lmzheng
/

fine-tuned-judge

Feature Extraction

text-generation-inference

Model card Files Files and versions

fine-tuned-judge / README.md

lmzheng's picture

Update README.md

0364f87 about 2 years ago

|

221 Bytes

	This is a 3-way classifier judge model fine-tuned on the Chatbot Arena human preference dataset. The base model is llama 13B.
	More details can be found in the Appendix. F of this [paper](https://arxiv.org/abs/2306.05685).