| This is a 3-way classifier judge model fine-tuned on the Chatbot Arena human preference dataset. The base model is llama 13B. | |
| More details can be found in the Appendix. F of this [paper](https://arxiv.org/abs/2306.05685). | 
| This is a 3-way classifier judge model fine-tuned on the Chatbot Arena human preference dataset. The base model is llama 13B. | |
| More details can be found in the Appendix. F of this [paper](https://arxiv.org/abs/2306.05685). |