metadata
license: apache-2.0
datasets:
- lvwerra/stack-exchange-paired
language:
- en
library_name: adapter-transformers
pipeline_tag: text-generation
tags:
- reward_model
Reward Model GPT2
fine-tuned GPT2 to a reward model.
The model is designed to generate human-like responses to questions in Stack Exchange domains of programming, mathematics, physics, and more.
For more info check out the blog post and github example.