reward_model_gpt2_stack_exchange / README.md

qgyd2021

Update README.md

b86d6be about 2 years ago

preview code

raw

history blame

625 Bytes

metadata

license: apache-2.0
datasets:
  - lvwerra/stack-exchange-paired
language:
  - en
library_name: adapter-transformers
pipeline_tag: text-generation
tags:
  - reward_model

Reward Model GPT2

fine-tuned GPT2 to a reward model.

The model is designed to generate human-like responses to questions in Stack Exchange domains of programming, mathematics, physics, and more.

For more info check out the blog post and github example.