alex-treebeard commited on
Commit
756d849
·
verified ·
1 Parent(s): 7c1b437

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - trl-lib/tldr
5
+ base_model:
6
+ - google/gemma-3-270m-it
7
+ pipeline_tag: summarization
8
+ ---
9
+
10
+ This is a fine-tune of gemma, trained using GRPO with a reward function to incentivise ~40 char outputs beginning with "I" such that it outputs TL;DR summaries for reddit comments.