File size: 299 Bytes
756d849
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
---
license: apache-2.0
datasets:
- trl-lib/tldr
base_model:
- google/gemma-3-270m-it
pipeline_tag: summarization
---

This is a fine-tune of gemma, trained using GRPO with a reward function to incentivise ~40 char outputs beginning with "I" such that it outputs TL;DR summaries for reddit comments.