thejaminator commited on
Commit
e79d8eb
·
verified ·
1 Parent(s): 4dc626c

verl GRPO trained model at step 50

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: thejaminator/checkpoints_multiple_datasets_layer_1_decoder-fixed
3
+ library_name: peft
4
+ tags:
5
+ - lora
6
+ - peft
7
+ pipeline_tag: text-generation
8
+ ---
9
+