smohammadi
/

tinyllama_rm_sentiment_1b

Text Classification

Generated from Trainer

text-generation-inference

Model card Files Files and versions

smohammadi commited on Jun 28, 2024

Commit

8dad0d0

·

verified ·

1 Parent(s): ae8eb87

Update README.md

Files changed (1) hide show

README.md +26 -2

README.md CHANGED Viewed

@@ -24,8 +24,32 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
 More information needed

 ## Model description
+Trained using:
+```
+python trl/examples/scripts/rm/rm.py \
+--dataset_name trl-internal-testing/sentiment-trl-style \
+--dataset_train_split train \
+--dataset_eval_split test \
+--model_name_or_path TinyLlama/TinyLlama_v1.1 \
+--chat_template simple_concat \
+--learning_rate 3e-6 \
+--per_device_train_batch_size 32 \
+--per_device_eval_batch_size 32 \
+--gradient_accumulation_steps 1 \
+--logging_steps 1 \
+--eval_strategy steps \
+--max_token_length 1024 \
+--max_prompt_token_lenth 1024 \
+--remove_unused_columns False \
+--num_train_epochs 1 \
+--eval_steps 100 \
+ --output_dir models/ppo_torchtune/tinyllama/tinyllama_rm_sentiment_1b \
+ --push_to_hub
+```
+on the "dataset-processor" branch of trl:
+git clone -b "dataset-processor" https://github.com/huggingface/trl
 ## Intended uses & limitations
 More information needed