Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
thejaminator
/
1e-5_hf_test_repeat-step-100
like
0
Text Generation
PEFT
Safetensors
lora
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
29ddc5d
1e-5_hf_test_repeat-step-100
1.4 GB
1 contributor
History:
2 commits
thejaminator
verl GRPO trained model at step 100
29ddc5d
verified
14 days ago
sft_policy
verl GRPO trained model at step 100
14 days ago
.gitattributes
Safe
1.52 kB
initial commit
14 days ago
README.md
5.19 kB
verl GRPO trained model at step 100
14 days ago
adapter_config.json
977 Bytes
verl GRPO trained model at step 100
14 days ago
adapter_model.safetensors
698 MB
xet
verl GRPO trained model at step 100
14 days ago