·
AI & ML interests
None yet
Organizations
models
15
caijanfeng/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
caijanfeng/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
•
6
caijanfeng/Qwen2.5-7B-Open-R1-Distill-OpenR1-Math-220k
Updated
caijanfeng/Qwen2.5-1.5B-Open-R1-Distill-OpenR1-Math-220k
Text Generation
•
2B
•
Updated
•
3
caijanfeng/Qwen2.5-7B-Open-R1-GRPO
Text Generation
•
8B
•
Updated
•
33
caijanfeng/Qwen-2.5-1.5B-Simple-RL
Updated
caijanfeng/Qwen2.5-7B-Open-R1-Distill-5epoch
Text Generation
•
8B
•
Updated
•
2
caijanfeng/Qwen2.5-1.5B-Open-R1-Distill-5epoch-repeat
Text Generation
•
2B
•
Updated
•
1
caijanfeng/Qwen2.5-1.5B-Open-R1-Distill-repeat
Text Generation
•
2B
•
Updated
•
2
caijanfeng/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated