Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
CodeDPO
/
qwen25-ins-7b-coderm_new_margin_scalebt-7b-reinforce-plus-episode_1
like
0
Follow
AceCoder
9
Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
d3cba65
qwen25-ins-7b-coderm_new_margin_scalebt-7b-reinforce-plus-episode_1
30.5 GB
1 contributor
History:
2 commits
DongfuJiang
Upload Qwen2ForCausalLM
d3cba65
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
Safe
5.17 kB
Upload Qwen2ForCausalLM
10 months ago
config.json
737 Bytes
Upload Qwen2ForCausalLM
10 months ago
generation_config.json
Safe
242 Bytes
Upload Qwen2ForCausalLM
10 months ago
model-00001-of-00007.safetensors
4.98 GB
xet
Upload Qwen2ForCausalLM
10 months ago
model-00002-of-00007.safetensors
4.78 GB
xet
Upload Qwen2ForCausalLM
10 months ago
model-00003-of-00007.safetensors
4.93 GB
xet
Upload Qwen2ForCausalLM
10 months ago
model-00004-of-00007.safetensors
4.93 GB
xet
Upload Qwen2ForCausalLM
10 months ago
model-00005-of-00007.safetensors
5 GB
xet
Upload Qwen2ForCausalLM
10 months ago
model-00006-of-00007.safetensors
3.66 GB
xet
Upload Qwen2ForCausalLM
10 months ago
model-00007-of-00007.safetensors
2.18 GB
xet
Upload Qwen2ForCausalLM
10 months ago
model.safetensors.index.json
Safe
27.8 kB
Upload Qwen2ForCausalLM
10 months ago