zai-org
/

LongReward-llama3.1-8b-DPO

Text Generation

text-generation-inference

Model card Files Files and versions Community

LongReward-llama3.1-8b-DPO

16.1 GB

2 contributors

History: 5 commits

NeoZ123's picture

Update README.md

7311e4c verified 11 months ago

.gitattributes

1.52 kB

initial commit 11 months ago
README.md

4.54 kB

Update README.md 11 months ago
config.json

908 Bytes

Upload folder using huggingface_hub 11 months ago
configuration.json

48 Bytes

Upload folder using huggingface_hub 11 months ago
generation_config.json

185 Bytes

Upload folder using huggingface_hub 11 months ago
model-00000-of-00005.safetensors

4.36 GB
LFS

Upload folder using huggingface_hub 11 months ago
model-00001-of-00005.safetensors

4.36 GB
LFS

Upload folder using huggingface_hub 11 months ago
model-00002-of-00005.safetensors

4.36 GB
LFS

Upload folder using huggingface_hub 11 months ago
model-00003-of-00005.safetensors

872 MB
LFS

Upload folder using huggingface_hub 11 months ago
model-00004-of-00005.safetensors

2.1 GB
LFS

Upload folder using huggingface_hub 11 months ago
model.safetensors.index.json

23.9 kB

Upload folder using huggingface_hub 11 months ago
modeling_llama.py

60.1 kB

Upload folder using huggingface_hub 11 months ago
tiktoken_tokenizer.py

6.04 kB

Upload folder using huggingface_hub 11 months ago
tokenizer.tiktoken

2.18 MB

Upload folder using huggingface_hub 11 months ago
tokenizer_config.json

234 Bytes

Upload folder using huggingface_hub 11 months ago