Finetuned model has smaller model.safetensors size

by nam-withpi - opened Mar 26, 2025

Mar 26, 2025

I fine-tuned this model for a sequence classification task but the model.safetensors file of the fine-tuned checkpoint is much smaller than the base model, 2.44GB (fine-tuned checkpoint) versus 3.02GB (based model).

Also when I eval the checkpoint model from the saved checkpoint, the checkpoint performance is different from what reported in the during the training loop. I suspect that somehow the saved model is missing module weights.

Please help with this issue. Much appreciated!

Nicolas-BZRD

EuroBERT org Mar 27, 2025

Hey @nam-withpi , could you share your training and saving code with us? We'll take a look at it 🙌

wilfoderek

Mar 27, 2025

I fine-tuned this model for a sequence classification task but the model.safetensors file of the fine-tuned checkpoint is much smaller than the base model, 2.44GB (fine-tuned checkpoint) versus 3.02GB (based model).

Also when I eval the checkpoint model from the saved checkpoint, the checkpoint performance is different from what reported in the during the training loop. I suspect that somehow the saved model is missing module weights.

Please help with this issue. Much appreciated!

Have you tried for information retrieval?

lgcharpe

about 10 hours ago

In case you are still wondering about the difference in size, it is due to the droping of the lm_head which adds about 130M parameters to the model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment