sgraham
/

modernbert-llm-cidoc-crm

Text Classification

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

sgraham commited on Jan 15

Commit

dadc208

·

verified ·

1 Parent(s): e442026

End of training

Files changed (3) hide show

README.md +9 -7
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.7225
 - F1: 0.0
 ## Model description
@@ -46,17 +46,19 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 5
-- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | F1  |
 |:-------------:|:------:|:----:|:---------------:|:---:|
-| No log        | 1.0    | 3    | 4.7336          | 0.0 |
-| No log        | 2.0    | 6    | 4.6789          | 0.0 |
-| No log        | 3.0    | 9    | 4.7128          | 0.0 |
-| No log        | 3.4444 | 10   | 4.7225          | 0.0 |
 ### Framework versions

 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.7977
 - F1: 0.0
 ## Model description
 - total_train_batch_size: 32
 - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | F1  |
 |:-------------:|:------:|:----:|:---------------:|:---:|
+| No log        | 1.0    | 3    | 5.3470          | 0.0 |
+| No log        | 2.0    | 6    | 6.6234          | 0.0 |
+| No log        | 3.0    | 9    | 6.9474          | 0.0 |
+| No log        | 4.0    | 12   | 6.8799          | 0.0 |
+| No log        | 5.0    | 15   | 6.8026          | 0.0 |
+| No log        | 6.0    | 18   | 6.8010          | 0.0 |
+| No log        | 6.8889 | 20   | 6.7977          | 0.0 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e853245fffb481c4da98084143711caaecb39fc167eafb6a721553f21f200fac
 size 598713556

 version https://git-lfs.github.com/spec/v1
+oid sha256:a80d8a35b612a8a72d006bf1cfb7cd5cfbd8a5a72a83626fd7d1b5af55d35ab9
 size 598713556

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:29e899ab7067e129b7a31fa66f876704c1cc5073d1b4750c103170d668cd80a1
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:9269ad190c1ca8ed7534aadf7c86f6093da7d9c0403d18bd3282f7b234274baf
 size 5304