End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 6.7977
 - F1: 0.0
 ## Model description
@@ -39,11 +39,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 4
-- total_train_batch_size: 32
 - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 10
@@ -52,13 +52,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | F1  |
 |:-------------:|:------:|:----:|:---------------:|:---:|
-| No log        | 1.0    | 3    | 5.3470          | 0.0 |
-| No log        | 2.0    | 6    | 6.6234          | 0.0 |
-| No log        | 3.0    | 9    | 6.9474          | 0.0 |
-| No log        | 4.0    | 12   | 6.8799          | 0.0 |
-| No log        | 5.0    | 15   | 6.8026          | 0.0 |
-| No log        | 6.0    | 18   | 6.8010          | 0.0 |
-| No log        | 6.8889 | 20   | 6.7977          | 0.0 |
 ### Framework versions

 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.4123
 - F1: 0.0
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 4
+- total_train_batch_size: 64
 - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 10
 | Training Loss | Epoch  | Step | Validation Loss | F1  |
 |:-------------:|:------:|:----:|:---------------:|:---:|
+| No log        | 0.5714 | 1    | 4.9327          | 0.0 |
+| No log        | 1.5714 | 2    | 5.0024          | 0.0 |
+| No log        | 2.5714 | 3    | 5.0481          | 0.0 |
+| No log        | 3.5714 | 4    | 5.1106          | 0.0 |
+| No log        | 4.5714 | 5    | 5.1695          | 0.0 |
+| No log        | 5.5714 | 6    | 5.2392          | 0.0 |
+| No log        | 6.5714 | 7    | 5.2945          | 0.0 |
+| No log        | 7.5714 | 8    | 5.3534          | 0.0 |
+| No log        | 8.5714 | 9    | 5.3930          | 0.0 |
+| No log        | 9.5714 | 10   | 5.4123          | 0.0 |
 ### Framework versions

config.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a80d8a35b612a8a72d006bf1cfb7cd5cfbd8a5a72a83626fd7d1b5af55d35ab9
-size 598713556

 version https://git-lfs.github.com/spec/v1
+oid sha256:dae5b845e137f8c3713c3ca737597fe7d49badc29f45578b3a7fb4a2184868b1
+size 598824292

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9269ad190c1ca8ed7534aadf7c86f6093da7d9c0403d18bd3282f7b234274baf
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:b8a5e300df48c6f5de8d6a74016ff67990c4514dd7e57ba8ddcfe90cae73e5e2
 size 5304