alexlop
/

detr_t5_captioning_model

text2text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

alexlop commited on May 28

Commit

52300da

·

verified ·

1 Parent(s): fd941d4

detr-t5-medical-captioning

Files changed (3) hide show

README.md +5 -13
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2843
-- Rougel: 0.1179
 ## Model description
@@ -42,23 +42,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rougel |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 1.0915        | 1.0   | 236  | 1.0522          | 0.1784 |
-| 0.6349        | 2.0   | 472  | 0.5972          | 0.1179 |
-| 0.5186        | 3.0   | 708  | 0.4307          | 0.1179 |
-| 0.5036        | 4.0   | 944  | 0.3281          | 0.4648 |
-| 0.4559        | 5.0   | 1180 | 0.3233          | 0.1179 |
-| 0.3973        | 6.0   | 1416 | 0.3013          | 0.3958 |
-| 0.4032        | 7.0   | 1652 | 0.2943          | 0.1179 |
-| 0.3626        | 8.0   | 1888 | 0.2892          | 0.1179 |
-| 0.3783        | 9.0   | 2124 | 0.2878          | 0.1179 |
-| 0.413         | 10.0  | 2360 | 0.2843          | 0.1179 |
 ### Framework versions

 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9834
+- Rougel: 0.1565
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rougel |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 1.3246        | 1.0   | 236  | 1.2553          | 0.1565 |
+| 0.9549        | 2.0   | 472  | 0.9834          | 0.1565 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d921af22e2699e03bad483400d86c44f17d93d964f9a0f54442a8b313a310e91
 size 242041896

 version https://git-lfs.github.com/spec/v1
+oid sha256:d78973b1986e90c854e2892a4b3514125873723548be5745e1d1a9a218119ce4
 size 242041896

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3710c348a61bbd9b095fb5ce2407f35ea389943a3cb083c84c16893460e927b2
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:baae9994b055f7496a6f4887f7576f7e5e8bc655719478ffa89969174c8c3920
 size 5368