End of training
Browse files
README.md
CHANGED
|
@@ -5,9 +5,24 @@ tags:
|
|
| 5 |
- generated_from_trainer
|
| 6 |
datasets:
|
| 7 |
- code_x_glue_tc_text_to_code
|
|
|
|
|
|
|
| 8 |
model-index:
|
| 9 |
- name: codet5-small-java-v1-text-to-code
|
| 10 |
-
results:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
---
|
| 12 |
|
| 13 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
@@ -16,6 +31,13 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 16 |
# codet5-small-java-v1-text-to-code
|
| 17 |
|
| 18 |
This model is a fine-tuned version of [Salesforce/codet5-small](https://huggingface.co/Salesforce/codet5-small) on the code_x_glue_tc_text_to_code dataset.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
|
| 20 |
## Model description
|
| 21 |
|
|
@@ -43,6 +65,16 @@ The following hyperparameters were used during training:
|
|
| 43 |
- num_epochs: 4
|
| 44 |
- mixed_precision_training: Native AMP
|
| 45 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 46 |
### Framework versions
|
| 47 |
|
| 48 |
- Transformers 4.36.0.dev0
|
|
|
|
| 5 |
- generated_from_trainer
|
| 6 |
datasets:
|
| 7 |
- code_x_glue_tc_text_to_code
|
| 8 |
+
metrics:
|
| 9 |
+
- rouge
|
| 10 |
model-index:
|
| 11 |
- name: codet5-small-java-v1-text-to-code
|
| 12 |
+
results:
|
| 13 |
+
- task:
|
| 14 |
+
name: Sequence-to-sequence Language Modeling
|
| 15 |
+
type: text2text-generation
|
| 16 |
+
dataset:
|
| 17 |
+
name: code_x_glue_tc_text_to_code
|
| 18 |
+
type: code_x_glue_tc_text_to_code
|
| 19 |
+
config: default
|
| 20 |
+
split: validation
|
| 21 |
+
args: default
|
| 22 |
+
metrics:
|
| 23 |
+
- name: Rouge1
|
| 24 |
+
type: rouge
|
| 25 |
+
value: 57.1969
|
| 26 |
---
|
| 27 |
|
| 28 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
|
| 31 |
# codet5-small-java-v1-text-to-code
|
| 32 |
|
| 33 |
This model is a fine-tuned version of [Salesforce/codet5-small](https://huggingface.co/Salesforce/codet5-small) on the code_x_glue_tc_text_to_code dataset.
|
| 34 |
+
It achieves the following results on the evaluation set:
|
| 35 |
+
- Loss: 0.7705
|
| 36 |
+
- Rouge1: 57.1969
|
| 37 |
+
- Rouge2: 40.0098
|
| 38 |
+
- Rougel: 55.326
|
| 39 |
+
- Rougelsum: 56.119
|
| 40 |
+
- Gen Len: 16.8335
|
| 41 |
|
| 42 |
## Model description
|
| 43 |
|
|
|
|
| 65 |
- num_epochs: 4
|
| 66 |
- mixed_precision_training: Native AMP
|
| 67 |
|
| 68 |
+
### Training results
|
| 69 |
+
|
| 70 |
+
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
| 71 |
+
|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
| 72 |
+
| 0.7434 | 1.0 | 6250 | 0.8148 | 55.9045 | 38.592 | 54.0278 | 54.7633 | 16.796 |
|
| 73 |
+
| 0.6708 | 2.0 | 12500 | 0.7868 | 56.3354 | 38.9843 | 54.5278 | 55.2197 | 16.751 |
|
| 74 |
+
| 0.6309 | 3.0 | 18750 | 0.7741 | 56.9883 | 39.8626 | 55.1321 | 55.9173 | 16.8495 |
|
| 75 |
+
| 0.6262 | 4.0 | 25000 | 0.7705 | 57.1969 | 40.0098 | 55.326 | 56.119 | 16.8335 |
|
| 76 |
+
|
| 77 |
+
|
| 78 |
### Framework versions
|
| 79 |
|
| 80 |
- Transformers 4.36.0.dev0
|