TheRamsay commited on
Commit
322b721
·
verified ·
1 Parent(s): 6248493

End of training

Browse files
Files changed (2) hide show
  1. README.md +13 -13
  2. generation_config.json +0 -1
README.md CHANGED
@@ -21,7 +21,7 @@ model-index:
21
  metrics:
22
  - name: Wer
23
  type: wer
24
- value: 1.0114942528735633
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,8 +31,8 @@ should probably proofread and complete it, then remove this comment. -->
31
 
32
  This model is a fine-tuned version of [](https://huggingface.co/) on the common_voice_17_0 dataset.
33
  It achieves the following results on the evaluation set:
34
- - Loss: 2.3875
35
- - Wer: 1.0115
36
 
37
  ## Model description
38
 
@@ -51,7 +51,7 @@ More information needed
51
  ### Training hyperparameters
52
 
53
  The following hyperparameters were used during training:
54
- - learning_rate: 0.0005
55
  - train_batch_size: 16
56
  - eval_batch_size: 8
57
  - seed: 42
@@ -67,15 +67,15 @@ The following hyperparameters were used during training:
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:-------:|:-----:|:---------------:|:------:|
70
- | 0.5699 | 2.1942 | 2000 | 0.5029 | 0.9499 |
71
- | 0.4929 | 4.3884 | 4000 | 0.4351 | 0.9349 |
72
- | 0.4568 | 6.5826 | 6000 | 0.3928 | 0.9080 |
73
- | 0.4274 | 8.7767 | 8000 | 0.3524 | 0.8848 |
74
- | 0.389 | 10.9709 | 10000 | 0.3127 | 0.8331 |
75
- | 0.6396 | 13.1651 | 12000 | 0.5697 | 0.9324 |
76
- | 2.5933 | 15.3593 | 14000 | 2.3935 | 1.0109 |
77
- | 2.5858 | 17.5535 | 16000 | 2.3892 | 1.0120 |
78
- | 2.5724 | 19.7477 | 18000 | 2.3875 | 1.0115 |
79
 
80
 
81
  ### Framework versions
 
21
  metrics:
22
  - name: Wer
23
  type: wer
24
+ value: 0.6237000547345375
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
31
 
32
  This model is a fine-tuned version of [](https://huggingface.co/) on the common_voice_17_0 dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 0.2006
35
+ - Wer: 0.6237
36
 
37
  ## Model description
38
 
 
51
  ### Training hyperparameters
52
 
53
  The following hyperparameters were used during training:
54
+ - learning_rate: 3e-05
55
  - train_batch_size: 16
56
  - eval_batch_size: 8
57
  - seed: 42
 
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:-------:|:-----:|:---------------:|:------:|
70
+ | 0.3837 | 2.1942 | 2000 | 0.3241 | 0.8196 |
71
+ | 0.3176 | 4.3884 | 4000 | 0.2855 | 0.7830 |
72
+ | 0.2886 | 6.5826 | 6000 | 0.2620 | 0.7499 |
73
+ | 0.2659 | 8.7767 | 8000 | 0.2431 | 0.7154 |
74
+ | 0.2464 | 10.9709 | 10000 | 0.2285 | 0.6877 |
75
+ | 0.2252 | 13.1651 | 12000 | 0.2163 | 0.6552 |
76
+ | 0.2132 | 15.3593 | 14000 | 0.2087 | 0.6461 |
77
+ | 0.2083 | 17.5535 | 16000 | 0.2032 | 0.6286 |
78
+ | 0.2034 | 19.7477 | 18000 | 0.2006 | 0.6237 |
79
 
80
 
81
  ### Framework versions
generation_config.json CHANGED
@@ -1,7 +1,6 @@
1
  {
2
  "_from_model_config": true,
3
  "bos_token_id": 50256,
4
- "decoder_start_token_id": 50256,
5
  "eos_token_id": 50256,
6
  "transformers_version": "4.45.2"
7
  }
 
1
  {
2
  "_from_model_config": true,
3
  "bos_token_id": 50256,
 
4
  "eos_token_id": 50256,
5
  "transformers_version": "4.45.2"
6
  }