DanielChenJH commited on
Commit
02df7d7
·
verified ·
1 Parent(s): 71c62f9

Model save

Browse files
Files changed (1) hide show
  1. README.md +2 -5
README.md CHANGED
@@ -40,15 +40,12 @@ The following hyperparameters were used during training:
40
  - train_batch_size: 3
41
  - eval_batch_size: 8
42
  - seed: 42
43
- - distributed_type: multi-GPU
44
- - num_devices: 3
45
  - gradient_accumulation_steps: 2
46
- - total_train_batch_size: 18
47
- - total_eval_batch_size: 24
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: constant
50
  - lr_scheduler_warmup_ratio: 0.03
51
- - num_epochs: 1
52
 
53
  ### Training results
54
 
 
40
  - train_batch_size: 3
41
  - eval_batch_size: 8
42
  - seed: 42
 
 
43
  - gradient_accumulation_steps: 2
44
+ - total_train_batch_size: 6
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: constant
47
  - lr_scheduler_warmup_ratio: 0.03
48
+ - num_epochs: 3
49
 
50
  ### Training results
51