s-mizuki-nlp commited on
Commit
b781fc6
·
verified ·
1 Parent(s): e55a3a6

v0.3 was developed by instruction-tuning of v0.1 base model.

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ coding contents, etc (see the Training Datasets section of the base model) for c
23
  The instruction-tuned models (Instruct) were built by supervised fine-tuning (SFT) on the synthetic data specially built for Japanese.
24
  See the Swallow Model Index section to find other model variants.
25
 
26
- **Note**: [Llama-3.1-Swallow-70B-Instruct-v0.3](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3) model was continually pre-trained from the [meta-llama/Llama-3.1-70B](https://huggingface.co/meta-llama/Llama-3.1-70B) and then instruction-tuned with our instruction datasets.
27
 
28
  # Release History
29
 
 
23
  The instruction-tuned models (Instruct) were built by supervised fine-tuning (SFT) on the synthetic data specially built for Japanese.
24
  See the Swallow Model Index section to find other model variants.
25
 
26
+ **Note**: [Llama-3.1-Swallow-70B-Instruct-v0.3](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3) model was an instruction-tuned version of [Llama-3.1-Swallow-70B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1) with our instruction datasets.
27
 
28
  # Release History
29