Update README.md
Browse files
README.md
CHANGED
|
@@ -42,6 +42,27 @@ The summary of the instruction tuning data is as follows:
|
|
| 42 |
|
| 43 |
<center><img src="data_table.jpg" alt="Instruction Data"/></center>
|
| 44 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 45 |
# Reproducing the Results
|
| 46 |
|
| 47 |
We will realize the training code and the training data soon. Our training code is based on [Megatron-LM](https://github.com/NVIDIA/Megatron-LM), with some modifications to support our training data format and Maximal Update Parametrization (μP).
|
|
|
|
| 42 |
|
| 43 |
<center><img src="data_table.jpg" alt="Instruction Data"/></center>
|
| 44 |
|
| 45 |
+
# Instruction Format
|
| 46 |
+
|
| 47 |
+
We've added some new special tokens to the CrystalCoder tokenizer to support the instruction tuning.
|
| 48 |
+
|
| 49 |
+
List special tokens used in the instruction tuning:
|
| 50 |
+
|
| 51 |
+
```
|
| 52 |
+
bos: <s>
|
| 53 |
+
eos: </s>
|
| 54 |
+
system_start: <|sys_start|>
|
| 55 |
+
system_end: <|sys_end|>
|
| 56 |
+
user_start: <|im_start|>
|
| 57 |
+
user_end: <|im_end|>
|
| 58 |
+
```
|
| 59 |
+
|
| 60 |
+
The instruction format is as follows:
|
| 61 |
+
|
| 62 |
+
```
|
| 63 |
+
<s> <|sys_start|> system prompt <|sys_end|> <|im_start|> first user utterance <|im_end|> first model response <|im_start|> next user utterance <|im_end|> next model response </s>
|
| 64 |
+
```
|
| 65 |
+
|
| 66 |
# Reproducing the Results
|
| 67 |
|
| 68 |
We will realize the training code and the training data soon. Our training code is based on [Megatron-LM](https://github.com/NVIDIA/Megatron-LM), with some modifications to support our training data format and Maximal Update Parametrization (μP).
|