Update README.md
Browse files
README.md
CHANGED
@@ -22,6 +22,8 @@ The global all-reduce was done with custom int8 all-reduce kernels to reduce the
|
|
22 |
|
23 |
For more detailed technical insights, please refer to our [technical paper](https://github.com/PrimeIntellect-ai/prime).
|
24 |
|
|
|
|
|
25 |
## Usage
|
26 |
```python
|
27 |
import torch
|
|
|
22 |
|
23 |
For more detailed technical insights, please refer to our [technical paper](https://github.com/PrimeIntellect-ai/prime).
|
24 |
|
25 |
+
**Note: The model will immediately output EOS token if the BOS token is not set. This is a result of the tensor packing used during training. This can result in terrible eval scores.**
|
26 |
+
|
27 |
## Usage
|
28 |
```python
|
29 |
import torch
|