This model aims to test the conversion between Megatron-LM and transformers. It is a small GPT-2-like model that has been used to debug the script. Use it only for integration tests
Downloads last month
15,840
Safetensors
Model size
16.2M params
Tensor type
BF16
·
Model tree for bigscience/bigscience-small-testing