A custom PyTorch implementation of the Gemma-3 270M model. For full details see the original notebook.
Download model, config, and tokenizer. Instantiate Gemma3Model, load weights and config as shown in this repo.