--- license: apache-2.0 --- The Hugging Face fast tokenizer for LLM-jp ABCI challenge 2023. The vocab size is 96,867. Requirements: - transformers>=4.34.0 - tokenizers>=0.14.0 - torch Usage: ```Python from transformers import AutoTokenizer tokenizer = AutoTokenizer.from_pretrained("llm-jp/hf-fast-tokenizer-v22b2") ```