bts_mistral_7b_v3_32k_merged / README.md

jspr

Create README.md

3700967 verified over 1 year ago

preview code

raw

history blame contribute delete

330 Bytes

Mistral-7B finetuned on a dataset of BTS fanfic at 32k context.

This model uses the alpaca format:

{"instruction": "An interaction between a user providing instructions, and an imaginative assistant providing responses.", "input": "...", "output": "..."}

Note RoPE scaling parameter 8.0, with RoPE scaling type linear