jspr
/

bts_mistral_7b_v3_32k_merged

Text Generation

text-generation-inference

Model card Files Files and versions

bts_mistral_7b_v3_32k_merged / README.md

jspr's picture

Create README.md

3700967 verified over 1 year ago

|

history blame contribute delete

330 Bytes

	Mistral-7B finetuned on a dataset of BTS fanfic at 32k context.

	This model uses the alpaca format:
	```
	{"instruction": "An interaction between a user providing instructions, and an imaginative assistant providing responses.", "input": "...", "output": "..."}
	```
	Note RoPE scaling parameter 8.0, with RoPE scaling type linear