weiweiz1
/

DeepSeek-V2-Lite-MXFP4-RTN

8-bit precision

compressed-tensors

Model card Files Files and versions

DeepSeek-V2-Lite-MXFP4-RTN / quantization_config.json

weiweiz1's picture

Upload folder using huggingface_hub

4cfd339 verified 4 months ago

history blame contribute delete

306 Bytes

	{
	"bits": 4,
	"group_size": 32,
	"sym": true,
	"data_type": "mx_fp4e2m1",
	"batch_size": 4,
	"iters": 0,
	"nsamples": 512,
	"autoround_version": "0.5.1",
	"quant_method": "auto-round",
	"packing_format": "mx_fp",
	"scale_format": [
	"e8m0"
	],
	"scale_calculation_mode": [
	"even"
	]
	}