Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
new activity
6 days ago
GadflyII/GLM-4.7-Flash-MXFP4:Update MXFP4 format to compressed-tensors updated
a model 8 days ago
mgoin/Qwen3-0.6B-MXFP8 published
a model 8 days ago
mgoin/Qwen3-0.6B-MXFP8