view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 6 days ago • 52
Distil Efficiency Benchmarks Collection Collection of models used in the blog post www.distillabs.ai/blog/the-10x-inference-tax-you-dont-have-to-pay • 9 items • Updated 14 days ago • 3
Quantized Qwen3.5 Collection Verified models. Compatible with Transformers v5.3 and vLLM v0.16.1rc1 (nightly). Under evaluation. • 9 items • Updated 4 days ago • 9
huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated Image-Text-to-Text • 36B • Updated 14 days ago • 37.5k • 221
Qwen3-MoE Collection Compressed Qwen3 MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 9 items • Updated Feb 11 • 3
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 19 days ago • 131