view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 22 days ago • 108
Running on Zero 24 Qwen Image Multiple Angles 3D Camera 🎥 24 Adjust camera angles in images using 3D controls or sliders
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 19 items • Updated 3 days ago • 56
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 23 days ago • 42
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published Dec 8, 2025 • 57
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 10 days ago • 93
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows Paper • 2512.16969 • Published 21 days ago • 111
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published 21 days ago • 203
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 225
Running 98 The Eiffel Tower Llama 📝 98 Explore the Eiffel Tower Llama experiment with open-source models