Transition Models: Rethinking the Generative Learning Objective Paper • 2509.04394 • Published 4 days ago • 25
CineScale: Free Lunch in High-Resolution Cinematic Visual Generation Paper • 2508.15774 • Published 18 days ago • 20
S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models Paper • 2508.12880 • Published 22 days ago • 45
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper • 2508.09983 • Published 26 days ago • 67
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation Paper • 2508.07901 • Published 29 days ago • 39
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation Paper • 2508.05399 • Published Aug 7 • 16
Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control Paper • 2508.08134 • Published 29 days ago • 9
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published Aug 5 • 50
DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework Paper • 2508.02807 • Published Aug 4 • 13
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning Paper • 2507.14111 • Published Jul 18 • 23
π^3: Scalable Permutation-Equivariant Visual Geometry Learning Paper • 2507.13347 • Published Jul 17 • 64
TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation Paper • 2507.04984 • Published Jul 7 • 5
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers Paper • 2507.12956 • Published Jul 17 • 23