Fu-En Yang

FuEnYang

https://fuenyang1127.github.io/

AI & ML interests

Computer Vision, Deep Learning, Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs), Reasoning Models, Embodied AI

Recent Activity

upvoted a paper 1 day ago

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

upvoted a paper 1 day ago

Kimi K2.5: Visual Agentic Intelligence

upvoted a paper 1 day ago

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

View all activity

Organizations

upvoted 7 papers 1 day ago

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Paper • 2602.02437 • Published 3 days ago • 74

EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models

Paper • 2602.04515 • Published 1 day ago • 32

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published 3 days ago • 31

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Paper • 2602.03796 • Published 2 days ago • 48

upvoted 3 papers 15 days ago

Future Optical Flow Prediction Improves Robot Control & Video Generation

Paper • 2601.10781 • Published 21 days ago • 19

ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models

Paper • 2601.11404 • Published 21 days ago • 25

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

Paper • 2601.12993 • Published 18 days ago • 75

upvoted 8 papers 21 days ago

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 22 days ago • 32

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published 21 days ago • 28

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published 22 days ago • 30

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 22 days ago • 193

RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

Paper • 2601.05249 • Published 28 days ago • 46

3AM: Segment Anything with Geometric Consistency in Videos

Paper • 2601.08831 • Published 23 days ago • 34

Motion Attribution for Video Generation

Paper • 2601.08828 • Published 23 days ago • 70

Flow Equivariant World Models: Memory for Partially Observed Dynamic Environments

Paper • 2601.01075 • Published Jan 3 • 6

upvoted 2 papers 22 days ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published 23 days ago • 13

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

Paper • 2601.09697 • Published 22 days ago • 8

Fu-En Yang

AI & ML interests

Recent Activity

Organizations

FuEnYang's activity

🎉 Free Image Generator Now Available!