Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
Joya Chen PRO
chenjoya
AI & ML interests
Video LLM
Recent Activity
upvoted
a
paper
about 21 hours ago
Robix: A Unified Model for Robot Interaction, Reasoning and Planning
upvoted
a
paper
1 day ago
Why Language Models Hallucinate
upvoted
a
paper
5 days ago
Draw-In-Mind: Learning Precise Image Editing via Chain-of-Thought
Imagination