5 99 32

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper 1 day ago

DMax: Aggressive Parallel Decoding for dLLMs

upvoted a paper 1 day ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

upvoted a paper 4 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 3 days ago • 36

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 4 days ago • 140

upvoted a paper 4 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 6 days ago • 99

upvoted a paper 12 days ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published 16 days ago • 154

upvoted a paper 30 days ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published about 1 month ago • 13

liked a model about 1 month ago

Qwen/Qwen3.5-0.8B

Image-Text-to-Text • 0.9B • Updated Mar 2 • 2.36M • 487

upvoted a paper 2 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 267

upvoted an article 3 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

•

upvoted 2 papers 3 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

DreamOmni3: Scribble-based Editing and Generation

Paper • 2512.22525 • Published Dec 27, 2025 • 15

upvoted 2 papers 4 months ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 177

liked a dataset 5 months ago

yenopoya/thousand-voices-trauma

Updated Oct 24, 2025 • 35 • 4

upvoted a paper 6 months ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27, 2025 • 59

upvoted a paper 8 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27, 2025 • 32

liked a model 8 months ago

LiquidAI/LFM2-350M

Text Generation • 0.4B • Updated 12 days ago • 34.5k • 248

upvoted 2 papers 8 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 211

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29, 2025 • 142

liked a Space 9 months ago

Open ASR Leaderboard

🏆

1.31k

Explore speech recognition model benchmarks and request new ones

upvoted a paper 9 months ago

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models

Paper • 2507.13984 • Published Jul 18, 2025 • 26