1 51 2

YanxingLiu

lyx98

YanxingLiu

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 4 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

upvoted a paper 8 days ago

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

upvoted a paper 18 days ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

View all activity

Organizations

None yet

upvoted a paper 4 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 7 days ago • 75

upvoted a paper 8 days ago

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

Paper • 2602.21548 • Published 10 days ago • 37

upvoted a paper 18 days ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published 22 days ago • 59

upvoted a paper 29 days ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published about 1 month ago • 262

upvoted a paper about 1 month ago

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Paper • 2602.01785 • Published Feb 2 • 95

upvoted a paper 3 months ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 240

upvoted 2 papers 4 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 212

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 45

upvoted 5 papers 6 months ago

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Paper • 2509.09286 • Published Sep 11, 2025 • 11

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 59

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 214

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published Aug 28, 2025 • 140

upvoted a paper 7 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 268

upvoted a collection 7 months ago

👁️ LFM2-VL

Collection

LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 10 items • Updated 10 days ago • 63

upvoted 5 papers 7 months ago

YanxingLiu

AI & ML interests

Recent Activity

Organizations

lyx98's activity

🎉 Free Image Generator Now Available!