6 9 15

Qiying Yu

qiying

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

upvoted a paper 3 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

updated a dataset 5 months ago

BytedTsinghua-SIA/DAPO-Math-17k

View all activity

Organizations

upvoted a paper about 1 month ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 129

upvoted a paper 3 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 180

upvoted a paper 6 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 139

upvoted a collection about 1 year ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574

upvoted an article about 1 year ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

and 2 others •

Mar 20, 2024

• 104

upvoted 2 papers over 1 year ago

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29

Generative Multimodal Models are In-Context Learners

Paper • 2312.13286 • Published Dec 20, 2023 • 37

upvoted a paper almost 2 years ago

CapsFusion: Rethinking Image-Text Data at Scale

Paper • 2310.20550 • Published Oct 31, 2023 • 27

upvoted a paper about 2 years ago

Generative Pretraining in Multimodality

Paper • 2307.05222 • Published Jul 11, 2023 • 22

Qiying Yu

AI & ML interests

Recent Activity

Organizations

qiying's activity

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models