garrethlee (Garreth Lee)

upvoted 2 papers 8 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 77

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Paper • 2506.00338 • Published May 31, 2025 • 10

upvoted a changelog 9 months ago

Changelog

Xet is now the default storage option for new users and organizations

May 23, 2025

• 76

upvoted a collection 11 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 695

upvoted an article 11 months ago

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

Mar 24, 2025

•

20

upvoted an article 12 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25, 2025

•

172

upvoted 3 articles about 1 year ago

Article

1 Billion Classifications

Feb 13, 2025

•

45

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

233

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

Jan 29, 2025

•

17

upvoted a paper about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 377

upvoted 2 articles over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

441

Article

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

+11

Oct 1, 2024

•

23

upvoted a paper over 1 year ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64

Garreth Lee

AI & ML interests

Organizations

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Xet is now the default storage option for new users and organizations

Llama 4

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

FastRTC: The Real-Time Communication Library for Python

1 Billion Classifications

KV Caching Explained: Optimizing Transformer Inference Efficiency

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

Qwen2.5 Technical Report

SmolLM - blazingly fast and remarkably powerful

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Garreth Lee

AI & ML interests

Organizations

garrethlee's activity

Xet is now the default storage option for new users and organizations

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

FastRTC: The Real-Time Communication Library for Python

1 Billion Classifications

KV Caching Explained: Optimizing Transformer Inference Efficiency

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

SmolLM - blazingly fast and remarkably powerful

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

🎉 Free Image Generator Now Available!