13 15 18

Garreth Lee

garrethlee

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

HuggingFaceM4/FineVision

liked a model 4 days ago

google/embeddinggemma-300m

liked a dataset 24 days ago

nvidia/Granary

View all activity

Organizations

liked a dataset 4 days ago

HuggingFaceM4/FineVision

Viewer • Updated 4 days ago • 24.2M • 49.6k • 225

liked a model 4 days ago

google/embeddinggemma-300m

liked a dataset 24 days ago

nvidia/Granary

Viewer • Updated 25 days ago • 116M • 24.8k • 132

upvoted a paper 2 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 70

upvoted a paper 3 months ago

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Paper • 2506.00338 • Published May 31 • 10

upvoted a changelog 3 months ago

Changelog

Xet is now the default storage option for new users and organizations

May 23

• 73

liked a Space 5 months ago

1.66k

Dia 1.6B

👯

Generate realistic dialogue from a script, using Dia!

upvoted a collection 5 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 617

upvoted 2 articles 6 months ago

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

and 8 others •

Mar 24

• 20

Article

FastRTC: The Real-Time Communication Library for Python

and 1 other •

Feb 25

• 172

liked a Space 7 months ago

3.16k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 3 articles 7 months ago

Article

1 Billion Classifications

•

Feb 13

• 45

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 128

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

•

Jan 29

• 17

liked a model 8 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 368k • • 12.7k

upvoted a paper 9 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 374

updated a Space 9 months ago

101

Number Tokenization Blog

📈

Explore how tokenization affects arithmetic in LLMs

liked a dataset 9 months ago

HuggingFaceFW/fineweb-2

Viewer • Updated Jun 27 • 5.02B • 41.3k • 628

liked a Space 9 months ago

101

Number Tokenization Blog

📈

Explore how tokenization affects arithmetic in LLMs

updated a Space 9 months ago

README

🐠

Garreth Lee

AI & ML interests

Recent Activity

Organizations

garrethlee's activity

Xet is now the default storage option for new users and organizations

Dia 1.6B

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

FastRTC: The Real-Time Communication Library for Python

The Ultra-Scale Playbook

1 Billion Classifications

KV Caching Explained: Optimizing Transformer Inference Efficiency

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

Number Tokenization Blog

Number Tokenization Blog

README