13 15 18

Garreth Lee

garrethlee

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

HuggingFaceM4/FineVision

liked a model 4 days ago

google/embeddinggemma-300m

liked a dataset 24 days ago

nvidia/Granary

View all activity

Organizations

liked a dataset 4 days ago

HuggingFaceM4/FineVision

Viewer • Updated 4 days ago • 24.2M • 49.6k • 225

liked a model 4 days ago

google/embeddinggemma-300m

liked a dataset 24 days ago

nvidia/Granary

Viewer • Updated 25 days ago • 116M • 24.8k • 132

liked a Space 5 months ago

1.66k

Dia 1.6B

👯

Generate realistic dialogue from a script, using Dia!

liked a Space 7 months ago

3.16k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 8 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 368k • • 12.7k

liked a dataset 9 months ago

HuggingFaceFW/fineweb-2

Viewer • Updated Jun 27 • 5.02B • 41.3k • 628

liked a Space 9 months ago

101

Number Tokenization Blog

📈

Explore how tokenization affects arithmetic in LLMs

liked a Space 10 months ago

Hub LFS Analysis

📈

An analysis of LFS files on the Hub.

liked a model 10 months ago

GoToCompany/gemma2-9b-cpt-sahabatai-v1-instruct

9B • Updated Nov 6, 2024 • 2.62k • 45

liked a Space 10 months ago

Sahabat-AI Chatbot (Gemma2 9b)

😻

Chatbot

liked 2 datasets 10 months ago

indolem/IndoMMLU

Updated Oct 11, 2023 • 509 • 18

PleIAs/common_corpus

Viewer • Updated Jun 10 • 470M • 15.1k • 306

liked 3 Spaces 11 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

122

TxT360: Trillion Extracted Text

📖

Explore TxT360: A Large-Scale Deduplicated Dataset for LLM Pretraining

971

Model Memory Utility

🚀

Calculate vRAM needed for model training and inference

liked a Space about 1 year ago

1.06k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model over 1 year ago

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • 7B • Updated Jul 24 • 462k • • 2.95k