Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
312.6
TFLOPS
92
66
183
Yaowei Zheng
hiyouga
Follow
zzzzzqqqqq's profile picture
Prat618's profile picture
osadai's profile picture
2721 followers
·
36 following
https://github.com/hiyouga
llamafactory_ai
hiyouga
AI & ML interests
LLM Training System
Recent Activity
liked
a model
13 days ago
microsoft/VibeVoice-1.5B
liked
a model
19 days ago
internlm/Intern-S1-mini
new
activity
21 days ago
google/gemma-3-270m-it:
ValueError During SFT Fine-tuning with Gamma3 Model
View all activity
Organizations
hiyouga
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
13 days ago
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
8 days ago
•
237k
•
1.57k
liked
a model
19 days ago
internlm/Intern-S1-mini
Image-Text-to-Text
•
9B
•
Updated
15 days ago
•
7.56k
•
93
liked
a dataset
26 days ago
nvidia/Llama-Nemotron-VLM-Dataset-v1
Viewer
•
Updated
7 days ago
•
2.86M
•
7.41k
•
139
liked
a model
26 days ago
janhq/Jan-v1-4B
Text Generation
•
4B
•
Updated
16 days ago
•
12.7k
•
327
liked
a model
27 days ago
openbmb/MiniCPM-V-4
Image-Text-to-Text
•
4B
•
Updated
28 days ago
•
22.3k
•
459
liked
a dataset
28 days ago
allenai/WildChat-4.8M
Viewer
•
Updated
29 days ago
•
3.2M
•
7.33k
•
98
liked
a model
about 1 month ago
openai/gpt-oss-20b
Text Generation
•
22B
•
Updated
13 days ago
•
8.93M
•
•
3.45k
liked
a dataset
about 1 month ago
JT-LM/JIUTIAN-TReB
Updated
about 1 hour ago
•
397
•
2
liked
a Space
about 2 months ago
Running
16
16
Megatron Memory Estimator
👁
Estimate GPU memory usage for Megatron models
liked
a model
about 2 months ago
moonshotai/Kimi-K2-Instruct
Text Generation
•
Updated
4 days ago
•
403k
•
•
2.14k
liked
a dataset
about 2 months ago
data-for-agents/insta-150k-v3
Viewer
•
Updated
May 28
•
146k
•
162
•
15
liked
a model
2 months ago
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
12 days ago
•
267k
•
•
733
liked
a dataset
3 months ago
Saigyouji-Yuyuko1000/dapo17k
Viewer
•
Updated
Jun 23
•
17.9k
•
225
•
2
liked
2 models
3 months ago
reducto/RolmOCR
Image-to-Text
•
8B
•
Updated
Apr 2
•
115k
•
512
nanonets/Nanonets-OCR-s
Image-Text-to-Text
•
4B
•
Updated
Jun 20
•
274k
•
1.5k
liked
a dataset
3 months ago
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
Jun 9
•
1.2M
•
7.18k
•
157
liked
2 models
3 months ago
open-thoughts/OpenThinker3-7B
Text Generation
•
8B
•
Updated
Jun 9
•
4.58k
•
•
125
ByteDance-Seed/BAGEL-7B-MoT
Any-to-Any
•
15B
•
Updated
Jun 23
•
822
•
1.12k
liked
a Space
3 months ago
Running
485
485
AI Deadlines
⚡
Manage project deadlines with AI assistance
liked
a dataset
4 months ago
ByteDance-Seed/mga-fineweb-edu
Viewer
•
Updated
May 19
•
846M
•
2.01k
•
34
Load more