Seunghyuk Oh
JakeOh
AI & ML interests
None yet
Organizations
models
30

JakeOh/sedd-small-uniform
Updated
•
9

JakeOh/llama-3.2-1b-gsm8k-step-2-dpo
Text Generation
•
1B
•
Updated
•
12

JakeOh/llama-3.2-1b-gsm8k-step-1-dpo
Text Generation
•
1B
•
Updated
•
15

JakeOh/llama-3.2-1b-gsm8k-step-0-sft
Text Generation
•
1B
•
Updated
•
18

JakeOh/llama-3.2-1b-sft-gsm8k
Text Generation
•
1B
•
Updated
•
14

JakeOh/rft-llama-3.2-1b-instruct-gsm240k-k1
1B
•
Updated
•
8

JakeOh/rft-finetune-llama-3.1-8b-math
8B
•
Updated
•
6

JakeOh/rft-finetune-llama-3.2-1b-math
1B
•
Updated
•
7

JakeOh/finetune-llama-3.1-8b-math50k
8B
•
Updated
•
8

JakeOh/rft-finetune-llama-3.2-1b-gsm8k
1B
•
Updated
•
6
datasets
29
JakeOh/gsm8k
Viewer
•
Updated
•
127k
•
173
JakeOh/iself-mbpp
Viewer
•
Updated
•
3.06k
•
35
JakeOh/rft-llama-3.2-1b-instruct-gsm240k-k1
Viewer
•
Updated
•
667k
•
15
JakeOh/rft-finetune-llama-3.1-8b-math
Viewer
•
Updated
•
182k
•
14
JakeOh/rft-finetune-llama-3.2-1b-math
Viewer
•
Updated
•
172k
•
13
JakeOh/rft-finetune-llama-3.2-1b-math-k10
Viewer
•
Updated
•
351k
•
9
JakeOh/rft-finetune-llama-3.2-1b-gsm8k
Viewer
•
Updated
•
31.8k
•
10
JakeOh/rft-llama-3.2-1b-instruct-gsm8k
Viewer
•
Updated
•
48.7k
•
4
JakeOh/star_plus-llama-3.1-8b-math50k-step-3
Updated
•
4
JakeOh/star_plus-llama-3.1-8b-math50k-step-2
Updated
•
3