3 15 8

Adam Yanxiao Zhao

sdpkjc

https://sdpkjc.me

AI & ML interests

Reinforcement Learning

Recent Activity

new activity 3 days ago

sdpkjc/SATQuest:Update dataset card: Add paper link, task categories, and tags

authored a paper 5 days ago

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

authored a paper 5 days ago

SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

View all activity

Organizations

New activity in sdpkjc/SATQuest 3 days ago

Update dataset card: Add paper link, task categories, and tags

#2 opened 5 days ago by

nielsr

authored 3 papers 5 days ago

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

Paper • 2508.14040 • Published 21 days ago • 2

SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published 9 days ago • 3

CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning

Paper • 2502.11896 • Published Feb 17

updated a collection 5 days ago

SATQuest

Collection

SATQuest Dataset Collections • 3 items • Updated 5 days ago • 1

upvoted a paper 5 days ago

SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published 9 days ago • 3

commented a paper 5 days ago

SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published 9 days ago • 3 •

upvoted a paper 5 days ago

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

Paper • 2508.14040 • Published 21 days ago • 2

updated 2 datasets about 1 month ago

sdpkjc/SATQuest-RFT-3k

Viewer • Updated Jul 30 • 3k • 27

sdpkjc/SATQuest

Viewer • Updated 3 days ago • 140 • 29

New activity in Qwen/Qwen3-1.7B 3 months ago

Fix chat template in case of multiple assistant messages and no thinking

❤️ 👍 2

#9 opened 3 months ago by

VityaVitalich

updated a dataset 4 months ago

sdpkjc/24problems_quiz-eval-n4-1-10-24

Viewer • Updated May 22 • 55.5k • 2

published a dataset 4 months ago

sdpkjc/24problems_quiz-eval-n4-1-10-24

Viewer • Updated May 22 • 55.5k • 2

updated 2 datasets 4 months ago

sdpkjc/24problems_quiz-eval-5

Viewer • Updated May 22 • 100k • 5

sdpkjc/24problems_quiz

Viewer • Updated May 21 • 85.6k • 13

published 2 datasets 4 months ago

sdpkjc/24problems_quiz-eval-5

Viewer • Updated May 22 • 100k • 5

sdpkjc/24problems_quiz

Viewer • Updated May 21 • 85.6k • 13

upvoted a collection 4 months ago

SATQuest

Collection

SATQuest Dataset Collections • 3 items • Updated 5 days ago • 1

updated a collection 4 months ago

SATQuest

Collection

SATQuest Dataset Collections • 3 items • Updated 5 days ago • 1

Adam Yanxiao Zhao

AI & ML interests

Recent Activity

Organizations

sdpkjc's activity

Update dataset card: Add paper link, task categories, and tags

Fix chat template in case of multiple assistant messages and no thinking