15 8 28

Yebowen Hu

huuuyeah

AI & ML interests

None yet

Recent Activity

authored a paper 11 days ago

DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4

authored a paper 11 days ago

MeetingBank: A Benchmark Dataset for Meeting Summarization

authored a paper 11 days ago

InFoBench: Evaluating Instruction Following Ability in Large Language Models

View all activity

Organizations

None yet

authored 7 papers 11 days ago

DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4

Paper • 2305.14702 • Published May 24, 2023 • 1

MeetingBank: A Benchmark Dataset for Meeting Summarization

Paper • 2305.17529 • Published May 27, 2023 • 1

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Paper • 2401.03601 • Published Jan 7, 2024 • 7

SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs

Paper • 2402.10979 • Published Feb 15, 2024

When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives

Paper • 2406.12084 • Published Jun 17, 2024

Complex Logical Instruction Generation

Paper • 2508.09125 • Published 28 days ago • 39

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

Paper • 2508.20374 • Published 12 days ago • 21

upvoted a paper 11 days ago

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

Paper • 2508.20374 • Published 12 days ago • 21

upvoted a paper 14 days ago

MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs

Paper • 2508.18264 • Published 15 days ago • 26

upvoted a paper 18 days ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published 19 days ago • 44

authored a paper 18 days ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published 19 days ago • 44

New activity in huuuyeah/meetingbank 27 days ago

Update README.md

#2 opened about 1 month ago by

parvezshah

liked 2 datasets about 1 month ago

casehold/casehold

Viewer • Updated Oct 4, 2023 • 585k • 426 • 18

huuuyeah/DeFine

Viewer • Updated Jul 26 • 587 • 36 • 1

updated a dataset about 1 month ago

huuuyeah/DeFine

Viewer • Updated Jul 26 • 587 • 36 • 1

published a dataset about 1 month ago

huuuyeah/DeFine

Viewer • Updated Jul 26 • 587 • 36 • 1

upvoted a paper 7 months ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 25

liked a dataset 11 months ago

huuuyeah/DecipherPref

Viewer • Updated Oct 3, 2024 • 8.31k • 15 • 2

updated a dataset 11 months ago

huuuyeah/DecipherPref

Viewer • Updated Oct 3, 2024 • 8.31k • 15 • 2

liked a dataset 11 months ago

huuuyeah/SportsGen

Viewer • Updated Oct 3, 2024 • 70k • 132 • 5

Yebowen Hu

AI & ML interests

Recent Activity

Organizations

huuuyeah's activity

Update README.md