wei's picture

wei

fengwei

·

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

swiss-ai/Apertus-70B-Instruct-2509

liked a model 4 days ago

google/embeddinggemma-300m

upvoted a paper 4 days ago

Baichuan-M2: Scaling Medical Capability with Large Verifier System

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Baichuan-M2: Scaling Medical Capability with Large Verifier System

Paper • 2509.02208 • Published 7 days ago • 35

upvoted a paper 20 days ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 129

upvoted 2 papers 21 days ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 153

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published 26 days ago • 91

upvoted 8 papers 28 days ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26 • 51

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 63

Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?

Paper • 2508.03644 • Published Aug 5 • 25

SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension

Paper • 2508.01959 • Published Aug 3 • 57

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 102

ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability

Paper • 2508.07050 • Published about 1 month ago • 114

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published 29 days ago • 106

upvoted a collection about 2 months ago

SmolLM3 pretraining datasets

datasets used in SmolLM3 pretraining • 15 items • Updated 28 days ago • 29

upvoted 2 papers 2 months ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1 • 78

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 180

upvoted 5 papers 3 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 271

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 260

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 39

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 71