OffSeeker OffSeeker/OffSeeker-8B-SFT 8B • Updated Jan 17 • 2 OffSeeker/OffSeeker-8B-DPO 8B • Updated Mar 3 • 5 OffSeeker/DeepForge Preview • Updated Mar 3 • 73 OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents Paper • 2601.18467 • Published Jan 26 • 1
OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents Paper • 2601.18467 • Published Jan 26 • 1
OffSeeker OffSeeker/OffSeeker-8B-SFT 8B • Updated Jan 17 • 2 OffSeeker/OffSeeker-8B-DPO 8B • Updated Mar 3 • 5 OffSeeker/DeepForge Preview • Updated Mar 3 • 73 OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents Paper • 2601.18467 • Published Jan 26 • 1
OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents Paper • 2601.18467 • Published Jan 26 • 1