Post
2884
Latest work on SWE-Bench 🐛
Our two new papers from the SJTU & Huawei: Powered by DeepSeek-V3, we've achieved a new SOTA on the SWE-Bench benchmark!
We introduce two innovative approaches:
⚔️ SWE-Debate: AI agents compete and "debate" to generate the best code fix.
🧠 SWE-Exp: An AI agent learns from past repair "experience" to solve new issues more efficiently.
👇 Explore the future of software development:
SWE-Debate
📄 Paper: https://arxiv.org/abs/2507.23348
💻 Code: https://github.com/YerbaPage/SWE-Debate
SWE-Exp
📄 Paper: https://arxiv.org/abs/2507.23361
💻 Code: https://github.com/YerbaPage/SWE-Exp
Our two new papers from the SJTU & Huawei: Powered by DeepSeek-V3, we've achieved a new SOTA on the SWE-Bench benchmark!
We introduce two innovative approaches:
⚔️ SWE-Debate: AI agents compete and "debate" to generate the best code fix.
🧠 SWE-Exp: An AI agent learns from past repair "experience" to solve new issues more efficiently.
👇 Explore the future of software development:
SWE-Debate
📄 Paper: https://arxiv.org/abs/2507.23348
💻 Code: https://github.com/YerbaPage/SWE-Debate
SWE-Exp
📄 Paper: https://arxiv.org/abs/2507.23361
💻 Code: https://github.com/YerbaPage/SWE-Exp