christian-muertz's picture
Upload eval/2025-09-07-19:02:17/README.md with huggingface_hub
68fd47d verified

SWE-bench Report

This folder contains the evaluation results of the SWE-bench using the official evaluation docker containerization.

Summary

  • total instances: 500
  • submitted instances: 500
  • completed instances: 483
  • empty patch instances: 11
  • resolved instances: 181
  • unresolved instances: 302
  • error instances: 6

Resolved Instances

Unresolved Instances

Error Instances

Empty Patch Instances

Incomplete Instances