christian-muertz's picture
Upload eval/2025-09-05-13:46:01/README.md with huggingface_hub
58a3f72 verified

SWE-bench Report

This folder contains the evaluation results of the SWE-bench using the official evaluation docker containerization.

Summary

  • total instances: 500
  • submitted instances: 100
  • completed instances: 99
  • empty patch instances: 0
  • resolved instances: 37
  • unresolved instances: 62
  • error instances: 1

Resolved Instances

Unresolved Instances

Error Instances

Empty Patch Instances

Incomplete Instances