christian-muertz's picture
Upload eval/2025-09-09-04:46:42/README.md with huggingface_hub
45e175d verified

SWE-bench Report

This folder contains the evaluation results of the SWE-bench using the official evaluation docker containerization.

Summary

  • total instances: 500
  • submitted instances: 13
  • completed instances: 12
  • empty patch instances: 1
  • resolved instances: 5
  • unresolved instances: 7
  • error instances: 0

Resolved Instances

Unresolved Instances

Error Instances

Empty Patch Instances

Incomplete Instances