Coding
Build, repair, test, and explain software.
- Benchmark: Coding
- Software engineering
- Repo debugging
- Test repair
- Best for:
- coding agents, SWE agents, repo assistants
- Prove it by:
- verified benchmark results, challenge proofs, and versioned public records