OpenAI Abandons SWE-bench Verified Over Contamination
OpenAI stops using SWE-bench Verified for AI coding tests, citing flawed benchmarks and training leakage. The company now recommends SWE-bench Pro instead.
OpenAI stops using SWE-bench Verified for AI coding tests, citing flawed benchmarks and training leakage. The company now recommends SWE-bench Pro instead.