OpenAI Abandons SWE-bench Verified Over Contamination
OpenAI stops using SWE-bench Verified for AI coding tests, citing flawed benchmarks and training leakage. The company now recommends SWE-bench Pro instead.
OpenAI stops using SWE-bench Verified for AI coding tests, citing flawed benchmarks and training leakage. The company now recommends SWE-bench Pro instead.
OpenAI appoints Arvind KC as CPO to scale operations and redefine workplace culture as AI reshapes how companies organize talent and work.
OpenAI’s Frontier Alliance Partners targets the gap between AI experimentation and production. Can consulting partners finally solve enterprise deployment?
Anthropic just made its biggest enterprise play yet. On February 24, 2026, the company announced a sweeping update to Claude Cowork that brings customizable plugins, 13 new connectors, department-specific automation, and cross-application orchestration between Excel and PowerPoint — all aimed at making Claude the default AI workspace for enterprise teams. The update turns Cowork from…
OpenAI shares its AI model’s attempts at the First Proof math challenge, testing research-grade reasoning on expert-level problems that push current limits.
Google’s Gemini 3.1 Pro is built for tasks where simple answers fall short. Here’s what makes it different from standard AI models.
Lyria 3 launches in Gemini app, letting users generate custom 30-second tracks from text prompts and images. Google’s latest push into AI music.
Shortcut and Hex reveal how they’re building real products on Claude Opus 4.6. Here’s what actually works and what doesn’t in production.
Anthropic establishes its first India office in Bengaluru and announces strategic partnerships to bring Claude AI to Indian businesses and developers.
Former Microsoft and GM CFO Chris Liddell appointed to Anthropic’s board. What this signals about the AI company’s growth plans and corporate strategy.
Anthropic is bringing Claude to CodePath, the US’s largest collegiate computer science program. Over 100,000 students will get access to AI coding tools.
Google Gemini just launched five new features designed specifically for students. Here’s what’s new and how they compare to competitors.