Anthropic evaluated the model’s programming capabilities using a benchmark called SWE-bench Verified. Sonnet 4.5 set a new industry record with a 82% score. The next two highest scores were also ...
US startup Anthropic on Monday announced the launch of its new generative artificial intelligence model, Claude Sonnet 4.5, ...