In a key benchmark, Claude Sonnet 4.5, a next-generation language model, runs autonomously for 30 hours on a single task.