Anthropocene began Claude Sonnet 4.5 on Monday, positioning the factitious intelligence model as “the very best coding model on this planet” in a direct challenge to the recently released OpenAI GPT-5because the two AI giants fight for dominance within the lucrative enterprise software development market.
The San Francisco-based startup says its latest model achieves top performance on critical coding benchmarks with a rating of 77.2% SWE Bench verified – a rigorous software engineering assessment – ​​in comparison with the performance of GPT-5. More notably, Anthropic says Claude Sonnet 4.5 can concentrate on complex, multi-step tasks for greater than 30 hours, representing a dramatic leap within the AI's ability to handle sustained work.
“Sonnet 4.5 achieves 77.2% in SWE bench verification (82% in parallel test time calculation). It is SOTA,” an Anthropic spokesperson told VentureBeat, using the industry acronym for “cutting-edge.” The company also highlighted the model's 50 percent rating Terminal bankone other coding benchmark wherein it claims the lead.
The announcement follows increasing pressure from OpenAI's recent advances and harsh criticism from high-profile figures similar to Elon Musk, who recently posted on X.com: “Winning was never a possible consequence for Anthropic.” When asked about Musk's statement, Anthropic declined to comment.
The release comes just seven weeks later OpenAI's GPT 5 launch in AugustThis underscores the rapid pace of competition in the factitious intelligence space as firms race to win enterprise customers who’re increasingly reliant on AI for software development. The timing is especially notable as Anthropic grapples with questions on its heavy reliance on just two major customers.
Anthropic dominates the coding market despite customer concentration risks
The competition is targeted on a market that has emerged as the primary major profitable use case for AI beyond chatbots. Anthropic controls 42% of the code generation market — greater than double OpenAI’s 21 percent share — in accordance with a Menlo Ventures survey of 150 corporate technical executives. This dominance has translated into remarkable financial performance, with the corporate achieving one Return on sales of $5 billion Earlier this yr.
However, an industry evaluation shows that coding applications cursor And GitHub Copilot approx. drive Anthropic's $1.4 billion in revenueThis creates a potentially dangerous concentration of shoppers that might leave the corporate vulnerable if one in every of the relationships fails.
“Our run-rate revenue has increased significantly, even excluding these two customers,” the Anthropic spokesman said, dismissing concerns about customer concentration. The company provided supportive quotes from Cursor CEO Michael Truell and GitHub Chief Product Officer Mario Rodriguez praising the performance of Claude Sonnet 4.5.
The latest model makes significant progress in computing capabilities with a rating of 61.4% OSWorlda benchmark that tests AI models against real computing tasks. Just 4 months ago, Claude Sonnet 4 was at the highest with 42.2%, demonstrating rapid improvement in AI's ability to interact with software interfaces.
OpenAI's aggressive pricing strategy threatens Anthropic's premium positioning
Anthropic's announcement comes as the corporate grapples with competitive pressure from GPT-5's aggressive pricing strategy. Initial analyzes show Claude Opus 4 costs about seven times as much per million tokens than GPT-5 for certain tasks, putting immediate pressure on Anthropic's premium positioning.
The price differences signal a fundamental shift in competitive dynamics that might force corporate procurement teams to rethink supplier relationships that were previously based on performance, not price. Companies managing exponentially growing AI budgets now have comparable capabilities at a fraction of the associated fee.
Nevertheless, Anthropic maintains its pricing strategy with Claude Sonnet 4.5. “The cost of Sonnet 4.5 stays the identical as Sonnet 4,” the spokesperson confirmed, leaving prices at $3 per million input tokens and $15 per million output tokens.
Claude Sonnet 4.5 offers 30-hour autonomous work sessions and improved security
Beyond performance improvements, anthropic positions Claude Sonnet 4.5 as its “best-aligned boundary model so far,” showing a major reduction in worrisome behaviors similar to sycophancy, deception, and power-seeking tendencies. The company has made “significant progress in mitigating prompt injection attacks,” a critical security risk to enterprise deployments.
The model is published at Anthropics AI Safety Level 3 (ASL-3) safeguardsThis includes classifiers to detect potentially dangerous inputs and outputs related to chemical, biological, radiological and nuclear weapons. While these protections sometimes flag normal content, Anthropic says they’ve reduced the variety of false positives by an element of ten since they were first described.
Perhaps most significantly for developers, Anthropic is publishing this Claude Agent SDK – the identical infrastructure that powers the Claude Code product. “We built Claude Code since the tool we would have liked didn’t yet exist,” the corporate said in its announcement. “The Agent SDK gives you a similar foundation to construct something that’s just as powerful for whatever problem you solve.”
International expansion accelerates as $1.5 billion copyright settlement is reached
The model launch coincides with Anthropic's aggressive international expansion as the corporate looks to diversify beyond its US-focused customer base. The startup recently announced plans to achieve this Triple the international workforce And Expand your applied AI team 5x in 2025, based on data showing that almost 80% of Claude usage now comes from outside the United States.
However, the expansion is related to significant legal costs. Anthropic recently agreed to pay $1.5 billion in a copyright agreement with authors and publishers over allegations that the corporate illegally used its books to coach AI models without permission. The settlement, approved by a federal judge last week, calls for payments of $3,000 for every publication listed within the case.
Enterprise AI spending is doubling as firms prioritize performance over cost
The rapid model releases of each firms reflect the high priority in introducing AI into firms. Have model API outputs greater than doubled, to $8.4 billion According to Menlo Ventures, it will occur in only six months as firms move from experimental projects to production deployments.
Customer behavior patterns suggest that firms consistently prioritize performance over price and upgrade to the newest models inside weeks of release, no matter cost. This behavior could work in Anthropic's favor if Claude Sonnet 4.5's performance benefits prove compelling enough to beat GPT-5's price advantage.
However, the dramatic price difference introduced by GPT-5 could overcome typical switching inertia, particularly for cost-conscious firms under budget pressure. Industry observers note that the associated fee of fixing models stays relatively low 66% of firms are upgrading inside existing providers as an alternative of fixing provider.
For firms, increasing competition leads to raised performance and lower costs through continually improved capabilities. The rapid pace of model improvements – with latest versions rolling out monthly quite than yearly – provides firms with advanced AI capabilities as vendors aggressively compete for his or her business.
While the company rivalry between Anthropic and OpenAI dominates industry headlines, the actual economic impact extends far beyond the boardrooms of Silicon Valley. The development of AI systems able to 30 hours of continuous coding work represents a fundamental shift in the best way software is built, with implications that reach to all industries that depend on technology infrastructure.
These evolving capabilities signal a significant transformation of the workplace. As AI systems change into increasingly able to complex, sustained mental work, the tech industry's competition for coding supremacy heralds similar disruptions in areas that require analytical pondering, problem solving and technical expertise.

