Today we're releasing Claude Sonnet 4.5 in Devin, alongside our new Devin Agent Preview that's over 2x faster and 12% better on our Jr. Developer Evals.
Claude Sonnet 4.5 is Anthropic's newest and most powerful model, and we believe it represents a new generation of coding models.
Over the past few days, we've been testing this model internally, and what we've found goes beyond just benchmark improvements. The model exhibits fundamentally different behaviors - ways it approaches problems, manages its own work, and structures its development process - that required us to rethink how we architect Devin.
Because Devin is an agent that plans, executes, and iterates rather than just autocompleting code, we get an unusual window into what's genuinely changed. And with Sonnet 4.5, the improvements compound across our feedback loops in exciting new ways.
Claude Sonnet 4.5 is available in Devin starting today. We're excited to see what you build with it.
Want to learn more about what makes this model different? We've been testing Sonnet 4.5 extensively over the past few days and discovered some fascinating behaviors, from how it manages its own context window to how it creates feedback loops to verify its work.