Mistral AI, a French startup, released Devstral 2, a 123 billion parameter open-weights coding model, on Tuesday. The model, designed to work as part of an autonomous software engineering agent, achieved a 72.2 percent score on SWE-bench Verified, a benchmark that tests AI systems' ability to solve real GitHub issues. This score places Devstral 2 among the top-performing open-weights models.
Mistral also released a new development app called Mistral Vibe, a command line interface (CLI) that allows developers to interact with the Devstral models directly in their terminal. The tool, released under the Apache 2.0 license, can scan file structures and Git status to maintain context across an entire project, make changes across multiple files, and execute shell commands autonomously. This feature enables developers to work more efficiently and effectively with the Devstral models.
According to a statement from Mistral AI, the release of Devstral 2 and Mistral Vibe marks a significant milestone in the development of open-weights AI coding models. "We're excited to bring this level of performance and functionality to developers," said a spokesperson for Mistral AI. "Our goal is to make AI-powered software engineering more accessible and efficient for everyone."
The release of Devstral 2 and Mistral Vibe has significant implications for the AI industry. Open-weights models, which are not proprietary like some of their competitors, offer developers more flexibility and control over their AI-powered software engineering tools. This could lead to increased adoption and innovation in the field.
In related news, employees of major AI companies have reportedly been paying close attention to the performance of open-weights models like Devstral 2. While the accuracy of AI benchmarks can be debated, the interest from industry insiders suggests that open-weights models are gaining traction.
Mistral AI's release of Devstral 2 and Mistral Vibe marks a significant step forward in the development of open-weights AI coding models. As the industry continues to evolve, it will be interesting to see how these models are adopted and integrated into software engineering workflows.
Share & Engage Share
Share this article