Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise
By Jakub Antkiewicz
•2026-03-18T08:50:12Z
French AI startup Mistral has announced Mistral Forge, a new platform designed to let enterprises build custom AI models using their own proprietary data. The platform was unveiled at Nvidia's GTC conference, an event heavily focused on enterprise AI applications this year. The launch sharpens Mistral’s focus on corporate clients, a deliberate strategy to compete with rivals like OpenAI and Anthropic that have gained significant traction in the consumer market. CEO Arthur Mensch stated the enterprise-first approach is working, with the company on track to surpass $1 billion in annual recurring revenue this year.
Unlike common approaches such as fine-tuning or retrieval-augmented generation (RAG) which adapt existing models, Mistral says Forge enables companies to train models from the ground up. This method offers greater control over model behavior and can better handle highly specialized or non-English data. Customers can build upon Mistral’s library of open-weight models, and for complex implementations, the platform includes hands-on support from forward-deployed engineers who embed with client teams, a service model used by companies like IBM and Palantir.
The move positions Mistral as a provider for organizations that cannot rely on general-purpose models trained on public internet data. Early partners include Ericsson, the European Space Agency, and chipmaker ASML, highlighting Forge's appeal to sectors with specific compliance, security, and technical needs. The company is targeting governments needing to tailor models to local languages, financial firms with strict compliance requirements, and technology companies that want to tune models to their own codebase, thereby reducing dependence on third-party model providers.
Mistral is betting that the next phase of enterprise AI adoption will be won not by the largest general-purpose model, but by providing the tools for companies to build smaller, highly specialized models that deeply understand their internal data and workflows.