OpenAI has unveiled GPT-4o mini, its most cost-efficient small AI model to date, on Thursday. This new model is set to revolutionize AI applications by making advanced intelligence more affordable and accessible.
Available for developers and consumers starting today, and for enterprise users next week, GPT-4o mini aims to broaden the scope of AI-driven solutions.
Summary
- Launch: OpenAI introduces GPT-4o mini, enhancing accessibility and affordability of AI.
- Performance: Outperforms existing small AI models in reasoning and multimodal tasks.
- Cost Efficiency: Significantly cheaper than previous models, priced at 15 cents per million input tokens and 60 cents per million output tokens.
- Applications: Suitable for high-volume, real-time tasks in various sectors.
Performance and Affordability
GPT-4o mini replaces GPT-3.5 Turbo as OpenAI’s smallest model, boasting superior performance in reasoning tasks and multimodal applications. It scores 82% on the MMLU benchmark and 87% on the MGSM benchmark, outshining competitors like Gemini Flash and Claude Haiku.
With a context window of 128K tokens and support for up to 16K output tokens per request, GPT-4o mini is both powerful and cost-efficient.

Competitive Edge
Compared to other small AI models such as Llama 3 8b, Claude Haiku, and Gemini Flash, GPT-4o mini excels in speed, cost-efficiency, and intelligence. Early independent tests and pre-launch evaluations confirm its superior performance.
George Cameron, Co-Founder of Artificial Analysis, highlighted its median output speed of 202 tokens per second, making it ideal for speed-dependent applications.
Enterprise Tools
In addition to GPT-4o mini, OpenAI has introduced new tools for enterprise customers. The Enterprise Compliance API is designed to help businesses in regulated industries comply with logging and audit requirements.
These tools offer admins more control over workspace GPTs, ensuring secure and efficient AI deployment.

Industry Insights
The launch of GPT-4o mini underscores a significant trend in the AI industry: the increasing popularity of small AI models due to their speed and cost efficiencies.
As developers leverage these models for high-volume, real-time tasks, the demand for affordable AI solutions continues to rise. OpenAI’s commitment to making AI accessible is evident in this latest release, promising to empower developers and businesses worldwide.
Safety Measures
Safety remains a priority for OpenAI. GPT-4o mini incorporates built-in safety measures, including filtering harmful content during pre-training and aligning model behavior with policies using reinforcement learning with human feedback (RLHF).
The model’s safety features are rigorously tested by external experts, ensuring reliable and secure AI applications.
Availability and Pricing
GPT-4o mini is now available through various OpenAI APIs, priced at 15 cents per million input tokens and 60 cents per million output tokens. Free, Plus, and Team users of ChatGPT can access GPT-4o mini starting today, with enterprise users gaining access next week.

What’s Next
OpenAI’s focus on reducing costs while enhancing model capabilities continues with GPT-4o mini. The future of AI looks promising, with models becoming seamlessly integrated into everyday applications. GPT-4o mini is set to pave the way for more efficient and affordable AI solutions, driving innovation and accessibility in the AI landscape.
Looking Ahead: Implications for Business and Policy
The introduction of GPT-4o mini highlights the importance of balancing innovation with affordability in the AI industry. For businesses, this new model offers a competitive edge in automating tasks and enhancing customer interactions.
Policymakers must consider the broader implications of widespread AI adoption, including regulatory compliance and data security. As AI technology evolves, stakeholders must navigate the challenges and opportunities to maximize the benefits of these advancements.



