Gemini 3.5 Flash: more expensive, but Google plan to use it for everything
EXECUTIVE SUMMARY
Google's Gemini 3.5 Flash: A New Era in AI Pricing and Accessibility
Summary
Google has launched Gemini 3.5 Flash, skipping the preview phase and making it available for a wide range of products and developers. This release comes with a significant price increase compared to its predecessors, raising questions about the pricing strategies of major AI labs.
Key Points
- Gemini 3.5 Flash was announced at Google I/O and is now generally available.
- The model ID is gemini-3.5-flash, with a knowledge cut-off in January 2025.
- It supports 1,048,576 input tokens and 65,536 output tokens.
- Pricing for Gemini 3.5 Flash is $1.50/million input tokens and $9/million output tokens.
- This new model is 3x the price of Gemini 3 Flash Preview and 6x that of Gemini 3.1 Flash-Lite.
- Google plans to roll out Gemini 3.5 Pro next month, likely at a higher price.
- The launch reflects a trend where major AI labs are testing the price tolerance of their API customers.
- A benchmark comparison shows Gemini 3.5 Flash costs significantly more than previous models.
Analysis
The release of Gemini 3.5 Flash signifies a strategic shift for Google as it integrates advanced AI capabilities across its platforms while also testing market pricing. This could impact how enterprises budget for AI services and influence competition among AI providers.
Conclusion
IT professionals should prepare for the increased costs associated with AI services and consider the implications of pricing changes on their projects. Staying informed about model capabilities and pricing structures will be crucial for effective budgeting and resource allocation.