Gemini 2.5 Flash-Lite is now ready for scaled production use
Gemini 2.5 Flash-Lite has moved from preview to general availability, offering a cost-efficient, high-quality model with a 1 million-token context window and multimodal capabilities. This makes it suitable for scaled production use. You can now deploy it for applications requiring efficient processing of long texts and multimodal inputs. The model's small size and feature set position it for cost-sensitive applications.
- Gemini 2.5 Flash-Lite is now generally available.
- Offers a 1 million-token context window.
- Suitable for cost-sensitive, scaled production applications.