Gemini 1.5 Flash
Google's fast, cost-efficient model created through knowledge distillation from Gemini 1.5 Pro. Designed for high-volume, latency-sensitive applications while maintaining strong capabilities. Features 1 million token context window and multimodal support (text, image, audio, video). Excels at summarization, chat, image/video captioning, and data extraction from long documents. Massive price reductions throughout 2024 (78% input, 71% output) make it extremely cost-effective. Improved performance: 7% MMLU-Pro gain, 20% math boost.
Strengths
Caveats
Capabilities
Vision
Audio
Video
Tool Use
Resources
No external resources available