Gemini 1.5 Pro
Google's flagship model with industry-leading 2 million token context window - the longest available. Sparse mixture-of-experts (MoE) Transformer architecture enables processing massive amounts of data: 2 hours of video, 19 hours of audio, codebases with 60,000 lines, or 2,000 pages of text. Matches or outperforms Gemini 1.0 Ultra on standard benchmarks while using significantly less training compute. Near-perfect retrieval (>99%) up to 10 million tokens in research tests. Major price reductions in October 2024.
Strengths
Caveats
Capabilities
Vision
Audio
Video
Tool Use
Resources
No external resources available