Llama 3.1 405B
Open Weights
Meta
Meta's largest and most capable open-weights model, rivaling top proprietary models. First openly available model competitive with GPT-4, Claude 3 Opus, and Gemini 1.5 Pro. Trained on over 16 trillion tokens using 16,000+ H100 GPUs. Features 128K context window (16x larger than Llama 3) and supports 8 languages. 87.3% on general knowledge benchmark exceeds GPT-4 Turbo (86.5%) and Claude 3 Opus (86.8%). Enables commercial use with permissive license, allowing self-hosting and fine-tuning.
Strengths
Caveats
Capabilities
Vision
Audio
Video
Tool Use
Resources
No external resources available