Name: Llama 3.3 70B
Author: Meta

Meta's efficiency-optimized model delivering Llama 3.1 405B-level quality at fraction of the cost. Released December 2024 with breakthrough cost-performance ratio - generates responses nearly 5x more cost-efficiently than 405B while maintaining similar output quality. Features 128K context window and achieves 86% on MMLU Chat, matching 3.1 70B. Excels at math (77% MATH, up from 67.8%), multilingual reasoning (91.1% MGSM vs 86.9%), and instruction following (92.1% IFEval). Pretrained on 15T tokens using 39.3M H100 GPU hours. Runs on accessible hardware (2-4 A100s).

Llama 3.3 70B

Strengths

Caveats

Capabilities

Resources

Reviews

Comments