Yi-Large
01.AI
01.AI's proprietary flagship model with 70B parameters, released May 2024. Chinese AI powerhouse trained from scratch with significant improvements over Yi-34B open-source model. Performs on par with GPT-4 and Claude 3 across benchmarks. Exceptional multilingual capabilities - excels in Spanish, Chinese, Japanese, German, and French per LMSYS multilingual leaderboard. Positioned first on AlpacaEval 2.0 shortly after launch. Trained on 3.1T tokens of English and Chinese using cascaded deduplication and quality filtering. Uses decoder-only transformer with pre-normalization, SwiGLU, RoPE, and Grouped Query Attention.
Strengths
Caveats
Capabilities
Vision
Audio
Video
Tool Use
Resources
No external resources available