Mistral 7B
Open Weights
Mistral AI
Mistral AI's compact but powerful 7.3B parameter model released September 2023. First model from Mistral AI, setting new standards for efficiency and performance in its size class. Outperforms Llama 2 13B on all benchmarks and Llama 1 34B on many benchmarks despite being significantly smaller. Approaches CodeLlama 7B code performance while maintaining strong English capabilities. Features Grouped-Query Attention for faster inference and Sliding Window Attention (4096 tokens) for efficient long sequences. Apache 2.0 license. Deployable anywhere including locally, cloud (AWS/GCP/Azure), available on HuggingFace.
Strengths
Caveats
Capabilities
Vision
Audio
Video
Tool Use
Resources
No external resources available