Skip to main content
llm.info

Mistral 7B

Open Weights

Mistral AI

Mistral AI's compact but powerful 7.3B parameter model released September 2023. First model from Mistral AI, setting new standards for efficiency and performance in its size class. Outperforms Llama 2 13B on all benchmarks and Llama 1 34B on many benchmarks despite being significantly smaller. Approaches CodeLlama 7B code performance while maintaining strong English capabilities. Features Grouped-Query Attention for faster inference and Sliding Window Attention (4096 tokens) for efficient long sequences. Apache 2.0 license. Deployable anywhere including locally, cloud (AWS/GCP/Azure), available on HuggingFace.

Strengths

Caveats

Capabilities

Vision
Audio
Video
Tool Use

Resources

No external resources available

Reviews

Comments