Mistral: Mixtral 8x22B Instruct
- 64K Context
- 0.9/M Input Tokens
- 0.9/M Output Tokens
- Mistralai
- Text 2 text
- 17 Apr, 2024
Mistral’s official instruct fine-tuned version of Mixtral 8x22B. It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include:
- strong math, coding, and reasoning
- large context length (64k)
- fluency in English, French, Italian, German, and Spanish
See benchmarks on the launch announcement here. #moe