Mistral: Mixtral 8x22B Instruct

64K Context
0.9/M Input Tokens
0.9/M Output Tokens

Mistralai
Text 2 text
17 Apr, 2024

Model Unavailable

Mistral’s official instruct fine-tuned version of Mixtral 8x22B. It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include:

strong math, coding, and reasoning
large context length (64k)
fluency in English, French, Italian, German, and Spanish

See benchmarks on the launch announcement here. #moe

Ministral 3B

Text 2 text

Ministral 3B is a 3B parameter model optimized for on-device and edge computing. It excels in knowledge, commonsense reasoning, and function-calling, outperforming larger models like Mi ...

Mistralai 125K context $0.04/M input tokens $0.04/M output tokens

Ministral 8B

Text 2 text

Ministral 8B is an 8B parameter model featuring a unique interleaved sliding-window attention pattern for faster, memory-efficient inference. Designed for edge use cases, it supports up ...

Mistralai 125K context $0.1/M input tokens $0.1/M output tokens

Mistral: Mistral 7B Instruct

Text 2 text

A high-performing, industry-standard 7.3B parameter model, with optimizations for speed and context length. *Mistral 7B Instruct has multiple version variants, and this is intended to ...

Mistralai 32K context $0.055/M input tokens $0.055/M output tokens

Mistral: Mistral Nemo

Text 2 text

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, P ...

Mistralai 125K context $0.13/M input tokens $0.13/M output tokens

Mistral Tiny

Text 2 text

This model is currently powered by Mistral-7B-v0.2, and incorporates a "better" fine-tuning than Mistral 7B, inspired by community work. It's best ...

Mistralai 31.25K context $0.25/M input tokens $0.25/M output tokens

Mixtral 8x7B (base)

Text 2 text

A pretrained generative Sparse Mixture of Experts, by Mistral AI. Incorporates 8 experts (feed-forward networks) for a total of 47B parameters. Base model (not fine-tuned for instructio ...

Mistralai 32K context $0.54/M input tokens $0.54/M output tokens

Mistral: Mixtral 8x22B Instruct

Tags :

Share :

Related Posts

Ministral 3B

Ministral 8B

Mistral: Mistral 7B Instruct

Mistral: Mistral Nemo

Mistral Tiny

Mixtral 8x7B (base)