Ministral 8B
- 125K Context
- 0.1/M Input Tokens
- 0.1/M Output Tokens
- Mistralai
- Text 2 text
- 17 Oct, 2024
Ministral 8B is an 8B parameter model featuring a unique interleaved sliding-window attention pattern for faster, memory-efficient inference. Designed for edge use cases, it supports up to 128k context length and excels in knowledge and reasoning tasks. It outperforms peers in the sub-10B category, making it perfect for low-latency, privacy-first applications.