Type something to search...
MiniMax: MiniMax-01

MiniMax: MiniMax-01

  • 976.75K Context
  • 0.2/M Input Tokens
  • 1.1/M Output Tokens

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context of up to 4 million tokens.

The text model adopts a hybrid architecture that combines Lightning Attention, Softmax Attention, and Mixture-of-Experts (MoE). The image model adopts the “ViT-MLP-LLM” framework and is trained on top of the text model.

To read more about the release, see: https://www.minimaxi.com/en/news/minimax-01-series-2

Related Posts

Claude 3.5 Haiku features enhancements across all skill sets including coding, tool use, and reasoning. As the fastest model in the Anthropic lineup, it offers rapid response times suit ...

Anthropic: Claude 3.5 Haiku (2024-10-22)
Rifx.Online
195.31K context $1/M input tokens $5/M output tokens

Experimental release (November 21st, 2024) of Gemini. ...

Google: Gemini Experimental 1121 (free)
Rifx.Online
8K context $0 input tokens $0 output tokens

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empo ...

Google: Gemma 2 9B (free)
Rifx.Online
4K context $0 input tokens $0 output tokens

Mag Mell is a merge of pre-trained language models created using mergekit, based on Mistral Nemo. It is a great roleplay and storytelling model which combines the best part ...

Inflatebot: Mag Mell R1 12B
Rifx.Online
15.63K context $0.9/M input tokens $0.9/M output tokens

An experimental version of Gemini 1.5 Pro from Google. ...

Google: LearnLM 1.5 Pro Experimental (free)
Rifx.Online
8K context $0 input tokens $0 output tokens

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrate ...

Meta: Llama 3.1 70B Instruct (free)
Rifx.Online
8K context $0 input tokens $0 output tokens