MiniMax: MiniMax-01

976.75K Context
0.2/M Input Tokens
1.1/M Output Tokens

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context of up to 4 million tokens.

The text model adopts a hybrid architecture that combines Lightning Attention, Softmax Attention, and Mixture-of-Experts (MoE). The image model adopts the “ViT-MLP-LLM” framework and is trained on top of the text model.

To read more about the release, see: https://www.minimaxi.com/en/news/minimax-01-series-2

Anthropic: Claude 3.5 Haiku (2024-10-22)

Text 2 text

Claude 3.5 Haiku features enhancements across all skill sets including coding, tool use, and reasoning. As the fastest model in the Anthropic lineup, it offers rapid response times suit ...

Rifx.Online 195.31K context $1/M input tokens $5/M output tokens

Google: Gemini Experimental 1121 (free)

Text image 2 text

Experimental release (November 21st, 2024) of Gemini. ...

Rifx.Online 8K context $0 input tokens $0 output tokens

Google: Gemma 2 9B (free)

Text 2 text

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empo ...

Rifx.Online 4K context $0 input tokens $0 output tokens

Inflatebot: Mag Mell R1 12B

Text 2 text

Mag Mell is a merge of pre-trained language models created using mergekit, based on Mistral Nemo. It is a great roleplay and storytelling model which combines the best part ...

Rifx.Online 15.63K context $0.9/M input tokens $0.9/M output tokens

Google: LearnLM 1.5 Pro Experimental (free)

Text image 2 text

An experimental version of Gemini 1.5 Pro from Google. ...

Rifx.Online 8K context $0 input tokens $0 output tokens

Meta: Llama 3.1 70B Instruct (free)

Text 2 text

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrate ...

Rifx.Online 8K context $0 input tokens $0 output tokens

MiniMax: MiniMax-01

Tags :

Share :

Related Posts

Anthropic: Claude 3.5 Haiku (2024-10-22)

Google: Gemini Experimental 1121 (free)

Google: Gemma 2 9B (free)

Inflatebot: Mag Mell R1 12B

Google: LearnLM 1.5 Pro Experimental (free)

Meta: Llama 3.1 70B Instruct (free)