Models

Google: Gemma 2 27B

Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of text generation ...

Google 8K context $0.27/M input tokens $0.27/M output tokens

Magnum 72B

Text 2 text

From the maker of Goliath, Magnum 72B is the first in a new family of models designed to achieve the prose quality of the Claude 3 models, notably Opus ...

Alpindale 16K context $3.75/M input tokens $4.5/M output tokens

FREE

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empowers developer ...

Google 8K context $0 input tokens $0 output tokens

Google: Gemma 2 9B

Text 2 text

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empowers developer ...

Google 8K context $0.06/M input tokens $0.06/M output tokens

Mistral: Codestral Mamba

Text 2 text

A 7.3B parameter Mamba-based model designed for code and reasoning tasks.Linear time inference, allowing for theoretically infinite sequence lengths 256k token context window Optimized for qu...

MistralAI 250K context $0.25/M input tokens $0.25/M output tokens

Mistral: Mistral Nemo

Text 2 text

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chi ...

MistralAI 125K context $0.13/M input tokens $0.13/M output tokens

Qwen 2 7B Instruct

Text 2 text

Qwen2 7B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV bias, and gro ...

Qwen 32K context $0.054/M input tokens $0.054/M output tokens

Meta: Llama 3.1 405B (base)

Text 2 text

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This is the base 405B pre-trained version. It has demonstrated strong performance compared to leading closed-sour ...

Meta Llama 128K context $2/M input tokens $2/M output tokens

FREE

Google: Gemini Pro 1.5 Experimental

Text image 2 text

# Free

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google 1.91M context $0 input tokens $0 output tokens $0.003/M image tokens

Anthropic: Claude 3.5 Haiku (2024-10-22)

Text 2 text

Claude 3.5 Haiku features enhancements across all skill sets including coding, tool use, and reasoning. As the fastest model in the Anthropic lineup, it offers rapid response times suitable for appli ...

Anthropic 195.31K context $1/M input tokens $5/M output tokens

Anthropic: Claude 3 Opus

Text image 2 text

Claude 3 Opus is Anthropic's most powerful model for highly complex tasks. It boasts top-level performance, intelligence, fluency, and understanding. See the launch announcement and benchmark result ...

Anthropic 195.31K context $15/M input tokens $75/M output tokens $0.024/M image tokens

Anthropic: Claude 3 Sonnet

Text image 2 text

Claude 3 Sonnet is an ideal balance of intelligence and speed for enterprise workloads. Maximum utility at a lower price, dependable, balanced for scaled deployments. See the launch announcement and ...

Anthropic 195.31K context $3/M input tokens $15/M output tokens $0.005/M image tokens

Anthropic: Claude 3 Haiku

Text image 2 text

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https: ...

Anthropic 195.31K context $0.25/M input tokens $1.25/M output tokens $0.4/K image tokens

Anthropic: Claude 3.5 Haiku

Text 2 text

Claude 3.5 Haiku features enhancements across all skill sets including coding, tool use, and reasoning. As the fastest model in the Anthropic lineup, it offers rapid response times suitable for appli ...

Anthropic 195.31K context $1/M input tokens $5/M output tokens

Anthropic: Claude 3.5 Sonnet

Text image 2 text

Claude 3.5 Sonnet delivers better-than-Opus capabilities, faster-than-Sonnet speeds, at the same Sonnet prices. Sonnet is particularly good at:Coding: Autonomously writes, edits, and runs code wi...

Anthropic 195.31K context $3/M input tokens $15/M output tokens $0.005/M image tokens

Models

Google: Gemma 2 27B

Magnum 72B

Google: Gemma 2 9B (free)

Google: Gemma 2 9B

Mistral: Codestral Mamba

Mistral: Mistral Nemo

Qwen 2 7B Instruct

Meta: Llama 3.1 405B (base)

Google: Gemini Pro 1.5 Experimental

Anthropic: Claude 3.5 Haiku (2024-10-22)

Anthropic: Claude 3 Opus

Anthropic: Claude 3 Sonnet

Anthropic: Claude 3 Haiku

Anthropic: Claude 3.5 Haiku

Anthropic: Claude 3.5 Sonnet

Categories

Tags