Models

Goliath 120B

A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale. Credits to@chargoddard for developing the fr...

Alpindale 6K context $9.375/M input tokens $9.375/M output tokens

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response. See the benchmarks and prompting guidelines from [Deepmind](https://deepmind.googl ...

Google 16K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

WizardLM-2 7B

Text 2 text

WizardLM-2 7B is the smaller variant of Microsoft AI's latest Wizard model. It is the fastest and achieves comparable performance with existing 10x larger opensource leading models It is a finetune ...

Microsoft Azure 31.25K context $0.055/M input tokens $0.055/M output tokens

Google: Gemini Pro 1.5

Text image 2 text

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google 1.91M context $1.25/M input tokens $5/M output tokens $0.003/M image tokens

Cohere: Command R+

Text 2 text

command-r-plus-08-2024 is an update of the Command R+ with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keepin ...

Cohere 125K context $2.85/M input tokens $14.25/M output tokens

Databricks: DBRX 132B Instruct

Text 2 text

DBRX is a new open source large language model developed by Databricks. At 132B, it outperforms existing open source LLMs like Llama 2 70B and Mixtral-8x7b on standard indu ...

Databricks 32K context $1.08/M input tokens $1.08/M output tokens

AI21: Jamba Instruct

Text 2 text

The Jamba-Instruct model, introduced by AI21 Labs, is an instruction-tuned variant of their hybrid SSM-Transformer Jamba model, specifically optimized for enterprise applications.256K Context Win...

Ai21 250K context $0.5/M input tokens $0.7/M output tokens

Llama 3 Euryale 70B v2.1

Text 2 text

Euryale 70B v2.1 is a model focused on creative roleplay from Sao10k.Better prompt adherence. Better anatomy / spatial awareness. Adapts much better to unique and...

Rifx.Online 8K context $0.35/M input tokens $0.4/M output tokens

Mistral: Mistral 7B Instruct

Text 2 text

A high-performing, industry-standard 7.3B parameter model, with optimizations for speed and context length. *Mistral 7B Instruct has multiple version variants, and this is intended to be the latest ...

MistralAI 32K context $0.055/M input tokens $0.055/M output tokens

Phi-3 Mini 128K Instruct

Text 2 text

Phi-3 Mini is a powerful 3.8B parameter model designed for advanced language understanding, reasoning, and instruction following. Optimized through supervised fine-tuning and preference adjustments, ...

Microsoft Azure 125K context $0.1/M input tokens $0.1/M output tokens

Phi-3 Medium 128K Instruct

Text 2 text

Phi-3 128K Medium is a powerful 14-billion parameter model designed for advanced language understanding, reasoning, and instruction following. Optimized through supervised fine-tuning and preference ...

Microsoft Azure 125K context $1/M input tokens $1/M output tokens

Google: Gemini Flash 1.5

Text image 2 text

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and vide ...

Google 976.56K context $0.075/M input tokens $0.3/M output tokens $0.04/K image tokens

Cohere: Command

Text 2 text

Command is an instruction-following conversational model that performs language tasks with high quality, more reliably and with a longer context than our base generative models. Use of this model is ...

Cohere 4K context $0.95/M input tokens $1.9/M output tokens

Cohere: Command R

Text 2 text

Command-R is a 35B parameter model that performs conversational language tasks at a higher quality, more reliably, and with a longer context than previous models. It can be used for complex workflows ...

Cohere 125K context $0.475/M input tokens $1.425/M output tokens

FREE

Qwen 2 7B Instruct (free)

Text 2 text

# Free

Qwen2 7B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV bias, and gro ...

Qwen 32K context $0 input tokens $0 output tokens

Models

Goliath 120B

Google: Gemini Pro Vision 1.0

WizardLM-2 7B

Google: Gemini Pro 1.5

Cohere: Command R+

Databricks: DBRX 132B Instruct

AI21: Jamba Instruct

Llama 3 Euryale 70B v2.1

Mistral: Mistral 7B Instruct

Phi-3 Mini 128K Instruct

Phi-3 Medium 128K Instruct

Google: Gemini Flash 1.5

Cohere: Command

Cohere: Command R

Qwen 2 7B Instruct (free)

Categories

Tags