Type something to search...

Models

Cost-efficient, fast, and reliable option for use cases such as translation, summarization, and sentiment analysis. ...

Mistral Small
MistralAI
31.25K context $0.2/M input tokens $0.6/M output tokens

This model is currently powered by Mistral-7B-v0.2, and incorporates a "better" fine-tuning than Mistral 7B, inspired by community work. It's best used for larg ...

Mistral Tiny
MistralAI
31.25K context $0.25/M input tokens $0.25/M output tokens

Google's flagship text generation model. Designed to handle natural language tasks, multiturn text and code chat, and code generation. See the benchmarks and prompting guidelines from [Deepmind](htt ...

Google: Gemini Pro 1.0
Google
31.99K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

The NeverSleep team is back, with a Llama 3 70B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necess ...

Llama 3 Lumimaid 70B
Meta Llama
8K context $3.375/M input tokens $4.5/M output tokens

A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale. Credits to@chargoddard for developing the fr...

Goliath 120B
Alpindale
6K context $9.375/M input tokens $9.375/M output tokens

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response. See the benchmarks and prompting guidelines from [Deepmind](https://deepmind.googl ...

Google: Gemini Pro Vision 1.0
Google
16K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

WizardLM-2 7B is the smaller variant of Microsoft AI's latest Wizard model. It is the fastest and achieves comparable performance with existing 10x larger opensource leading models It is a finetune ...

WizardLM-2 7B
Microsoft Azure
31.25K context $0.055/M input tokens $0.055/M output tokens

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google: Gemini Pro 1.5
Google
1.91M context $1.25/M input tokens $5/M output tokens $0.003/M image tokens

command-r-plus-08-2024 is an update of the Command R+ with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keepin ...

Cohere: Command R+
Cohere
125K context $2.85/M input tokens $14.25/M output tokens

DBRX is a new open source large language model developed by Databricks. At 132B, it outperforms existing open source LLMs like Llama 2 70B and Mixtral-8x7b on standard indu ...

Databricks: DBRX 132B Instruct
Databricks
32K context $1.08/M input tokens $1.08/M output tokens

The Jamba-Instruct model, introduced by AI21 Labs, is an instruction-tuned variant of their hybrid SSM-Transformer Jamba model, specifically optimized for enterprise applications.256K Context Win...

AI21: Jamba Instruct
Ai21
250K context $0.5/M input tokens $0.7/M output tokens

Euryale 70B v2.1 is a model focused on creative roleplay from Sao10k.Better prompt adherence. Better anatomy / spatial awareness. Adapts much better to unique and...

Llama 3 Euryale 70B v2.1
Rifx.Online
8K context $0.35/M input tokens $0.4/M output tokens

A high-performing, industry-standard 7.3B parameter model, with optimizations for speed and context length. *Mistral 7B Instruct has multiple version variants, and this is intended to be the latest ...

Mistral: Mistral 7B Instruct
MistralAI
32K context $0.055/M input tokens $0.055/M output tokens

Phi-3 Mini is a powerful 3.8B parameter model designed for advanced language understanding, reasoning, and instruction following. Optimized through supervised fine-tuning and preference adjustments, ...

Phi-3 Mini 128K Instruct
Microsoft Azure
125K context $0.1/M input tokens $0.1/M output tokens

Phi-3 128K Medium is a powerful 14-billion parameter model designed for advanced language understanding, reasoning, and instruction following. Optimized through supervised fine-tuning and preference ...

Phi-3 Medium 128K Instruct
Microsoft Azure
125K context $1/M input tokens $1/M output tokens
Tags