Type something to search...

Models

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like [Gemi ...

Google: Gemini 2.0 Flash Lite
Google
1M context $0.075/M input tokens $0.3/M output tokens

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images. ...

Qwen: Qwen2.5 VL 72B Instruct
Qwen
128K context $0.7/M input tokens $0.7/M output tokens

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This is the base 70B pre-trained version. It has demonstrated strong performance compared to leading closed-source ...

Meta: Llama 3 70B (Base)
Meta Llama
8K context $0.59/M input tokens $0.79/M output tokens

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This is the base 8B pre-trained version. It has demonstrated strong performance compared to leading closed-source m ...

Meta: Llama 3 8B (Base)
Meta Llama
8K context $0.05/M input tokens $0.08/M output tokens

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between ra ...

Anthropic: Claude 3.7 Sonnet
Anthropic
195.31K context $3/M input tokens $15/M output tokens $0.005/M image tokens

Note: As this model does not return tags, thoughts will be streamed by default directly to the content field. R1 1776 is a version of DeepSeek-R1 that has been post-trained to remove censo ...

Perplexity: R1 1776
Perplexity
125K context $2/M input tokens $8/M output tokens
20% OFF

OpenAI o3-mini-high is the same model as o3-mini with reasoning_effort set to high. o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly ex ...

OpenAI: o3 Mini High
OpenAI
195.31K context $1.1/M input tokens $4.4/M output tokens
20% OFF

DeepSeek-R1 1. Introduction We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (R ...

DeepSeek: R1
DeepSeek
160K context $3/M input tokens $8/M output tokens

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like [Gemini Pr ...

Google: Gemini Flash 2.0
Google
976.56K context $0.1/M input tokens $0.4/M output tokens

DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The m ...

DeepSeek: DeepSeek R1 Distill Llama 70B
DeepSeek
128K context $0.23/M input tokens $0.69/M output tokens

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge. Crea ...

Sao10K: Llama 3 8B Lunaris
Rifx.Online
8K context $0.03/M input tokens $0.06/M output tokens

Mag Mell is a merge of pre-trained language models created using mergekit, based on Mistral Nemo. It is a great roleplay and storytelling model which combines the best part ...

Inflatebot: Mag Mell R1 12B
Rifx.Online
15.63K context $0.9/M input tokens $0.9/M output tokens

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimiz ...

Meta: Llama 3.3 70B Instruct
Meta Llama
128K context $0.13/M input tokens $0.4/M output tokens

text-embedding-3-small is OpenAI's cost-effective text embedding model, serving as the lightweight version in the text-embedding-3 series. This model maintains good performance while offering a more ...

text-embedding-3-small
OpenAI
$0.02/M input tokens $0 output tokens

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite can handle real-time cu ...

Amazon: Nova Lite 1.0
Amazon
292.97K context $0.06/M input tokens $0.24/M output tokens
Tags
Type something to search...