Type something to search...

Models

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and ...

OpenAI: GPT-4.1
OpenAI
1023.02K context $2/M input tokens $8/M output tokens

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, a ...

OpenAI: GPT-4.1 Nano
OpenAI
1023.02K context $0.1/M input tokens $0.4/M output tokens

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruct ...

OpenAI: GPT-4.1 Mini
OpenAI
1023.02K context $0.4/M input tokens $1.6/M output tokens
FREE

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the DeepSeek V3 m ...

DeepSeek: DeepSeek V3 0324 (free)
DeepSeek
62.5K context $0 input tokens $0 output tokens
FREE

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, ...

Google: Gemma 3 27B (free)
Google
125K context $0 input tokens $0 output tokens

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, ...

Google: Gemma 3 27B
Google
125K context $0.3/M input tokens $0.5/M output tokens
FREE

dall-e-3 ...

dall-e-3
OpenAI
$0 input tokens $0 output tokens $0.001/M request tokens
FREE

FLUX.1 Redux [dev] is an adapter for all FLUX.1 base models for image variation generation. Given an input image, FLUX.1 Redux can reproduce the image with slight variation, allowing to refine a give ...

black-forest-labs/FLUX.1-redux
Together
$0 input tokens $0 output tokens $0.025/M request tokens

...

tts-1-hd
OpenAI
$300/M input tokens $0 output tokens
FREE

FLUX.1 [schnell] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions. For more information, please read our [blog post](https://blackforestlabs ...

black-forest-labs/FLUX.1-schnell-Free
Together
$0 input tokens $0 output tokens

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like [Gemi ...

Google: Gemini 2.0 Flash Lite
Google
1M context $0.075/M input tokens $0.3/M output tokens

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images. ...

Qwen: Qwen2.5 VL 72B Instruct
Qwen
128K context $0.7/M input tokens $0.7/M output tokens

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This is the base 70B pre-trained version. It has demonstrated strong performance compared to leading closed-source ...

Meta: Llama 3 70B (Base)
Meta Llama
8K context $0.59/M input tokens $0.79/M output tokens

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This is the base 8B pre-trained version. It has demonstrated strong performance compared to leading closed-source m ...

Meta: Llama 3 8B (Base)
Meta Llama
8K context $0.05/M input tokens $0.08/M output tokens

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between ra ...

Anthropic: Claude 3.7 Sonnet
Anthropic
195.31K context $3/M input tokens $15/M output tokens $0.005/M image tokens
Tags
Type something to search...