Models

DeepSeek: R1 Distill Qwen 1.5B

DeepSeek R1 Distill Qwen 1.5B is a distilled large language model based on Qwen 2.5 Math 1.5B, using outputs from [DeepSeek R1](/deepseek/deepseek-r1 ...

DeepSeek 128K context $0.18/M input tokens $0.18/M output tokens

DeepSeek: R1 Distill Llama 8B

Text 2 text

DeepSeek R1 Distill Llama 8B is a distilled large language model based on Llama-3.1-8B-Instruct, using outputs from DeepSeek R1. The mode ...

DeepSeek 31.25K context $0.04/M input tokens $0.04/M output tokens

DeepSeek: R1 Distill Qwen 14B

Text 2 text

DeepSeek R1 Distill Qwen 14B is a distilled large language model based on Qwen 2.5 14B, using outputs from [DeepSeek R1](/deepseek/d ...

DeepSeek 62.5K context $0.15/M input tokens $0.15/M output tokens

DeepSeek: R1 Distill Qwen 32B

Text 2 text

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. It outperfo ...

DeepSeek 128K context $0.12/M input tokens $0.18/M output tokens

DeepSeek: R1 (nitro)

Text 2 text

DeepSeek R1 is here: Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass. Fully ...

DeepSeek 160K context $3/M input tokens $8/M output tokens

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can ha ...

Rifx.Online 976.75K context $0.2/M input tokens $1.1/M output tokens

Microsoft: Phi 4

Text 2 text

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 1 ...

Microsoft Azure 16K context $0.07/M input tokens $0.14/M output tokens

30% OFF

OpenAI: o1-preview

Text 2 text

# Discount

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related ta ...

OpenAI 125K context $15/M input tokens $60/M output tokens

40% OFF

OpenAI: o1-mini

Text 2 text

# Discount

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related ta ...

OpenAI 125K context $3/M input tokens $12/M output tokens

DeepSeek V3

Text 2 text

# New # Hot

1. Introduction We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-eff ...

DeepSeek 62.5K context $0.14/M input tokens $0.28/M output tokens

OpenAI: o1-mini

Text 2 text

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related ta ...

OpenAI 125K context $3/M input tokens $12/M output tokens

OpenAI: o1

Text image 2 text

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason using ...

OpenAI 195.31K context $15/M input tokens $60/M output tokens $0.022/M image tokens

FREE

Google: Gemini 2.0 Flash Thinking Experimental (free)

Text image 2 text

# Free

Gemini 2.0 Flash Thinking Mode is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. As a result, Thinking Mode is capable of stro ...

Google 39.06K context $0 input tokens $0 output tokens

50% OFF

EVA Llama 3.33 70b

Text 2 text

# Discount

EVA Llama 3.33 70b is a roleplay and storywriting specialist model. It is a full-parameter finetune of Llama-3.3-70B-Instruct on mixture of ...

Eva unit 01 16K context $4/M input tokens $6/M output tokens

xAI: Grok 2 Vision 1212

Text image 2 text

Grok 2 Vision 1212 advances image-based AI with stronger visual comprehension, refined instruction-following, and multilingual support. From object recognition to style analysis, it empowers develope ...

X AI 32K context $2/M input tokens $10/M output tokens $0.004/M image tokens

Models

DeepSeek: R1 Distill Qwen 1.5B

DeepSeek: R1 Distill Llama 8B

DeepSeek: R1 Distill Qwen 14B

DeepSeek: R1 Distill Qwen 32B

DeepSeek: R1 (nitro)

MiniMax: MiniMax-01

Microsoft: Phi 4

OpenAI: o1-preview

OpenAI: o1-mini

DeepSeek V3

OpenAI: o1-mini

OpenAI: o1

Google: Gemini 2.0 Flash Thinking Experimental (free)

EVA Llama 3.33 70b

xAI: Grok 2 Vision 1212

Categories

Tags