Models

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's [news po ...

X AI 1.91M context $0.2/M input tokens $0.5/M output tokens

xAI: Grok 4

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not exposed, reasoning ...

X AI 250K context $3/M input tokens $15/M output tokens

MoonshotAI: Kimi K2

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for ...

Rifx.Online 64K context $0.14/M input tokens $2.49/M output tokens

DeepSeek: DeepSeek V3 0324

DeepSeek V3，一个拥有685B参数的混合专家模型，是DeepSeek团队旗舰聊天模型系列的最新版本。它继承了DeepSeek V3模型，并在多种任务上表现出色。 ...

DeepSeek 62.5K context $0.27/M input tokens $1.1/M output tokens

FREE

MoonshotAI: Kimi K2 (free)

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for ...

Rifx.Online 64K context $0 input tokens $0 output tokens

FREE

DeepSeek: R1 0528 (free)

DeepSeek-R1 1. 介绍我们介绍我们的第一代推理模型，DeepSeek-R1-Zero 和 DeepSeek-R1。 DeepSeek-R1-Zero 是通过大规模强化学习（RL）训练的模型，没有经过监督微调（SFT）作为初步步骤，表现出卓越的推理能力。通过 RL，DeepSeek-R1-Zero 自然展现出许多强大且有趣的推理行为。然而，DeepSeek-R ...

DeepSeek 160K context $0 input tokens $0 output tokens

FREE

tts-1-1106

...

Rifx.Online $0 input tokens $0 output tokens $0.01/M request tokens

FREE

FunAudioLLM/CosyVoice2-0.5B

...

Rifx.Online $0 input tokens $0 output tokens $0.01/M request tokens

FREE

FunAudioLLM/SenseVoiceSmall

...

Rifx.Online $0 input tokens $0 output tokens

gpt-4.1-mini

GPT-4.1 Mini 是一个中型模型，其性能与 GPT-4o 竞争，同时具有显著更低的延迟和成本。它保留了 1 million 的上下文窗口，在困难指令评估中得分 45.1%，在 MultiChallenge 中得分 35.8%，在 IFEval 中得分 84.1%。Mini 还展示了强大的编码能力（例如，在 Aider 的多语言 diff 基准测试中得分 31.6%）和视觉理解能力，使其适 ...

OpenAI 1023.02K context $0.4/M input tokens $1.6/M output tokens

gpt-4.1-mini

GPT-4.1 Mini 是一个中型模型，其性能与 GPT-4o 竞争，同时具有显著更低的延迟和成本。它保留了 1 million 的上下文窗口，在困难指令评估中得分 45.1%，在 MultiChallenge 中得分 35.8%，在 IFEval 中得分 84.1%。Mini 还展示了强大的编码能力（例如，在 Aider 的多语言 diff 基准测试中得分 31.6%）和视觉理解能力，使其适 ...

OpenAI 1023.02K context $0.4/M input tokens $1.6/M output tokens

free/gpt-4.1-nano

对于需要低延迟的任务，GPT‑4.1 nano 是 GPT-4.1 系列中速度最快、成本最低的模型。它以其 1 百万 token 的上下文窗口在小尺寸下提供卓越性能，并在 MMLU 上得分 80.1%，在 GPQA 上得分 50.3%，在 Aider polyglot coding 上得分 9.8%——甚至高于 GPT‑4o mini。它非常适合分类或自动补全等任务。 ...

OpenAI 1023.02K context $0.1/M input tokens $0.4/M output tokens

gpt-4.1