Type something to search...

Models

FREE

A lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most. ...

Meta: Llama 3.3 8B Instruct (free)
Rifx.Online
125K context $0 input tokens $0 output tokens

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality work flows. ...

xAI: Grok Code Fast 1
X AI
250K context $0.2/M input tokens $1.5/M output tokens

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's [news po ...

xAI: Grok 4 Fast
X AI
1.91M context $0.2/M input tokens $0.5/M output tokens

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not exposed, reasoning ...

xAI: Grok 4
X AI
250K context $3/M input tokens $15/M output tokens

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for ...

MoonshotAI: Kimi K2
Rifx.Online
64K context $0.14/M input tokens $2.49/M output tokens

DeepSeek V3,一个拥有685B参数的混合专家模型,是DeepSeek团队旗舰聊天模型系列的最新版本。 它继承了DeepSeek V3模型,并在多种任务上表现出色。 ...

DeepSeek: DeepSeek V3 0324
DeepSeek
62.5K context $0.27/M input tokens $1.1/M output tokens
FREE

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for ...

MoonshotAI: Kimi K2 (free)
Rifx.Online
64K context $0 input tokens $0 output tokens
FREE

DeepSeek-R1 1. 介绍 我们介绍我们的第一代推理模型,DeepSeek-R1-Zero 和 DeepSeek-R1。 DeepSeek-R1-Zero 是通过大规模强化学习(RL)训练的模型,没有经过监督微调(SFT)作为初步步骤,表现出卓越的推理能力。 通过 RL,DeepSeek-R1-Zero 自然展现出许多强大且有趣的推理行为。 然而,DeepSeek-R ...

DeepSeek: R1 0528 (free)
DeepSeek
160K context $0 input tokens $0 output tokens
FREE

...

tts-1-1106
Rifx.Online
$0 input tokens $0 output tokens $0.01/M request tokens
FREE

...

FunAudioLLM/CosyVoice2-0.5B
Rifx.Online
$0 input tokens $0 output tokens $0.01/M request tokens
FREE

...

FunAudioLLM/SenseVoiceSmall
Rifx.Online
$0 input tokens $0 output tokens

GPT-4.1 Mini 是一个中型模型,其性能与 GPT-4o 竞争,同时具有显著更低的延迟和成本。它保留了 1 million 的上下文窗口,在困难指令评估中得分 45.1%,在 MultiChallenge 中得分 35.8%,在 IFEval 中得分 84.1%。Mini 还展示了强大的编码能力(例如,在 Aider 的多语言 diff 基准测试中得分 31.6%)和视觉理解能力,使其适 ...

gpt-4.1-mini
OpenAI
1023.02K context $0.4/M input tokens $1.6/M output tokens

GPT-4.1 Mini 是一个中型模型,其性能与 GPT-4o 竞争,同时具有显著更低的延迟和成本。它保留了 1 million 的上下文窗口,在困难指令评估中得分 45.1%,在 MultiChallenge 中得分 35.8%,在 IFEval 中得分 84.1%。Mini 还展示了强大的编码能力(例如,在 Aider 的多语言 diff 基准测试中得分 31.6%)和视觉理解能力,使其适 ...

gpt-4.1-mini
OpenAI
1023.02K context $0.4/M input tokens $1.6/M output tokens

对于需要低延迟的任务,GPT‑4.1 nano 是 GPT-4.1 系列中速度最快、成本最低的模型。它以其 1 百万 token 的上下文窗口在小尺寸下提供卓越性能,并在 MMLU 上得分 80.1%,在 GPQA 上得分 50.3%,在 Aider polyglot coding 上得分 9.8%——甚至高于 GPT‑4o mini。它非常适合分类或自动补全等任务。 ...

free/gpt-4.1-nano
OpenAI
1023.02K context $0.1/M input tokens $0.4/M output tokens

GPT-4.1 是一款旗舰大型语言模型,针对高级指令跟随、现实世界的软件工程和长上下文推理进行了优化。它支持 1 million token 的上下文窗口,并在编码(54.6% SWE-bench Verified)、指令合规性(87.4% IFEval)和多模态理解基准测试中超越了 GPT-4o 和 GPT-4.5。它经过调优,能够提供精确的代码差异、代理可靠性以及在大文档上下文中的高召回率, ...

gpt-4.1
OpenAI
1023.02K context $2/M input tokens $8/M output tokens
Tags
Type something to search...