Google: Gemini 2.5 Flash Lite

Google: Gemini 2.5 Flash Lite

1M Context
0.1/M Input Tokens
0.4/M Output Tokens

Google
Text image 2 text
13 Oct, 2025

与模型对话

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, “thinking” (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

Tags :

Google

Share :

Related Posts

Google: Gemini Flash 2.0

Text image 2 text

Gemini Flash 2.0 提供了显著更快的首次令牌时间（TTFT），相比于 Gemini Flash 1.5，同时保持与更大模型如 Gemini Pro 1.5 相当的质量。它在多模态理解、编码能力、复杂指令执行和函数调用方面引入了显著的增强。这些进步共同提供了更无缝和强大的代理体 ...

Google 976.56K context $0.1/M input tokens $0.4/M output tokens

Google: Gemini 2.0 Flash Experimental

Gemini 2.0 Flash 提供了比 Gemini 1.5 Flash 更快的首次令牌时间 (TTFT)，同时保持与更大模型如 Gemini 1.5 Pro 相当的质量。它在多模态理解、编码能力、复杂指令执行和函数调用方面引入了显著的增强。这些进步共同提供了更无缝和强大的代理体验。 ...

Google 976.56K context $0.2/M input tokens $0.6/M output tokens

FREE

Google: Gemini 2.0 Flash Experimental (free)

Gemini 2.0 Flash 提供了比 Gemini 1.5 Flash 更快的首次令牌时间 (TTFT)，同时保持与更大模型如 Gemini 1.5 Pro 相当的质量。它在多模态理解、编码能力、复杂指令执行和函数调用方面引入了显著的增强。这些进步共同提供了更无缝和强大的代理体验。 ...

Google 976.56K context $0 input tokens $0 output tokens

Google: Gemini 2.0 Flash Lite

Text image 2 text

Gemini 2.0 Flash Lite 提供了显著更快的首次令牌时间 (TTFT)，与 Gemini Flash 1.5 相比，同时在质量上与更大模型如 Gemini Pro 1.5 相当，所有这些都以极具经济性的令牌价格进行。 ...

Google 1M context $0.075/M input tokens $0.3/M output tokens

FREE

Google: Gemini Flash Lite 2.0 Preview (free)

Text image 2 text

Gemini Flash Lite 2.0 提供了显著更快的首次令牌时间 (TTFT)，相比于 Gemini Flash 1.5，同时保持与更大模型如 Gemini Pro 1.5 相当的质量。由于目前处于预览阶段，它将会受到 Google 的严格限流。该模型将在 2 月 24 日的 ...

Google 976.56K context $0 input tokens $0 output tokens

FREE

Google: Gemini 2.0 Flash Thinking Experimental (free)

Text image 2 text

Gemini 2.0 Flash Thinking Mode 是一个实验性模型，旨在生成模型在响应过程中经历的“思维过程”。因此，Thinking Mode 在响应中的推理能力比基础 Gemini 2.0 Flash 模型更强。 ...

Google 39.06K context $0 input tokens $0 output tokens