DeepSeek: DeepSeek V3.1 (free)
FREE

159.96K Context
0 Input Tokens
0 Output Tokens

DeepSeek
Text 2 text
13 Oct, 2025

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context training process, reaching up to 128K tokens, and uses FP8 microscaling for efficient inference. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows.

It succeeds the DeepSeek V3-0324 model and performs well on a variety of tasks.

DeepSeek: DeepSeek V3 0324

Text 2 text

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the DeepSeek V3 m ...

DeepSeek 62.5K context $0.27/M input tokens $1.1/M output tokens

FREE

DeepSeek: DeepSeek V3 0324 (free)

Text 2 text

# Free

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the DeepSeek V3 m ...

DeepSeek 62.5K context $0 input tokens $0 output tokens

DeepSeek V3

Text 2 text

# New # Hot

1. Introduction We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-eff ...

DeepSeek 62.5K context $0.14/M input tokens $0.28/M output tokens

DeepSeek V3

Text 2 text

# New # Hot

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported ...

DeepSeek 62.5K context $0.14/M input tokens $0.28/M output tokens

FREE

DeepSeek: R1 0528 (free)

Text 2 text

# Free

DeepSeek-R1 1. Introduction We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (R ...

DeepSeek 160K context $0 input tokens $0 output tokens

DeepSeek: DeepSeek R1 Distill Llama 70B

Text 2 text

DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The m ...

DeepSeek 128K context $0.23/M input tokens $0.69/M output tokens

DeepSeek: DeepSeek V3.1 (free)
FREE

Tags :

Share :

Related Posts

DeepSeek: DeepSeek V3 0324

DeepSeek: DeepSeek V3 0324 (free)

DeepSeek V3

DeepSeek V3

DeepSeek: R1 0528 (free)

DeepSeek: DeepSeek R1 Distill Llama 70B

DeepSeek: DeepSeek V3.1 (free) FREE

Tags :

Share :

Related Posts

DeepSeek: DeepSeek V3 0324

DeepSeek: DeepSeek V3 0324 (free)

DeepSeek V3

DeepSeek V3

DeepSeek: R1 0528 (free)

DeepSeek: DeepSeek R1 Distill Llama 70B

DeepSeek: DeepSeek V3.1 (free)
FREE