DeepSeek: DeepSeek V3.2 Exp

160K Context
0.27/M Input Tokens
0.4/M Output Tokens

DeepSeek
Text 2 text
13 Oct, 2025

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve training and inference efficiency in long-context scenarios while maintaining output quality. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

The model was trained under conditions aligned with V3.1-Terminus to enable direct comparison. Benchmarking shows performance roughly on par with V3.1 across reasoning, coding, and agentic tool-use tasks, with minor tradeoffs and gains depending on the domain. This release focuses on validating architectural optimizations for extended context lengths rather than advancing raw task accuracy, making it primarily a research-oriented model for exploring efficient transformer designs.

DeepSeek: DeepSeek V3 0324

Text 2 text

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the DeepSeek V3 m ...

DeepSeek 62.5K context $0.27/M input tokens $1.1/M output tokens

FREE

DeepSeek: DeepSeek V3 0324 (free)

Text 2 text

# Free

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the DeepSeek V3 m ...

DeepSeek 62.5K context $0 input tokens $0 output tokens

FREE

DeepSeek: DeepSeek V3.1 (free)

Text 2 text

# Free

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase ...

DeepSeek 159.96K context $0 input tokens $0 output tokens

DeepSeek V3

Text 2 text

# New # Hot

1. Introduction We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-eff ...

DeepSeek 62.5K context $0.14/M input tokens $0.28/M output tokens

DeepSeek V3

Text 2 text

# New # Hot

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported ...

DeepSeek 62.5K context $0.14/M input tokens $0.28/M output tokens

FREE

DeepSeek: R1 0528 (free)

Text 2 text

# Free

DeepSeek-R1 1. Introduction We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (R ...

DeepSeek 160K context $0 input tokens $0 output tokens

DeepSeek: DeepSeek V3.2 Exp

Tags :

Share :

Related Posts

DeepSeek: DeepSeek V3 0324

DeepSeek: DeepSeek V3 0324 (free)

DeepSeek: DeepSeek V3.1 (free)

DeepSeek V3

DeepSeek V3

DeepSeek: R1 0528 (free)