AI Model

DeepSeek: DeepSeek V3.1 Base

DeepSeek: DeepSeek V3.1 Base logoDeepSeek
Text Generation
About DeepSeek V3.1 Base

This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”).

DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

Specifications
Provider
DeepSeek
Context Length
163,840 tokens
Input Types
text
Output Types
text
Category
DeepSeek
Added
8/20/2025

Frequently Asked Questions

Common questions about DeepSeek V3.1 Base

Use DeepSeek V3.1 Base and 200+ more models

Access all the best AI models in one platform. No API keys, no switching between apps.