DeepSeek
21 variants tracked. Compare specs, pricing, and capabilities side by side.
DeepSeek: DeepSeek V3
DeepSeek
DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on...
DeepSeek: DeepSeek V3 0324
DeepSeek
DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [D...
DeepSeek: DeepSeek V3.1
DeepSeek
DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It ext...
DeepSeek: DeepSeek V3.1 Terminus
DeepSeek
DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues...
DeepSeek: DeepSeek V3.2
DeepSeek
DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introd...
DeepSeek: DeepSeek V3.2 Exp
DeepSeek
DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces D...
DeepSeek: DeepSeek V3.2 Speciale
DeepSeek
DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Atten...
DeepSeek: R1
DeepSeek
DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, wi...
DeepSeek: R1 0528
DeepSeek
May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open re...
DeepSeek: R1 Distill Llama 70B
DeepSeek · 70B
DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [...
DeepSeek: R1 Distill Qwen 32B
DeepSeek · 32B
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSe...
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
8B
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
8B
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
1.5B
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
14B
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
7B
deepseek-ai/DeepSeek-V2-Lite
deepseek-ai/DeepSeek-V2-Lite-Chat
deepseek-ai/deepseek-coder-6.7b-instruct
6.7B
deepseek-ai/deepseek-llm-7b-chat
7B
Track the full DeepSeek family in AIniverse
Side-by-side comparisons, real ratings, get notified when a new version drops.
Open in App