🔵 DeepSeek AI — เจาะลึกทุกโมเดล

บริษัท AI จากจีนที่สั่นสะเทือนวงการ | อัปเดต: 25 พ.ค. 2569

🔵 DeepSeek 深度求索

杭州深度求索人工智能基础技术研究有限公司 (Hangzhou DeepSeek AI Fundamental Technology Research)

ก่อตั้ง

2023

สำนักงาน

หางโจว, จีน

ผู้ก่อตั้ง

High-Flyer Capital

โมเดลหลัก

V3, V4, R1, Coder

สถาปัตยกรรม

MoE (Mixture-of-Experts)

Open-Source

✅ ทุกโมเดลหลัก

DeepSeek คือบริษัท AI จากหางโจว จีน ก่อตั้งโดย High-Flyer Capital (กองทุน量化เทรดดิ้ง) — ใช้ความเชี่ยวชาญด้าน GPU clusters และ RL สร้างโมเดลที่แข่งขันกับ OpenAI/Google ได้ในราคาที่ต่ำกว่ามาก โดดเด่นด้วยสถาปัตยกรรม Mixture-of-Experts (MoE) ที่ activate เฉพาะ subset ของ parameters — ทำให้ training/inference ถูกลงอย่างมาก โมเดลทุกตัวเปิด open-source บน HuggingFace

🤖 โมเดลทั้งหมด — ราคา & Specs

โมเดล	ประเภท	Parameters	Context	Input $/1M	Output $/1M	จุดเด่น
DeepSeek V4 Pro Flagship 2026	Pro	1.6T total 49B active	1M ✅	$0.000435	$0.00087	MoE 1.6T, 1M context, reasoning
DeepSeek V4 Flash Latest 2026	Chat	284B total 13B active	1M ✅	$0.00010	$0.00020	Ultra cheap, efficiency
DeepSeek V4 Flash FREE Free tier	Free	284B total 13B active	1M ✅	$0	$0	Free เต็มรูปแบบ!
DeepSeek V3.2 Speciale High-compute reasoning	Reasoning	671B 37B active	163K	$0.000287	$0.000431	DSA, max reasoning
DeepSeek V3.2 Current stable	Chat	671B 37B active	131K	$0.000252	$0.000378	Efficient, reasoning+agentic
DeepSeek V3.1 Terminus Updated V3.1	Chat	671B 37B active	163K	$0.00027	$0.00095	Language consistency fix
DeepSeek V3.1 Hybrid reasoning	Reasoning	671B 37B active	163K	$0.00021	$0.00079	Thinking + non-thinking mode
DeepSeek V3 0324 MoE flagship (0324)	Chat	685B MoE	163K	$0.00020	$0.00077	Open-source, coding, reasoning
DeepSeek R1 0528 Updated R1 (May 28)	Reasoning	671B 37B active	163K	$0.00050	$0.00215	reasoning, open reasoning tokens
DeepSeek R1 Original reasoning	Reasoning	671B 37B active	163K	$0.00070	$0.00250	= OpenAI o1, open-source
DeepSeek R1 Distill Qwen 32B Distilled • open-weight	Distill	32B	128K	$0.00029	$0.00029	เล็ก, ถูก, รองรับ local
DeepSeek R1 Distill Llama 70B Distilled • open-weight	Distill	70B	131K	$0.00070	$0.00080	Balanced size/performance
DeepSeek V3.1 Nex N1 Agent-centric	Agent	—	131K	$0.000135	$0.00050	Tool use, agent autonomy

💰 เปรียบเทียบราคา — DeepSeek vs ค่ายอื่น

DeepSeek V4 Flash

$0.10

per 1M input

$0.20

per 1M output

DeepSeek R1

$0.70

per 1M input

$2.50

per 1M output

Claude Sonnet 4

$3.00

per 1M input

$15.00

per 1M output

GPT-4.5

$15.00

per 1M input

$75.00

per 1M output

DeepSeek V4 Flash ถูกกว่า GPT-4.5 ถึง 150,000x (input) และ 375,000x (output)

⚙️ สถาปัตยกรรม — Mixture-of-Experts (MoE)

🔬 MoE Core Concept

Total Parameters 671B–1.6T

Active Parameters 37B–49B

Expert Count 256

Active per token 8 experts

Sparsity ~94%

Training Tokens ~15T

🏗️ V4 Architecture

V4 Total 1.6 Trillion

V4 Active 49 Billion

Context 1M tokens

Attention DeepSeek Sparse Attention

Multi-head Latent Yes (MLA)

FP8 Training Yes

🧠 V3 Architecture

V3 Total 685 Billion

V3 Active 37 Billion

Context 163K tokens

Auxiliary Loss Multi-factor

Load Balance Auxiliary-loss-free

Pretrain Tokens 14.8T

📊 Why MoE?

Cost Reduction ~90%

Same Quality ✅ vs Dense

Scalability → 1T+ params

Inference Speed → Fast routing

Training Efficiency → lower GPU

Open-Source ✅ Weights

📊 Benchmark Performance

🧮 Reasoning (AIME/MATH)

DeepSeek R1 87.5%

OpenAI o1 87.3% ≈ เท่ากัน

GPT-4.5 ~75% R1 ดีกว่า

Claude Sonnet 4 ~78% R1 ดีกว่า

Gemini 2.0 Ultra ~86% ใกล้เคียง

💻 Coding (LiveCodeBench)

DeepSeek V3.2 High

GPT-4.5 Very High

Claude Opus 4 Very High

DeepSeek Coder V2 SOTA

Grok 3 High

🌐 Multilingual

DeepSeek V3 Excellent

English ⭐⭐⭐⭐

Chinese ⭐⭐⭐⭐⭐

Thai ⭐⭐⭐

Coding ⭐⭐⭐⭐

📈 Price/Performance

DeepSeek V4 Flash 💚💚💚💚💚

DeepSeek R1 💚💚💚💚

Qwen 3.7 Max 💚💚💚

Claude Sonnet 4 💛💛💛

GPT-4.5 💛

🔓 Open-Source Models — ดาวน์โหลดฟรี!

🦙

DeepSeek V3 685B

HuggingFace • ~700GB

🦙

DeepSeek R1 671B

HuggingFace • ~670GB

🔧

R1 Distill Qwen 32B

~64GB • Local รองรับ

🔧

R1 Distill Llama 70B

~140GB • Local รองรับ

🔵

DeepSeek Coder V2

HuggingFace • open

🔵

DeepSeek Math

HuggingFace • open

🔗 HuggingFace: huggingface.co/deepseek-ai | ดาวน์โหลด weights ฟรี ใช้ local ได้!

✅ จุดเด่น

ราคาถูกมาก — ถูกกว่าค่ายใหญ่ 100-1000x

Open-source weights — ดาวน์โหลดใช้เองได้

MoE architecture — efficient มาก

R1 reasoning เทียบ o1 ได้เลย

Distilled models (32B, 70B) รองรับ local

Context 1M tokens (V4)

แข่งขันได้ใน coding + math

⚠️ ข้อจำกัด

ไม่มี multimodal ด้วย images/video ใน V3/R1

Context V3/V4 เล็กกว่า Gemini (1M vs 1M)

ไม่มี Voice/Speech built-in

จีน company — data privacy concerns

Web search ไม่มี (ต้องใช้ tools)

Output limit 16K (V3) — น้อยกว่า M2.7

📰 Timeline ความเป็นมา & Updates

24 พ.ค. 69

DeepSeek V4 Pro เปิดตัว — MoE 1.6T params, 49B active, 1M context, $0.000435/M input

2026

V4 Flash เปิดตัว — 284B total, 13B active, ultra cheap, มี free tier

28 พ.ค. 68

DeepSeek R1 0528 — update ใหม่, performance สูงขึ้น, reasoning tokens เปิดเต็ม

เม.ย. 68

V3.1 Terminus + V3.2 — DSA (DeepSeek Sparse Attention), language consistency fixed

ธ.ค. 67

DeepSeek V3 0324 — MoE 685B, 163K context, $0.20/$0.77 per 1M tokens, open-source

ม.ค. 68

DeepSeek R1 — reasoning model แข่ง o1, open-source, open weights ทั้ง 671B

2024

V2, Coder V2, Math — open-source coding + math models

2023

ก่อตั้ง — โดย High-Flyer Capital (กองทุน量化), หางโจว

💡 สรุป — DeepSeek คุ้มค่าแค่ไหน?

💰 Budget Users: ใช้ V4 Flash ฟรี หรือ $0.10/M input — ถูกที่สุดในโลก AI!
🧠 Reasoning/Math: DeepSeek R1 เทียบ o1 ได้ แต่ถูกกว่ามาก
💻 Coding: V3.2 + Coder V2 แข่งขันได้กับ GPT-4
🔓 Open-Source: ดาวน์โหลด V3/R1 มาใช้ local ฟรี
📝 Long Doc: V4 1M context — อ่านเอกสารยาวได้สบาย

⚠️ ถ้าต้องการ: Vision/Multimodal → ใช้ GPT-4o หรือ Gemini
⚠️ ถ้าต้องการ: Voice → ใช้ GPT-4o หรือ Gemini
⚠️ ถ้าต้องการ: Enterprise support → Claude หรือ GPT
⚠️ ถ้าต้องการ: Web search → ใช้ Perplexity หรือ Grok
✅ แต่ถ้า: ราคาถูก + reasoning ดี + open-source → DeepSeek สุดคุ้ม!

📝 หมายเหตุ:
• ราคาจาก api.deepseek.com / OpenRouter (25 พ.ค. 2569) — อาจเปลี่ยนแปลง
• V4 Pro $0.000435/M input คือ $0.435/1M — ถูกมากๆ
• DeepSeek เป็นบริษัทจีน — พิจารณา data privacy ก่อนใช้งาน enterprise
• คำว่า "DeepSeek 4" อาจหมายถึง V4 ที่ยังไม่ official name — current latest คือ V4 Pro / V4 Flash
• Distilled models (32B, 70B) รองรับ local deployment บน consumer GPU