Tagged "deepseek-v4"
Cross-cutting reads on this topic
DeepSeek abandons its no-outside-capital stance in a ~$7.4B maiden round led by Tencent and CATL, valuing it near $59B and reshaping open-weight economics.
#deepseek#ai-funding+6 more
2026-06-03
Read Article
H1 2026 open-weight retrospective — DeepSeek V4, Qwen 3, Llama 4 release cadence, benchmark gains vs frontier, adoption patterns, and four trend lines.
#open-weight-retrospective#h1-2026+7 more
2026-05-11
Read Article
Migrate DeepSeek V3.2 to V4 across open-weight stacks — three reasoning modes, tokenizer change, HCA/CSA attention deltas, KV-cache reduction.
#deepseek-v4#migration-playbook+7 more
2026-05-05
Read Article
DeepSeek-V4 ships April 24, 2026 as open-weight MoE: Pro (1.6T/49B active) and Flash (284B/13B), 1M context, 27% FLOPs and 10% KV cache vs V3.2.
#deepseek-v4#deepseek-v4-pro+6 more
2026-04-24
Read Article
MoE choices powering 2026 frontier models compared — total vs active params, routing strategies, sparsity ratios, and the downstream cost implications.
#mixture-of-experts#moe-architecture+8 more
2026-04-24
Read Article
Updated NIAH-2 results across 1M-context models — single-needle, multi-needle, and reasoning-over-context. Where models silently fail above 200K tokens.
#long-context#needle-in-haystack+8 more
2026-04-24
Read Article
We measured low/medium/high reasoning effort across 5 frontier models on math, code, and analysis. Quality lift, latency tax, and cost-per-correct-answer data.
#reasoning-effort#ai-benchmarks+8 more
2026-04-23
Read Article
Preview of Q2 2026 AI model releases. DeepSeek V4 at ~1T parameters, GPT-5.5 Spud with pretraining done, and Grok 5 expected by mid-2026. Timeline and specs.
#deepseek-v4#gpt-5-5+5 more
2026-04-03
Read Article
DeepSeek V4 launches with approximately 1 trillion parameters, 1M context window, and Huawei Ascend optimization. China's frontier multimodal model analysis.
#deepseek-v4#open-source-ai+4 more
2026-03-04
Read Article
DeepSeek V4 brings 1 trillion parameters, 1M token context, and Engram O(1) memory. Architecture details, leaked benchmarks, and what it means for developers.
#DeepSeek#DeepSeek V4+6 more
2026-02-14
Read Article