Topic

#open-source-llm

10 articles tagged open-source-llm. Browse the full set below, or see all topics.

Tagged "open-source-llm"

Cross-cutting reads on this topic

10 articles

AI Development

Kimi K2.7-Code Release: Benchmarks, Kimi Code & Plans

Kimi K2.7-Code is Moonshot's new open-source coding model: +21.8% on Kimi Code Bench v2, 30% fewer reasoning tokens, plus Kimi Code CLI plans from $19/mo.

#kimi-k2-7-code#moonshot-ai+7 more

2026-06-12

Read Article

AI Development

DeepSeek V4 Launches: 1.6T MoE, 1M Context, 10% KV

DeepSeek-V4 ships April 24, 2026 as open-weight MoE: Pro (1.6T/49B active) and Flash (284B/13B), 1M context, 27% FLOPs and 10% KV cache vs V3.2.

#deepseek-v4#deepseek-v4-pro+6 more

2026-04-24

Read Article

AI Development

Kimi K2.6: 300-Agent Swarms + Motion Frontend Guide

Moonshot's Kimi K2.6 ships 300-agent swarms, 12-hour coding runs, WebGL hero sections, and open-source SOTA on SWE-Bench Pro. Agency playbook and benchmarks.

#kimi-k2-6#moonshot-ai+8 more

2026-04-20

Read Article

AI Development

Qwen 3.5 Medium Models: Benchmarks, Pricing, and Guide

Qwen 3.5 medium series: Flash, 35B-A3B, 122B-A10B, and 27B. Benchmarks vs GPT-5 mini and Claude Sonnet 4.5, pricing from $0.10/M tokens.

#qwen-3-5#alibaba-ai+6 more

2026-02-25

Read Article

AI Development

Qwen 3.5: 397B MoE Benchmarks, Pricing & Complete Guide

Qwen 3.5-397B scores 83.6 on LiveCodeBench v6 and 91.3 on AIME26 with 17B active MoE params. Benchmarks vs GPT-5.2, Claude, and pricing details.

#Qwen 3.5#Alibaba+6 more

2026-02-16

Read Article

AI Development

DeepSeek V4: Engram Architecture, 1M Context & Coding Guide

DeepSeek V4 brings 1 trillion parameters, 1M token context, and Engram O(1) memory. Architecture details, leaked benchmarks, and what it means for developers.

#DeepSeek#DeepSeek V4+6 more

2026-02-14

Read Article

AI Development

DeepSeek R1 vs Qwen 3 vs Mistral Large: LLM Comparison

DeepSeek R1 revolutionized open-source AI with reasoning capabilities. Compare it to Qwen 3's 1T parameters and Mistral Large for enterprise deployment.

#DeepSeek R1#Qwen 3+4 more

2026-01-06

Read Article

AI Development

MiniMax M2.1 Guide: Digital Employee for AI Coding

MiniMax M2.1 is a real Dec 2025 coding and agentic workflow model listed on OpenRouter, with 10B active parameters and Digital Employee positioning.

#MiniMax M2.1#Open-Source LLM+3 more

2025-12-24

Read Article

Development

Local LLM Deployment: Privacy-First AI Complete Guide

Deploy Llama 3.3, Mistral 3, Qwen 3 locally with Ollama, LM Studio, or vLLM. Hardware requirements, quantization, and enterprise self-hosting patterns.

#Local LLM#Ollama+5 more

2025-12-23

Read Article

AI Development

GLM-4.7 Guide: Z.ai's Open-Source AI Coding Model

GLM-4.7 achieves 73.8% SWE-bench and 87.4% τ²-Bench with Preserved Thinking. Complete developer guide for the $3/month Claude Code alternative.

#GLM-4.7#Z.ai+5 more

2025-12-23

Read Article