Topic

#gpt-5-5

15 articles tagged gpt-5-5. Browse the full set below, or see all topics.

Tagged "gpt-5-5"

Cross-cutting reads on this topic

15 articles
On June 9, 2026, ChatGPT's GPT-5.5 personalization reached Free and Go tiers, drawing on past chats, files, and Gmail. What agencies must do about data policy.
#ChatGPT#GPT-5.5+6 more
2026-06-11
Read Article
Claude Fable 5 leads the benchmarks; GPT-5.5 costs half as much and owns Codex. We compare coding, knowledge work, long context, and cost to find the fit.
#claude-fable-5#gpt-5-5+6 more
2026-06-09
Read Article
MiniMax M3 lands at 5-17x lower cost, but Opus 4.8 leads SWE-bench Pro and GPT-5.5 wins Terminal-Bench. A full three-way agentic coding routing matrix.
#minimax-m3#claude-opus-4-8+6 more
2026-06-03
Read Article
We compare Claude Opus 4.8 and GPT-5.5 on coding, agents, reasoning, and real cost — including where GPT-5.5 still wins and which model fits which job.
#claude-opus-4-8#gpt-5-5+6 more
2026-05-28
Read Article
Agentic coding head-to-head: Gemini 3.5 Flash vs GPT-5.5 vs Opus 4.7. MCP Atlas, SWE-Bench Pro, Terminal-Bench, plus Antigravity 2.0 launch context.
#gemini-3-5-flash#gpt-5-5+8 more
2026-05-19
Read Article
Migrate GPT-5.2 to 5.5 — reasoning-effort behavior change, tool-call schema diff, cost-curve impact, structured outputs, and a phased rollout plan.
#gpt-5-5#gpt-5-2+7 more
2026-05-05
Read Article
MoE choices powering 2026 frontier models compared — total vs active params, routing strategies, sparsity ratios, and the downstream cost implications.
#mixture-of-experts#moe-architecture+8 more
2026-04-24
Read Article
Cross-modal benchmark scores — image understanding, video, OCR, ASR, code-with-vision — across GPT-5.5, Gemini 3, Claude 4.7, Qwen 3.5 Omni. 80+ data cells.
#multimodal-ai#vision-language-models+8 more
2026-04-24
Read Article
Updated NIAH-2 results across 1M-context models — single-needle, multi-needle, and reasoning-over-context. Where models silently fail above 200K tokens.
#long-context#needle-in-haystack+8 more
2026-04-24
Read Article
Head-to-head: GPT-5.5 and Claude Opus 4.7 on agentic coding, computer use, 1M context, pricing, and the right model for each production workload.
#gpt-5-5#claude-opus-4-7+8 more
2026-04-23
Read Article
OpenAI's GPT-5.5 ships April 23, 2026 with 1M context, Thinking and Pro variants, 82.7% Terminal-Bench, and same latency as GPT-5.4. Pricing inside.
#gpt-5-5#gpt-5-5-pro+8 more
2026-04-23
Read Article
We measured low/medium/high reasoning effort across 5 frontier models on math, code, and analysis. Quality lift, latency tax, and cost-per-correct-answer data.
#reasoning-effort#ai-benchmarks+8 more
2026-04-23
Read Article
Cross-model hallucination rates on factual recall, citation accuracy, and code reference. 5,000 prompts tested across 5 frontier models with confidence bands.
#ai-hallucination#ai-benchmarks+8 more
2026-04-23
Read Article
MCP tool-call success across 12 task types — search, file ops, data, calendar, email. Pass-rate, retry-rate, and cost-to-completion for 5 frontier AI models.
#tool-use#mcp+8 more
2026-04-23
Read Article
Preview of Q2 2026 AI model releases. DeepSeek V4 at ~1T parameters, GPT-5.5 Spud with pretraining done, and Grok 5 expected by mid-2026. Timeline and specs.
#deepseek-v4#gpt-5-5+5 more
2026-04-03
Read Article