OpenAI changed the default model behind ChatGPT on 5 May 2026, replacing GPT-5.3 Instant with GPT-5.5 Instant across free and paid tiers. The company's framing was unusual for an AI lab: rather than emphasising new reasoning capability, the announcement focused on faster responses and fewer hallucinations in high-stakes domains — explicitly naming law, medicine, and finance. The change affects hundreds of millions of daily users.
What changed for users
For the average ChatGPT user the change is invisible: same UI, same conversation flow, same product. What shifts is latency and reliability under load. GPT-5.5 Instant is OpenAI's first publicly-defaulted Instant variant tuned more for confidence calibration than for headline benchmark performance — the company has acknowledged for a year that frontier-scale reasoning gains have plateaued and that operational quality (response time, refusal rate, source-grounded answers) is where users now feel the difference.
According to WhatLLM's May 2026 model survey, GPT-5.5 scored 60.24 on the Intelligence Index at high-effort reasoning — the first model to break the 57-point ceiling that had held since April. The Instant variant trades some of that reasoning for response speed, the trade-off OpenAI has settled on as the right default.
The "fewer hallucinations" claim
OpenAI's specific framing — "fewer hallucinations in law, medicine, finance" — is targeted at the three verticals where its enterprise customers face the most pushback from professional regulators. The American Bar Association and the World Health Organisation have both published guidance in 2025–2026 constraining how LLMs can be cited or used in regulated workflows. A measurable drop in hallucination rate inside those verticals is the kind of metric OpenAI's enterprise sales team can put in a procurement document.
The catch: OpenAI did not publish specific hallucination-rate benchmarks for the new default, and as Air Street Press noted in its May 2026 state-of-AI summary, vendor-published behaviour claims await independent verification through structured evaluations like SimpleQA-Hard or HELM-Lite.
The wider May 2026 pattern
OpenAI is not alone. Google made Gemini 3.1 Flash Lite a new gateway default around 8 May, mirroring the same speed-over-scale philosophy. After April's frontier sprint — when GPT-5.5, DeepSeek V4, Kimi K2.6, and Claude Opus 4.7 all shipped in a single window — May has been a month of operational tuning rather than model races. The labs that didn't ship in April (Anthropic past Opus 4.7, Mistral, Qwen Max) are reportedly cooking new releases; the labs that did ship are optimising for cost, latency, and customer retention.
For developers building on top of ChatGPT's API, the practical implication is that the same prompt may now return slightly different output than it did before 5 May. Applications that rely on output stability should evaluate whether their pinned model version is GPT-5.5 Instant or an earlier Instant build. LLM-Stats maintains a live tracker of model swaps for production teams who need to know.
Sources and cross-checks: Primary: WhatLLM — New AI Models May 2026. Corroborated against: Air Street Press — State of AI: May 2026 and LLM-Stats AI Updates Today. 60.24 Intelligence Index score and default-swap date verified across all three sources 18 May 2026.