OpenAI Undisclosed proprietary text-generation

GPT-5.4

Released March 5, 2026

Context Window 1.1M tokens
≈ 788 pages of text

Pricing

Input $2.50/M per million tokens
Output $15.00/M per million tokens

About

GPT-5.4 launched today — March 5, 2026. It is OpenAI’s most capable model, the first with native computer-use capabilities, and the new default across ChatGPT, the API, and Codex. It replaces GPT-5.2 Thinking and GPT-5.3-Codex in a single unified model.

What is new

GPT-5.4 is not an incremental update. It introduces three capabilities that no previous GPT model had:

Native computer use. GPT-5.4 can operate software through screenshots and keyboard/mouse commands — browsing websites, filling forms, navigating desktop environments. On OSWorld-Verified (desktop navigation), it scores 75.0%, surpassing human performance at 72.4%. This is not a plugin or tool — it is built into the model.

Tool search. Previous models loaded all tool definitions into context upfront. GPT-5.4 receives a lightweight index and loads only the definitions it needs at runtime. On Scale’s MCP Atlas benchmark with 36 MCP servers enabled, this reduced token usage by 47% with no accuracy loss. This matters for anyone building agents with large tool ecosystems.

1M token context. The context window extends to 1.05M tokens (up from 272K on GPT-5.2). Prompts exceeding 272K tokens are priced at 2x input / 1.5x output. This enables analysis of entire codebases, long document collections, or extended agent trajectories in a single request.

Native compaction. GPT-5.4 is the first model trained to support compaction — summarizing earlier context to extend effective agent trajectories without losing key information.

Benchmarks

OpenAI published extensive benchmarks comparing GPT-5.4 against GPT-5.3-Codex and GPT-5.2:

Professional work

BenchmarkGPT-5.4GPT-5.3-CodexGPT-5.2
GDPval (matches/exceeds professionals)83.0%70.9%70.9%
Investment Banking Modeling87.3%79.3%68.4%
OfficeQA68.1%65.1%63.1%

GDPval tests agent performance across 44 occupations — sales presentations, accounting spreadsheets, urgent care schedules, manufacturing diagrams. GPT-5.4 matches or exceeds industry professionals in 83% of comparisons.

Coding

BenchmarkGPT-5.4GPT-5.3-CodexGPT-5.2
SWE-Bench Pro (Public)57.7%56.8%55.6%
Terminal-Bench 2.075.1%77.3%62.2%

GPT-5.4 matches GPT-5.3-Codex on coding while being lower latency. Codex /fast mode delivers 1.5x faster token velocity.

Computer use and vision

BenchmarkGPT-5.4GPT-5.2Human
OSWorld-Verified75.0%47.3%72.4%
WebArena-Verified67.3%65.4%
Online-Mind2Web92.8%
MMMU-Pro (no tools)81.2%79.5%

The jump from 47.3% to 75.0% on OSWorld is the headline number — GPT-5.4 now outperforms humans on desktop navigation tasks.

Tool use

BenchmarkGPT-5.4GPT-5.2
BrowseComp82.7%65.8%
MCP Atlas67.2%60.6%
Toolathlon54.6%45.7%

Abstract reasoning

BenchmarkGPT-5.4GPT-5.2
ARC-AGI-1 (Verified)93.7%86.2%
ARC-AGI-2 (Verified)73.3%52.9%
Frontier Science Research33.0%25.2%
FrontierMath Tier 1–347.6%40.7%
Humanity’s Last Exam (with tools)52.1%45.5%

ARC-AGI-2 going from 52.9% to 73.3% is a 20-point jump — the largest improvement in abstract reasoning between any two consecutive GPT releases.

Factuality

GPT-5.4 is OpenAI’s most factual model:

  • Individual claims are 33% less likely to be false vs GPT-5.2
  • Full responses are 18% less likely to contain any errors

Pricing

ModelInputCached InputOutput
GPT-5.4$2.50/M$0.25/M$15.00/M
GPT-5.4 Pro$30.00/M$180.00/M
GPT-5.2$1.75/M$0.175/M$14.00/M

GPT-5.4 is priced ~43% higher on input than GPT-5.2, but OpenAI claims greater token efficiency offsets this — the model uses fewer tokens to solve equivalent problems.

Batch and Flex pricing available at half rate. Priority processing at 2x rate. Context >272K tokens priced at 2x input / 1.5x output.

API access

from openai import OpenAI
client = OpenAI()

response = client.responses.create(
    model="gpt-5.4",
    input="Analyze this codebase for security vulnerabilities.",
    reasoning={"effort": "medium"},
    text={"verbosity": "medium"}
)

Reasoning effort levels: none (default), low, medium, high, xhigh Verbosity levels: low, medium (default), high

Computer use requires the updated computer tool in the Responses API. Tool search uses tool_search capability with deferred loading.

Variants

VariantBest forContext
gpt-5.4General-purpose: reasoning, coding, agentic tasks1.05M
gpt-5.4-proMaximum performance on complex problems1.05M
gpt-5-miniCost-optimized reasoning and chat
gpt-5-nanoHigh-throughput classification and instruction following

Steerability

New in GPT-5.4 Thinking (ChatGPT): the model shows an upfront preamble of its plan before executing, so users can adjust course mid-response. This reduces back-and-forth — you see where it is headed and can redirect before it finishes.

Safety

OpenAI rates GPT-5.4 as High cyber capability under their Preparedness Framework — the first general-purpose model with this classification. Deployed with:

  • Expanded cyber safety stack with monitoring systems
  • Trusted access controls
  • Asynchronous blocking for higher-risk requests on Zero Data Retention surfaces
  • Chain-of-Thought controllability testing (model cannot easily hide its reasoning)

Timeline

GPT-5.2 Thinking remains available in ChatGPT for 3 months under Legacy Models. Retirement date: June 5, 2026.

Availability

  • ChatGPT: Rolling out now to Plus, Team, Pro users as GPT-5.4 Thinking
  • Enterprise/Edu: Available via admin early access settings
  • API: Available now as gpt-5.4 and gpt-5.4-pro
  • Codex: Available with experimental 1M context support
  • OpenRouter: Available as openai/gpt-5.4

Open-source alternatives

GPT-5.4 is proprietary with no public weights. For self-hosted alternatives:

  • Qwen3.5-397B-A17B (Apache 2.0) — 403B MoE, 17B active, open-weight frontier reasoning
  • DeepSeek V3.2 (MIT) — 685B MoE, strong reasoning + chat at 1/40th the price
  • GLM 5 (MIT) — 754B params, Z.ai’s open-weight flagship
  • LiquidAI LFM2-24B-A2B — novel architecture, efficient inference on consumer hardware

References

  • 🤗 HuggingFace huggingface.co/openai/GPT-5.4
  • 📝 Blog openai.com/index/introducing-gpt-5-4/
  • 📖 Docs developers.openai.com/api/docs/guides/latest-model
  • 🔍 Grokipedia grokipedia.com/page/GPT-5