GPT-5.6 Sol Preview: Benchmarks, Pricing, Access & How It Compares to Claude

Q: Can I use GPT-5.6 Sol right now?

Not broadly. GPT-5.6 launched as a limited preview available only through the API and Codex to a small group of trusted partners whose participation was shared with the U.S. government. OpenAI says general availability in ChatGPT, Codex, and the API is planned within weeks, but no firm date has been announced.

Q: How much does GPT-5.6 cost?

Per OpenAI's announcement, pricing is per million tokens: Sol is $5 input / $30 output, Terra is $2.50 input / $15 output, and Luna is $1 input / $6 output. GPT-5.6 also introduces prompt caching with explicit cache breakpoints and a 30-minute minimum cache life.

Q: Why is GPT-5.6 Sol's release restricted?

OpenAI says it previewed GPT-5.6 Sol's capabilities to the U.S. government ahead of launch and, at the government's request, is starting with a limited preview for trusted partners before wider release, tied to an ongoing cyber Executive Order framework.

On June 26, 2026, OpenAI began a limited preview of the GPT-5.6 series: Sol, its new flagship model, Terra, a balanced everyday model, and Luna, a fast and inexpensive tier. The headline framing is agentic coding and cybersecurity — Sol is positioned as OpenAI's strongest model yet for long-horizon terminal work and defensive security tasks. The catch is availability: this is not a normal launch. Access is currently limited to a small, government-coordinated group of trusted partners, and OpenAI has not announced a firm date for general availability.

This review sticks closely to what OpenAI stated in its official preview post. Where specific benchmark percentages are cited, they come from outlets that reviewed OpenAI's published launch-day chart, since OpenAI's own post text describes results qualitatively ("sets a new state of the art") without printing every figure inline — that distinction is flagged throughout.

What Is GPT-5.6 Sol, Terra, and Luna?

GPT-5.6 introduces a naming split OpenAI hasn't used before: the number identifies the model generation, while Sol, Terra, and Luna are durable capability tiers that can each advance on their own schedule going forward.

Sol — the flagship. Positioned for agentic coding, long-horizon reasoning, biology workflows, and cybersecurity.
Terra — a balanced everyday model. OpenAI says it's competitive with GPT-5.5 while costing about half as much.
Luna — the fastest, cheapest tier, aimed at high-volume, latency-sensitive workloads like classification and chat.

GPT-5.6 also introduces two new reasoning settings: a max effort level that gives Sol more time to reason on hard problems, and an ultra mode that dispatches subagents to parallelize complex, long-running work — similar in spirit to the multi-agent orchestration approaches showing up across the frontier lineup this year, including Claude Opus 4.8's Dynamic Workflows.

Benchmarks: What OpenAI Actually Published

OpenAI's preview post leads with three capability areas: agentic coding, biology, and cybersecurity, with fuller safety and preparedness evaluations reserved for the system card. The one benchmark chart in the launch post itself is Terminal-Bench 2.1, an agentic coding benchmark that tests command-line workflows requiring planning, iteration, and tool coordination — the same benchmark category that drove the biggest single gain in Claude Sonnet 5's launch.

Terminal-Bench 2.1: GPT-5.6 Sol vs the field

Widely reported readings of OpenAI's launch-day chart (June 26, 2026). OpenAI's post states Sol sets a new state of the art on this benchmark; exact figures below come from outlets that reviewed the published chart, not from OpenAI's running text.

GPT-5.6 familyPrior-generation / rival models

GPT-5.6 pricing by tier ($ per million tokens)

Sources: OpenAI, "Previewing GPT-5.6 Sol: a next-generation model" (Jun 26, 2026), cross-referenced against third-party chart readings. GPT-5.6 is in a restricted preview at the time of writing — these figures may be revised when OpenAI publishes a full evaluation suite at general availability.

Benchmark	Sol Ultra	Sol	GPT-5.5	Claude Mythos 5
Terminal-Bench 2.1 (agentic coding)	91.9%	88.8%	88.0%	84.3%

Note: benchmark percentages are third-party readings of OpenAI's launch-day chart, not figures OpenAI printed in its post text.

OpenAI states plainly that Sol "sets a new state of the art" on Terminal-Bench 2.1. The specific percentages most widely reported by outlets that reviewed the chart put base Sol at 88.8% and the higher-compute Sol Ultra mode at 91.9% — ahead of GPT-5.5 and the current Claude Mythos 5 and Fable 5 figures on the same benchmark. Treat the exact decimal gaps between adjacent models with some skepticism: agentic benchmarks like this one are sensitive to harness configuration, retry policy, and random seed, so a gap under one point is closer to a statistical tie than a clear ranking.

Beyond the terminal benchmark, OpenAI reports two other capability claims worth noting:

Biology (GeneBench v1): OpenAI says Sol achieves stronger results than GPT-5.5 on long-horizon genomics and quantitative-biology analysis, while using fewer tokens to get there.
Cybersecurity (ExploitBench): OpenAI says Sol is competitive with Claude Mythos Preview on ExploitBench while using roughly a third of the output tokens — a token-efficiency claim rather than a pure accuracy win. On the independently built ExploitGym benchmark (created by UC Berkeley researchers with OpenAI and other labs), OpenAI reports that Sol, Terra, and Luna all show strong gains in cyber capability as reasoning effort increases.

Importantly, OpenAI states that GPT-5.6 Sol does not cross the Cyber Critical threshold under its Preparedness Framework. In evaluations against Chromium and Firefox, the model identified bugs and exploitation building blocks but did not autonomously produce a full working exploit chain under the tested conditions.

Why Is Access So Restricted?

This is the part of the GPT-5.6 story that's arguably bigger news than the benchmarks. OpenAI says it previewed Sol's capabilities to the U.S. government ahead of launch, and, at the government's request, is starting with a limited preview for a small group of trusted partners — whose participation has reportedly been shared with the government — before wider release. OpenAI frames this as a short-term step tied to an ongoing cyber Executive Order framework, and says it does not believe this kind of government access process should become the long-term default for model releases.

The pattern is notably similar to what's currently happening on the Anthropic side of the market. Our Claude Fable 5 and Mythos 5 breakdown covers a related export-control suspension affecting Anthropic's top-tier models. Between the two labs, cyber-capable frontier models are increasingly shipping with government-coordinated gating rather than a normal public rollout — worth knowing if you're planning around either lab's top tier for the second half of 2026.

Practically, OpenAI describes a layered safeguard stack for the GPT-5.6 family: refusal training built into the model, real-time classifiers that can pause generation for review on higher-risk cyber and biology requests, account-level review across conversations for flagged activity, and differentiated access by risk level. OpenAI also says it dedicated over 700,000 A100-equivalent GPU hours to automated red-teaming aimed at finding universal jailbreaks — attacks that generalize across many prompts rather than one narrow case — ahead of this launch.

Pricing and Access

Model	Input / 1M tokens	Output / 1M tokens
Sol	$5	$30
Terra	$2.50	$15
Luna	$1	$6

GPT-5.6 also introduces more predictable prompt caching: explicit cache breakpoints and a 30-minute minimum cache life. Cache writes are billed at 1.25x the model's uncached input rate, while cache reads keep the usual 90% cached-input discount. OpenAI also confirmed plans to run Sol on Cerebras hardware at up to 750 tokens per second starting in July, though that access will initially be limited to select customers as well.

As of publication, GPT-5.6 models are available only through the API and Codex, and only to the trusted-partner cohort. There is no ChatGPT access yet, and no confirmed general-availability date — OpenAI's own language is "in the coming weeks."

Who Should Actually Pay Attention Right Now?

Most founders and agencies: there's nothing to act on yet beyond awareness. You can't buy access, and OpenAI hasn't confirmed pricing tiers or rate limits for the general release. If your roadmap depends on a specific model choice this quarter, it's safer to plan around models you can actually use today — our top 5 frontier AI models of 2026 roundup only covers models with real public availability.

Security teams and defenders: the ExploitBench and ExploitGym claims are the most relevant signal here. OpenAI is explicitly framing Sol as a defensive tool first — vulnerability research, patch development, and security education — while trying to make offensive misuse harder and more detectable. If you already have Codex or API access as part of the preview cohort, this is the area worth testing first.

Developers evaluating coding agents: Terminal-Bench 2.1 is becoming the benchmark to watch across every lab this year — it's the same category where Claude Sonnet 5 posted its biggest single-generation jump, and where our Opus 4.8 vs GPT-5.5 coding benchmark comparison already tracks a GPT-5.5-era result. Once Sol reaches general availability, it's worth re-running that comparison with actual hands-on numbers rather than vendor charts — and worth checking how it fits alongside IDE-native tools, which our Cursor Composer 2.5 review covers from the tooling side.

Editorial Take

Frequently Asked Questions

What is GPT-5.6 Sol?

GPT-5.6 Sol is OpenAI's newest flagship model, previewed on June 26, 2026 alongside two smaller tiers, Terra and Luna. It focuses on agentic coding, biology workflows, and cybersecurity, and launched with OpenAI's most robust safety stack to date.

Can I use GPT-5.6 Sol right now?

Not broadly. It's in a limited preview available only through the API and Codex to a small group of trusted partners whose participation was shared with the U.S. government. OpenAI says general availability in ChatGPT, Codex, and the API is planned within weeks, with no firm date announced yet.

How much does GPT-5.6 cost?

Per million tokens: Sol is $5 input / $30 output, Terra is $2.50 input / $15 output, and Luna is $1 input / $6 output. GPT-5.6 also introduces prompt caching with explicit cache breakpoints and a 30-minute minimum cache life.

Why is GPT-5.6 Sol's release restricted?

OpenAI says it previewed the model's capabilities to the U.S. government ahead of launch and, at the government's request, is starting with a limited trusted-partner preview before wider release, tied to an ongoing cyber Executive Order framework.

Source: OpenAI, "Previewing GPT-5.6 Sol: a next-generation model" (June 26, 2026). Benchmark percentages not printed directly in OpenAI's post text are attributed to third-party chart readings and flagged as such throughout this article.

Preview — not yet rated

We don't assign a star rating to models we can't independently test, and GPT-5.6 Sol isn't available to us or to the vast majority of readers yet. What's confirmed is real: OpenAI is claiming a genuine step up in agentic coding and cyber capability, backed by a materially heavier safety investment than prior releases. What's not confirmed is everything that depends on hands-on use — real-world reliability, how often the new safeguards interrupt legitimate work, and whether the benchmark gains hold up outside OpenAI's own harness. We'll publish a full review with our own testing once GPT-5.6 reaches general availability.

What Is GPT-5.6 Sol, Terra, and Luna?

Sol — the flagship. Positioned for agentic coding, long-horizon reasoning, biology workflows, and cybersecurity.
Terra — a balanced everyday model. OpenAI says it's competitive with GPT-5.5 while costing about half as much.
Luna — the fastest, cheapest tier, aimed at high-volume, latency-sensitive workloads like classification and chat.

Benchmarks: What OpenAI Actually Published

Terminal-Bench 2.1: GPT-5.6 Sol vs the field

GPT-5.6 familyPrior-generation / rival models

GPT-5.6 pricing by tier ($ per million tokens)

Benchmark	Sol Ultra	Sol	GPT-5.5	Claude Mythos 5
Terminal-Bench 2.1 (agentic coding)	91.9%	88.8%	88.0%	84.3%

Note: benchmark percentages are third-party readings of OpenAI's launch-day chart, not figures OpenAI printed in its post text.

Beyond the terminal benchmark, OpenAI reports two other capability claims worth noting:

Biology (GeneBench v1): OpenAI says Sol achieves stronger results than GPT-5.5 on long-horizon genomics and quantitative-biology analysis, while using fewer tokens to get there.
Cybersecurity (ExploitBench): OpenAI says Sol is competitive with Claude Mythos Preview on ExploitBench while using roughly a third of the output tokens — a token-efficiency claim rather than a pure accuracy win. On the independently built ExploitGym benchmark (created by UC Berkeley researchers with OpenAI and other labs), OpenAI reports that Sol, Terra, and Luna all show strong gains in cyber capability as reasoning effort increases.

Why Is Access So Restricted?

Pricing and Access

Model	Input / 1M tokens	Output / 1M tokens
Sol	$5	$30
Terra	$2.50	$15
Luna	$1	$6

What Is GPT-5.6 Sol, Terra, and Luna?

Benchmarks: What OpenAI Actually Published

Terminal-Bench 2.1: GPT-5.6 Sol vs the field

GPT-5.6 pricing by tier ($ per million tokens)

Why Is Access So Restricted?

Pricing and Access

Who Should Actually Pay Attention Right Now?

Editorial Take