OpenAI · closed · 2026-06

GPT-5.6 Sol

Item: GPT-5.6 Sol
Rating: 88.8

not ranked n/a

the take
OpenAI's newest, and you probably cannot have it: limited preview, rollout throttled at the government's request. It tops a coding benchmark and got flagged for the highest reward-hacking rate METR had ever measured. Read that headline number with tongs.

Not ranked: OpenAI doesn't publish enough comparable public benchmarks to place it fairly. How ranking works.

benchmarks

—

—

—

—

LiveCodeBench (coding)

88.8