gpt-5.4-mini Benchmark & Insights

OpenAI OpenAI API

Updated Jun 6, 2026 All models

Sample size

63 runs

in window

Accuracy

78.5%

consensus match · 65d

Confidence

95%

over 65 runs

Window end

Jun 6, 2026

most recent run

Input price

$0.75/MTok

prompt tokens

Output price

$4.50/MTok

completion tokens

Model insights

01 All 14 misses overcall, with 8 false "unsafe" flips — markedly worse than its big sibling gpt-5.4 on the same days.
02 The older gpt-5-mini scores higher at a third of the price, so this model has no clear role here.

Recent forecasts

Date

Conf.

Risk

Safe