Op
gpt-5.4-mini Benchmark & Insights
OpenAI OpenAI API
Updated Jun 6, 2026 All models
Sample size
63 runs
in window
Accuracy
78.5%
consensus match · 65d
Confidence
95%
over 65 runs
Window end
Jun 6, 2026
most recent run
Input price
$0.75/MTok
prompt tokens
Output price
$4.50/MTok
completion tokens
Model insights
- 01 All 14 misses overcall, with 8 false "unsafe" flips — markedly worse than its big sibling gpt-5.4 on the same days.
- 02 The older gpt-5-mini scores higher at a third of the price, so this model has no clear role here.