Go

gemma-4-31b-it Benchmark & Insights

Google Gemini API
Updated Jun 6, 2026 All models
Sample size
56 runs
in window
Accuracy
87.9%
consensus match · 58d
Confidence
100%
over 58 runs
Window end
Jun 6, 2026
most recent run
Input price
$0.13/MTok
prompt tokens
Output price
$0.38/MTok
completion tokens
Model insights
  • 01 A nervous open-weight bargain: all 10 misses overcall, and it flipped to "unsafe" on 5 consensus "safe" days.
  • 02 At $0.13/$0.38 the false alarms may be an acceptable price for a model that never underrates risk.
Notes

Open-weight model; also available free via OpenRouter

Recent forecasts