Skip to content
FB
Frontier Benchmarks AI
Models
Wizard
Battle
Hardware
Pricing
Methodology
Download
Search
/
EN
ES
Home
Models
Wizard
Battle
Hardware
Pricing
Methodology
Download
home
/
benchmarks
/
Terminal-Bench-Hard
Coding
Terminal-Bench-Hard
Hard terminal/CLI tasks.
1 models published a score
#
Model
Company
Score
1
Claude Opus 4.5
Anthropic
44.0
← All benchmarks
How we measure