Skip to content

FB Frontier Benchmarks AI

Models Wizard Battle Hardware Pricing Methodology Download

Home Models Wizard Battle Hardware Pricing Methodology Download

home / benchmarks / Terminal-Bench-Hard

Coding

Terminal-Bench-Hard

Hard terminal/CLI tasks.

1 models published a score

#	Model	Company	Score
1	Claude Opus 4.5	Anthropic	44.0

← All benchmarks How we measure

FB Frontier Benchmarks AI

Ground truth, with a map.
The atlas of frontier LLMs.

Explore

Models (111)
Benchmarks (31)
Companies (25)
Download single-file

Trust

Methodology
Changelog
RSS feed
Sitemap

Project

A Javryx Systems project
111 models · 31 benchmarks · 25 companies

© 2026 Frontier Benchmarks AI. A Javryx Systems project.

frontierbenchmarks.com