Changelog
What is changing in Frontier Benchmarks AI. Each entry is signed with its date and tag — no retroactive edits.
Tags
- data (1) score updates, pricing, new models
- product (5) new features (views, tools, UI)
- methodology (1) criterion, formula, definition changes
-
Cross-provider Pricing Calculator
productEstimated monthly TCO per model + provider. Inputs: M tokens input/output/cached. Compares up to 12 models sorted ascending by cost. Cache support when the provider exposes it.
-
Use Case Wizard recommender
product4-step wizard to recommend the top 3 models per use case (coding, math, writing, vision, agent, RAG, summarization, translation). Composite = baseScore × coverage × priorityFactor.
-
Head-to-head Battle Mode
productSide-by-side comparison of 2-4 models benchmark by benchmark. Shareable URL with ?models=a,b,c. Global verdict with winner by wins, abstentions when no score.
-
Hardware Compatibility Checker
productDetects browser GPU/RAM/CPU (with honest Firefox/Safari limitations) and classifies each model into S/A/B/C/D/F tiers based on available VRAM and quantization. Multi-GPU 1-8.
-
Open-source model enrichment
dataAdded total params, active params (MoE), license, cross-provider pricing and install commands (Ollama / LM Studio / vLLM) to the main open-weight models.
-
Definition of "comparable" in battles
methodologyA benchmark is comparable only if 2+ models in the battle have a published score. If a model has no score, it counts as abstained (does not affect winRate).
-
Frontier Benchmarks AI launch
productFirst public version of the atlas. 62 models, 32 benchmarks, 25 companies. Catalog, individual Models, Benchmarks, Companies, Methodology and Download single-file views.