Agentic
TAU-bench
Tool agent benchmark — airline/retail customer service.
1 models published a score
| # | Model | Company | Score |
|---|---|---|---|
| 1 | Command A | Cohere | 51.7 |
Tool agent benchmark — airline/retail customer service.
| # | Model | Company | Score |
|---|---|---|---|
| 1 | Command A | Cohere | 51.7 |