Get financials on a company
Find Hightouch on Crunchbase and return their total funding, last round type, and a one-line description.
✗ Zero tool calls, $0.0005. "Let me search for Hightouch..." — then nothing. Schema parser had nothing to parse.
deepseek
Cheapest model in the matrix. Also the most likely to bail at "let me search for…" without ever calling a tool.
DeepSeek's V3.1 — the Chinese open-weights frontier model that defines the cost floor for serious reasoning. We tested via DeepSeek's first-party API.
Results by agent
Find Hightouch on Crunchbase and return their total funding, last round type, and a one-line description.
✗ Zero tool calls, $0.0005. "Let me search for Hightouch..." — then nothing. Schema parser had nothing to parse.
Read a real r/sales thread on enrichment tools and rank the top 5 by how many people recommended them.
5 tool calls, $0.046 — the model's best run. Read Reddit, counted mentions, returned the 5 tools.
Pull Google Shopping reviews for AirPods Pro 2 (USB-C) and return the top 3 recurring complaints, with verbatim quotes.
4 tool calls, $0.053. Clean pass with verbatim quotes.
Read Stripe's official docs and write a real, working webhook-verification function in TypeScript.
✗ 1 tool call, $0.0005. One Google search, then bailed. "I'll help you create a TypeScript function..." — and stopped.
Scrape Apollo.io's pricing page and return every tier (Free, Basic, Professional, Organization) with name, price, and top 3 features.
2 tool calls, $0.014. Cheapest Apollo pass in the matrix.