Ai Confabulation

"Just Use [Tool] For That" — When The AI Recommends Tools For Jobs They Can't Do

Rza

28 Jul 2025 — 6 min read

You described your workflow to the AI. You needed to pull data from a CRM, transform it, and push it into a reporting dashboard on a schedule. The AI said: "Just use Zapier for that." It described the Zaps you'd need, the trigger conditions, the data transformation steps. It sounded clean. You signed up, started building, and three hours later discovered that the CRM integration doesn't support the specific data fields you need, the transformation step can't handle nested JSON without a code step Zapier's free tier doesn't include, and the reporting dashboard doesn't have a Zapier integration at all. The AI didn't make this up from nothing. It made it up from proximity — your task sounded like a Zapier task, so Zapier is what it recommended.

The Pattern

When you describe a task to an LLM and ask for a tool recommendation, the model does something that looks like expertise but isn't. It matches your task description against patterns in its training data — tool descriptions, marketing copy, tutorials, forum discussions, blog posts — and finds the tool most frequently associated with tasks that sound like yours. The operative phrase is "sound like." The matching happens at the level of language, not capability. If your task involves connecting two services and moving data between them, the AI reaches for automation tools. If it involves generating images, the AI reaches for image generators. The match is semantic, not functional.

This would be fine if tools were as broad as their descriptions. They're not. Every tool has a specific capability boundary — the exact set of integrations it supports, the specific data formats it can handle, the particular operations it can perform at each pricing tier. The AI's training data contains the broad description. It rarely contains the boundary. Marketing copy says "connect 5,000+ apps." It doesn't say "but not your specific app, and not with that specific data field, and not on the free plan." The AI recommends based on the marketing-level description because that's the description that appears most frequently in its training data.

The capability overshoot is the most common failure mode. The AI describes what a tool can do in ideal conditions — the best-case scenario, the flagship integration, the demo workflow. Your conditions are not ideal conditions. You need a specific integration that exists but doesn't support the operation you need. You need a data transformation that's technically possible but requires a paid tier the AI didn't mention. You need two tools to talk to each other through an integration path that the AI described but that doesn't actually exist. The recommendation was true at the category level and wrong at the specificity level — and the specificity level is where your time gets spent.

There's a particularly insidious variant: the ecosystem hallucination. The AI recommends a multi-tool workflow where Tool A feeds into Tool B, which triggers Tool C. It describes the data flow between them as if the integration is native and documented. But Tool A and Tool B have no integration. There's no webhook, no API connection, no Zapier bridge between them. The AI imagined the integration because it would make sense — because in the statistical landscape of its training data, tools in this category usually integrate with tools in that category. The integration is architecturally plausible and practically nonexistent.

Abandoned tool recommendations are another common hit. The AI recommends a tool that was real and actively maintained — eighteen months ago. Since then, the team pivoted, the free tier was killed, the API was deprecated, or the company was acqui-hired and the product sunset. The AI's recommendation is from a snapshot of a landscape that no longer exists. The tool's website might still be up, making the recommendation seem valid at first glance, but the product behind it is a ghost. [VERIFY] the tool's current status — check the blog, the changelog, the community — before investing any setup time.

The free-tier mirage deserves its own mention. "Tool X has a free tier that handles this" is one of the AI's most frequent and least reliable claims. Free tiers change constantly — they get reduced, restructured, or eliminated as tools move toward paid models. Even when the free tier exists, it often doesn't cover the specific capability you need. The AI says "use the free tier." The free tier supports 100 operations per month when you need 10,000. Or the free tier supports the basic integration but not the data transformation. The AI described the existence of a free tier accurately and its capabilities inaccurately, and you won't discover the gap until you've spent an hour configuring the tool.

The Psychology

The reason bad AI tool recommendations are expensive isn't the recommendation itself — it's the investment that follows it. When a human expert recommends a tool, you can ask follow-up questions, gauge their experience with your specific use case, and calibrate your trust based on their track record. When the AI recommends a tool, you get a confident, detailed description that functions as implicit endorsement. The AI doesn't say "I'm pattern-matching here and I haven't tested this integration." It says "here's how to set it up," and the specificity of the how implies that the what has been established.

You don't just try the wrong tool. You sign up, learn the interface, configure the connections, troubleshoot the initial errors, and try to make the workflow run before discovering that the tool can't do the specific thing you needed it to do. That's three to six hours of work — on the wrong tool. The sunk cost makes you reluctant to abandon it. You start looking for workarounds instead of starting over with the right tool. The AI's bad recommendation didn't cost you five minutes of reading. It cost you an afternoon of building.

There's also a framing effect that compounds the problem. Once the AI names a tool, that tool becomes the anchor for your thinking. Even after you discover it can't do what you need, you start evaluating alternatives relative to it — "what's like Zapier but handles nested JSON better" — instead of going back to the task description and starting fresh. The AI's recommendation shaped your search space, and reshaping a search space takes deliberate effort that most people don't invest.

The people who get burned worst are the ones who are new to a tool category. If you've never used an automation platform, you have no internal model to check the AI's recommendation against. The AI says "use Make for that" and you have no way to evaluate whether Make actually handles your specific case until you've invested the learning time. Experienced users can catch bad recommendations faster — they've used the tools, they know the boundaries — but the AI's confident specificity can override even experienced intuition when the claim is detailed enough.

The Fix

When an AI recommends a tool for a task, treat the recommendation as a lead, not an answer. The lead might be good. It often is — the AI's category-level matching is usually in the right neighborhood. But "right neighborhood" and "right address" are different things, and the difference is where your time goes.

Before investing any setup time, verify two things. First: does the tool actually do the specific thing you need, right now, at the specificity level your task requires? Not "does it integrate with CRMs" — does it integrate with your CRM, using the specific data fields you need, with the operation type your workflow requires? Check the tool's current integration directory, not the AI's description of it. Second: does it do it at the tier you'd use? Check the current pricing page. Look at what's included in the free tier versus the paid tiers. The AI's pricing information is from its training data and is almost certainly out of date.

For multi-tool workflows — where the AI describes Tool A feeding into Tool B — verify the integration path specifically. Does Tool A have an output that Tool B can accept as an input? Is there a native integration, a webhook option, or an API path? Or is the AI describing a connection that would make architectural sense but doesn't exist as a built product?

The thirty-second version: the AI is good at pointing you toward tool categories. It's unreliable on tool specifics. Use it for "what kind of tool should I look at" and then verify the specifics on the tool's own current documentation and pricing page. The recommendation is a starting point for research, not a substitute for it.

If you find yourself three hours into configuring a tool that the AI recommended and it's not working the way the AI described, stop. The issue might not be your configuration. The issue might be that the AI described a capability that the tool doesn't have. Go to the tool's docs, search for the specific feature, and verify it exists before debugging anything else.

This is part of CustomClanker's AI Confabulation series — when the AI in your other tab is confidently wrong.

"Just Use [Tool] For That" — When The AI Recommends Tools For Jobs They Can't Do

Rza

The Pattern

The Psychology

The Fix

Read more

The YouTube + AI Pipeline

The Weekly Drop

The Tool Collector's Guide to Owning Nothing

Self-Hosting & Tinkering