AEO Article

The AI Share-of-Voice Trap: Your Brand Is 'Visible' — But Is It Winning the Sale?

CeraVe owns more AI share-of-voice in beauty than any other brand — more than twice its nearest rival in what ChatGPT recommends. It ranks #8 in purchase conversion. e.l.f., the brand the AI barely mentions, ranks last in share-of-voice and first in conversion. Across the commercial-intent ChatGPT conversations we classified, roughly 7 in 10 shoppers do not buy the brand the AI names. The leaderboard your AI-visibility dashboard shows you and the leaderboard that decides revenue are almost entirely different lists.

On this page

A mention is not a customer
The leaderboard reorders when you measure behavior
Why high visibility and low conversion can both be true
Why AI-visibility and SOV tools can't see the sale
What to do with this

Buy-through at high AI intent

~30%

of shoppers buy the brand AI recommended

Shoppers who don't buy the AI-named brand

7 in 10

across commercial-intent ChatGPT sessions

e.l.f. conversion rank vs AI SOV rank

+9 ranks

last in AI share-of-voice → first in purchase conversion

ChatGPT conversations with commercial signal

~22%

rising to 29% including medium intent

If you already own an AEO or GEO tool, you've felt this. The dashboard tells you whether ChatGPT names your brand. It cannot tell you whether that mention turned into a customer. That gap is the AI Share-of-Voice Trap — and closing it is the difference between a vanity metric and a measurement you can spend against.

A mention is not a customer

AI visibility matters because real shoppers are there. More than one in five AI conversations — roughly 22% — carry a commercial signal; about 29% once you combine high and medium intent (January–October 2025, ~282K classified ChatGPT conversations, US behavioral panel). And AI is not cannibalizing the funnel: 71.4% of the people who research with ChatGPT also use Google Search or visit a retailer site in the same month. AI is an additive layer on top of search, not a replacement for it. Being named by the assistant is a genuine top-of-funnel win.

But a win at the top of the funnel is not a sale at the bottom. When we matched commercial-intent ChatGPT sessions that named at least one brand to same-month purchases for the same real consumers, about 70% of high-intent shoppers and about 75% of medium-intent shoppers did not buy the AI-recommended brand. AI-named-brand buy-through sat around 30% at high intent. Apple was the lone outlier at 28.2% follow-through; nearly every other product brand we measured came in below 3%. The switches were not random — they were overwhelmingly intra-category, to a direct competitor: the AI named CeraVe and the shopper bought Neutrogena, the AI named Nike and the shopper bought Adidas or New Balance.

AI visibility wins the mention. It doesn't win the sale. Roughly 7 in 10 shoppers who ask an AI for a brand recommendation buy something else — usually a direct rival.

This is a conservative read. We counted only same-month purchases captured in the panel and excluded other retailers, other months, and offline conversion — so true buy-through is, if anything, modestly higher. The direction is not in doubt: AI visibility and AI-driven purchase are nearly unrelated.

The leaderboard reorders when you measure behavior

Take beauty and skincare — the highest-volume commercial category in AI conversations. Rank the brands by AI share-of-voice and you get one list. Rank the same brands by observed purchase conversion and you get a different one. CeraVe leads share-of-voice at 3.26%, more than double its nearest rival, yet converts at 1.68% — only #8. e.l.f. sits dead last in share-of-voice at 0.80% and converts at 9.08% — first in the category, a +9 rank shift. La Roche-Posay is #4 by AI visibility but effectively zero conversion, with a meaningful share of its journeys stranded in the consideration stage: high visibility, broken path to checkout.

This reorder is the whole point. No share-of-voice scraper can produce it, because it requires person-level observed purchase, not a count of AI responses. It is only visible in behavioral data. A brand can look relatively strong on AI visibility and still show a wide gap on conversion — and note that AI-vs-search visibility and AI-vs-purchase conversion are measured against different denominators (search share versus purchase), so a 'visible' brand under-converting is not a contradiction. It is the trap itself.

Why high visibility and low conversion can both be true

When a brand is named often by AI yet converts poorly, the cause is usually one of three things — and they call for different responses. First, the AI recommends the product, not the seller: NVIDIA and Intel are named constantly in hardware conversations, but the purchase flows to Amazon, Best Buy, or Newegg. That's an attribution blind spot, not a conversion failure. Second, category-level rather than product-level naming: Etsy gets named generically ('handmade,' 'personalized') and leaks the intent to whoever closes — a real funnel problem. Third, considered purchases that close offline: T-Mobile and Toyota draw heavy AI mentions but finish in-store, so a digital view understates them. A visibility score collapses all three into one number. Only the journey tells you which one you're looking at — and only one of the three is something you fix with brand work rather than measurement.

Why AI-visibility and SOV tools can't see the sale

This isn't a knock on AEO and GEO tools — it's a description of what they're built to do. Share-of-Voice, Share-of-Answer, and Share-of-Model all measure the same surface: how often a brand appears in what an AI says. That's presence. They read the AI's output; they never observe the human who read it. So when the shopper closes the tab, opens Amazon, and buys a competitor, the visibility tool sees none of it. The category's own vendors concede the point — that AI answers satisfy intent without a click, so visibility can't be tied to revenue, and that scores aren't even comparable tool-to-tool. There is no shared standard that maps a Share-of-Model number to a sale.

Closing the gap requires a different kind of data: person-level observed behavior across the platforms where shoppers actually move — AI assistants, search, retailers, and payment — not pages scraped from a model's responses, not synthetic personas, not stated-intent surveys. That's what produces the beauty reorder and the brand-switch patterns above, and it's exactly what no visibility scraper can reproduce.

Measure Predict was built for exactly this question. Ask it about your category and it returns the observed journey from AI mention to consideration to purchase — which mentions converted, which went to a rival, and why. See whether your AI mentions convert — in Predict.

What to do with this

Keep your visibility tool. It answers a real question — am I in the conversation? — and presence is a prerequisite. But stop treating a share-of-voice number as a proxy for revenue. Pair it with observed behavior so you can tell the e.l.f. case (under-mentioned, converting hard — buy it more reach) from the La Roche-Posay case (well-mentioned, leaking at checkout — fix the path) from the NVIDIA case (converting fine, just not where your dashboard can see it). Those three brands look identical on a visibility chart. They demand three completely different decisions.

The brands that win the next phase of AI search won't be the ones with the highest Share-of-Model. They'll be the ones who know which of their mentions actually convert — and act on the difference.

Find out which of your AI mentions turn into customers — and which go to a competitor. Try Predict Now.

Methodology: figures are drawn from ~282K ChatGPT conversations classified for commercial intent and from Measure's permissioned US behavioral panel of real consumers, over a January–October 2025 window. Conversion metrics differ by finding and are kept within their own framing; the switch analysis is deliberately conservative.