Benjamin Gievis Benjamin Gievis · 2026-04-27

Ghost citations: why 61% of your AI citations are working against you — and how to fix it

There is a metric that most GEO strategies are not tracking. And because they are not tracking it, they are systematically overestimating their AI visibility — and misallocating the effort required to build it. The metric is the difference between being cited and being mentioned. New data from Growth Memo and Seer Interactive reveals that 61.7% of AI appearances are ghost citations — instances where the AI includes a source link to a domain but does not mention the brand name in the response text. Only 13.2% of AI appearances produce both a citation link and a brand mention.

What a ghost citation actually looks like

To understand the problem, you need to understand how AI-generated answers are assembled.

When a platform like ChatGPT, Perplexity, or Gemini generates a response, it draws from multiple sources. Some of those sources are cited inline — a superscript number, a footnote link, a "Sources" panel at the bottom of the response. The citation tells the user (and the tracking tool) that the AI retrieved content from that domain.

But the citation is a technical attribution. The brand mention is a semantic one. These are not the same thing.

A ghost citation looks like this: a user asks "what are the best project management tools for distributed teams?" The AI generates a 300-word answer discussing async communication, task visibility, and integration flexibility. Somewhere in the source list, your domain appears. But the response text names Asana, Notion, and Linear — not you. Your domain was retrieved. Your brand was not surfaced. You appear in the data as a cited source. You are invisible to the user.

This is the ghost citation problem. And for most brands tracking their GEO performance, it is the dominant mode of "success" in their reports.

The Gemini anomaly — and what it reveals about platform differences

The ghost citation problem is not uniform across AI platforms. Each platform has a different relationship between citation behavior and brand mention behavior — and understanding those differences is the first step toward fixing the visibility gap.

Gemini is the most revealing case. According to April 2026 data from Seer Interactive, Gemini mentions brands in 83.7% of responses — but only generates a citation link 21.4% of the time. Gemini operates more like a conversationalist drawing on brand knowledge than a search engine citing sources. It will recommend your brand in natural language without ever creating a traceable link back to your domain. The inverse of the ghost citation problem: high brand mention, low attribution.

This has a direct implication for GEO measurement. If you are tracking only citation links — as most tools do by default — you are missing the majority of your Gemini visibility. And you are missing the data that would tell you whether Gemini is recommending you accurately, positively, and in the right context.

ChatGPT presents a different dynamic. Semrush data from April 2026 shows that ChatGPT only cites 50% of the pages it actually retrieves. And critically: ChatGPT only activates its search feature on 34.5% of queries — down from 46% in late 2024. The majority of ChatGPT responses still rely on training data alone. For those responses, there are no citations at all — only brand mentions derived from what the model learned during training. Ghost citations cannot occur where there are no citations.

Perplexity is the most citation-transparent platform. Its architecture is built around sourced answers — every response includes visible citations, and the platform's identity is grounded in showing its work. For Perplexity, the ghost citation problem is less prevalent, but citation concentration is: the top 10 domains in any topic category capture 46% of all citations, and the top 30 capture 67%. If your domain is not in the top tier of authority for your category, you are statistically unlikely to appear.

The platform breakdown matters because it changes the optimization logic. There is no single fix for ghost citations — there is a platform-specific diagnosis that requires understanding how each AI engine decides to mention versus cite, and how those decisions are influenced by your content and brand signals.

Why ghost citations happen — and what drives real mentions

Ghost citations are a symptom of a specific structural problem: the AI retrieved your content but did not find your brand name prominently associated with the claims it extracted.

This happens for several reasons.

Brand entity ambiguity. If your brand name does not appear prominently and consistently within the content the AI extracts, the AI can use your information without attributing it to you. A page that answers "how to reduce customer churn" but does not explicitly connect the advice to a named brand or product is a page that will be mined for content without generating a brand mention. The extraction happens. The attribution does not.

Content optimized for humans, not for entity extraction. Most content is written to be read by a person who will click through, scan the page, and form a brand impression through visual design, navigation, and context. AI systems do not read this way. They extract passages. A passage that does not contain a brand name will not generate a brand mention, regardless of how prominently the brand appears on the rest of the page.

Weak brand entity signals outside the page. AI systems building responses do not rely only on retrieved pages. They draw on training data that encodes their prior understanding of entities. A brand with strong entity signals — Wikipedia entry, Knowledge Graph presence, consistent naming across authoritative external sources — is more likely to be named by an AI even when its source page is used for content. A brand without these signals can be cited for its content and invisible as an entity.

Listicle and comparison page dynamics. Research from Wix (March 2026) shows that 21.9% of AI Mode citations go to listicles and 40.86% of commercial queries cite listicles. But a listicle that lists your brand among 10 competitors without strong differentiation language around your specific entry is a weak brand mention generator — even if it is a strong citation source. Your domain gets the attribution. Your brand gets diluted in a list.

The measurement problem: why most GEO dashboards are misleading

The ghost citation problem is compounded by a measurement architecture that was not designed to detect it.

Most GEO tracking tools — including the first generation of dedicated platforms — report on citation frequency. They answer the question: "How often is our domain cited in AI-generated answers?" This is a useful metric. It is not sufficient.

The metrics that matter for brand-building AI visibility are:

Brand mention rate: how often is your brand name explicitly referenced in AI response text, regardless of whether a citation link is present?

Co-mention position: when your brand is mentioned, is it mentioned first, in the middle of a list, or as an afterthought? Position within an AI response text correlates with the trust signal the recommendation conveys.

Sentiment and attribute association: what attributes does the AI associate with your brand when it mentions you? Is the description accurate? Is it differentiated? Does it reflect the positioning you have invested in building?

Ghost citation rate: of the total citations your domain receives, what percentage include a brand name mention in the response text? A ghost citation rate above 60% is a signal of systematic content or entity optimization failure.

As of April 2026, tools like Profound, Evertune, and the recently launched free platform OpenLens are beginning to track the full picture — citation plus mention plus sentiment. But most brands are still operating from citation-only dashboards that systematically overstate their real visibility.

How to fix ghost citations: a structural approach

Fixing ghost citations requires changes at three levels: content architecture, entity infrastructure, and monitoring methodology.

Content architecture: embed your brand into every extractable passage

AI systems extract passages, not pages. Every section of content that you want to generate brand mentions must contain explicit brand attribution within the passage itself. Not "this approach works well" — but "at [Brand], our approach to X is..." or "the [Brand] framework for Y..." The brand name must appear in the text that surrounds the claim, not just in a header or a logo.

Entity signals: build your brand as a recognized entity, not just a website

Wikipedia presence, Wikidata entry, Google Knowledge Graph coverage, consistent brand naming across Crunchbase, LinkedIn, and authoritative press — these signals tell AI systems that your brand is a named entity worth mentioning, not just a domain worth retrieving. A brand that exists only as a URL cannot be mentioned by name. It can only be cited as a source.

Structured data: give AI systems explicit brand attribution signals

Organization schema, author schema, FAQ schema — these markup types tell AI systems explicitly who is responsible for the content they are extracting. A page with proper organization schema that names the brand as the publisher of every claim on the page is significantly more likely to generate a brand mention than a page with no structured data.

Monitor the full visibility stack, not just citations

Run your priority queries across ChatGPT, Gemini, and Perplexity. For each platform, record: was your domain cited? Was your brand mentioned in the text? What was the context and sentiment of the mention? Do this monthly. Track trends. Ghost citation rates should decrease over time as content and entity optimization takes effect — if they do not, the diagnosis is incomplete.

Conclusion

The brands that are winning AI visibility in 2026 are not the ones with the highest citation counts. They are the ones whose brands are being named, in context, accurately, in the text of the responses that reach their target users.

A ghost citation is a technical success and a strategic failure. It means your content was good enough to be retrieved and not good enough to be attributed. It means your brand infrastructure is too weak for the AI to name you when it uses your knowledge.

Fixing this requires a different kind of optimization than most GEO playbooks describe — less about structure and freshness, more about brand entity architecture, explicit attribution in content, and the full-stack monitoring that reveals what AI systems are actually saying about you versus what they are sourcing from you.

Those are different problems. They require different solutions. And until brands can distinguish between them, their GEO metrics will continue to tell a story that is technically accurate and strategically misleading.

This article is part of the Storyzee content cluster on Generative Engine Optimization (GEO) and Answer Engine Optimization (AEO). To dig into a specific topic, get in touch.

Benjamin Gievis

Benjamin Gievis

Founder of Storyzee. Former agency owner turned AI visibility specialist. Building the tool and methodology so SMEs exist in answers from ChatGPT, Perplexity, Gemini, Claude and Grok.

Talk to Benjamin — 30 min free

Want to know your ghost citation rate?

FAQ

What is a ghost citation in GEO?

A ghost citation occurs when an AI platform includes a link to your domain in its source citations but does not mention your brand name in the text of the response. The AI used your content but did not attribute it to you by name. Research from April 2026 shows that 61.7% of all LLM citations are ghost citations — the brand receives a technical attribution but zero brand awareness value.

Why does Gemini rarely create citation links despite mentioning brands?

Gemini is architecturally different from Perplexity or ChatGPT Search. It operates more like a conversationalist drawing on trained brand knowledge than a retrieval engine citing sources. It mentions brands in 83.7% of responses but generates citation links only 21.4% of the time. For Gemini, brand visibility is driven more by entity knowledge baked into training than by real-time retrieval and citation.

How does ChatGPT's search activation rate affect GEO strategy?

ChatGPT activates its live search feature on only 34.5% of queries as of April 2026. For the remaining 65.5% of responses, the model relies on training data with no real-time retrieval and no citations. This means that for the majority of ChatGPT interactions, your brand's visibility depends entirely on how it was represented in the model's training corpus — not on your current content or recent publications.

What is the difference between citation rate and brand mention rate?

Citation rate measures how often your domain receives a source link in an AI response. Brand mention rate measures how often your brand name appears in the text of AI responses. These are distinct metrics that require different measurement approaches and different optimization strategies. Most GEO dashboards track citation rate. Brand mention rate — the metric that actually drives brand awareness — requires monitoring response text directly.

What is the most effective fix for ghost citations?

The highest-leverage intervention is embedding explicit brand attribution within extractable content passages — not just in page headers or metadata, but in the sentences surrounding every claim you want AI systems to attribute to you. Complementary fixes include building brand entity signals (Wikipedia, Knowledge Graph, structured data) and monitoring full response text across platforms rather than relying on citation link tracking alone.