Reddit and YouTube in AI Search: Why UGC Is the #1 Cited Source by LLMs


The number-one source that AI models cite isn't the New York Times. It isn't Wikipedia (well, except for ChatGPT). It isn't some enterprise content powerhouse with a million backlinks and a 95 domain authority.
It's Reddit. User-generated content (UGC) dominates AI search citations in ways that should make every content marketer uncomfortable.
An anonymous forum where people argue about whether a hot dog is a sandwich. That's the most-cited source across aggregated AI search platforms.
The citation leaderboard no one expected
Lily Ray, VP of SEO Strategy & Research at Amsive, has been tracking which domains AI platforms actually cite. The data, drawn from her research across ChatGPT, Perplexity, and Google AI Overviews, paints a picture that flips traditional content strategy on its head.
Here's what the numbers show:
- Perplexity cites Reddit in 46.7% of responses. YouTube is second at 13.9%.
- Google AI Overviews cite Reddit at 21%, YouTube at 18.8%, and Quora at 14.3%.
- ChatGPT leans on Wikipedia (47.9%), but Reddit still captures 11.3%.
Aggregate it across platforms, and Reddit and YouTube sit at the top. Not brand blogs. Not industry publications. User-generated content platforms where real people share real experiences.
Traditional SEO metrics (backlinks, domain authority, the stuff we've obsessed over for twenty years) only predict 4–7% of AI citation behavior, according to Ray's research presented at Tech SEO Connect 2025. That's not a "declining factor." That's near-irrelevance.
Why LLMs love what real people write
Kevin Indig at Growth Memo dug into what makes cited content different from uncited content. One finding stands out: cited text has an entity density of 20.6%, nearly three times the ~5–8% found in normal English text. Entities are specific things: brand names, product names, people, places, versions, prices.
UGC is packed with this stuff naturally. Nobody on Reddit writes "consider evaluating a leading CRM solution." They write "we switched from Salesforce to HubSpot last quarter and our close rate went up 15%." That's entity-dense. That's specific. That's exactly what LLMs can extract clean facts from.
There's a deeper reason too. Google has spent years pushing E-E-A-T (Experience, Expertise, Authoritativeness, Trustworthiness). The first E, Experience, is the one brands struggle with most. A corporate blog post about "5 Tips for Project Management" doesn't carry experience. A Reddit thread where an actual PM describes how they rescued a failing sprint using a specific tool? That's experience. LLMs can tell the difference. We break down exactly how to build E-E-A-T authority that LLMs trust.
The great decoupling: rankings and citations are no longer the same game
Kevin Indig coined the phrase "The Great Decoupling" in Growth Memo, and it captures something important: organic search rankings and AI citations have diverged. They're no longer proxies for each other.
A Reddit thread might never appear on page one of Google for a competitive keyword. But it gets cited by Perplexity, referenced by Claude, and pulled into Google's AI Overviews. Meanwhile, a perfectly optimized brand page ranking #1 for that same keyword might get zero AI citations.
UGC sites rank in AI responses despite not "optimizing" for anything. They succeed because the content itself is what LLMs are looking for: specific, experienced, entity-rich, and honest.
Indig's client data shows you can grow pipeline 2.3x faster than traffic, and you can grow pipeline while traffic stays flat or even declines. The classic SEO model assumed more rankings meant more clicks, meant more traffic, meant more leads. That chain is broken.
Eli Schwartz's counterpoint: stop optimizing for engines
Eli Schwartz, author of Product-Led SEO, argued that the entire GEO (Generative Engine Optimization) framework "misses the mark." His point: if you're optimizing for the engine rather than the user, you're playing the same game that got SEO into trouble in the first place.
Reddit and YouTube don't succeed in AI citations because someone optimized them for LLMs. They succeed because people on those platforms are answering real questions with real experience. The optimization is accidental. The authenticity is the feature.
This is the part that makes content marketers squirm. You can't "optimize" your way into this. The content has to actually be good. Not good by SEO standards, but good by "would a real person find this genuinely helpful" standards.
Reddit strategy for B2B: what works and what gets you banned
Let's be honest about the tension here. Every B2B marketer reading this is thinking: "Great, so we need a Reddit strategy."
Sort of. But not the way you're imagining.
What gets you banned:
- Creating accounts solely to promote your product
- Posting thinly disguised marketing content as "advice"
- Having your sales team upvote each other's posts
- Using intern accounts to seed product mentions
- Any form of astroturfing. Reddit's community moderators are extremely good at detecting it, and the consequences are permanent
What actually works:
- Having your engineers and product people participate in relevant subreddits as themselves, not as brand ambassadors, as humans who happen to know things
- Answering technical questions where your expertise is genuinely relevant
- Sharing honest experiences, including the limitations of your own product
- Building a posting history over months before your product is ever mentioned
- Engaging in discussions where you have nothing to sell
Companies like Datadog and Zapier have people who genuinely participate in their relevant subreddits. They answer questions, share context, occasionally mention their products when directly relevant, and they've built credibility over years, not weeks. That credibility now pays compound interest in AI citations.
There's no shortcut. If your "Reddit strategy" can be put into a Notion doc with KPIs and a weekly content calendar, it's not going to work. Reddit rewards people, not brands.
YouTube: the citation magnet hiding in plain sight
YouTube's position as the #2 most-cited source shouldn't be surprising once you think about it. Video transcripts are structured, detailed goldmines of information that LLMs can extract from.
A 20-minute YouTube tutorial on "how to set up Kubernetes monitoring" contains more specific, actionable, entity-dense information than most blog posts written about the same topic. The speaker names tools, shows configurations, explains trade-offs, and usually responds to comments with additional context.
The interesting part: YouTube videos get cited in AI responses even when the user never watches the video. The LLM extracts information from the transcript and metadata. Your video is being "read" by AI, not watched.
This means a few things for B2B brands:
- Transcripts matter more than production quality. A clearly spoken walkthrough beats a slick brand video with vague messaging.
- Specific titles outperform broad ones. "How We Cut AWS Costs 40% Using Spot Instances" gets cited. "Cloud Cost Optimization Best Practices" does not.
- Comments add citation surface. Active Q&A in your video comments creates additional entity-rich content that AI models pick up.
- Technical depth wins. The videos that get cited are how-tos, comparisons, and deep dives, not brand awareness content.
The long tail: review sites, forums, and Quora
Reddit and YouTube get the headlines, but Lily Ray's research shows a broader pattern. AI models disproportionately cite platforms where users share unfiltered experiences:
- G2 and Gartner Peer Insights: software review platforms show up heavily in bottom-of-funnel AI responses
- Quora: particularly for "how to" and "what is" queries, cited at 14.3% by Google AI Overviews
- LinkedIn: especially posts with specific data or contrarian takes from recognized practitioners
- Stack Overflow: still a dominant citation source for technical queries
- Niche forums: industry-specific communities carry weight for vertical queries
The pattern is consistent: 95% of the sources AI cites when answering questions come from platforms you don't own or control. Your brand website represents a sliver of your AI visibility footprint. Understanding how to measure AEO visibility across these platforms matters more than ever.
The honest tension: brands want control. UGC is uncontrollable.
Here's the part most "UGC strategy" blog posts skip over.
Brands are built on control. Controlled messaging. Controlled positioning. Controlled narratives. Marketing departments exist to manage how a brand is perceived.
UGC is the opposite of that. Someone on Reddit can write "we tried [your product] for three months and it was awful, here's why we switched" and that post might get cited by every AI model on the internet. You can't edit it. You can't suppress it. You can't outrank it with a sponsored post.
This is genuinely uncomfortable territory. And there's no clean solution.
What you can do:
- Make the product good enough that UGC is mostly positive. This sounds obvious. It's also the hardest and most important thing on this list.
- Respond to negative UGC publicly and constructively. AI models pick up on how brands handle criticism. A thoughtful response to a complaint can become part of the cited content.
- Encourage your actual users to share experiences. Not through incentivized review campaigns. By building a product experience worth talking about.
- Monitor what AI models say about you. Ask ChatGPT, Perplexity, and Claude about your product regularly. Know what's being cited. Know where the narrative is forming.
Building an off-site presence strategy that feeds AI visibility
Lily Ray has been clear on this point: brands need to prioritize off-site signals. Digital PR, brand mentions across Reddit, Quora, and review sites. These carry heightened importance for AI visibility. Third-party validation has moved from a secondary SEO benefit to a primary visibility driver.
Here's what a practical off-site strategy looks like:
1. Audit your current AI citations. Ask each major AI platform about your product category. Note which competitors get cited, which sources are referenced, and where your brand shows up, or doesn't.
2. Map the citation sources in your category. Which subreddits discuss your space? Which YouTube channels review tools like yours? Which G2 categories matter? Build a map of where the conversations happen.
3. Invest in genuine participation. Get your subject matter experts active on the platforms that matter. Not as brand accounts. As people with expertise.
4. Rethink your content distribution. That detailed comparison guide sitting on your blog? The data is more valuable as a Reddit comment, a YouTube video, or a LinkedIn post with specific numbers. Distribute the knowledge where AI models actually look.
5. Build relationships with the people AI cites. Industry analysts, independent reviewers, active forum contributors. These are the new "backlinks." Their mentions of your product carry more weight in AI responses than a link from a DA 90 website.
6. Cross-team collaboration is non-optional. This isn't a task for the SEO team alone. It spans PR, social, product marketing, customer success, and developer relations. AI visibility is an output of brand presence everywhere.
What this means for your AI sales presence
The rise of UGC in AI search changes the equation for how brands get discovered, and what happens after discovery.
When a buyer asks an AI assistant about your product category and gets a response citing Reddit threads, YouTube reviews, and G2 comparisons, they arrive at your site with a different set of expectations than a buyer from Google. They've already heard real user opinions. They've already formed initial impressions. They don't need another feature list.
They need a conversation. A real one. Responsive, honest, specific to their situation.
This is where AI sales agents become core infrastructure, not just conversion tools. When buyers land after exposure to UGC-driven AI citations, they expect the same specificity and directness they got from the Reddit thread that sent them there. A generic chatbot with canned responses breaks that continuity. An AI agent that can have a real conversation, answer specific questions, and respond with the same level of depth as the UGC that influenced the buyer? That's what closes the gap between AI discovery and conversion.
The entire funnel is shifting. The top is now owned by user-generated content you can't control. The middle is shaped by AI models you can't directly optimize. The bottom, the actual conversation with the buyer, is the part you can still own. Make it count. And as agentic commerce reshapes the buyer journey, even that bottom layer is being mediated by AI.




