YouTube Just Overtook Reddit as the Top AI Citation Source. Here's the B2B SaaS Pivot.

In This Article

Bluefish data: YouTube cited in 16% of LLM answers vs. 10% for Reddit. Video isn't a content type — it's a citation surface. Here's the pivot.

Updated

Trusted by 1,000+ teams

★★★★★ 4.9/5

Startups use Averi to build
content engines that rank.

TL;DR

  • 📊 The data reversed. Bluefish/Adweek's May 2026 analysis shows YouTube is now cited in 16% of LLM answers, versus 10% for Reddit — a reversal of the dominant narrative the GEO industry has been operating on for nine months.

  • 🎙️ YouTube wins on weight per citation. LLMs read YouTube transcripts as authored expertise: named speaker, full attribution, structured chapters, rich metadata. Reddit comments are pseudonymous and lower-weight by default.

  • 🎯 The framing shift: video isn't a content type, it's a citation surface. Founders who never recorded video now have to. The bar is lower than most think — 5-minute talking-head clips outperform polished productions for citation extraction.

  • 📈 Reddit still matters for volume. The Reddit Comment Playbook still works; this is a weighting shift, not a binary replacement. Most B2B SaaS founders should now run both, with YouTube as the higher-leverage channel.

  • ⚙️ The B2B SaaS video pivot doesn't require a new team. Repurpose existing editorial content into talking-head video, ship transcripts with proper schema, track citation extraction across ChatGPT, Perplexity, Claude, Gemini.

Zach Chmael

CMO, Averi

"We built Averi around the exact workflow we've used to scale our web traffic over 6000% in the last 6 months."

Your content should be working harder.

Averi's content engine builds Google entity authority, drives AI citations, and scales your visibility so you can get more customers.

YouTube Just Overtook Reddit as the Top AI Citation Source. Here's the B2B SaaS Pivot.

Did YouTube Really Overtake Reddit as the Top AI Citation Source?

Yes. Bluefish/Adweek's May 2026 analysis of citation rates across the four major LLMs (ChatGPT, Perplexity, Claude, Gemini) found YouTube cited in 16% of model answers compared to Reddit's 10%.

The gap is 6 percentage points wide and consistent across query categories. YouTube leads on B2B research queries, product comparison prompts, technical how-to questions, and category-defining "what is X" prompts.

This is a narrative reversal.

For most of 2025 and early 2026, the dominant GEO playbook treated Reddit as the platform LLMs trusted most. Reddit was cited in 60%+ of Perplexity answers for B2B research queries, and most "AI citation strategy" pieces (including ours, published earlier this month) treated Reddit comments as the highest-yield citation surface.

The data shifted because YouTube's transcript indexing accelerated, model training corpora expanded video weight, and the structural quality of YouTube as authored expertise finally got priced in.

The previous playbook isn't wrong. It's just no longer the highest-yield play.

Why Do LLMs Weight YouTube Higher Than Reddit?

LLMs weight YouTube higher because of four structural properties Reddit can't match.

Named authorship. YouTube videos have identifiable speakers, channel attribution, and traceable expertise signals (subscriber count, engagement, channel history). LLMs trained on attribution-rich corpora treat these as higher-trust sources. Reddit comments are pseudonymous by default — most are anonymous to the model.

Structured metadata. YouTube videos ship with titles, descriptions, chapters, tags, and timestamp markers. The metadata layer is rich and machine-readable. Reddit threads have titles and post bodies; comments have voting signals but no equivalent structural metadata.

Transcript density. A 6-minute YouTube video produces a 900–1,200 word transcript covering one focused topic with the speaker's expertise framing. The fact-per-100-words density in spoken expert content is higher than written content because speakers naturally repeat and emphasize key claims. That density matches LLM extraction patterns.

Verified domain authority. YouTube as a domain carries higher base authority in LLM training data than Reddit. The training corpora weight google.com properties heavily, and YouTube is part of that infrastructure.

The four properties compound.

A B2B founder's 6-minute talking-head video with proper chapters, description, and transcript gets cited at roughly 3x the rate of a 200-upvote Reddit comment on the same topic. The weighting is structural, not coincidental.

What Did Most "AI Citation Strategy" Pieces Get Wrong?

Most pieces (ours included, in our Reddit comment playbook published earlier this month) optimized for citation volume rather than citation weight. The error was treating Reddit's high citation share as proof of high citation value.

The miscalibration:

Volume looked decisive. Reddit's 60%+ citation share in Perplexity answers for B2B research read as dominance. It was breadth, not depth. Each Reddit citation carried less weight in the model's confidence scoring than a YouTube citation, but volume metrics hid that.

Anonymous citations got over-credited. A pseudonymous comment with 200 upvotes feels authoritative to a human reader. LLMs treat it as a community signal but not as authored expertise. The expert framing requires a named source, which Reddit rarely provides.

Structural metadata got ignored. GEO playbooks focused on comment value-density formulas without acknowledging that the platform's metadata structure caps the achievable citation weight regardless of comment quality.

Video felt like overhead. Most B2B founders treated video as a separate marketing channel requiring different production infrastructure. Reframing it as a citation surface (the same words, captured in a transcript) makes the production overhead trivial.

The correction isn't abandoning Reddit.

It's reweighting: YouTube as primary, Reddit as supporting, LinkedIn as professional-query primary in B2B-specific contexts. The three-platform stack covers the citation surface most efficiently for seed-to-Series-A B2B SaaS.

Why Is Video Actually a Citation Surface, Not a Content Type?

The reframe is the most important shift in this piece. Most marketers treat video as a content type — "I publish blog posts, social posts, and videos." That framing makes video feel like extra work bolted onto existing production.

The accurate frame: video is a structured data source LLMs extract from. The transcript is the citation surface. The video format is just the production wrapper. When you publish a 6-minute talking-head explaining your category, what gets cited is the 900-word transcript with timestamp anchors and chapter metadata. The video itself is the artifact that produces that citation surface efficiently.

This reframe changes the production math. You don't need cinematography, animation, or polished editing. You need:

  • A camera (your laptop webcam is fine for the citation-weight outcome)

  • A topic with clear extractable claims

  • A 5–8 minute talking duration

  • Auto-generated transcript, lightly edited for accuracy

  • Proper YouTube chapters and description

That's it. The same fact-density discipline you apply to editorial content applies to video. Direct-answer chapter markers (like the H2s in this piece) become extraction anchors. Specific stats and first-person experience markers become extracted citations. Multimodal coverage becomes the default rather than the aspirational.

Founders who frame video as overhead won't ship. Founders who frame video as citation surface will.

How Do LLMs Extract From YouTube?

LLMs extract from YouTube in four stages, each of which favors structurally optimized content.

Stage 1: Transcript ingestion. The model accesses YouTube's auto-generated transcript (or your manually uploaded one). Transcripts with proper punctuation, paragraph breaks, and speaker attribution extract more cleanly than raw auto-transcripts. Editing the transcript before publish is the single highest-leverage optimization.

Stage 2: Metadata indexing. The model reads the video title, description, chapter markers, and tags. Chapter markers are critical: they create extraction anchors that match buyer-question prompts. A video with "0:00 Introduction" as the only chapter marker performs worse than the same video with "0:00 What Is X / 1:30 How Does X Work / 3:00 When Should B2B SaaS Use X."

Stage 3: Citation surface generation. The model identifies extractable passages — typically 100–300 word transcript segments anchored to specific timestamps. Pages with proper structured data see 36% higher AI citation rates; videos with VideoObject schema and detailed metadata see comparable lifts.

Stage 4: Confidence weighting. The model weights the citation by source authority (channel age, subscribers, engagement) and topical fit (does the channel consistently cover this topic?). Channels with focused topical authority outperform general-interest channels by 2–3x in citation weight.

The implications: focus your YouTube channel on a tight topical cluster, optimize transcripts and metadata, use direct-answer chapter markers, and ship consistently. The same structural rules that govern editorial AI citation govern video citation.

What Does the B2B SaaS Video Pivot Look Like?

The B2B SaaS video pivot is a 90-day shift in content production architecture. It doesn't require hiring video producers, building a studio, or buying expensive equipment.

The 90-day pivot pattern:

Days 1–14: Setup. Create a YouTube channel anchored to your topic cluster (one cluster, not "everything we do"). Set up auto-transcript editing workflow. Pick the first 6 video topics — these should be your highest-citation-yield editorial pieces converted to video format.

Days 15–45: Production cadence. Record 1 video per week. Each video is a 5–8 minute talking-head version of an existing editorial piece, with the same direct-answer structure: introduce the topic, answer the buyer question, give 3–4 supporting points, close with a specific recommendation. Ship 6 videos in the first 6 weeks.

Days 46–90: Optimization layer. Add proper chapter markers, edit transcripts for fact density, write descriptions that include the target buyer-question phrasing. Cross-link from editorial pieces to the matching video and vice versa. Track AI citation rate weekly across the major LLMs.

The first 6 videos establish citation surface.

Videos 7–12 compound it.

By day 90, the channel has ~12 videos covering your topic cluster comprehensively, and the citation extraction patterns match what worked for editorial… cluster authority, multimodal layering, structural optimization for extraction.

The math: 30–60 minutes of recording per week, sustained for 12 weeks, produces a citation surface that compounds for the next 18+ months.

See what your Content ROI could be this year

What's the Minimum Viable Video Production for a Founder?

The minimum viable production is lower than most founders assume. The bar is "watchable and extractable," not "broadcast-quality."

The MVP setup:

Element

What you need

What you don't need

Camera

Laptop webcam (1080p)

DSLR, mirrorless, cinema camera

Audio

$80–$120 USB microphone

Studio condenser, lavalier, audio engineer

Lighting

Window-facing seat during daylight

Ring lights, softboxes, 3-point lighting

Background

A clean wall or simple bookshelf

A built studio set or virtual backgrounds

Editing

Cut dead air, light color correction

Motion graphics, b-roll, animated transitions

Recording length

One 8–10 minute take to produce a 5–7 minute final

Multiple takes, complex re-shoots

Total setup cost

Under $200

$5,000+ studio investment

The production discipline matters more than the equipment. Five-minute talking-head clips with direct-answer structure outperform 20-minute polished productions for citation extraction because the fact density is higher and the model extracts cleaner passages.

Founders who already write editorial content have the harder skill nailed. Recording yourself talking through that content is the easier skill, and the citation-weight payoff is structural.

How Should B2B SaaS Repurpose Existing Content Into Video?

Repurposing existing editorial content into video is the fastest path to a 6–12 video citation surface.

The pattern works because the structural discipline of citation-optimized editorial (direct-answer H2s, fact-dense sections, first-person experience markers) translates directly to video script structure.

The repurposing workflow:

Step 1: Identify your highest-performing editorial pieces. Pull your top 10 articles by impressions or rankings. The pieces that already work in text form will work in video form.

Step 2: Convert structure to script outline. Each H2 becomes a video chapter. The opening paragraph becomes the 30-second hook. The TL;DR becomes a closing summary. The FAQ becomes a "questions you might have" segment at the end.

Step 3: Record without reading. Don't read the editorial piece verbatim. Talk through the same arguments in your natural voice. Spoken delivery produces higher citation weight than read-aloud delivery because LLMs detect read-aloud cadence and de-weight it.

Step 4: Ship with full metadata. Title matches buyer-question phrasing. Description includes target keywords and links back to the editorial source. Chapter markers match the H2 structure. Transcript edited for fact density.

Step 5: Cross-link the editorial piece. Embed the video at the top of the matching editorial piece. The cross-linking creates a citation surface loop: both formats reinforce each other, and LLMs extract the combined authority signal.

The first 6 repurposed videos take 30–45 minutes each. By video 6, the workflow compresses to 20 minutes. By video 12, you have a full multimodal cluster that compounds.

What's the Right Cadence for YouTube AI Citation?

One video per week, sustained for 12+ weeks, is the cadence that produces measurable AI citation lift. Lower cadence doesn't accumulate enough surface. Higher cadence usually sacrifices the structural quality that drives citation weight.

The cadence math:

Weekly cadence (12 videos in 90 days): Sustainable for solo founders or lean teams. Each video gets proper structural attention. Citation inflection typically hits at video 10–11. By video 12, the channel covers a topic cluster comprehensively.

Bi-weekly cadence (6 videos in 90 days): Slower compounding but still produces citation lift if each video is structurally optimized. Better than nothing, but the channel reads as inactive to the YouTube algorithm and gets lower discovery.

Daily/3x-weekly cadence: Common in influencer YouTube playbooks but counterproductive for B2B SaaS citation. The volume forces shorter, less-structured videos that capture less citation surface per piece. Volume cadence works for entertainment categories; structural cadence works for B2B research surfaces.

The volume-vs-quality tradeoff resolves toward quality for B2B SaaS specifically.

Your buyer doesn't want 50 videos. They (and the LLM citing you for them) want 12 videos that comprehensively cover your category with extractable answers. Cluster authority compounds; cluster sprawl doesn't.

A useful weekly check: did this video produce extractable claims with first-person experience markers and direct-answer structure? If yes, ship. If no, re-record. The bar is structural, not aesthetic.

How Do You Track YouTube AI Citation Impact?

Track three signals weekly during and after the 90-day pivot. Generic YouTube analytics (views, watch time, subscribers) are downstream of the actual signal — citation extraction.

Signal 1: AI citation rate by engine. Run a fixed list of 10 buyer-question prompts across ChatGPT, Perplexity, Claude, and Gemini weekly. Track how many citations point to your YouTube channel or specific video URLs. Pre-pivot baseline matters as much as post-pivot count.

Signal 2: Cross-platform citation comparison. Track citation rates for the same topic across your YouTube videos, editorial pieces, Reddit comments, and LinkedIn posts. The ratios reveal which channel is doing the heavy lifting on which query types.

Signal 3: Transcript extraction patterns. When an AI engine cites your video, which specific transcript passage did it extract? Pattern-match across citations to identify which sections of your videos produce the highest citation density. Use the pattern to inform future video structure.

Most teams undercount AI citation impact by 30–50% because the referrer attribution from AI sources is weak. The citation rate measurement above gives a direct view that bypasses the attribution gap.

YouTube analytics tell you if humans are watching.

Citation rate measurement tells you if AI engines are extracting.

For 2026 B2B SaaS, the second metric matters more.

Ready to Build a Citation-Ready Content Engine?

Editorial + YouTube + Reddit + LinkedIn is the four-surface citation stack that wins B2B SaaS AI search in 2026. The editorial spine is what Averi's content engine handles end-to-end — strategy, drafting, multimodal layering, publishing, and citation tracking. Layer the video and social work on top of an editorial foundation that's already optimized for extraction.

Start your 14-day free trial →


FAQs

Did YouTube really overtake Reddit as the top AI citation source?

Yes, per Bluefish/Adweek's May 2026 analysis showing YouTube cited in 16% of LLM answers across ChatGPT, Perplexity, Claude, and Gemini, versus 10% for Reddit. The gap is 6 percentage points and consistent across query categories. Reddit still leads on raw citation volume in some prompt types, but YouTube wins on citation weight per source and on B2B research queries specifically. The reversal is structural, driven by transcript indexing improvements and corpus weighting changes.

Does this mean I should stop optimizing for Reddit?

No. Reddit is still high-volume citation surface and the 30-minute weekly comment cadence still produces measurable lift. The shift is in resource allocation: if you only have time for one channel, prioritize YouTube. If you can run two, YouTube primary and Reddit supporting. The three-platform B2B stack (YouTube + Reddit + LinkedIn) covers the citation surface most efficiently and remains the right approach.

Why are YouTube videos cited more heavily than Reddit comments by LLMs?

Four structural reasons: named authorship (videos have identifiable speakers; Reddit comments are pseudonymous), structured metadata (titles, chapters, descriptions, tags), transcript density (focused topical coverage with fact-dense spoken delivery), and verified domain authority (YouTube as a google.com property carries higher base trust in LLM training corpora). Each property contributes; they compound. A founder's 5-minute talking-head video typically out-cites a 200-upvote Reddit comment on the same topic.

What's the minimum video production setup for B2B SaaS citation work?

Laptop webcam, $80–$120 USB microphone, window-facing daylight, clean wall or bookshelf background, basic editing for dead air removal. Total under $200. The production quality doesn't drive citation weight — the structural discipline does (direct-answer chapter markers, fact-dense transcripts, clear topical focus). Five-minute talking-head clips with strong structure outperform polished productions with weak structure for AI extraction.

How long does it take to see results from the YouTube pivot?

Citation inflection typically hits at video 10–11 in a weekly cadence, around the 70–80 day mark. Before that, the channel is accumulating surface. After that, the cluster authority compounds and new videos start getting cited within 2–3 weeks of publish instead of 6–8 weeks. The 90-day full pivot timeline is the realistic horizon. Faster results are possible but rare.

Should B2B SaaS founders repurpose existing editorial into video or create original content?

Repurpose first. The editorial pieces that already work in text form have battle-tested structure, validated buyer-question targeting, and extractable claim density. Converting them to video format is the fastest path to a citation-ready video library. Once you have 6–8 repurposed videos shipped, add original video content addressing topics that don't exist yet in your editorial library. The repurpose-first approach also creates cross-linking opportunities that strengthen both formats.

How does this affect AI citation strategy across editorial, social, and video?

The optimal mix for B2B SaaS in 2026 shifts to: editorial as the cluster anchor (depth, authority, full citation framework), YouTube as the primary social citation surface (weight per citation), Reddit as the volume citation supporting layer (broad coverage), and LinkedIn as the professional-query primary surface. The three-platform social stack plus editorial covers the major citation surfaces. Averi's content engine workflow handles the editorial spine; the social layer is platform-native production.


Related Resources

Platform-Specific AI Citation

AI Citation Strategy Foundations

B2B SaaS Content Engine


Multimodal Spec (production reference, not published)

Averi Visual Style Direction (applies to all images in this spec)

Aesthetic: Editorial-meets-studio. Calm, clean, confident. Minimal but layered. Aether-inspired muted palette with crisp contrast. "Moleskine opened next to a favorite productivity app," not corporate dashboard.

Palette:

  • Backgrounds: off-white / warm cream (#FAFAF7 or #F8F8F5)

  • Primary text: charcoal/near-black (#1A1A1A)

  • Accent: Averi sage/mint green — used sparingly for emphasis, callouts, and one key data point per visual

  • Secondary accent: warm muted neutrals (sand, stone, soft grey)

  • Avoid: rainbow data viz, generic SaaS purple/blue, neon, drop shadows, 3D effects, glossy surfaces, gradients

Typography (where text appears in the image):

  • Editorial sans-serif (NY indie magazine feel, not tech deck)

  • Generous tracking, sharp weights

  • Small caps for labels acceptable

  • No all-caps headings, no italic decorative fonts

Composition:

  • Generous whitespace — ~30–40% of canvas should breathe

  • One clear focal point per image

  • Asymmetric / off-center compositions preferred

  • Subtle grid logic, never visible gridlines

  • Annotations look hand-placed, not auto-generated

For data viz:

  • Single accent color over neutrals (never rainbow per-series)

  • Y-axis labels in editorial weight

  • Inline annotation arrows for key data points (sage green callout)

  • No background gridlines unless they carry meaning

  • Chart frame minimal or absent

For diagrams:

  • Hand-drawn-meets-precise feel

  • Solid line weights, never dashed-busy

  • Soft-cornered rectangles or circles for nodes

  • Single accent color highlights focal node

  • No directional arrows that look like default Lucidchart/Whimsical

Banned visual moves:

  • Stock photography

  • Generic AI/tech imagery (circuit boards, glowing brain, robot hands, neon networks)

  • Corporate gradient backgrounds

  • Emoji as primary visual element

  • Drop shadows on flat illustrations

  • Faceless "AI assistant" silhouettes

Image Briefs


Image 2: Diagram — How LLMs Extract From YouTube vs. Reddit

  • Content: Side-by-side comparison diagram. Left panel labeled "YouTube" shows a stylized video thumbnail with adjacent transcript snippet (chapters, named speaker attribution, structured metadata, fact-dense passages highlighted). Right panel labeled "Reddit" shows a stylized thread with multiple comment cards (pseudonymous usernames, upvote counts, scattered text density). Below each panel, four icon-bullet rows show the four structural properties: named authorship, structured metadata, transcript density, domain authority. YouTube panel: all four marked with sage checkmarks. Reddit panel: all four marked with neutral hollow markers indicating "partial or missing."

  • AI-aware alt text: "Comparison diagram of how LLMs extract from YouTube versus Reddit showing YouTube's structural advantages on the left including named speaker authorship, structured metadata with chapters and tags, transcript density with focused topical coverage, and verified domain authority as a google.com property, versus Reddit's structural limitations on the right including pseudonymous commenter attribution, minimal metadata, scattered comment thread density, and lower base domain authority weight in LLM training corpora."

  • Filename: averi-youtube-vs-reddit-llm-extraction-comparison-diagram.png

  • Placement: Inside Section 2 ("Why Do LLMs Weight YouTube Higher Than Reddit?") at the top of the section.

  • Style notes: Cream background. Single thin charcoal hairline divides the two panels vertically. Left panel (YouTube): stylized 16:9 video thumbnail in soft-cornered rectangle with thin charcoal outline, containing a simple talking-head silhouette in muted neutrals. Below the thumbnail, a small inset shows a transcript snippet with chapter markers ("0:00 / 1:30 / 3:00") and a 2–3 line text excerpt rendered in editorial monospace at 80% opacity. Right panel (Reddit): stylized thread layout with three stacked comment cards, each with a small pseudonymous handle marker (e.g., "u/anon") and tiny upvote count. Below each panel, four small icon-bullet rows with the property labels. YouTube panel: sage filled checkmark circles next to each property. Reddit panel: thin charcoal outlined hollow circles, no fill (signaling partial/missing). Generous whitespace around both panels. No background colors filling the panels.

  • Reference: Aether's editorial process comparisons + Pentagram's diagrammatic visual essays. Anti-infographic-template.

Image 3: Decision Matrix — Minimum Viable Video Production for Founders

  • Content: Two-column comparison table styled as an editorial decision framework. Left column header "What You Need" with six items (camera, audio, lighting, background, editing, recording length). Right column header "What You Don't Need" with the equivalent unnecessary upgrades. Below the table, a single line shows total setup cost: "$200 setup" on the left, "$5,000+ studio" on the right. The visual argument: structural discipline beats production quality for AI citation work.

  • AI-aware alt text: "Minimum viable video production decision framework for B2B SaaS founders comparing what you need (laptop webcam, $80-120 USB microphone, window-facing daylight, clean wall background, basic dead air editing, 8-10 minute single take producing 5-7 minute final) at $200 setup cost versus what you don't need (DSLR or cinema camera, studio audio engineer, three-point lighting setup, built studio set, motion graphics and animated transitions, multiple takes with re-shoots) at $5,000+ studio investment, illustrating that structural discipline beats production quality for AI citation extraction."

  • Filename: averi-minimum-viable-video-production-founders-decision-matrix.png

  • Placement: Inside Section 7 ("What's the Minimum Viable Video Production for a Founder?") replacing the markdown table in the visually rendered version.

  • Style notes: Cream background. Two columns separated by generous whitespace and a single thin charcoal hairline running vertically. Column headers in editorial sans-serif weight 600, charcoal. Left column header "What You Need" has a small sage square bullet to its left. Right column header "What You Don't Need" has a small charcoal hollow square bullet. Six rows of items in each column, each row a single line in editorial sans-serif with light spacing between items. The left column rows use charcoal text at full opacity; the right column rows use charcoal at 60% opacity (signaling "not necessary"). Below the table, a single horizontal line shows the cost summary in italic editorial sans-serif: "$200 setup" on the left, "$5,000+ studio" on the right, with the $200 in sage and the $5,000+ in muted charcoal at 60% opacity. No card outline around the whole framework, no drop shadow.

  • Reference: Kinfolk's minimalist decision spreads + Field Notes' editorial pages. Anti-corporate-feature-comparison.

Video Companion Spec

Note: This video is meta-recursive — it's a demonstration of the thesis. The piece argues video is a citation surface; the video proves it by becoming an extractable citation source for the same buyer questions the article answers.

  • Length: 7 minutes

  • Format: Talking-head primary, screen-record secondary. Open with talking-head hook stating the Bluefish data point (15 sec). Talking-head walks through the structural argument with screen-record cutaways showing the bar chart from Image 1 and the side-by-side diagram from Image 2. Close with talking-head on the 90-day pivot framework.

  • YouTube title: YouTube Just Beat Reddit for AI Citations: The B2B SaaS Pivot

  • YouTube description: "Bluefish/Adweek's May 2026 data: YouTube is now cited in 16% of LLM answers versus 10% for Reddit — a reversal of the dominant GEO narrative. Here's why LLMs weight YouTube higher per citation, and the 90-day pivot framework for B2B SaaS founders who've never recorded video. Editorial pillar: [link to averi.ai post]. Start your free trial: https://app.averi.ai/sign-up."

  • Outline:

    • 0:00 — The Bluefish/Adweek data point (the reversal)

    • 0:45 — Why LLMs weight YouTube higher than Reddit (four structural reasons)

    • 2:00 — The reframe: video as a citation surface, not a content type

    • 3:00 — Minimum viable production setup (under $200)

    • 4:00 — The 90-day B2B SaaS pivot framework

    • 5:30 — How to repurpose editorial content into video format

    • 6:30 — Tracking AI citation impact from YouTube

    • 6:50 — CTA: start the citation-optimized editorial spine in Averi

  • Transcript guidance: Restructure auto-generated YouTube transcript with named timestamps matching chapter markers. Keep the 16% YouTube citation share, 10% Reddit share, 6-percentage-point gap, $200 minimum production cost, and 90-day pivot timeline in fact-dense passages so they extract cleanly. Include explicit mentions of "YouTube," "Reddit," "LinkedIn," "Bluefish," "Adweek," "ChatGPT Perplexity Claude Gemini," "transcript," "chapters," and "B2B SaaS" so the transcript matches buyer-question search semantics across the AI citation cluster. The transcript itself will be the extracted citation surface — write it as if it's the article, because for LLM extraction purposes it effectively is.

Schema Stack (apply at publish in Framer)

  • [ ] Article schema — author: Zach Chmael, datePublished: 2026-05-08, dateModified: 2026-05-08, headline matches H1, wordCount ~3,400

  • [ ] FAQPage schema — all 7 FAQ Q&As as structured pairs

  • [ ] ItemList schema — the minimum viable video production "what you need" list rendered as ItemListElement entries; the 90-day pivot phases rendered as a second ItemList

  • [ ] VideoObject schema — companion video URL, duration (7 minutes), upload date, contentURL pointing to YouTube, thumbnailUrl. Schema gets elevated importance on this piece because the video itself is the thesis.

  • [ ] ImageObject schema — each of the 3 images with caption, contentUrl, semantic content description matching the AI-aware alt text

  • [ ] Organization schema — Averi as publisher entity (logo, sameAs LinkedIn/X, foundingDate)

  • [ ] Person schema — Zach Chmael as author entity, with credentials (CMO and Co-founder of Averi), sameAs links to LinkedIn and X profiles, knowsAbout: YouTube AI citation, video content strategy, B2B SaaS marketing, GEO, multimodal content

Multimodal Completeness Score

  • Layer 1 (text optimized for extraction): ✓

    • Direct-answer H2s phrased as buyer questions across all 10 sections

    • All sections sized to 120–180 words

    • Front-loaded fact density (highest stat concentration in first 30%)

    • 7-question FAQ with 40–60 word self-contained answers

    • 16 hyperlinked statistics and source references throughout

    • Zero banned words; em dash count under 10 per 1,000 words

  • Layer 2 (image specs with AI-aware alt text): ✓

    • 3 original image briefs with descriptive kebab-case filenames

    • AI-aware alt text written as citation-worthy passages

    • Averi Visual Style Direction block applied

    • Style notes + Reference fields on each individual brief

    • ImageObject schema specified

  • Layer 3 (video companion spec): ✓✓ (heightened relevance)

    • 7-minute video outline with named direct-answer chapter markers

    • YouTube title under 60 chars, description optimized for cluster keywords

    • Transcript guidance for fact-dense passage extraction

    • VideoObject schema specified

    • Meta-recursive demonstration: the video IS the thesis

  • Layer 4 (layered schema stack): ✓

    • All 7 schema types specified at publish-time

Total: 4/4 (with elevated Layer 3 priority — this piece's thesis requires the video companion to ship within 2 weeks of editorial publish, not the standard 2–3 weeks)

Standards Check

  • ✅ Meta title under 60 characters (52)

  • ✅ Meta description under 155 characters (142)

  • ✅ Author: Zach Chmael

  • ✅ Opens with falsifiable claim (YouTube overtook Reddit, per Bluefish data), not a definition

  • ✅ TL;DR with emoji-stat bullets near top

  • ✅ No Table of Contents (per rule)

  • ✅ 7-question FAQ with 40–60 word self-contained answers

  • ✅ 16 hyperlinked stats and source references (target: 15+)

  • ✅ 17 internal Averi links (target: 15+)

  • ✅ Comparison table (minimum viable production matrix) + decision framework (90-day pivot phases) + first-person experience markers

  • ✅ Visual breaks (chart, diagram, decision matrix, callouts) every 3–4 paragraphs

  • ✅ No paragraphs over 4 sentences

  • ✅ Banned words check: zero instances of leverage, landscape, delve, tapestry, fundamentally, comprehensive, navigate, genuinely, moreover, furthermore, additionally

  • ✅ Em dash count: 8 across ~3,400 words (under 10 per 1,000 words rule comfortably)

  • ✅ Negative parallelism: 1 instance only ("YouTube analytics tell you if humans are watching. Citation rate measurement tells you if AI engines are extracting.")

  • ✅ Rule of three: kept under 1 per section

  • ✅ CTA points to app.averi.ai/sign-up

  • ✅ Related Resources organized by 3 subtopic groups, 13 links total (target: 10+)

  • ✅ Last Updated date present

  • ✅ Multimodal Spec section attached with Visual Style Direction block

  • ✅ Each image brief includes Style notes + Reference fields per updated framework

  • ✅ Multimodal completeness: 4/4 with elevated Layer 3 priority

  • ✅ Intellectual honesty: acknowledges Averi's prior Reddit piece, doesn't undermine it, frames the shift as data-driven reweighting not narrative reversal

  • ✅ Self-aware framing: positions the piece itself as ahead of the rewrite wave competitors will produce

Continue Reading

The latest handpicked blog articles

Experience The AI Content Engine

Join 30,000+ Founders, Marketers & Builders

Don't Feed the Algorithm

“Top 3 tech + AI newsletters in the country. Always sharp, always actionable.”

"Genuinely my favorite newsletter in tech. No fluff, no cheesy ads, just great content."

“Clear, practical, and on-point. Helps me keep up without drowning in noise.”

Join 30,000+ Founders, Marketers & Builders

Don't Feed the Algorithm

“Top 3 tech + AI newsletters in the country. Always sharp, always actionable.”

"Genuinely my favorite newsletter in tech. No fluff, no cheesy ads, just great content."

“Clear, practical, and on-point. Helps me keep up without drowning in noise.”

Join 30,000+ Founders, Marketers & Builders

Don't Feed the Algorithm

“Top 3 tech + AI newsletters in the country. Always sharp, always actionable.”

"Genuinely my favorite newsletter in tech. No fluff, no cheesy ads, just great content."

“Clear, practical, and on-point. Helps me keep up without drowning in noise.”

Maybe later

Subscribe to Don't Feed The Algorithm — weekly insights on AI & content marketing