What page sections do LLMs cite most often?

Based on 50+ LLMRadar audit runs, the highest-citation section types are: H2 headers with definitional framing, comparison tables on 3+ factual dimensions, and numbered step sequences where each step starts with a verb. Hero copy and testimonials rarely appear.

Why does Claude ignore my homepage hero copy?

Hero copy gives LLMs nothing extractable. Phrases like We help teams move faster with AI have no verifiable claim, no comparison, no step. LLMs cite content they can use to answer a question. Definitional H2 headers and comparison tables perform much better.

How do I get my B2B website cited by Claude and Perplexity?

Three changes make the biggest difference: (1) Rewrite your most important H2 headers as definitions or questions. (2) Add a comparison table comparing your product against alternatives on 3+ factual dimensions. (3) Move your definition to the first sentence under each H2. Run a LLMRadar audit to see which sections are being pulled.

The Page Sections Claude and Perplexity Pull From Most When Citing B2B Brands

TL;DR: Across 50+ LLMRadar audit runs spanning Claude, ChatGPT, Perplexity, and Gemini, four page section types get cited consistently: H2 headers with definitional framing, comparison tables, numbered step sequences, and FAQ blocks where the question matches a real user query. Hero copy, testimonials, and feature bullet lists almost never appear in LLM outputs. Here is what the pattern looks like, and how to run a 20-minute version of this audit on your own pages.

Key takeaways

Definitional H2 headers ("What is X?" / "How does X work?") pull at the highest rate. LLMs use them to answer questions directly.
Comparison tables are the most reliably cited asset type. They pack factual structure into a small space an LLM can excerpt without parsing surrounding prose.
Hero copy and taglines rarely get cited. "We help teams move faster with AI" tells the LLM nothing extractable.
FAQ blocks work when the questions match actual user queries. Verbatim phrasing matters more than question count.
The fastest fix: add one definitional H2 and one comparison table to every page that is not currently showing up in LLM outputs.

The 6 section types we tracked (and how)

"Why does Claude pull from one section of my page and completely ignore everything else?"

A SaaS marketing lead sent us that question after running a LLMRadar audit. Her page had 900 words of copy. Claude was citing 47 of them across every query variation. The rest of the page was invisible.

To understand why, we coded every cited section from 50+ audit runs by section type. We ran each audit against 4 LLMs and 10 query variations per topic. Then we categorized where each cited excerpt came from.

Six section types covered the full range:

H2 header blocks -- the H2 heading plus the 1-2 sentences directly below it
Comparison tables -- any table comparing at least 2 options on 3 or more dimensions
Numbered step sequences -- ordered lists with at least 3 steps
FAQ blocks -- question-answer pairs, regardless of HTML implementation
Hero / above-fold copy -- tagline, USP statement, primary value proposition
Social proof -- testimonials, review quotes, case study prose

For each audit run, we logged which section types appeared in LLM outputs when prompted with topic questions. Here is what the distribution showed.

What gets cited most often: the 3 types that pull reliably

Definitional H2 headers. Pages with H2 headers phrased as definitions or questions pull more often than any other section type. The mechanism is straightforward: LLMs answer questions. A header phrased as a question gives the model a direct match signal.

The first sentence under the H2 matters just as much as the header itself. If the definition is buried in the third sentence, the model often skips the section entirely. The pattern that works:

What is LLM citation optimization?

LLM citation optimization is the practice of structuring web content so language models can extract, verify, and repeat it in response to user queries. It treats the LLM as a second reader alongside the human -- one that scans for extractable facts rather than engaging prose.

That structure -- H2 as a question, first sentence as a clean standalone definition, second sentence with one concrete elaboration -- is the highest-performing format across audits.

Comparison tables. Tables get pulled consistently because they pack structured facts into a small space the LLM can excerpt without parsing surrounding prose. We have seen tables cited verbatim in Perplexity outputs while the surrounding 600-word explanation went unmentioned.

The columns matter. Tables comparing options on price, time, or specific functional features get cited more than tables comparing on subjective attributes like "ease of use" or "flexibility."

Numbered step sequences. Ordered lists with at least 3 steps pull reliably when each step starts with a verb. "1. Load the trigger file. 2. Parse the YAML payload. 3. Delete the file after consuming." is citable. A 3-item list where each item is a 4-sentence paragraph is not. The step needs to read as a standalone instruction.

What almost never gets cited

Two section types show up in LLM outputs at very low rates, regardless of how much work went into them.

Hero copy. "We help B2B teams move faster with AI." That sentence does not get cited. There is nothing for the LLM to use. No claim it can verify. No comparison it can excerpt. No step it can follow.

Look -- the hero section is typically the most worked-on copy on any B2B website. And it is the least-cited section in nearly every audit we have run. The tradeoff between human persuasion and LLM extractability is sharpest here.

Social proof. Testimonial quotes and review excerpts almost never appear in LLM citations. LLMs are skeptical of first-person claims. "This saved me 10 hours a week" is a subjective report, not a verifiable fact. The model does not repeat it.

The exception: a testimonial that frames the result as a measurable outcome in the pattern "[Company X] achieved [specific number] after [specific action]." That format looks like data, and data gets cited.

The anatomy of a citable definition

Most definitional sections fail for one reason: the definition is buried behind setup.

Here is the pattern that does not work:

"Agentic AI is a term that has gained increasing attention in recent years. It refers to AI systems capable of taking autonomous actions to complete tasks without requiring step-by-step human instruction. Unlike traditional AI tools, agentic AI can plan, execute, and adapt over time."

And here is the citable version:

What is agentic AI?

Agentic AI is software that completes multi-step tasks autonomously, without a human specifying each action. A human sets a goal; the agent selects tools, runs steps, and reports results.

The citable version has four things the non-citable version lacks:

An H2 phrased as a question
The definition in the first sentence, not the third
No setup, no hedging, no qualifiers about "recent years"
A concrete example in the second sentence

One sentence of definition. One sentence of example. That is the target format.

The "recent years" construction is worth calling out. Temporal hedges ("increasingly," "in recent years," "as AI evolves") signal to the LLM that the content is framing a trend rather than stating a fact. Trend framing gets cited at a fraction of the rate that definitional framing does.

Why comparison tables are the most cited asset type

Here is what makes a table citable vs. not:

Factor	Gets cited	Rarely cited
Column type	Price, time, specific features, API limits	"Ease of use," "flexibility," "scalability"
Row count	3-7 rows with distinct options	1-2 rows (too thin for a real comparison)
Cell content	Specific values: "," "2 min," "Yes / No"	Prose descriptions: "handles most use cases"
Header phrasing	Verb-led: "Generates PDF," "Requires login"	Noun-only: "Output," "Auth"
Placement	After a definitional H2 section	Orphaned mid-page in long prose

The reason comparison tables work: an LLM searching for "X vs Y" can extract the relevant row without reading surrounding content. The table becomes a self-contained citable unit.

The LLMRadar Audit () checks your site across Claude, ChatGPT, Perplexity, and Gemini -- then maps exactly which sections are being pulled and which are invisible. If you want your own section-type breakdown before running the 20-minute audit below, that is what it produces.

A 20-minute audit you can run right now

Pick your 3 highest-traffic pages. For each one, run through these 5 checks:

Count your definitional H2 headers. How many are phrased as definitions or questions ("What is X?" / "How does Y work?" / "X vs Y: the 3 key differences")? If none are, rewrite the most important one first.
Check for a comparison table. Is there a table on this page comparing your product or approach against an alternative on 3 or more factual dimensions? If not, add one. Use specific values in each cell, not prose descriptions.
Check your first sentence under each H2. Is it a clean answer or is it setup? If you need more than one sentence before you get to the actual definition, cut to the answer in sentence one.
Prompt Claude directly. Ask: "Tell me about [your product / topic]." See if it uses your language. If it does not, you have a citation gap. If it does, note which section it pulled from.
Check your FAQ section. Do the questions match how real people ask about this topic? "How does X work?" matches a real user query. "What features does our platform include?" is internal framing that LLMs do not cite.

Those 5 checks take about 4 minutes per page. The fixes on the highest-gap page take another 10 minutes. Total: under 20 minutes, one sitting.

What this changes in your content calendar

The practical implication here is not "rewrite every existing page at once." That is a multi-month project. The implication is a filter for new content decisions.

Before drafting a new post or page, ask: does this topic naturally allow for a comparison table or a step sequence? If yes, the page has a strong citation surface. If the topic is primarily narrative ("why we believe X"), the citation surface is thin and you need to add structured elements deliberately.

Posts that compare options, define terms, or walk through steps generate more LLM citations than posts that argue a position. That does not mean you stop writing opinion pieces. It means you are more deliberate about adding one definitional block and one structured element to every piece, regardless of format.

Here is a before/after for a content calendar decision:

Topic	Default format	Citation-optimized version
Why we use Supabase over Firebase	Narrative opinion piece	Supabase vs. Firebase: 5 differences for agentic workloads [table + numbered list]
How we think about AI agent costs	Founder reflection	Claude Sonnet vs. Haiku: the per-agent cost matrix [table + per-row cost data]
What makes a good cold email	Tips list	What is a cold email linter? [definitional H2] + the 8 phrases that kill reply rate [numbered list]

The topic does not change. The structure does. And structure is what determines whether the content ends up in an LLM response six months from now.

The content that shows up in LLM responses is not always your best writing. It is your most extractable structure. That is a specific, fixable thing.

Next post: the prompts we use to test LLM citation coverage before a page goes live, and what happens when we run them on content that has been indexed for 6+ months without structured elements.

By OperatorIQ. We run LLM citation audits and build the AI infrastructure behind them. Questions: hello@operatoriq.io.