Question 1

How large is a typical chunk?

Accepted Answer

Most production AI retrieval systems use chunks in the range of 200 to 800 tokens, with 512 tokens being a common default. Some engines use much smaller passages (100 to 200 tokens) for high-precision retrieval, and some use larger ones (up to 1,500 tokens) when context preservation matters more than retrieval granularity. The exact size is rarely disclosed by AI engines, but the strategic implication is constant: each section of your content should make sense at roughly paragraph-to-short-section scale.

Question 2

Do AI engines use the same chunking strategy?

Accepted Answer

No — chunking strategies vary across engines, retrieval pipelines, and even across query types within the same engine. Perplexity, Google AI Overviews, ChatGPT Search, and Gemini all use different chunking and embedding approaches, and these are continuously tuned. The practical takeaway is that you cannot optimize for one specific chunk size; instead, write content that produces coherent, self-contained passages at multiple scales.

Question 3

Can I control how my content gets chunked?

Accepted Answer

Indirectly, yes. You cannot dictate chunk boundaries to an AI engine, but you strongly influence them through HTML structure, heading hierarchy, paragraph length, list formatting, and structured data. A page with semantic HTML, clear

and

boundaries, well-bounded paragraphs, and consistent FAQ or definition patterns will be chunked far more cleanly than one without — and the resulting chunks will carry more retrievable value.

Question 4

How does Chunking relate to embeddings?

Accepted Answer

Chunking comes first, embeddings come second. The page is split into chunks, each chunk is then converted into an embedding (a high-dimensional vector representing its meaning), and those embeddings are what get stored in the retrieval index. When a user query arrives, the query is also embedded and matched against the chunk embeddings via vector search. Bad chunking produces incoherent embeddings; good chunking produces clean, semantically focused embeddings that retrieve well.

Question 5

Does Chunking apply to PDFs and other document formats?

Accepted Answer

Yes. AI engines that index PDFs, Word documents, and other formats apply chunking to those as well, often using format-specific heuristics (page breaks, section headings, table boundaries). The same principles apply: well-structured documents with clear sections and self-contained passages chunk cleanly and surface in AI answers; long, undifferentiated documents do not. This is particularly relevant for B2B brands publishing whitepapers, research reports, and technical documentation.

Chunking (Passage Retrieval)

What is Chunking (Passage Retrieval)?

Key points about Chunking (Passage Retrieval)

Go deeper

Frequently asked questions about Chunking (Passage Retrieval)

Related terms

Want to measure your AI visibility?