Step 1 of 8

The AI crawler landscape

Before you touch a single robots.txt line, you need to know who’s actually visiting and why. The three categories don’t share access consequences — blocking the wrong one closes off citation while leaving training open, which is the most expensive mistake you can make in this layer.

What to read for this step

AI Crawlers
Wiki
ChatGPT-User
Wiki · coming soon

After this step, you should be able to answer

Name the major AI crawlers and identify each one's User-Agent
How can an engine cite you without ever crawling you?
Which crawlers train models, and which fetch your page at answer time?