If you are a service business and your goal is citation: allow everything. There is no business model in blocking — citations drive brand, brand drives leads.
If you are a publisher monetizing content directly: typical policy is allow live-retrieval bots (OAI-SearchBot, Google-Extended for AI Overviews, PerplexityBot), block training-only bots (GPTBot, ClaudeBot, CCBot). This lets you be cited in real-time answers while preserving content for paid licensing deals.
Whatever you choose, document the policy and revisit quarterly. The bot landscape is shifting fast.
No. Blocking GPTBot today does nothing about content that was already trained on. It only affects future model versions.
Some publishers do, but adoption by AI engines is inconsistent. Robots.txt is the more reliable signal.
Use Google's robots.txt tester for Google bots, manually test for the others. Our AI Citability Checker also tests for known AI-bot blocks.
Yes — the EU AI Act includes provisions on training transparency. As of April 2026, no jurisdiction has mandated bot-policy formats, but expect regulation in 2026-2027.