Crawling - SERP Lens

The Crawling panel evaluates how accessible your pages are to search engine crawlers and AI bots. It reads the site’s robots.txt, tests whether major bots can reach the page, checks for firewall blocking, and reports on LLM bot access.

What it shows

Robots.txt summary

The panel fetches and parses the site’s robots.txt file, then tests it against major search engine bots:

Bot	Search engine
Googlebot	Google
Bingbot	Bing
Slurp	Yahoo
DuckDuckBot	DuckDuckGo
Baiduspider	Baidu
YandexBot	Yandex

For each bot, you’ll see:

Allowed or Blocked status
The specific robots.txt rules that apply
Which user-agent group matched

Click any bot to expand its details and see the exact rules affecting it.

Firewall status

The panel tests whether bots are blocked at the server/firewall level — before robots.txt even applies. This catches cases where a WAF (Web Application Firewall) or CDN blocks crawler traffic entirely. For each bot, you’ll see:

HTTP status code returned
Whether the request was allowed or blocked
The blocking reason, if applicable

You can filter the view to show all, allowed only, or blocked only bots.

LLM bot access

A dedicated section tests whether AI and LLM crawlers can access the site:

GPTBot (OpenAI)
Claude-Web (Anthropic)
Other configured AI crawlers

This is increasingly important as sites add specific robots.txt rules to control AI training and retrieval crawlers.

LLM bot rules are separate from search engine bot rules. A site can allow Googlebot while blocking GPTBot, or vice versa.

Export

Export the full crawling report — including all bot statuses, matched rules, and firewall results — as a CSV file.

Common issues to look for

Accidentally blocked bots — Disallow: / applying to important crawlers
Overly broad rules — blocking entire directories that contain indexable content
Firewall blocking crawlers — security rules that reject requests with bot user-agents
Missing robots.txt — no file at all, which means all bots are allowed by default

Next steps

Indexing

Check meta robots, canonical URLs, and hreflang tags.

Browser settings

Switch your user-agent to browse as Googlebot and test accessibility firsthand.

Structured data Links analysis

​What it shows

​Robots.txt summary

​Firewall status

​LLM bot access

​Export

​Common issues to look for

​Next steps

Indexing

Browser settings

What it shows

Robots.txt summary

Firewall status

LLM bot access

Export

Common issues to look for

Next steps