🤖 SEO Tool · Free Forever

Robots.txt Tester

Paste or fetch your robots.txt, then test any URL path against any crawler. See the exact rule that allows or blocks access — including wildcards.

✓ Googlebot & GPTBot ✓ Wildcard support (* and $) ✓ Fetch from URL ✓ No sign-up
robots.txt content
Website URL

Enter your domain — we'll fetch yourdomain.com/robots.txt automatically via our server. Works for any public website.

0
User-agents
0
Allow rules
0
Disallow rules
0
Sitemaps
Test a URL Path
User-agent
User-agent name
+
URL path to test
Parsed Rules by User-agent
Sitemaps Found

Frequently Asked Questions

What is robots.txt and why does it matter?
robots.txt is a text file at the root of your website that tells search engine crawlers which pages they can or cannot visit. It follows the Robots Exclusion Protocol. Misconfigured robots.txt can accidentally block Googlebot from indexing your entire site — which is why testing is critical before deploying changes.
How does path matching work with wildcards?
The asterisk (*) matches any sequence of characters. For example, Disallow: /*.pdf$ blocks all URLs ending in .pdf. The dollar sign ($) anchors the match to the end of the URL. The most specific (longest) matching rule wins. If an Allow and Disallow rule are equal length, Allow takes precedence.
What does an empty Disallow mean?
"Disallow:" with no value means "allow everything" — it's effectively the same as no rule. This is often used as a shorthand to explicitly permit all crawling: User-agent: * / Disallow:
What is GPTBot and should I block it?
GPTBot is OpenAI's crawler used to train AI models like ChatGPT. You can block it with: User-agent: GPTBot / Disallow: /. Similarly, CCBot (Common Crawl) and other AI crawlers can be blocked this way. This does not affect your Google rankings.
Does robots.txt prevent pages from appearing in Google search?
Blocking a page in robots.txt prevents Googlebot from crawling it, but Google can still index the URL if other pages link to it — it just won't see the content. To fully remove a page from search results, use a noindex meta tag or the URL Removal Tool in Google Search Console.