Question 1

How does a robots.txt tester decide allowed vs blocked?

Accepted Answer

It parses the file into user-agent groups, picks the group whose user-agent best matches the bot you entered (an exact or prefix match beats the catch-all *), then evaluates that group's Allow and Disallow rules against the path. The most specific rule — the one with the longest matching pattern — wins. If an Allow and a Disallow match with equal length, the Allow wins. If no rule matches, the path is allowed.

Question 2

What do the * and $ characters mean in a rule?

Accepted Answer

An asterisk (*) matches any sequence of characters, so Disallow: /*.pdf matches any path containing .pdf. A dollar sign ($) anchors the end of the path, so Disallow: /*.pdf$ only matches paths that end in .pdf. Both are supported by Google and Bing. Without a $, a rule matches by prefix: Disallow: /admin blocks /admin, /admin/, and /administrator.

Question 3

Why does Allow beat Disallow when both match?

Accepted Answer

That is Google's documented tie-break: when an Allow rule and a Disallow rule match a URL with the same number of characters, the less restrictive rule (Allow) wins. It lets you carve out exceptions — for example Disallow: /admin/ with Allow: /admin/public/ blocks the admin area but keeps the public sub-folder crawlable. This tester follows the same rule.

Question 4

Does the user-agent I enter matter?

Accepted Answer

Yes. robots.txt can define different rules per crawler. The tester selects the most specific matching group: if you enter Googlebot and the file has a Googlebot group, those rules apply; otherwise the User-agent: * group is used. Try the same path with * and with a named bot to see how the verdict changes.

Question 5

Can I paste a full URL instead of a path?

Accepted Answer

Yes. If you paste a full URL like https://example.com/admin/settings, the tester extracts the path and query string and tests those. robots.txt rules only apply to the path and query, not the scheme or host, so the host part is ignored.

Question 6

Does a blocked path guarantee the page won't appear in Google?

Accepted Answer

No. Disallow stops crawling, but a URL that is linked from other sites can still be indexed without its content. To keep a page out of search results, allow it to be crawled and add a noindex meta tag (or X-Robots-Tag header) instead of blocking it in robots.txt.

Question 7

What counts as the most specific rule?

Accepted Answer

Specificity is measured by the length of the rule's path pattern. Disallow: /admin/reports/ (16+ characters) is more specific than Disallow: /admin/ (8 characters), so for the path /admin/reports/q1 the longer rule decides the outcome. Wildcards and the end-anchor are counted as part of the pattern length.

Question 8

Does this tester send my robots.txt anywhere?

Accepted Answer

No. Parsing and matching run as JavaScript in your browser. Nothing is uploaded or stored, so you can safely test rules from a production or internal site.

Robots.txt Tester

How robots.txt matching works

Common mistakes this tester catches

Frequently asked

Robots.txt Tester

Robots.txt path tester

How robots.txt matching works

Common mistakes this tester catches

Frequently asked