Google Will Ignore Robots.txt Rules If It Serves A 4xx Status Code

Jan 17, 2023 - 7:41 am 1 by

Lizzi Sassman Googlebot

Here is another PSA from Gary Illyes of Google. In short, if you serve a 4xx status code with your robots.txt file, then Google will ignore the rules you have specified in that file.

Why? Well, 4xx status codes means the document is not available, so Google won't check it because the server says it is not available. Gary said this because he received a complaint or two about Google not respecting the robots.txt rules.

Gary wrote on LinkedIn, "PSA from my inbox: if you serve your robotstxt with a 403 HTTP status code, all rules in the file will be ignored by Googlebot. Client errors (4xx, except 429) mean unavailable robotstxt, as in, a 404 and a 403 are equivalent in this case."

In short, make sure your robots.txt file serves a 200 status code and Google can access it.

Forum discussion at LinkedIn.

 

Popular Categories

The Pulse of the search community

Search Video Recaps

 
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: January 17, 2025

Jan 17, 2025 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Search Volatility Cooling, AI Overviews Penalties, Maps Pin Hack Fix, Search Market Share & More

Jan 17, 2025 - 8:01 am
Google Ads

Scary Google Ads Phishing Scam

Jan 17, 2025 - 7:51 am
Google Ads

Google Ads Search Max Coming Soon?

Jan 17, 2025 - 7:41 am
Google Search Engine Optimization

Google Updates Examples Of Events & Estimated Salary Images In Structured Data Docs

Jan 17, 2025 - 7:31 am
Google

Google Testing AI Generated What People Are Saying

Jan 17, 2025 - 7:21 am
Previous Story: Google Search Perspectives & Opinions