Crawl Budget
Crawl budget is the number of pages a search engine is willing and able to crawl on your site within a given timeframe. It matters most for large sites, where wasted crawling means important pages get fetched late or not at all.
Crawl budget is the amount of crawling a search engine will spend on your site in a given period. Search engines do not have unlimited resources, so they ration how much they fetch from any one domain. For most small sites this is a non-issue, the engine crawls everything easily. For large sites with tens of thousands or millions of URLs, crawl budget becomes a real constraint on what gets discovered and refreshed.
Google frames crawl budget as two things working together. There is crawl capacity, how much your server can handle without slowing down or erroring, and crawl demand, how much Google actually wants to crawl based on how popular and how fresh your pages are. The budget that gets spent sits at the intersection of what your server can take and what Google cares to fetch.
Crawl budget is mostly a concern for large sites. If you have a few hundred pages, you almost certainly do not need to worry about it.
Who actually needs to care
- Large ecommerce sites with huge product catalogs and faceted navigation.
- Sites with millions of URLs generated by filters, parameters, or pagination.
- Big publishers where fresh content needs to be discovered quickly.
- Any site where important pages are getting crawled late or missed entirely.
- Sites that generate endless low-value URL variations the crawler keeps chasing.
Where crawl budget gets wasted
- Infinite URL parameter combinations from sorting, filtering, and session IDs.
- Duplicate content served under many URLs without canonical consolidation.
- Long redirect chains that force the crawler through multiple hops.
- Soft 404s and error pages that the crawler keeps revisiting.
- Slow server responses that reduce how aggressively the engine will crawl.
warningWATCH OUT
If your server is slow or unstable, Google backs off to avoid overloading it, which shrinks your effective crawl budget. Performance is not just a user experience issue. A fast, reliable server lets the engine crawl more of your important pages more often.
targetHow to find the waste
Server log analysis is the only way to see what bots actually crawl. Pull your logs, filter for Googlebot, and look at which URLs eat the most crawl activity. If you find the engine burning fetches on parameter junk and duplicates instead of your money pages, you have found your problem.
How to protect your budget
- 1Block low-value URL patterns, like internal search results and infinite filters, in robots.txt.
- 2Consolidate duplicates with canonical tags so the engine stops crawling variants.
- 3Eliminate redirect chains by pointing redirects straight to the final URL.
- 4Fix soft 404s and remove links to dead pages so crawlers stop chasing them.
- 5Keep your most important pages well linked and shallow in your site structure.
Spend the budget on money pages
The goal is not to maximize total crawling. It is to make sure the limited crawling you get lands on the pages that earn traffic and revenue, not on infinite junk URLs that will never rank anyway.
One caution before you go chasing this metric: do not invent a crawl budget problem you do not have. Google has been clear that the vast majority of sites never need to think about this. If you have a few thousand clean pages and they all get crawled and indexed, your crawl budget is fine and your time is better spent on content and links. Crawl budget optimization is a large-site discipline. Reach for it when the symptoms are real, important pages crawled rarely or not at all on a site with a huge URL count, not as a default chore for every project.
Crawl budget rewards a clean, efficient site, and the surest way to manage it is to watch how bots really behave rather than guessing. For the full method, including how to read your logs and act on them, see my log file analysis guide, and the technical SEO guide for the broader foundation.
RELATED TERMS
Want this handled by someone who has measured search for 20 years?
Work with me