Gary Illyes from Google shared some more details on Googlebot, Google’s crawling ecosystem, fetching and how it processes bytes.

The article is named Inside Googlebot: demystifying crawling, fetching, and the bytes we process.

Googlebot. Google has many more than one singular crawler, it has many crawlers for many purposes. So referencing Googlebot as a singular crawler, might not be super accurate anymore. Google documented many of its crawlers and user agents over here.

Limits. Recently, Google spoke about its crawling limits. Now, Gary Illyes dug into it more. He said:

  • Googlebot currently fetches up to 2MB for any individual URL (excluding PDFs).
  • This means it crawls only the first 2MB of a resource, including the HTTP header.
  • For PDF files, the limit is 64MB.
  • Image and video crawlers typically have a wide range of threshold values, and it largely depends on the product that they’re fetching for.
  • For any other crawlers that don’t specify a limit, the default is 15MB regardless of content type.

Then what happens when Google crawls?

  1. Partial fetching: If your HTML file is larger than 2MB, Googlebot doesn’t reject the page. Instead, it stops the fetch exactly at the 2MB cutoff. Note that the limit includes HTTP request headers.
  2. Processing the cutoff: That downloaded portion (the first 2MB of bytes) is passed along to our indexing systems and the Web Rendering Service (WRS) as if it were the complete file.
  3. The unseen bytes: Any bytes that exist after that 2MB threshold are entirely ignored. They aren’t fetched, they aren’t rendered, and they aren’t indexed.
  4. Bringing in resources: Every referenced resource in the HTML (excluding media, fonts, and a few exotic files) will be fetched by WRS with Googlebot like the parent HTML. They have their own, separate, per-URL byte counter and don’t count towards the size of the parent page.

How Google renders these bytes. When the crawler accesses these bytes, it then passes it over to WRS, the web rendering service. “The WRS processes JavaScript and executes client-side code similar to a modern browser to understand the final visual and textual state of the page. Rendering pulls in and executes JavaScript and CSS files, and processes XHR requests to better understand the page’s textual content and structure (it doesn’t request images or videos). For each requested resource, the 2MB limit also applies,” Google explained.

Best practices. Google listed these best practices:

  • Keep your HTML lean: Move heavy CSS and JavaScript to external files. While the initial HTML document is capped at 2MB, external scripts, and stylesheets are fetched separately (subject to their own limits).
  • Order matters: Place your most critical elements — like meta tags, </code> elements, <code><link/></code> elements, canonicals, and essential structured data — higher up in the HTML document. This ensures they are unlikely to be found below the cutoff.</li> <li><strong>Monitor your server logs:</strong> Keep an eye on your server response times. If your server is struggling to serve bytes, our fetchers will automatically back off to avoid overloading your infrastructure, which will drop your crawl frequency.</li> </ul> <p><strong>Podcast. </strong>Google also had a podcast on the topic, here it is:</p> <figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"> <div class="wp-block-embed__wrapper"> <noscript><iframe title="Google crawlers behind the scenes" width="1040" height="585" src="https://www.youtube.com/embed/JpweMBnpS4Q?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe></noscript> </div> </figure> </div> <hr/> <p class="article-disclosure"> <em>Search Engine Land is owned by <a href="https://www.semrush.com/" rel="nofollow" target="_blank">Semrush</a>. We remain committed to providing high-quality coverage of marketing topics. Unless otherwise noted, this page’s content was written by either an employee or a paid contractor of <a href="https://www.semrush.com/" rel="nofollow" target="_blank">Semrush</a> Inc.</em> </p> <hr/> <div class="author-about py-4"> <div class="card bg-light"> <div class="row gx-0"> <div class="col-12 col-lg-auto"> <div class="authorImage p-2"> <img decoding="async" class="img-fluid rounded avatar-border" src="https://searchengineland.com/wp-content/seloads/2025/05/1630496745425.jpeg.webp" alt="Barry Schwartz" width="140" height="140"/> </div> </p></div> <div class="col-12 col-lg"> <div class="card-body author-body p-2"> <div id="authorBio-251" class="author-desc"> <p>Barry Schwartz is a technologist and a Contributing Editor to Search Engine Land and a member of the programming team for SMX events. He owns <a href="https://www.rustybrick.com/" rel="nofollow">RustyBrick</a>, a NY based web consulting firm. He also runs <a href="https://www.seroundtable.com/" rel="nofollow">Search Engine Roundtable</a>, a popular search blog on very advanced SEM topics.</p> <p>In 2019, Barry was awarded the Outstanding Community Services Award from Search Engine Land, in 2018 he was awarded the US Search Awards the “US Search Personality Of The Year,” you can learn more over here and in 2023 he was listed as a top 50 most influential PPCer by Marketing O’Clock.</p> <p>Barry can be followed <a href="https://twitter.com/rustybrick/">on X here</a> and you can learn more about <a href="https://www.rustybrick.com/barry">Barry Schwartz over here</a> or on his <a href="https://www.barryschwartz.org/">personal site</a>.</p> </p></div> </p></div> </p></div> </p></div> </p></div> </div> <p> <!-- START SIDEBAR LOWER SPACE --></p> <p><!-- END SIDEBAR LOWER SPACE --></p></div> <p><script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script></p> </div> </section> <footer class="post-full-footer"> <section class="author-card"> <img alt='' src='https://secure.gravatar.com/avatar/f460abd9ebf5b32c1cc43bf3c2242f21be77a874c7ebba4652155c7e6037e259?s=60&d=mm&r=g' srcset='https://secure.gravatar.com/avatar/f460abd9ebf5b32c1cc43bf3c2242f21be77a874c7ebba4652155c7e6037e259?s=120&d=mm&r=g 2x' class='avatar avatar-60 photo' height='60' width='60' loading='lazy' decoding='async'/> <section class="author-card-content"> <h4 class="author-card-name"><a href="https://www.seositestool.com/author/abdellahgarmaligmail-com/">seositestool</a></h4> <p> Read <a href="https://www.seositestool.com/author/abdellahgarmaligmail-com/">more posts</a> by this author. </p> </section> </section> <div class="post-full-footer-right"> <a class="author-card-button" href="https://www.seositestool.com/author/abdellahgarmaligmail-com/">Read More</a> </div> </footer> <section class="post-full-comments"> </section> </article> </div> </main> <!-- Links to Previous/Next posts --> <aside class="read-next outer"> <div class="inner"> <div class="read-next-feed"> <article class="read-next-card" "> <header class="read-next-card-header"> <small class="read-next-card-header-sitetitle">— Seo Sites Tool —</small> <h3 class="read-next-card-header-title"><a href="https://www.seositestool.com/category/seo/">Seo</a></h3> </header> <div class="read-next-divider"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13 14.5s2 3 5 3 5.5-2.463 5.5-5.5S21 6.5 18 6.5c-5 0-7 11-12 11C2.962 17.5.5 15.037.5 12S3 6.5 6 6.5s4.5 3.5 4.5 3.5"/></svg> </div> <div class="read-next-card-content"> <ul> <li><a href="https://www.seositestool.com/bots-could-overtake-human-web-usage-by-2027/">Bots could overtake human web usage by 2027</a> </li> <li><a href="https://www.seositestool.com/why-traditional-keyword-research-is-failing-and-how-to-fix-it-with-the-intention-of-searching/">Why traditional keyword research is failing and how to fix it with the intention of searching</a> </li> <li><a href="https://www.seositestool.com/how-to-do-ppc-keyword-gap-analysis/">How to do PPC keyword gap analysis</a> </li> </ul> </div> <footer class="read-next-card-footer"> <a href="https://www.seositestool.com/category/seo/"> See all 1591 posts. → </a> </footer> </article> <!-- Display next post --> <article class="post-card post-4828 post type-post status-publish format-standard has-post-thumbnail hentry category-seo"> <a class="post-card-image-link" href="https://www.seositestool.com/google-expands-merchant-center-loyalty-features-to-14-countries-and-ai-surfaces/" aria-label="Read more about Google expands Merchant Center loyalty features to 14 countries and AI surfaces"> <img width="768" height="432" src="https://www.seositestool.com/wp-content/uploads/2026/04/Google-Shopping-Ads-Google-Ads-768x432.jpg" class="post-card-image wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://www.seositestool.com/wp-content/uploads/2026/04/Google-Shopping-Ads-Google-Ads-768x432.jpg 768w, https://www.seositestool.com/wp-content/uploads/2026/04/Google-Shopping-Ads-Google-Ads-300x169.jpg 300w, https://www.seositestool.com/wp-content/uploads/2026/04/Google-Shopping-Ads-Google-Ads-1024x576.jpg 1024w, https://www.seositestool.com/wp-content/uploads/2026/04/Google-Shopping-Ads-Google-Ads-1536x864.jpg 1536w, https://www.seositestool.com/wp-content/uploads/2026/04/Google-Shopping-Ads-Google-Ads.jpg 1920w" sizes="auto, (max-width: 768px) 100vw, 768px" /> </a> <div class="post-card-content"> <a class="post-card-content-link" href="https://www.seositestool.com/google-expands-merchant-center-loyalty-features-to-14-countries-and-ai-surfaces/" aria-label="Read more about Google expands Merchant Center loyalty features to 14 countries and AI surfaces"> <header class="post-card-header"> <span class="post-card-tags">Seo</span> <h2 class="post-card-title">Google expands Merchant Center loyalty features to 14 countries and AI surfaces</h2> </header> <section class="post-card-excerpt"> <p>Google is giving retailers more firepower to promote loyalty program benefits directly within product listings — expanding the program internationally and into its newest AI-powered shopping experiences. What’s new. Merchants can now highlight…</p> </section> </a> <footer class="post-card-meta"> <ul class="author-list"> <li class="author-list-item"> <div class="author-name-tooltip"> seositestool </div> <a href="https://www.seositestool.com/author/abdellahgarmaligmail-com/" class="static-avatar" aria-label="Read more posts by seositestool"> <img alt='' src='https://secure.gravatar.com/avatar/f460abd9ebf5b32c1cc43bf3c2242f21be77a874c7ebba4652155c7e6037e259?s=30&d=mm&r=g' srcset='https://secure.gravatar.com/avatar/f460abd9ebf5b32c1cc43bf3c2242f21be77a874c7ebba4652155c7e6037e259?s=60&d=mm&r=g 2x' class='avatar avatar-30 photo' height='30' width='30' loading='lazy' decoding='async'/> </a> </li> </ul> <span class="reading-time">3 min read</span> </footer> </div><!-- .post-card-content --> </article> <!-- Display previous post --> <article class="post-card post-4824 post type-post status-publish format-standard has-post-thumbnail hentry category-seo"> <a class="post-card-image-link" href="https://www.seositestool.com/59-of-seo-jobs-are-now-senior-level-roles-study/" aria-label="Read more about 59% of SEO jobs are now senior-level roles: Study"> <img width="768" height="432" src="https://www.seositestool.com/wp-content/uploads/2026/03/seo-command-center-768x432.png" class="post-card-image wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://www.seositestool.com/wp-content/uploads/2026/03/seo-command-center-768x432.png 768w, https://www.seositestool.com/wp-content/uploads/2026/03/seo-command-center-300x169.png 300w, https://www.seositestool.com/wp-content/uploads/2026/03/seo-command-center-1024x576.png 1024w, https://www.seositestool.com/wp-content/uploads/2026/03/seo-command-center-1536x864.png 1536w, https://www.seositestool.com/wp-content/uploads/2026/03/seo-command-center.png 1920w" sizes="auto, (max-width: 768px) 100vw, 768px" /> </a> <div class="post-card-content"> <a class="post-card-content-link" href="https://www.seositestool.com/59-of-seo-jobs-are-now-senior-level-roles-study/" aria-label="Read more about 59% of SEO jobs are now senior-level roles: Study"> <header class="post-card-header"> <span class="post-card-tags">Seo</span> <h2 class="post-card-title">59% of SEO jobs are now senior-level roles: Study</h2> </header> <section class="post-card-excerpt"> <p>SEO hiring is shifting toward senior, strategy-led roles as AI reshapes search and expands the scope of the job. A new Semrush analysis of 3,900 listings shows companies now prioritize leadership, experimentation, and…</p> </section> </a> <footer class="post-card-meta"> <ul class="author-list"> <li class="author-list-item"> <div class="author-name-tooltip"> seositestool </div> <a href="https://www.seositestool.com/author/abdellahgarmaligmail-com/" class="static-avatar" aria-label="Read more posts by seositestool"> <img alt='' src='https://secure.gravatar.com/avatar/f460abd9ebf5b32c1cc43bf3c2242f21be77a874c7ebba4652155c7e6037e259?s=30&d=mm&r=g' srcset='https://secure.gravatar.com/avatar/f460abd9ebf5b32c1cc43bf3c2242f21be77a874c7ebba4652155c7e6037e259?s=60&d=mm&r=g 2x' class='avatar avatar-30 photo' height='30' width='30' loading='lazy' decoding='async'/> </a> </li> </ul> <span class="reading-time">3 min read</span> </footer> </div><!-- .post-card-content --> </article> </div> </div> </aside> <!-- Floating header which appears on-scroll --> <div class="floating-header"> <div class="floating-header-logo"> <a href="https://www.seositestool.com"> <span>Seo Sites Tool</span> </a> </div> <span class="floating-header-divider">—</span> <div class="floating-header-title">Google explains how crawling works in 2026</div> <div class="floating-header-share"> <div class="floating-header-share-label">Share this <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"> <path d="M7.5 15.5V4a1.5 1.5 0 1 1 3 0v4.5h2a1 1 0 0 1 1 1h2a1 1 0 0 1 1 1H18a1.5 1.5 0 0 1 1.5 1.5v3.099c0 .929-.13 1.854-.385 2.748L17.5 23.5h-9c-1.5-2-5.417-8.673-5.417-8.673a1.2 1.2 0 0 1 1.76-1.605L7.5 15.5zm6-6v2m-3-3.5v3.5m6-1v2"/> </svg> </div> <a class="floating-header-share-tw" href="https://twitter.com/share?text=Google explains how crawling works in 2026&url=https://www.seositestool.com/google-explains-how-crawling-works-in-2026/" onclick="window.open(this.href, 'share-twitter', 'width=550,height=235');return false;"> <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 32 32"><path d="M30.063 7.313c-.813 1.125-1.75 2.125-2.875 2.938v.75c0 1.563-.188 3.125-.688 4.625a15.088 15.088 0 0 1-2.063 4.438c-.875 1.438-2 2.688-3.25 3.813a15.015 15.015 0 0 1-4.625 2.563c-1.813.688-3.75 1-5.75 1-3.25 0-6.188-.875-8.875-2.625.438.063.875.125 1.375.125 2.688 0 5.063-.875 7.188-2.5-1.25 0-2.375-.375-3.375-1.125s-1.688-1.688-2.063-2.875c.438.063.813.125 1.125.125.5 0 1-.063 1.5-.25-1.313-.25-2.438-.938-3.313-1.938a5.673 5.673 0 0 1-1.313-3.688v-.063c.813.438 1.688.688 2.625.688a5.228 5.228 0 0 1-1.875-2c-.5-.875-.688-1.813-.688-2.75 0-1.063.25-2.063.75-2.938 1.438 1.75 3.188 3.188 5.25 4.25s4.313 1.688 6.688 1.813a5.579 5.579 0 0 1 1.5-5.438c1.125-1.125 2.5-1.688 4.125-1.688s3.063.625 4.188 1.813a11.48 11.48 0 0 0 3.688-1.375c-.438 1.375-1.313 2.438-2.563 3.188 1.125-.125 2.188-.438 3.313-.875z"/></svg> </a> <a class="floating-header-share-fb" href="https://www.facebook.com/sharer/sharer.php?u=https://www.seositestool.com/google-explains-how-crawling-works-in-2026/" onclick="window.open(this.href, 'share-facebook','width=580,height=296');return false;"> <svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 32 32"><path d="M19 6h5V0h-5c-3.86 0-7 3.14-7 7v3H8v6h4v16h6V16h5l1-6h-6V7c0-.542.458-1 1-1z"/></svg> </a> </div> <progress id="reading-progress" class="progress" value="0"> <div class="progress-container"> <span class="progress-bar"></span> </div> </progress> </div> <!-- The footer at the very bottom of the screen --> <footer class="site-footer outer"> <div class="site-footer-content inner"> <section class="copyright"><a href="https://www.seositestool.com">Seo Sites Tool</a> © 2026</section> <nav class="site-footer-nav"> <a href="https://www.seositestool.com">Latest Posts</a> <a href="https://chrisyee.ca/geist" target="_blank" rel="noopener">Geist by Chris Yee</a> </nav> </div> </footer> </div> <div id="search" class="search-overlay"> <button class="search-overlay-close" aria-label="close search overlay"></button> <div class="search-overlay-content"> <form role="search" method="get" class="search-form" action="https://www.seositestool.com/"> <label> <span class="screen-reader-text">Search for:</span> <input type="search" class="search-field" placeholder="Search …" value="" name="s" /> </label> <input type="submit" class="search-submit" value="Search" /> </form> </div> </div> <script type="speculationrules"> {"prefetch":[{"source":"document","where":{"and":[{"href_matches":"\/*"},{"not":{"href_matches":["\/wp-*.php","\/wp-admin\/*","\/wp-content\/uploads\/*","\/wp-content\/*","\/wp-content\/plugins\/*","\/wp-content\/themes\/geist-2.0.2\/*","\/*\\?(.+)"]}},{"not":{"selector_matches":"a[rel~=\"nofollow\"]"}},{"not":{"selector_matches":".no-prefetch, .no-prefetch a"}}]},"eagerness":"conservative"}]} </script> <script type="text/javascript" src="https://www.seositestool.com/wp-content/themes/geist-2.0.2/built/main.js?ver=20190322" id="geist-main-js"></script> <script type="text/javascript" src="https://www.seositestool.com/wp-content/themes/geist-2.0.2/built/jquery.fitvids.js?ver=20190322" id="geist-fitvids-js"></script> <div style="display:none;"> <p><a href="https://www.seositestool.com/">https://www.seositestool.com/</a></p> </div> </body> </html> <!-- Page cached by LiteSpeed Cache 7.6.2 on 2026-04-01 13:09:59 -->