Results 1 to 2 of 2

“Crawl Caching Proxy” (BigDaddy Update) Discussed by Matt Cutts

This is a discussion on “Crawl Caching Proxy” (BigDaddy Update) Discussed by Matt Cutts within the General Discussion forums, part of the vBulletin SEO Discussion category; “Crawl Caching Proxy” (BigDaddy Update) Discussed by Matt Cutts Matt Cutts talks about the “ Crawl Caching Proxy ” Google ...

  1. #1
    Senior Member
    Real Name
    Joseph Ward
    Join Date
    Jun 2005
    Posts
    23,845
    Liked
    36 times
    Blog Entries
    9

    “Crawl Caching Proxy” (BigDaddy Update) Discussed by Matt Cutts

    “Crawl Caching Proxy” (BigDaddy Update) Discussed by Matt Cutts

    Matt Cutts talks about the “Crawl Caching Proxy” Google has introduced with the BigDaddy update. With this new caching proxy, they will reduce bandwidth consumption (for both themselves and site owners).

    How does it work?
    • Google has multiple independent bots that will crawl your site (Main index, AdSense, News Search, Blog Search, etc).
    • By using a “crawl caching proxy”, pages crawled by any one of these bots can be shared with the other services, without having to hit the website again (and, thereby, naturally consuming less bandwidth).
    Conspiracy theorists should note that Matt emphasizes:

    Quote Originally Posted by Matt Cutts
    Just as always, participating in AdSense or being in our blogsearch doesn’t get you any “extra” crawling (or ranking) in our web index whatsoever.
    FYI - Having a page get stored in the crawl caching proxy will not help it to be prioritized for crawling by the other Google crawl services. Apparently, they will still determine their crawling list independently.

    Quick Notes
    • robots.txt directives for each bot type will still be respected even if the page is pulled from the caching proxy instead of directly from the site.
    • The caching proxy is not to be confused with Google’s “Cached” links in the SERPs.
    Notes on vBSEO
    • vBSEO also focuses on reducing bandwidth consumption for faster and more efficient indexing.
    • vBSEO includes a HTML comment stripping feature that helps to reduce bandwidth consumption by a significant amount.
    • vBSEO includes gzip compression compatibility.
    • vBSEO works similar to the Google caching proxy. Our focus on 1-URL-Per-Resource helps to eliminate redundant URLs to the same content therefore also eliminating duplicate content. In addition to its other SEO advantages, this is a major bandwidth saver.
    If Google agreed to crawl 100 of your pages per day, would you prefer:

    (a) it crawled 100 unique content pages, or
    (b) it crawled 100 pages with a significant level of redundant/duplicate content?

    The answer is obvious: With vBSEO Option (a) is a reality. Without vBSEO, a vBulletin forum is loaded with redundant URLs to the same content.

    For Discussion
    • Why didn’t they have such a mechanism in place a long time ago?
    • No specific mention of how the freshness of crawled content will be affected when pulled from the caching proxy or how long a page will be stored there.
    • One might hope that they would also try (in addition to saving bandwidth) to also eliminate processing redundancy for the multiple crawling services.
    Source:
    Matt Cutts: Gadgets, Google, and SEO » Crawl caching proxy
    Last edited by Joe Ward; 04-26-2006 at 03:59 PM.

  2. #2
    Senior Member Michael's Avatar
    Real Name
    Michael Benson
    Join Date
    Sep 2005
    Location
    United Kingdom
    Posts
    776
    Liked
    0 times
    This is an excellent update for Google and site owners alike, i love how you tied vBSEO into the announcement too

Similar Threads

  1. Replies: 4
    Last Post: 04-27-2006, 03:35 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •