One of Google Search’s oldest and best-known features, cache links, are being retired. Best known by the “Cached” button, those are a snapshot of a web page the last time Google indexed it. However, according to Google, they’re no longer required.

“It was meant for helping people access pages when way back, you often couldn’t depend on a page loading,” Google’s Danny Sullivan wrote. “These days, things have greatly improved. So, it was decided to retire it.”

  • @[email protected]
    link
    fedilink
    English
    810 months ago

    At least some of these tools change their “user agent” to be whatever google’s crawler is.

    When you browse in, say, Firefox, one of the headers that firefox sends to the website is “I am using Firefox” which might affect how the website should display to you or let the admin knkw they need firefox compatibility (or be used to fingerprint you…).

    You can just lie on that, though. Some privacy tools will change it to Chrome, since that’s the most common.

    Or, you say “i am the google web crawler”, which they let past the paywall so it can be added to google.

    • @sfgifz
      link
      English
      2
      edit-2
      10 months ago

      Or, you say “i am the google web crawler”, which they let past the paywall so it can be added to google.

      If I’m not wrong, Google has a set range of IP addresses for their crawlers, so not all sites will let you through just because your UA claims to be Googlebot