How to evade Google search

Dell apparently learned the hard way this week that companies have to be careful to ensure that information they store on the Internet that they want to keep hidden is not automatically added to a search engine index for everyone on the Web to see.

Specifications for future Dell notebooks were accessible via Google's search site before the content was pulled from a Dell file transfer protocol site and from Google's cache.

Google, like the other major search engines, has an automated search engine that sends software robots called "spiders" out to crawl the Web and find sites to add to the index of Web sites it maintains. Because the spiders follow links running from one Web site to others, they pick up sites on their own without Webmasters having to manually submit them to search engines.

Webmasters also can provide the URL, or numerical Web address, for pages they want crawled, and they can submit detailed site maps to Google, according to Google's "information for Webmasters" pages.

Webmasters who want to keep some or all of their site private from the Googlebot can put a standard document called "robot.txt" at the root of the server that instructs the crawler not to download content. If the removal request is urgent, the Webmaster can submit a request via Google's automatic URL removal system, but must provide an e-mail address and password first.

Content that has been removed can still be viewed through Google's cache, which is a "snapshot" and archive of each page crawled. Webmasters can prevent pages from being cached by inserting specific code on them.

Advertisement

Talkback 2 comments

    Dumba55 Anonymous -- 02/02/06 (in reply to #120128384)

    I have to admit - that was stupid on dells behalf to put a server on the internet containing the next gen laptops.

    WTF? Anonymous -- 03/02/06

    I'm assuming Dell's FTP site was at least secured using a username/password? Do google's bots hit secured FTP sites?

Add your opinion

Latest Videos

Sponsored content

Power Centre - Content from our premier sponsors

Blogs

  • Suzanne Tindal Sick of broken tender sites
    Some of the state governments desperately need to invest in more user-friendly tender sites so that looking for information on government tenders doesn't have to be a game of blind man's bluff.
  • Array Cyberwar: What is it good for?
    In this week's episode, Cyberwar. What is Australia's place in the world of digital warfare? What are the implications for the NBN?
  • Array Is wholesale-only backhaul just a pipedream?
    The potential acquisition of Pipe Networks by SP Telemedia has raised the question about whether vertically integrated backhaul providers will mean higher wholesale prices for ISP customers.
  • More blogs »

Tags

Back to top

Featured