The magic that makes Google tick

By Matt Loney, ZDNet UK
02 December 2004 10:43 AM
Tags: goog, search, google, page

Page III: Google's vice-president of engineering was in London this week to talk to potential recruits about just what lies behind that search page.

Google has two crucial factors in its favour. First, the whole problem is what Hölzle refers to as embarrassingly parallel, which means that if you double the amount of hardware, you can double performance (or capacity if you prefer -- the important point is that there are no diminishing returns as there would be with less parallel problems).

The second factor in Google's favour is the falling cost of hardware. If the index size doubles, then the embarrassingly parallel nature of the problem means that Google could double the number of machines and get the same response time so it can grow linearly with traffic. "In reality (from a business point of view) we would like to grow less than linear to keep costs down," said Hölzle, "but luckily the hardware keeps getting cheaper."

So every year as the Web gets bigger and requires more hardware to index, search and return Web pages, hardware gets cheaper so it "more or less evens out" to use Hölzle's words.

As the scale of the operation increases, it introduces some particular problems that would not be an issue on smaller systems. For instance, Google uses IDE drives for all its storage. They are fast and cheap, but not highly reliable. To help deal with this, Google developed its own file system -- called the Google File System, or GFS -- which assumes an individual unit of storage can go away at any time either because of a crash, a lost disk or just because someone stepped on a cable.

The power of three
There are no disk arrays within individual PCs; instead Google stores every bit of data in triplicate on three machines on three racks on three data switches to make sure there is no single point of failure between you and the data. "We use this for hundreds of terabytes of data," said Hölzle.

Don't expect to see GFS on a desktop near you any time soon -- it is not a general-purpose file system. For instance, a GFS block size is 64MB, compared with the more usual 2KB on a desktop file system. Hölzle said Google has 30 plus clusters running GFS, some as large as 2,000 machines with petabytes of storage. These large clusters can sustain read/write speeds of 2Gbps -- a feat made possible because each PC manages 2Mbps.

Once, said Hölzle, "someone disconnected an 80-machine rack from a GFS cluster, and the computation slowed down as the system began to re-replicate and we lost some bandwidth, but it continued to work. This is really important if you have 2,000 machines in a cluster." If you have 2000 machines then you can expect to see two failures a day.

Running thousands of cheap servers with relatively high failure rates is not an easy job. Standard tools don't work at this scale, so Google has had to develop them in-house. Some of the other challenges the company continues to face include:

Debugging: "You see things on the real site you never saw in testing because some special set of circumstances that create a bug," said Hölzle. "This can create non-trivial but fun problems to work on."

Data errors: A regular IDE hard disk will have an error rate in the order of 10-15 -- that is one millionth of one billionth of the data written to it may get corrupted and the hard-disk's own error checking will not pick it up. "But when you have a petabyte of data you need to start worrying about these failures," said Hölzle. "You must expect that you will have undetected bit errors on your disk several times a month, even with hardware checking built-in, so GFS does have an extra level of checksumming. Again this is something we didn't expect, but things happen."

Spelling: Google wrote its own spell checker, and maintains that nobody know as many spelling errors as it does. The amount of computing power available at the company means it can afford to begin teaching the system which words are related -- for instance "Imperial", "College" and "London". It's a job that many CPU years, and which would not have been possible without these thousands of machines. "When you have tons of data and tons of computation you can make things work that don't work on smaller systems," said Hölzle. One goal of the company now is to develop a better conceptual understanding of text, to get from the text string to a concept.

Power density: "There is an interesting problem when you use PCs," said Hölzle. "If you go to a commercial data centre and look at what they can support, you'll see a typical design allowing for 50W to 100W per square foot. At 200W per square foot you notice the sales person still wants to sell it but their international tech guy starts sweating. At 300W per square foot they cry out in pain."

Eighty mid-range PCs in a rack, of which you will find many dozens in a Google data centre, produce over 500W per square foot. "So we're not going to blade technology," said Hölzle. "We're already too dense. Finally Intel has realised this is a problem and is now focusing more on power efficiency, but it took some time to get the message across."

Quality of search results: One big area of complaints for Google is connected to the growing prominence of commercial search results -- in particular price comparison engines and e-commerce sites. Hölzle is quick to defend Google's performance "on every metric", but admits there is a problem with the Web getting, as he puts it, "more commercial". Even three years ago, he said, the Web had much more of a grass roots feeling to it. "We have thought of having a button saying 'give me less commercial results'," but the company has shied away from implementing this yet.

ZDNet UK's Matt Loney reported from London. For more coverage on ZDNet UK Insight, click here.

Talkback 20 comments

    Can't resist being a little PC ...Anonymous -- 03/12/04

    Can't resist being a little PC and finding the parallel between Klingon and Tagalog a bit weird. The latter is a real language spoken by tens of millions of people. Its name sure sounds funny, but is that enough? Better to mention Google's other funny options like "Bork, bork, bork" and "Elmer Fudd".

    Very nice overview. Our compu ...Anonymous -- 03/12/04

    Very nice overview. Our compute farm has no direct correlation to google's, but there's still a lot we're learning from their work.

    Nice article. Always nice to g ...Anonymous -- 03/12/04

    Nice article. Always nice to get an "inside" look at one of the most popular and useful web applications on the planet. Hope to see more stuff like this - especially about Google.

    Thanks for the story. More int ...Anonymous -- 03/12/04

    Thanks for the story. More interesting insights on the Google cluster can also be found here on <a href="http://www.tnl.net/blog/entry/How_many_Google_machines">http://www.tnl.net/blog/entry/How_many_Google_machines</a>

    This was a really cool article ...Anonymous -- 03/12/04

    This was a really cool article. Thanks

    Hey, a "give me less comm ...Anonymous -- 03/12/04

    Hey, a "give me less commercial" button sounds wonderful. If you were looking for a camera then clicking this button would probably bring up technical articles, reviews, techniques etc, rather than 1000 results from shops and price comparison spammers.

    How have you guys not done a s ...Anonymous -- 04/12/04

    How have you guys not done a story on the Google Sandbox??!

    Klingon and Tagalog? I'm curio ...Anonymous -- 05/12/04

    Klingon and Tagalog? I'm curious if you were aware that Tagalog was a real language since you put it in the same sentence as Klingon.

    Why is Tagalog placed in the s ...Anonymous -- 05/12/04

    Why is Tagalog placed in the same context as Klingon? It's hardly a rare language - it's used by the entire country of the Philippines!

    Wouldn't a more culturally sensitive connotation be in order here? Tagalog is hardly in the same class as an artificial language created for a science fiction series!

    Very cool article. Great insig ...Anonymous -- 05/12/04

    Very cool article. Great insight on how the website works. Now, if only I could fit all of that power into my PC, maybe I could play some really top notch games.

    THE PAGE IS HANGING! In both I ...Anonymous -- 06/12/04

    THE PAGE IS HANGING!

    In both IE and Firefox!

    Congrats to you and Matt Loney ...Anonymous -- 07/12/04

    Congrats to you and Matt Loney; good stuff.
    Got onto you from Wired.

    Spelling and grammar make this ...Anonymous -- 07/12/04

    Spelling and grammar make this article a pain, although the subject matter is interesting. Too bad parts of it are nearly unreadable.

    Jesus, i never knew any of thi ...Anonymous -- 08/12/04

    Jesus, i never knew any of this, its really inresting, what surpised me was "Google runs its systems on cheap, no-name IU and 2U servers -- so cheap that Google refers to them as PCs. After all each one has a standard x86 PC processor, standard IDE hard disk, and standard PC reliability -- which means it is expected to fail once in three years."!!lol, using ide!!even i have sata, but with such a laugh business i can understand why they would use this as it is more cost effective.cheers !!!Mike!!...www.suprmobo.net!!

    Get a proofreader. 10-15 was s ...Anonymous -- 08/12/04

    Get a proofreader. 10-15 was supposed to be 10 to the power 15. And there are other errors and typos too. Embarr****ing.

    What amazes me is that with 20 ...Anonymous -- 09/12/04

    What amazes me is that with 200 computer doctors and 600 other computer science people, the results from Google in an average search is, many times, similar to other engines, such as Altavista and Webcrawler, with the same people and companies spamming the top slots.

    Uh...Google keeps locking up.. ...Anonymous -- 25/12/04

    Uh...Google keeps locking up....(Just Kiddin')

    Have A Merry Christmas!

    "104 interface languages ...Anonymous -- 28/12/04

    "104 interface languages including Klingon and Tagalog."

    Do your research man. You're crossing the racist line.

    Nice article Ravi Shiraguppi -- 08/10/07

    wah... Nice Article.

    Ravi Shiraguppi.
    Sangolli Rayanna nagar,
    Dharwad.Karnataka.
    INDIA

Add your opinion

Sponsored content

Power Centre - Content from our premier sponsors

Blogs

Tags

Back to top

Featured