On June 8th, Google announced on their company blog that they were using an new searching algorithm (method) for keeping track of all in information on the web. They named this new system Caffeine.
Caffeine lets us index web pages on an enormous scale. In fact, every second Caffeine processes hundreds of thousands of pages in parallel. If this were a pile of paper it would grow three miles taller every second. Caffeine takes up nearly 100 million gigabytes of storage in one database and adds new information at a rate of hundreds of thousands of gigabytes per day. You would need 625,000 of the largest iPods to store that much information; if these were stacked end-to-end they would go for more than 40 miles.
|Sidebar: How Search Engines Work
A search engine does not scour the Internet when you look for something. Instead, search engines “scape” the Internet looking for data and then build a giant list/table called an index. When you type in a search request, the search engine, looks in the index to find everything it has on your search terms and then sorts if based on factors such as popularity, how frequently the search terms are used, and other factors.
Google claims that Caffeine provides 50% more up-to-date information than their old system and has more stored information about the Internet than ever before.
Why is this big news: Analysts listen to and watch Google (G) because G’s search engines often provide more traffic to a website than when people just type the address into a web browser.
Google’s… old index had several layers, some of which were refreshed at a faster rate than others; the main layer would update every couple of weeks. To refresh a layer of the old index, we would analyze the entire web, which meant there was a significant delay between when we found a page and made it available to you.
With Caffeine, we analyze the web in small portions and update our search index on a continuous basis, globally. As we find new pages, or new information on existing pages, we can add these straight to the index. That means you can find fresher information than ever before—no matter when or where it was published.
Recently Google posted an article on their webmaster blog explaining their ranking methods. It’s all about quality. Google’s advice to owners of new websites is simple. Get involved in an online community and contribute with substance. Avoid fluff. Humor is OK, but don’t waste people’s time with ads and other junk. They discourage the purchase of links from unrelated sites or doing random link exchanges (where you trade links on sites with people who have nothing in common with you.) It is definitely a long-term game. Getting good rankings on Google is about consistency. It is not for sprinters. This race is for the turtle. Finally, Google suggests that you make it easy for novice users to link to your site. Notice below that you can post a bookmark to