+ Reply to Thread
Results 1 to 5 of 5

 

Thread: BigDaddy Means Big Changes at Google

  1. #1
    3wdl's Avatar
    Super Dancer

    Status
    Offline
    Join Date
    Jun 2005
    Posts
    3,213
    Thanks
    181
    Thanked 147 Times in 108 Posts


    http://directmag.com/searchline/1-25...ddy/index.html

    One of the most popular forms of exercise among many search engine optimizers both the third-party firms that do it for others and the advertisers who spiff up their own Web pages for better natural search rankings is a periodic workout called chasing the algorithm. The race begins when Google or Yahoo! updates some portion of the software that determines how they look at Web pages and decide which are most relevant and valuable to a searcher. The engine makes that change; Web operators see their rankings rise or fall as a result; and they, or their outside search engine optimization (SEO) firm, scramble to get back the old rank by providing the new elements the search engine now needs. After a few months, the engines make another change, and its off to the races again.

    Well, optimizers on Google are lacing up their running shoes for another race. Only this one promises to be more a marathon than the usual sprint. Google is testing a new data center infrastructure, a feat much bigger and comprehensive than an algorithm change. Dubbed Big Daddy both in the search marketing blogs and forums and by the friendly folks at Google, this new data center still in shakedown mode will reportedly add new ground-level capabilities into the Google search function and drive those powers deep into all the algorithms with which Google searches, studies and indexes the Web.

    First, a bit of big-picture talk. Googles examination of the Web relies on a global network of data centers with different IP addresses. These decentralized servers speed the job of sending specialized Google services to users in different regions; they also share the workload of spidering the Web and comparing those discoveries to Web pages that are already in Googles index.

    The new BigDaddy data center contains new code for examining and sorting the Web, and once it has been tested fully, will become the default source for Web results, according to Googles chief search engineer Matt Cutts. In a January 4 post on his blog, Cutts said that might happen in early February or March of this year.

    But what is BigDaddy intended to do? According to Rob Sullivan, head organic search strategist at search marketing firm Enquiro, If an algorithm update is like putting new tires on a car or installing a new stereo system, this BigDaddy is like putting in a whole new motor. Theyre totally revamping how Google works and resolving some long-standing issues with getting sites indexed properly.

    One of those issues is canonicalization. Thats a fancy Google word for instructing a search engine how to decide which of a series of related URLs is the proper one to insert into the Google index. Say your Web site has a number of different home page URLs, including stuff.com, www.stuff.com, www.stuff.com/index.html and stuff.com/home.asp. This can come about because Web servers are often set up to accept aliases for Web pages, and to know that a request for stuff.com means someones looking for www.stuff.com. Thats a concession to users who get tired of getting error messages when they dont type in www.

    The problem is that while these URLs may pull up the same page content, theyre technically four different pages. That could skew the page count Google gets for the Web site, so that a site with 1000 pages and two aliases per page might look twice its real size to Google.

    Its also possible that those aliases could inadvertently contain different content or different incoming links. In that case the Google index, which looks at the value of the content and the quality of the links, could give those four pages different rankings.

    Finally, a Google search that turns up multiple entries for what is essentially the same content makes the results page that much less valuable to users. Better to select one of the URLs as the most representative and make room for other results.

    If you want to go to the Seattle Seahawks page on the NFL Web site, youll get this long, horrendous URL, Sullivan says. But the site also has another URL thats just Seattle Seahawks. It pulls the content from the first page and just displays it under a prettier URL. So Google wants to be able to say that second page is the one people really want, and theyll attribute all the traffic, links and value to the shorter URL.

    BigDaddy is also intended to provide a solution to another long-standing Google problem: that of illicit redirects, known as 302 redirects. Nefarious Webmasters can hijack a page by replacing the pages that should come up in a search with a virtual page that masquerades under the URL for the correct page. The searcher sees the correct result, but when clicked on, the listing can redirect the searcher to any page the hijacker wants including adult content or false storefronts set up to capture personal information. If a Web site suffers enough hijackings, Google will consider all the pages contaminated and drop it from the index.

    302 redirects are a big hole in the system, Sullivan says. People are using 302 redirects to hijack content and pages and many other things. By fixing this, Google will be eliminating a lot of problems.

    Of course, how BigDaddy will fix these issues is a closely held secret. As with many other questions surrounding the compiling and ranking of its index, Google refuses to be specific for fear that too much information will only teach the bad guys how to get around the system.

    And theres something else new about BigDaddy. While search optimizers often know where to find a Google testing data center and have usually tried to go there to see how the pages theyre working on are being searched and indexed, those IP addresses change often, even in a day.

    But for BigDaddy, Googles thrown open the doors. In early January, Cutts published a pair of IP addresses (66.249.93.104 and 64.233.179.104, for those who want a look) and actively called for feedback from Webmasters about problems and issues they perceived with the new system and its indexing.

    Some of these changes will bring Googles indexing technology up to par with its competitors; for example, Yahoo! and MSN have been handling 302 redirects for a year or more, although perhaps not as effectively as BigDaddy will eventually do. But other aspects of BigDaddy will help position Google to measure up to the search requirements of the future in some interesting ways, Sullivan says.

    This will lay the groundwork for more advanced algorithms, larger databases, and being able to index different types of content more effectively, he says. For example, Google has also begun using a search crawler built on a Mozilla browser. The new search bot is more flexible, seems faster and can read non-text content more readily; that should mean that in time, it will be able to read links within images and even within Flash video, matter that gets ignored by bots that cant speak Javascript.

    As Web technology develops and we get richer and more interactive Web sites, [the search engines] cant just stick with just indexing hyperlinks and text, Sullivan says. Theyre going to have to do everything.
    Any thoughts on this?
    James Little | Partnerships Director | TopCashBack

  2. #2
    Senior Member

    Status
    Offline
    Join Date
    Sep 2003
    Posts
    768
    Thanks
    0
    Thanked 2 Times in 2 Posts
    My favourite Big Daddy theory (not one I prescribe to) is that it is just a honey pot for identifying what sites are being "artificially" inflated through SEO. The theory is that those who are SEOing their results will punch their URL into Big Daddy and stick their head above the parapets in doing so.

    Thought it was a nice example how how twitchy people get when trying to out think the search engines.

  3. #3
    Paul Wright's Avatar
    Fishboy

    Status
    Offline
    Join Date
    Jan 2005
    Location
    London
    Posts
    1,735
    Thanks
    32
    Thanked 20 Times in 14 Posts
    Agency Services Director | e: paul.wright@tradedoubler.com | t: 0207 798 5825


  4. #4
    3wdl's Avatar
    Super Dancer

    Status
    Offline
    Join Date
    Jun 2005
    Posts
    3,213
    Thanks
    181
    Thanked 147 Times in 108 Posts
    Quote Originally Posted by Paul Wright
    Thanks Paul - but mine has more replies.

    James Little | Partnerships Director | TopCashBack

  5. #5
    Paul Wright's Avatar
    Fishboy

    Status
    Offline
    Join Date
    Jan 2005
    Location
    London
    Posts
    1,735
    Thanks
    32
    Thanked 20 Times in 14 Posts
    Quote Originally Posted by 3wdl
    Thanks Paul - but mine has more replies.

    I guess that's what i get for posting in the correct forum huh? lol. It's an interesting topic and one that effects us all so the more awareness the better.
    Agency Services Director | e: paul.wright@tradedoubler.com | t: 0207 798 5825


+ Reply to Thread


Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Fonetastic mobile sales (none)
    By Leeky in forum Affiliate Marketing Lounge
    Replies: 48
    Last Post: 23-01-06, 01:37 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
To Top

Content Relevant URLs by vBSEO 3.5.0 RC2