Affiliate Marketing
Forum Search

Reply
 
LinkBack Thread Tools Display Modes

  #1 (permalink)  
Old 06-03-05
Registered User
 
Join Date: Aug 2003
Location: Belfast, Northern Ireland
Posts: 140
Thanks: 0
Thanked 0 Times in 0 Posts
will 68 is an unknown quantity at this point
  Page Similarity Tools

Like many others on here I had a couple of sites wiped out in the recent google change. Both these sites have been steady earners for me for over 3 years.
I have been thinking along the lines it was some sort of duplicate/similar page filter, which has stuck these sites into small sort of google black hole by this I mean both sites are fully indexed in google, but never appear in any search results. I have looked at these sites using the 2 page similarity checkers which I know of: http://www.webconfs.com/similar-page-checker.php and http://tool.motoricerca.info/similarity-analyzer.phtml using the webconfs tool I see an average page similarity of 55%, but using the motoricerca tool I sometimes see a page similarity of up to 80%.

I have always used the webconfs tool and have just recently found the motoricerca tool and comparing certain pages they both give some very different results. I just wondered what you all thought of both these tools, especially those effected by the recent update and who has used the webconfs tool and believes their site is fine as far as page similarity is concerned. Also does anyone know of any other similar page checker tools.

Will
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Sponsored Links
  #2 (permalink)  
Old 06-03-05
renegade's Avatar
Moderator
 
Join Date: Aug 2003
Posts: 3,220
Thanks: 78
Thanked 16 Times in 12 Posts
renegade seems to know their stuffrenegade seems to know their stuff
You're right, the results do differ wildly but I suppose it all depends on your definition, or more importantly, Google algorithms' definition of similarity.

The motoricerca tool is able to differentiate easily between the text and HTML content so we can bet Google can do this too, there could be seperate thresholds for HTML and text similarity or any combination, and I'd love to know those values!
__________________
Joe's CantBarsed Blog | Discount Codes
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 07-03-05
justnozy's Avatar
Registered User
 
Join Date: Jan 2004
Location: Wigan
Posts: 501
Thanks: 1
Thanked 0 Times in 0 Posts
justnozy is an unknown quantity at this point
Don't think there is any doubt (maybe among other things) there was a duplicate content filter in there.

Our main earning site was controlled by a "do it & worry tomorrow" guy. I preached & preached about duplicate content - but it was always - yeh I'll sort it tomorrow - overnight 75% of pages wiped out.

I think there is also some truth in the sandbox theory though - new content does seem to get into serps very quick - with, in honesty, no due merit - then bomb - then what seems to be a good natural ranking ( if you take the time to check through your "betters" and dismiss the new stuff i.e no way should be there's ). In fact it almost seems to me better adding cr*p content than doing a short term ppc campaign - the results are faster & cheaper - but the long term effect ????

But this is of course with google. New content zooms up the serps, bombs and then seems to find it's "natural" place.

With MSN & Yahoo I have rankings that are simply ridiculously high when you look at the experience, maturity, and real content of my peers. It will be interesting to see if a "sandbox" eventually comes through.

Meanwhile of course there are two choices - dance or wait for the slow waltz
__________________
Free Competitions

Last edited by justnozy; 07-03-05 at 04:51 AM..
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply

Bookmarks


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Affiliate Marketing RSS Feeds - Contact Us - Affiliate Marketing - Archive - Privacy Statement - Top

Content Relevant URLs by vBSEO 3.2.0 RC7