You're right, the results do differ wildly but I suppose it all depends on your definition, or more importantly, Google algorithms' definition of similarity.
The motoricerca tool is able to differentiate easily between the text and HTML content so we can bet Google can do this too, there could be seperate thresholds for HTML and text similarity or any combination, and I'd love to know those values!
|