How are you going to block them??
Most of the less well known bots tend to ignore robots txts anyway
Are there any search bots out there that are a sap ot bandwidth with no real benefit.
Looking for ones that are big timewasters.
I see more and more in my server logs - I am tempted to just restrict bots to yahoo, google, ask and a couple of others.
Any comments or suggestions
Follow Me | Looking for Merchants Who Do Scifi Stuff
How are you going to block them??
Most of the less well known bots tend to ignore robots txts anyway
Designer Men's underwear, swimwear, socks and t-shirts click to join our program deadgoodundies.com
up to 20% commission | full product feed | 90 day cookie | No end of month tier reset
Webmasterworld has a pretty comprehensive robots.txt (located at http://www.webmasterworld.com/robots3). Brett has spent considerable effort over the years ID'ing and banning resource hog 'bots - as enormous as the place it, it ran off a single medium-spec box for years, and when he started naming G updates, the box reached 120% rated load before it finally fried...
Most of these are somewhat obedient, at least - for the really naughty ones, you need to be banning by IP in any case, since they often run with spoofed UAs, like the standard IE on, or googlebot
There are currently 1 users browsing this thread. (0 members and 1 guests)
Bookmarks