Affiliate Marketing
Forum Search

Reply
 
LinkBack Thread Tools Display Modes

  #1 (permalink)  
Old 03-02-05
King of The Zoo
 
Join Date: Nov 2003
Location: Wherever I lay my hat - that's my home
Posts: 1,416
Thanks: 0
Thanked 0 Times in 0 Posts
EyeOfTheTiger is an unknown quantity at this point
  Finding out about bot visits

Hi,

I've been working on my sites a little while and I use a comprehensive paid for tracking package which provides loads of useful info.

If noticed that many people talk about finding out that they have google visit their site everyday or msn visits every so often.

How do I tell this from my stats package. I have a basic allow all robots.txt but that doesn't have any tracking on. Should I put tracking on this txt file ? Would that help.

HOw do I look at "raw logs" and what am i looking for if i can find anything in these ?

Thanks

Tiger
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Sponsored Links
  #2 (permalink)  
Old 03-02-05
Registered User
 
Join Date: Mar 2004
Location: Reading, UK
Posts: 301
Thanks: 0
Thanked 0 Times in 0 Posts
dmorison is an unknown quantity at this point
Hi Tiger,

Quote:
How do I tell this from my stats package
If your stats package works by using a hidden image on your pages then it is unlikely that it will include search engine information as the crawlers do not generally request image URLs. If this is the case, you will have to find your raw log files - and where they might be and what state they might be in depends on how/where your hosting is configured and what acess you have to it - can you provide more details?

Once you have got hold of your raw logs, you can identify a bot through its "User-Agent" string - the part of a log entry that normally tells you what web browser is being used by the client accessing your website.

A normal log entry looks something like this:

1.2.3.4 - - [03/Feb/2005:16:11:29 +0000] "GET / HTTP/1.1" 200 134 "" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1) Opera 7.54u1 [en]"

The last entry on this log record being the user-agent; which in this case is Opera. A search engine bot on the other hand will identify itself something like:

1.2.3.4 - - [03/Feb/2005:08:10:38 +0000] "GET / HTTP/1.0" 200 1887 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"

If you can open your log files into a text editor you can use the search function to look for these tell-tale signs of the search engine robots.

If you're interested in a particular search engine, have a look at the following page:

http://www.robotstxt.org/wc/active/html/index.html

If you browse the record for the search engine you want to look out for it will show you what user-agent string that search engine's robot uses.

Hope this helps!

Last edited by dmorison; 03-02-05 at 09:28 PM..
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 02-03-05
Registered User
 
Join Date: May 2004
Posts: 31
Thanks: 0
Thanked 0 Times in 0 Posts
tigger is an unknown quantity at this point
you could try http://www.darrinward.com/spydertrax.htm

its a neat little program that tells you about spider activity on your site plus its free
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply

Bookmarks


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Affiliate Marketing RSS Feeds - Contact Us - Affiliate Marketing - Archive - Privacy Statement - Top

Content Relevant URLs by vBSEO 3.2.0 RC7