Results 1 to 4 of 4

 

Thread: Robots.txt and robots meta tag use?

  1. #1
    Registered User

    Status
    Offline
    Join Date
    Aug 2006
    Posts
    22
    Thanks
    0
    Thanked 0 Times in 0 Posts


    Hi all

    Not far away from the launch of my first site now, and have read a little about robots.txt and meta tags.

    I've got a pretty small site about 30 pages at most for launch, but this is going to grow over time. Do all search engines look at the robot meta tag, or is the robots.txt file the only way to stop bots indexing unneccessary content?

    Also read all sites should have a robots.txt file even if it's blank?? Is that the case?

    Most grateful of any advise.
    N.

  2. #2
    tbp
    Registered User

    Status
    Offline
    Join Date
    Dec 2006
    Posts
    1,998
    Thanks
    0
    Thanked 22 Times in 22 Posts
    All search engines and crawlers should read the robots.txt tag, and follow the directives within, which tells them the files or directories they aren't allowed to index.

    You can also use the meta tag:

    < remove_this_meta name="robots" content="noindex,nofollow">

    on a page to stop it being indexed, but the robots.txt file is the best solution, although you have to be very carefully with pattern matching so you don't accidentally tell them not to index your whole site!

    Its best to create a robots.txt site for every site, even if its empty, as the spiders frequently request it, and if it doesn't exist your error log files can fill up with 404 requests for the file.

    Have a look at http://www.robotstxt.org for lots of info on this topic.
    Last edited by tbp; 27-02-07 at 03:24 PM.

  3. #3
    Registered User

    Status
    Offline
    Join Date
    Aug 2006
    Posts
    22
    Thanks
    0
    Thanked 0 Times in 0 Posts
    Thanks for the advice, looks like the Robots.txt file is the way to go.

    You couldn't answer another minor question I have talking about 404's. Want to create a custom 404 page on site, rather than sending them to a standard one, in order to keep visitors on my site. How would I go about adding this to my site?

  4. #4
    tbp
    Registered User

    Status
    Offline
    Join Date
    Dec 2006
    Posts
    1,998
    Thanks
    0
    Thanked 22 Times in 22 Posts
    If you're using Apache, add the following to an .htaccess file in the root of your site:

    ErrorDocument 404 /path_to_custom_404_page.htm

    You`ll need to have the relevant permissions for the htaccess file, as some directives can be set to be ignored, so you`ll need to talk to your host if this doesnt work.



Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. site not listed on Google!
    By skywalker786 in forum Affiliate Marketing Lounge
    Replies: 9
    Last Post: 20-09-06, 10:34 AM
  2. Robots Tag
    By gw3000 in forum Programming
    Replies: 4
    Last Post: 09-06-04, 12:00 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
To Top

Content Relevant URLs by vBSEO 3.5.0 RC2