+ Reply to Thread
Results 1 to 10 of 10

 

Thread: Wordpress robots.txt settings

  1. #1
    Registered User

    Status
    Offline
    Join Date
    Sep 2008
    Posts
    89
    Thanks
    12
    Thanked 6 Times in 6 Posts


    I'm sure G has just slapped my site for duplicate content, I'm beginning to hate wordpress!

    Is there a good, up to date guide on basic robots.txt settings for wordpress anywhere?

    Also, should I be fooling with meta settings too?

    Ta
    Paul

  2. #2
    kickflip's Avatar
    Registered User

    Status
    Offline
    Join Date
    Oct 2008
    Location
    Stafford
    Posts
    255
    Thanks
    26
    Thanked 21 Times in 20 Posts
    I use this in my robots.txt's on wordpress, can't remember where I picked it up from originally:
    Code:
    User-agent: *
    Disallow: /cgi-bin
    Disallow: /wp-admin
    Disallow: /wp-includes
    Disallow: /wp-content/plugins
    Disallow: /wp-content/cache
    Disallow: /wp-content/themes
    Disallow: /trackback
    Disallow: /feed
    Disallow: /comments
    Disallow: /category/*/*
    Disallow: */trackback
    Disallow: */feed
    Disallow: */comments
    Allow: /wp-content/uploads
    
    # Google Image
    User-agent: Googlebot-Image
    Disallow:
    Allow: /*
    
    # Google AdSense
    User-agent: Mediapartners-Google*
    Disallow:
    Allow: /*
    
    # Internet Archiver Wayback Machine
    User-agent: ia_archiver
    Disallow: /
    
    # digg mirror
    User-agent: duggmirror
    Disallow: /
    
    Sitemap: http://www.yourdomain.co.uk/sitemap.xml

  3. #3
    Registered User

    Status
    Offline
    Join Date
    Nov 2004
    Location
    Wirral, Merseyside
    Posts
    2,199
    Thanks
    60
    Thanked 111 Times in 86 Posts
    Why do you think this has anything to do with WP?

  4. #4
    Registered User

    Status
    Offline
    Join Date
    Sep 2008
    Posts
    89
    Thanks
    12
    Thanked 6 Times in 6 Posts
    Quote Originally Posted by gadget View Post
    Why do you think this has anything to do with WP?
    Because initially I had a single page with some products and some text on it and since publishing the wordpress site it dropped out of the rankings and then came back lower.

    I have a side menu that presents the products from a database in different ways e.g. by manufacturer, by type of product, by manufacturer and type of product etc. I'm still working on the site.

    I don't have anything other than products on these pages. They link through to either; a post with detailed description and price comparison; or direct to the item on the merchant's site. So I think they are being seen as duplicate content. I also think that the same content can be seen by going through different files in the WP directories.

    I had a quick look at duplicate content and WP and found a load of articles that are mostly out of date on setting up robots.txt to prevent duplicate content as it is a common problem evidently.

    Just trying to close off everthing obvious to see what effect it has.

  5. #5
    Registered User

    Status
    Offline
    Join Date
    Sep 2008
    Posts
    89
    Thanks
    12
    Thanked 6 Times in 6 Posts
    Quote Originally Posted by kiddaclo View Post
    I use this in my robots.txt's on wordpress, can't remember where I picked it up from originally:
    Code:
    User-agent: *
    Disallow: /cgi-bin
    Disallow: /wp-admin
    Disallow: /wp-includes
    Disallow: /wp-content/plugins
    Disallow: /wp-content/cache
    Disallow: /wp-content/themes
    Disallow: /trackback
    Disallow: /feed
    Disallow: /comments
    Disallow: /category/*/*
    Disallow: */trackback
    Disallow: */feed
    Disallow: */comments
    Allow: /wp-content/uploads
    
    # Google Image
    User-agent: Googlebot-Image
    Disallow:
    Allow: /*
    
    # Google AdSense
    User-agent: Mediapartners-Google*
    Disallow:
    Allow: /*
    
    # Internet Archiver Wayback Machine
    User-agent: ia_archiver
    Disallow: /
    
    # digg mirror
    User-agent: duggmirror
    Disallow: /
    
    Sitemap: http://www.yourdomain.co.uk/sitemap.xml
    I think these are the settings from the wordpress codex

    I've started with that, having changed it to reflect my directory structure (and domain!) but there is normally something a bit better. I did notice someone that had changed their header.php to insert meta settings depending what type of page/post the bot was on.

  6. #6
    Keith's Avatar
    Moderator

    Status
    Offline
    Join Date
    Aug 2003
    Location
    Richmond North Yorkshire
    Posts
    2,540
    Thanks
    172
    Thanked 119 Times in 83 Posts
    This robots meta pluging is great for avoiding dupe content Meta Robots WordPress plugin - Yoast - Tweaking Websites
    Keith ~ My Blog general ramblings. Internet Marketing Blogs UK all the blogs together in one place (pm for inclusion)

  7. The Following User Says Thank You to Keith For This Useful Post:

    paultp (08-03-10)

  8. #7
    Registered User

    Status
    Offline
    Join Date
    Nov 2004
    Location
    Wirral, Merseyside
    Posts
    2,199
    Thanks
    60
    Thanked 111 Times in 86 Posts
    Quote Originally Posted by paultp View Post
    Because initially I had a single page with some products and some text on it and since publishing the wordpress site it dropped out of the rankings and then came back lower.

    I have a side menu that presents the products from a database in different ways e.g. by manufacturer, by type of product, by manufacturer and type of product etc. I'm still working on the site.

    I don't have anything other than products on these pages. They link through to either; a post with detailed description and price comparison; or direct to the item on the merchant's site. So I think they are being seen as duplicate content. I also think that the same content can be seen by going through different files in the WP directories.

    I had a quick look at duplicate content and WP and found a load of articles that are mostly out of date on setting up robots.txt to prevent duplicate content as it is a common problem evidently.

    Just trying to close off everthing obvious to see what effect it has.
    Can you post an example?

  9. #8
    pete_coles's Avatar
    Administrator

    Status
    Offline
    Join Date
    May 2007
    Posts
    513
    Thanks
    14
    Thanked 82 Times in 62 Posts
    Quote Originally Posted by Keith View Post
    This robots meta pluging is great for avoiding dupe content Meta Robots WordPress plugin - Yoast - Tweaking Websites
    On the money mate, simple a brilliant plugin. Joost is also excellent at keeping his plugins up to date
    Jack of all trades master of some

  10. The Following User Says Thank You to pete_coles For This Useful Post:

    paultp (08-03-10)

  11. #9
    Registered User

    Status
    Offline
    Join Date
    Sep 2008
    Posts
    89
    Thanks
    12
    Thanked 6 Times in 6 Posts
    Quote Originally Posted by pete_coles View Post
    On the money mate, simple a brilliant plugin. Joost is also excellent at keeping his plugins up to date
    I think I had this site recommended to me before. Now that I understand a bit more about wordpress I'll have another look.

    Cheers
    Paul

  12. #10
    Registered User

    Status
    Offline
    Join Date
    Nov 2004
    Location
    Wirral, Merseyside
    Posts
    2,199
    Thanks
    60
    Thanked 111 Times in 86 Posts

  13. The Following User Says Thank You to gadget For This Useful Post:

    paultp (10-03-10)

+ Reply to Thread


Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. AffiliateFuture FTP settings
    By AcornDomains in forum Affiliate Future
    Replies: 0
    Last Post: 15-05-08, 11:41 AM
  2. Robots.txt and robots meta tag use?
    By blackstar in forum Affiliate Marketing Lounge
    Replies: 3
    Last Post: 27-02-07, 04:26 PM
  3. Cookie settings
    By diesel77 in forum Affiliate Marketing Lounge
    Replies: 0
    Last Post: 28-09-06, 08:55 PM
  4. HeartInternet DNS Settings?
    By DomainsUnlimited in forum Affiliate Marketing Lounge
    Replies: 2
    Last Post: 24-07-06, 02:58 PM
  5. Your Mail Settings Have Been Reset
    By Pete Taylor in forum Commission Junction
    Replies: 0
    Last Post: 11-11-05, 06:30 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
To Top

Content Relevant URLs by vBSEO 3.5.0 RC2