Results 1 to 10 of 10

 

Thread: Internal duplicate content issues - how to fix?!

  1. #1
    JamesBuzz's Avatar
    Registered User

    Status
    Offline
    Join Date
    Jun 2010
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts


    What's the best way to overcome this internal dup. content problem?

    We have what are essentially product type pages with a lot of content generated by AJAX, without its own URL.

    But within each product page, users can also filter by different specs, which reduces some of the content, but still has a huge amount of the same content as the main level page.

    An example:

    A main product type page - Ford | Focus | Reviews | Carbuzz

    One filtered by a spec (engine type) - Ford | Focus | Reviews | Carbuzz

    As you'll see the content that a spider would see is v.similar.

    We can't redesign the site. Would it be a good idea to add rel=canonical to the engine level pages, so we have one decent page (the main car page) with a lot of content that doesn't appear on other of our URLs? Or is there another way to solve this?!

    Thanks!

    James

  2. #2
    apg1985's Avatar
    Registered User

    Status
    Offline
    Join Date
    Mar 2011
    Posts
    412
    Thanks
    1
    Thanked 17 Times in 11 Posts
    Can you not put a nofollow tag on the filter links? Or even remove the filter pages in the robots file

  3. #3
    JamesBuzz's Avatar
    Registered User

    Status
    Offline
    Join Date
    Jun 2010
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts
    Not sure to be honest. Anyone know?

  4. #4
    mcharalambous's Avatar
    Registered User

    Status
    Offline
    Join Date
    Mar 2011
    Posts
    18
    Thanks
    0
    Thanked 0 Times in 0 Posts
    Hmmmm it is a tricky one. I've handled something like this myself, but we totally rebuilt the whole site structure.

    It's really not worth even using a system like that at all, although effective for the user it's horrid for search engines. I'd suggest re-doing the entire way that works to make it both user and search engine friendly.

    However, i'd suggest having one landing page for each of the options and blocking out the other pages when merged. So for example, decide which option is the most important, in this case it may be the make of the car, so Ford.

    You'd set up a custom Landing page for Ford cars, and any filters/searches within that category would be blocked out by search engines using robots.txt or meta noindex. Each of the unique Ford models can still have their own landing pages, and it's those, along with the Ford landing page that will bring in the traffic so it really doesnt matter if you block out the filter pages.

    Let me know if that makes sense. If not, i'll try and show you an example...

    Thanks
    Michael

  5. #5
    Registered User

    Status
    Offline
    Join Date
    Oct 2006
    Location
    Melbourne, Australia
    Posts
    25
    Thanks
    0
    Thanked 1 Time in 1 Post
    Quote Originally Posted by apg1985 View Post
    Can you not put a nofollow tag on the filter links?
    The nofollow tag doesn't prevent the link being crawled, it just tells Google not to pass page juice to the referring url/link.

    EG - internally, good to use the nofollow attribute on a contact page. Although the page is useful in being indexed, there's no advantage in passing juice to this page or pages like this.


    Cheers

    Dave

  6. #6
    mcharalambous's Avatar
    Registered User

    Status
    Offline
    Join Date
    Mar 2011
    Posts
    18
    Thanks
    0
    Thanked 0 Times in 0 Posts
    Quote Originally Posted by AussieDave View Post
    EG - internally, good to use the nofollow attribute on a contact page. Although the page is useful in being indexed, there's no advantage in passing juice to this page or pages like this.
    Sorry Dave but this is in fact incorrect. Although make a good point above, this is no longer a used tactic as the Juice/PageRank that would flow to the contact page is lost, rather than passed to another page.

    It is never worth No-following any link internally.

    Instead, use the robots.txt file to block out search engines, or implement a Meta NoIndex tag on the pages you wish not to be crawled.

    Thanks
    Mike

  7. #7
    Registered User

    Status
    Offline
    Join Date
    Mar 2010
    Posts
    42
    Thanks
    0
    Thanked 0 Times in 0 Posts
    Personally I would ness around with the nofollow tag and test with that for a while as that should definitely help. You could even possibly talk to a programmer at freelancer.com to see what one of them can do for you - I have found some great programmers there for a good price.

  8. #8
    JamesBuzz's Avatar
    Registered User

    Status
    Offline
    Join Date
    Jun 2010
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts
    Is it possible to block only certain pages using robots.txt. e.g. Test it out on say 10 cars and monitor the results?

  9. #9
    Registered User

    Status
    Offline
    Join Date
    May 2008
    Posts
    50
    Thanks
    0
    Thanked 3 Times in 3 Posts
    You're better off using <meta name="robots" content="nofollow,noindex"> in the head of each page containing duplicate content as Google/Bing/Yahoo will not always read the robots.txt file (even though they should).

    rel=canonical is the recommended solution but I think you should apply this on a page by page basis until you're sure that Google is displaying the correct/most relevant content.
    These are mine: Voucherfrenzy - Hot Deals + Netbook news + 32 LED

  10. #10
    Registered User

    Status
    Offline
    Join Date
    Mar 2010
    Posts
    42
    Thanks
    0
    Thanked 0 Times in 0 Posts
    Thank you for that code - that actually helps me a lot. I am the worst with techie stuff! But that really helps so thank you.



Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
To Top

Content Relevant URLs by vBSEO 3.5.0 RC2