Go Back   Webmaster Forums UK SEO SEM Webmaster Community Forum - UKWW > Web Design and Website Development > Help and Tutorials for new Webmasters
Register FAQ Members List Downloads Calendar Today's Posts Webmaster Resources Webmaster Blogs
 
 

Help and Tutorials for new Webmasters Help and tutorials for people new to the Internet and webmastering in particular, there is no such thing as stupid question, please fell free to question about any aspects of webmastering you need help with.
Sub Forums::Content Management System ::Webmaster Toolbox

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 05-27-2008, 05:25 PM
Senior Member
148 posts this year. Executive spud!
We have not managed to scare them away..
Last months UKWW Tokens: 6
 
Join Date: Jan 2007
Posts: 162
Thanks: 1
Thanked 1 Time in 1 Post
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Default 12 Ways Webmasters Create Duplicate Content

Quote:
Here are 12 ways people unintentionally create dupe content:

1) Build a site for the sole purpose of promoting affiliate offers, and use the canned text supplied by the agency managing the affiliate program.

2) Generate lots of pages with little unique text. Weak directory sites could be an example of this.

3) Use a CMS that allows multiple URLs to refer to the same content. For example, do you have a dynamic site where E-Commerce Hosting pulls up the exact same content as E-Commerce Hosting If so, you have duplicate content. This is made worse if your site actually refers to these pages using multiple methods. A surprising number of large sites do this.

4) Use a CMS that resolves sub domains to your main domain. As with the prior point, a surprising number of large sites have this problem as well.

5) Generate pages that differ only by simple word substitutions. The classic example of this is to generate pages for blue widgets for each state where the only difference between the pages is a simple word substitution (e.g. Alabama Blue Widgets, Arizona Blue Widgets, …).

6) Forget to implement a canonical redirect. For example, not 301 redirecting .com to .com (or vice versa) for all the pages on your site. Regardless of which form you pick to be the preferred form of URL for your site, someone out there will link to the other form, so implementing the 301 redirect will eliminate that duplicate content problem for you, as well as consolidate all the page rank from your inbound links.

7) Having your on site links back to your home page link to .com/index.html (or index.htm, or index.shtml, or …). Since most of the rest of the world will link to .com, you now have created duplicate content, and divided your page rank, if you have done this.

8) Implement printer pages, but not using robots.txt to keep them from being crawled.

9) Implement archive pages, but not using robots.txt to keep them from being crawled.

10) Using Session ID parameters on your URLs. This means every time the crawler comes to your site it thinks it is seeing different pages.

11) Implement parameters on your URLs for other tracking related purposes. One of the most popular is to implement an affiliate program. The search engine will see .com?affid=1234 as a duplicate of .com. This is made worse if you leave the “affid” on the URL throughout the user’s visit to your site. A better solution is to remove the ID when they arrive at the site, after storing the affiliate information in a cookie. Note that I have seen a case where an affiliate had a strong enough site that .com?affid=1234 started showing up in the search engines rather than .com (NOT good).

12) Implement a site where parameters on URLs are ignored. If you, or someone else, links to your site with a parameter on the URL, it will look like dupe content.
There are many ways that people intentionally create duplicate content, by various scraping techniques, but there is no need to cover that here.
Source: Ramblings About SEO Blog Archive 12 Ways Webmasters Create Duplicate Content
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
  #2 (permalink)  
Old 05-28-2008, 12:28 AM
Gorkfu's Avatar
Senior Member
223 posts this year. worth their weight in gold!
Trusted Member - And full of good stuff!
Last months UKWW Tokens: 9
 
Join Date: May 2008
Location: USA
Posts: 168
Thanks: 0
Thanked 1 Time in 1 Post
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Default

This is a great list! I had something similar to number 11 happen in the past. I just want to share the issue in case anyone ever has the same problem.

Google crawled a new site I created, that I had yet to link to from anywhere. They were able to crawl it because my host automatically had directory list on as default in apache. I turned it off of course and I recommend anyone running apache do the same. You can do it by adding this to a .htaccess file.
Options -Indexes

Google ended up making like 10 links like this: sitename.com/?id=3 etc. The only way I found Google would get rid of it from their index was by making disallows in my robots.txt for each link. Deleting the URLs in Google's webmaster tools will simply not work.
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
  #3 (permalink)  
Old 07-06-2008, 08:25 AM
Member
45 posts this year. i see smoke!
It looks like they have moved their luggage in.
 
Join Date: Dec 2007
Location: Timisoara, Romania
Posts: 30
Thanks: 0
Thanked 0 Times in 0 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Send a message via Yahoo to geme
Default

nice article pow-wow... keep up the good work on this forum!
__________________
BMW News | TV Online | Videoclipuri
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
  #4 (permalink)  
Old 07-06-2008, 09:08 AM
tb987's Avatar
Senior Member
315 posts this year. worth their weight in gold!
Trusted Member - And full of good stuff!
Last months UKWW Tokens: 5
 
Join Date: Mar 2008
Posts: 280
Thanks: 2
Thanked 2 Times in 2 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Default

This is a great post - I have made and still continue to make all/some of these mistakes.

There is a 14th, if you use url rewrites make sure you exclude the none rewritten URLS and redirect them back to the friendly ones. I have noticed a site with /index.php?ref=blah in the index but it should be /blah/
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
  #5 (permalink)  
Old 08-18-2008, 04:25 PM
Junior Member
11 posts this year. the lights are on!
User is on their way up.
Last months UKWW Tokens: 22
 
Join Date: Mar 2007
Posts: 11
Thanks: 0
Thanked 0 Times in 0 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Send a message via Skype™ to Tom_Sean
Default

This is a great list pow-wow! Thanks a lot. It's very useful.
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
  #6 (permalink)  
Old 08-28-2008, 06:30 AM
Junior Member
4 posts this year. needs some grease!
New user, who has not interacted much yet.
Last months UKWW Tokens: 3
 
Join Date: Aug 2008
Posts: 3
Thanks: 0
Thanked 0 Times in 0 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Default

Quote:
Originally Posted by tb987 View Post
This is a great post - I have made and still continue to make all/some of these mistakes.

There is a 14th, if you use url rewrites make sure you exclude the none rewritten URLS and redirect them back to the friendly ones. I have noticed a site with /index.php?ref=blah in the index but it should be /blah/

Hi

I agree with you. This point must be included in this. I aslo noticed these types of sites.

thanks
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
  #7 (permalink)  
Old 08-29-2008, 09:03 PM
Junior Member
1 posts this year. needs some grease!
New user, who has not interacted much yet.
Last months UKWW Tokens: 1
 
Join Date: Aug 2008
Posts: 1
Thanks: 0
Thanked 0 Times in 0 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Default

Good articles.Thank for sharing.
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
  #8 (permalink)  
Old 09-01-2008, 11:32 AM
Member
49 posts this year. i see smoke!
It looks like they have moved their luggage in.
Last months UKWW Tokens: 5
 
Join Date: May 2008
Location: Taunton, uk
Posts: 48
Thanks: 1
Thanked 4 Times in 4 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Default

to add to that list, using the same meta data on more than one page is dupe content... like at my work we use the exact same meta data on each page (i've just started here so give me time.. when the contract ends with the seo firm they currently use i get to play.. yes an seo firm has done it... )

some may disagree with me but i class it as dupe content... i mean how can you describe 2+ seprate pages with the same info...
Digg this Post!Add Post to del.icio.usStumble this Post!Wong this Post!
Reply With Quote
Reply

Bookmarks



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Webmaster Resources
UK WW SEO Tools
Find UK Hosts
 
The Forum Rules
Forum Rules - MUST READ
 
Site Of the Month
BizzFace
Nominate site of the month
 
Tag Cloud
43. wholesale adsense ready affiliate scam amazon apple iphone 16gb apple iphone 16gb 3g articles australia web hosting cash casino cheap clothes communications content custard media database dgital camerals directory domain name english teacher fantasy football fantasy football league football league free handbags home income instant jewelry link bid link directory links money money making online music news nokia n96 16gb online online shop poker professor replica sam allcock seo social networking sony vaio laptop sunglasses technology themes tutor verbalized wallet wallets wanted web webhosting web hosting website widget ready wordpress xmas offer

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT. The time now is 05:29 AM.

UK Webmaster World Forums - Internet marketing, web development, domain names, SEO contest and discussuons.
Subscribe to our feeds   Subscribe to our feeds

Powered by vBulletin® Version 3.7.0
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
LinkBacks Enabled by vBSEO 3.1.0