• On MovieTome: Is this supposed to be Cobra Commander?
April 11, 2008 7:00 AM PDT

Understanding duplicate content: Outside view

by Brian R. Brown

Are you being outranked by you? Is "your" content showing up in searches, but on sites that aren't yours? Do you have multiple websites that compete against each other? Well this discussion on duplicate content from external sources should be right up your alley.

Earlier in the week, I started our discussion on duplicate content by trying to lay to rest the idea of a duplicate content penalty. Now we pick up that discussion with one aspect of duplicate content . . . content duplication from other sites.

While I'd love to start out our discussion with the idea that external duplicate content is the hardest to deal with, that may not always be the case as you'll see when we talk about duplication on our own websites. For now though, we are just going to focus on content duplication from other sites.

At this point, you are probably in one of two camps--the "Yes, help me with this please," camp or the "What in the world are you talking about?" camp. So let's start by getting everyone in the same camp at least. External content duplication can come about, generally, in three ways.

Content Theft

In every aspect of life, there are those who want to get ahead through the hard work of others, even illegally or unethically. The Web is certainly no exception to this, especially given the fact that, of all the ways to take advantage of the hard efforts of others, copy-paste must certainly be the laziest--I mean easiest.

Don't feel that this is an issue that only affects big name brands and sites, because anyone who publishes online is susceptible to this kind of attack. Keep in mind that what we are talking about here is essentially copyright infringement, not phishing sites and things like that, which is a whole other level of criminal activity.

Realistically, this is probably the hardest to combat, but in many cases, probably doesn't cause as much damage as you might think. In many ways, we might thank the search engines for this. They're out to deliver the best results they can to searchers and are certainly aware of these issues. Because of this, I truly believe they work really hard to identify authoritative and original sources of content. They can compare content they find based on when they found it, as well as links leading back to that content, and while purely speculation, I would have to imagine that it would be pretty easy for the engines to assign a score to any site based on the proportion of content on the site that appears elsewhere and determine natural and unnatural patterns.

So what can you do about content theft? While you can file reports with the search engines based on the Digital Millennium Copyright Act (just search on "Google copyright infringement" or the respective search engine for specific details), the ISP that hosts the infringing domain, or seek even greater legal action, it may be better to first weigh the impact you feel it really has as well as the resources it may take to fight it and determine whether it is worth your attention to begin with. And sometimes, just an email or letter to the infringer might be enough

Content Syndication

Ironically, you are probably the most responsible for your own duplicate content on other sites. Writing content and syndicating through article directories or other content syndication services, RSS feeds of blog posts, and press release syndication will probably make up far more of your duplication woes than pirated content.

Each of these instances can be addressed though. Article writing and similar content is best kept unique and different from any content you have on your own site. When it comes to this kind of content, it is often best to develop content for the sites where it is going to be placed anyway, rather than a mass distribution. Of course, you'll also want to include a byline with a link back to your site.

Blog syndication can be handled a little differently. You may decide to include only a summary of your post, or the full post. The pros and cons here must be weighed, since a partial feed may discourage some sites from even syndicating your blog. In many cases, there may be enough differentiation between your blog and the sites where your post is syndicated anyway. However the best solution is to also include an absolute link back to the blog post on your own site. This helps signal to the search engines that your post is the source.

Press releases can be handled the same way as these other content pieces. Whether you are distributing through wire services or using RSS to syndicate from your site, including links back to your site helps signal the source. Press releases also tend to be more temporary on external sites, though you should certainly keep an archive on your own site.

Micro-Sites

The final source of external content also falls under your control. Micro-site strategy consists of creating additional websites, often around niche topical areas. This strategy evolved out of the idea that if one website was good, then many websites must be better, and would increase the chances of ranking in search engines and the number of listings for a particular search. Some view micro-sites as a good thing, while others view them as bad, however neither view is particularly accurate. Rather, it is the implementation that makes them good or bad.

Micro-site strategy is a much bigger topic, but bad implementation is directly related to our discussion of duplicate content. Most micro-site implementations result in identical or nearly identical duplication of the main website's pages on the various micro-sites. This isn't surprising since creating unique content for one site, especially for an ecommerce site, is often challenging enough without having to create unique content for multiple sites. But rather than improving or increasing rankings, the micro-sites tend to directly compete with the main site and greater resources are needed to maintain multiple sites. Needless to say, this is why most micro-site implementations are bad.

Like many things, there are a few tools that can be used in the fight against duplicate content. One tool to help you keep on top of potential content theft issues is Copyscape, that allows you to enter in your page and it comes back with a list of potential duplication.

Brian Brown is a Consultant & Natural Search Marketing Strategist for Netconcepts. He is a member of the CNET Blog Network, and is not an employee of CNET. Disclosure.
Recent posts from Searchlight
Be unique to avoid duplicate content
Selling duplicate content
Book review: How To Make Money With Your Blog
Yahoo Suggest: The Good, the Bad, and the Unbelievable
Understanding duplicate content: Outside view
Flickr adds video to photo sharing services
Duplicate content: Separating the penalty from the filter
Use SEO to optimize your recession
Add a Comment (Log in or register) (3 Comments)
  • prev
  • 1
  • next
by BrickMarketing April 11, 2008 7:55 AM PDT
How about talking about the effect that duplicate content truly has on a site and if it has any at all when your site is the one being copied? Many people are a bit confused on those subjects.
Reply to this comment
by brbrown April 14, 2008 11:58 AM PDT
As long as the steps above are being followed (including links back to the original site) then most sites probably shouldn't be impacted too deeply. The links back should help to signal the canonical source of the information. Some who steal content do it programmatically by scraping pages and many of these links will remain, and for those who do strip out the links, they are probably stealing content from many other sites and therefore don't have a very authoritative profile to begin with.

If the content appears on another site that is considerably more authoritative to than the original site, that is an occasion when the syndicated content may outrank the original site. However at this point, an authoritative site like that is more likely to include the byline or links back to the original source, and in this case, that site may get more traffic and therefore even being outranked in search results may be beneficial since that site may send more traffic or further boost the authority of the original site because of the content and the links.

But this is why it is better to try to avoid those situations anyway by offering to create unique content for those highly authoritative sites to begin with.

In a lot of cases, there is enough difference between the two pages on the differing sites that duplicate content filtering may not come into play anyway.

The bigger issue for most sites will be internal duplication...which I'll be diving into next.
Reply to this comment
by SEOCOMPANY May 11, 2009 12:32 AM PDT
A really interesting post wherein different means of content duplication is seen.
Reply to this comment
(3 Comments)
  • prev
  • 1
  • next
advertisement

Can RIM get its mojo back?

The new BlackBerry Tour, carried by Verizon and Sprint, arrives Sunday, even as RIM seems to be losing sales to exclusive devices like the iPhone and Pre.

With Chrome, Google reignites the OS wars

roundup Google Chrome OS, due in 2010, underscores the Web giant's cloud-computing ambitions and opens new competition with Microsoft.
• What Chrome OS has on Windows that Linux doesn't

About Searchlight

Search engine optimization expert Stephan Spencer and analysts from Net Concepts share late-breaking SEO tools, tips, trends, resources, news and insights. Stephan is the founder and president of Netconcepts, a web agency specializing in search engine optimized ecommerce. Clients include Discovery Channel, AOL, Home Shopping Network, Verizon SuperPages.com, and REI, to name a few. Stephan is a frequent speaker at Internet conferences around the globe. He is also a Senior Contributor to MarketingProfs.com, a monthly columnist for Practical Ecommerce, and he's been a contributor to DM News, Multichannel Merchant, Catalog Success, Catalog Age, and others. The blog is part of the CNET Blog Network and the authors are not employees of CNET. Disclosure.

Add this feed to your online news reader

Searchlight topics

advertisement
advertisement

Inside CNET News

Scroll Left Scroll Right