70 total blogs and 104 members.

A Real Estate Network

Blog

Duplicate Content, Generic Content, Scraped Content, Syndicated Content…What’s The Diff?

One of the great things about Positive Real Estate Professionals is all the functionality that can be incorporated into your PREP website; if you would like to view some of the general functionality geared toward making your site more visitor friendly, simply visit the Test Account. Keep in mind that the general functionality is by no means exhaustive nor does it include SEO tweaks. Much of the work done to make your PREP site more friendly to the various search engines and visitors is done behind the scenes.

This brings me to one of the main points of this post. The myths that surround duplicate content, generic content, and syndicated content. First, let’s use Google’s definition of duplicate content as they are the most authoritative and widely used search engine:

Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar. (Source: Deftly Dealing With Duplicate Content)

Google is very clear and direct on the issue of duplicate content and penalties stating, “Having duplicate content can affect your site in a variety of ways; but unless you’ve been duplicating deliberately, it’s unlikely that one of those ways will be a penalty. (Source: Demystifying the “duplicate content penalty“)” If however you add new and unique content that enhances the scraped or republished content you will not be penalized for duplicate content. This brings me to what I call generic content. Generic content is generally mass produced and available for use by anyone. Some of this content is quite good and some is not. The point is not the quality of the content but its widespread use on many different sites with no unique content added to it by the site’s owner. What this means is that Google sees the hundreds (or more) websites that use this content as using duplicate content and penalizes them because there is often no author credit given and no link to the content’s source. This brings up a rather hotbutton term, ’scraping.’ Scraping is a relatively new term that generally refers to the redistribution of content without the webmaster’s consent and as such does not negatively affect the person being scraped (Source: Duplicate content due to scrapers). In journalism and academic circles, scraping would be considered plagiarism. You may be wondering how using this generic and scraped content is different from syndicating your own content. There are a couple of major but important differences that I have already alluded to. First, syndicated content is not ‘copied and pasted’ onto a website it utilizes an RSS feed to acquire the content. Second, syndicated content always provides a link back to the source of the feed.

Don't Clone Me BRO!

Now for the possible penalties. According to Google:

Duplicate content on a site is not grounds for action on that site unless it appears that the intent of the duplicate content is to be deceptive and manipulate search engine results. (Source: Demystifying the “duplicate content penalty“)

To me the terms ‘deceptive’ and ‘manipulate search engine results’ are subjective but warrant a closer look. Deceptive could include trying to directly or indirectly take ownership of another person’s content. Google is very efficient at finding the original source of duplicate content, generic content, and scraped content. This original source is then indexed as the authoritative URL for that content and appears at the top of the search results. The site which uses the content with or without the consent of the webmaster is directly penalized with lower Page Rank (PR) which in turn means the site is also returned lower in search engine results. If you have ever visited one of these article farm websites you will often notice that the site which holds these generic articles has a pretty good PR while sites that use the material generally do not (or have a lesser PR than if they did not use the generic material). The manipulate search engine results would seem to imply some level of deception. Some sites will publish content that is basically keywords that are loosely strung together to create a post. In some instances there is not even an attempt to disguise a keyword page or post as an article and you basically have a bunch of keywords masquerading as content. This practice, termed ‘keyword stuffing,’ seems to fit under both the deceptive and manipulative categories. It is widely believed that keywords should not collectively comprise more than 10% of the total words in your article because Google penalizes sites that attempt to manipulate search results by keyword stuffing.

We spend numerous hours daily reading articles from authoritative websites. Some of this information is applicable and beneficial to the PREP network and some is not. We then begin testing various theories and techniques that we have encountered or devised ourselves. This testing is generally never seen by nor does it impact the members of Positive Real Estate Professionals and visitors to the network. If there is a chance that something may affect the network, we will conduct this testing on other sites that we maintain for this very purpose. Your site’s function and optimization is one of our main goals because this is how your site is found and if your site isn’t found, what is its purpose. I hope this post helps to clarify questions you may have had regarding content.

Google Buzz
Share and Enjoy:
  • Print
  • Digg
  • del.icio.us
  • Facebook
  • Mixx
  • Google Bookmarks
  • Blogplay
  • Add to favorites
  • FriendFeed
  • LinkedIn
  • MisterWong
  • Ping.fm
  • Propeller
  • Reddit
  • StumbleUpon
  • Technorati
  • Twitter
  • Yahoo! Bookmarks
  • Yahoo! Buzz
  • Tumblr
  • BlinkList
  • MySpace
  • NewsVine
Retweet this post

4 responses to “Duplicate Content, Generic Content, Scraped Content, Syndicated Content…What’s The Diff?”

  1. Mihaela Lica pamil-visions.net says:

    Well, the idea that there is no penalty always bugged me somehow. If they choose not to show your pages, they call it “filtering” – but for the webmasters affected by this “filtering” I think it is a penalty. It’s a matter of walking in different shoes. Google’s shoes say :”duplicate content penalty is a myth.” The webmasters say: “my site was penalized for having duplicate content.” Probably everything that doesn’t go according to some peoples’ will can be regarded as a penalty.

  2. @ Mihaela Lica – I can definitely see your point. By Google’s own statements, they try to find the original source of information and cluster all the URLs behind what they feel is the original URL. Google also freely admits that though they are very good at finding the original URL, they do not always get it right. This is a disturbing but not altogether unexpected admission. It does seem that if you are part of Google’s cluster but not the ‘original’ URL and therefore do not show that that is a penalty. I do think that perspective, whether Google’s or the webmaster’s plays a huge part in how this non-penalty, penalty is viewed. I am a proponent of original content with source citations if someone else’s content is being cited for authoritative or incidental purposes. This source citation is something I learned in school. IMHO, credit given to the original author coupled with new, original material can enhance and further the thinking of the source material but simply using (or plagiarizing) the source material of someone else is completely unacceptable and does nothing to further either the thinking or the conversation.

  3. James Goksina searchpro.net says:

    Very useful post. where can i find more articles on this subject ?

  4. @ James – I have linked, as a citation in this post, to several Google articles where more info can be found. These links are to Google’s official blog.

Leave a Reply

icon_wink.gif icon_neutral.gif icon_mad.gif icon_twisted.gif icon_smile.gif icon_eek.gif icon_sad.gif icon_rolleyes.gif icon_razz.gif icon_redface.gif icon_surprised.gif icon_mrgreen.gif icon_lol.gif icon_idea.gif icon_biggrin.gif icon_evil.gif icon_cry.gif icon_cool.gif icon_arrow.gif icon_confused.gif icon_question.gif icon_exclaim.gif 
CommentLuv Enabled
© 2009-2010 Positive Real Estate Professionals All Rights Reserved -- Copyright notice by Blog Copyright