The Duplicate Content Debate

From an SEO perspective, too much duplicate content can land your site in the netherworld of Google's supplemental results.

Author

P.J. Fusco

Date published June 6, 2007 Categories

Duplicate content is a much-discussed (more like debated, harangued, and maligned) SEO (define) issue that just won’t go away. That’s because more duplicate content is being generated on the Web by distributive formats than ever before. This challenges search engine algorithms to be smarter and faster when presenting users with definitive, relevant search results — usually the original content source.

The major search engines know there are many valid reasons for creating duplicate content. Affiliate sales channels need to be fed content from a centralized source. Syndication services must earn their keep by distributing similar content to dissimilar sites. Content management systems render page after page of mildly realigned content as a matter of cataloging efficiency. RSS feeds distribute another rendition of the same-old content to new venues. The list goes on.

Search engines also understand there are many not-so-valid reasons for duplicate content, like hallway pages, doorway pages, and multiple-domain microsites. Then there are the unscrupulous scrapers that snag someone else’s content, duplicate it repeatedly, and interlink it in a wasteland of Web sites. These made-for-AdSense type of sites are just the sort of thing Google likes to keep out of its indices via penalties, filters, and dampening.

Google in particular understands the inherent differences between valid and invalid content duplication. Because there are many valid reasons for creating valid duplicate content, the effects aren’t readily penalized, unless one considers Google’s supplemental results a penalty.

Make no mistake — landing in supplemental results shouldn’t be considered a penalty. After all, the page is still indexed. Granted, supplemental results are a labyrinth of auxiliary Web pages that won’t generate much search-referred traffic. But it’s better to be supplemental than not be indexed at all, isn’t it?

In Google, supplemental results usually appear as regular search results if the regular index can’t serve up a myriad of relevant results. You’ll often find Web pages from supplemental results lingering on page 10 and beyond. Of course, if a searcher is performing a highly targeted query that should only return genuinely specific results, then supplementals can be on page one.

Supplemental results are a way for Google to extend its search database while preventing low-quality Web pages from getting high levels of search-referred traffic. Since supplemental results aren’t as trusted as regular results, they don’t get as much love and attention from the Google bot. Supplementals tend not to be crawled as frequently as pages in regular search results. So pages found in the supplemental index tend to be a bit stale in terms of cache, but being there isn’t usually the result of a penalty.

Most robust, highly search-referred Web sites have at least some pages in supplemental. If you’re concerned about your site’s performance in Google, keep some historical data around that can help you define whether duplicate content is contributing to poor levels of search-referred traffic.

Know how many pages you have in your site and do some domain drilldowns (site:www.domain.com) to determine when your pages go supplemental. Keep a running monthly tally of the percentage of pages in supplemental. Watch for changes, but always review content quality. Maybe duplicate content is a primary contributor toward receiving low-quality page scores from Google; maybe it’s low-value inbound links or an unrelated technical issue. You won’t know until you do some digging, and even then it will take time to understand the trends that would indicate duplicate content at the root of all evil.

Duplicate content creates a real problem for site owners because you’re allowing a search engine to select which pages are important and which aren’t. This also wastes crawl time that could be spent in more pertinent site areas.

There are simple solutions for minimizing duplicate content, such as leveraging robots.txt to keep the spiders out of print-only and PDF versions. Doing so will dedupe your content and enhance crawling efficiency. If dynamic URLs are at the root of your site’s duplicate content, rewrites to static URLs are in good order. If you’re merging two sites into one, permanent redirects offer a timely fix. No matter what’s creating the duplicate content, there are ways to minimize its effects on search-referred traffic.

Unfortunately, the effects of duplicate content are subject to debate in nearly mythical proportion and most certainly from a myopic perspective. Duplicate content penalties are very rare. It takes a great measure of dubious intent to earn a duplicate content penalty. Better to create one great site filled with unique content that naturally earns inbound links than worry about a couple of Web pages that land in supplemental results.

Join us for Search Engine Strategies on June 12-13 in Toronto.

Want more search information? ClickZ SEM Archives contain all our search columns, organized by topic.

Subscribe to get your daily business insights

More about:

Read the next article

Explore Tech Talks

Lucy

Lucy helps organizations leverage knowledge for in... View Tech Talk
TVSquared

TVSquared is the global leader in cross-platform T... View Tech Talk
Grata

Grata is a B2B search engine for discovering small... View Tech Talk

Whitepapers

US Mobile Streaming Behavior

Whitepaper | Mobile

US Mobile Streaming Behavior

Streaming has become a staple of US media-viewing habits. Streaming video, however, still comes with a variety of pesky frustrations that viewers are ...

View resource

Winning the Data Game: Digital Analytics Tactics for Media Groups

Whitepaper | Analyzing Customer Data

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Data is the lifeblood of so many companies today. You need more of it, all of which at higher quality, and all the meanwhile being compliant with data...

View resource

Learning to win the talent war: how digital marketing can develop its people

Whitepaper | Digital Marketing

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

This report documents the findings of a Fireside chat held by ClickZ in the first quarter of 2022. It provides expert insight on how companies can ret...

View resource

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

Mastering voice search optimization: Talk like a local, rank like a pro

Search Marketing

Mastering voice search optimization: Talk like a local, rank like a pro

1m ClickZ News Staff

Mastering voice search optimization: Talk like a l...

Forget typing, voice search is booming. Businesses need Voice Search Optimization (VSO) to rank for conversational queries and secure top spots in sea...

View article

How to Create Impactful SEO Reports that Drive Business Success

2m ClickZ News Staff

How to Create Impactful SEO Reports that Drive Bus...

Wielding graphs and analytics has its place. But to truly capture executive attention in today’s impatient digital arena, we must step into the shoes ...

View article

How Google's Search Generative Experience (SGE) is Reshaping SEO

2m ClickZ News Staff

How Google's Search Generative Experience (SGE) is...

As the search giant delves deeper into the realm of artificial intelligence (AI), it is clear that SGE will have a profound impact on the future of SE...

View article

The secrets to getting the best SEO traffic without even ranking

11m Daniel Tannenbaum

The secrets to getting the best SEO traffic withou...

Did you know that there are ways to get to the top of Google without ranking your own site? You can still get lots of good organic traffic using alter...

View article

How SEO is changing because of ChatGPT

11m Daniel Tannenbaum

How SEO is changing because of ChatGPT

When ChatGPT was introduced in 2022, it changed the internet. Today, we speak to some startups and experts to understand how ChatGPT is changing SEO R...

View article

Winning at search: why vigilance and strategy alignment are necessary evils

Data-Driven Marketing

Winning at search: why vigilance and strategy alignment are necessary evils

11m Prasanna Dhungel

Winning at search: why vigilance and strategy alig...

As brands and agencies struggle to prioritize visibility of ever-changing SERP features, here's how they can build effective, holistic search strategi...

View article

What role does page speed play for SEO?

SEO

What role does page speed play for SEO?

1y DebugBear

What role does page speed play for SEO?

Page speed has been a ranking factor for a long time, but it has increased in importance over the last two years. Learn about Google’s Core Web Vitals...

View article

iOS 14 uncovers measurement vulnerabilities for business

322023

iOS 14 uncovers measurement vulnerabilities for business

1y Jamie Bolton

iOS 14 uncovers measurement vulnerabilities for bu...

How will marketers handle the advertising industry upheaval in regard to data and measurement? Read More...

View article

Follow us

The Duplicate Content Debate

Subscribe to get your daily business insights

Read the next article

Explore Tech Talks

Whitepapers

Whitepapers

US Mobile Streaming Behavior

US Mobile Streaming Behavior

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Related Articles

Mastering voice search optimization: Talk like a local, rank like a pro

Mastering voice search optimization: Talk like a l...

How to Create Impactful SEO Reports that Drive Business Success

How to Create Impactful SEO Reports that Drive Bus...

How Google's Search Generative Experience (SGE) is Reshaping SEO

How Google's Search Generative Experience (SGE) is...

The secrets to getting the best SEO traffic without even ranking

The secrets to getting the best SEO traffic withou...

How SEO is changing because of ChatGPT

How SEO is changing because of ChatGPT

Winning at search: why vigilance and strategy alignment are necessary evils

Winning at search: why vigilance and strategy alig...

What role does page speed play for SEO?

What role does page speed play for SEO?

iOS 14 uncovers measurement vulnerabilities for business

iOS 14 uncovers measurement vulnerabilities for bu...