Diagnosing Search Issues from the Query Box, Part 2

Additional ways to use query operators to gauge your site's presence.

Author

Erik Dafforn

Date published August 6, 2008 Categories

In my last column, I discussed ways to use the site: and inurl: operators to detect indexing issues with your site. In this column, I discuss additional operators (such as cache:) and the ways in which they help you diagnose search engine issues and view your site the way engines do.

Additional Uses for the Site Operator

While sites such as CopyScape do a nice job of detecting duplicate content around the Web, I sometimes like the flexibility of finding duplicate content myself. My last column showed ways of detecting unwitting duplicate content on your own site (due to canonicalization issues). But what about your content being used on other sites?

To detect this, I recommend using the site: operator to filter out your site. Scan your site to find a string of text that should appear only on your site, then plug it into a query like this:

“this is the unique string of text I found on my site” -site:yourdomain.com

The quotation marks are required to search for the exact text string. The minus sign before the site: operator tells Google to exclude your domain from results. Consequently, the only results on the SERP (define) should be third-party sites using your copy.

Keep in mind that you need to be cautious before shouting “plagiarism” or “copyright infringement.” These sites may be quoting yours in a fair-use context, or they might be directory sites that have pulled a description of your site prior to linking back to you. The best text strings to search for are longer, more obscure passages that really should be on your site only.

Searching for Specific URLs

Several years ago, you could simply enter a URL in a Google search box, and the resulting page would give you a short but helpful list of information about that particular URL, including links to related sites, the cached version of the page, links pointing to that page (although this feature is notoriously shallow in its coverage), pages that mention the specific URL text, and so on.

This sort of query was particularly helpful not so much for the links to additional information, but to quickly determine whether a specific engine had indexed a page. In short, a resulting page that said “Sorry, no information is available for the URL [URL]” was a quick way to spot an indexing problem, because that response was reserved for URLs that had either not yet indexed the page, or for pages that purposely avoided indexing (such as via the robots.txt exclusion or a robots “noarchive” meta tag).

Today, searching for a simple URL still works at MSN/Live and Yahoo. A couple years ago, however, Google changed its usage for URL queries. At Google, you must now precede a URL with the text info: to get indexing and informative link information. Make sure that you leave no space between colon and URL when performing this query.

In my opinion, this latter feature is of limited value, although it can represent a link-building opportunity, sometimes turning up less savvy sites that mention your URL as text but not as a link.

The Difference Between Cache and Text Cache

The cache: operator is a terrific tool that helps you determine whether engines see your page. Ironically, it’s not an entirely accurate way to show you exactly what engines see. I can’t emphasize this enough, so I’ll rephrase: The cached version of your page is not necessarily the exact same version of the page that engines see, monitor, and consider in their algorithms.

To see the version of your page that engines see, you must take a technological step backwards and view the text cache of the page. The text cache strips away deceptive script code, rich media, and graphics, leaving only the skeletal remains of your page, the text and links.

Consider, for example, the cache version of www.usanetwork.com. You can see some rich media and graphics and a few links, but the main body section is empty.

Contrast that view with the text cache of the same page.

While the regular cached version of a page “includes” content such as rich media and JavaScript-spawned Flash files, don’t assume Google notices or considers such content. In most cases, it’s included only because Google has pulled the script and Flash code into its index — not because it understands or weighs it.

To find the text cache version of a page at Google, you can add &strip=1 to the end of a cached URL, such as in the following:

http://64.233.167.104/search?q=cache:www.usanetwork.com&pws=0
http://64.233.167.104/search?q=cache:www.usanetwork.com&pws=0&strip=1

You can also find a link to the text cache at the top of any cached page in Google. Look for the copy “Text-only version” at the top-right of a cached page, such as this cached version of the ClickZ home page.

Conclusion

Cached pages are available at all major engines, although only Google allows use of the actual cache: operator. For Yahoo and MSN Live, you can search for a URL then find a link to the cached version on the resulting page. Also, Google is the only one of the big three that differentiates and shows an actual text cache.

Subscribe to get your daily business insights

More about:

Read the next article

Explore Tech Talks

Lucy

Lucy helps organizations leverage knowledge for in... View Tech Talk
TVSquared

TVSquared is the global leader in cross-platform T... View Tech Talk
Grata

Grata is a B2B search engine for discovering small... View Tech Talk

Whitepapers

US Mobile Streaming Behavior

Whitepaper | Mobile

US Mobile Streaming Behavior

Streaming has become a staple of US media-viewing habits. Streaming video, however, still comes with a variety of pesky frustrations that viewers are ...

View resource

Winning the Data Game: Digital Analytics Tactics for Media Groups

Whitepaper | Analyzing Customer Data

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Data is the lifeblood of so many companies today. You need more of it, all of which at higher quality, and all the meanwhile being compliant with data...

View resource

Learning to win the talent war: how digital marketing can develop its people

Whitepaper | Digital Marketing

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

This report documents the findings of a Fireside chat held by ClickZ in the first quarter of 2022. It provides expert insight on how companies can ret...

View resource

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

Mastering voice search optimization: Talk like a local, rank like a pro

Search Marketing

Mastering voice search optimization: Talk like a local, rank like a pro

1m ClickZ News Staff

Mastering voice search optimization: Talk like a l...

Forget typing, voice search is booming. Businesses need Voice Search Optimization (VSO) to rank for conversational queries and secure top spots in sea...

View article

How to Create Impactful SEO Reports that Drive Business Success

2m ClickZ News Staff

How to Create Impactful SEO Reports that Drive Bus...

Wielding graphs and analytics has its place. But to truly capture executive attention in today’s impatient digital arena, we must step into the shoes ...

View article

How Google's Search Generative Experience (SGE) is Reshaping SEO

2m ClickZ News Staff

How Google's Search Generative Experience (SGE) is...

As the search giant delves deeper into the realm of artificial intelligence (AI), it is clear that SGE will have a profound impact on the future of SE...

View article

The secrets to getting the best SEO traffic without even ranking

11m Daniel Tannenbaum

The secrets to getting the best SEO traffic withou...

Did you know that there are ways to get to the top of Google without ranking your own site? You can still get lots of good organic traffic using alter...

View article

How SEO is changing because of ChatGPT

11m Daniel Tannenbaum

How SEO is changing because of ChatGPT

When ChatGPT was introduced in 2022, it changed the internet. Today, we speak to some startups and experts to understand how ChatGPT is changing SEO R...

View article

Winning at search: why vigilance and strategy alignment are necessary evils

Data-Driven Marketing

Winning at search: why vigilance and strategy alignment are necessary evils

11m Prasanna Dhungel

Winning at search: why vigilance and strategy alig...

As brands and agencies struggle to prioritize visibility of ever-changing SERP features, here's how they can build effective, holistic search strategi...

View article

What role does page speed play for SEO?

SEO

What role does page speed play for SEO?

1y DebugBear

What role does page speed play for SEO?

Page speed has been a ranking factor for a long time, but it has increased in importance over the last two years. Learn about Google’s Core Web Vitals...

View article

iOS 14 uncovers measurement vulnerabilities for business

322023

iOS 14 uncovers measurement vulnerabilities for business

1y Jamie Bolton

iOS 14 uncovers measurement vulnerabilities for bu...

How will marketers handle the advertising industry upheaval in regard to data and measurement? Read More...

View article

Follow us

Diagnosing Search Issues from the Query Box, Part 2

Subscribe to get your daily business insights

Read the next article

Explore Tech Talks

Whitepapers

Whitepapers

US Mobile Streaming Behavior

US Mobile Streaming Behavior

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Related Articles

Mastering voice search optimization: Talk like a local, rank like a pro

Mastering voice search optimization: Talk like a l...

How to Create Impactful SEO Reports that Drive Business Success

How to Create Impactful SEO Reports that Drive Bus...

How Google's Search Generative Experience (SGE) is Reshaping SEO

How Google's Search Generative Experience (SGE) is...

The secrets to getting the best SEO traffic without even ranking

The secrets to getting the best SEO traffic withou...

How SEO is changing because of ChatGPT

How SEO is changing because of ChatGPT

Winning at search: why vigilance and strategy alignment are necessary evils

Winning at search: why vigilance and strategy alig...

What role does page speed play for SEO?

What role does page speed play for SEO?

iOS 14 uncovers measurement vulnerabilities for business

iOS 14 uncovers measurement vulnerabilities for bu...