Hidden Gems in Google Webmaster Tools, Part 2

Google's Webmaster Tools offers informative reporting that many site owners overlook. Part two of a series.

Author

Date published December 12, 2007 Categories

The first half of this series discussed some helpful Google Webmaster Tools (GWT) reports, including an external link report and ways to find rankings your site may be narrowly missing. Now, additional reports that offer benefits to GWT users.

Robots.txt Verification and Error Checking

A robots.txt file isn’t necessary for a site that performs well organically, but engines are finding more ways a robots.txt file can benefit site owners. To access this report from the main GWT area, click the “Tools” link in the left navigation, then click “Analyze robots.txt” from the submenu.

If you have a robots.txt file, this report helps you determine whether specific URLs are excluded as they should be. In the box labeled “Test URLs against this robots.txt file,” enter an actual URL from your site and click the “Check” button. In the subsequent “URL Results” field, Google will tell you whether the URL is “blocked” or “allowed.” Running tests helps determine how and when to use such characters as wildcards and trailing slashes to most effectively block those URLs you don’t want indexed.

This report is also helpful if you use your robots.txt file to provide engines with the location of your XML sitemap. Remember, however, this page doesn’t validate the XML sitemap itself. It validates only the way you refer to the file. In other words, you can point to the sitemap in a valid way, but the sitemap itself may not validate. Compare this with asking someone for directions to a specific restaurant. The directions may be accurate, but the restaurant could be out of business. Similarly, the sitemap reference in the robots.txt file can be valid, but the sitemap itself might not be.

Fortunately, GWT can also tell you whether your XML sitemap is valid. Find this report in the main “Sitemaps” section. If your sitemap feed is valid, you’ll see “OK” in the “Sitemap Status” column of that report.

One important thing to remember about excluding files via the robots.txt file is that while rare, these pages can technically show up in results pages if they have significant external link popularity. On our company blog, we have a login link for staff members. A Google search for that page shows a link to it but not a valid title or description. Google partially crawled that link but couldn’t fully access it, as it’s password protected.

Overcoming Canonical Issues

Canonical issues on a site are architectural glitches that inadvertently create multiple versions of identical URLs. One example is a site that resolves with or without the “www” prefix. Another example is a page that resolves at both the folder level, such as “/products/,” and the page level, like “/products/index.aspx.”

It’s true engines are getting better at detecting and accounting for canonical issues. It’s also true that you can never provide engines with too much information about the proper way to crawl, index, and interpret a site. So if you’re unsure about whether such a setting is necessary, my advice is utilize it.

GWT has an area that lets you account for the “www” prefix issue. From the “Tools” menu, select “Set preferred domain.” On this page you’ll see three options:

Display URLs as www.site.com (for both www.site.com and site.com)
Display URLs as site.com (for both www.site.com and site.com)
Don’t set an association

Select the appropriate choice, and click the OK button. This takes a while to take effect, and it can take even longer to undo it if you change your mind down the road. So be sure about your needs before you make a choice.

Important points to remember about this feature:

This setting is only for the “www” prefix issue. Other subdomains, such as shop.site.com, require their own versions of robots.txt, Google verification files, and so on.
Experienced coders may already have canonical redirects set up for their sites, via either their .htaccess file (for Apache servers) or their IIS (define) control panel. If so, this GWT feature is redundant and likely unnecessary. Just make sure not to send conflicting instructions to Google about this. In other words, don’t tell engines to use the “www” version via your .htaccess file and tell Google to use the “non-www” version via GWT. That’s asking for trouble.
While this report enables you to determine which sorts of URLs appear in Google results pages, there’s no evidence that the report is a “true” fix for canonical problems. In other words, there’s no reason to believe that link popularity to your “non-www” pages will somehow magically transfer to their “www” counterparts simply by using the tool.

Conclusion

Every second you spend poking around GWT is time well spent. I’ve watched its evolution closely, and I find the GWT team to be very responsive to user requests and concerns and really focused on providing data that’s truly helpful.

Next: I’ll spend some time in Yahoo’s Site Explorer and discuss ways Yahoo is informing site owners about their sites.

Want more search information? ClickZ SEM Archives contain all our search columns, organized by topic.

Subscribe to get your daily business insights

More about:

Read the next article

Explore Tech Talks

Lucy

Lucy helps organizations leverage knowledge for in... View Tech Talk
TVSquared

TVSquared is the global leader in cross-platform T... View Tech Talk
Grata

Grata is a B2B search engine for discovering small... View Tech Talk

Whitepapers

US Mobile Streaming Behavior

Whitepaper | Mobile

US Mobile Streaming Behavior

Streaming has become a staple of US media-viewing habits. Streaming video, however, still comes with a variety of pesky frustrations that viewers are ...

View resource

Winning the Data Game: Digital Analytics Tactics for Media Groups

Whitepaper | Analyzing Customer Data

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Data is the lifeblood of so many companies today. You need more of it, all of which at higher quality, and all the meanwhile being compliant with data...

View resource

Learning to win the talent war: how digital marketing can develop its people

Whitepaper | Digital Marketing

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

This report documents the findings of a Fireside chat held by ClickZ in the first quarter of 2022. It provides expert insight on how companies can ret...

View resource

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

Mastering voice search optimization: Talk like a local, rank like a pro

Search Marketing

Mastering voice search optimization: Talk like a local, rank like a pro

1m Idris Nagri

Mastering voice search optimization: Talk like a l...

Forget typing, voice search is booming. Businesses need Voice Search Optimization (VSO) to rank for conversational queries and secure top spots in sea...

View article

How to Create Impactful SEO Reports that Drive Business Success

2m Idris Nagri

How to Create Impactful SEO Reports that Drive Bus...

Wielding graphs and analytics has its place. But to truly capture executive attention in today’s impatient digital arena, we must step into the shoes ...

View article

How Google's Search Generative Experience (SGE) is Reshaping SEO

2m Idris Nagri

How Google's Search Generative Experience (SGE) is...

As the search giant delves deeper into the realm of artificial intelligence (AI), it is clear that SGE will have a profound impact on the future of SE...

View article

The secrets to getting the best SEO traffic without even ranking

10m Daniel Tannenbaum

The secrets to getting the best SEO traffic withou...

Did you know that there are ways to get to the top of Google without ranking your own site? You can still get lots of good organic traffic using alter...

View article

How SEO is changing because of ChatGPT

10m Daniel Tannenbaum

How SEO is changing because of ChatGPT

When ChatGPT was introduced in 2022, it changed the internet. Today, we speak to some startups and experts to understand how ChatGPT is changing SEO R...

View article

Winning at search: why vigilance and strategy alignment are necessary evils

Data-Driven Marketing

Winning at search: why vigilance and strategy alignment are necessary evils

11m Prasanna Dhungel

Winning at search: why vigilance and strategy alig...

As brands and agencies struggle to prioritize visibility of ever-changing SERP features, here's how they can build effective, holistic search strategi...

View article

What role does page speed play for SEO?

SEO

What role does page speed play for SEO?

1y DebugBear

What role does page speed play for SEO?

Page speed has been a ranking factor for a long time, but it has increased in importance over the last two years. Learn about Google’s Core Web Vitals...

View article

iOS 14 uncovers measurement vulnerabilities for business

322023

iOS 14 uncovers measurement vulnerabilities for business

1y Idris Nagri

iOS 14 uncovers measurement vulnerabilities for bu...

How will marketers handle the advertising industry upheaval in regard to data and measurement? Read More...

View article

Follow us

Hidden Gems in Google Webmaster Tools, Part 2

Subscribe to get your daily business insights

Read the next article

Explore Tech Talks

Whitepapers

Whitepapers

US Mobile Streaming Behavior

US Mobile Streaming Behavior

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Related Articles

Mastering voice search optimization: Talk like a local, rank like a pro

Mastering voice search optimization: Talk like a l...

How to Create Impactful SEO Reports that Drive Business Success

How to Create Impactful SEO Reports that Drive Bus...

How Google's Search Generative Experience (SGE) is Reshaping SEO

How Google's Search Generative Experience (SGE) is...

The secrets to getting the best SEO traffic without even ranking

The secrets to getting the best SEO traffic withou...

How SEO is changing because of ChatGPT

How SEO is changing because of ChatGPT

Winning at search: why vigilance and strategy alignment are necessary evils

Winning at search: why vigilance and strategy alig...

What role does page speed play for SEO?

What role does page speed play for SEO?

iOS 14 uncovers measurement vulnerabilities for business

iOS 14 uncovers measurement vulnerabilities for bu...