How Social Media Sites Should Use Internal SEO, Part 2

  |  June 10, 2009   |  Comments

More ways that social media sites could increase natural search traffic with a few SEO tweaks.

"How Social Media Sites Should Use Internal SEO, Part 1" discussed how some social media sites could enhance their own organic performance by implementing some simple architecture changes.

Continuing that theme, let's look at Facebook and Twitter (again), then change course slightly to talk about the Wolfram Alpha engine.

Facebook and Vanity URLs

Historically, public Facebook URLs have been long, cluttered, and not particularly consumer-friendly, usually ending with a long, random number string. With last week's announcement that non-numeric vanity pages will soon be available, hopefully this will end soon, with shorter URLs quickly overtaking their aging ancestors.

When and if this happens, hopefully Facebook will do what Google Profiles has already done (and what LinkedIn should do): Work to unify and consolidate those profile pages so that while any URL can be requested, only one will be considered the "authority." This will allow the older, longer URLs to live forever while allowing people to begin promoting their newer, shorter, more concise URLs.

This can be done in several different ways, including using a 301 redirect (like Google Profiles), excluding old URLs via the robots.txt file, or adding the canonical link element to the pages' HTML. And the addition of authoritative URLs to the site's XML site map will supplement any or all of these techniques.

In addition to its potential vanity URL situation, Facebook has a few fairly insignificant issues in its current URL construction, such as duplicates created due to the addition of ref= and viewas= parameters. These params and others like them make it difficult for engines to differentiate between them and cause Google to show you an abbreviated list of pages unless you click the "show all results" link, as seen in this example from Starbucks Coffee here.

While not currently a significant issue, if you add a vanity URL to the mix, the number of pages for each person or company is likely to double if consolidation steps aren't taken.

Another way that Facebook could help itself is to ensure that within a person's or company's profile, the pages are titled and described accurately. Currently, using the previous Starbucks example, all pages have identical, (branded but non-specific) titles and meta descriptions across each of Starbucks' pages, regardless of whether the page contained an overview, RSS story, photos, and so on.

(Note: Just before publishing time, Facebook clarified its rollout of vanity URLs. I'll follow up on the discussion in a future column.)

Back to Twitter

Twitter is duplicating much of its content in a couple of ways, including multiple capitalization styles and the significant amounts of its mobile site that has been indexed.

Add to that the split between http: and https: traffic. Pages resolve at both versions of the URL. Only a small fraction of URLs are indexed as https:, but 1.4 million is a big number regardless of its relative percentage.

Wolfram Alpha and Load Balancing

Wolfram Alpha isn't exactly a social media site. By its own admission it's a "computational knowledge engine." But until that niche gets a bit broader, most people will continue to call it a search engine.

With the apparent desire to offer users a fast experience, the Wolfram Alpha site uses a fairly common method of distributing the computational burden across multiple servers. Instead of seeing results on a www server subdomain, for example, you'll often see a subdomain such as "www37", which appends one or two digits to the traditional "www".

This practice results in engines crawling vast amounts of Wolfram Alpha pages that aren't necessarily on the www subdomain. The site's "Food and Nutrition" examples page, for example, exists in at least four different locations, and choosing a non-www server at random (I used www93), Google has indexed more than 50 of the pages.

Traditional redirects don't necessarily work in a situation like this because they go against the purpose of the architecture, which is to spread the traffic across multiple servers instead of consolidating it all onto www. If you find your site in this situation, however, you still have several options.

First, the site could detect engines beforehand and serve them pages only on the www subdomain. (This is really no different from the time-honored wisdom of detecting engines and stripping away session IDs.)

Second, and possibly easiest, would be to apply the canonical link element so that engines know that the www subdomain is the one that should appear in search results.

At least as of this writing, most engines, including Bing and Yahoo are still indexing Wolfram Alpha's /input/ pages. (These pages are Wolfram Alpha's equivalent of search results.) This isn't Wolfram Alpha's fault, because although at launch its robots.txt file was nearly empty, it's recently added /input/ to its disallow list. Eventually, engines will stop showing those results.

Conclusion

Due to deadline pressure and the constant need to deliver interface and features improvements, developers put basic SEO (define) items like these on the back burner. After all, the site "works" without them, right? However, they should consider them quick investments in long-tail traffic acquisition.

ClickZ Live San Francisco This Year's Premier Digital Marketing Event is #CZLSF
ClickZ Live San Francisco (Aug 11-14) brings together the industry's leading practitioners and marketing strategists to deliver 4 days of educational sessions and training workshops. From Data-Driven Marketing to Social, Mobile, Display, Search and Email, this year's comprehensive agenda will help you maximize your marketing efforts and ROI. Register today!

ABOUT THE AUTHOR

Erik Dafforn

Erik Dafforn is the executive vice president of Intrapromote LLC, an SEO firm headquartered in Cleveland, Ohio. Erik manages SEO campaigns for clients ranging from tiny to enormous and edits Intrapromote's blog, SEO Speedwagon. Prior to joining Intrapromote in 1999, Erik worked as a freelance writer and editor. He also worked in-house as a development editor for Macmillan and IDG Books. Erik has a Bachelor's degree in English from Wabash College. Follow Erik and Intrapromote on Twitter.

COMMENTSCommenting policy

comments powered by Disqus

Get the ClickZ Search newsletter delivered to you. Subscribe today!

COMMENTS

UPCOMING EVENTS

Featured White Papers

BigDoor: The Marketers Guide to Customer Loyalty

The Marketer's Guide to Customer Loyalty
Customer loyalty is imperative to success, but fostering and maintaining loyalty takes a lot of work. This guide is here to help marketers build, execute, and maintain a successful loyalty initiative.

Marin Software: The Multiplier Effect of Integrating Search & Social Advertising

The Multiplier Effect of Integrating Search & Social Advertising
Latest research reveals 68% higher revenue per conversion for marketers who integrate their search & social advertising. In addition to the research results, this whitepaper also outlines 5 strategies and 15 tactics you can use to better integrate your search and social campaigns.

WEBINARS

Jobs

    • Customer Service Representative
      Customer Service Representative (Common Sense Publishing) - Delray BeachDo you thrive on chaos? Love to multi-task? Enjoy a fast paced work environment...
    • Business Intelligence and Reporting Analyst
      Business Intelligence and Reporting Analyst (Stansberry and Associates) - BaltimoreStansberry and Associates is seeking a driven individual to fill...
    • Internet Marketing Campaign Manager
      Internet Marketing Campaign Manager (Straight North, LLC) - Fort MillWe are looking for a talented Internet Marketing Campaign Manager to join the...