Search engines don't find that much. Not only do most of the popular search engines index less than 10% of the web, but the engines are biased because they are most likely to index popular U.S. commercial sites.
Two researchers from the NEC Research Institute in Princeton, NJ, Steve Lawrence and C. Lee Giles, have published a paper in Nature Magazine that tells everyone something you already know. That is, search engines don't find that much.
Not only do most of the popular search engines index less than 10 percent of the web (Yahoo, the index created by human hands, indexes just 7.4 percent), but they say the engines are biased because they are most likely to index popular U.S. commercial sites. (As my teenage daughter might say, "Like, DUUH!")
Given the huge growth of the web and the concerted efforts of "Spamdexers" to make sure client pages are not only indexed but that they're found first, this is as surprising as news that the sun comes up in the east. But the wire services are spinning this one as another web horror storyMost of it (presumably the good stuff, most likely your stuff) can't be found.
I don't use Spamdexers (although I did once review "WebPosition Gold," a Spamdex software package), so I decided to conduct a little experiment. I searched for my own newsletter, a-clue.com, on all the major search engines (and a few minor ones).
I wasn't interested in seeing that I was indexed based on a relevant keyword like "e-commerce newsletters" or "great web stuff." I avoided the keywords I used in the WebPosition review last year. Instead I just searched for the main part of the URL, a-clue.com. Here's what I found:
Despite 30 months of weekly publication, and my 1997 submission of the site, Yahoo still hasn't found a-clue.com. But my fans should not despair. The Inktomi search engine, which finds web pages using a computer, did have it. It was even the number three listing. This makes me wonder why Yahoo hasn't used Inktomi to improve its indexing capabilities, but their profits beat street estimates this week. The stock is up. So who cares?
Next, Excite. I like Excite. They not only found a-clue.com, but the home page was the number one hit. And I've never submitted a-clue.com to Excite.
Some engines couldn't find a-clue.com at all; not on the first page of listings anyway. By my reckoning, Alta Vista, Dogpile, Direct Hit and LookSmart all failed this test. But overall results were pretty good. I'm number one with Lycos and Northern Light, third with Google, and fifth with Thunderstone.
Perhaps the most surprising finding was that a page of feedback from Chris Tyler (previously online November 30 of last year) is apparently easier for search engines to find than my main page. Hotbot has got that as its number three hit, as did GoTo.Com, and MSN found that page at number two. A feedback item from September 14 was found first by Infoseek, listing my home page as sixth.
What did I learn from all this? If you're in e-commerce, it doesn't really matter whether search engines have indexed the whole web, so long as they can find your site. Check your own site in this way and let us know what you find...you may be surprised.
On the heels of a fantastic event in New York City, ClickZ Live is taking the fun and learning to Toronto, June 23-25. With over 15 years' experience delivering industry-leading events, ClickZ Live offers an action-packed, educationally-focused agenda covering all aspects of digital marketing. Register today!
Dana Blankenhorn has been a business reporter for more than 20 years. He has written parts of five books and currently contributes to Advertising Age, Business Marketing, NetMarketing, the Chicago Tribune, Boardwatch, CLEC Magazine, and other publications. His own newsletter, A-Clue.Com, is published weekly.
Hong Kong, May 5-6, 2015
Gartner Magic Quadrant for Digital Commerce
This Magic Quadrant examines leading digital commerce platforms that enable organizations to build digital commerce sites. These commerce platforms facilitate purchasing transactions over the Web, and support the creation and continuing development of an online relationship with a consumer.
Paid Search in the Mobile Era
Google reports that paid search ads are currently driving 40+ million calls per month. Cost per click is increasing, paid search budgets are growing, and mobile continues to dominate. It's time to revamp old search strategies, reimagine stale best practices, and add new layers data to your analytics.
May 6, 2015
12:00pm ET/9:00am PT