What Every Marketer Needs to Know About Hadoop

With "big data" on everybody's lips, here's all you need to know to keep up your end of the conversation.

Author

Date published August 30, 2012 Categories

“Big data.” There’s no escaping it.

It’s catchy. It’s generic enough that everybody is using it for everything. It’s a one-size-fits-all phrase.

It’s so all-encompassing that the best definition I’ve seen recently is from Stephané Hamel who put it this way:

bigdatatweet

So with “big data” on everybody’s lips, here’s all you (the marketing executive) need to know to keep up your end of the conversation.

A. Disk drives got cheaper so we can store more data. The ways and means of collecting all sorts of data have proliferated faster than Twitter traffic or TSA lines at the airport. We have more of data, more types of data, and it’s coming at us faster (real time) than ever dreamed possible. That’s what makes up “volume, variety, velocity.”

So, the ability to replace big, honkin’ disk drives with many smaller, cheaper drives that we can wire together is the first, significant technical advance.

B. We can split up the processing. The second advance is the ability to augment the big, honkin’ processors with many smaller, cheaper servers. We have distributed the processing to the data instead of waiting for the data to rocket back and forth from disk farm to processor.

connectedcloud

So What?

So, there are two things to keep in mind when your marketing budget is being allocated to what seems like pure IT projects.

The more data you throw into the pot, the more likely you are of finding some sort of relationship (correlation) to act on. More on that can be found in a July column I called “Consilience – The Intrinsic Value of Big Data.”
This practice of splitting up the data, solving smaller problems, and bringing it back together (MapReduce) is very useful for some specific types of processing. Getting this under your belt gives you voting rights when discussing options.

Big, honkin’ analytics processors are very good at finding hidden pieces in a hurry. (Show me all the customers who have bought in the past three months after clicking on these special offers and abandoning their shopping carts.)

But those types of questions are known unknowns. You know the things you’re going to ask and the entire database is set up that way. You know you’ll want to see things by date, by region, by product line, etc. That is what gives these enterprise data warehouses their power: they are designed in advance to answer the questions you know you might ask, and they can answer them very quickly so you can refine your questions – as long as you have deep knowledge about what data you have and how it is structured in the database.

hal-seye

But the other data – the messy data – is chock-full of unknown unknowns. We know the information might be valuable, but we don’t know what to ask.

MapReduce is great as a low-cost storage medium for unstructured data and for refining that data into a more structured form for heavy analysis. Social media data, call center transcripts, clickstream data, website content, and sensor data all start out unstructured.

MapReduce is ideal for pre-processing text, turning all those tweets into numerical models of opinion (sentiment analysis), which can then be fed to the big, honkin’ analytics machines for correlation discovery and problem solving. It’s great for asking slower questions of larger amounts of data. It’s great for finding a representative sample of data so the big, honkin’ processors don’t have to juggle all of the bits at once.

So the next time somebody throws “Hadoop” into the conversation, you’ll know more than the fact that it was named after Doug Cutting’s son’s stuffed elephant.

Connected Cloud and Hal’s Eye images via Shutterstock.

Subscribe to get your daily business insights

More about:

Read the next article

Explore Tech Talks

Lucy

Lucy helps organizations leverage knowledge for in... View Tech Talk
TVSquared

TVSquared is the global leader in cross-platform T... View Tech Talk
Grata

Grata is a B2B search engine for discovering small... View Tech Talk

Whitepapers

US Mobile Streaming Behavior

Whitepaper | Mobile

US Mobile Streaming Behavior

Streaming has become a staple of US media-viewing habits. Streaming video, however, still comes with a variety of pesky frustrations that viewers are ...

View resource

Winning the Data Game: Digital Analytics Tactics for Media Groups

Whitepaper | Analyzing Customer Data

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Data is the lifeblood of so many companies today. You need more of it, all of which at higher quality, and all the meanwhile being compliant with data...

View resource

Learning to win the talent war: how digital marketing can develop its people

Whitepaper | Digital Marketing

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

This report documents the findings of a Fireside chat held by ClickZ in the first quarter of 2022. It provides expert insight on how companies can ret...

View resource

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

How well do you really know your customers?

Analyzing Customer Data

How well do you really know your customers?

3y Nick Ashmore

How well do you really know your customers?

Marketers have dozens of optimization tools at their disposal. But they’ve forgotten about the most sophisticated one. Read More...

View article

Q&A: ReachMobi’s CEO Matt Hoggatt on turning website visitors to subscribers

Acquisition

Q&A: ReachMobi’s CEO Matt Hoggatt on turning website visitors to subscr...

7y Leonie Mercedes

Q&A: ReachMobi’s CEO Matt Hoggatt on turning w...

According to Matt Hoggatt, CEO of mobile audience network ReachMobi, there are rich opportunities in the realm of mobile web, if only mobile companies...

View article

Five ways to boost your conversion rate without wasting ad budget

Acquisition

Five ways to boost your conversion rate without wasting ad budget

8y Tim Nichols

Five ways to boost your conversion rate without wa...

Improve the conversion rate of your ads and maximize profits without draining your entire ad budget. Read More...

View article

Three digital dilemmas that are really opportunities

Actionable Analysis

Three digital dilemmas that are really opportunities

8y Catherine Magoffin

Three digital dilemmas that are really opportuniti...

Marketers can begin to work on solutions to achieve new levels of consumer value, satisfaction, and engagement by addressing these three digital dilem...

View article

Three ways to grow your total customer community

Analyzing Customer Data

Three ways to grow your total customer community

8y Dave Evans

Three ways to grow your total customer community

Experts predict that there will be a greater emphasis on social business this year, thus making it more imperative for marketers to cultivate strong c...

View article

Reviving sluggish sales with email personalization

Acquisition

Reviving sluggish sales with email personalization

8y Guest Writer

Reviving sluggish sales with email personalization

By using personalization in the emails that are sent, retail marketers can grow their consumer base and ultimately increase ecommerce revenue. Read Mo...

View article

An absolute beginner's guide to setting up Google Analytics for your website

Analytics

An absolute beginner's guide to setting up Google Analytics for your websit...

8y Yuyu Chen

An absolute beginner's guide to setting up Google ...

Our beginner’s guide to Google Analytics teaches you how to set up an account that is linked to your site and recommends a few basic metrics to look a...

View article

How can publishers use analytics data to save themselves?

Analytics

How can publishers use analytics data to save themselves?

8y Andrew Edwards

How can publishers use analytics data to save them...

How can publishers and advertising networks best utilize analytics data to prevent the extinction of digital magazines and newspapers? Read More...

View article

Follow us

What Every Marketer Needs to Know About Hadoop

Subscribe to get your daily business insights

Read the next article

Explore Tech Talks

Whitepapers

Whitepapers

US Mobile Streaming Behavior

US Mobile Streaming Behavior

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Related Articles

How well do you really know your customers?

How well do you really know your customers?

Q&A: ReachMobi’s CEO Matt Hoggatt on turning website visitors to subscr...

Q&A: ReachMobi’s CEO Matt Hoggatt on turning w...

Five ways to boost your conversion rate without wasting ad budget

Five ways to boost your conversion rate without wa...

Three digital dilemmas that are really opportunities

Three digital dilemmas that are really opportuniti...

Three ways to grow your total customer community

Three ways to grow your total customer community

Reviving sluggish sales with email personalization

Reviving sluggish sales with email personalization

An absolute beginner's guide to setting up Google Analytics for your websit...

An absolute beginner's guide to setting up Google ...

How can publishers use analytics data to save themselves?

How can publishers use analytics data to save them...