big-data

MapReduce and Marketing: Are Small Bits of Big Data Meaningful?

  |  April 1, 2013   |  Comments

Start tapping technology tools that provide the ability to identify the key bits of data that are really important to your decisioning or messaging, and use those to create an ominichannel view of the customer.

There are many truisms around managing big data that I do think are actually true, like the one about how just because a technology solution can be used for some purpose doesn't mean that it is the best option for that purpose. I feel the same way about that truism in business that says the only way to solve big problems is to break it down into small problems and solve them in turn.

However, I'm not convinced that it's a good idea to follow advice that I often hear from several quarters, that you should "break down big data into small data" so that you can manage and understand it. Perhaps it's semantics, but the point is to keep big data big - that is where the power and opportunity lay.

I recommend tapping technology tools that provide the ability to identify the key bits of data that are really important to your decisioning or messaging, and use those in context to create an ominichannel view of the customer. This is only possible with effective management of the big data. Big data is not simply the sum of the small data parts. It's a view of the customer profile, intent, and behavior that is only possible because marketers have access to and can utilize all the data to improve the offer timing and content.

At the same time, there is a lot of big data that is useless to marketers - and it often gets captured and stored anyway. A better solution is to skim just what you need out of a big data set - while keeping the context intact. This is not new, and is increasingly available to marketers through their data warehouse, data management, or campaign management solution(s). MapReduce is a tool that helps marketers handle the unstructured and semi-structured resources that are not easy to analyze with traditional tools. Mapreduce.org defines "MapReduce" as a programming framework that "supports distributed computing on large data sets on clusters of computers" - essentially to simplify data processing across massive data sets. We hear lots of talk about Hadoop too, which is an open source version of MapReduce supplied by the Apache organization and the best known implementation of the MapReduce framework.

Unstructured or semi-structured data are things like web session logs, clickstream data, web analytics and optimization streams, social data, and other types that do not fit the "rows and columns" structure that is easy to analyze with relational database tools.

MapReduce can help sort through the masses of data and pull out the important parts. Many large data streams like web logs have a lot of data in them that has no long-term value. It doesn't make sense to spend a lot of time and processing power to upload data to a persistent location (the database) when you only need it for a short time. This is true for things like sentiment analysis or when publishing an event-based word cloud - when the event is over, the data is no longer needed, but the cloud itself is worth keeping.

Another great example of useless data getting in the way is an automated browse messaging scheme. What you really want is to comb through the entire web log, and find all customers who browsed but didn't buy. All the other data - the length of session, the other products viewed, the ads that were viewed, etc. - you don't need in order to trigger an email follow-up with the right product and offer based on the non-purchased item.

MapReduce is not a database. It has no querying power and no knowledge of what other data sets exist. It runs processes in parallel and is especially adept at pulling out small sets of data from the big data set and understanding them so they can be used as part of a larger picture. Lots of such jobs can be run at the same time and without any connection to each other - until the results get into the main database. Please note that it usually requires a specific expert to implement and optimize - many great database teams do not have this experience (yet).

Big data is just the latest generation of intimidating data sets - and tools like MapReduce can help tame big data by preprocessing it and passing important pieces on for further analysis. It lets you see and utilize small data inside the big data context. I think that is an important distinction - and opportunity.

Please comment below and let me know how your company is using various big data tools to help you manage big data insights.

Big Data image on home page via Shutterstock.

Tags:

ClickZ Live San Francisco This Year's Premier Digital Marketing Event is #CZLSF
ClickZ Live San Francisco (Aug 11-14) brings together the industry's leading practitioners and marketing strategists to deliver 4 days of educational sessions and training workshops. From Data-Driven Marketing to Social, Mobile, Display, Search and Email, this year's comprehensive agenda will help you maximize your marketing efforts and ROI. Register today!

ABOUT THE AUTHOR

Stephanie Miller

Stephanie Miller is a relentless customer advocate and a champion for marketers creating memorable online experiences. A digital marketing expert, she helps responsible data-driven marketers connect with the people, resources, and ideas they need to optimize response and revenue. She speaks and writes regularly and leads many industry initiatives as VP, Member Relations and Chief Listening Officer at the Direct Marketing Association (www.the-dma.org). Feedback and column ideas most welcome, to smiller AT the-dma DOT org or @stephanieSAM.

COMMENTSCommenting policy

comments powered by Disqus

Get the ClickZ Marketing newsletter delivered to you. Subscribe today!

COMMENTS

UPCOMING EVENTS

Featured White Papers

BigDoor: The Marketers Guide to Customer Loyalty

The Marketer's Guide to Customer Loyalty
Customer loyalty is imperative to success, but fostering and maintaining loyalty takes a lot of work. This guide is here to help marketers build, execute, and maintain a successful loyalty initiative.

Marin Software: The Multiplier Effect of Integrating Search & Social Advertising

The Multiplier Effect of Integrating Search & Social Advertising
Latest research reveals 68% higher revenue per conversion for marketers who integrate their search & social advertising. In addition to the research results, this whitepaper also outlines 5 strategies and 15 tactics you can use to better integrate your search and social campaigns.

WEBINARS

    Information currently unavailable

Jobs

    • Internet Marketing Campaign Manager
      Internet Marketing Campaign Manager (Straight North, LLC) - Fort MillWe are looking for a talented Internet Marketing Campaign Manager to join the...
    • Online Marketing Coordinator
      Online Marketing Coordinator (NewMarket Health) - BaltimoreWant to learn marketing from the best minds in the business? NewMarket Health, a subsidiary...
    • Call Center Manager
      Call Center Manager (Common Sense Publishing) - Delray BeachWanted: Dynamic Call Center Manager with a Proven Track Record of Improving Response...