Customer Data Munging and Reconciliation for Correlation

What custom clothiers and tailors have in common with analytics professionals.

Author

Jim Sterne

Date published August 18, 2011 Categories

You want custom made clothing? Step right up and we’ll measure you. We’ll find out how tall, wide, and thick you are in any number of places. Well, 33 places to put a specific number on it.

At least, that’s how many are used by MGL Industries and burlesque costumer Glitz by Linda Joyce and that’s not even counting ear height, glove length, or pastie size.

Now imagine that your custom clothier were to measure your neck with calipers, your arms with a yard stick, your wrist with a micrometer, your hat size with a protractor, your inseam with a tape measure, your arms with a laser range finder, and your waist with a Smart Finger.

Not only would the process be time consuming, the results would be a mishmash of not quite relatable numbers.

I’ve been pondering the dilemma of differing customer data for a while. I remain hopeful but not immediately confident. Much the same as I feel about medicine, law, and government. Will we ever be able to put all our digital data eggs into one customer warehouse basket and come out with a reliable omelet?

Data management mechanics have long been mapped out: capture, cleanse, store, extract, etc. But it’s the transformation of all that customer behavioral data that comes just before loading it all into the master warehouse that has me concerned. Customer data come in all shapes, sizes, weights, density, and value.

It is a given that any two advertising servers will record their performance in slightly different ways, that any two web analytics tools on the same site will generate different numbers, and that any two customer satisfaction indexes will differ. This is merely the problem of the man with two watches who does not know what time it really is.

This issue is put to bed by giving up hope for standard, industrial strength metrics, acknowledging that every yardstick is slightly dissimilar. Organizations succeed when they settle for internal consistency over galactic exactitude.

Data cleansing is not as problematic. It draws on the services of a data dictionary. In system A, men and women are identified as either M or W, in system B, as either M or F, and in system C, as either 1 or 2. A quick cross-reference puts all things to right as long as “Decline to state,” “A little of each,” and “Not sure yet” are accounted for.

Merging or joining all of these data so they make sense requires a thorough understanding of how each is calibrated. In one case, a week’s worth of data represents data collected between Monday morning and Sunday night. In another, it’s Sunday morning to Saturday night. In a third, it’s simply the monthly total divided by 4, 4.25, or 4.33333. Messy, but manageable.

The real tricky bit comes when trying to attribute said data to individual individuals. For that, a common key is needed. If we all have one and only one telephone number, email address, customer ID number, or ship-to address, then all the information about one person could be correlated to all of the other information about that one person. Multiply that multi-headed hydra with the number of cookies we have on the number of devices we use and the problem becomes nail-biting.

The additional challenge is something I have heard referred to as “data munging.” This is the art of associating apples and orangutans. The two have very little in common and have dramatically different attributes. Nevertheless, we are compelled to assume that their coexistence in the same database will reveal hitherto unrealized returns on investigatory investment.

Social media influences, advertising exposures, click-through activities, email opens, blog post sentiments, shopping cart inclusions, likes, shares, and +1s are not measurable in the same way by the same scale and in any standard form. And yet…

During a recent interview, Brandt Dainow from ThinkMetrics asked about the complexity of data reconciliation. I could not give him a clear answer. I was, instead, frustrated that the term “data reconciliation” would be perfect for this problem if it were not already in vogue to describe rectifying errors introduced by measurement noise.

Now that you’ve read this far, I have two pleas to make to you:

What is the proper term for the conjoining of disparate data types for the purpose of building a truly useful model – in this case of customer behavior – for the purpose of optimizing marketing?
And, does anybody have any ideas they’d like to share on how this can be done in a way that is useful across more than one instance (industry/product line/campaign)?

I’m all ears. (3 ¼” x 1 ¾” each.)

Subscribe to get your daily business insights

More about:

Read the next article

Explore Tech Talks

Lucy

Lucy helps organizations leverage knowledge for in... View Tech Talk
TVSquared

TVSquared is the global leader in cross-platform T... View Tech Talk
Grata

Grata is a B2B search engine for discovering small... View Tech Talk

Whitepapers

US Mobile Streaming Behavior

Whitepaper | Mobile

US Mobile Streaming Behavior

Streaming has become a staple of US media-viewing habits. Streaming video, however, still comes with a variety of pesky frustrations that viewers are ...

View resource

Winning the Data Game: Digital Analytics Tactics for Media Groups

Whitepaper | Analyzing Customer Data

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Data is the lifeblood of so many companies today. You need more of it, all of which at higher quality, and all the meanwhile being compliant with data...

View resource

Learning to win the talent war: how digital marketing can develop its people

Whitepaper | Digital Marketing

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

This report documents the findings of a Fireside chat held by ClickZ in the first quarter of 2022. It provides expert insight on how companies can ret...

View resource

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

Fospha as TikTok’s New Measurement Partner

Analytics

Fospha as TikTok’s New Measurement Partner

3m Fospha Team

Fospha as TikTok’s New Measurement Partner

Understanding media performance in digital marketing is like navigating a maze that constantly changes. The emergence of platforms like TikTok has rev...

View article

How Walgreens Boots Alliance rebuilt for first party data

Actionable analysis

How Walgreens Boots Alliance rebuilt for first party data

2y Benjamin Broomfield

How Walgreens Boots Alliance rebuilt for first par...

Boots, one of the UK’s largest Beauty and Pharmacy retailers, has a rich 170-year history. Their customers have always been at the core of this....

View article

The growing culture of marketing experimentation

Actionable Analysis

The growing culture of marketing experimentation

4y Nick Stoltz

The growing culture of marketing experimentation

Marketers are turning to new methods to analyze the effectiveness of their campaigns, and they are getting more accurate results more quickly with inc...

View article

6 ways to increase your conversion rate using behavioral data

Analytics

6 ways to increase your conversion rate using behavioral data

7y Mike O'Brien

6 ways to increase your conversion rate using beha...

SessionCam and Subway deliver practical advice on how to improve your conversion rates using behavioral data. Read More...

View article

Facebook reveals it miscalculated even more metrics

Analytics

Facebook reveals it miscalculated even more metrics

7y Al Roberts

Facebook reveals it miscalculated even more metric...

The Like and Share counts retrieved through Facebook's Graph API were inconsistent with the counts displayed through search queries in Facebook's mobi...

View article

American Apparel: driving customer centricity in an omnichannel world

Acquisition

American Apparel: driving customer centricity in an omnichannel world

8y Sophie Loras

American Apparel: driving customer centricity in a...

American Apparel's chief digital officer discussed the future of retail, the importance of delivering value to the consumer, and strategies for an IoT...

View article

Study: Marketers are clueless on cross-channel measurement

Ad Industry Metrics

Study: Marketers are clueless on cross-channel measurement

8y Mike O'Brien

Study: Marketers are clueless on cross-channel mea...

A new study from Origami Logic highlights how much marketers struggle with cross-channel measurement. How can they make data less overwhelming? Read M...

View article

Six tips on measurement from the IAB Programmatic Marketplace

Actionable Analysis

Six tips on measurement from the IAB Programmatic Marketplace

8y Mike O'Brien

Six tips on measurement from the IAB Programmatic ...

Not getting too complicated with metrics is just one important point covered in an IAB conference session all about attribution. Read More...

View article

Follow us

Customer Data Munging and Reconciliation for Correlation

Subscribe to get your daily business insights

Read the next article

Explore Tech Talks

Whitepapers

Whitepapers

US Mobile Streaming Behavior

US Mobile Streaming Behavior

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Related Articles

Fospha as TikTok’s New Measurement Partner

Fospha as TikTok’s New Measurement Partner

How Walgreens Boots Alliance rebuilt for first party data

How Walgreens Boots Alliance rebuilt for first par...

The growing culture of marketing experimentation

The growing culture of marketing experimentation

6 ways to increase your conversion rate using behavioral data

6 ways to increase your conversion rate using beha...

Facebook reveals it miscalculated even more metrics

Facebook reveals it miscalculated even more metric...

American Apparel: driving customer centricity in an omnichannel world

American Apparel: driving customer centricity in a...

Study: Marketers are clueless on cross-channel measurement

Study: Marketers are clueless on cross-channel mea...

Six tips on measurement from the IAB Programmatic Marketplace

Six tips on measurement from the IAB Programmatic ...