What Machines Haven't Learned Yet

Machine learning is really, really powerful and it opens up new ways of doing analysis with Big Data. But, it cannot act alone.

Author

Jim Sterne

Date published September 30, 2013 Categories

gary-angel

I had an interesting talk with Gary Angel the other day… which is the same as saying I talked to Gary Angel the other day.

Gary is the partner / principal of the Digital Analytics Center of Excellence at Ernst & Young. An engaging guy with a very large brain.

With more than 20 years of analytics under his belt, Gary knows about analytics and is more than willing to share. So, when the subject turned to machine learning, I tuned in a little tighter.

Gary is not an artificial intelligence guy. In my narrow sense of the term, I wouldn’t even call him a data scientist.

Data Scientist; One responsible for understanding and advancing the nature of data, its collection methods, and the algorithms for processing it.
– Jim Sterne’s Private Opinion Dictionary

Gary is a business problem solver who happens to use data to get the job done. So I tune into his perspective on machine learning because he’s going to base that opinion on years of practical application. He works in the field, not in the lab.

Gary and I agreed that machine learning is really, really powerful and it opens up new ways of doing analysis with Big Data. But, it cannot act alone.

As Ron Kohavi, George H. John put it in their paper: Wrappers for feature subset selection:

A universal problem that all intelligent agents must face is where to focus their attention. A problem-solving agent must decide which aspects of a problem are relevant, an expert-system designer must decide which features to use in rules, and so forth. Any learning agent must learn from experience, and discriminating between the relevant and irrelevant parts of its experience is a ubiquitous problem.

sterne-092613
This is where you come in. You need to be or find some subject matter experts who can separate the wheat from the chaff.

Big Data cannot gobble up every bit you collect and paw through tens of thousands of variables and figure out what’s important.

As Gary puts it, “Good analysis comes from someone figuring out what the right variables are.”

He then recounted an example from a Digital Analytics Association Symposium in Philadelphia about calculating what movies people are most likely to want to see next. The data was collected from set-top boxes and the machine determined that movies beginning with the letter “A” were far more likely to be preferred.

A human knows instantly that this is the result of movies being listed alphabetically and is not a valuable variable for determining the likeability of any given movie. It is not proper fodder for a recommendation engine.

If you’re crunching numbers in marketing, do you know if time-of-day is any more predictive of a purchase than geography? Search behavior? Click behavior? Shopping cart population?

This is why people with business smarts will always have a job.

Gary provided another example:

We did a segmentation analysis for an online travel aggregator, looking at purely search behavior data. We found a very interesting segmentation, but we had to put a lot of thought into what that search behavior meant.

If someone did a search, changed the data of the search, and then looked for the same destination, we could infer that they were flexible about dates. If they change the destination but didn’t change the date, we infer they were flexible about destinations.

We created those as variables in the analysis and that became a very powerful predictor for them.

But that’s not inherent in the behaviors, right? An analyst had to
figure out that changing those two things was a valuable variable for the analysis.

When we started, the obvious variable was destination – was a traveler going to Las Vegas, for example. But as we thought through the analysis, many additional variables that were even more interesting emerged. It was about how far out they were searching, how many days between the search, when the search was conducted, and what the destination date was. Added to this it was, whether they change the search, whether they change the destination, whether it was a weekend, and whether it was a weekend included in the stay. All those kinds of things turned out to be not surprisingly very important but those are things that, unless you feed them into the machine, you won’t get a good analysis.

Once the human picks out the high value variables, the machine will do a great job figuring out which ones are important.

The lesson is to use that part of your brain that it’s best at: intuition, relevance, reasoning, etc., and then let the machine do what it’s best at: calculation, tabulation, tabulation, enumeration.

Computers are incredibly fast, accurate, and stupid. Human beings are incredibly slow, inaccurate, and brilliant. Together they are powerful beyond imagination.
– Albert Einstein

Subscribe to get your daily business insights

More about:

Read the next article

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

Announcement Alert from Lee Arthur

Weekly briefing | Digital Transformation

Announcement Alert from Lee Arthur

Announcement Alert!! Read More

View resource

The 2023 B2B Superpowers Index

Whitepaper | Digital Transformation

The 2023 B2B Superpowers Index

The Merkle B2B 2023 Superpowers Index outlines what drives competitive advantage within the business culture and subcultures that are critical to succ...

View resource

Impact of SEO and Content Marketing

Whitepaper | Digital Transformation

Impact of SEO and Content Marketing

Making forecasts and predictions in such a rapidly changing marketing ecosystem is a challenge. Yet, as concerns grow around a looming recession and b...

View resource

How your CMS can help personalise your customer journey

Analytics

How your CMS can help personalise your customer journey

9y Chris Camps

How your CMS can help personalise your customer jo...

A great customer experience has moved from ‘nice to have’ to ‘can’t do without’. Your users expect be engaged from the moment they land on your site, ...

View article

Programmatic advertising: what's the difference between good and bad data?

Actionable Analysis

Programmatic advertising: what's the difference between good and bad data?

10y Evan Magliocca

Programmatic advertising: what's the difference be...

Marketers need to know what’s in their data and trim out the filler to provide continuous, data-driven ROI for their brands. Read More...

View article

Finding intelligence to act on from big data: a five step approach

Actionable Analysis

Finding intelligence to act on from big data: a five step approach

10y Evan Magliocca

Finding intelligence to act on from big data: a fi...

Every marketer has been sitting with his or her analytics team, reviewing an overwhelming spreadsheet of data points. It tends to hurt your eyes and y...

View article

Three ways to create insights from consumers’ click histories

Actionable Analysis

Three ways to create insights from consumers’ click histories

10y Deren Baker

Three ways to create insights from consumers’ clic...

Without any action behind it, data is just a bunch of numbers. Clickstream data is particularly valuable, providing insights about what consumers are ...

View article

A guide to understanding the different types of data available to marketers

Analytics

A guide to understanding the different types of data available to marketers

10y Kym Reynolds

A guide to understanding the different types of da...

Your customers are engaging with your business across an increasing number of touchpoints – websites, social media, in-store, mobile and t...

View article

Five ways to optimize customer loyalty in a changing landscape

Analytics

Five ways to optimize customer loyalty in a changing landscape

10y Stephen Hay

Five ways to optimize customer loyalty in a changi...

Earning and retaining customer loyalty for brands is a more complex endeavor than it used to be. Use these tips to gain consumers' devotion and fortif...

View article

Cracking the code: CMOs, measurement, and metrics

Analytics

Cracking the code: CMOs, measurement, and metrics

10y Sanjay Dholakia

Cracking the code: CMOs, measurement, and metrics

Although technological advances have simplified evaluating metrics and quantifying data from campaigns, it's still a complicated process. How should m...

View article

3 changes that will re-make your analytics

Actionable Analysis

3 changes that will re-make your analytics

10y Andrew Edwards

3 changes that will re-make your analytics

In addition to modifying how tags are managed and where they are placed, you can refine your analytics strategy and improve data collection accuracy b...

View article

Follow us

Strategy

Innovation

Insights

Stats & Tools

What Machines Haven't Learned Yet

Leave a Reply Cancel reply

Subscribe to get your daily business insights

Read the next article

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Announcement Alert from Lee Arthur

Announcement Alert from Lee Arthur

The 2023 B2B Superpowers Index

The 2023 B2B Superpowers Index

Impact of SEO and Content Marketing

Impact of SEO and Content Marketing

Related Articles

How your CMS can help personalise your customer journey

How your CMS can help personalise your customer jo...

Programmatic advertising: what's the difference between good and bad data?

Programmatic advertising: what's the difference be...

Finding intelligence to act on from big data: a five step approach

Finding intelligence to act on from big data: a fi...

Three ways to create insights from consumers’ click histories

Three ways to create insights from consumers’ clic...

A guide to understanding the different types of data available to marketers

A guide to understanding the different types of da...

Five ways to optimize customer loyalty in a changing landscape

Five ways to optimize customer loyalty in a changi...

Cracking the code: CMOs, measurement, and metrics

Cracking the code: CMOs, measurement, and metrics

3 changes that will re-make your analytics

3 changes that will re-make your analytics