De-Mystifying Models

Calculating the math behind a predictive model for display advertising.

Author

George John

Date published November 25, 2009 Categories

Many ad networks and ad technology providers claim to build “predictive models” that are used in the course of buying ad space for a campaign and/or figuring out which ad to show to a given impression. Models are relatively new to display advertising, but they’re a common and relatively standard practice in other forms of direct marketing.

In the mid-1990s, “data mining” and statistical modeling became quite fashionable in the direct marketing/database marketing industry. IBM entered the fray with a product called “Intelligent Miner,” and after likely having a few too many refreshments with their ad agency team, the IBM execs agreed to run a TV spot to promote the service.

Somewhere along the way, the idea of explaining “models” to the TV-viewing audience turned into a commercial spot set on the runway of a fashion show: beautiful Swedish models found time mid-catwalk to tell each other about how they were optimizing sales of their own fashion merchandise using data mining from IBM.

It was a little confusing to have fashion models talking about statistical models, but the point was that both are models in the sense that they’re representations of something in the real world. A fashion model is supposed to show you what a particular jacket would look like if you wore it, so you can decide if you want to buy the jacket. A predictive model for real-time display ad media buying is supposed to tell you what would happen if you delivered your ad in a particular impression, so you can decide if you want to buy the impression. One big difference between fashion models and predictive models for media buying is the desired bias in the models. Marketers in the fashion industry realize that it’s in their interest to make their models a tad optimistic — you won’t actually become tall, thin, and get high-cheekbones when you buy the jacket. But predictive models for display ad media buying are intended to have no bias, so that they accurately predict the likelihood of response.

Predictive models are different from what you might call “plain old bidding rules.” While models are usually more granular and incorporate many features of the impression blended together with math to arrive at some kind of score or estimated chance of response, rules typically include just a few features.

For example, a bidding rule might be “Bid $2.24 on all impressions on People.com for users who’ve seen less than three impressions of my ad in the last day.”

A predictive model might be: “Take 0.92 and multiply it by itself as many times as the user has seen this ad in the last day, then multiply that number by .013 if the current page is on Yahoo, or else .006 if it’s on another site. Then multiply by the historical post-click conversion rate for this ad, and this is the estimated chance that the user will convert if we buy this impression for our ad.” In this case, we’re assuming the goal is predicting conversions, but you can model anything you can measure, including ad engagement, clicks, or other kinds of goals.

The way a predictive model works with real-time bidding on exchanges is that your bidding server software basically gets a poke many times per second, where the exchange says, “OK, I’ve got browser #AH842DEH19 on myyearbook.com right now and I need a 300×250 ad…what do you bid?” The bid server would look up the user’s frequency (say it’s three) and the ad’s historical post-click conversion rate (let’s say it’s one in 100), then multiply (.92 by .92 by .92) by .006 by .01 to get 0.0000467 or roughly .005 percent as the chance this particular impression will yield a conversion. If the target CPA (define) is $40, then we can afford to bid $40 times .000467 or $.00187 for this impression, which equates to a $1.87 CPM (define) rate.

There’s more math to it that deals with how we figure out which features of the impression we should include in the model in the first place, what the various coefficients should be (for example, why .92 is a good decay factor for frequency), how to deal with scenarios when we don’t yet have enough data to estimate things like the historical post-click conversion rate, what to do when we’re bidding on behalf of not just one ad but many ads, and how to do “portfolio optimization” across multiple exchanges.

Good models always win versus guesswork because there are just too many factors for a person to pay attention to. Also, intuition about the kinds of users that will respond to an ad, or the kinds of Web sites they can be found on, is often good but incomplete, and models can almost always find subsegments within an intuited audience that are inefficient, or conversely new segments of inventory to buy that outperform.

Subscribe to get your daily business insights

More about:

Read the next article

Explore Tech Talks

Lucy

Lucy helps organizations leverage knowledge for in... View Tech Talk
TVSquared

TVSquared is the global leader in cross-platform T... View Tech Talk
Grata

Grata is a B2B search engine for discovering small... View Tech Talk

Whitepapers

US Mobile Streaming Behavior

Whitepaper | Mobile

US Mobile Streaming Behavior

Streaming has become a staple of US media-viewing habits. Streaming video, however, still comes with a variety of pesky frustrations that viewers are ...

View resource

Winning the Data Game: Digital Analytics Tactics for Media Groups

Whitepaper | Analyzing Customer Data

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Data is the lifeblood of so many companies today. You need more of it, all of which at higher quality, and all the meanwhile being compliant with data...

View resource

Learning to win the talent war: how digital marketing can develop its people

Whitepaper | Digital Marketing

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

This report documents the findings of a Fireside chat held by ClickZ in the first quarter of 2022. It provides expert insight on how companies can ret...

View resource

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

Five of the most exciting digital innovations in grocery retail

Data insights

Five of the most exciting digital innovations in grocery retail

8y Dev Mehta

Five of the most exciting digital innovations in g...

Digital innovations which improve not only the online customer experience but also the in-store shopping experience are becoming key to major gro...

View article

How to take your geo-fences to the next level

Data insights

How to take your geo-fences to the next level

8y Benjamin Spiegel

How to take your geo-fences to the next level

Geo-fencing has become more sophisticated, but how can marketers break out of static fences and cast a wider net to reach mobile users more effectivel...

View article

Five ways to boost your conversion rate without wasting ad budget

Acquisition

Five ways to boost your conversion rate without wasting ad budget

8y Tim Nichols

Five ways to boost your conversion rate without wa...

Improve the conversion rate of your ads and maximize profits without draining your entire ad budget. Read More...

View article

Has targeting gone too far? How Amazon ruined Christmas

Data insights

Has targeting gone too far? How Amazon ruined Christmas

8y Mike O'Brien

Has targeting gone too far? How Amazon ruined Chri...

A Brooklyn man's live-in girlfriend knew her Christmas present based on Amazon recommendations. Has targeting become too sophisticated? Read More...

View article

Five ways to optimize customer loyalty in a changing landscape

Analytics

Five ways to optimize customer loyalty in a changing landscape

8y Stephen Hay

Five ways to optimize customer loyalty in a changi...

Earning and retaining customer loyalty for brands is a more complex endeavor than it used to be. Use these tips to gain consumers' devotion and fortif...

View article

They laughed when I said neuromarketing

Data insights

They laughed when I said neuromarketing

8y Dave Lloyd

They laughed when I said neuromarketing

To what degree does neuroscience affect content marketing? Here's how neuromarketing insights can influence effective storytelling tactics and ultimat...

View article

How to make data actionable by using segments

Analytics

How to make data actionable by using segments

9y Andrew Edwards

How to make data actionable by using segments

Here's how digital marketers can configure audience segments based on collected behavioral data from analytics to effectively retarget consumers and y...

View article

How to Use Data to Find New Markets

Analytics

How to Use Data to Find New Markets

9y Motoko Hunt

How to Use Data to Find New Markets

Deciding which market to target next is different for every business, but using multiple data sets can help to narrow the field. Read More...

View article

Follow us

De-Mystifying Models

Subscribe to get your daily business insights

Read the next article

Explore Tech Talks

Whitepapers

Whitepapers

US Mobile Streaming Behavior

US Mobile Streaming Behavior

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Related Articles

Five of the most exciting digital innovations in grocery retail

Five of the most exciting digital innovations in g...

How to take your geo-fences to the next level

How to take your geo-fences to the next level

Five ways to boost your conversion rate without wasting ad budget

Five ways to boost your conversion rate without wa...

Has targeting gone too far? How Amazon ruined Christmas

Has targeting gone too far? How Amazon ruined Chri...

Five ways to optimize customer loyalty in a changing landscape

Five ways to optimize customer loyalty in a changi...

They laughed when I said neuromarketing

They laughed when I said neuromarketing

How to make data actionable by using segments

How to make data actionable by using segments

How to Use Data to Find New Markets

How to Use Data to Find New Markets