businessman

Cross-Industry Standard Process for Data Mining

  |  November 8, 2012   |  Comments

Do you really know what your business process is?

When we marketing people first started looking into web data back in the mid 1990's, we were inventing an industry. We made up words (click-through, pageview, bounce rate). We made up tools (Sawmill, Webtrends, NetGenesis). We made up processes.

smarteracronym

We were so engaged in creating books, conferences, and trade associations for this previously unknown data stream, that we kept right on creating without looking around at what others were doing.

One of the things they were working on was the Cross-Industry Standard Process for Data Mining (CRISP-DM). While the project never caught on like wildfire, the timing behind it and the basic premise make for a useful methodology for performing analysis and offers some solace for those of us who are laden with data and overwhelmed with business questions.

That makes it worth a quick review.

CRISP-DM divvies up data mining into six phases:

  1. Business understanding. What problem are you solving for?
  2. Data understanding. What do you have to work with?
  3. Data preparation. Choose and validate which data you'll use.
  4. Modeling. Create a conceptual model and draw conclusions.
  5. Evaluation. Test how well the model holds up against data.
  6. Deployment. Implement the best models for making business decisions.

handdraws

It seems brutally simple as a list, but think about your last project. Did you miss a step? Maybe skimp on some aspect?

Perhaps the trickiest piece of all is at the very start. Do you really know what the business problem is?

The business understanding step requires a cultural ability to collaboratively determine business objectives. It requires the proper background, clear, assented business objectives, and even clearer and more agreed-to business success criteria. This is usually a tricky political process and one that is often neglected.

You must properly enumerate the available resources, agree to specific requirements, identify areas of deficiency, and communally concur on risks and contingencies. Just getting the terminology straight can be a task that requires weeks of meetings and innumerable emails.

Once the costs and benefits are ironed out, specific data-mining goals have to be acknowledged and specific success criteria must be signed off.

Oh - and one more thing...failure criteria. When will you know the project is a failure and who has the right to pull the plug?

Only after all of this is in place can you create a project plan with specific sponsors recognized and specific outputs delineated. Then you can break it all down into explicit sub-tasks by specific team members.

businessman

If this sounds like a lot of work, it is. If this sounds like too much work, then corporate culture may not allow for a rigorous project. It may be time to reassess the likelihood of any project analytics getting traction.

Remember, you still have to work out how the data will be collected, validated, catalogued, cleansed, attributed, integrated, formatted, modelled, tested, evaluated, deployed, and applied to the business.

What? You thought this was going to be a piece of cake?

To help you along the way, there is a visual guide to CRISP-DM and a Decision Management Solutions' Eclipse Process Framework version (download) of the CRISP-DM methodology, which includes business rules and integration of analytics and rules. It's an open-source tool for managing methodologies both to allow developers of methodologies to share them and companies to customize them.

CRISP-DM may be a tough row to hoe, but at least you won't have to make it up as you go along.

Smarter Acronym, Hand Draws, and Business Man images via Shutterstock.

This column was originally published on August 2, 2012.

Tags:

ClickZ Live San Francisco This Year's Premier Digital Marketing Event is #CZLSF
ClickZ Live San Francisco (Aug 11-14) brings together the industry's leading practitioners and marketing strategists to deliver 4 days of educational sessions and training workshops. From Data-Driven Marketing to Social, Mobile, Display, Search and Email, this year's comprehensive agenda will help you maximize your marketing efforts and ROI. Register today!

ABOUT THE AUTHOR

Jim Sterne

Jim Sterne is an international consultant focused on measuring the value of the online marketing for creating and strengthening customer relationships. Sterne has written eight books on using the Internet for marketing, produces the eMetrics Marketing Optimization Summit and is co-founder and current chairman of the Digital Analytics Association.

COMMENTSCommenting policy

comments powered by Disqus

Get the ClickZ Analytics newsletter delivered to you. Subscribe today!

COMMENTS

UPCOMING EVENTS

Featured White Papers

BigDoor: The Marketers Guide to Customer Loyalty

The Marketer's Guide to Customer Loyalty
Customer loyalty is imperative to success, but fostering and maintaining loyalty takes a lot of work. This guide is here to help marketers build, execute, and maintain a successful loyalty initiative.

Marin Software: The Multiplier Effect of Integrating Search & Social Advertising

The Multiplier Effect of Integrating Search & Social Advertising
Latest research reveals 68% higher revenue per conversion for marketers who integrate their search & social advertising. In addition to the research results, this whitepaper also outlines 5 strategies and 15 tactics you can use to better integrate your search and social campaigns.

WEBINARS

    Information currently unavailable

Jobs

    • Interactive Product Manager
      Interactive Product Manager (Western Governors University) - Salt Lake CityWestern Governors University, one of the 20 largest universities...
    • SEO Senior Analyst
      SEO Senior Analyst (University of Phoenix (Apollo Education Group)) - San FranciscoSEO Senior Analyst   Position Summary...
    • SEM & Biddable Media Manager
      SEM & Biddable Media Manager (Kepler Group LLC) - New YorkAs an Optimization & Innovation Manager at Kepler Group, you will be on the bleeding...