Oftentimes it may be cheaper and less time-consuming to just keep data than to delete it. But how long is data valid and valuable, and how long until that data is dangerous?
"It's cheaper to keep data than delete it," is a quote from Bob Page, vice president of products at Hortonworks, from the last eMetrics Summit. It sounds absurd at first, but his premise is sound.
The cost of throwing more hardware at your storage system or increasing your rental space in the cloud might just be lower than the cost of deciding what to delete.
I'm not referring to medical records, primary research data, or financial records, etc. "Standard accounting practices," research protocols, and the IRS cover those instances. I'm talking about the customer-related data you keep for advertising and marketing purposes.
Data governance usually revolves around what data to collect, how it will be cleaned and managed, and who may access or manipulate it. But deletion seldom enters the conversation.
Philosophy, strategy, policy, and methodology of data deletion all need to be discussed, aligned, rolled out, managed, and maintained, which will take a fair amount of time and resources.
How Long Is Data Valid and Valuable?
The U.K.'s Data Protection Act says, "Personal data processed for any purpose or purposes shall not be kept for longer than is necessary for that purpose or those purposes." One finds Zen koans in the strangest places.
Today, data is collected and kept on the chance that somebody will think of any interesting question. Deleting data too soon may cause trouble.
The United Parcel Service decided the 200 and some addresses I had entered into their database were not worthy of maintaining (See "Where's My Freakin' Data You B@stards?") and obliterated it without asking me.
How Long Until Data Is Dangerous?
Your legal department will tell you that data becomes dangerous when it is kept so long that it might fall into the wrong hands: hackers or opposing attorneys.
Data protection is getting more and more attention these days and your IT department is tasked with keeping it all safe and sound. After all, your legal liability grows the more data you keep and the longer you keep it.
"Discoverability" is also a serious concern to your legal beagles. When the other side in a lawsuit asks you to produce electronic evidence, it's best if one can point to a policy that states, "customer data shall be destroyed after X years," along with a well-documented procedure that manages the deletion process.
But data also becomes dangerous when it no longer represents the truth.
Amazon can show me everything I've bought from them since my first purchase on March 26, 1996. That data is still valid. It may not be valuable, but it is still true. However, what I searched for in 1996 no longer represents my intent to purchase, is no longer true, and is actually harmful to an algorithm trying to help me find and buy new stuff.
Amazon's approach is to trust their customers to do the decision-making for them by offering you the chance to "Improve Your Recommendations." They invite you to rate the items you've purchased, identify which you bought as a gift, or simply check the box that says, "Don't use for recommendations."
How steep is your data decay curve? When does your data become toxic and corrupt the veracity of the answers you seek?
Clearly, it's necessary to delete some data from time to time. The question is, what data is not worth the effort of even worrying about?
Learn Digital Marketing Insights From Leading Brands!
ClickZ Live Chicago (Nov 3-6) will deliver over 50 sessions across 4 days and 10 individual tracks, including Data-Driven Marketing, Social, Mobile, Display, Search and Email. Check out the full agenda, or register and attend one of the best ClickZ events yet!
Jim Sterne is an international consultant who focuses on measuring the value of the Web as a medium for creating and strengthening customer relationships. Sterne has written eight books on using the Internet for marketing, is the founding president and current chairman of the Digital Analytics Association and produces the eMetrics Summit and the Media Analytics Summit.
Hong Kong, October 21-22
London, November 13-14
San Francisco, November 13-14
London, November 18-19
IBM Social Analytics: The Science Behind Social Media Marketing
80% of internet users say they prefer to connect with brands via Facebook. 65% of social media users say they use it to learn more about brands, products and services. Learn about how to find more about customers' attitudes, preferences and buying habits from what they say on social media channels.
An Introduction to Marketing Attribution: Selecting the Right Model for Search, Display & Social Advertising
If you're considering implementing a marketing attribution model to measure and optimize your programs, this paper is a great introduction. It also includes real-life tips from marketers who have successfully implemented attribution in their organizations.
October 23, 2014
1:00pm ET/10:00am PT