Oftentimes it may be cheaper and less time-consuming to just keep data than to delete it. But how long is data valid and valuable, and how long until that data is dangerous?
"It's cheaper to keep data than delete it," is a quote from Bob Page, vice president of products at Hortonworks, from the last eMetrics Summit. It sounds absurd at first, but his premise is sound.
The cost of throwing more hardware at your storage system or increasing your rental space in the cloud might just be lower than the cost of deciding what to delete.
I'm not referring to medical records, primary research data, or financial records, etc. "Standard accounting practices," research protocols, and the IRS cover those instances. I'm talking about the customer-related data you keep for advertising and marketing purposes.
Data governance usually revolves around what data to collect, how it will be cleaned and managed, and who may access or manipulate it. But deletion seldom enters the conversation.
Philosophy, strategy, policy, and methodology of data deletion all need to be discussed, aligned, rolled out, managed, and maintained, which will take a fair amount of time and resources.
How Long Is Data Valid and Valuable?
The U.K.'s Data Protection Act says, "Personal data processed for any purpose or purposes shall not be kept for longer than is necessary for that purpose or those purposes." One finds Zen koans in the strangest places.
Today, data is collected and kept on the chance that somebody will think of any interesting question. Deleting data too soon may cause trouble.
The United Parcel Service decided the 200 and some addresses I had entered into their database were not worthy of maintaining (See "Where's My Freakin' Data You B@stards?") and obliterated it without asking me.
How Long Until Data Is Dangerous?
Your legal department will tell you that data becomes dangerous when it is kept so long that it might fall into the wrong hands: hackers or opposing attorneys.
Data protection is getting more and more attention these days and your IT department is tasked with keeping it all safe and sound. After all, your legal liability grows the more data you keep and the longer you keep it.
"Discoverability" is also a serious concern to your legal beagles. When the other side in a lawsuit asks you to produce electronic evidence, it's best if one can point to a policy that states, "customer data shall be destroyed after X years," along with a well-documented procedure that manages the deletion process.
But data also becomes dangerous when it no longer represents the truth.
Amazon can show me everything I've bought from them since my first purchase on March 26, 1996. That data is still valid. It may not be valuable, but it is still true. However, what I searched for in 1996 no longer represents my intent to purchase, is no longer true, and is actually harmful to an algorithm trying to help me find and buy new stuff.
Amazon's approach is to trust their customers to do the decision-making for them by offering you the chance to "Improve Your Recommendations." They invite you to rate the items you've purchased, identify which you bought as a gift, or simply check the box that says, "Don't use for recommendations."
How steep is your data decay curve? When does your data become toxic and corrupt the veracity of the answers you seek?
Clearly, it's necessary to delete some data from time to time. The question is, what data is not worth the effort of even worrying about?
On the heels of a fantastic event in New York City, ClickZ Live is taking the fun and learning to Toronto, June 23-25. With over 15 years' experience delivering industry-leading events, ClickZ Live offers an action-packed, educationally-focused agenda covering all aspects of digital marketing. Register today!
Jim Sterne is an international consultant who focuses on measuring the value of the Web as a medium for creating and strengthening customer relationships. Sterne has written eight books on using the Internet for marketing, is the founding president and current chairman of the Digital Analytics Association and produces the eMetrics Summit and the Media Analytics Summit.
Hong Kong, May 5-6, 2015
Gartner Magic Quadrant for Digital Commerce
This Magic Quadrant examines leading digital commerce platforms that enable organizations to build digital commerce sites. These commerce platforms facilitate purchasing transactions over the Web, and support the creation and continuing development of an online relationship with a consumer.
Paid Search in the Mobile Era
Google reports that paid search ads are currently driving 40+ million calls per month. Cost per click is increasing, paid search budgets are growing, and mobile continues to dominate. It's time to revamp old search strategies, reimagine stale best practices, and add new layers data to your analytics.
May 6, 2015
12:00pm ET/9:00am PT