When Will AI Help Solve Spam?

Targeting spam with artificial intelligence and concept technologies.

Author

Date published January 22, 2004 Categories

I was at a project in Vermont for five weeks and had a problem that isn’t new to anyone reading this column. I was forced to use a dial-up connection to access my email. Was it slow! I used a “download message headers only” feature to view only the message envelopes, saving me from downloading all that spam. But Outlook’s filters made a mistake. My client tried to send an email three times before I realized why I never got it: The word “debt” appeared in the message’s body copy. I have a filter that zaps all email containing words such as “debt” or “mortgage.” This email wasn’t spam. It just happened to use that spam-like word.

What’s the solution? Maybe it’s artificial intelligence (AI).

Regular readers know my pre-marketing background is in AI and its e-commerce applications. I’ve been involved in the development of many technologies over the years. A few could really help fight the spam wars. Yet no one really seems to be using them for that purpose.

The problem with current spam filters is they operate on keywords and heuristics. Outlook’s rules engine and similar spam filters use keyword matching (searching for words) to identify terms that indicate spam. A lot of spam these days uses characters such as “$” instead of “S” in subject lines. They do this to elude keyword-matching filters.

Service providers also use heuristics, such as the number of email messages sent from a particular address to addresses within their services, to determine if something might be spam. The problem with this, of course, is nonspam email (such as newsletters) are also sent in bulk and exhibit the same heuristics as spam. Obviously the keyword/heuristic approach isn’t working.

Several technologies already exist that can be refitted to help fight spam. Let’s look at a few.

Content Abstraction

Content abstraction (natural language processing) technology looks at unstructured content (e.g., a news article) and creates abstracts that convey, in a short paragraph, the essence of the article. News services use this technology to create automatic abstracts to display in search results.

Two types of technologies are commonly used for this: those that output human-readable text and those that generate a “concept fingerprint” of the article. A human can’t understand this fingerprint, but the computer can. Active Navigation outputs human readable abstracts, whereas Autonomy use the concept fingerprint.

Why are these technologies interesting as anti-spam weapons? They can identify concepts and work on unstructured email (e.g., a message body). They’re perfect for understanding the essence of what an email communication is about. In the above example, these technologies would have understood my client’s email containing the word “debt” was a business proposal I had to review, not a solicitation for a credit card.

These technologies are concept-based, not keyword-based. That means it doesn’t matter if someone calls it “Viagra,” “V_i_a_g_r_a,” or “that little blue pill.” The concept engine understands the email wants to sell you some type of drug.

Neural Networks

Neural networks are a technology used by credit card companies to help identify credit card fraud. The basic premise is you train a neural network to identify patterns inherent to fraudulent behavior. The system analyzes new purchase patterns and raises a red flag if it thinks a purchase may be fraudulent. Rules-based systems (like the one in Outlook) are also used to weed out common purchase situations that are most likely fraudulent.

Two weeks ago, my credit card company called me while I was in England. Within the same week I used the card to buy plane tickets in New York, subway cards in London, and several large purchases in Germany. A combination of neural-network pattern matching and rules-based purchase patterns contributed to the company calling to ensure the card wasn’t stolen.

Bayesian Networks

Bayesian networks are currently used to help determine if something is spam, but they’re based on keyword tokens and heuristics surrounding the tokens (how close together they appear, where they appear, etc.), not on concepts. Bayesian networks, like neural networks, would do a terrific job if inputs were concept-based instead of keyword-based.

Putting It All Together

Combined with concept fingerprint technologies, neural networks can be trained to identify which concepts are likely spam. As new email arrives, the content abstraction system can send the concept to the neural network, then to the rules-based system. These can weed out unwanted email.

Obviously, spam is far from over. Keyword-based rules engines and heuristic models to identify spam aren’t as effective as we’d hoped. We must use different technologies that identify spam based on message concept, not just its use of words. Using concept fingerprints to identify spam will help eradicate much more spam.

No matter what they call that little blue pill, they’re still trying to sell it to you.

What are your thoughts? Let me know!

Until next time…

Jack

Subscribe to get your daily business insights

More about:

Read the next article

Explore Tech Talks

Lucy

Lucy helps organizations leverage knowledge for in... View Tech Talk
TVSquared

TVSquared is the global leader in cross-platform T... View Tech Talk
Grata

Grata is a B2B search engine for discovering small... View Tech Talk

Whitepapers

US Mobile Streaming Behavior

Whitepaper | Mobile

US Mobile Streaming Behavior

Streaming has become a staple of US media-viewing habits. Streaming video, however, still comes with a variety of pesky frustrations that viewers are ...

View resource

Winning the Data Game: Digital Analytics Tactics for Media Groups

Whitepaper | Analyzing Customer Data

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Data is the lifeblood of so many companies today. You need more of it, all of which at higher quality, and all the meanwhile being compliant with data...

View resource

Learning to win the talent war: how digital marketing can develop its people

Whitepaper | Digital Marketing

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

This report documents the findings of a Fireside chat held by ClickZ in the first quarter of 2022. It provides expert insight on how companies can ret...

View resource

Engagement To Empowerment - Winning in Today's Experience Economy

Report | Digital Transformation

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Customers decide fast, influenced by only 2.5 touchpoints – globally! Make sure your brand shines in those critical moments. Read More...

View resource

How MetaBanners are Transforming Digital Advertising

Advertising & Promotion

How MetaBanners are Transforming Digital Advertising

1d Priscilla Soedarpo

How MetaBanners are Transforming Digital Advertisi...

Meta AI Marketing Ltd. has introduced MetaBanners, a revolutionary digital advertising platform powered by Ads-Chain technology, poised to transform t...

View article

How Attention Metrics are Redefining Success in Digital Marketing

Conversion & ROI

How Attention Metrics are Redefining Success in Digital Marketing

1w ClickZ News Staff

How Attention Metrics are Redefining Success in Di...

The digital marketing landscape is experiencing a transformative shift from traditional metrics like impressions and viewability to attention-based me...

View article

Revolutionizing Digital Advertising: How Chase Bank is Changing the Game with Financial Data

Actionable Analysis

Revolutionizing Digital Advertising: How Chase Bank is Changing the Game wi...

1w Idris Nagri

Revolutionizing Digital Advertising: How Chase Ban...

Chase Bank introduces Chase Media Solutions, leveraging its vast financial data to pioneer precision-targeted advertising, offering marketers direct a...

View article

Marketers Struggling to Utilize Customer Data for Personalization Amid Privacy Changes

Analytics

Marketers Struggling to Utilize Customer Data for Personalization Amid Priv...

2w Idris Nagri

Marketers Struggling to Utilize Customer Data for ...

Acoustic’s commissioned study finds 75% of marketers say collecting real-time experience data is critical to the business, but less than half are curr...

View article

Nutrimuscle: Scaling spend and growing ROAS through better measurement

Analytics

Nutrimuscle: Scaling spend and growing ROAS through better measurement

1m Fospha Team

Nutrimuscle: Scaling spend and growing ROAS throug...

Snapchat driving spend growth at higher efficiency Nutrimuscle is a fast-growing sports supplement brand that started using Fospha in June 2023. ...

View article

Snap Selects Fospha as Measurement Partner for Retail eCommerce

Analytics

Snap Selects Fospha as Measurement Partner for Retail eCommerce

2m Fospha Team

Snap Selects Fospha as Measurement Partner for Ret...

Fospha and Snap announced a partnership that will further enable eCommerce advertisers to measure their Snapchat campaigns. What’s the problem t...

View article

How Leading Brands Structure Social Media Success

Analytics

How Leading Brands Structure Social Media Success

2m Idris Nagri

How Leading Brands Structure Social Media Success

Social media content calendars are crucial for businesses to plan, organize, and optimize their strategies, ensuring effective audience engagement and...

View article

You haven't switched to GA4? Better get a move on

Analytics

You haven't switched to GA4? Better get a move on

2m Idris Nagri

You haven't switched to GA4? Better get a move on

In the rapidly evolving landscape of digital marketing, staying ahead of the curve is crucial. As technology advances and consumer behavior shifts, bu...

View article

Follow us

When Will AI Help Solve Spam?

Subscribe to get your daily business insights

Read the next article

Explore Tech Talks

Whitepapers

Whitepapers

US Mobile Streaming Behavior

US Mobile Streaming Behavior

Winning the Data Game: Digital Analytics Tactics for Media Groups

Winning the Data Game: Digital Analytics Tactics f...

Learning to win the talent war: how digital marketing can develop its peopl...

Learning to win the talent war: how digital market...

Engagement To Empowerment - Winning in Today's Experience Economy

Engagement To Empowerment - Winning in Today's Exp...

Related Articles

How MetaBanners are Transforming Digital Advertising

How MetaBanners are Transforming Digital Advertisi...

How Attention Metrics are Redefining Success in Digital Marketing

How Attention Metrics are Redefining Success in Di...

Revolutionizing Digital Advertising: How Chase Bank is Changing the Game wi...

Revolutionizing Digital Advertising: How Chase Ban...

Marketers Struggling to Utilize Customer Data for Personalization Amid Priv...

Marketers Struggling to Utilize Customer Data for ...

Nutrimuscle: Scaling spend and growing ROAS through better measurement

Nutrimuscle: Scaling spend and growing ROAS throug...

Snap Selects Fospha as Measurement Partner for Retail eCommerce

Snap Selects Fospha as Measurement Partner for Ret...

How Leading Brands Structure Social Media Success

How Leading Brands Structure Social Media Success

You haven't switched to GA4? Better get a move on

You haven't switched to GA4? Better get a move on