Scott on Writing

Musings on technical writing...

Filtering Spam at the ISP Level

As spam has spiraled into a pandemic problem, many folks have come up with many ways to reduce the sheer volume of spam individuals receive.  Six months ago, or so, I wrote about a Challenge/Response Spam Blocking System I wrote to help tide the deluge of spam I was receiving.  Since then I have moved to a Bayesian approach, using the Spambayes Outlook Add-In, which I have found to be a wonderous (and free!) product.  Spambayes works so well because it is tailored to the email you receive.  Since I receive a lot of technical emails from listservs and colleagues, especially on ASP.NET, tokens like ASP.NET, ADO.NET, DataSet, loop, method, object, class, etc. are all strong indications that the email's not spam.

Anywho, one approach that many ISPs are taking is to employ universal sam blocking at the ISP level.  I have some reservations about this approach for a couple of reasons. 

  1. It assumes that there's some global spam “signature,” but the spam (and non-spam) one receives, I contend, is personalized.  That is, the probability of you getting particular types of spam likely has to do with factors on how a spammer got ahold of your email.  For example, I get spams from software companies, which I'd wager others who are not in a technical field receive less of.  I'd imagine those who regularly post their email on very high-traffic, entertainment sites, are more likely to get such targetted spam.  Too, companies you volunteerly give your email address to might sell your address, or have it obtained by another company when they go out of business.
  2. Those messages marked as spams never make it to the recepient's inbox.  I know this is the intended approach, but what about false negatives?  That is, what if a legitimate email gets flagged as a spam by the ISP?  It will never reach the recipient's inbox, where, had it, and been marked as spam by, say Spambayes, the recipient would still have an opportunity to find said email in the Junk Email folder and mark it as non-spam.

Now, I have no statistics to back up these claims (namely, that spam is somewhat personalized), but one thing I know doesn't work are universal filters at the ISP level.  Yes, these filters are nice because the computer illiterate doesn't have to concern himself with the details of installing and configuring an anti-spam solution, and, yes, these spam filters might weed out the vast majority of the spams, but at the same time they are likely cutting out a handful of legitimate emails that happen to be ensnared by the anti-spam nets.  This concern exists both from my personal experiences in issuing challenges in a C/R anti-spam system, and also from emails I've received from folks, noting how emails they send from their Web sites are routinely getting blocked up by spam blockers at the ISP level.

I tend to view spam blocking software like the idealization of the American justice system - I'd rather have a few spams get through than have one legit email be marked as a spam.  Clearly, spam blocking at the ISP level does not permit this.

posted on Tuesday, March 09, 2004 11:52 AM

Feedback

# re: Filtering Spam at the ISP Level 3/9/2004 12:01 PM Mike Swaim

What about if you get 50 spams and 5 virus laden emails per legitimate email? My ISP uses Postini for spam blocking, and it lets me see what's caught in/modify the filter. That way I don't get tons of crud in my inbox, but I can still scan the emails that get caught for legitimate email.

# re: Filtering Spam at the ISP Level 3/9/2004 12:14 PM Bryant Likes

I think there is probably a percentage of spam that is personalized, but most spam seemed to be universal. The reason I think this is because I use Spamnet by Cloudmark. It catches almost all of my universal spam and misses the personal spam. I get very little personal spam.

I would feel comfortable with an ISP implemented solution like spamnet since it would elimanate all the universal spam and allow me to deal with my personal spam.

See:
http://www.cloudmark.com

# re: Filtering Spam at the ISP Level 3/9/2004 12:20 PM Scott Mitchell

Bryant, by personalized I meant not that I get a spam that NO ONE else gets; rather, I meant I get a class of spam that, perhaps, tens of thousands of others get, but not the millions of people who get bombarded by spam get.

For example, I would imagine the likelihood of someone in France getting spam written in French is much higher than me receiving such spam. The point is, if a French-speaking individual who posts his email address to French sites, uses, say Earthlink as his ISP and lives in America, the spam guard at Earthlink would likely not catch his French spams. (Again, this is my assumption, no stats to back me up here.)

# re: Filtering Spam at the ISP Level 3/9/2004 12:35 PM Avonelle Lovhaug

I think solutions like Postini are great, since they let you go see the messages that are getting filtered, and adjust it as appropriate.

But I agree that whatever systems some other ISPs are using is not so wonderful. I have a customer right now who can't receive email messages from me. I don't know where the email messages are going - I don't get bounce messages, but he doesn't ever receive my emails. I assume they are being filtered in some way to prevent spam, but it sure doesn't help us much - we're just trying to communicate! For this particular customer, I have to remember to use a different email address.

<rant>
I don't know why the messages are rejected, although I suspect that it is because once, many, many moons ago, we made a small mistake on the Exchange server and accidentally created an open mail relay. Even though the situation has been corrected for months, it is almost impossible to get off the lists of the crazy, nazi-like, anti-spam folks. I admit it - I made a mistake once. I swear, I've paid my dues. I've done my time. My email server isn't relaying anything anymore. In fact, if I'm working off-site, I have to use web mail because even *I* can't send email through my mail server if I'm not at home. So, please, please please! Take me off the bad list!
</rant>

# re: Filtering Spam at the ISP Level 3/9/2004 2:13 PM AndrewSeven

Hi Scott,

Just thought I'd mention that the last feedback I sent seems to never have gotten through.

I recieved some delay/fail notifications.

It wasn't important but you've always replied before.
I'm sending a test feedback just after I post this.

-Andrew

# re: Filtering Spam at the ISP Level 3/12/2004 2:22 PM Scott Mitchell

A scary bit of information: a study showed 19% of opt-in email is blocked by spam filters!
http://www.clickz.com/news/article.php/3325591

There was also a two-part discussion on the failings of anti-spam technologies at SecurityFocus:
http://www.securityfocus.com/infocus/1763
http://securityfocus.com/infocus/1766

Title:  
Name:  
Url:
Protected by Clearscreen.SharpHIPEnter the code you see:
Comments   

My Links

Ads Via DevMavens

Archives

Post Categories

 

I am a Microsoft MVP for ASP.NET.
I am an ASPInsider.
<July 2008>
SMTWTFS
293012345
6789101112
13141516171819
20212223242526
272829303112
3456789

Comment Stats

DayTotal% of Total
Sunday 1896.8%
Monday 39014.0%
Tuesday 46916.8%
Wednesday 51518.5%
Thursday 54419.5%
Friday 50818.2%
Saturday 1706.1%
Total 2785100.0%

Hour1Total% of Total
12:00 AM 682.4%
1:00 AM 712.5%
2:00 AM 632.3%
3:00 AM 752.7%
4:00 AM 572.0%
5:00 AM 1093.9%
6:00 AM 1114.0%
7:00 AM 1615.8%
8:00 AM 1756.3%
9:00 AM 1505.4%
10:00 AM 1736.2%
11:00 AM 1826.5%
12:00 PM 1906.8%
1:00 PM 1766.3%
2:00 PM 1605.7%
3:00 PM 1324.7%
4:00 PM 1124.0%
5:00 PM 983.5%
6:00 PM 913.3%
7:00 PM 993.6%
8:00 PM 853.1%
9:00 PM 802.9%
10:00 PM 833.0%
11:00 PM 843.0%
Total 2785100.0%

Comments by Blog Entry Date/Time

Day Entry MadeAvg.Total
Sunday 5.14144
Monday 5.35353
Tuesday 4.35444
Wednesday 7.58644
Thursday 6.87625
Friday 5.45414
Saturday 5.03161
Total 5.802785

Hour1 Entry MadeAvg.Total
12:00 AM 5.0035
1:00 AM 1.002
5:00 AM 0.000
7:00 AM 7.0035
8:00 AM 5.45109
9:00 AM 6.34279
10:00 AM 6.41250
11:00 AM 4.28184
12:00 PM 6.98342
1:00 PM 2.87112
2:00 PM 5.29222
3:00 PM 8.54299
4:00 PM 3.9190
5:00 PM 5.78156
6:00 PM 4.52113
7:00 PM 9.32177
8:00 PM 9.06154
9:00 PM 5.14113
10:00 PM 6.2381
11:00 PM 4.5732
Total 5.802785

Learn More About Comment Stats
1 - All times GMT -8...


Blog Stats

Favorite Web Sites

My Books

My MSDN Articles