Interesting People mailing list archives

Re: Comcast blocking mail to its customers

From: David Farber <dave () farber net>
Date: Thu, 16 Oct 2008 04:19:34 -0400



Begin forwarded message:

From: Joel Snyder <Joel.Snyder () Opus1 COM>
Date: October 16, 2008 1:39:50 AM EDT
To: dave () farber net
Cc: ip <ip () v2 listbox com>, johnl () iecc com, tilghman () mail jeffandtilghman com
Subject: Re: [IP] Re:       Comcast blocking mail to its customers

John Levine's point is spot-on: ISPs don't have the luxury of buildingtheir services to support that 1% of technology-savvy folks, andanyone who forwards mail has to deal with 'backpressure' from otherISPs when the forward mail that the destination ISP doesn't want toaccept, even if it has already been cleaned of spam! (For example,Qwest won't accept mail if the envelope FROM domain name is not inDNS, which does indeed block spam, but also has a fairly high falsepositive rate.)

But I'd like to respond to Tilghman's point: yes, he's also right (tosummarize, he says "you should spam scan at SMTP time"), BUT...

But the reality is that his approach, while technically superior, isnot a particularly scalable one. Most anti-spam gateways filter forboth spam and viruses, and that takes a lot of CPU time.

Doing reputation-based message refusal (what some folks are callingblacklists or RBLs) at SMTP time is a HUGE benefit and does followTilghman's philosophy of block-at-smtp-time (rather than accept-and-blackhole, which is industry standard practice). However, doing thespam filtering, virus filtering, content analysis, and any necessarymessage splitting, all at SMTP time only really works if you havemassively fast CPUs and a fairly low trickle of a mail stream.

The problem is that mail tends to be bursty, and while the averagearrival rate may be modest enough for whatever appliance you're using,the peaks are both frequent and high. If you start returning 4xxresponses whenever load gets high, you'll have a lot of dissatisfiedusers, because you'll go into resource conservation mode fairlyquickly and frequently. And, given that there is no predictabilityabout when the other side will retry (some in seconds, others inhours), that doesn't work in practice very well.

Some appliance vendors don't follow his advice because they have poorarchitectures and cannot; others don't because they want to sell thecheapest hardware possible to keep their margins up. But some don'tjust because it doesn't work that well in real enterprise mail streams.

Obviously, a middle-ground approach would be to block when you can atSMTP time, and if you start falling behind, then fall back to 'oldschool' and queue the mail for later scanning. There are huge hostedservices that do this, but the anti-spam appliance vendors (whocontrol most of the enterprise commercial market) haven't embracedthat approach, likely for complexity reasons.

Message splitting also complicates the picture. Sometimes a messagewill be destined for many recipients (this is not uncommon in spam)and each will have a different policy for sensitivity and action.Having to figure out what the policy is and then applying differentactions based on whether all recipients are the same or not, all atSMTP time, is another bit that many anti-spam vendors have avoidedchewing.

I think that everyone who is (sane and) active in this space agreesthat Tilghman's approach is the best, but it's easy for us guys whouse and deploy the products to advocate it; I have found it moredifficult to get the developers who write the code to go down thatpath. This doesn't mean that some vendors aren't doing it, butcertainly the dominant appliance vendors are not and probably won't beanytime soon.


jms

David Farber wrote:

Begin forwarded message:
From: Tilghman Lesher <tilghman () mail jeffandtilghman com>
Date: October 15, 2008 7:32:17 PM EDT
To: johnl () iecc com
Cc: dave () farber net
Subject: Re: [IP] Re:      Comcast blocking mail to its customers
On Wednesday 15 October 2008 15:32:50 David Farber wrote:
Begin forwarded message:

From: John Levine <johnl () iecc com>
Date: October 15, 2008 3:12:36 PM EDT
To: dave () farber net
Subject: Re: [IP] Re:      Comcast blocking mail to its customers
My view is that an appropriate AUP for email should be similar to
that of a common carrier or the USPS.  It's a critical service these
days.  Using robotic methods or wholesale IP shutoffs to dump
presumptive spam into the trash is not acceptable for such a
service.
The mail stream that ISPs see is typically 95% spam these days.  That
means 20 spams for every real message, so if they were to accept and
store all the spam, that's more than an order of magnitude increasein
the size and cost of their mail system, which would be passed through
to the customers, most of whom don't want it.  And even if they did,
how much confidence do you have that you could manually sort it
correctly?  I've seen plausible studies that say that mechanical
filters are if anything better than humans at sorting large mail
streams, since mechanical filters' eyes don't glaze over.
I think you missed the part which I consider to be most important,that of
dumping presumptive spam into the trash.  The most correct method of
filtering is to do it at SMTP time and reject the email then, ratherthan
trying to either a) accept all email and bounce the stuff that is
undeliverable (this is arguably what is most wrong with some MTAs,suchas qmail, as it causes the secondary problem of backscatter) or b)acceptingall email and tossing the stuff that a mechanical filter thinks isspam(which means that a sender may never be notified that their messagewas
falsely flagged as spam).
Rejecting at SMTP time guarantees that minimal backscatter bounces are
generated and when an email is rejected as a false positive, thesender has
immediate feedback of the problem and can work to address the issue.
It is really no more computationally expensive than currentoperations (whichhave to scan all email anyway, so they might as well do it at SMTPtime). In
the case of a flood of email causing problems with scanning (the prime
argument against scanning at SMTP time, that it does not scale),that iseasily addressed within the mail protocol, simply by sending a 400-levelerror, indicating a temporary issue, which good MTAs use as anindication totry the delivery again later. Oddly, a 400-level error stops manyspam botsin their tracks, which will never reattempt delivery of the samemessage upon
receiving the first error.


--
Joel M Snyder, 1404 East Lind Road, Tucson, AZ, 85719
Senior Partner, Opus One       Phone: +1 520 324 0494
jms () Opus1 COM                http://www.opus1.com/jms




-------------------------------------------
Archives: https://www.listbox.com/member/archive/247/=now
RSS Feed: https://www.listbox.com/member/archive/rss/247/
Powered by Listbox: http://www.listbox.com

Current thread:

Comcast blocking mail to its customers David Farber (Oct 14)
- <Possible follow-ups>
- Re: Comcast blocking mail to its customers David Farber (Oct 14)
- Re: Comcast blocking mail to its customers David Farber (Oct 14)
- Re: Comcast blocking mail to its customers David Farber (Oct 15)
- Re: Comcast blocking mail to its customers David Farber (Oct 15)
- Re: Comcast blocking mail to its customers David Farber (Oct 15)
- Re: Comcast blocking mail to its customers David Farber (Oct 15)
- Re: Comcast blocking mail to its customers David Farber (Oct 16)
- Re: Comcast blocking mail to its customers David Farber (Oct 16)
- Re: Comcast blocking mail to its customers David Farber (Oct 16)
- Re: Comcast blocking mail to its customers David Farber (Oct 16)
- Re: Comcast blocking mail to its customers David Farber (Oct 16)
- Re: Comcast blocking mail to its customers David Farber (Oct 17)