Interesting People mailing list archives

The Future of Internet Immune Systems

From: David Farber <dfarber () cs cmu edu>
Date: Wed, 21 Nov 2007 18:23:14 -0500



Begin forwarded message:

From: dewayne () warpspeed com (Dewayne Hendricks)
Date: November 20, 2007 6:56:29 PM EST
To: Dewayne-Net Technology List <xyzzy () warpspeed com>
Subject: [Dewayne-Net] The Future of Internet Immune Systems

The Future of Internet Immune Systems
Written by Cory Doctorow
11/19/2007

<http://www.internetevolution.com/author.asp?section_id=479&doc_id=139358&;>

Bunhill Cemetery is just down the road from my flat in London. It’s ahandsome old boneyard, a former plague pit (“Bone hill” -- as in,there are so many bones under there that the ground is actually kindof humped up into a hill). There are plenty of luminaries buried there-- John “Pilgrim’s Progress” Bunyan, William Blake, Daniel Defoe, andassorted Cromwells. But my favorite tomb is that of Thomas Bayes, the18th-century statistician for whom Bayesian filtering is named.

Bayesian filtering is plenty useful. Here’s a simple example of howyou might use a Bayesian filter. First, get a giant load of non-spamemails and feed them into a Bayesian program that counts how manytimes each word in their vocabulary appears, producing a statisticalbreakdown of the word-frequency in good emails.

Then, point the filter at a giant load of spam (if you’re having ahard time getting a hold of one, I have plenty to spare), and countthe words in it. Now, for each new message that arrives in your inbox,have the filter count the relative word-frequencies and make astatistical prediction about whether the new message is spam or not(there are plenty of wrinkles in this formula, but this is the generalidea).

The beauty of this approach is that you needn’t dream up “The BigExhaustive List of Words and Phrases That Indicate a Message Is/Is NotSpam.” The filter naively calculates a statistical fingerprint forspam and not-spam, and checks the new messages against them.

This approach -- and similar ones -- are evolving into an immunesystem for the Internet, and like all immune systems, a little bitgoes a long way, and too much makes you break out in hives.

ISPs are loading up their network centers with intrusion detectionsystems and tripwires that are supposed to stop attacks before theyhappen. For example, there’s the filter at the hotel I once stayed atin Jacksonville, Fla. Five minutes after I logged in, the networklocked me out again. After an hour on the phone with tech support, ittranspired that the network had noticed that the videogame I wasplaying systematically polled the other hosts on the network to checkif they were running servers that I could join and play on. Thenetwork decided that this was a malicious port-scan and that it hadbetter kick me off before I did anything naughty.

It only took five minutes for the software to lock me out, but it tookwell over an hour to find someone in tech support who understood whathad happened and could reset the router so that I could get back online.


[snip]

-------------------------------------------
Archives: http://v2.listbox.com/member/archive/247/=now
RSS Feed: http://v2.listbox.com/member/archive/rss/247/
Powered by Listbox: http://www.listbox.com

Current thread:

The Future of Internet Immune Systems David Farber (Nov 21)