IDS mailing list archives

Re: Intrusion Detection Evaluation Datasets

From: "Stuart Staniford" <sstaniford () FireEye com>
Date: Thu, 12 Mar 2009 15:55:04 -0700


On Mar 12, 2009, at 8:40 AM, Zow Terry Brugger wrote:

I see a lot of people saying (correctly) that advanced (non-signature
based) NIDS can't be researched until we have good evaluation
datasets, and I see a lot of people ignoring them and doing it anyway.
Is anyone (else) actually working on fixing the data problem?

There's a number of things about the framing of this discussion thatare bugging me (I come at this from the perspective of having spentquite a bit of time on both the research and the commercial sides ofthe field).

For one, the nature of the intrusion detection problem is verydynamic. Ten+ years ago, the biggest problem was interactiveattacks. Five years ago, the biggest headache for organizations wasautomated random scanning worms. Today, RS worms have become muchless of a big deal, and most of the action is attacks on clientsprimarily via the web, and the resulting remote control of systems viabots. These are very different problems requiring pretty differentapproaches. And in another five years, I'm sure the main problem willbe something else again. So the main nuisances on the wire keepchanging, and any dataset is necessarily going to get stale veryquickly. In particular, quite a lot of staleness will happen betweenthe start of a hypothetical graduate student starting and finishing athesis.

Secondly, I think there's an assumption lurking implicitly in thesearch for datasets that the appropriate focus for research is theinference algorithm. Much like the machine learning community does -get a fixed data set, and then try all kinds of inference algorithmsto see what works best. For our problem set, I don't think that's agreat way of doing things. For us, the main focus is "What are thebad guys doing now?" and "What features do we need to detect what theyare now doing". Usually, if you have good features with highdiscrimination, most algorithms can be tweaked to do ok. If you don'thave good features, no inference algorithm will save you. And if youhave good features today, they'll be a lot less useful in a couple ofyears and new ones will have to be invented.

I think there's a lot of contribution that researchers can continue tomake in this field. But you can't think of it that you arediscovering timeless principles or something - this is much tooapplied a field. It's about figuring out what's happening on the wire*now*, and what can be done about it.

So forget looking for a dataset. Look for a wire. Do whatever ittakes to get your institution let you sniff the egress link - it'sjust about guaranteed to have plenty of attacks on it. Build, oradapt, some software to look at the packets with respect to someproblem that interests you and that seems like a currently risingchallenge. Spend a lot of time manually poring over the packets tofigure out what is going on, and label your own data. You need to getyour hands dirty. If you look at the most influential highly citedresearchers (Todd Heberlein, Vern Paxson, etc, etc) their influentialcontributions were always driven off actually trying to detect attackson real networks. In the end, intrusion detection is about detectingintrusions, just like the name says. Any amount of theoretical oralgorithmic sophistication is a waste of time unless it directlycontributes to that goal, and no amount of sophistication will be veryexciting if it only improves the detection of five-year-old attacks(this is not to say that technical sophistication is not required forcurrent problems - I believe it is).

I think the problem of producing regular timely datasets that can besafely published is probably just about intractable, even if one ofthe funding agencies were to step up to try and fill the shoes DARPAlong ago left behind. Synthetic datasets would not be thatinteresting, and since most attacks are now inside packet content, thechallenge of reliably anonymizing the data while not affecting thetraffic materially would be just about impossible (what algorithm isgoing to sanitize every single web developer's cookie format, forexample? How could one be sure that obfuscated javascript didn'tcontain any personal information?).


Stuart Staniford.

Current thread:

Intrusion Detection Evaluation Datasets snort user (Mar 04)
- Re: Intrusion Detection Evaluation Datasets "Zow" Terry Brugger (Mar 06)
- Re: Intrusion Detection Evaluation Datasets Damiano Bolzoni (Mar 09)
- Re: Intrusion Detection Evaluation Datasets Jamie Riden (Mar 09)
- <Possible follow-ups>
- Re: Re: Intrusion Detection Evaluation Datasets zubair . shafiq (Mar 09)
  - Re: Intrusion Detection Evaluation Datasets Stefano Zanero (Mar 09)
- Re: Re: Intrusion Detection Evaluation Datasets zubair . shafiq (Mar 10)
  - Re: Intrusion Detection Evaluation Datasets Stefano Zanero (Mar 11)
    - Re: Intrusion Detection Evaluation Datasets "Zow" Terry Brugger (Mar 12)
    - Re: Intrusion Detection Evaluation Datasets Paul Palmer (Mar 12)
    - Re: Intrusion Detection Evaluation Datasets Stuart Staniford (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets Stefano Zanero (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets "Zow" Terry Brugger (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets Paul Palmer (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets Stefano Zanero (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets Paul Palmer (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets Stefano Zanero (Mar 13)
    - Message not available
    - Re: Intrusion Detection Evaluation Datasets "Zow" Terry Brugger (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets Paul Palmer (Mar 13)
    - Re: Intrusion Detection Evaluation Datasets Damiano Bolzoni (Mar 16)
    - Re: Intrusion Detection Evaluation Datasets Paul Schmehl (Mar 17)

(Thread continues...)