tcpdump mailing list archives

Re: bpf/pcap performance

From: Guy Harris <guy () alum mit edu>
Date: Tue, 13 Apr 2004 15:42:43 -0700


On Apr 13, 2004, at 3:18 PM, Darren Reed wrote:

Ok, now that I read it, I find it tries to do too much (in my
opinion.) The part that makes it go off and read more data is
of questionable value ?

I.e., the fact that it - like "pcap_next()" - will fill the packetbuffer if it's empty?

As for non-blocking more...

How exactly is pcap_next_ex() meant to work in non-blocking mode ?

In a fashion similar to the way "pcap_next()" does - if no packets areavailable, it should return 0, so the documentation for a return valueof 0 should indicate that.

Looking more closely at the non-blocking side of things, there seems
to be no use of either p->nonblock or p->timeout apart from in
pcap_setnonblock() ?

"p->nonblock" is there for the benefit of, among other things,"pcap-linux.c" if, as, and when it implements mmaped-capture support -"reading" isn't done with a "read()" or "recvfrom()" in that case, soit needs to know whether, if the mmapped buffer is empty, it should doa "select()" to wait for a packet or should just return "no packets".


I need to look at the "p->timeout" issue.

Maybe pcap_next_ex() should also return -2
if "p->cc == 0" prior to calling p->read_op if nonblock is true ?

No, -2 is an "end-of-file, don't bother calling 'pcap_next_ex()' everagain, there will never be another packet" return. 0 means "no packetsnow, but try again later".

Another thought that came to me on "how it might work", going back
to the quick API that I wrote up, if I could open a pcap file and
mmap it in,

...which you can do only if you have enough address space; what wouldbe best here would be a way in which a window could be mapped in, sothat really large capture files (which people *do* have and *do*process) can still be handled.

I think the current design (pcap_dispatch(), etc) is what's led
to strange API hooks like pcap_breakloop() appearing, when I think
a better design could have side stepped the need for them O:-)

Well, "pcap_breakloop()" also works around the workaround :-) for the"don't choke when we get ptraced" problem, i.e. that the implementationon some platforms treats EINTR not as an indication that it should quitbut as an indication that it should try reading again. I don't knowwhether an EINTR gets delivered if you're debugging a libpcap-basedapplication, but, if that *is* a problem and that workaround *is*necessary, some scheme is needed to let a signal handler cause the readloop to terminate.

There's pcap.h with pcap_pkthdr and pcap-int.h with pcap_sf_pkthdr.
At present, applications are meant to include pcap.h and usepcap_pkthdr.
What should happen is that internally, pcap_int_pkthdr

Presumably meaning "pcap_sf_pkthdr", or possibly a renamed"pcap_sf_pkthdr"?

should be used by libpcap


I.e., used in capture files?

and pcap_pkthdr should be used in pcap_dump() and when reading pcapdump files.


I.e., used in APIs?

That's what we do now - we have "pcap_pkthdr", used in API's, and"pcap_sf_pkthdr", used in capture files.

But that would only make sense if the pcap file format was actually
considered to be a public interface.  Is it?

I call it quasi-public - you can write code that reads and writes it(in fact, I have - the code in Ethereal), but

1) you shouldn't do so merely out of ignorance of the existence oflibpcap (and Net::Pcap and the like)

and

2) you should be prepared for us to come up with a new capture fileformat which your code won't be able to read but that the next versionof libpcap *will* be able to read (without the applications having tochange, as long as they ignore all the new capabilities of the new fileformat).

If not, I'd really like
you to consider making it a public interface.- which I think is the
other, large, ongoing email conversation here is about.

Once the new format is defined and implemented, we should have a"pcap(5)" (or "pcap(4)", or whatever - that's one problem with that manpage, different flavors of UN*X have different conventions for whichsection is to have file formats :-( ) man page that documents the newformat; it can also document the current format.

It just dawned on me that you might want a universal file format that
uses particular sizes for things and have pcap_pkthdr use what ever
happens to be the native size and that this fits the current design
and implementation.


Yes, that's what I think is the right answer.

So we're stuck having to build an API that has to be able to work on
top of a broken kernel interface implementation.  Fun.  NOT.

The only way *not* to ever be so stuck is to abandon libpcap supportfor anything other than Linux and BSD (and *maybe* Windows, if none ofthe underlying limitations get in the way of the WinPcap drivers), andeven that might require supporting only shiny new versions or supplyingkernel patches. We don't control all the mechanisms atop which libpcapis implemented.


-
This is the tcpdump-workers list.
Visit https://lists.sandelman.ca/ to unsubscribe.

Current thread:

Re: bpf/pcap performance Guy Harris (Apr 12)
- Re: bpf/pcap performance Darren Reed (Apr 12)
  - Re: bpf/pcap performance Guy Harris (Apr 12)
    - Re: bpf/pcap performance Guy Harris (Apr 12)
    - Re: bpf/pcap performance Guy Harris (Apr 12)
    - Re: bpf/pcap performance Darren Reed (Apr 13)
    - Re: bpf/pcap performance Guy Harris (Apr 13)
- <Possible follow-ups>
- Re: bpf/pcap performance Michael Richardson (Apr 12)
- Re: bpf/pcap performance Guy Harris (Apr 13)