Wireshark · Wireshark-dev: Re: [Wireshark-dev] heuristic Dissector for Dummies

Wireshark-dev: Re: [Wireshark-dev] heuristic Dissector for Dummies

From: Ulf Lamping <ulf.lamping@xxxxxx>

Date: Fri, 29 Aug 2008 20:42:40 +0200

Tom Stevens schrieb:

Hello!
Is there a simple tutorial on the web where i can find some informationabout how to write a heuristic dissector.
http://www.wireshark.org/docs/wsdg_html_chunked/ChapterDissection.html-> On this side i couldn't find anything about heuristic dissectors.
May you recommend a code snipet, where i can learn how to write aheuristic dissector by my own.
Where and how can i define the rules (pattern) that wireshark needs tofind the corresponding dissector?To what points do I have to pay particular attention when i write such adissector?

Ok, I'll try to give you some ideas about a heuristic dissector.


Why heuristic dissectors?
-------------------------

When Wireshark "receives" a packet, it has to find the right dissectorto start decoding the packet data. Often this can be done by knownconventions, e.g. the Ethernet type 0x800 means "IP on top of Ethernet"- an easy and reliable match for Wireshark.

Unfortunately, these conventions are not always available, or(accidentially or knowingly) some protocols don't care about thoseconventions and "reuse" existing "magic numbers / tokens".

For example TCP defines port 80 only for the use of HTTP traffic. But,this convention doesn't prevent anyone from using TCP port 80 for somedifferent protocol, or on the other hand using HTTP on a port numberdifferent to 80.

To solve this problem, Wireshark introduced the so called heuristicdissector mechanism to try to deal with these problems.


How Wireshark uses heuristic dissectors?
----------------------------------------

While Wireshark starts, heuristic dissectors (HD) register themselvesslightly different than "normal" dissectors, e.g. a HD can ask for anyTCP packet, as it *may* contain interesting packet data for thisdissector. In reality more than one HD will exist for e.g. TCP packet data.

So if Wireshark has to decode TCP packet data, it will first try to finda dissector registered directly for the TCP port used in that packet. Ifit finds such a registered dissector it will just hand over the packetdata to it.

In case there is no such "normal" dissector, WS will hand over thepacket data to the first matching HD. Now the HD will look into the dataand decide if that data looks like the dissector "is interested in". Thereturn value signals WS if the HD processed the data (so WS can stopworking on that packet) or the heuristic didn't matched (so WS tries thenext HD until one matches - or the data simply can't be processed).


How do these heuristics work?
-----------------------------

Difficult to give a general answer here. The usual heuristic works asfollows:

A HD looks into the first few packet bytes and search for commonpatterns that are specific to the protocol in question. Most protocolsstarts with a specific header, so a specific pattern may look like(synthetic example):

1) first byte must be 0x42

2) second byte is a type field and only can contain values between 0x20- 0x333) third byte is a flag field, where the lower 4 bits always contain thevalue 04) fourth and fifth bytes contains a 16 length field, where the valuecan't be longer than 10000 bytes

So the heuristic dissector will check incoming packet data for all ofthe 4 above conditions, and only if all of the four conditions are truethere is a good chance that the packet really contains the expectedprotocol - and the dissector continues to decode the packet data. If onecondition fails, it's very certainly not the protocol in question andthe dissector returns to WS immediately "this is not my protocol" -maybe some other heuristic dissector is interested!

Obviously, this is *not* 100% bullet proof, but the best we can offer toour users here - and improving the heuristic is always possible if itturns out that it's not good enough to distinguish between two givenprotocols.


Regards, ULFL

Follow-Ups:
- Re: [Wireshark-dev] heuristic Dissector for Dummies
  - From: Peter Johansson

References:
- [Wireshark-dev] heuristic Dissector for Dummies
  - From: Tom Stevens

Prev by Date: Re: [Wireshark-dev] Patch to support decoding LANforge packets.
Next by Date: Re: [Wireshark-dev] heuristic Dissector for Dummies
Previous by thread: [Wireshark-dev] heuristic Dissector for Dummies
Next by thread: Re: [Wireshark-dev] heuristic Dissector for Dummies
Index(es):
- Date
- Thread