Skip to main content

Python multi-engine PCAP analyse kit.

Project description

PyPCAPKit

  The pcapkit project is an open source Python program focus on PCAP parsing and analysis, which works as a stream PCAP file extractor. With support of dictdumper, it shall support multiple output report formats.

Note that the whole project supports Python 3.4 or later.


About

pcapkit is an independent open source library, using only dictdumper as its formatted output dumper.

There is a project called jspcapy works on pcapkit, which is a command line tool for PCAP extraction but now DEPRECATED.

  Unlike popular PCAP file extractors, such as Scapy, dpkt, pyshark, and etc, pcapkit uses streaming strategy to read input files. That is to read frame by frame, decrease occupation on memory, as well as enhance efficiency in some way.

Module Structure

  In pcapkit, all files can be described as following six parts.

  • Interface (pcapkit.interface) -- user interface for the pcapkit library, which standardise and simplify the usage of this library
  • Foundation (pcapkit.foundation) -- synthesise file I/O and protocol analysis, coordinate information exchange in all network layers
  • Reassembly (pcapkit.reassembly) -- base on algorithms described in RFC 815, implement datagram reassembly of IP and TCP packets
  • IPSuite (pcapkit.ipsuite) -- collection of constructors for Internet Protocol Suite
  • Protocols (pcapkit.protocols) -- collection of all protocol family, with detail implementation and methods
  • Utilities (pcapkit.utilities) -- collection of four utility functions and classes
  • CoreKit (pcapkit.corekit) -- core utilities for pcapkit implementation
  • ToolKit (pcapkit.toolkit) -- capability tools for pcapkit implementation
  • DumpKit (pcapkit.dumpkit) -- dump utilities for pcapkit implementation

Engine Comparison

  Besides, due to complexity of pcapkit, its extraction procedure takes around 0.01 seconds per packet, which is not ideal enough. Thus, pcapkit introduced alternative extraction engines to accelerate this procedure. By now, pcapkit supports Scapy, DPKT, and PyShark. Plus, pcapkit supports two strategies of multiprocessing (server & pipeline). For more information, please refer to the document.

Engine Performance (seconds per packet)
dpkt 0.0003609057267506917
scapy 0.002443440357844035
default 0.017523006995519
pipeline 0.014550424114863079
server 0.04667099356651306
pyshark 0.0792640733718872

 

Installation

Note that pcapkit supports Python versions since 3.4

  Simply run the following to install the current version from PyPI:

pip install pypcapkit

  Or install the latest version from the git repository:

git clone https://github.com/JarryShaw/PyPCAPKit.git
cd pypcapkit
pip install -e .
# and to update at any time
git pull

  And since pcapkit supports various extraction engines, and extensive plug-in functions, you may want to install the optional ones:

# for DPKT only
pip install pypcapkit[DPKT]
# for Scapy only
pip install pypcapkit[Scapy]
# for PyShark only
pip install pypcapkit[PyShark]
# and to install all the optional packages
pip install pypcapkit[all]
# or to do this explicitly
pip install pypcapkit dpkt scapy pyshark

 

Usage

Documentation

Interfaces

NAME DESCRIPTION
extract extract a PCAP file
analyse analyse application layer packets
reassemble reassemble fragmented datagrams
trace trace TCP packet flows

Macros

Formats
NAME DESCRIPTION
JSON JavaScript Object Notation (JSON) format
PLIST macOS Property List (PLIST) format
TREE Tree-View text format
PCAP PCAP format
Layers
NAME DESCRIPTION
RAW no specific layer
LINK data-link layer
INET internet layer
TRANS transport layer
APP application layer
Engines
NAME DESCRIPTION
PCAPKit the default engine
MPServer the multiprocessing engine with server process strategy
MPPipeline the multiprocessing engine with pipeline strategy
DPKT the DPKT engine
Scapy the Scapy engine
PyShark the PyShark engine

Protocols

NAME DESCRIPTION
NoPayload No-Payload
Raw Raw Packet Data
ARP Address Resolution Protocol
Ethernet Ethernet Protocol
L2TP Layer Two Tunnelling Protocol
OSPF Open Shortest Path First
RARP Reverse Address Resolution Protocol
VLAN 802.1Q Customer VLAN Tag Type
AH Authentication Header
HIP Host Identity Protocol
HOPOPT IPv6 Hop-by-Hop Options
IP Internet Protocol
IPsec Internet Protocol Security
IPv4 Internet Protocol version 4
IPv6 Internet Protocol version 6
IPv6_Frag Fragment Header for IPv6
IPv6_Opts Destination Options for IPv6
IPv6_Route Routing Header for IPv6
IPX Internetwork Packet Exchange
MH Mobility Header
TCP Transmission Control Protocol
UDP User Datagram Protocol
HTTP Hypertext Transfer Protocol

  Documentation can be found in submodules of pcapkit. Or, you may find usage sample in the test folder. For further information, please refer to the source code -- the docstrings should help you :)

ps: help function in Python should always help you out.

CLI Usage

The following part was originally described in jspcapy, which is now deprecated and merged into this repository.

  As it shows in the help manual, it is quite easy to use:

$ pcapkit --help
usage: pcapkit [-h] [-V] [-o file-name] [-f format] [-j] [-p] [-t] [-a] [-v]
               [-F] [-E PKG] [-P PROTOCOL] [-L LAYER]
               input-file-name

PCAP file extractor and formatted exporter

positional arguments:
  input-file-name       The name of input pcap file. If ".pcap" omits, it will
                        be automatically appended.

optional arguments:
  -h, --help            show this help message and exit
  -V, --version         show program's version number and exit
  -o file-name, --output file-name
                        The name of input pcap file. If format extension
                        omits, it will be automatically appended.
  -f format, --format format
                        Print a extraction report in the specified output
                        format. Available are all formats supported by
                        dictdumper, e.g.: json, plist, and tree.
  -j, --json            Display extraction report as json. This will yield
                        "raw" output that may be used by external tools. This
                        option overrides all other options.
  -p, --plist           Display extraction report as macOS Property List
                        (plist). This will yield "raw" output that may be used
                        by external tools. This option overrides all other
                        options.
  -t, --tree            Display extraction report as tree view text. This will
                        yield "raw" output that may be used by external tools.
                        This option overrides all other options.
  -a, --auto-extension  If output file extension omits, append automatically.
  -v, --verbose         Show more information.
  -F, --files           Split each frame into different files.
  -E PKG, --engine PKG  Indicate extraction engine. Note that except default
                        engine, all other engines need support of corresponding
                        packages.
  -P PROTOCOL, --protocol PROTOCOL
                        Indicate extraction stops after which protocol.
  -L LAYER, --layer LAYER
                        Indicate extract frames until which layer.

  Under most circumstances, you should indicate the name of input PCAP file (extension may omit) and at least, output format (json, plist, or tree). Once format unspecified, the name of output file must have proper extension (*.json, *.plist, or *.txt), otherwise FormatError will raise.

  As for verbose mode, detailed information will print while extraction (as following examples). And auto-extension flag works for the output file, to indicate whether extensions should be appended.

 

Samples

Usage Samples

  As described in test folder, pcapkit is quite easy to use, with simply three verbs as its main interface. Several scenarios are shown as below.

  • extract a PCAP file and dump the result to a specific file (with no reassembly)

    import pcapkit
    # dump to a PLIST file with no frame storage (property frame disabled)
    plist = pcapkit.extract(fin='in.pcap', fout='out.plist', format='plist', store=False)
    # dump to a JSON file with no extension auto-complete
    json = pcapkit.extract(fin='in.cap', fout='out.json', format='json', extension=False)
    # dump to a folder with each tree-view text file per frame
    tree = pcapkit.extract(fin='in.pcap', fout='out', format='tree', files=True)
    
  • extract a PCAP file and fetch IP packet (both IPv4 and IPv6) from a frame (with no output file)

    >>> import pcapkit
    >>> extraction = pcapkit.extract(fin='in.pcap', nofile=True)
    >>> frame0 = extraction.frame[0]
    # check if IP in this frame, otherwise ProtocolNotFound will be raised
    >>> flag = pcapkit.IP in frame0
    >>> tcp = frame0[pcapkit.IP] if flag else None
    
  • extract a PCAP file and reassemble TCP payload (with no output file nor frame storage)

    import pcapkit
    # set strict to make sure full reassembly
    extraction = pcapkit.extract(fin='in.pcap', store=False, nofile=True, tcp=True, strict=True)
    # print extracted packet if HTTP in reassembled payloads
    for packet in extraction.reassembly.tcp:
        for reassembly in packet.packets:
            if pcapkit.HTTP in reassembly.protochain:
                print(reassembly.info)
    

CLI Samples

  The CLI (command line interface) of pcapkit has two different access.

  • through console scripts -- use command name pcapkit [...] directly (as shown in samples)
  • through Python module -- python -m pypcapkit [...] works exactly the same as above

Here are some usage samples:

  • export to a macOS Property List (Xcode has special support for this format)
$ pcapkit in --format plist --verbose
🚨Loading file 'in.pcap'
 - Frame   1: Ethernet:IPv6:ICMPv6
 - Frame   2: Ethernet:IPv6:ICMPv6
 - Frame   3: Ethernet:IPv4:TCP
 - Frame   4: Ethernet:IPv4:TCP
 - Frame   5: Ethernet:IPv4:TCP
 - Frame   6: Ethernet:IPv4:UDP
🍺Report file stored in 'out.plist'
  • export to a JSON file (with no format specified)
$ pcapkit in --output out.json --verbose
🚨Loading file 'in.pcap'
 - Frame   1: Ethernet:IPv6:ICMPv6
 - Frame   2: Ethernet:IPv6:ICMPv6
 - Frame   3: Ethernet:IPv4:TCP
 - Frame   4: Ethernet:IPv4:TCP
 - Frame   5: Ethernet:IPv4:TCP
 - Frame   6: Ethernet:IPv4:UDP
🍺Report file stored in 'out.json'
  • export to a text tree view file (without extension autocorrect)
$ pcapkit in --output out --format tree --verbose
🚨Loading file 'in.pcap'
 - Frame   1: Ethernet:IPv6:ICMPv6
 - Frame   2: Ethernet:IPv6:ICMPv6
 - Frame   3: Ethernet:IPv4:TCP
 - Frame   4: Ethernet:IPv4:TCP
 - Frame   5: Ethernet:IPv4:TCP
 - Frame   6: Ethernet:IPv4:UDP
🍺Report file stored in 'out'

 

TODO

  • specify Raw packet
  • interface verbs
  • review docstrings
  • merge jspcapy
  • write documentation
  • implement IP and MAC address containers
  • implement option list extractors
  • implement more protocols

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypcapkit-0.12.10.post2.tar.gz (175.0 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pypcapkit-0.12.10.post2-py3.7.egg (508.1 kB view details)

Uploaded Egg

pypcapkit-0.12.10.post2-py3.6.egg (508.1 kB view details)

Uploaded Egg

pypcapkit-0.12.10.post2-py3.5.egg (382.0 kB view details)

Uploaded Egg

pypcapkit-0.12.10.post2-py3.4.egg (382.3 kB view details)

Uploaded Egg

pypcapkit-0.12.10.post2-pp35-none-macosx_10_14_x86_64.whl (256.6 kB view details)

Uploaded PyPymacOS 10.14+ x86-64

pypcapkit-0.12.10.post2-cp37-none-macosx_10_14_x86_64.whl (256.6 kB view details)

Uploaded CPython 3.7macOS 10.14+ x86-64

pypcapkit-0.12.10.post2-cp36-none-macosx_10_14_x86_64.whl (256.6 kB view details)

Uploaded CPython 3.6macOS 10.14+ x86-64

pypcapkit-0.12.10.post2-cp35-none-macosx_10_14_x86_64.whl (256.6 kB view details)

Uploaded CPython 3.5macOS 10.14+ x86-64

pypcapkit-0.12.10.post2-cp34-none-macosx_10_14_x86_64.whl (256.6 kB view details)

Uploaded CPython 3.4macOS 10.14+ x86-64

File details

Details for the file pypcapkit-0.12.10.post2.tar.gz.

File metadata

  • Download URL: pypcapkit-0.12.10.post2.tar.gz
  • Upload date:
  • Size: 175.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2.tar.gz
Algorithm Hash digest
SHA256 0cea0bf3a1911ae24cdc4d092770ce750a1100bf38fc4597907d046c473d2354
MD5 b2691eab1c47af015570042c73c2bc6c
BLAKE2b-256 c0d0ea6d5c65a15f39a8a30df84c4ede77725a3ed10582a96a6a1ac15171cd19

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-py3.7.egg.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-py3.7.egg
  • Upload date:
  • Size: 508.1 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-py3.7.egg
Algorithm Hash digest
SHA256 fa383b3189772fc9728a3b512473ec4aec14d4e1e21a8b1d7029c783ee12461e
MD5 34ee84e8fb5a88644af2ad457e2a7355
BLAKE2b-256 3a0eac023d144242974029546d16bf30565e7c3d7c503c44428cb3dc3cbcdca6

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-py3.6.egg.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-py3.6.egg
  • Upload date:
  • Size: 508.1 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-py3.6.egg
Algorithm Hash digest
SHA256 60df715670ab66a6b08df0ff7a353fc965874f75860769b21553838823d41633
MD5 952165d03b1476162ff172d838183f2b
BLAKE2b-256 0389ea85bba55b064a21f7dffb26fd9a37335ccb04435fb762ed9b3e7fcd3c34

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-py3.5.egg.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-py3.5.egg
  • Upload date:
  • Size: 382.0 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-py3.5.egg
Algorithm Hash digest
SHA256 4382ce5c7c07e69a59889ac894410b72d8fbd6b844f1292c60c4146f6b8ebcde
MD5 f5108ab9bc597fd3387a237411e2bc66
BLAKE2b-256 0c3c91d497f525181f7e0818f10408689ab2d2ff55f0355e8aee46f62a94a7d4

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-py3.4.egg.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-py3.4.egg
  • Upload date:
  • Size: 382.3 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-py3.4.egg
Algorithm Hash digest
SHA256 2030c314aae805e7b3d2342402be4e52ee12843ca08b05339d824acb1ba0df8a
MD5 b3b03afd4e5f25f66ba9ff8d360e43bf
BLAKE2b-256 923c8cabd9091ef4111b19a26b52c689920a4edf54952340bc467be235f1429c

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-pp35-none-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-pp35-none-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 256.6 kB
  • Tags: PyPy, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-pp35-none-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 310cb0e4959ffbd624fe2b261dade89ddf2c937681138cac606e7d439e75752c
MD5 cb6206018d817b78b652f5edd07b456d
BLAKE2b-256 1a00a2a518394ab72e4a0c555af37058cfeb4fc5d71a26f4126fbd8870cc3011

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-cp37-none-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-cp37-none-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 256.6 kB
  • Tags: CPython 3.7, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-cp37-none-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 9c489fdc667447b571a0df18db5f330af5b9164d77e847ee1c5d4e9930863c1a
MD5 ec316d420f62253b338566a7fbff7b1e
BLAKE2b-256 36f18a958ccae03d7b2134c8c46f980a2a4731030c9104f02f062e300e3b9493

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-cp36-none-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-cp36-none-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 256.6 kB
  • Tags: CPython 3.6, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-cp36-none-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 9415cfd1d677cfb7c64a84a7b078a8a49f5da15b44d2600881cd5f0bc0054e17
MD5 0b6956bccbf9686e7335b21482c3eea5
BLAKE2b-256 c9fef67e13c19d6198b14dd6e5face4c1547fe6286088de462e4c23bb2aaca2f

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-cp35-none-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-cp35-none-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 256.6 kB
  • Tags: CPython 3.5, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-cp35-none-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 28972a3174d4be871465c124a9eeeb3f7a3782087de3b564a552d59a3dfecfe2
MD5 509c1114091ecba4922e7907c0c5a265
BLAKE2b-256 400f513bd0db9ed87df226d02141a0c27167dfd1345b82eb2906c4cc5485d59f

See more details on using hashes here.

File details

Details for the file pypcapkit-0.12.10.post2-cp34-none-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pypcapkit-0.12.10.post2-cp34-none-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 256.6 kB
  • Tags: CPython 3.4, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.1

File hashes

Hashes for pypcapkit-0.12.10.post2-cp34-none-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 685ab53845a27b3c8eb2eee0d3e43eb4a5a84aa91ab61339278f08b0cdf7d95e
MD5 a1a40d2d3041d59af767ef4e2a8475d2
BLAKE2b-256 46e60fdf229e2d4cfd82620fdbdb97b228e6d6423423527f363426cd870a836a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page