Skip to main content

Library for core functionalities of IoT Inspector. This captures packets and stores it in a database for real-time network traffic analysis.

Project description

inspector-core-library

Ruff libinspector_test

Library for core functionalities of IoT Inspector

Installation

To install the libinspector module via pip, use the following command:

pip install libinspector

Usage

Running the Inspector

To run the Inspector, you need to activate the virtual environment first and then run the following command:

sudo $(which python) -m libinspector.core

If you're debugging, you can run the following to reset the database and start the Inspector:

sudo truncate -s 0 inspector.log; sudo rm -f debug_mem_db.db; sudo $(which $(which python)) -m libinspector.core

By default, the traffic is saved in an in-memory SQLite database, so you won't see the data directly. Also, none of the devices are inspected by default. For debugging purposes, you can have libinspector dump the internal SQLite database to disk and inspect (i.e., ARP-spoof) traffic for ALL devices by doing the following:

  1. Create a file libinspector_config.json in the same directory where you run sudo python -m libinspector.core (or where you import libinspector as a part of your package).

  2. Edit this json file to include the following text:

  {
      "use_in_memory_db": false,
      "inspect_every_device_by_default": true
  }
  1. Remove the libinspector_config.json config file, or flip the above values in production.

Embedding in Your Own Python Application

The preferred way to use libinspector is to embed it within your own Python application. You can do this by importing libinspector.core and calling the start_threads() method, which returns almost instantaneously. Your Python script will then need to read the in-memory SQLite database for information about the devices and the network traffic flows.

import time
import libinspector.core
import libinspector.global_state

# This method returns almost instantaneously
libinspector.core.start_threads()

# Make sure to sleep and/or do other work here, such as analyzing the in-memory SQLite database. For example, you can keep printing the device list from the `devices` table.
db_conn, rwlock = libinspector.global_state.db_conn_and_lock

while True:
    with rwlock:
        for device in db_conn.execute('SELECT mac_address, ip_address FROM devices').fetchall():
            print(f'MAC: {device["mac_address"]}, IP: {device["ip_address"]}')
    time.sleep(5)

If you want to add additional packet parsing capabilities, you can specific a custom callback when you start Inspector. Here's an example that prints out the summary of each captured packet:

import libinspector
libinspector.core.start_threads(
  custom_packet_callback_func=lambda pkt: print(f'Packet captured: {pkt.summary()}')
)

Data Schema

The data schema is defined in mem_db.py and includes the following tables:

  • devices: Stores information about devices on the network.

    • mac_address (TEXT, PRIMARY KEY): The MAC address of the device.
    • ip_address (TEXT, NOT NULL): The IP address assigned to the device.
    • is_inspected (INTEGER, DEFAULT 0): Indicates whether the device is being inspected (1) or not (0).
    • is_gateway (INTEGER, DEFAULT 0): Indicates whether the device is a gateway (1) or not (0).
    • updated_ts (INTEGER, DEFAULT 0): The timestamp of the last update.
    • metadata_json (TEXT, DEFAULT '{}'): Additional metadata in JSON format.
  • hostnames: Stores hostnames associated with IP addresses.

    • ip_address (TEXT, PRIMARY KEY): The IP address associated with the hostname.
    • hostname (TEXT, NOT NULL): The hostname of the device.
    • updated_ts (INTEGER, DEFAULT 0): The timestamp of the last update.
    • data_source (TEXT, NOT NULL): The source of the hostname data.
    • metadata_json (TEXT, DEFAULT '{}'): Additional metadata in JSON format.
  • network_flows: Stores information about network flows.

    • timestamp (INTEGER): The timestamp of the network flow.
    • src_ip_address (TEXT): The source IP address of the flow.
    • dest_ip_address (TEXT): The destination IP address of the flow.
    • src_hostname (TEXT): The source hostname of the flow.
    • dest_hostname (TEXT): The destination hostname of the flow.
    • src_mac_address (TEXT): The source MAC address of the flow.
    • dest_mac_address (TEXT): The destination MAC address of the flow.
    • src_port (TEXT): The source port of the flow.
    • dest_port (TEXT): The destination port of the flow.
    • protocol (TEXT): The protocol used in the flow.
    • byte_count (INTEGER, DEFAULT 0): The number of bytes transferred in the flow.
    • packet_count (INTEGER, DEFAULT 0): The number of packets transferred in the flow.
    • metadata_json (TEXT, DEFAULT '{}'): Additional metadata in JSON format.
    • PRIMARY KEY (timestamp, src_mac_address, dest_mac_address, src_ip_address, dest_ip_address, src_port, dest_port, protocol): The composite primary key for the table.

How libinspector Works

The libinspector module works by starting various threads to monitor and inspect network traffic. Here is a high-level overview of the start_threads function in core.py:

  1. Ensure Single Instance: The function first ensures that only one instance of the Inspector core is running.
  2. Initialize Database: It initializes the database by calling mem_db.initialize_db().
  3. Initialize Networking Variables: It enables IP forwarding and updates the network information.
  4. Start Threads: It starts several threads to perform various tasks:
    • Update network info from the OS every 60 seconds.
    • Discover devices on the network every 10 seconds.
    • Collect and process packets from the network.
    • Spoof internet traffic.
    • Start the mDNS and UPnP scanner threads.

Testing and Development

To test locally, run these commands:

python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install .

Notes

TODO:

  • Create more test cases to obtain higher code coverage.

Contributing

Contributions are welcome! Please feel free to submit a pull request or open an issue.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

Ask Prof. Danny Y. Huang (dhuang@nyu.edu) or Andrew Quijano (andrew.quijano@nyu.edu).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

libinspector-1.0.12-py3-none-any.whl (6.7 MB view details)

Uploaded Python 3

File details

Details for the file libinspector-1.0.12-py3-none-any.whl.

File metadata

  • Download URL: libinspector-1.0.12-py3-none-any.whl
  • Upload date:
  • Size: 6.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for libinspector-1.0.12-py3-none-any.whl
Algorithm Hash digest
SHA256 7592ad4f1bec60175fa45e2744845b9a439d2d4ffd026b116109ab32de9cb9a8
MD5 432711edcedad201202f5363324ec4fa
BLAKE2b-256 0d1f71dfe42da4ac0c74e700a0d417fc72ae10f2e76a7199b7829fd9068d9c95

See more details on using hashes here.

Provenance

The following attestation bundles were made for libinspector-1.0.12-py3-none-any.whl:

Publisher: deploy_wheel.yml on nyu-mlab/inspector-core-library

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page