Skip to main content

Gateway facilitating asynchronous communication between sensory data-emitting devices, InfluxDB and the user.

Project description

PyPI version Total Downloads PyPI - Downloads

async-httpd-data-collector

Note: This is an older university project and is not actively maintained. It works as-is, but is not extensively tested.

A Python library that acts as an asynchronous gateway between IoT sensor devices and InfluxDB. It fetches JSON readings from a device (like a NodeMCU/ESP8266) over HTTP, parses them, and stores them as time-series data. You can then query the data back as pandas DataFrames.

Started as a university project to get hands-on with async Python, asyncio, aiohttp, and time-series databases. The hardware side runs on an arduino-air-state-server - a NodeMCU board with air quality sensors (MQ135, BMP180, DHT22, DS18B20) that exposes readings at an HTTP endpoint.

Installation

pip install async-httpd-data-collector

Quick Start

from ahttpdc.read.database_interface import DatabaseInterface

# define which sensors and parameters to track
sensors = {
    'bmp180': ['altitude', 'pressure', 'seaLevelPressure', 'temperature'],
    'mq135': ['aceton', 'alcohol', 'co', 'co2', 'nh4', 'toulen'],
    'ds18b20': ['temperature'],
    'dht22': ['humidity', 'temperature'],
}

# connect to InfluxDB and the device
interface = DatabaseInterface(
    sensors,
    db_host='localhost',
    db_port=8086,
    db_token='your-influxdb-token',
    db_org='your-org',
    db_bucket='your-bucket',
    srv_ip='192.168.1.100',  # device IP
    srv_port=80,
    handle='circumstances',  # HTTP endpoint on the device
)

# start the background daemon - fetches and stores data continuously
interface.daemon.enable()

# query the last 30 days of data as a DataFrame
df = interface.query_historical('-30d')
print(df.head())

# stop the daemon when done
interface.daemon.disable()

How It Works

NodeMCU device          async-httpd-data-collector            InfluxDB
  (sensors)                                                  (storage)
     |                                                          |
     |--- HTTP GET /circumstances -->  AsyncFetcher             |
     |                                     |                    |
     |                              JSONInfluxParser            |
     |                                     |                    |
     |                              AsyncCollector  -------->   |
     |                                                          |
     |                              AsyncQuery     <--------    |
     |                                     |                    |
     |                               DataParser                 |
     |                                     |                    |
     |                              pandas DataFrame            |

The DatabaseInterface is the main entry point. It manages two things:

  • DataDaemon - a background process (via multiprocessing) that periodically fetches sensor data from the device and stores it in InfluxDB.
  • AsyncQuery - queries InfluxDB and returns results as pandas DataFrames with local timezone-adjusted timestamps.

Querying

# latest reading
df = interface.query_latest()

# last 3 hours
df = interface.query_historical('-3h')

# specific time range
df = interface.query_historical('2024-05-16T00:00:00Z', '2024-05-21T00:00:00Z')

# custom Flux query
df = interface.query_custom_sync('from(bucket:"my-bucket") |> range(start: -1d) |> last()')

Related Projects

  • arduino-air-state-server - the NodeMCU firmware that collects sensor readings and serves them over HTTP
  • air-quality-data-analysis - Jupyter notebooks with data analysis (heatmaps, correlations, anomaly detection) and SARIMAX time-series forecasting on the collected data

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

async_httpd_data_collector-2.0.4.tar.gz (49.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

async_httpd_data_collector-2.0.4-py3-none-any.whl (37.6 kB view details)

Uploaded Python 3

File details

Details for the file async_httpd_data_collector-2.0.4.tar.gz.

File metadata

File hashes

Hashes for async_httpd_data_collector-2.0.4.tar.gz
Algorithm Hash digest
SHA256 64d555cf508d77cdf7c3a151a2873fcc165b77fc9372e2d0c2f4140ec49738d9
MD5 6ae8c298b902dae82e82fffa3f2c1940
BLAKE2b-256 7afae073294f27ca6469a9f35af6b8c0dcb2c41981e43caa929e3cd1b7dd2ba8

See more details on using hashes here.

File details

Details for the file async_httpd_data_collector-2.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for async_httpd_data_collector-2.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 73220ad275e3421d6666f835b8754b65a53f166aaeae1c7a889df65a84ad7260
MD5 c91f6d9852711dac57366a81fd0cf5b3
BLAKE2b-256 cdffb1ccbcbca86e4cefde57201c95cd33f4757b26d4f7eb40f834d25f5ba03c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page