Skip to main content

Extract individual fields from lines in Apache access logs

Project description

GitHub PyPI PyPI - Status GitHub last commit GitHub issues PyPI - Downloads GitHub repo size PyPI - Python Version


Dinobox Logo

Features

The centerpiece of the parser201 module is the LogParser class. The class initializer takes a single line from an Apache access log file and extracts the individual fields into attributes within an object.

Installation

pip3 install parser201

Usage

The most common use-case for parser201 is importing individual lines from an Apache access log file and creating LogParser objects, like this:

from parser201 import LogParser

with open('access.log', 'r') as f:
    for line in f:
        lp = LogParser(line)
        # Use lp as desired: add to List, Dictionary, etc.

Documentation

See: parser201 Documentation.

Version History

  • 1.5.1 (2024-09-23)
    • Migrated packaging and build system to uv, and code formatting and linting to ruff.
    • Improved exception handling for invalid date-time objects.
    • Migrated documentation generation to pdoc.
    • Code and documentation linting.

  • 1.5.0 (2024-01-27)
    • Cleaned up packaging for better PEP561 compliance.
    • Cleaned up type hints.
    • Dropped support for converting timestamps to local machine time. Processing local timezones across multiple architectures and operating systems is a bit of a hot mess in Python right now. There's just too much variability with regard to OS Settings, location, daylight savings time, etc. The performance of this feature was spotty at best. There is still support for the original timezone and converstion to UTC.

  • 1.4.1 (2023-06-22)
    • Migrated code formatter to black.

  • 1.4.0 (2023-04-30)
    • Strengthened regular expression parsing to handle log lines that contain a wider array of malicious attacks.
    • Added support for access logs that contain both IPv4 and IPv6 addresses.
    • Minimum supported Python version is now 3.8 (^3.8).
    • Miscellaneous optimizations.

  • 1.3.1 (2022-10-22)
    • Migrated dependency/build management to poetry.

  • 1.3.0 (2022-08-13)
    • Implemented __eq__ magic method for the LogParser class. You can now perform equality checks on two LogParser objects.
    • Added test cases for __eq__
    • Migrated task runner to make
    • Documentation cleanup
    • Code linting and cleanup

  • 1.2.0 (2022-07-17)
    • Implemented __eq__ magic methods in the FMT and TZ classes.
    • Documentation cleanup.
    • Testing improvements and pyproject.toml adjustments for better pytest compatability.
    • Code linting and cleanup.

  • 1.1.5 (2022-01-17)
    • Code linting and cleanup.

  • 1.1.4 (2021-12-23)
    • Documentation cleanup.

  • 1.1.3 (2021-12-19)
    • Make file tuning.
    • Documentation cleanup.
    • Added site logo to README.md.

  • 1.1.0 (2021-11-13)
    • Implemented selectable timestamp conversion options {original, local, UTC}.
    • Implemented selectable formatting options for timestamp attribute {string, date_obj}.
    • Migrated API reference to GitHub pages.
    • Code cleanup.

  • 1.0.2 (2021-11-05)
    • Documentation cleanup.

  • 1.0.0 (2021-11-04)
    • Stable production release.
    • Migrated to a new development framework.
    • Implemented more robust and compartmentalized test cases.
    • Code tuning.

  • 0.2.0 (2021-10-31)
    • Changed behavior to gracefully fail for any malformed input line. If an input line cannot be successfully parsed, all attributes of the returned object are set to None and no messages are printed.
    • Added additional pytest cases to verify failure behavior.

  • 0.1.9 (2021-09-15)
    • Code cleanup for pep8 compliance.
    • Cleaned up Makefiles and scripts to remove references to python (meaning python2) and replace it with python3.

  • 0.1.7 (2021-06-05)
    • Re-tooled testing scripts to use parameterized test data, and conduct more robust testing.

  • 0.1.6 (2020-12-19)
    • Addressed exception handling for initializer input not being a valid string data type.
    • Documentation cleanup.

  • 0.1.5 (2020-10-26)
    • Enabled automatic deployment of tagged releases to pypi from travis using encrypted token.
    • Converted references to the master branch in the git repository to main across the documentation set.
    • Documentation cleanup.

  • 0.1.4 (2020-10-24)
    • Initial pypi release.
    • Fixed test file filtering issue in .gitignore.
    • Dependency fix for travis tests.

  • 0.1.1 (2020-10-22)
    • Follow-on testing on test.pypi.org.

  • 0.1.0 (2020-10-18)
    • Initial testing on test.pypi.org.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parser201-1.5.1.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

parser201-1.5.1-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file parser201-1.5.1.tar.gz.

File metadata

  • Download URL: parser201-1.5.1.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.20

File hashes

Hashes for parser201-1.5.1.tar.gz
Algorithm Hash digest
SHA256 ca0a0503e46eca467c940a531d79f58709858a929478dfba31cd211c737033e6
MD5 3310be1a5b0a82b257df574ffa4dec00
BLAKE2b-256 3212546287ea624f2eb41e0cc280294b06de43c16ac789cc99b34e3b145ac177

See more details on using hashes here.

File details

Details for the file parser201-1.5.1-py3-none-any.whl.

File metadata

  • Download URL: parser201-1.5.1-py3-none-any.whl
  • Upload date:
  • Size: 8.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.20

File hashes

Hashes for parser201-1.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e801a53828b73633fc5d30836403b993a346097649d83948b2a19d8ab31b26ff
MD5 66978cef9d10e810813286fd5c57795d
BLAKE2b-256 63afa05c17c3caeeaa99692142f20532ac8e8872545e606c95281dc4e126cd5f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page