Skip to main content

Functions required by the access-logs-local-driver

Project description

# Access Logs Driver

Load the content of gzipped Apache HTTP log files Exclude bots, scrapers, etc., select URLs matching the provided regex(es), and generate a CSV of the relevant log entries.

Take postprocessed logs and strip out multiple hits in sessions, and resolve URLs to the chosen URI_SCHEME (e.g. info:doi).

We strip out entries where the same (IP address * user agent) pair has accessed a URL within the last SESSION_TIMEOUT (e.g. half-hour)

Additionally, we convert the URLs to ISBNs and collate request data by date, outputting a CSV for ingest via the stats system.

Release Notes: [0.0.5] - 2023-07-03

Changed:
  • Added start_date and end_date for searching in the log files

  • Added the measure_uri to the result

Release Notes: [0.0.4] - 2023-07-31

Changed:
  • Update file structure and name of the driver

Release Notes: [0.0.3] - 2023-07-25

Changed:
  • Update requirements

  • Update using a pyproject.toml file as well as the new deployment structure

[0.0.2] - 2023-07-11

Added:
  • Unittests

Changed:
  • Moved the files out of the package and get the file’s data as parameters and return the filtered data.

  • renamed the plugin to access-logs-local

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

access_logs_local-0.0.5.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

access_logs_local-0.0.5-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file access_logs_local-0.0.5.tar.gz.

File metadata

  • Download URL: access_logs_local-0.0.5.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for access_logs_local-0.0.5.tar.gz
Algorithm Hash digest
SHA256 e94ddb22bcd177af2d96c59dc342d2d9f842f3aba4689f34c4be3ab487028321
MD5 c8aa1ae41e8c1e9931a931aeef979dd4
BLAKE2b-256 71fe3e3e9e40e8c8c2c81f757def406da3db0b4c6585150ccac49f3b1092d0a0

See more details on using hashes here.

File details

Details for the file access_logs_local-0.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for access_logs_local-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 fe1c6acd5fc4304fe72d53fd91159d3d903620d458d489f5fc71de05d3020e25
MD5 174dfd1a54034072a829364f97b1e6eb
BLAKE2b-256 2fab0af0367e66bf57797c0651a19d8e866437c769e7f39c101de0b7a2c2bc07

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page