Functions required by the access-logs-local-driver
Project description
# Access Logs Driver
Load the content of gzipped Apache HTTP log files Exclude bots, scrapers, etc., select URLs matching the provided regex(es), and generate a CSV of the relevant log entries.
Take postprocessed logs and strip out multiple hits in sessions, and resolve URLs to the chosen URI_SCHEME (e.g. info:doi).
We strip out entries where the same (IP address * user agent) pair has accessed a URL within the last SESSION_TIMEOUT (e.g. half-hour)
Additionally, we convert the URLs to ISBNs and collate request data by date, outputting a CSV for ingest via the stats system.
Release Notes: [0.0.3] - 2023-07-25
- Changed:
Update requirements
Update using a pyproject.toml file as well as the new deployment structure
[0.0.2] - 2023-07-11
- Added:
Unittests
- Changed:
Moved the files out of the package and get the file’s data as parameters and return the filtered data.
renamed the plugin to access-logs-local
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for access_logs_local-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 32ba6298051ee1fd16d17f129577c38524a76312cc05d17e9382cf9005a1da43 |
|
MD5 | bb694c6158bd6b7fab28a865b0140ba8 |
|
BLAKE2b-256 | 7cae10c574f5287fc513f25034474554d9079d4235e3f3c05cefb82e292693b5 |