Skip to main content

Scraper for ALLRIS

Project description

Apache License 2.0 Python 3.7 and 3.8 version 0.5.8

This scraper offers both public and private scraping. The latter requires your username and password and performs the following tasks for you:

  • login

  • download of all agendas and motions related to upcoming meetings of committees and plenary sessions

    • Only considers meetings where you already have been invited formally through ALLRIS.

IMPORTANT: All districts are supported but official committee abbreviations will only work for Eimsbüttel as of now.

The public scraper ought to be used with care as it accesses all accessible pages of an entire month. Currently, June 2020 is hardcoded and it can only be used in a GUI environment.

Requirements

Initial setup

  1. Install ALLRIS scraper pip install twomartens.allrisscraper (you need Python 3.7+)

  2. Run ALLRIS scraper a first time tm-allrisscraper (creates config ini in your current working directory)

  3. Fill out the config file with your login credentials and an absolute path on your system to store PDFs of files

Configuration

[Default]
; possible values for district: Altona, Bergedorf, Eimsbüttel, Hamburg-Nord,
; Hamburg-Mitte, Harburg, Wandsbek
district = Eimsbüttel
; if you are not from Eimsbüttel your domain ending will differ
username = max.mustermann@eimsbuettel.de
; password is stored in clear text, therefore ini file should have most
; restrictive read permissions
password = VerySecurePassword
; location for storage of PDFs (trailing slash is IMPORTANT)
pdflocation = /path/to/storage/of/PDFs/
; location of the firefox binary
firefoxBinary = /Pfad/zur/firefox.exe
; location of the geckodriver binary
geckodriver = /Pfad/zum/geckodriver

Usage after initial setup

Run ALLRIS scraper: tm-allrisscraper (takes a few seconds to finish)

In the specified location for download you will find the following structure:

  • YYYY-MM-DD_Abbreviation of committee or plenary session/ (one directory for each meeting)

  • files inside the directory: Einladung.pdf (contains invitation), Mappe.pdf (contains all motions in one document), and Tagesordnung.pdf (agenda)

Packaging new versions

Activate virtual environment and run python -m build --sdist --wheel

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twomartens.allrisscraper-0.5.8.tar.gz (21.0 kB view details)

Uploaded Source

Built Distribution

twomartens.allrisscraper-0.5.8-py3-none-any.whl (21.8 kB view details)

Uploaded Python 3

File details

Details for the file twomartens.allrisscraper-0.5.8.tar.gz.

File metadata

  • Download URL: twomartens.allrisscraper-0.5.8.tar.gz
  • Upload date:
  • Size: 21.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.8.5

File hashes

Hashes for twomartens.allrisscraper-0.5.8.tar.gz
Algorithm Hash digest
SHA256 48d5e28b85c56134967776537d93ad98b182a540cc5bbb6d82c844ea49acf0b9
MD5 a60130e658b82d9a380b9e8931e00d01
BLAKE2b-256 3e8ed412e566c3809525052673f543da093b4e4f54f86dc2112b610f5686df4e

See more details on using hashes here.

File details

Details for the file twomartens.allrisscraper-0.5.8-py3-none-any.whl.

File metadata

  • Download URL: twomartens.allrisscraper-0.5.8-py3-none-any.whl
  • Upload date:
  • Size: 21.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.8.5

File hashes

Hashes for twomartens.allrisscraper-0.5.8-py3-none-any.whl
Algorithm Hash digest
SHA256 82b949ec3fd34ab2994be64510eee7fb8ea863119cf2f95f8a5140bb7f45dea8
MD5 fe0791d0471a4f25c097126cc7866b0c
BLAKE2b-256 3faa34bb39b5eef36c1093ef28bcf4c002964c80ffbde14aa7843389fd3dc79f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page