Skip to main content

Simple application for downloading financial statements from czech business register.

Project description

Task

Create terminal application for downloading financial statements from czech business register (https://or.justice.cz/) and extracting information from them and storing them in structured form.

Input is a file with ICOs, one ICO per line.

At first the application downloads pdfs with financial statements. In the next step it extracts information specified in config file and stores them in structured form.

The application must allow downloading documents of large number of companies, while respecting information system limits (https://or.justice.cz/ias/ui/podminky).

Installation

  • pip install financial-statements-downloader

Commands

  • import_icos <filepath> - imports ICOs from file

  • download - downloads information about subjects and documents

  • extract - extract information from financial statements

Config

[downloader]
base_url = https://or.justice.cz/ias/ui/  # information system url
search_url = rejstrik-$firma?jenPlatne=VSECHNY&ico=  #search url
limit_day = 3000  # download limit
download_extract = true  # download information from extract
download_documents = true  # download documents
documents_type = závěrka  # document type
documents_dir = documents  # directory for downloaded documents

[parser]  # example of specification extracted information
vlastni_kapital = Vlastní kapitál
cizi_zdroje = CizĂ­ zdroje

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

financial-statements-downloader-0.1.1.tar.gz (702.6 kB view details)

Uploaded Source

File details

Details for the file financial-statements-downloader-0.1.1.tar.gz.

File metadata

File hashes

Hashes for financial-statements-downloader-0.1.1.tar.gz
Algorithm Hash digest
SHA256 49f104a376c48dd5521a5b496b5dc6a5996f0c9434698d5b989867baeb4dc2fd
MD5 9167592bdaad709cc1554976e5e63aa5
BLAKE2b-256 c9ce06cea9f2b14597984c50f25ab9f96037cf77422ba7031e83545f6d722104

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page