Simple application for downloading financial statements from czech business register.
Project description
Task
Create terminal application for downloading financial statements from czech business register (https://or.justice.cz/) and extracting information from them and storing them in structured form.
Input is a file with ICOs, one ICO per line.
At first the application downloads pdfs with financial statements. In the next step it extracts information specified in config file and stores them in structured form.
The application must allow downloading documents of large number of companies, while respecting information system limits (https://or.justice.cz/ias/ui/podminky).
Installation
pip install financial-statements-downloader
Commands
import_icos <filepath> - imports ICOs from file
download - downloads information about subjects and documents
extract - extract information from financial statements
Config
[downloader] base_url = https://or.justice.cz/ias/ui/ # information system url search_url = rejstrik-$firma?jenPlatne=VSECHNY&ico= #search url limit_day = 3000 # download limit download_extract = true # download information from extract download_documents = true # download documents documents_type = závěrka # document type documents_dir = documents # directory for downloaded documents [parser] # example of specification extracted information vlastni_kapital = Vlastnà kapitál cizi_zdroje = Cizà zdroje
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file financial-statements-downloader-0.1.1.tar.gz
.
File metadata
- Download URL: financial-statements-downloader-0.1.1.tar.gz
- Upload date:
- Size: 702.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 49f104a376c48dd5521a5b496b5dc6a5996f0c9434698d5b989867baeb4dc2fd |
|
MD5 | 9167592bdaad709cc1554976e5e63aa5 |
|
BLAKE2b-256 | c9ce06cea9f2b14597984c50f25ab9f96037cf77422ba7031e83545f6d722104 |