PDF parsing for Singaporean banks
Project description
Monopoly is a Python library that converts Singapore bank statement PDFs to CSV using pdftotext
Supported banks:
- Citibank
- DBS
- HSBC
- OCBC
- Standard Chartered
Only credit card statements are supported (for now)
Install
Install dependencies using Homebrew
brew bundle
Clone the repo
git clone https://github.com/benjamin-awd/monopoly.git
Create a virtual environment and install Python dependencies
pyenv virtualenv 3.11.4 monopoly
pyenv shell monopoly
poetry install
Usage
Monopoly can be run as a Python package, allowing you to extract, transform and write bank statements to a CSV file.
To see how Monopoly works, you can run this example
python3 monopoly/examples/single_statement.py
If your PDF is encrypted, you'll have to add the password to a .env file in the root directory, which is automatically read by monopoly
You can use the .env.template and then update values in the .env file
cp .env.template .env
Features
- Monopoly can be run on Google Cloud as a scheduled Cloud Run job.
- PDFs can be unlocked using explicitly defined passwords and/or a masking pattern like ?d?d?d
Current cloud implementation:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for monopoly_sg-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0527a92df6f6bc8043f5a5cb4982fe3f11ffb96a0f629e48aa35c3e054a14b2b |
|
MD5 | 51c2678da123cfdcd632756f7eb752bb |
|
BLAKE2b-256 | 68eac834549874df7d0c47a4e11b7b56febf334bcb5d8422a20d6f91cc675b33 |