PDF parsing for Singaporean banks
Project description
Monopoly is a Python library that converts Singapore bank statement PDFs to CSV using pdftotext
Supported banks:
- Citibank
- DBS
- HSBC
- OCBC
- Standard Chartered
Only credit card statements are supported (for now)
Install
Install dependencies using Homebrew
brew bundle
Clone the repo
git clone https://github.com/benjamin-awd/monopoly.git
Create a virtual environment and install Python dependencies
pyenv virtualenv 3.11.4 monopoly
pyenv shell monopoly
poetry install
Usage
Monopoly can be run as a Python package, allowing you to extract, transform and write bank statements to a CSV file.
To see how Monopoly works, you can run this example
python3 monopoly/examples/single_statement.py
If your PDF is encrypted, you'll have to add the password to a .env file in the root directory, which is automatically read by monopoly
You can use the .env.template and then update values in the .env file
cp .env.template .env
Features
- Monopoly can be run on Google Cloud as a scheduled Cloud Run job.
- PDFs can be unlocked using explicitly defined passwords and/or a masking pattern like ?d?d?d
Current cloud implementation:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for monopoly_sg-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6eee5943723baa7afe311c76d88e7f61bae424335235219590e0007a254f89c3 |
|
MD5 | 83e360e7a8da3ca0493dea907619bbd4 |
|
BLAKE2b-256 | 85ce1424895cb467275dff363d6eee7f1fb883cebc9cc1542778fe607048ebca |