Parses 1099 tax document (1099-B, 1099-INT, 1099-DIV) from PDF into CSV format and performs simple analysis
Project description
๐ 1099 Parser
Note: Dividend features are in beta. I haven't written tests yet. Use the
--analysis-report
flag to get more details about how decisions are made and double check my work.
This project converts standard 1099 tax documents (validated on Robinhood and Wealthfront documents so far) from PDF to CSV file. This tool will be helpful for those who need every transaction in a spreadsheet format for tax reporting purposes. After parsing the tax documents, it will perform some simple analysis on the holding period of securities which reported qualifying dividends.
Original Work
Copyright (c) 2023 Andrew Wells (ajwells@uchicago.com)
Original Author
Many kudos to original author Keun Park (kevin.park1217@gmail.com), whose work I used as a foundation upon which to expand.
๐ Running Locally
Prerequisites
Make sure you have the following installed on your computer.
- Latest Python 3
Must be version 3.6 or higher - [Windows Only] Build Tools for Visual Studio 2019
In the installer, select- C++ build tools
- the Windows 10 SDK
- the latest version of MSVC v142 x64/x86 build tools.
Installing
$ python -m pip install wheel
$ python -m pip install --upgrade parse_1099
Note: If commands above fail, try replacing python
with python3
Running
$ parse_1099
usage: parse_1099 [-h] --pdf FILE [--csv FILE] [--silent] [--validate] [--disable-dividend-analysis] [--analysis-report]
Example and Validation
Set the --validate
flag to print out total values for some columns. Make sure these values match with the PDF!
$ parse_1099 --pdf consolidated_1099.pdf --validate
Pages: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 40/40 [00:03<00:00, 10.41it/s]
>>> Calculated Totals:
Make sure the values matches with the PDF totals!
proceeds: $77,521.03, cost: $80,902.05, wash_sales_loss: $3,733.41, gain_loss: $352.39
>>> Saved to output.csv
๐ Issues and Bugs
If you have any issues with the tool, please open a GitHub Issue with as much as detail as you can provide.
Development
Structure
This is the first python module I've worked on, so the structure may be a little goofy. I tried to structure the project into two parsers, one for 1099-B Proceeds from Broker and Barter Exchange Transactions and one for 1099-DIV Detail for Dividends and Distributions.
In each case, I designed for extensibility via versioning, in case the structure of the data or the presentation of the data on the PDF ever changes. Hypothetically the correct version of the subparser could be detected, but there's no sense writing that functionality until the scenario arises.
Building
The following will build the python wheel file into the dist
folder. Note that python
and pip
can be exchanged with python3
and pip3
depending on your environment configuration.
$ python -m build
The following will update the installed module requiring a version bump.
$ pip install dist/parse_1099-X.Y.Z-...whl --force-reinstall --no-deps
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file parse_1099-2.1.1.tar.gz
.
File metadata
- Download URL: parse_1099-2.1.1.tar.gz
- Upload date:
- Size: 33.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 151046ad26d2968947ff5aae1d54ae7f7e99e5bc8602957c520217ddf2af8822 |
|
MD5 | 54112de5dbffc31977bb3c3da6676265 |
|
BLAKE2b-256 | 3205fd5fc48c27c3393eb450dfda879de8454b99ada2fc9dd73bc5eeec386e3e |
File details
Details for the file parse_1099-2.1.1-py3-none-any.whl
.
File metadata
- Download URL: parse_1099-2.1.1-py3-none-any.whl
- Upload date:
- Size: 34.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | af0adde0629828a15afded3da7d2c1cf9dc95a9b9f08817e1882813db759d61d |
|
MD5 | 681ac03b75cae399cd58b5a6c22375ab |
|
BLAKE2b-256 | 998ae3e24bffb2bd364dc8e1d6eeb4f4edd11dad8907b78b104543f365d8d304 |