Skip to main content

TX Dallas Criminal Case Parser

Project description

TX Dallas County Felony and Misdemeanor Courts Parser

Dallas Screenshot

This is a Python package for parsing HTML pages retrieved from the Texas Dallas County Felony and Misdemeanor Courts Case Information. Since the county website strictly monitors server activity, this package is meant to be used only after the user has collected HTML files for their use. Put differently, I will not be sharing any code that can be used to collect these raw HTML files.

Output

The dallasparser package will generate a maximum of 15 .xlsx files for each data table found from the county website:

Table Name Output
Appeals appeals.xlsx
Bonds bonds.xlsx
Bond COMMENTS bond_comments.xlsx
Charges charges.xlsx
Competency Data competency_data.xlsx
Dispositions dispositions.xlsx
General Comments general_comments.xlsx
General Comments WS Date general_comments_ws_date.xlsx
Judicial Information judicial_information.xlsx
Motions motions.xlsx
Names names.xlsx
Payments payments.xlsx
Probation Revocation probation_revocation.xlsx
Reduced/Enhanced Charges reduced_enhanced_charges.xlsx
Sets and Passes sets_and_passes.xlsx

Installation

Source

$ git clone https://github.com/jaycatsby/tx_dallas_court_parser.git
$ cd tx_dallas_court_parser
$ python setup.py install

PyPI

$ pip install dallasparser

Usage

A. CLI

$ dallasparser [-h] [-i INPUT] [-o OUTPUT]

optional arguments:
  -h, --help            show this help message and exit
  -i INPUT, --input INPUT
                        absolute path of HTML folder
  -o OUTPUT, --output OUTPUT
                        absolute path of XLSX output files

B. Module

from dallasparser.parser import TXDallasParser
parser = TXDallasParser(html_path, xlsx_path)
parser.run()

Contributions

Have an idea to make it better? Go ahead! I will be happy to see a pull request from you! :blush:

While creating a PR, please update the Contributor(s) section below as well.

Contributor(s)

Jay C. @ Squalo Dev LLC

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dallasparser-0.3.tar.gz (18.8 kB view details)

Uploaded Source

File details

Details for the file dallasparser-0.3.tar.gz.

File metadata

  • Download URL: dallasparser-0.3.tar.gz
  • Upload date:
  • Size: 18.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.4

File hashes

Hashes for dallasparser-0.3.tar.gz
Algorithm Hash digest
SHA256 2a886e9a7dc3ec852a3c3773c24764c0db0ce57883debb13b6a4ef2d4d9b6343
MD5 16a1ed24f404c859fa065a40fd5159ce
BLAKE2b-256 59c6098af3b210d8a4294beebb559ffd1e81ed4476b068c06093a5a34de23626

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page