Skip to main content

Oklahoma State Courts Network case parsing utilities

Project description

OSCN utilities

A python library for scraping case information from the Oklahoma State Courts Network.

Update:

  • 04/2025: Completed migration to Selectolax. BeautifulSoup has been removed as a dependency. All parsers now use Selectolax.
  • 12/16/2024: Updated to use Selectolax instead of BeautifulSoup. Most parsing functions now use Selectolax which is faster and more memory efficient.

Contents

oscn > Python package source to provide an api for retrieving and parsing case records.

scripts > Python scripts showing use of the oscn package

  • example.py: demonstrates use of the request Case and Caselist
  • retrieve-counts.py: saves a list of all counts for a list of counties and years
  • find-counts.py: saves a list of counts passing a test for a list of counties and years
  • soup_test.py: a stub for testing parsing attempts using BeautifulSoup
  • parse_test.py: a stub for developing using saved examples

OSCN package

oscn

  • counties: Returns a list of counties.
  • courts: Same as counties but more a accurate description.
  • judges: Returns a list of objects formated as {'name': 'Bond, James', 'number': '007'}
  • types: returns a dict of case type codes and descriptons
  • type: function to return case type description. Usage:
    >>> oscn.type("AO")
    'CIVIL ADMINISTRATIVE'
    

oscn.request

  • Case: Returns a single case. Case can be saved as files using Case.save() and retrieved using Case.open().

  • CaseList: Returns an iterator for retrieving cases for a county and year. CaseLists can be filtered using .find(). See scripts/example.py for details

  • Party: Returns information on parties available on OSCN.

  • Docket: Returns docket of cases for specific judges and date

oscn.parse

Parsers accept the html of an OSCN page and return python objects.

Case Page Parsers

  • filed: returns a string of the filing date (e.g. 12/25/2017)
  • closed: returns a string of the date the case was closed. Return None if not closed.
  • counts: returns of list of count dicts found in a case. Keys include 'description' of the count. If available 'violation' and 'disposed' are added.
  • judge: returns a string of the judge's name
  • parties: returns a list of dicts with these keys: id, name, type
  • docket: returns a list of rows in a docket
  • events: returns a list of dicts with these keys: event, party, docket, reporter, date, description. The keys date and description are cleaner versions of the event text. The event key will be deprecated some day so use date and description if you are starting a project.
  • attorneys: returns a list of dicts with these keys: name, address, and representing
  • issues: returns a list of dicts with issue information. Each issues includes a list of dicts for each party

Party Page Parsers

  • name: returns 'Requested Party'
  • alias: returns 'Alias or Alternate Names'
  • profile: returns dict of values in 'Personal Profile'
  • birth_month: returns string of 'Birth Month and Year'
  • addresses: returns a list of dicts for each address

Docket Page Parsers

  • cases: returns a list of case indexes
  • tables: returns the html table for each case in the docket

oscn.find

  • CaseIndexes: returns an iterator of case indexes (e.g. tulsa-CF-2019-12).

Usage

Create a CaseIndexes list using these key word arguments:

  • county: defaults to all,
  • last_name: use this for company or organization names
  • first_name: optional
  • middle_name: optional
  • filed_after: More readable than FiledDateL
  • filed_before: More readable than FiledDateH
  • closed_after: More readable than ClosedDateL
  • closed_before: More readable than ClosedDateH

Notes

  • The % wild card is added to all words in name, first and middle
  • Date arguments use MM/DD/YYY strings.

OSCN search parameters

If you are familar with the OSCN search parameters you can initialize CaseIndexes using these as key word arguments: db, number, lname, fname, mname, DoBMin, DoBMax, partytype, apct, dcct, FiledDate, FiledDateH, ClosedDateL, ClosedDateH, iLC, iLCType, iYear, iNumber, and citation

Using this will override init keyword values such as first or filed_after.

Installation

To install the latest version of this package, run the following command: pip install oscn

For development, clone the repository and install the dependencies using poetry: git clone git@github.com:codefortulsa/oscn.git cd oscn poetry install

Usage

Script example:

import oscn

Request a single case:

oscn.request.Case(county='tulsa', year='2018', number=84)

or use case index notation:

oscn.request.Case('love-CF-2019-25')

To request a list of cases to iterate:

oscn.request.CaseList(county='adair', year='2016')

Run test scripts

  • poetry run pytest tests/

or with ipdb:

- `poetry run pytest -s tests/`

specify a test:

  • poetry run pytest -s tests/test_parse.py -k 'test_events'

parallel testing: -poetry run pytest -n auto

Deployment steps

To deploy a new version of the package, follow these steps:

  1. poetry version patch
  2. poetry build
  3. poetry publish

User Agent

In some cases a custom user agent is required in the header of requests. Setting an environmental varialbe called OSCN_USER_AGENT will override the default.

This project is open-source and contributions are welcome.


Release Notes

Version 0.0.98

  • Completed full migration from BeautifulSoup to Selectolax. All parsers (parties, attorneys, counts, issues, events, docket, cmids, party_addresses, party_profile, party_properties, docket_report, find/parse) now use Selectolax exclusively.
  • Removed beautifulsoup4 as a dependency.
  • Removed global SSL warning suppression that was masking urllib3 warnings.

Version 0.0.92

  • Added the .documents property to the Case object, which parses the case docket and returns a list of all linked documents. Each document is a dictionary with the following structure:

    {
      "id": 1234567,
      "title": "MOTION TO COMPEL",
      "url": "https://www.oscn.net/dockets/GetDocument.aspx?...",
      "date": "01-01-2024",
      "code": "MO",
      "party": "PLAINTIFF"
    }
    

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oscn-0.0.98.tar.gz (30.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

oscn-0.0.98-py3-none-any.whl (37.0 kB view details)

Uploaded Python 3

File details

Details for the file oscn-0.0.98.tar.gz.

File metadata

  • Download URL: oscn-0.0.98.tar.gz
  • Upload date:
  • Size: 30.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.11.12 Darwin/25.4.0

File hashes

Hashes for oscn-0.0.98.tar.gz
Algorithm Hash digest
SHA256 03e16ecc3b14044874ad6b272179ce85f1d63f03d23569d56147b1903d0a3332
MD5 b5f6732241d565f6886a4533655c8d75
BLAKE2b-256 2336b57d6943306be385be8316b5e859b61826bf99224e2dd550659d4f61b2f5

See more details on using hashes here.

File details

Details for the file oscn-0.0.98-py3-none-any.whl.

File metadata

  • Download URL: oscn-0.0.98-py3-none-any.whl
  • Upload date:
  • Size: 37.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.11.12 Darwin/25.4.0

File hashes

Hashes for oscn-0.0.98-py3-none-any.whl
Algorithm Hash digest
SHA256 c6a57a98d5599dcf2818054c0a620ace8333d6fabe693e87df953e3ecb8b53db
MD5 bb8f8fb392b25339c564fbfa0de58e50
BLAKE2b-256 f71e0ef7b4a7576933f4db6fa2e40bffbd189027316178396ba5df0f6e7c0838

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page