Skip to main content

A package for downloading bulk files from courtlistener

Project description

Easy Bulk export, no cap

This repository provides scripts and notebooks that make it easy to export data in bulk from CourtListener's freely available downloads.

  • Create first version of notebook suitable for Data Scientists
    • Create the appropriate dtypes to optimize panda storage
    • Select necessary cols usecols, for example 'created_by' date field indicating a database insert isn't necessary
    • Read the opinions.csv (190+gb) chunk at a time from disk while converting into JSON
  • Create a standalone script that can be piped to other tools
  • Improve speed by using DASK DataFrame

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lil_nocap-0.5.4.tar.gz (11.3 kB view hashes)

Uploaded Source

Built Distribution

lil_nocap-0.5.4-py3-none-any.whl (13.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page