Skip to main content

A package for downloading bulk files from courtlistener

Project description

Easy Bulk export, no cap

This repository provides scripts and notebooks that make it easy to export data in bulk from CourtListener's freely available downloads.

  • Create first version of notebook suitable for Data Scientists
    • Create the appropriate dtypes to optimize panda storage
    • Select necessary cols usecols, for example 'created_by' date field indicating a database insert isn't necessary
    • Read the opinions.csv (190+gb) chunk at a time from disk while converting into JSON
  • Create a standalone script that can be piped to other tools
  • Improve speed by using DASK DataFrame

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lil_nocap-0.2.1.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lil_nocap-0.2.1-py3-none-any.whl (12.2 kB view details)

Uploaded Python 3

File details

Details for the file lil_nocap-0.2.1.tar.gz.

File metadata

  • Download URL: lil_nocap-0.2.1.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.10.10 Darwin/21.5.0

File hashes

Hashes for lil_nocap-0.2.1.tar.gz
Algorithm Hash digest
SHA256 034b7700c1f22d502b61f05d3a93a7883166ccb9bf2b38f0c9bc35b65c94b2ff
MD5 ce7e361af172bd083ea50775bb436516
BLAKE2b-256 ad182e05c5bbaebb3d1d9ad1a0d39917bcc958835e1fe8bc1ece009d51d6e66f

See more details on using hashes here.

File details

Details for the file lil_nocap-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: lil_nocap-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 12.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.10.10 Darwin/21.5.0

File hashes

Hashes for lil_nocap-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0e94de2ab2a38cc7ed4f9d09176dce4d7ffe5211e0a5628578e2a5a2ce9efbb8
MD5 b5524e21ac8b02fcf0f6ddb54463aa06
BLAKE2b-256 df1a6164292729c9b72403e3b744edd8848de4c98ceacb22b06a56294d046a15

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page