Skip to main content

Mine and extract complete package lists from NPM registry

Project description

NPM Package Miner

A Python tool to mine and extract complete package lists from the NPM registry.

Features

  • Fetches all ~2-3 million NPM packages from the official registry
  • Retrieves package metadata including homepage and repository URLs
  • Parallel processing with 50 workers for efficient data collection
  • Progress tracking with visual feedback
  • Outputs standardized CSV format for cross-ecosystem analysis

Installation

pip install npm-miner

Quick Start

npm-miner

Or use as a Python module:

from npm_miner import mine_npm
mine_npm()

Output

Generates a CSV file with package information:

  • Package ID, Platform, Name
  • Homepage URL, Repository URL

Performance

  • Runtime: 10-20 hours for complete dataset
  • Uses 50 parallel workers
  • Network-dependent processing speed

Data Source

License

MIT License - see LICENSE file for details

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

npm_miner-1.0.2.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

npm_miner-1.0.2-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file npm_miner-1.0.2.tar.gz.

File metadata

  • Download URL: npm_miner-1.0.2.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for npm_miner-1.0.2.tar.gz
Algorithm Hash digest
SHA256 bd2d85dfa5f2bb5e37e832b372e039f6c8fd1db0a6ef694f8ed6e16214bdea67
MD5 b14948866a9191e931e3cf3ed76c8620
BLAKE2b-256 907ec3235468251c25a9108cba293ee821a9116b0c6af9a985859b757a07f44a

See more details on using hashes here.

File details

Details for the file npm_miner-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: npm_miner-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for npm_miner-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 7b63f7c6109616d80943b94a72743ebadd68fa62fde34d8ec06e77503e7a51c6
MD5 3a7f84837996b936ae6bcdd9c76fc5c8
BLAKE2b-256 78e029c7893d678c5aa65df857aaed92548ee3dc6413bb033ce128f8b4ccda5d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page