Mine and extract complete package lists from NPM registry
Project description
NPM Package Miner
A Python tool to mine and extract complete package lists from the NPM registry.
Features
- Fetches all ~2-3 million NPM packages from the official registry
- Retrieves package metadata including homepage and repository URLs
- Parallel processing with 50 workers for efficient data collection
- Progress tracking with visual feedback
- Outputs standardized CSV format for cross-ecosystem analysis
Installation
pip install npm-miner
Quick Start
npm-miner
Or use as a Python module:
from npm_miner import mine_npm
mine_npm()
Output
Generates a CSV file with package information:
- Package ID, Platform, Name
- Homepage URL, Repository URL
Performance
- Runtime: 10-20 hours for complete dataset
- Uses 50 parallel workers
- Network-dependent processing speed
Data Source
- NPM Registry: https://registry.npmjs.org/
- All packages: https://replicate.npmjs.com/_all_docs
License
MIT License - see LICENSE file for details
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
npm_miner-1.0.2.tar.gz
(8.7 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file npm_miner-1.0.2.tar.gz.
File metadata
- Download URL: npm_miner-1.0.2.tar.gz
- Upload date:
- Size: 8.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bd2d85dfa5f2bb5e37e832b372e039f6c8fd1db0a6ef694f8ed6e16214bdea67
|
|
| MD5 |
b14948866a9191e931e3cf3ed76c8620
|
|
| BLAKE2b-256 |
907ec3235468251c25a9108cba293ee821a9116b0c6af9a985859b757a07f44a
|
File details
Details for the file npm_miner-1.0.2-py3-none-any.whl.
File metadata
- Download URL: npm_miner-1.0.2-py3-none-any.whl
- Upload date:
- Size: 8.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7b63f7c6109616d80943b94a72743ebadd68fa62fde34d8ec06e77503e7a51c6
|
|
| MD5 |
3a7f84837996b936ae6bcdd9c76fc5c8
|
|
| BLAKE2b-256 |
78e029c7893d678c5aa65df857aaed92548ee3dc6413bb033ce128f8b4ccda5d
|