A CLI to download, extract and convert Discogs data dumps
Project description
🎧 Discogs CLI — Data Processor Tool 💿
A modern command-line tool to download, extract, and convert Discogs data dumps into structured CSV files.
🚀 Features
- 🧠 Scrape latest available data dump list from Discogs S3
- ⬇️ Download
.gzfiles for artists, labels, releases, masters - 📦 Extract
.gzfiles to raw XML - ✂️ Chunk large XML into smaller files
- 📄 Convert XML to clean, flat CSV files
- 🗑 Delete selected or all files
- ⚙️ Set custom download folder
- 🧪 Easy to use from terminal with friendly UI
🧩 Installation
🐍 Install with PyPI
pip install DiscogsDataProcessorCLI
🍻 Install with Homebrew
brew tap ofurkancoban/discogs
brew install discogs
or
git clone https://github.com/ofurkancoban/DiscogsCLI.git
cd DiscogsCLI
pip install -e .
💻 Usage
discogs run # Auto: download → extract → convert
discogs show # List available Discogs data
discogs download # Just download selected files
discogs extract # Extract downloaded .gz files
discogs convert # Convert extracted XML to CSV
discogs delete # Delete files by selection or --all
discogs config # Set download folder
📁 Folder Structure
~/Downloads/Discogs/
├── .discogs_config.json
└── Datasets/
└── 2025-04/
├── discogs_20250401_artists.gz
├── discogs_20250401_artists ← .xml
└── discogs_20250401_artists.csv ← converted
🧠 Example Workflow
discogs show
# [1] 2025-04 | releases | 950 MB
# [2] 2025-04 | artists | 320 MB
discogs download
# Select 1,2
# Downloads only
discogs extract
# Select file to extract
discogs convert
# Select XML to convert
🧑💻 Author
- GitHub: github.com/ofurkancoban
- LinkedIn: linkedin.com/in/ofurkancoban
- Kaggle: kaggle.com/ofurkancoban
📜 License
MIT — use freely, mention when you do something cool 😎
Built with ❤️ by @ofurkancoban
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file discogsdataprocessorcli-1.5.9.tar.gz.
File metadata
- Download URL: discogsdataprocessorcli-1.5.9.tar.gz
- Upload date:
- Size: 14.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9c9e1f07b57073b860357ddf8b32f9e4fbd5caf2cc9885d193844cdbc88d3337
|
|
| MD5 |
c6d1757253e25fc0ef39f38e7d86bacc
|
|
| BLAKE2b-256 |
a77badafe50be39e172d3cf8be5cdac0eb8e2a831104487297aad3e52eba07ce
|
File details
Details for the file discogsdataprocessorcli-1.5.9-py3-none-any.whl.
File metadata
- Download URL: discogsdataprocessorcli-1.5.9-py3-none-any.whl
- Upload date:
- Size: 19.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
53971e4f2b8baed62d01bc14a69ad8f7cf36c07f0b43502d32587912c398a85b
|
|
| MD5 |
7298ea6205a6820dc7e064847be9a86f
|
|
| BLAKE2b-256 |
db614903399526aada53feb26a9ced836972537b0a87e9ddc287d71d7d6a8c12
|