Skip to main content

A CLI to download, extract and convert Discogs data dumps

Project description

🎧 Discogs CLI — Data Processor Tool 💿

Discogs Logo

A modern command-line tool to download, extract, and convert Discogs data dumps into structured CSV files.


🚀 Features

  • 🧠 Scrape latest available data dump list from Discogs S3
  • ⬇️ Download .gz files for artists, labels, releases, masters
  • 📦 Extract .gz files to raw XML
  • ✂️ Chunk large XML into smaller files
  • 📄 Convert XML to clean, flat CSV files
  • 🗑 Delete selected or all files
  • ⚙️ Set custom download folder
  • 🧪 Easy to use from terminal with friendly UI

🧩 Installation

🐍 Install with PyPI

pip install DiscogsDataProcessorCLI

🍻 Install with Homebrew

brew tap ofurkancoban/discogs
brew install discogs

or

git clone https://github.com/ofurkancoban/DiscogsCLI.git
cd DiscogsCLI
pip install -e .

💻 Usage

discogs run        # Auto: download → extract → convert
discogs show       # List available Discogs data
discogs download   # Just download selected files
discogs extract    # Extract downloaded .gz files
discogs convert    # Convert extracted XML to CSV
discogs delete     # Delete files by selection or --all
discogs config     # Set download folder

📁 Folder Structure

~/Downloads/Discogs/
├── .discogs_config.json
└── Datasets/
    └── 2025-04/
        ├── discogs_20250401_artists.gz
        ├── discogs_20250401_artists     ← .xml
        └── discogs_20250401_artists.csv ← converted

🧠 Example Workflow

discogs show
# [1] 2025-04 | releases | 950 MB
# [2] 2025-04 | artists  | 320 MB

discogs download
# Select 1,2
# Downloads only

discogs extract
# Select file to extract

discogs convert
# Select XML to convert

🧑‍💻 Author


📜 License

MIT — use freely, mention when you do something cool 😎

Built with ❤️ by @ofurkancoban

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

discogsdataprocessorcli-1.5.9.tar.gz (14.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

discogsdataprocessorcli-1.5.9-py3-none-any.whl (19.0 kB view details)

Uploaded Python 3

File details

Details for the file discogsdataprocessorcli-1.5.9.tar.gz.

File metadata

  • Download URL: discogsdataprocessorcli-1.5.9.tar.gz
  • Upload date:
  • Size: 14.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.1

File hashes

Hashes for discogsdataprocessorcli-1.5.9.tar.gz
Algorithm Hash digest
SHA256 9c9e1f07b57073b860357ddf8b32f9e4fbd5caf2cc9885d193844cdbc88d3337
MD5 c6d1757253e25fc0ef39f38e7d86bacc
BLAKE2b-256 a77badafe50be39e172d3cf8be5cdac0eb8e2a831104487297aad3e52eba07ce

See more details on using hashes here.

File details

Details for the file discogsdataprocessorcli-1.5.9-py3-none-any.whl.

File metadata

File hashes

Hashes for discogsdataprocessorcli-1.5.9-py3-none-any.whl
Algorithm Hash digest
SHA256 53971e4f2b8baed62d01bc14a69ad8f7cf36c07f0b43502d32587912c398a85b
MD5 7298ea6205a6820dc7e064847be9a86f
BLAKE2b-256 db614903399526aada53feb26a9ced836972537b0a87e9ddc287d71d7d6a8c12

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page