Skip to main content

Extract public Spotify data - tracks, albums, artists, playlists, podcasts - without the official API.

Project description

SpotifyScraper

PyPI version Python versions CI Docs License: MIT

Extract public Spotify data — tracks, albums, artists, playlists, and podcasts — without the official API or any credentials.

SpotifyScraper bootstraps an anonymous token from Spotify's own public embed pages and reads the same JSON endpoints the web player uses, returning typed, immutable models. v3 is a ground-up rewrite focused on reliability and a clean, modern API.

Upgrading from v2? See the migration guide. The previous line lives on the v2.x branch.

Install

pip install spotifyscraper                 # core (only depends on httpx)
pip install "spotifyscraper[media]"        # + cover/preview embedding (mutagen)
pip install "spotifyscraper[browser]"      # + Playwright browser fallback
pip install "spotifyscraper[cli]"          # + the spotifyscraper command-line tool
pip install "spotifyscraper[all]"          # everything

Python 3.10+.

Quickstart

from spotify_scraper import SpotifyClient

with SpotifyClient() as client:
    track = client.get_track("https://open.spotify.com/track/4uLU6hMCjMI75M1A2tKUQC")
    print(track.name, "—", track.artists[0].name)
    print(track.duration_ms, "ms |", track.preview_url)

    print(track.to_dict())          # JSON-safe dict, if you prefer dicts

Every entity has its own method — get_track, get_album, get_artist, get_playlist, get_episode, get_show — each accepting a URL, URI, or bare ID.

Async

import asyncio
from spotify_scraper import AsyncSpotifyClient

async def main():
    async with AsyncSpotifyClient() as client:
        track, album = await asyncio.gather(
            client.get_track("4uLU6hMCjMI75M1A2tKUQC"),
            client.get_album("4aawyAB9vmqN3uQ7FjRGTy"),
        )
        print(track.name, "|", album.name)

asyncio.run(main())

Download a cover and preview

from spotify_scraper import SpotifyClient

with SpotifyClient() as client:
    track = client.get_track("4uLU6hMCjMI75M1A2tKUQC")
    client.download_cover(track, dest="covers/")
    client.download_preview(track, dest="previews/", embed_cover=True)  # needs [media]

Features

  • All core entities + podcasts — tracks, albums, artists, playlists, shows, episodes.
  • Sync & async clients sharing one sans-io core.
  • Typed, frozen models with JSON-safe to_dict() / from_dict().
  • Two-tier resilience — Spotify's GraphQL API with automatic fallback to the embed page.
  • One core dependency (httpx); media and browser support are optional extras.
  • Anti-ban built in — per-host rate limiting, retries with backoff, UA rotation, proxies.
  • Browser fallback via Playwright when you need a real browser.

Command line

With the cli extra installed, a spotifyscraper command is available:

spotifyscraper track 4uLU6hMCjMI75M1A2tKUQC          # entity metadata as JSON
spotifyscraper playlist <id> --max-tracks 50 --pretty
spotifyscraper download preview <id> -o ./previews --embed-cover

Every command emits JSON, so it composes with tools like jq. See the CLI guide.

Roadmap

Version Scope
3.0 The library: all entities, pagination, media downloads, browser fallback, docs
3.1 Command-line interface
next Cookie-authenticated lyrics

Documentation

Full docs, guides, and the API reference: https://spotifyscraper.readthedocs.io

Legal

SpotifyScraper is an unofficial, independent project, not affiliated with Spotify. It reads publicly available data and the ~30-second previews Spotify publishes; it does not download full tracks or circumvent DRM. Use it for educational and personal purposes, and in line with Spotify's Terms of Service. See the legal notice.

Contributing

Contributions are welcome — see CONTRIBUTING.md. The project is developed spec-first with OpenSpec; specs live in openspec/.

License

MIT © Ali Akhtari

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spotifyscraper-3.1.0.tar.gz (285.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

spotifyscraper-3.1.0-py3-none-any.whl (59.4 kB view details)

Uploaded Python 3

File details

Details for the file spotifyscraper-3.1.0.tar.gz.

File metadata

  • Download URL: spotifyscraper-3.1.0.tar.gz
  • Upload date:
  • Size: 285.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spotifyscraper-3.1.0.tar.gz
Algorithm Hash digest
SHA256 45c15f6a12ec705f87195b951add6e23f29b1f61f607662a117d61a3875cc0e2
MD5 3dd73be423362aff9573ca8cb70e5378
BLAKE2b-256 976edd85da754349348e1b190c353150bd911d3795b9b8aaf03ed9c51f9bef40

See more details on using hashes here.

File details

Details for the file spotifyscraper-3.1.0-py3-none-any.whl.

File metadata

  • Download URL: spotifyscraper-3.1.0-py3-none-any.whl
  • Upload date:
  • Size: 59.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for spotifyscraper-3.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d35d9b0345525a994b80d29bd4d2f2505e2fdfeb219a28f8f9b3fc61fcad5616
MD5 81a46bae3dae20558193c1dc9ad61506
BLAKE2b-256 751c442bf7504cf74fcbb15c4b8b907c223599678f87f9ba71ea652df4decbd3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page