Automatically sync your YouTube Music playlists to Spotify with smart track matching, diff-based updates, and metadata enrichment
Project description
ytmusic-to-spotify-sync
Automatically sync your YouTube Music playlists to Spotify — with smart track matching, diff-based updates, and full metadata enrichment.
Why?
YouTube Music and Spotify don't talk to each other. If you curate playlists on one platform you either listen on two platforms or manually rebuild them. This tool automates the bridge: it reads your YT Music playlist, finds the matching tracks on Spotify, and keeps the two in sync. Only changes since the last run are processed — no full re-scan, no duplicates.
Features
| Feature | Description |
|---|---|
| Live YT Music API | Fetches your playlist directly via ytmusicapi — no manual export required |
| 3-pass smart matching | ISRC exact match → normalised title + artist → fuzzy fallback with duration validation |
| Diff-based sync | JSON snapshots track playlist state; only added/removed tracks are touched each run |
| Spotify playlist management | Adds new matches and removes deleted tracks automatically |
| Audio features enrichment | Schema for danceability, energy, valence, tempo, key, and 7 more (populated only if your Spotify app has audio-features access; see Known limitations) |
| Metadata enrichment | Captures ISRC, explicit flag, album release date, album type, track number, and Spotify popularity |
| Last.fm artist tags | Pulls play counts, listeners, and dense genre tags via artist.getInfo (typical ~93 % coverage on niche libraries) |
| Local genre + mood classification | Derives primary_genre (17 buckets) and mood (13 labels) from the tag pool — no API calls |
| CSV fallback | Works from a CSV export if you prefer not to use the live API |
| Resume after rate limits | Match progress is cached every 25 tracks; re-running continues where you left off |
Quota-friendly --limit |
Cap new tracks per run to stay within Spotify's daily API quota |
| Interactive menu | Run without arguments for a guided, looping step-by-step experience |
| Dry-run mode | Preview every change before it is applied |
| Unmatched tracking | Saves failed matches to data/unmatched.csv for manual review or later retry |
Quick Start
1. Install
git clone https://github.com/davidmalko87/ytmusic-to-spotify-sync.git
cd ytmusic-to-spotify-sync
pip install -r requirements.txt
2. Configure
cp .env.example .env
Edit .env with your credentials:
| Variable | Where to get it |
|---|---|
SPOTIPY_CLIENT_ID |
Spotify Developer Dashboard → Create App |
SPOTIPY_CLIENT_SECRET |
Same app page, under "Settings" |
SPOTIPY_REDIRECT_URI |
Set to http://127.0.0.1:8888/callback and add it in the Spotify app settings |
SPOTIFY_PLAYLIST_ID |
Create a playlist on Spotify; the ID is the last segment of its URL |
YTMUSIC_PLAYLIST_ID |
From your YT Music playlist URL: ...playlist?list=<ID> |
3. Authenticate YT Music
python playlist_sync.py setup-ytmusic
One-time setup — paste request headers from browser DevTools. Auth is valid for ~2 years.
How to get the request headers
- Open music.youtube.com in your browser (logged in)
- Press F12 to open DevTools
- Go to the Network tab and type
/browsein the filter bar - Click on any playlist or page in YT Music to trigger a request
- Find a POST request to
browsewith status 200 - Click it → Headers tab → Request Headers
- Copy all the request headers and paste them into the terminal when prompted
Both the Chrome two-line format and the standard key: value format are accepted.
4. Run your first sync
# Preview what would happen (no changes made)
python playlist_sync.py sync --dry-run
# Run the actual sync
python playlist_sync.py sync
Configuration reference
All options are set via environment variables (.env file or shell environment).
| Variable | Required | Default | Description |
|---|---|---|---|
SPOTIPY_CLIENT_ID |
Yes | — | Spotify app client ID |
SPOTIPY_CLIENT_SECRET |
Yes | — | Spotify app client secret |
SPOTIPY_REDIRECT_URI |
No | http://127.0.0.1:8888/callback |
OAuth redirect URI |
SPOTIFY_PLAYLIST_ID |
Yes | — | ID of the target Spotify playlist |
YTMUSIC_PLAYLIST_ID |
Yes* | — | ID of the source YT Music playlist (*not needed with --from-csv) |
YTMUSIC_AUTH_FILE |
No | browser.json |
Path to the YT Music auth JSON file |
SOURCE_CSV |
No | — | Path to a CSV export to use instead of the live API |
LASTFM_API_KEY |
No | — | Free key from last.fm/api/account/create — enables play-count, listener, and artist-tag enrichment |
Usage
Interactive mode
python playlist_sync.py
==================================================
Playlist Sync: YT Music -> Spotify
==================================================
[1] Setup YT Music auth
[2] Import from CSV
[3] Snapshot YT Music playlist
[4] Show diff (changes)
[5] Full sync to Spotify
[6] Sync from CSV file
[7] Retry unmatched tracks
[8] Enrich with Last.fm
[9] Classify genre + mood from tags
[10] Sync Liked Songs (YTM <-> Spotify)
[11] Export enriched data to JSON
[12] Re-push to recreated Spotify playlist
[13] Show status
[0] Exit
Command-line mode
python playlist_sync.py setup-ytmusic # One-time browser auth setup
python playlist_sync.py snapshot # Save current playlist state
python playlist_sync.py diff # Show changes since last snapshot
python playlist_sync.py sync # Full sync (YT Music API → Spotify)
python playlist_sync.py sync --from-csv # Sync from CSV export instead
python playlist_sync.py sync --dry-run # Preview without making changes
python playlist_sync.py sync --limit 50 # Match at most 50 new tracks this run
python playlist_sync.py sync --retry-unmatched # Also retry tracks that previously failed
python playlist_sync.py retry-unmatched # Standalone retry of previously failed matches
python playlist_sync.py lastfm # Re-run Last.fm enrichment on the existing CSV
python playlist_sync.py classify # Re-derive primary_genre and mood from tags
python playlist_sync.py classify --force # Re-classify even rows that already have values
python playlist_sync.py sync-likes # Mirror YT Music liked songs to Spotify Liked Songs
python playlist_sync.py sync-likes --dry-run # Preview likes-sync changes
python playlist_sync.py export # Export enriched CSV as JSON (default: data/playlist_enriched.json)
python playlist_sync.py export -o my_data.json # Custom output path
python playlist_sync.py repush # Re-push all matched URIs (idempotent — only adds missing)
python playlist_sync.py repush --replace # Wipe the playlist first, then add (cleans duplicates)
python playlist_sync.py repush --dry-run # Preview without pushing
python playlist_sync.py status # Show sync statistics
All commands accept --verbose / -v for debug-level log output.
How it works
YT Music Playlist ──(API or CSV)──> Current State
│
Diff Engine ←── Previous Snapshot
╱ ╲
Added Tracks Removed Tracks
│ │
Matcher Spotify Remove
╱ ╲
Matched Unmatched
│ │
Spotify Add unmatched.csv
│
Enricher (audio features + metadata)
│
playlist_enriched.csv
Track matching strategy
The matcher runs three passes in order of reliability:
- ISRC match — searches Spotify by
isrc:query; highest accuracy, ~95 % confidence - Title + Artist — normalised search (
feat.stripped, HTML decoded), validated by duration ± 5 s - Relaxed — title-only search with fuzzy artist matching; catches live versions and alternate releases
Unmatched tracks are written to data/unmatched.csv and can be retried later with retry-unmatched.
Project structure
ytmusic-to-spotify-sync/
├── playlist_sync.py # Entry point
├── playlist_sync/
│ ├── __init__.py # Package version (canonical version source)
│ ├── cli.py # Commands and interactive menu
│ ├── config.py # Environment variables and paths
│ ├── models.py # Track, MatchResult, DiffResult dataclasses
│ ├── utils.py # Text normalisation and logging setup
│ ├── csv_manager.py # CSV I/O (BOM-aware)
│ ├── ytmusic_client.py # YT Music API wrapper
│ ├── spotify_client.py # Spotify API wrapper with rate-limit handling
│ ├── matcher.py # 3-pass track matching engine
│ ├── differ.py # Snapshot diff engine
│ ├── enricher.py # Metadata, audio features, classification
│ └── lastfm_client.py # Last.fm API wrapper (track + artist endpoints)
├── data/ # Created at runtime
│ ├── snapshots/ # JSON snapshots (latest.json + timestamped)
│ ├── playlist_enriched.csv # Full enriched output
│ └── unmatched.csv # Tracks that could not be matched
├── .env.example # Credential template
├── requirements.txt
├── CHANGELOG.md
├── CONTRIBUTING.md
└── SETUP.md # Detailed step-by-step setup guide
Output: enriched CSV
The sync produces data/playlist_enriched.csv with 50 columns:
| Column | Source |
|---|---|
title, artist, album |
YT Music |
trackId, url, duration |
YT Music |
spotify_uri, spotify_url, spotify_duration_ms |
Spotify match |
isrc, isrc_enriched, explicit, album_release_date |
Spotify metadata |
popularity |
Spotify track popularity (0–100) — only with extended-access apps |
artist_genres |
Primary artist genre tags from Spotify /artists — only with extended-access apps |
album_type |
Album type (album / single / compilation) |
track_number |
Track position within the album |
danceability, energy, valence |
Spotify audio features — only with extended-access apps |
tempo, key, mode, loudness |
Spotify audio features |
speechiness, acousticness |
Spotify audio features |
instrumentalness, liveness, time_signature |
Spotify audio features |
audio_features_fetched |
Skip-flag — audio features endpoint already attempted |
lastfm_playcount, lastfm_listeners, lastfm_tags |
Last.fm track.getInfo |
artist_tags |
Last.fm artist.getInfo — much denser than track tags |
tag_source |
Which source filled artist_tags (lastfm_artist, …) |
lastfm_attempted, lastfm_track_attempted |
Skip-flags — Last.fm endpoints already attempted |
spotify_metadata_attempted, spotify_genres_attempted |
Skip-flags — Spotify endpoints already attempted |
skip_reason |
Why the matcher pre-filtered this track (e.g. no_album for YT Music tracks lacking album metadata) — set means no Spotify search was attempted |
match_attempted |
true once the matcher has run on this track. Tracks with match_attempted=true AND no spotify_uri are skipped on subsequent syncs unless --retry-unmatched is passed (or retry-unmatched is run standalone) |
primary_genre |
Single broad genre bucket (electronic, rock, soundtrack, …) — derived locally from tags |
mood |
Multi-label mood (chill, epic, cinematic, …) — derived locally from tags |
match_method, match_confidence |
Matching diagnostics |
first_synced, last_synced |
Sync timestamps |
Genre & mood classification
primary_genre and mood are derived locally from the tag pool (artist_tags + lastfm_tags) — no API calls. This runs automatically at the end of every sync and lastfm command, and can be triggered standalone with python playlist_sync.py classify (or --force to re-bucket rows that already have values).
The classifier uses word-boundary tokenisation, so compound tags like deep house map to the electronic bucket via house, and post-rock correctly maps to rock (not soundtrack via ost).
Genre buckets, in priority order: soundtrack, classical, jazz, hip hop, metal, punk, country, blues, reggae, folk, electronic, ambient, rock, pop, rnb, indie, world.
Mood labels: chill, energetic, dark, sad, happy, epic, romantic, dreamy, aggressive, nostalgic, cinematic, ambient, instrumental.
Audio features reference
| Feature | Range | Meaning |
|---|---|---|
danceability |
0.0 – 1.0 | Suitability for dancing (tempo, rhythm, beat strength) |
energy |
0.0 – 1.0 | Intensity and activity (loud, fast, noisy = high) |
valence |
0.0 – 1.0 | Musical positiveness (happy = high, sad/angry = low) |
tempo |
BPM | Estimated beats per minute |
speechiness |
0.0 – 1.0 | Presence of spoken words |
acousticness |
0.0 – 1.0 | Confidence the track is acoustic |
instrumentalness |
0.0 – 1.0 | Likelihood of no vocal content |
liveness |
0.0 – 1.0 | Presence of a live audience |
loudness |
dB | Overall loudness (typically −60 to 0) |
key |
0 – 11 | Pitch class (0 = C, 1 = C♯, …, 11 = B) |
mode |
0 or 1 | Modality (0 = minor, 1 = major) |
time_signature |
int | Estimated beats per bar |
Liked songs sync
python playlist_sync.py sync-likes
Mirrors your YT Music Liked Songs to Spotify's Liked Songs library (/me/tracks). Uses the same 3-pass matcher and reuses Spotify URIs already discovered during regular playlist sync, so a track present in both places is matched only once.
Maintains its own snapshot under data/snapshots/likes/ so likes-diff state never collides with playlist-diff state. Outputs:
data/likes_enriched.csv— matched + unmatched likes with the same 49-column schemadata/likes_unmatched.csv— likes that couldn't be matched
One-time re-authorization required on first launch after upgrading to 0.7.0 — the new scope (user-library-modify) needs your consent. spotipy refreshes the cached token automatically.
Recreated the Spotify playlist? Use repush
If you delete and recreate your Spotify playlist (new ID in .env), sync won't repopulate it — sync only pushes newly-matched tracks since the last snapshot, and an empty destination playlist isn't a "new match". Run:
python playlist_sync.py repush # idempotent — only adds URIs missing from the playlist
python playlist_sync.py repush --replace # wipe the playlist first, then add (cleans dupes)
python playlist_sync.py repush --dry-run # preview the push
Reads every spotify_uri from data/playlist_enriched.csv and adds the missing ones in batches of 100. No Spotify search calls — uses the URIs already on disk, so it's fast. Idempotent by default: running it twice does not duplicate tracks.
JSON export
python playlist_sync.py export # writes data/playlist_enriched.json
python playlist_sync.py export -o /tmp/my_data.json # custom path
Produces a structured JSON document — { exported_at, track_count, tracks: [{...}, ...] } — with all 49 enrichment fields per track. Easier to feed into jq, dashboards, or other programmatic tools than the CSV.
Skip filter & debug stats
- Skip filter: YT Music tracks with no album metadata (typically YouTube uploads, fan edits, mixes) are filtered out before Spotify search and written to
data/skipped.csvwithskip_reason=no_album. Saves API time and keepsunmatched.csvfocused on tracks that genuinely should match but didn't. - Debug stats: every
syncrun writesdata/debug/run_<timestamp>.json(and a rollinglatest.json) with totals, diff deltas, match rate, method distribution, average confidence, skip-reason histogram, and a few unmatched/skipped examples. Useful for graphing sync quality over time or feeding into a monitoring dashboard.
Known limitations
- Spotify Developer Mode limits search to 10 results per request and imposes a daily quota. Use
--limit Nto spread large initial syncs over multiple days. /v1/audio-features,/v1/tracks,/v1/artistsblocked — Spotify returns 403 on these endpoints for most standard (non-extended-access) app types since late 2024. Affected columns (danceability,energy,valence,tempo,popularity,artist_genres, …) stay empty unless your app passes Spotify's Extended Quota Mode review. The tool detects each 403, marks the relevant skip-flag, and stops retrying. Sync still works — Last.fm picks up the slack for genre/mood data.- YT Music-exclusive tracks (unreleased, region-locked, user uploads) will not have Spotify matches — these are tracked in
data/unmatched.csv. - ytmusicapi OAuth is broken in v1.11.x — the tool uses browser-based authentication instead (stable, valid ~2 years).
- Last.fm tag coverage on niche music — track-level tags (
lastfm_tags) are user-submitted and sparse for game OSTs, regional uploads, and remix edits. Artist-level tags (artist_tags) are far denser; the tool prefers them and falls back to track-level only for play-count/listener data.
Requirements
- Python 3.10+
- Spotify Developer account (free)
- YouTube Music account
pandas>=2.0.0
spotipy>=2.24.0
ytmusicapi>=1.8.0
python-dotenv>=1.0.0
tqdm>=4.66.0
Changelog & Contributing
- CHANGELOG.md — full version history
- CONTRIBUTING.md — development setup, semver policy, two-file update rule
License
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ytmusic_to_spotify_sync-0.7.1.tar.gz.
File metadata
- Download URL: ytmusic_to_spotify_sync-0.7.1.tar.gz
- Upload date:
- Size: 50.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1a3cba0d692b0a83859b18ca7bd4b181d631ad02e966c9ee4d30ba5ac44fbb31
|
|
| MD5 |
ed4372551f3a5d294a6adc58f25b6482
|
|
| BLAKE2b-256 |
7dde81a4cd27bf89c9d4c215eec888ce9a33c996752e3fbd34fc77499898db52
|
Provenance
The following attestation bundles were made for ytmusic_to_spotify_sync-0.7.1.tar.gz:
Publisher:
publish.yml on davidmalko87/ytmusic-to-spotify-sync
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ytmusic_to_spotify_sync-0.7.1.tar.gz -
Subject digest:
1a3cba0d692b0a83859b18ca7bd4b181d631ad02e966c9ee4d30ba5ac44fbb31 - Sigstore transparency entry: 1454085523
- Sigstore integration time:
-
Permalink:
davidmalko87/ytmusic-to-spotify-sync@47f84c96c11c7513ba105ea460ed72efb3fa1342 -
Branch / Tag:
refs/tags/v0.7.1 - Owner: https://github.com/davidmalko87
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@47f84c96c11c7513ba105ea460ed72efb3fa1342 -
Trigger Event:
push
-
Statement type:
File details
Details for the file ytmusic_to_spotify_sync-0.7.1-py3-none-any.whl.
File metadata
- Download URL: ytmusic_to_spotify_sync-0.7.1-py3-none-any.whl
- Upload date:
- Size: 49.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ffbd379f16fde7c28fa015db39d39f262fbc4a6294227268403c5aa81aa915f9
|
|
| MD5 |
900c17ef1eae0be009fc1a40ef403c5f
|
|
| BLAKE2b-256 |
7ee0150c7f3dff108dc20fa2dbc9bbfe92f5b6d7b7b7dff5f217acb21e3aa79a
|
Provenance
The following attestation bundles were made for ytmusic_to_spotify_sync-0.7.1-py3-none-any.whl:
Publisher:
publish.yml on davidmalko87/ytmusic-to-spotify-sync
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ytmusic_to_spotify_sync-0.7.1-py3-none-any.whl -
Subject digest:
ffbd379f16fde7c28fa015db39d39f262fbc4a6294227268403c5aa81aa915f9 - Sigstore transparency entry: 1454085676
- Sigstore integration time:
-
Permalink:
davidmalko87/ytmusic-to-spotify-sync@47f84c96c11c7513ba105ea460ed72efb3fa1342 -
Branch / Tag:
refs/tags/v0.7.1 - Owner: https://github.com/davidmalko87
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@47f84c96c11c7513ba105ea460ed72efb3fa1342 -
Trigger Event:
push
-
Statement type: