Imports IMDB TSV files into a SQLite database
Project description
imdb-sqlite
Imports IMDB TSV files into a SQLite database.
It will fetch the files from IMDB unless you've already fetched them earlier.
The program relies on the following IMDB tab separated files:
title.basics.tsv.gz
: Video titles such as movies, documentaries, tv series, episodes etc.name.basics.tsv.gz
: People in the entertainment business.title.akas.tsv.gz
: Alternative names for titles, for different languages.title.principals.tsv.gz
: Mapping of who participated in which title (movie / show).title.episode.tsv.gz
: season and episode numbers, for episodes of shows.title.ratings.tsv.gz
: Current rating and vote count for the titles.
Installation
pip install imdb-sqlite
Usage
usage: imdb-sqlite [-h] [--db DB] [--cache-dir CACHE_DIR] [--verbose]
Imports imdb tsv interface files into a new sqlitedatabase. Fetches them from
imdb if not present onthe machine.
optional arguments:
-h, --help show this help message and exit
--db FILE Connection URI for the database to import into. (default:
imdb.db)
--cache-dir DIR Download cache dir where the tsv files from imdb will be
stored before the import. (default: downloads)
--verbose Show database interaction (default: False)
Just run the program with no arguments, and you'll get a file named imdb.db
in the current working directory.
Note: the import may take a long time, since there are millions of records to import.
Hints:
- Make sure the disk the database is written to has sufficient space. About 5 GiB is needed.
- Use a SSD to speed up the import.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
imdb-sqlite-0.1.0.tar.gz
(5.6 kB
view hashes)
Built Distribution
Close
Hashes for imdb_sqlite-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ec88f7ea003337c4e9bb0d0ff18de693e227be0f8b1cc3ad50dcc2eed7c28b3c |
|
MD5 | 80c1fe0fedf71fc265d8c714b23308aa |
|
BLAKE2b-256 | dc7759304189834eb7aca30ee96403cbdd7f8119f4a354c0afd3bfa5376b5b9e |