Skip to main content

A database builder for digital preservation information.

Project description

Preservation Status Database Builder

Returns the preservation status of a Crossref DOI matched against mainstream digital preservation platforms.

license activity Code style: black

Django Git GitHub Linux Python

This application allows you to build a database of digital preservation sources and then to match a DOI against common digital preservation systems.

Installation

The easiest install is via pip:

pip install preservation-database

Then add "preservationdatabase" (no hyphen) to your list of INSTALLED_APPS.

Usage

export DJANGO_SETTINGS_MODULE=import_settings.settings

Usage: python -m preservationdatabase.cli [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
    clear-cache                    Clear the import cache
    import-all                     Download and import all data (excluding...
    import-cariniana               Download and import data from Cariniana
    import-clockss                 Download and import data from CLOCKSS
    import-hathi                   Import data from Hathi (requires local...
    import-internet-archive        Import data from Internet Archive...
    import-internet-archive-items  Import item data from Internet Archive
    import-issnl                   Import ISSN-L mappings
    import-lockss                  Download and import data from LOCKSS
    import-ocul                    Import data from Ocul (requires local...
    import-pkp                     Download and import data from PKP's...
    import-portico                 Download and import data from Portico
    random-samples                 Return random samples that occur in and...
    show-archives                  Clear the import cache
    show-cache                     Show last fill date/times and cache status
    show-issn                      Show preservation items that match an ISSN
    show-preservation              Determine whether a DOI is preserved
    stamp-cache-today              Mark the latest imports as today

Features

  • Cariniana import.
  • CLOCKSS import.
  • HathiTrust import.
  • Internet Archive import.
  • Internet Archive item-level import.
  • LOCKSS import.
  • PKP PLN import.
  • Portico import.
  • Crossref DOI lookup.

First-Run Setup

First, copy example_settings.py to settings.py and check settings.py to ensure that the database you want to use is set correctly. The default is db.sqlite. You should carefully read and check all of settings.py.

DATABASES = {
    'default': {
        'ENGINE': 'django.db.backends.sqlite3',
        'NAME': BASE_DIR / 'db.sqlite3',
    }
}

Next, run the database build commands:

python3 manage.py makemigrations
python3 manage.py makemigrations preservation-database
python3 manage.py migrate 

You should then have a working database into which you can import new preservation data.

Archive Notes

Internet Archive

The Internet Archive gives a KBART file for the Keepers Registry that we use as a primary ingest source: https://archive.org/details/ia-keepers-registry-kbart. However, this source is not the total coverage of the Internet Archive. However, sadly, the Internet Archive snapshots do not contain external identifiers and the container-level snapshots do not present coverage extent. While it is possible to download the entire 217GB FATCAT database snapshot, this will not be viable for many users. We have therefore stuck with the KBART file that Keepers uses. Extent of coverage in the Internet Archive may, therefore, be under-reported.

Credits

© Crossref 2023

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

preservation-database-0.0.78.tar.gz (27.1 kB view details)

Uploaded Source

Built Distribution

preservation_database-0.0.78-py3-none-any.whl (29.8 kB view details)

Uploaded Python 3

File details

Details for the file preservation-database-0.0.78.tar.gz.

File metadata

File hashes

Hashes for preservation-database-0.0.78.tar.gz
Algorithm Hash digest
SHA256 87f5d25b1b8e9ea8e9c6ada4263225ad1b4a9b629779ab12e994e92df1f2fa4b
MD5 b9986d07b6f6842ff1c5e0d4015166d4
BLAKE2b-256 63b901934e4b07568b12aa7579b3fd76eab19e79249204d6bc3e6887d723a345

See more details on using hashes here.

File details

Details for the file preservation_database-0.0.78-py3-none-any.whl.

File metadata

File hashes

Hashes for preservation_database-0.0.78-py3-none-any.whl
Algorithm Hash digest
SHA256 737746c38ecb1a14affec5d7e6698a6e57fea6fa5cc61e75e0707b4a8a2ec75d
MD5 4aec9f596f5514f14217caed16615d4b
BLAKE2b-256 07c46f7c94483387a698cdacafb158f871277211a881ddb2c0f2faca5ad44fb4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page