Skip to main content

Software Heritage Tarball Loader

Project description

SWH Tarball Loader

The Software Heritage Tarball Loader is in charge of ingesting the directory representation of the tarball into the Software Heritage archive.

Sample configuration

The loader's configuration will be taken from the default configuration file: ~/.config/swh/loader/tar.yml (you can choose a different path by setting the SWH_CONFIG_FILENAME environment variable).

This file holds information for the loader to work, including celery configuration:

working_dir: /home/storage/tmp/
storage:
  cls: remote
  args:
    url: http://localhost:5002/
celery:
task_modules:
    - swh.loader.tar.tasks
task_queues:
    - swh.loader.tar.tasks.LoadTarRepository

Local

Load local tarball directly from code or python3's toplevel:

# Fill in those
repo = '8sync.tar.gz'
tarpath = '/home/storage/tar/%s' % repo
origin = {'url': 'file://%s' % repo, 'type': 'tar'}
visit_date = 'Tue, 3 May 2017 17:16:32 +0200'
last_modified = 'Tue, 10 May 2016 16:16:32 +0200'
import logging
logging.basicConfig(level=logging.DEBUG)

from swh.loader.tar.tasks import load_tar
load_tar(origin=origin, visit_date=visit_date,
         last_modified=last_modified)

Remote

Load remote tarball is the same sample:

url = 'https://ftp.gnu.org/gnu/8sync/8sync-0.1.0.tar.gz'
origin = {'url': url, 'type': 'tar'}
visit_date = 'Tue, 3 May 2017 17:16:32 +0200'
last_modified = '2016-04-22 16:35'
import logging
logging.basicConfig(level=logging.DEBUG)

from swh.loader.tar.tasks import load_tar
load_tar(origin=origin, visit_date=visit_date,
         last_modified=last_modified)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swh.loader.tar-0.0.41.tar.gz (11.4 kB view details)

Uploaded Source

Built Distribution

swh.loader.tar-0.0.41-py3-none-any.whl (27.9 kB view details)

Uploaded Python 3

File details

Details for the file swh.loader.tar-0.0.41.tar.gz.

File metadata

  • Download URL: swh.loader.tar-0.0.41.tar.gz
  • Upload date:
  • Size: 11.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.5.3

File hashes

Hashes for swh.loader.tar-0.0.41.tar.gz
Algorithm Hash digest
SHA256 52472c3e3b6d3ef6130c28c17bfec2c65d956f1e43a63520cdbf2b8308c4a3f2
MD5 73acbbf8315a85c853560c08f61e0d7e
BLAKE2b-256 21ae72e84e81bb76154878153fc88ceb36022c2806a4b50a2234479f5ed15f6d

See more details on using hashes here.

File details

Details for the file swh.loader.tar-0.0.41-py3-none-any.whl.

File metadata

  • Download URL: swh.loader.tar-0.0.41-py3-none-any.whl
  • Upload date:
  • Size: 27.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.5.3

File hashes

Hashes for swh.loader.tar-0.0.41-py3-none-any.whl
Algorithm Hash digest
SHA256 31013770d2388c14c34c276bb0fa47e5167c3c6c86c3ab9b228e8c0d1e8429d6
MD5 68b1a8cf38b2e91310d581bf7934b538
BLAKE2b-256 68ba9d50c4c40f156a66ca21fff516930475e51efe7e646a2e1b51c183ae1fac

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page