Skip to main content

Software Heritage git loader

Project description

swh-loader-git

The Software Heritage Git Loader is a tool and a library to walk a local Git repository and inject into the SWH dataset all contained files that weren't known before.

The main entry points are:

  • :class:swh.loader.git.loader.GitLoader for the main loader which can ingest either local or remote git repository's contents. This is the main implementation deployed in production.

  • :class:swh.loader.git.from_disk.GitLoaderFromDisk which ingests only local git clone repository.

  • :class:swh.loader.git.loader.GitLoaderFromArchive which ingests a git repository wrapped in an archive.

  • :class:swh.loader.git.directory.GitCheckoutLoader which ingests a git tree at a specific commit, branch or tag.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

See top-level LICENSE file for the full text of the GNU General Public License along with this program.

Dependencies

Runtime

  • python3
  • python3-dulwich
  • python3-retrying
  • python3-swh.core
  • python3-swh.model
  • python3-swh.storage
  • python3-swh.scheduler

Test

  • python3-nose

Requirements

  • implementation language, Python3
  • coding guidelines: conform to PEP8
  • Git access: via dulwich

CLI Run

You can run the loader from a remote origin (loader) or from an origin on disk (from_disk) directly by calling:

swh loader -C <config-file> run git <git-repository-url>

or "git_disk".

Configuration sample

/tmp/git.yml:

storage:
  cls: remote
  args:
    url: http://localhost:5002/

Project details


Release history Release notifications | RSS feed

This version

2.5.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swh.loader.git-2.5.0.tar.gz (83.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swh.loader.git-2.5.0-py3-none-any.whl (82.1 kB view details)

Uploaded Python 3

File details

Details for the file swh.loader.git-2.5.0.tar.gz.

File metadata

  • Download URL: swh.loader.git-2.5.0.tar.gz
  • Upload date:
  • Size: 83.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.3

File hashes

Hashes for swh.loader.git-2.5.0.tar.gz
Algorithm Hash digest
SHA256 2a1c19d8f8167451aebd0bf87677bb05e53405eb9566d5084a242997ae0b7529
MD5 ab5f0883c44b28d93862d9bc01317270
BLAKE2b-256 f8c4273de38b19ef2da72e6b98bec9fdb2604ce3c144b54fbcd8515b61707731

See more details on using hashes here.

File details

Details for the file swh.loader.git-2.5.0-py3-none-any.whl.

File metadata

  • Download URL: swh.loader.git-2.5.0-py3-none-any.whl
  • Upload date:
  • Size: 82.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.3

File hashes

Hashes for swh.loader.git-2.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 183cc0ceef4c5e0f6dd473722fc8e1c7305f35e6cbcd08b41be0d487e523fd0f
MD5 5397133333db6442251e287748e02fcb
BLAKE2b-256 d0950accb6fb861c3b323ff9393d42135025e0c1b8e1d2e593f22f607436b7f8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page