Skip to main content

Software Heritage git loader

Project description

swh-loader-git

The Software Heritage Git Loader is a tool and a library to walk a local Git repository and inject into the SWH dataset all contained files that weren't known before.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

See top-level LICENSE file for the full text of the GNU General Public License along with this program.

Dependencies

Runtime

  • python3
  • python3-dulwich
  • python3-retrying
  • python3-swh.core
  • python3-swh.model
  • python3-swh.storage
  • python3-swh.scheduler

Test

  • python3-nose

Requirements

  • implementation language, Python3
  • coding guidelines: conform to PEP8
  • Git access: via dulwich

Configuration

You can run the loader from a remote origin (loader) or from an origin on disk (from_disk) directly by calling:

python3 -m swh.loader.git.{loader,from_disk}

Location

Both tools expect a configuration file.

Either one of the following location:

  • /etc/softwareheritage/
  • ~/.config/swh/
  • ~/.swh/

Note: Will call that location $SWH_CONFIG_PATH

Configuration sample

Respectively the loader from a remote (git.yml) and the loader from a disk (git-disk.yml), $SWH_CONFIG_PATH/loader/git{-disk}.yml:

storage:
  cls: remote
  args:
    url: http://localhost:5002/

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swh.loader.git-0.0.55.tar.gz (33.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swh.loader.git-0.0.55-py3-none-any.whl (49.6 kB view details)

Uploaded Python 3

File details

Details for the file swh.loader.git-0.0.55.tar.gz.

File metadata

  • Download URL: swh.loader.git-0.0.55.tar.gz
  • Upload date:
  • Size: 33.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.3

File hashes

Hashes for swh.loader.git-0.0.55.tar.gz
Algorithm Hash digest
SHA256 4c1edd5984e0cd9a6ff115b10a8255c7776e3e769ddaf1041f644b4dcf03cca3
MD5 ae2e11f84cf42f91d932bfaf30451f23
BLAKE2b-256 118f71f4998ec13fdd74626f83cfa26680db7314e78cf2d6d9771c5a42f1cd78

See more details on using hashes here.

File details

Details for the file swh.loader.git-0.0.55-py3-none-any.whl.

File metadata

  • Download URL: swh.loader.git-0.0.55-py3-none-any.whl
  • Upload date:
  • Size: 49.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.3

File hashes

Hashes for swh.loader.git-0.0.55-py3-none-any.whl
Algorithm Hash digest
SHA256 2179c26bad1f2c8707ed82fdf7afa71a1430b6c990778fe0965076d6a59184f2
MD5 f2ea6e50856832bd98d3343a932eeb1c
BLAKE2b-256 6a9e8dcbdd926a51e8b179b97abcb5129f0e04dd06d8058e98ab4e442cd76d6f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page