Skip to main content

Software Heritage git loader

Project description


The Software Heritage Git Loader is a tool and a library to walk a local Git repository and inject into the SWH dataset all contained files that weren't known before.


This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

See top-level LICENSE file for the full text of the GNU General Public License along with this program.



  • python3
  • python3-dulwich
  • python3-retrying
  • python3-swh.core
  • python3-swh.model
  • python3-swh.scheduler


  • python3-nose


  • implementation language, Python3
  • coding guidelines: conform to PEP8
  • Git access: via dulwich


You can run the loader from a remote origin (loader) or from an origin on disk (from_disk) directly by calling:

python3 -m swh.loader.git.{loader,from_disk}


Both tools expect a configuration file.

Either one of the following location:

  • /etc/softwareheritage/
  • ~/.config/swh/
  • ~/.swh/

Note: Will call that location $SWH_CONFIG_PATH

Configuration sample

Respectively the loader from a remote (git.yml) and the loader from a disk (git-disk.yml), $SWH_CONFIG_PATH/loader/git{-disk}.yml:

  cls: remote
    url: http://localhost:5002/

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for swh.loader.git, version 0.0.58
Filename, size File type Python version Upload date Hashes
Filename, size swh.loader.git-0.0.58-py3-none-any.whl (50.0 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size swh.loader.git-0.0.58.tar.gz (33.9 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page