Software Heritage git loader
Project description
swh-loader-git
The Software Heritage Git Loader is a tool and a library to walk a local Git repository and inject into the SWH dataset all contained files that weren't known before.
The main entry points are:
-
:class:
swh.loader.git.loader.GitLoaderfor the main loader which can ingest either local or remote git repository's contents. This is the main implementation deployed in production. -
:class:
swh.loader.git.from_disk.GitLoaderFromDiskwhich ingests only local git clone repository. -
:class:
swh.loader.git.loader.GitLoaderFromArchivewhich ingests a git repository wrapped in an archive. -
:class:
swh.loader.git.directory.GitCheckoutLoaderwhich ingests a git tree at a specific commit, branch or tag.
License
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
See top-level LICENSE file for the full text of the GNU General Public License along with this program.
Dependencies
Runtime
- python3
- python3-dulwich
- python3-retrying
- python3-swh.core
- python3-swh.model
- python3-swh.storage
- python3-swh.scheduler
Test
- python3-nose
Requirements
- implementation language, Python3
- coding guidelines: conform to PEP8
- Git access: via dulwich
CLI Run
You can run the loader from a remote origin (loader) or from an origin on disk (from_disk) directly by calling:
swh loader -C <config-file> run git <git-repository-url>
or "git_disk".
Configuration sample
/tmp/git.yml:
storage:
cls: remote
args:
url: http://localhost:5002/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file swh.loader.git-2.5.0.tar.gz.
File metadata
- Download URL: swh.loader.git-2.5.0.tar.gz
- Upload date:
- Size: 83.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2a1c19d8f8167451aebd0bf87677bb05e53405eb9566d5084a242997ae0b7529
|
|
| MD5 |
ab5f0883c44b28d93862d9bc01317270
|
|
| BLAKE2b-256 |
f8c4273de38b19ef2da72e6b98bec9fdb2604ce3c144b54fbcd8515b61707731
|
File details
Details for the file swh.loader.git-2.5.0-py3-none-any.whl.
File metadata
- Download URL: swh.loader.git-2.5.0-py3-none-any.whl
- Upload date:
- Size: 82.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
183cc0ceef4c5e0f6dd473722fc8e1c7305f35e6cbcd08b41be0d487e523fd0f
|
|
| MD5 |
5397133333db6442251e287748e02fcb
|
|
| BLAKE2b-256 |
d0950accb6fb861c3b323ff9393d42135025e0c1b8e1d2e593f22f607436b7f8
|