Skip to main content

Software Heritage archive modification tools

Project description

It happens for Software Heritage to record content that either should not have been archived or should no longer be archived. swh-alter holds the tools necessary to prune or make inaccessible content from Software Heritage archive.

Usage

First, create a configuration file, for example in ~/.config/swh/alter.yml, like the following:

storage:
  cls: postgresql
  db: "service=…"
  objstorage:
    cls: memory

graph:
  cls: remote
  url: "http://granet.internal.softwareheritage.org:5009"

Then, the following command line will print a list of objects that can be safely removed from the archive to take down the given origins:

$ SWH_CONFIG_FILENAME=~/config/swh.alter.yml \
  swh alter remove --dry-run \
      https://gitlab.softwareheritage.org/swh/devel/swh-alter.git \
      https://gitlab.softwareheritage.org/swh/devel/swh-py-template.git

Only -dry-run is supported currently. Implementing actual removal is still pending.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swh.alter-0.0.1.tar.gz (42.7 kB view hashes)

Uploaded Source

Built Distribution

swh.alter-0.0.1-py3-none-any.whl (37.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page