Skip to main content

Software Heritage provenance

Project description

Software Heritage - Provenance

This service provide a provenance query service for the Software Heritage Archive. Provenance is the ability to ask for a given object stored in the Archive: “where does it come from?”

This question generally does not have a simple and unambiguous answer. It can be, among other:

  • what it the oldest revision in which this object has been found?

  • what is the “better” origin in which this object can be found?

Answering this kind of question requires querying the Merkle DAG on which the Software Heritage Archive is built with complex queries, mostly from the bottom to the top (aka from Content to Origin objects).

The idea is to use both the compressed graph representation of the Archive (swh-graph) and a preprocessed provennce index to speed up some of the provenance queries.

API Description

For a single object:

Input: SWHID (core SWHID of an artifact found in the use code base)

Output: SWHID or origin URI where input SWHID was found + context information
    Context information, a subset of:
        snapshot (snp SWHID)
        release (rel)
        revision (rev)
        path (filesystem-style path)

Non-functional requirements: TODO something about the fact that both the
answer and the context information should be "as high as possible" in the
graph

Public API

GET /whereis/:swhid

GET /whereis_all/

POST /whereare/TODO
  :swhids

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swh_provenance-0.1.0.tar.gz (28.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swh.provenance-0.1.0-py3-none-any.whl (29.6 kB view details)

Uploaded Python 3

File details

Details for the file swh_provenance-0.1.0.tar.gz.

File metadata

  • Download URL: swh_provenance-0.1.0.tar.gz
  • Upload date:
  • Size: 28.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for swh_provenance-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2894350fdba6d8ae42824592ddbc599985cccd08aafc73f9348965ee552b4b3c
MD5 33e6462f8c62dd5ecd95aba4f092d5de
BLAKE2b-256 d79096b19458fc83e78b96d69e8aecaa50819b908a4fae2c7e7f2ed801b6a61f

See more details on using hashes here.

File details

Details for the file swh.provenance-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: swh.provenance-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 29.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for swh.provenance-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 21e09272a53baabbf57e2d4d4289f867f609a2765f8710df1b1d90dd5eef52a6
MD5 778f2a69607aff76fd8624ff446cd78a
BLAKE2b-256 4a793ab578e39e9c0ab2b9db85bf79c5ff4fd6830c5a58e392fe5b345793d6f7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page