A tool for uploading RDF data to SPARQL endpoints

These details have not been verified by PyPI

Project links

Project description

RDF Uploader

When working with RDF data and multiple triple stores, it is common to need to upload knowledge graphs to different stores. Although most stores claim to be standards-based, there are two main standards: the Graph Store Protocol and SPARQL Update. However, there are nuances regarding exact URL endpoints, named graphs, and authentication, making it a pain to deal with multiple proprietary tools.

Introducing rdf_uploader, a single tool that can upload RDF data to a variety of data sources. It is easy to use and has no dependencies on RDFLib or any datastore-specific libraries, relying solely on pure HTTP. With rdf_uploader, you can seamlessly upload your RDF data to different triple stores without the hassle of dealing with multiple tools and their quirks.

Features

Ingest RDF data into SPARQL endpoints using asynchronous operations
Support for multiple RDF stores (MarkLogic, Blazegraph, Neptune, RDFox, and Stardog)
Authentication support for secure endpoints
Content type detection and customization
Clear status outputs after each upload operation
Concurrent uploads with configurable limits

Installation

From PyPI

pip install rdf-uploader

Usage

Basic Usage

Upload a single RDF file to a SPARQL endpoint:

rdf-uploader path/to/file.ttl --endpoint http://localhost:3030/dataset/sparql

You can also omit the endpoint URL and use environment variables:

# Set the endpoint URL in an environment variable
export RDF_ENDPOINT=http://localhost:3030/dataset/sparql

# Then run without the --endpoint parameter
rdf-uploader path/to/file.ttl

Or specify the endpoint type to use a type-specific environment variable:

# Set endpoint-specific URL
export MARKLOGIC_ENDPOINT=http://marklogic-server:8000/v1/graphs

# Use the endpoint type to determine which environment variable to use
rdf-uploader path/to/file.ttl --type marklogic

Programmatic Usage

You can also use the library programmatically in your Python code:

from pathlib import Path
from rdf_uploader.uploader import upload_rdf_file
from rdf_uploader.endpoints import EndpointType

# The endpoint URL, username, and password can be provided directly
# or read from environment variables if not specified
await upload_rdf_file(
    file_path=Path("path/to/file.ttl"),
    endpoint="http://localhost:3030/dataset/sparql",
    endpoint_type=EndpointType.GENERIC,
    username="myuser",
    password="mypass"
)

# Using environment variables
# export RDF_ENDPOINT=http://localhost:3030/dataset/sparql
# export RDF_USERNAME=myuser
# export RDF_PASSWORD=mypass
await upload_rdf_file(
    file_path=Path("path/to/file.ttl"),
    endpoint_type=EndpointType.GENERIC
)

Multiple Files

Upload multiple RDF files:

rdf-uploader upload path/to/file1.ttl path/to/file2.n3 --endpoint http://localhost:3030/dataset/sparql

Specify Endpoint Type

rdf-uploader upload path/to/file.ttl --endpoint http://localhost:3030/dataset/sparql --type fuseki

Available endpoint types:

marklogic
neptune
blazegraph
rdfox
stardog

Specify Named Graph

rdf-uploader upload path/to/file.ttl --endpoint http://localhost:3030/dataset/sparql --graph http://example.org/graph

Authentication

For endpoints that require authentication:

rdf-uploader upload path/to/file.ttl --endpoint http://localhost:3030/dataset/sparql --username myuser --password mypass

You can also set authentication credentials using environment variables:

export RDF_USERNAME=myuser
export RDF_PASSWORD=mypass
rdf-uploader upload path/to/file.ttl --endpoint http://localhost:3030/dataset/sparql

For endpoint-specific credentials, use the endpoint type as a prefix:

export MARKLOGIC_USERNAME=mluser
export MARKLOGIC_PASSWORD=mlpass
rdf-uploader upload path/to/file.ttl --endpoint http://marklogic-server:8000/v1/graphs --type marklogic

Content Type

Specify the content type for the RDF data:

rdf-uploader upload path/to/file.ttl --endpoint http://localhost:3030/dataset/sparql --content-type "text/turtle"

If not specified, the content type is automatically detected based on the file extension:

.ttl, .turtle: text/turtle
.nt: application/n-triples
.n3: text/n3
.nq, .nquads: application/n-quads
.rdf, .xml: application/rdf+xml
.jsonld: application/ld+json
.json: application/rdf+json
.trig: application/trig

Control Concurrency

Limit the number of concurrent uploads:

rdf-uploader upload path/to/*.ttl --endpoint http://localhost:3030/dataset/sparql --concurrent 10

Verbose Mode

Enable verbose output to see detailed information about each batch upload, including the number of triples per batch and server response codes:

rdf-uploader upload path/to/file.ttl --endpoint http://localhost:3030/dataset/sparql --verbose

Help

Get help on available commands and options:

rdf-uploader --help
rdf-uploader upload --help

Environment Variables

You can configure the RDF Uploader using environment variables, which is especially useful for CI/CD pipelines or when working with multiple endpoints. The library also supports reading values from a .envrc file in the current working directory if environment variables are not set:

Endpoint URLs

# Generic endpoint URL
export RDF_ENDPOINT=http://localhost:3030/dataset/sparql

# Endpoint-specific URLs
export MARKLOGIC_ENDPOINT=http://marklogic-server:8000/v1/graphs
export NEPTUNE_ENDPOINT=https://your-neptune-instance.amazonaws.com:8182/sparql
export BLAZEGRAPH_ENDPOINT=http://blazegraph-server:9999/blazegraph/sparql
export RDFOX_ENDPOINT=http://rdfox-server:12110/datastores/default/content
export STARDOG_ENDPOINT=https://your-stardog-instance:5820/database

Authentication

# Generic credentials
export RDF_USERNAME=myuser
export RDF_PASSWORD=mypass

# Endpoint-specific credentials
export MARKLOGIC_USERNAME=mluser
export MARKLOGIC_PASSWORD=mlpass
export NEPTUNE_USERNAME=neptuneuser
export NEPTUNE_PASSWORD=neptunepass
export BLAZEGRAPH_USERNAME=bguser
export BLAZEGRAPH_PASSWORD=bgpass
export RDFOX_USERNAME=rdfoxuser
export RDFOX_PASSWORD=rdfoxpass
export STARDOG_USERNAME=sduser
export STARDOG_PASSWORD=sdpass

RDFox Store Name

export RDFOX_STORE_NAME=mystore

Test Configuration

Tests use a local SPARQL endpoint by default. You can configure the test endpoint by setting environment variables:

export TEST_ENDPOINT_URL=http://localhost:3030/test
export TEST_ENDPOINT_TYPE=fuseki

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.18.8

Sep 6, 2025

0.18.7

Sep 6, 2025

0.18.6

May 17, 2025

0.18.5

May 10, 2025

0.18.3

May 9, 2025

0.18.2

Apr 19, 2025

0.18.0

Apr 19, 2025

0.17.5

Apr 19, 2025

0.17.4

Apr 19, 2025

0.17.3

Apr 18, 2025

This version

0.17.2

Apr 18, 2025

0.17.0

Apr 18, 2025

0.16.3

Apr 18, 2025

0.16.2

Apr 18, 2025

0.16.0

Apr 18, 2025

0.15.7

Apr 18, 2025

0.15.6

Apr 18, 2025

0.15.0

Apr 13, 2025

0.1.0

Mar 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rdf_uploader-0.17.2.tar.gz (43.4 kB view details)

Uploaded Apr 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rdf_uploader-0.17.2-py3-none-any.whl (12.1 kB view details)

Uploaded Apr 18, 2025 Python 3

File details

Details for the file rdf_uploader-0.17.2.tar.gz.

File metadata

Download URL: rdf_uploader-0.17.2.tar.gz
Upload date: Apr 18, 2025
Size: 43.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for rdf_uploader-0.17.2.tar.gz
Algorithm	Hash digest
SHA256	`03f7a51af1d5c5f7f4775783565ba694577563c5bd828185b7c9cc93113ecd13`
MD5	`d85bfb26d8a55ca5e2ae8aed5bd94d3a`
BLAKE2b-256	`6a041b3d5b840b4139e806accc789a34b93ec1cde67a5bb341e8f9d60f33a3ae`

See more details on using hashes here.

File details

Details for the file rdf_uploader-0.17.2-py3-none-any.whl.

File metadata

Download URL: rdf_uploader-0.17.2-py3-none-any.whl
Upload date: Apr 18, 2025
Size: 12.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for rdf_uploader-0.17.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e6d4070d564b2d26bc3503c2ae5681295f47ecd4994aad1a800e10f3390a601e`
MD5	`2d837f835c946f5837af37939cd05281`
BLAKE2b-256	`889e3e94b4e145076fc75623afb363b1eee7ea53cdae1d4d295c0bb62235a8c2`

See more details on using hashes here.

rdf-uploader 0.17.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

RDF Uploader

Features

Installation

From PyPI

Usage

Basic Usage

Programmatic Usage

Multiple Files

Specify Endpoint Type

Specify Named Graph

Authentication

Content Type

Control Concurrency

Verbose Mode

Help

Environment Variables

Endpoint URLs

Authentication

RDFox Store Name

Test Configuration

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes