CIM query utilities

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

CIMSPARQL Query CIM data using sparql

This Python package provides functionality for reading/parsing cim data from either xml files or GraphDB into Python memory as pandas dataframes.

The package provides a set of predefined functions/queries to load CIM data such generator or branch data, though the user can easiliy extend or define their own queries.

Usage

Load data using predefined functions/queries

>>> from cimsparql.graphdb import GraphDBClient
>>> from cimsparql.url import service
>>> gdbc = GraphDBClient(service(repo='<repo>', server=127.0.0.1:7200))
>>> ac_lines = gdbc.ac_lines(limit=3)
>>> print(ac_lines[['name', 'x', 'r', 'bch']])
         name       x       r       bch
0  <branch 1>  1.9900  0.8800  0.000010
1  <branch 2>  1.9900  0.8800  0.000010
2  <branch 3>  0.3514  0.1733  0.000198

In the example above the client will query repo "" in the default server GraphDB for AC line values.

Inspect/view predefined queries

To see the actual sparql use the dry_run option:

>>> from cimsparql.queries import ac_line_query
>>> print(ac_line_query(limit=3, dry_run=True))

The resulting string contains all the prefix's available in the Graphdb repo making it easier to copy and past to graphdb. Note that the prefixes are not required in the user specified quires described below.

The dry_run option is available for all the predefined queries.

Load data using user specified queries

>>> query = 'SELECT ?mrid where { ?mrid rdf:type cim:ACLineSegment } limit 2'
>>> query_result = gdbc.get_table(query)
>>> print(query_result)

List of available repos at the server

>>> from cimsparql.url import GraphDbConfig
>>> print(GraphDbConfig().repos)

Prefix and namespace

Available namespace for current graphdb client (gdbc in the examples above), which can be used in queries (such as rdf and cim) can by found by

>>> print(gdbc.ns)
{'wgs': 'http://www.w3.org/2003/01/geo/wgs84_pos#',
 'rdf': 'http://www.w3.org/1999/02/22-rdf-syntax-ns#',
 'owl': 'http://www.w3.org/2002/07/owl#',
 'cim': 'http://iec.ch/TC57/2010/CIM-schema-cim15#',
 'gn': 'http://www.geonames.org/ontology#',
 'xsd': 'http://www.w3.org/2001/XMLSchema#',
 'rdfs': 'http://www.w3.org/2000/01/rdf-schema#',
 'SN': 'http://www.statnett.no/CIM-schema-cim15-extension#',
 'ALG': 'http://www.alstom.com/grid/CIM-schema-cim15-extension#'}

Running Tests Against Docker Databases

Tests can be run agains RDF4J and/or BlazeGraph databases if a container with the correct images are available.

docker pull eclipse/rdf4j-workbench
docker pull openkbs/blazegraph

Launch one or both containers and specify the following environment variables

RDF4J_URL = "localhost:8080/rdf4j-server"
BLAZEGRAPH_URL = "localhost:9999/blazegraph/namespace

Note 1: The port numbers may differ depending on your local Docker configurations. Note 2: You don't have to install RDF4J or BlazeGraph. Tests requiring these will be skipped in case they are not available. They will in any case be run in the CI pipeline on GitHub (where both always are available).

Data Assumptions

CimSPARQL makes certain assumptions about the data which is required to be present for the queries to work. The script modify_xml should be able to modify the XML files such that they are compliant with CimSPARQL.

There is a valid xml:base attribute in the top-level rdf:RDF element. This is required for uploading files (at least for RDF4J which is used in the CI pipeline)
All items cimsparql.constants.CIM_TYPES_WITH_MRID has cim:IdentifiedObject:mRID
cim:Terminal.endNumber is of type xsd:integer

poetry run python scripts/modify.xml -h

usage: Program that modifies XML files to be compatible with cimsparql [-h] [--baseURI BASEURI] [--suffix SUFFIX] file

positional arguments:
  file               File or glob pattern for files to modify

optional arguments:
  -h, --help         show this help message and exit
  --baseURI BASEURI  Base URI to insert in all XML files. For example: http://iec.ch/TC57/2013/CIM-schema-cim16
  --suffix SUFFIX    Suffix to the filename after modifying them. If given as an empty string the original files will be overwritten. Default 'mod'

In order to use the script to convert XML files into a format that can be used with cimsparql

poetry run scripts/modify_xml.py "path/to/model/*.xml"

Ontology (for developers)

Ontologies for the CIM model can be found at (ENTSOE's webpages)[https://www.entsoe.eu/digital/common-information-model/cim-for-grid-models-exchange/]. For convenience and testing purposes the ontology are located under tests/data/ontology. CIM models used for testing purposes in Cimsparql should be stored in N-quads format. In case you have a model in XML format it can be converted to N-quads by launching a DB (for example RDF4J) and upload all the XML files and the ontology.

Execute

PREFIX cims: <http://iec.ch/TC57/1999/rdf-schema-extensions-19990926#>

DELETE {?s ?p ?o}
INSERT {?s ?p ?o_cast} WHERE {
  ?s ?p ?o .
  ?p cims:dataType ?_dtype .
  ?_dtype cims:stereotype ?stereotype .
  BIND(IF(?stereotype = "Primitive",
    URI(concat("http://www.w3.org/2001/XMLSchema#", lcase(strafter(str(?_dtype), "#")))),
    ?_dtype) as ?dtype)
  BIND(STRDT(?o, ?dtype) as ?o_cast)
}

and export as N-quads.

Note: Make sure the base URI is either specified in the XML-files or when you upload. It should be set to

<rdf:RDF xml:base="http://iec.ch/TC57/2013/CIM-schema-cim16">

Test models

micro_t1_nl: MicroGrid/Type1_T1/CGMES_v2.4.15_MicroGridTestConfiguration_T1_NL_Complete_v2

Rest APIs

CimSparql mainly uses SparqlWrapper to communicate with the databases. However, there are certain operations which are performed directly via REST calls. Since there are small differences between different APIs you may have to specify which API you are using. This can be done when initializing the ServiceCfg class or by specifying the SPARQL_REST_API environment variable. Currently, RDF4J and blazegraph is supported (if not given RDF4J is default).

export SPARQL_REST_API=RDF4J  # To use RDF4J
export SPARQL_REST_API=BLAZEGRAPH  # To use BlazeGraph

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

3.2.1

Apr 12, 2024

3.2.0

Apr 8, 2024

3.1.0

Apr 4, 2024

3.0.0

Mar 19, 2024

2.17.0

Mar 7, 2024

2.16.0

Jan 16, 2024

2.15.0

Jan 9, 2024

2.14.0

Jan 5, 2024

2.13.0

Jan 5, 2024

2.12.1

Dec 7, 2023

2.12.0

Dec 5, 2023

2.11.0

Nov 27, 2023

2.10.2

Nov 8, 2023

2.10.1

Nov 2, 2023

2.10.0

Oct 2, 2023

2.9.0

Sep 18, 2023

2.8.0

Jun 28, 2023

2.7.3

May 31, 2023

2.7.2

Mar 30, 2023

2.7.1

Mar 21, 2023

2.7.0

Mar 13, 2023

2.6.7

Mar 3, 2023

2.6.6

Mar 3, 2023

2.6.5

Feb 28, 2023

2.6.4

Feb 28, 2023

2.6.3

Feb 20, 2023

2.6.2

Feb 17, 2023

2.6.1

Feb 16, 2023

2.6.0

Feb 13, 2023

2.5.1

Jan 19, 2023

2.5.0

Jan 16, 2023

2.4.2

Jan 4, 2023

2.4.1

Jan 2, 2023

2.4.0

Dec 15, 2022

2.3.3

Dec 12, 2022

2.3.2

Dec 9, 2022

2.3.1

Dec 7, 2022

2.3.0

Dec 7, 2022

2.2.2

Nov 11, 2022

2.2.1

Nov 10, 2022

2.1.1

Nov 2, 2022

2.1.0

Oct 25, 2022

2.0.1

Oct 23, 2022

1.12.1

Sep 5, 2022

This version

1.12.0

Aug 25, 2022

1.10.3

Jun 28, 2022

1.10.1

Jun 16, 2022

1.10.0

Jun 15, 2022

1.9.1

May 2, 2022

1.9.0

Feb 28, 2022

1.8.1

Jan 26, 2022

1.8.0

Jun 8, 2021

1.7.1

May 5, 2021

1.7.0

Apr 30, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cimsparql-1.12.0.tar.gz (37.1 kB view hashes)

Uploaded Aug 25, 2022 Source

Built Distribution

cimsparql-1.12.0-py3-none-any.whl (39.8 kB view hashes)

Uploaded Aug 25, 2022 Python 3

Hashes for cimsparql-1.12.0.tar.gz

Hashes for cimsparql-1.12.0.tar.gz
Algorithm	Hash digest
SHA256	`07603e689a6c7a221087f2a7d510d1992104adc585d5d858dce83cda6b7c46cd`
MD5	`492b9e138d79702f6b6e61140ebe057d`
BLAKE2b-256	`5a6a14f2de49b9612f1fae1f9007a9afe71bd999fc52b87129d1eed1f38b0459`

Hashes for cimsparql-1.12.0-py3-none-any.whl

Hashes for cimsparql-1.12.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d0137f91cc5939c0000fa373cc128214d49e50d06ac9e242c5211e1cdf1629f1`
MD5	`8131a0fc017260b13f0e1acfbfa1dbc9`
BLAKE2b-256	`7158bb5fb31e422b3c74f8ac3dcf1aec8f6c73de519aa25a1807bac5aab0e329`