Translates several CSV files with ontological terms and corresponding data into RDF triples. These RDF triples are stored in OWL and JSON-LD files, facilitating data accessibility, interoperability, and knowledge unification. The triples are also visualized in a graph saved as an SVG. The input CSVs must be formatted with a template from a public Google Sheet; see README or vignette for more information. This is a tool used by the SDLE Research Center at Case Western Reserve University.
Project description
title: "FAIRmaterials"
| Authors: | Jonathan E. Gordon$^{1}$, Alexander Harding Bradley$^{1}$, Priyan Rajamohan$^{1}$, Nathaniel Hahn$^{1}$, Kiefer Lin$^{1}$, | Arafath Nihar$^{1}$, Hayden Cadwell$^{1}$, Jiana Kambo$^{1}$, Jayvic Jimenez$^{1}$, Kristen J. Hernandez$^{1}$, Hein Htet Aung$^{1}$, | Brian Giera$^{2}$, Weiqi Yu$^{1}$, Mohommad Redad Mehdi$^{1}$, Finley Holt$^{1}$, Quynh Tran$^{1}$, Gabriel Ponon$^{1}$, | Dan Savage$^{3}$, Don Brown$^{3}$, Jarod Kaltenbaugh$^{4}$, Kush Havinal$^{4}$, Nicholas Gray$^{4}$, Max Ligget$^{1}$, | Benjamin G. Pierce$^{1}$, Raymond Wieser$^{1}$, Yangxin Fan$^{1}$, Tommy Ciardi$^{1}$, Olatunde J. Akanbi$^{1}$, Hadiza Iawal$^{1}$, | Will Oltjen$^{1}$, Maliesha Kalutotage$^{1}$, Antony Lino$^{1}$, Van Tran$^{1}$, Mingjian Lu$^{1}$, Xuanji Yu$^{1}$, | Abhishek Daundkar$^{1}$, Hope Omodolor$^{1}$, Mirra Rasmussen$^{1}$, Sameera Nalin-Venkat$^{1}$, Tian Wang$^{1}$, | Rounak Chawla$^{1}$, Liangyi Huang$^{1}$, Zelin Li$^{1}$, Leean Jo$^{1}$, Jeffrey M. Yarus$^{1}$, Mengjie Li$^{4}$, | Kristopher O. Davis$^{4}$, Yinghui Wu$^{1}$, Pawan K. Tripathi$^{1}$, Laura S. Bruckman$^{1}$, Erika I. Barcelos$^{1}$, | Roger H. French$^{1}$ | | $^{1}$ Materials Data Science for Stockpile Stewardship Center of Excellence, Cleveland, OH 44106, USA | $^{2}$ Lawrence Livermore National Laboratory, Livermore, CA 94551, USA | $^{3}$ Los Alamos National Laboratory, Los Alamos, NM 87545, USA | $^{4}$ University of Central Florida, Materials Science & Engineering, Orlando, FL 32816, USA
What is FAIRmaterials and what does it do?
'FAIRmaterials' is a bilingual package in R and Python that translates several CSV files from the template (described below) with ontological terms and corresponding data into RDF triples. These RDF triples are then stored in OWL and JSON-LD files, facilitating data accessibility, interoperability, and knowledge unification. The triples are also visualized in a graph saved as an SVG or as a PNG with the Python package. The Python package has a fourth output too. It generates documentation for the output ontology as an HTML.
Put the sheets, created from the template described below, for one or more ontologies in a folder and the 'FAIRmaterials' 'process_ontology_files' or 'FAIRSheetParser' function will output a JSON-LD, turtle/OWL, SVG/PNG visualization, and HTML documentaion into the input folder for all the input ontologies merged. Note that a folder containing only files for one ontology will give results for just that ontology and specifying a base uri is required for merging ontologies. The package can handle subdirectories by producing separate unmerged outputs for each subdirectory and a merged output from all files in the input path. This tool, developed for use by the SDLE Research Center at Case Western Reserve University, facilitates the creation and visualization of material science ontologies.
How to use FAIR CSV template:
To Prepare your CSV file, ensure your CSV file contains the appropriate ontological terms and corresponding data using this template: FAIR CSV Template
Example of XRay ontology FAIR CSV sheet
Example Name Space Sheet:
Example Ontology Information Sheet:
Example Variable Definitions Sheet:
Example Relationship Definitions Sheet:
Example Value Type Definitions Sheet:
The descriptions under the column names provide instructions for how to fill out each column. Once the Name Space, Value Type Definitions, Relationship Definitions, Variable Definitions, and Ontology Info sheets are filled out, 'FAIRmaterials' is ready to use. The visualizations generated by this package are helpful to visualize an ontology to ensure satisfaction with the information in the sheets.
Install and load the package
Install in R:
install.packages("FAIRmaterials")
library(FAIRmaterials)
Install in Python:
pip install FAIRmaterials
Running the default 'process_ontology_files' function:
PV ontology sheets:
Running the package in R:
# Process the CSV files in the PV folder
example_folder1 <- system.file("extdata", "PV", package = "FAIRmaterials")
FAIRmaterials::process_ontology_files(example_folder1, add_external_onto_info = FALSE)
This visualization from the R package will be saved in the output folder as an SVG:
Running the package in Python:
FAIRmaterials --folder_path /path/to/csv/files --include_graph_valuetype --include_pylode_docs
This visualization from the Python package will be saved in the output folder as an PNG:
HTML output
The HTML output generated by the Python package for PV Module can be found here:
XRay Sample ontology sheets:
Running the package in R:
# Process the CSV files in the Xray folder
example_folder2 <- system.file("extdata", "XRay", package = "FAIRmaterials")
FAIRmaterials::process_ontology_files(example_folder2, add_external_onto_info = FALSE)
This visualization from the R package will be saved in the output folder as an SVG:
Running the package in Python:
# Process the CSV files in the Xray folder
FAIRmaterials --folder_path /path/to/csv/files --include_graph_valuetype --include_pylode_docs
This visualization from the Python package will be saved in the output folder as an PNG:
HTML output
The HTML output generated by the Python package for XRay Sample can be found here:
Removing values from visualization in R
By setting include_graph_valuetype argument to FALSE the valuetypes are excluded from graph. We can see how this graph differs from the PV graph above.
Removing values in R:
FAIRmaterials::process_ontology_files(example_folder1, include_graph_valuetype = FALSE, add_external_onto_info = FALSE)
Removing values from visualization in Python
By removing the --include_graph_valuetype flag, the valuetypes are excluded from graph. We can see how this graph differs from the PV graph above.
Removing values in Python:
FAIRmaterials --folder_path /path/to/csv/files
This visualization from the Python package will be saved in the output folder as a PNG:
Attempting to add external ontology information in R:
Now we set the add_external_onto_info argument to TRUE. The package attempts to find additional information for every term in the XRay Sample and PV Module data and update the output.
For PV Module:
Adding external ontology information in R:
FAIRmaterials::process_ontology_files(example_folder1, include_graph_valuetype = TRUE, add_external_onto_info = TRUE)
For XRay Sample:
Adding external ontology information in R:
FAIRmaterials::process_ontology_files(example_folder2, include_graph_valuetype = TRUE, add_external_onto_info = TRUE)
Attempting to add external ontology information in Python:
Now we include the --add_external_onto_info flag when using the package. The package attempts to find additional information for every term in the XRay Sample and PV Module data and update the output.
For PV Module:
Adding external ontology information in Python:
FAIRmaterials --folder_path /path/to/csv/files --include_graph_valuetype --add_external_onto_info
For XRay Sample:
Adding external ontology information in R:
FAIRmaterials::process_ontology_files(example_folder2, include_graph_valuetype = TRUE, add_external_onto_info = TRUE)
Merging two ontologies and specifying some of the metadata:
Lastly we will merge the PV and XRay ontologies. This is accomplished by providing the function a folder path that contains both the PV and XRay ontology sheets. We can also specify some of the metadata included in the outputs.
Merging ontologies in R:
example_folder3 <- system.file("extdata", package = "FAIRmaterials")
FAIRmaterials::process_ontology_files(example_folder3, add_external_onto_info = FALSE, merge_title = "MergedPVandXRay", merge_base_uri = "https://cwrusdle.bitbucket.io/OntologyFilesOwl/Ontology/", merge_version = "1.0")
This is the visualization for the two ontologies merged. This visualization from the R package will be saved in the output folder as an SVG:
Merging ontologies in Python:
FAIRmaterials --folder_path /path/to/csv/files/ --merge_title MergedPVandXRay --merge_base_uri https://cwrusdle.bitbucket.io/OntologyFilesOwl/Ontology/ --merge_version 1.0
We are still working on adding visualization functionality for merged ontologies in Python.
Acknowledgment:
This work was supported by the U.S. Department of Energy’s Office of Energy Efficiency and Renewable Energy (EERE) under Solar Energy Technologies Office (SETO) Agreement Numbers DE-EE0009353 and DE-EE0009347, Department of Energy (National Nuclear Security Administration) under Award Number DE-NA0004104 and Contract number B647887, and U.S. National Science Foundation Award under Award Number 2133576.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file fairmaterials-0.4.2.6.tar.gz
.
File metadata
- Download URL: fairmaterials-0.4.2.6.tar.gz
- Upload date:
- Size: 16.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 80763813c207177b22b0fd4d204f5880041d86f9cd9ddfd61fbcbef4089b3d2a |
|
MD5 | 434b23ab7da349bfb9333d035bb7512c |
|
BLAKE2b-256 | 4a9a29a8c1394037e052092ba10b6249aa0dca83e65542cccf1dfc0d311c872c |
File details
Details for the file FAIRmaterials-0.4.2.6-py3-none-any.whl
.
File metadata
- Download URL: FAIRmaterials-0.4.2.6-py3-none-any.whl
- Upload date:
- Size: 18.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8904be6744087d41ec30cc195e70f82db25a7a9b1f6e49fd0a99d87276f3566d |
|
MD5 | 289ba6113255206e19cf37bea97b68bd |
|
BLAKE2b-256 | cfedf6180eb9e7e6fa93d0e9012c78d109047d7c42506e7c5676fadc0d805e7b |