A python tool to enhance the accessibility of SPARC scaffold visualisations and their analyses in accordance with FAIR principles.
Project description
SPARC Scaffolding in Python (SPARC-SPy)
A python tool to enhance the accessibility of SPARC dataset visualisations and their analyses in accordance with FAIR principles.
Table of contents
- About
- Introduction
- The problem
- Our solution - (SPARC-SPy)
- Impact
- Contributing
- Setting up SPARC-SPy
- Reporting issues
- Contributing
- Cite us
- FAIR practices
- License
- Team
- Acknowledgements
About
This is the repository of team SPARC-SPy (Team #3) of the 2024 SPARC Codeathon. Information about the 2024 SPARC Codeathon can be found here.
No work was done on this project prior to the Codeathon.
Introduction
The NIH Common fund program Stimulating Peripheral Activity to Relieve Conditions (SPARC) seeks to understand how electrical signals control internal organ function. In doing so it explores how therapeutic devices might modulate nerve activity to treat conditions like hypertension, heart failure, and gastrointestinal disorders. To this end, data have been compiled from 60+ research groups, involving 3900+ subjects across 8 species from 49 different anatomical structures.
The SPARC Portal offers a user-friendly interface to access and share resources from the SPARC community. It features well-curated, high-impact data, SPARC projects, and computational simulations, all available under the “Find Data” section.
The problem
In the current landscape of data science and research, visualizing data is crucial for analysis, interpretation, and communication. However, existing tools for reconstructing visualizations from datasets are limited in their accessibility and interoperability. The primary tool available is restricted to the Windows operating system, creating significant barriers for users on other platforms such as macOS and Linux. This limitation hinders the application of the FAIR principles (Findable, Accessible, Interoperable, and Reusable) to data visualization:
Limited Accessibility:
- Researchers and data scientists using non-Windows operating systems are unable to access the existing tool, leading to inefficiencies and potential data silos.
Poor Interoperability:
- The existing tool may not support integration with other widely-used data analysis tools or workflows, making it difficult to share and collaborate on visualizations across different platforms and software environments.
Challenges in Reusability:
- Without a standardized approach to creating and sharing visualizations, researchers may struggle to replicate or adapt visualizations for different datasets or research contexts.
Our solution - (SPARC-SPy)
We have developed a cross-platform Python visualisation tool called the SPARC Scaffolding in Python (SPARC-SPy) to run within o2S2PARC that can produce VTK visualisations from data scaffolds. This Python module enhancess the FAIRness of SPARC data by:
- Findability
- Enhanced Metadata: The tool can extract and attach metadata to visualizations, making it easier to locate specific datasets and their visual representations.
- Searchability: By tagging visualizations with relevant keywords and descriptions, users can quickly find the visual data they need.
- Accessibility
- User-Friendly Interface: A well-designed tool can provide an intuitive interface for accessing and generating visualizations, lowering the barrier for users with varying levels of technical expertise.
- Light weight: A universally implementable visualisation tool can be run within o2S2PARC while accessing visualisations of curated SDS datasets and their metadata (using the Pennsieve API).
- Open Access: If the tool is open-source or freely available, it ensures that a wider audience can access and use it without restrictions.
- Interoperability
- Standard Formats: The tool can support and export visualizations in standardized formats (e.g., JSON & VTK at present - can be expanded further), ensuring compatibility with other tools and platforms.
- APIs and Integration: By providing APIs and integration capabilities, the tool can work seamlessly with other data analysis and visualization workflows, promoting interoperability.
- Reusability
- Documentation and Templates: The tool includes comprehensive documentation and reusable templates for common visualization types, making it easier for users to replicate and adapt visualizations for their own datasets.
- Version Control: Implementing version control for visualizations ensures that users can track changes and reuse previous versions as needed.
Impact
Improve existing capabilities of SPARC tools with direct integration
The SPARC-SPy tool has been developed to integrate existing SPARC tools such as Pennsieve and sparc-me. This allows for a streamlined process within the SPARC ecosystem from downloading datasets to generating visualisations. By supporting standardised data formats this tool is highly interoperable with existing tools, improving the capabilities and experience of the SPARC platform. The capabilities of spark-spy extend further as it can query metadata and embeded within the visualisations to provide powerful analyses (e.g. scaffold volume). This tool is provided alongside comprehensive documentation to ensure a user-friendly experience, empowering researchers to integrate SPARC-SPy into their workflows for more consistent and reproducible visualisations.
Increase visibility of the value within SPARC's public data
Visualizations can make complex data more engaging and easier to communicate to a broader audience, including those without a technical background. Using SPARC-SPy for reconstructing visualizations, researchers can more effectively analyze and interpret SPARC’s public data, making it more accessible and understandable, which in turn increases its visibility and impact. The tool can help users discover new insights and patterns within SPARC’s datasets, potentially leading to new research questions and applications and the end goal of effective treatments.
Setting up SPARC-SPy
Pre-requisites
- Git
- Python versions:
- 3.9
Installing via PyPI
Here is the link to our project on PyPI
pip install sparc_spy
From source code
Downloading source code
Clone the SPARC-SPy repository from github, e.g.:
git clone git@github.com:SPARC-FAIR-codeathon/sparc-spy
Installing dependencies
pip install requirements.txt
Using SPARC-SPy
Included are guided tutorials covering some applications of SPARC-SPy:
| Tutorial | Description |
|---|---|
| Tutorial 1: | Getting started - In this tutorial we use SPARC-SPy to import a json scaffold file from a public dataset and visualise it within a jupyter notebook running on o2S2PARC. |
| Tutorial 2: | Finding scaffolds - In this tutorial we show how SPARC-SPy can be used to identify scaffolds within a given sparc dataset. |
| Tutorial 3: | Generating analytics - In this tutorial we show how SPARC-SPy can be use scaffolds and metadata to generate powerful analytics (such as volume, ?average temperature/direction?). |
| Tutorial 4: | New tags - In this tutorial we show how we can tag visualisations with key descriptors to enable users to quickly identify the data they need. |
| Tutorial 5: | Into the flow - In this tutorial we show how SPARC-SPy can be used with existing tools such as sparc-flow to simplify visualisation workflows. |
| Tutorial 6: | Mapping new data - In this tutorial we show how new experimental data can be imported to a scaffold. |
Reporting issues
To report an issue or suggest a new feature, please use the issues page. Please check existing issues before submitting a new one.
Contributing
To contribute: fork this repository and submit a pull request. Before submitting a pull request, please read our Contributing Guidelines and Code of Conduct. If you found this tool helpful, please add a GitHub Star to support further developments!
Project structure
/src/- Directory of SPARC-SPy python module./tutorials/- Directory of tutorials showcasing SPARC-SPy python module in action.
Cite us
If you use sparc-flow to make new discoveries or use the source code, please cite us as follows:
Michael Hoffman, Yun Gu, Mishaim Malik, Savindi Wijenayaka, Matthew French (2024). SPARC-SPy: v1.0.0 - A python tool to enhance the accessibility of SPARC dataset visualisations and their analyses in accordance with FAIR principles.
Zenodo. https://doi.org/XXXX/zenodo.XXXX.
FAIR practices
We have assessed the FAIRness of our sparc-flow tool against the FAIR Principles established for research software. The details are available in the following SPARC-SPy-Fairness
License
SPARC-SPy is open source and distributed under the Apache License 2.0. See LICENSE for more information.
Team
- Michael Hoffman (Writer)
- Yun Gu (Developer)
- Mishaim Malik (Developer)
- Savindi Wijenayaka (SysAdmin, Developer)
- Matthew French (Lead, Developer)
Acknowledgements
- We would like to thank the 2024 SPARC Codeathon organizers for their guidance and support during this Codeathon.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sparc_spy-0.1.5.tar.gz.
File metadata
- Download URL: sparc_spy-0.1.5.tar.gz
- Upload date:
- Size: 19.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
51c4ed56a3cc59214490e4c35fb4fa4a5230446e884bed2457e2d2c7b9b4c4e1
|
|
| MD5 |
c96cc322fea175bd5871fd460e8a8433
|
|
| BLAKE2b-256 |
cd322c8b4c2b9ebb6d6adc6eaddd3fa1a71b3c1af1fcd5b3c189b6965877dfb9
|
File details
Details for the file sparc_spy-0.1.5-py3-none-any.whl.
File metadata
- Download URL: sparc_spy-0.1.5-py3-none-any.whl
- Upload date:
- Size: 15.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
99593b88b88a71c43663ecd524d06b13ffbef564e0e971ad470e853e89a9149b
|
|
| MD5 |
8aaef3c9f300d0cc6709b1927da642b7
|
|
| BLAKE2b-256 |
60c1c0f2b994680b242ef017eb418ba330049669ac615a14ce3b45e9dcc450b7
|