The CIDC data model and tools for working with it.
Project description
cidc-schemas
This repository contains formal definitions of the CIDC metadata model using json-schema syntax and vocabulary.
View documentation at https://nci-cidc.github.io/cidc-schemas/
Installation
To install the latest released version, run:
pip install nci-cidc-schemas
Development
Project Structure
cidc_schemas/- a python module for generating, validating, and reading manifest and assay templates.schemas/- json specifications defining the CIDC metadata model.templates/- schemas for generating and validating manifest and assay templates.assays/- schemas for defining assay data models.artifacts/- schemas for defining artifacts.
docs/- the most recent build of the data model documentation, along with templates and scripts for re-generating the documentation.template_examples/- example populated Excel files for template specifications inschemas/templates, and.csvs auto-generated from those.xlsxs that allow to transparently keep track of changes in them.tests/- tests for thecidc_schemasmodule.
Developer Setup
Install necessary dependencies.
pip install -r requirements.dev.txt
Install and configure pre-commit hooks.
pre-commit install
Running tests
This repository has unit tests in the tests folder. After installing dependencies the tests can be run via the command
pytest tests
Building documentation
Pre-commit hooks ensure documentation is automatically up-to date. To build the documentation manually, run the following commands:
python setup.py install # install helpers from the cidc_schemas library
python docs/generate_docs.py
This will output the generated html documents in docs/docs. If the updated docs are pushed up and merged into master, they will be viewable at https://nci-cidc.github.io/cidc-schemas/.
Using the Command-Line Interface
This project comes with a command-line interface for validating schemas and generating/validating assay and manifest templates.
Install the CLI
Clone the repository and cd into it
git clone git@github.com:NCI-CIDC/cidc-schemas.git
cd cidc-schemas
Install the cidc_schemas package (this adds the cidc_schemas CLI to your console)
python setup.py install
Run cidc_schemas --help to see available options.
If you're making changes to the module and want those changes to be reflected in the CLI without reinstalling the cidc_schemas module every time, run
python3 -m cidc_schemas.cli [args]
Creating a new assay or analysis type
In order to create a new assay type, your best bet is to just search for an existing assay and copy it.
Preferably, look at scrnaseq and copy exactly what it does. Make changes in the assay schema and template for your particular assay and/or analysis schema.
Once you update and update the version of this repo, update api-gae. You should only need to copy what scrnaseq did in api-gae in order for files to show up on the portal. Make sure to update the api-gae version. Update the api-gae version used in cloud-functions.
Finally, make sure to update the cli tool to include the new assay.
There are a lot of gotchas and hidden parsing going on behind the scenes. Listing them all would be hard, so the practical advice is to follow an existing working template.
Be sure to regenerate the docs after creating your schema, so the new schema is added to the reference docs.
Generate templates
Create a template for a given template configuration.
cidc_schemas generate_template -m templates/manifests/pbmc_template.json -o pbmc.xlsx
Validate filled-out templates
Check that a populated template file is valid with respect to a template specification.
cidc_schemas validate_template -m templates/manifests/pbmc_template.json -x template_examples/pbmc_template.xlsx
Validate JSON schemas
Check that a JSON schema conforms to the JSON Schema specifications.
cidc_schemas validate_schema -f shipping_core.json
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nci_cidc_schemas-0.28.7.tar.gz.
File metadata
- Download URL: nci_cidc_schemas-0.28.7.tar.gz
- Upload date:
- Size: 2.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0fcd0f19b1e64b313d9ddaa3b779bc587e855f64a8a4e019ad25d8fc016543cc
|
|
| MD5 |
7c02a96a58809de6d322bc5c87641ac6
|
|
| BLAKE2b-256 |
d3e844082587e12ad20761084d13ac0d243fc8e5e59f1bf3289db31af94eeb43
|
File details
Details for the file nci_cidc_schemas-0.28.7-py2.py3-none-any.whl.
File metadata
- Download URL: nci_cidc_schemas-0.28.7-py2.py3-none-any.whl
- Upload date:
- Size: 2.6 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b0717f8c70d1467832e5d8e76b038a9ad291cfcecf0e5ce2ae602c9a82aa27fa
|
|
| MD5 |
1a9d318b3e416fa00a1a5ba1c7db6679
|
|
| BLAKE2b-256 |
423ebf41c363e6a9ccc99cb49094f8ca9c5c7f0a3e9f2476fdde8b5ecd7f741b
|