Skip to main content

ML visualization pipeline for caQTL evaluation

Project description

Data Pipeline

Processes inference models predictions and observed data, exploratory data analysis, data vizualization.

Configuration

Before running the pipelines, you need to configure them. Configuration files are located in the /config/ directory. For custom configurations:

This will create the following files that the user needs to fill out:

- `pipelines/data_pipeline/configs/direct_input_config.json`
- `pipelines/data_pipeline/configs/personal_config.json`
  1. Edit Config Files: Modify the configuration files to match your data and setup. These files contain the necessary parameters and paths required to run the pipelines successfully. Ensure that all paths, model checkpoints, and settings are correctly specified to match your environment.

Option 1: Default Repository Structure

Use this option if you're following the default setup as structured in the repository:

python generate_config.py --config_file configs/default_config.json

Option 2: Custom Configuration

Use this option if you need to specify custom paths and settings:

python generate_config.py --direct_input --config_file configs/direct_input_config.json
  1. Usage: Once the configuration is complete, you can run the pipeline.

Running the pipeline

Data Frame Generation

Exploratory Data Analysis(EDA)

Data Visualization

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file data_pipeline_ml_caqtl_visualization-0.2.1.tar.gz.

File metadata

File hashes

Hashes for data_pipeline_ml_caqtl_visualization-0.2.1.tar.gz
Algorithm Hash digest
SHA256 b2c96e85283edff2f1daa2adc74ee3cf8d0aacae42d05cb3b9773e60b17e73b2
MD5 031bd50b7b40d682cb09040833ae3d8f
BLAKE2b-256 dc1564a08dbca1c0c0916329d7b781dfbafe3e70316ac3f1bfd920242b72fe24

See more details on using hashes here.

File details

Details for the file data_pipeline_ml_caqtl_visualization-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for data_pipeline_ml_caqtl_visualization-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 93b8f68f3d015972b321ef582d5405b3eae2e4a63da6a8e4d4e82c5c7a4c74e7
MD5 e88870ff06d25817dbc4acd7af7a3488
BLAKE2b-256 eeae617202e86b637aa21ce302128ce6ee120caf6ecdd59ac692bbc6825d11c9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page