Skip to main content

ML visualization pipeline for caQTL evaluation

Project description

Data Pipeline

Processes inference models predictions and observed data, exploratory data analysis, data vizualization.

Configuration

Before running the pipelines, you need to configure them. Configuration files are located in the /config/ directory. For custom configurations:

This will create the following files that the user needs to fill out:

- `pipelines/data_pipeline/configs/direct_input_config.json`
- `pipelines/data_pipeline/configs/personal_config.json`
  1. Edit Config Files: Modify the configuration files to match your data and setup. These files contain the necessary parameters and paths required to run the pipelines successfully. Ensure that all paths, model checkpoints, and settings are correctly specified to match your environment.

Option 1: Default Repository Structure

Use this option if you're following the default setup as structured in the repository:

python generate_config.py --config_file configs/default_config.json

Option 2: Custom Configuration

Use this option if you need to specify custom paths and settings:

python generate_config.py --direct_input --config_file configs/direct_input_config.json
  1. Usage: Once the configuration is complete, you can run the pipeline.

Running the pipeline

Data Frame Generation

Exploratory Data Analysis(EDA)

Data Visualization

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

File details

Details for the file data_pipeline_ml_caqtl_visualization-0.1.3.tar.gz.

File metadata

File hashes

Hashes for data_pipeline_ml_caqtl_visualization-0.1.3.tar.gz
Algorithm Hash digest
SHA256 b61f28141209da41560ae08bd13a169235b969678ce345dcf7633c13779fec0b
MD5 6d108e166f7ab804c3bf1c7d262eeed3
BLAKE2b-256 072ee47e6652511f1dcf87114c67c43b78e5d2c2a408e7fa33bb38478a8b57dc

See more details on using hashes here.

File details

Details for the file data_pipeline_ml_caqtl_visualization-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for data_pipeline_ml_caqtl_visualization-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 22ee95699a4251165219ac91687f27465d9823e160184de9da966b6936d6a0fd
MD5 9d3f6d76ad1e4a1cceb5ba6d2b0b5841
BLAKE2b-256 6e7204aeccd79837c37497f6a9fa6d6722c9a372982e8b6fee9e15dbc7088fd1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page