Skip to main content

VaRaPS : Variants Ratios from Pooled Sequencing

Project description

VaRaPS

VaRaPS (Variant Read Proportion System) is a specialized tool designed for calculating the proportions of each variant from sequencing data, primarily focusing on BAM and CRAM file formats. This package provides targeted functionalities to accurately determine variant distributions, essential for detailed genomic analysis and research.

Functionality

Mode 1

Mode 1 is focused on analyzing sequencing data from files in BAM or CRAM format. It is designed to process this data to calculate and report the proportions of each variant present in the sequencing. This mode is essential for the initial analysis and provides a foundation for more detailed variant proportion calculations.

Mode 2

Mode 2 is utilized for calculating proportions from the output generated by Mode 1. This mode involves parameters such as number of bootstraps, initial alpha values, and optimization flags. It offers a deeper level of analysis, building upon the foundational data prepared in Mode 1.

Mode 3

Mode 3 combines the functionalities of Modes 1 and 2. It starts by performing an analysis similar to Mode 1, creating a temporary directory for intermediate outputs. After processing the initial data, it then proceeds to Mode 2's detailed analysis, leveraging the intermediate data for comprehensive variant proportion calculations. This mode is particularly useful for extensive datasets requiring sequential processing stages.

These modes collectively provide a versatile toolset for genomic researchers to analyze sequencing data with precision and depth, specifically focusing on the proportions of various genetic variants.

Installation

You can easily install VaRaPS from PyPI using pip:

pip install VaRaPS

Usage

VaRaPS can be utilized in three distinct modes, each tailored for specific aspects of sequencing data analysis. Below are the detailed instructions for each mode:

Mode 1

Use the following command structure to analyze your BAM/CRAM files:

`varaps -m 1 --path [PATH_TO_BAM/CRAM] --ref [PATH_TO_REFERENCE] --output [OUTPUT_DIRECTORY] --filter_per [FILTER_PERCENTAGE] --filter_num [FILTER_NUMBER]` 

Replace the placeholders with your data:
* `[PATH_TO_BAM/CRAM]`: Path to the BAM/CRAM file or directory.
* `[PATH_TO_REFERENCE]`: Path to the reference genome file.
* `[OUTPUT_DIRECTORY]`: Directory for saving the output.
* `[FILTER_PERCENTAGE]`: Percentage for filtering (optional).
* `[FILTER_NUMBER]`: Number for filtering (optional).

Mode 2

To use VaRaPS in Mode 2 for calculating variant proportions from Mode 1 output:

  1. Prepare Mode 1 Output: Ensure that the output from Mode 1, typically in CSV format, is available.

  2. Run the analysis: Execute the following command:

    varaps -m 2 --path [PATH_TO_MODE1_OUTPUT] --M [VALUE] --output [OUTPUT_DIRECTORY] --NbBootstraps [BOOTSTRAPS] --alphaInit [ALPHA_INIT] --optibyAlpha [OPTIMIZE_BY_ALPHA] --deconv_method [DECONVOLUTION_METHOD]

    Replace the placeholders with appropriate values:

    • [PATH_TO_MODE1_OUTPUT]: Path to the folder or file containing Mode 1's output.
    • [VALUE]: Specify the M parameter value.
    • [OUTPUT_DIRECTORY]: Directory for saving the output.
    • [BOOTSTRAPS]: Number of bootstraps.
    • [ALPHA_INIT]: Initial alpha value.
    • [OPTIMIZE_BY_ALPHA]: Boolean flag to optimize by alpha (true/false).
    • [DECONVOLUTION_METHOD]: Method of deconvolution to use (numerical value).

Mode 3

For using VaRaPS in Mode 3, which combines the functionalities of Modes 1 and 2:

  1. Run the analysis: Execute the following command:

    varaps -m 3 --path [PATH_TO_DATA] --ref [REFERENCE_PATH] --output [OUTPUT_DIRECTORY] --filter_per [FILTER_PERCENTAGE] --filter_num [FILTER_NUMBER] --M [VALUE] --NbBootstraps [BOOTSTRAPS] --alphaInit [ALPHA_INIT] --optibyAlpha [OPTIMIZE_BY_ALPHA] --deconv_method [DECONVOLUTION_METHOD]

    The placeholders should be filled as follows:

    • [PATH_TO_DATA]: Path to the BAM/CRAM files or the directory containing them.
    • [REFERENCE_PATH]: Path to the reference file.
    • [OUTPUT_DIRECTORY]: Directory for saving the output.
    • [FILTER_PERCENTAGE] and [FILTER_NUMBER]: Filtering criteria for the initial analysis (Mode 1 part).
    • [VALUE], [BOOTSTRAPS], [ALPHA_INIT], [OPTIMIZE_BY_ALPHA], [DECONVOLUTION_METHOD]: Parameters for the detailed analysis (Mode 2 part).

Contributing

Contributions to VaRaPS are welcome. If you have suggestions or improvements, feel free to mail me at djaout[at]lpsm.paris

License

GNU General Public License v3 or later (GPLv3+)

Contact

For any questions or feedback regarding VaRaPS, feel free to reach out through by mail at djaout[at]lpsm.paris

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

varaps-0.7.6.tar.gz (36.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

varaps-0.7.6-py2.py3-none-any.whl (43.2 kB view details)

Uploaded Python 2Python 3

File details

Details for the file varaps-0.7.6.tar.gz.

File metadata

  • Download URL: varaps-0.7.6.tar.gz
  • Upload date:
  • Size: 36.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.31.0

File hashes

Hashes for varaps-0.7.6.tar.gz
Algorithm Hash digest
SHA256 b792ae54698185e244271c3e5a36fc088512e13d4129a1371956b75e0dc47a46
MD5 09837bef2219e9213ce8f3dd1c13f870
BLAKE2b-256 66f0de583b40ea5829e7bacce75b811101a9fdb91af3297b8c7afe33e0f56560

See more details on using hashes here.

File details

Details for the file varaps-0.7.6-py2.py3-none-any.whl.

File metadata

  • Download URL: varaps-0.7.6-py2.py3-none-any.whl
  • Upload date:
  • Size: 43.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.31.0

File hashes

Hashes for varaps-0.7.6-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 ce805f0585184e86e9b6668448538633ab7dc93a51a18fab68dc93beac2237bf
MD5 b6832e074d43d7e0401480c83867705b
BLAKE2b-256 133d17f9d9226ef73611e507f62000cbbaa0aa7f9ccc3f8dbb3fb27ff46aaccd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page