Skip to main content

High-resolution copy number variant calling in single-cell whole-genome sequencing.

Project description

Logo generated by DALLE-3

HiScanner (HIgh-resolution Single-Cell Allelic copy Number callER)

PyPI version License: MIT

HiScanner is a lightweight python package for high-resolution single-cell copy number analysis.

Content

Installation

It is recommended to install HiScanner in a virtual environment. Here's how to create one using conda:

conda create -n hiscanner_env python=3.8
conda activate hiscanner_env

To install HiScanner, simply use pip:

pip install hiscanner

Requirements

HiScanner requires bcftools, which must be included in PATH. All other dependencies should be installed automatically with pip.

Quick Start

To get started with HiScanner, please refer to our tutorial. Here's a quick example:

import hiscanner
# define your json file path
path = "your/dir/to/json"
# preprocess
hiscanner.pp.preprocess(path)
# segment
hiscanner.tl.segment(path)
# infer copy number
hiscanner.tl.infer_copy_number(path)
# visualize 
hiscanner.pl.plot_whole_genome_track(path)

Command line interface

For advanced users interested in multi-sample segmentation (e.g., repurposing for bulk samples), HiScanner provides a command line interface. In bash, simply run:

hiscanner-segment -i {input_file} -l {LAMBDA} -o {output_file}

For more details, please refer to our documentation.

Required input

1) JSON

HiScanner requires a json file as input. Here's an example:

{
    "bin_path": "/Users/yifan/dev/scanner_tutorial/tests/data/bins",
    "phase_file": "/Users/yifan/dev/scanner_tutorial/tests/data/hsnps/samplename_phased_hsnps.vcf",
    "germline": "bulkname",
    "gatk_vcf": "/Users/yifan/dev/scanner_tutorial/tests/data/hsnps/samplename_gatk.vcf",
    "stem": "/Users/yifan/dev/scanner_tutorial/tests/data/output/",
    "j": 20,
    "singlecell":"cellA,cellB",
    "MAX_WGD": 1,
    "LAMBDA":200
}

2) BAM

The BAM file should be indexed and sorted.

3) Unphased VCF

output from GATK or SCAN-SNV; The VCF file should be indexed and sorted.

4) Phased VCF

output from Eagle or other phasing tools;

5) Bins

The bins file should be a bed file (output from BIC-seq2)

Operating System

HiScanner was tested in the following operating systems:

  • macOS Ventura 13.5.2
  • CentOS Linux 7.9
  • Windows 11

Documentation

For more detailed information and advanced usage, please refer to our documentation.

Support

HiScanner is currently under active development. For support or questions, please open an issue on our GitHub repository.

Contributing

Contributions to HiScanner are welcome. Please refer to our contribution guidelines for more information.

LICENSE

HiScanner is released under the MIT License, and is freely available for non-commercial use.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hiscanner-0.2a0.tar.gz (109.4 kB view details)

Uploaded Source

Built Distribution

hiscanner-0.2a0-py3-none-any.whl (112.8 kB view details)

Uploaded Python 3

File details

Details for the file hiscanner-0.2a0.tar.gz.

File metadata

  • Download URL: hiscanner-0.2a0.tar.gz
  • Upload date:
  • Size: 109.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.17

File hashes

Hashes for hiscanner-0.2a0.tar.gz
Algorithm Hash digest
SHA256 4e4b9290fda7854a175cbd0312d1f81042ec110098f6aa199b87bec4819ec383
MD5 dd1c2bc29663f96c9bf8644eac540b9f
BLAKE2b-256 af2d514cc7fd5aa8663f20aa197b7866354040e3f22643224f98738e264c0f1e

See more details on using hashes here.

File details

Details for the file hiscanner-0.2a0-py3-none-any.whl.

File metadata

  • Download URL: hiscanner-0.2a0-py3-none-any.whl
  • Upload date:
  • Size: 112.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.17

File hashes

Hashes for hiscanner-0.2a0-py3-none-any.whl
Algorithm Hash digest
SHA256 4271b9eca120ac3b5bda1232b3912c1fac4dc89b57afabc015d94bf40f2c2166
MD5 d177125f90191bd9990e60b5c1b4057e
BLAKE2b-256 565b3862b37efbaea8ae54aee2bcd9892ec5b320612f21a8f68e47e5e9e9e638

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page