Skip to main content

scHi-C analysis package

Project description

schic

schic is a Python package for analyzing sc-hic data.

Table of Contents

Overview

schic

Requirements

  1. Python 3.10+
  2. cutadapt 2.10+
  3. pairtools 2.2.3+

schic modules

schic provides a set of modules for analyzing sc-hic data. The modules are:

  • whitelist
  • extract
  • cutadapt
  • contacts
  • report

schic whitelist

Usage

cd 01_whitelist

bc_pattern='(?P<discard_1>ACATGGCTACGATCCGACTTTCTGCG)(?P<cell_1>.{10})(?P<discard_2>CCTTCC)(?P<cell_2>.{10})(?P<discard_3>TCGTCGGCAGCGTCAGATGTGTATA)(?P<umi_1>.{1}).*'

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
        schic whitelist \
            --bc-pattern='${bc_pattern}' \
            --stdin ../input/{}/{}_R1.fastq.gz \
            --set-cell-number=5000 \
            --plot-prefix={}_whitelist \
            --stdout={}_whitelist.txt
    """

schic extract

Usage

cd 02_extract
bc_pattern='(?P<discard_1>ACATGGCTACGATCCGACTTTCTGCG)(?P<cell_1>.{10})(?P<discard_2>CCTTCC)(?P<cell_2>.{10})(?P<discard_3>TCGTCGGCAGCGTCAGATGTGTATA)(?P<umi_1>.{1}).*'

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
    schic extract \
        --bc-pattern='${bc_pattern}' \
        --stdin ../input/{}/{}_R1.fastq.gz \
        --stdout {}_R1.extracted.fastq.gz \
        --read2-in ../input/{}/{}_R2.fastq.gz \
        --read2-out {}_R2.extracted.fastq.gz \
        --whitelist=../reference/barcodes/whitelist.txt
    """

schic cutadapt

Usage

cd 03_cutadapt

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
    schic cutadapt \
        --read1 ../02_extract/{}_R1.extracted.fastq.gz \
        --read2 ../02_extract/{}_R2.extracted.fastq.gz \
        --read1-out {}_R1.trimmed.fastq.gz \
        --read2-out {}_R2.trimmed.fastq.gz \
        --params='-a CTGTCTCTTATACACATCT -A AGATGTGTATAAGAGACAG'
    """

schic contacts

Usage

cd 03_contacts

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
    schic contacts \
        --read1 ../03_cutadapt/{}_R1.trimmed.fastq.gz \
        --read2 ../03_cutadapt/{}_R2.trimmed.fastq.gz \
        --reference ../reference/index/bwa/genome \
        --genome-size ../reference/index/bwa/genome.size \
        --prefix {} \
        --threads 8
    
    """
# test
schic contacts \
    --read1 ../03_cutadapt/T4-Bioth-long_R1.trimmed.fastq.gz \
    --read2 ../03_cutadapt/T4-Bioth-long_R2.trimmed.fastq.gz \
    --reference ../reference/index/bwa/genome \
    --genome-size ../reference/index/bwa/genome.size \
    --prefix T4-Bioth-long \
    --threads 16

Docker

If the user has docker installed, the following command can be used to run the pipeline in a docker container:

docker run -v /path/to/data:/data -it schic/schic:latest /bin/bash

Conda Environment

If the user has conda installed, the following command can be used to create a conda environment for schic:

  1. Install conda
  2. Create a new conda environment: conda create -n schic python=3.10
  3. Activate the environment: conda activate schic
  4. Install the required packages: conda install -c bioconda samtools bedtools
  5. Install the required python packages: pip install pandas numpy scipy sklearn matplotlib seaborn pysam
  6. Clone the schic repository: git clone https://github.com/epibiotek/schic.git

Cite schic

If you use schic in your research, please cite the following paper:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

schic-0.1.6.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

schic-0.1.6-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file schic-0.1.6.tar.gz.

File metadata

  • Download URL: schic-0.1.6.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.12.8 Linux/5.15.0-151-generic

File hashes

Hashes for schic-0.1.6.tar.gz
Algorithm Hash digest
SHA256 238b133d7ca9dac6d03cc4ab4bc1160465a36d3e1ef4e09683c95e92cd961c6b
MD5 9170a3c0015e08d6b89aac3b83969ce4
BLAKE2b-256 78124ca90f73229f9c80986f32451e0f1621a880cc093e72e7617da8e1c4bc1b

See more details on using hashes here.

File details

Details for the file schic-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: schic-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 11.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.12.8 Linux/5.15.0-151-generic

File hashes

Hashes for schic-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 3cf125017628027d8b5b0cec14d296b39a0223c1782cda078ba4c4e9304a420d
MD5 fd669c4a5419271e938ee78435ff0475
BLAKE2b-256 b415839536c66951ab368c3361c0547b559a6408cbf08e5b238043f4a93d1c0c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page