Skip to main content

scHi-C analysis package

Project description

schic

schic is a Python package for analyzing sc-hic data.

Table of Contents

Overview

schic

Requirements

  1. Python 3.10+
  2. cutadapt 2.10+
  3. pairtools 2.2.3+

schic modules

schic provides a set of modules for analyzing sc-hic data. The modules are:

  • whitelist
  • extract
  • cutadapt
  • contacts
  • report

schic whitelist

Usage

cd 01_whitelist

bc_pattern='(?P<discard_1>ACATGGCTACGATCCGACTTTCTGCG)(?P<cell_1>.{10})(?P<discard_2>CCTTCC)(?P<cell_2>.{10})(?P<discard_3>TCGTCGGCAGCGTCAGATGTGTATA)(?P<umi_1>.{1}).*'

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
        schic whitelist \
            --bc-pattern='${bc_pattern}' \
            --stdin ../input/{}/{}_R1.fastq.gz \
            --set-cell-number=5000 \
            --plot-prefix={}_whitelist \
            --stdout={}_whitelist.txt
    """

schic extract

Usage

cd 02_extract
bc_pattern='(?P<discard_1>ACATGGCTACGATCCGACTTTCTGCG)(?P<cell_1>.{10})(?P<discard_2>CCTTCC)(?P<cell_2>.{10})(?P<discard_3>TCGTCGGCAGCGTCAGATGTGTATA)(?P<umi_1>.{1}).*'

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
    schic extract \
        --bc-pattern='${bc_pattern}' \
        --stdin ../input/{}/{}_R1.fastq.gz \
        --stdout {}_R1.extracted.fastq.gz \
        --read2-in ../input/{}/{}_R2.fastq.gz \
        --read2-out {}_R2.extracted.fastq.gz \
        --whitelist=../reference/barcodes/whitelist.txt
    """

schic cutadapt

Usage

cd 03_cutadapt

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
    schic cutadapt \
        --read1 ../02_extract/{}_R1.extracted.fastq.gz \
        --read2 ../02_extract/{}_R2.extracted.fastq.gz \
        --read1-out {}_R1.trimmed.fastq.gz \
        --read2-out {}_R2.trimmed.fastq.gz \
        --params='-a CTGTCTCTTATACACATCT -A AGATGTGTATAAGAGACAG'
    """

schic contacts

Usage

cd 03_contacts

NUM=$(cat ../config/sample|wc -l)
cat ../config/sample|xargs -n 1 -P ${NUM} -i -t bash -e -c \
    """
    schic contacts \
        --read1 ../03_cutadapt/{}_R1.trimmed.fastq.gz \
        --read2 ../03_cutadapt/{}_R2.trimmed.fastq.gz \
        --reference ../reference/index/bwa/genome \
        --genome-size ../reference/index/bwa/genome.size \
        --prefix {} \
        --threads 8
    
    """
# test
schic contacts \
    --read1 ../03_cutadapt/T4-Bioth-long_R1.trimmed.fastq.gz \
    --read2 ../03_cutadapt/T4-Bioth-long_R2.trimmed.fastq.gz \
    --reference ../reference/index/bwa/genome \
    --genome-size ../reference/index/bwa/genome.size \
    --prefix T4-Bioth-long \
    --filter 3 \
    --threads 16

Docker

If the user has docker installed, the following command can be used to run the pipeline in a docker container:

docker run -v /path/to/data:/data -it schic/schic:latest /bin/bash

Conda Environment

If the user has conda installed, the following command can be used to create a conda environment for schic:

  1. Install conda
  2. Create a new conda environment: conda create -n schic python=3.10
  3. Activate the environment: conda activate schic
  4. Install the required packages: conda install -c bioconda samtools bedtools
  5. Install the required python packages: pip install pandas numpy scipy sklearn matplotlib seaborn pysam
  6. Clone the schic repository: git clone https://github.com/epibiotek/schic.git

Cite schic

If you use schic in your research, please cite the following paper:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

schic-0.1.7.tar.gz (10.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

schic-0.1.7-py3-none-any.whl (14.1 kB view details)

Uploaded Python 3

File details

Details for the file schic-0.1.7.tar.gz.

File metadata

  • Download URL: schic-0.1.7.tar.gz
  • Upload date:
  • Size: 10.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.12.8 Linux/5.15.0-152-generic

File hashes

Hashes for schic-0.1.7.tar.gz
Algorithm Hash digest
SHA256 74c6a9626e215908167284a65f21a6f42410eea5d2b07739becf3649b1d51cc1
MD5 3a0aa80c81ab7a2681ec293bff061093
BLAKE2b-256 04a9eb17545f3a0ea8c92dff6ac785574ae3a92706f457f93ab6d59087498c03

See more details on using hashes here.

File details

Details for the file schic-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: schic-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 14.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.12.8 Linux/5.15.0-152-generic

File hashes

Hashes for schic-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 88a881d26538b525806794dc8ea9bc7c17f25d2135318ae77c574dc9e920e2f5
MD5 6251664c9778a70b150c381ff84984c3
BLAKE2b-256 ee6715a0705fe60a5372a114701c5dd81f1a65af93584de82f83d38d9e52717e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page