Tools to deal with one or more measurements from single cells

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

buysdb

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3.6
- Python :: 3.7

Project description

Single cell multi omics

Single cell multi omics is a set of tools to deal with multiple measurements from the same cell. This package is maintained by Barbanson Biotech.

Installation

pip install singlecellmultiomics

For creating a virtual environment look here

Usage

demux.py: This tool demultiplexes raw fastq files, adapters, molecule and cell information are removed from the fastq records and encoded into the read name including their base-calling qualities. Additional stored inforation includes:

Assigned and raw illumina index
Library
Demultiplexing strategy used for demultiplexing (What kind of data the read is derived from)
Assigned barcode index

The next step is usually to trim and then map the demultiplexed reads.

For RNA seq data aligned to a transcriptome the step after this is to run featureCounts.

The mapped reads are encoded in a BAM file. This BAM file still contains the encoded data and this has to be decoded in order to get a useful BAM file. bamtagmultiome.py

Recodes the original read names and extracts all information previously encoded by the demultiplexer.
Adds allele information. (A VCF file is required for this)
Supports multiple protocols: RNA:CELSEQ1, CELSEQ2, VASA (with 8 and 6bp UMI), methylation digest sequencing:SC MSPJI ,
lineage tracing:SCARTRACE, DNA digest sequencing: NLAIII, Epigenetic modification sequencing: scCHIC, scCHIC+Transcriptome, DamID, DamID+T Single cell methylation : TAPs (in combination with any other supported protocol).
Assigns reads to molecules to allow for deduplication, adds duplication BAM flag
Assigns read groups
Splits libraries where multiple modalities are measured
Estimates consensus sequences of molecules

All SAM tags used and written by this package are listed in TAGS.MD

bamToCountTable.py Extracts a count table from a bam file, look here for examples.

libraryStatistics.py All statistics plots can be generated with a single script, look here for details.

Examples:

Demultiplex all fastq.gz files in the current directory using NLAIII barcodes

demux.py *.fastq.gz -use NLAIII384C8U3 --y

Demultiplex only the specified sequencing index (GTTTGA), and everything 1 hamming distance away from GTTTGA :

demux.py -si GTTTGA *.gz --y --hdi 1

API: Using SingleCellMultiOmics from python

All molecule and fragment information can be accessed using python Documentation for the classes in available here

scCHIC

For every fragment in input.bam find scCHIC seq fragments and deduplicate these. Fragments with the same cell barcode, umi, library and strand and starting within a range of 5 bp from each other are assigned as duplicate. The mnase cut site location is expected to be between the first base (Usually an A) this A is part of the sequencing adapter, and the second base (Usually a T). The cut site location is recorded into the DS tag. When alleles are specified using -alleles, the molecule assignment is split up by allele, this means that if two fragments map to the same location and share the same umi, but contain SNPs which indicate differing alleles, the reads are not assigned to the same molecule. For every fragment the ligation sequence is recorded into the RZ tag.

bamtagmultiome.py input.bam -method chic -o tagged.bam

Complete scCHIC data processing instructions from FastQ to count table here

NlaIII

For every fragment in input.bam find NLAIII seq fragments and deduplicate these. Fragments with the same cell barcode, umi, library and strand are assigned as duplicate. The NlaIII cut site location is recorded into the DS tag. When alleles are specified using -alleles, the molecule assignment is split up by allele, this means that if two fragments map to the same location and share the same UMI, but contain SNPs which indicate differing alleles, the reads are not assigned to the same molecule. For every fragment the sequenced part of the NlaIII cut site sequence is recorded into the RZ tag, this is usually CATG, but is allowed to be shifted 1 base to ATG. In the NlaIII protocol a reverse transcription (RT) is used, generally capturing more reverse transcription reactions will yield a more accurate molecule consensus sequence. For every fragment which support the molecule the reverse transcription reaction is recorded by storing the location of the random primer used for RT and the sequence of the random primer.

bamtagmultiome.py input.bam -method nla -o tagged.bam

Complete NlaIII data processing instructions from FastQ to count table here

Plate visualisation

Show relative abundance of reads and unique molecules across 384 well plate.

bamPlateVisualisation.py tagged.bam -o ./plate_plots

Creates the folder ./plate_plots containing

raw_reads_[LIBRARY_TYPE]_[LIBRARY_NAME].png # Distribution of total reads
usable_reads_[LIBRARY_TYPE]_[LIBRARY_NAME].png # Distribution of reads which can be assigned to a molecule
unique_reads_[LIBRARY_TYPE]_[LIBRARY_NAME].png # Distribution of unique reads

All statistics plots can be generated with a single script, look here for details.

Create a contig by sample matrix and divide counts when reads are multimapping. (Used for counting transcriptome mapped reads)

bamToCountTable.py -featureTags chrom -sampleTags SM --divideMultimapping --dedup ./tagged/STAR_mappedAligned.sortedByCoord.out.bam -o transcriptome_counts.csv

Obtain sample, chromosome, restrictionsite, read start, and read end from test.bam file:

bamTabulator.py -featureTags SM,reference_name,DS,reference_start,reference_end test.bam

List all available tags:

bamTabulator.py test.bam

You can additionaly use any of the pysam read attributes

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

buysdb

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3.6
- Python :: 3.7

Release history Release notifications | RSS feed

This version

0.1.43

Aug 12, 2025

0.1.41

Jun 16, 2025

0.1.40

Oct 14, 2024

0.1.32

Apr 13, 2023

0.1.30

Sep 13, 2022

0.1.29

Aug 19, 2022

0.1.28

Jul 1, 2022

0.1.27

Jun 20, 2022

0.1.26

Mar 22, 2022

0.1.25

Jan 13, 2022

0.1.23

Aug 11, 2021

0.1.22

Jun 30, 2021

0.1.20

May 17, 2021

0.1.19

Mar 30, 2021

0.1.18

Jan 30, 2021

0.1.17

Dec 11, 2020

0.1.13

Aug 31, 2020

0.1.12

Jun 22, 2020

0.1.11

Jun 5, 2020

0.1.9

May 12, 2020

0.1.7

Nov 29, 2019

0.1.6

Nov 23, 2019

0.1.5

Oct 28, 2019

0.1.4

Sep 6, 2019

0.1.2b0 pre-release

Jul 17, 2019

0.1.1

Jul 16, 2019

0.0.3

Jul 16, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

singlecellmultiomics-0.1.43.tar.gz (19.9 MB view details)

Uploaded Aug 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

singlecellmultiomics-0.1.43-py3-none-any.whl (20.2 MB view details)

Uploaded Aug 12, 2025 Python 3

File details

Details for the file singlecellmultiomics-0.1.43.tar.gz.

File metadata

Download URL: singlecellmultiomics-0.1.43.tar.gz
Upload date: Aug 12, 2025
Size: 19.9 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for singlecellmultiomics-0.1.43.tar.gz
Algorithm	Hash digest
SHA256	`91e62532fc93fcdf63999a4daf08ef8f4aff7b81888181b66fc4d6e6b2b6923e`
MD5	`7cb535c71c0de3cb0b507b7d0cac3c94`
BLAKE2b-256	`cb8424a85bfea622a0ef8110f8db9f6ee3b3ca37f72d0fc8ea3edaad6e19ed56`

See more details on using hashes here.

Provenance

The following attestation bundles were made for singlecellmultiomics-0.1.43.tar.gz:

Publisher: pythonpublish.yml on BuysDB/SingleCellMultiOmics

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: singlecellmultiomics-0.1.43.tar.gz
- Subject digest: 91e62532fc93fcdf63999a4daf08ef8f4aff7b81888181b66fc4d6e6b2b6923e
- Sigstore transparency entry: 384146921
- Sigstore integration time: Aug 12, 2025
Source repository:
- Permalink: BuysDB/SingleCellMultiOmics@fae72593a8ddc162e32f4d397e395c9584f9c0a3
- Branch / Tag: refs/tags/v0.1.43
- Owner: https://github.com/BuysDB
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pythonpublish.yml@fae72593a8ddc162e32f4d397e395c9584f9c0a3
- Trigger Event: release

File details

Details for the file singlecellmultiomics-0.1.43-py3-none-any.whl.

File metadata

Download URL: singlecellmultiomics-0.1.43-py3-none-any.whl
Upload date: Aug 12, 2025
Size: 20.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for singlecellmultiomics-0.1.43-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0d76e862f1e23fe603af40b0cd4698bfdc2de9d36a79a2b1563fae47cd4f23db`
MD5	`c123887ed7484bb62162567e02c3a9f7`
BLAKE2b-256	`f6cd8120edc799eb6bf42faa60070a5b3eed4b966a49ee9b23683fd98fe490c5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for singlecellmultiomics-0.1.43-py3-none-any.whl:

Publisher: pythonpublish.yml on BuysDB/SingleCellMultiOmics

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: singlecellmultiomics-0.1.43-py3-none-any.whl
- Subject digest: 0d76e862f1e23fe603af40b0cd4698bfdc2de9d36a79a2b1563fae47cd4f23db
- Sigstore transparency entry: 384146949
- Sigstore integration time: Aug 12, 2025
Source repository:
- Permalink: BuysDB/SingleCellMultiOmics@fae72593a8ddc162e32f4d397e395c9584f9c0a3
- Branch / Tag: refs/tags/v0.1.43
- Owner: https://github.com/BuysDB
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pythonpublish.yml@fae72593a8ddc162e32f4d397e395c9584f9c0a3
- Trigger Event: release

singlecellmultiomics 0.1.43

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Single cell multi omics

Installation

Usage

Examples:

API: Using SingleCellMultiOmics from python

scCHIC

NlaIII

Plate visualisation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance