Skip to main content

Decline in transcriptional homeostasis defines aging

Project description

HDS Package

Age-dependent dysregulation of transcription regulatory machinery triggers modulations in the gene expression levels leading to the decline in cellular fitness. Tracking of these transcripts along the temporal axis in multiple species revealed a spectrum of evolutionarily conserved pathways, such as electron transport chain, translation regulation, DNA repair, etc. Recent shreds of evidence suggest that aging deteriorates the transcription machinery itself, indicating the hidden complexity of the aging transcriptomes. This reinforces the need for devising novel computational methods to view aging through the lens of transcriptomics. Here, we present Homeostatic Divergence Score (HDS) to quantify the extent of messenger RNA (mRNA) homeostasis by assessing the balance between spliced and unspliced mRNA repertoire in single cells. By tracking HDS across single-cell expression profiles of human pancreas we identified a core set of 171 transcripts undergoing an age-dependent homeostatic breakdown. Notably, many of these transcripts are well-studied in the context of organismal aging. Our preliminary analyses in this direction suggest that mRNA processing level information offered by single-cell RNA sequencing (scRNA-seq) data is a superior determinant of chronological age as compared to transcriptional noise.

Instructions

How to install?

  1. These are are required packages:

    scipy, numpy, pandas, velocyto, scanpy, anndata, matplotlib, seaborn, matplotlib_venn, leidenalg and scikit-learn

  2. To install these packages use below command

    !pip install scipy numpy pandas velocyto scanpy anndata matplotlib seaborn matplotlib_venn leidenalg scikit-learn

  3. Get latest version of HDS from the link given below:

    https://test.pypi.org/project/HDS-krishang

  4. Install it using below command.

    pip install -i https://test.pypi.org/simple/ HDS-krishang

How to make loom files from fastq files?

  1. Download fastq files from the link given below (or any other link):

    https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-6687/samples/

  2. For 10x fastq files, use the 'cellranger count' command to generate bam files.

    For example:

    cellranger count --id=$sample --transcriptome=$transcriptome --fastqs=/sample.fastqs --sample=$sample --expect-cells=8000 --localcores=12

    FYI: Download transcriptome from the link given below:

    https://support.10xgenomics.com/single-cell-gene-expression/software/downloads/latest

  3. 'STAR' tool can also be used for alignment to reference genome and generate bam file.

    For example:

    STAR --runThreadN 12 --genomeDir /star_mouse/index --sjdbGTFfile /gencode.vM25.primary_assembly.annotation.gtf --readFilesIn $line1.fastq.gz $line2.fastq.gz --outFileNamePrefix $line.bam --readFilesCommand zcat --outSAMtype BAM SortedByCoordinate

    1. Create star index using standard parameters
    2. Download gtf file from the link given below:

    https://www.gencodegenes.org/human

  4. Generate the loom file using velocyto command.

    For example:

    For 10x data, use the command written below:

    velocyto run10x -m hg19_rmsk.gtf sample_folder/01 refdata-gex-GRCh38-2020-A/genes/genes.gtf

    1. Download gtf file from the link given below:

    https://www.gencodegenes.org/human/

    1. Download mask file from the link given below:

    https://genome.ucsc.edu/cgi-bin/hgTables?hgsid=611454127_NtvlaW6xBSIRYJEBI0iRDEWisITa&clade=mammal&org=Human&db=0&hgta_group=allTracks&hgta_track=rmsk&hgta_table=rmsk&hgta_regionType=genome&position=&hgta_outputType=gff&hgta_outFileName=mm10_rmsk.gtf

    1. For STAR generated bam files, use the command written below:

    velocyto run -b filtered_barcodes.tsv -o output_path -m repeat_msk_srt.gtf possorted_genome_bam.bam mm10_annotation.gtf

How to use?

  1. from HDS import HDS

    HDS("path of loom file")

    For example:

    HDS("/home/krishan/loom/abc.loom")

  2. Use 'clusters' parameter to pass cluster identity of cells if you have. Otherwise, HDS by default uses 'leiden' method with resolution = 1, inbuilt in scanpy package. Note: clusters labels should be in the same order as barcode in the loom file.

    For example:

    HDS(path1="path of loom file", clusters=[1,2,1,2,3,4,5])

  3. Use 'per' parameter to specify the X percentile genes with top HDS score. This could be important since HDS can return large negative valuesthereby distorting the overall distribution plots involving HDS scores.

  4. Use 'genes' parameter to pass speific genes for which you want to generate the phase portraits.

    For example:

    HDS(path1="path of loom file", genes=['GENE1','GENE2'])

  5. Notably default scanpy parameters are (you can change it):

    min_genes=200, min_cells=3, n_genes_by_counts=2500, pct_counts_mt=5

    To understand the relevance of these parameters check out:

  6. We have created a Google colab notebook with the code and loom file. Link is given below:

    https://drive.google.com/drive/folders/1Pq9IsjnCYaJngU8WQ0E1RjIqA9f-j3lY?usp=sharing

Output?

HDS function will return a pandas data frame cantaining HDS scores of genes across all clusters.

Rsquared (pandas data frame)

HDS score distribution for each supplied cluster

Rsquared (violin plot)

Example phase portraits of genes under homeostasis breakdown

portrait of rhomeostatis genes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

HDS-krishangupta-0.0.1.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

HDS_krishangupta-0.0.1-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file HDS-krishangupta-0.0.1.tar.gz.

File metadata

  • Download URL: HDS-krishangupta-0.0.1.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.0 setuptools/51.3.1 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.6.9

File hashes

Hashes for HDS-krishangupta-0.0.1.tar.gz
Algorithm Hash digest
SHA256 fd75bd523c03ce717e3a3ca671fddb4446c31d571757a5286c611d09269270fb
MD5 bb8b77234abafad2c4a764f57074fb46
BLAKE2b-256 477ebcb4a466da08f324efec1f6bf648483fc9a5d4ccdb3fd33cdf7d0470a482

See more details on using hashes here.

File details

Details for the file HDS_krishangupta-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: HDS_krishangupta-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 8.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.0 setuptools/51.3.1 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.6.9

File hashes

Hashes for HDS_krishangupta-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d8616720a66fe407572e5bd3554ccae8cd762abfbc9f28296cb49870d2106737
MD5 de5a5fe701f64bc635b9267186b8e3b2
BLAKE2b-256 1ee9d3765857511eb1f438a078851758a5f5481ae9d0c25f0f329bd8525597b3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page