Skip to main content

cytocoarsening-graph coarsening

Project description

📦 cytocoarsening.py

We want to identify cell-types that are enriched for both phenotype (e.g. cell phenotype) and relate to the external information. Graph-based approaches for identifying these modules can suffer in the single-cell setting because there is an extremely large number of cells profiled per sample and we often profile multiple samples with multiple different experimental conditions or timepoints. Here is Cytocoarsening github link.

Overview

Installation

If you'd like to install by PYPI, please type this line on your command line(Cytocoarsening PYPI):

pip install cytocoarsening
  • Or you can clone the git repository by,
git clone https://github.com/ChenCookie/cytocoarsening.git
  • Once you've clone the repository, please change your working directory into this folder.
cd cytocoarsening

Data access

Take preeclampsia for example, if you'd want to list all of the publicly available files for download,

from lxml import html
import requests

r = requests.get(f'https://zenodo.org/record/6779483#.Yrygu-zMJhF')
content = html.fromstring(r.content)
hrefs = content.xpath('//a/@href')
files = [i for i in hrefs if i.endswith('?download=1')]
files = np.unique(files)
print(files)

If you'd like to download any preeclampsia dataset file from zenodo,

curl 'https://zenodo.org/record/6779483/files/Han-FCS_file_list.xlsx?download=1' --output Han-FCS_file_list.xlsx

Parameter Explanation

The function can be excute at one line.

coarsening_group,group_edge,result_dicts=cytocoarsening(cell_data,cell_label,multipass,k_nearest_neighbors)

input

  • cell_data - numpy.ndarray. The single cell data with several features. The shape of ndarray is (cell number,features number). Default: cell_data = None.
  • cell_label - numpy.ndarray. The attribute of each cell data. The shape of ndarray is (cell number,). Default: cell_label = None.
  • multipass - int. The pass number that what want the data size decrease. Default: multipass = 10.
  • k_nearest_neighbors - int. Number of neighbors in the inisial graph in each pass. Default: k_nearest_neighbors = 5.

output

  • coarsening_group - dict. The dictionary that indicate supernode as key and the node number list of the group as value in coarsening graph
  • group_edge - numpy.ndarray. The array that record the edge that combine two nodes
  • result_dicts - dict. The dictionary that save different result value, including accuracy, error rate, quadratic equation evaluation in feature and label, node number, edge number, runtime, and keypoint

Toy Example

from cytocoarsening.cytocoarsening import cytocoarsening
import numpy as np
import random

cell_data=[[random.random() for i in range(33)] for j in range(4500)]
cell_data=np.array(cell_data)

cell_label = np.array([0] * 1000 + [1] * (3500))
np.random.shuffle(cell_label)

group,edge,diccts=cytocoarsening(cell_data,cell_label,3,5)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cytocoarsening-0.0.8.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

cytocoarsening-0.0.8-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file cytocoarsening-0.0.8.tar.gz.

File metadata

  • Download URL: cytocoarsening-0.0.8.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.7.10

File hashes

Hashes for cytocoarsening-0.0.8.tar.gz
Algorithm Hash digest
SHA256 64e39e6c01c57f96d849ee275712ba40ad20edbd7450fb7ff74ae2a548b5f6dc
MD5 e467c51ddc0379989bfdff80549e8c2f
BLAKE2b-256 33718909c1a6feea7b52016a567c17aab9e3b2c0fd93dd78e44e1ce60516ec78

See more details on using hashes here.

File details

Details for the file cytocoarsening-0.0.8-py3-none-any.whl.

File metadata

File hashes

Hashes for cytocoarsening-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 65f49296a7384f29d3e50b833c8f5e87fa91a5a85d94c354fdc7387121d416a9
MD5 4d9681ff40b9a16f317f3532b5ef66fc
BLAKE2b-256 7273dd1c17e7e7e771d2ff31a3968442b61134f3c44fb8ba043c2525a03df30a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page