A thin, pythonic wrapper around geftool.
Project description
gefpy使用文档(gefpy is a thin, pythonic wrapper around geftools, which runs on Python 3 (3.7+).)
bgef_creater_cy模块
- 对geftools中creat bgef的python接口
- 相关接口
.. py:module:: bgef_creater_cy .. py:class:: BgefCreater(thcnt=8) .. py:method:: create_bgef(self, strin, bin, strmask, strout) Create tisscuecut bgef by bgef/gem and mask. :param strin: raw bgef or bgem :param bin: mask binsize :param strmask: mask path :param strout: out path .. py:method:: get_stereo_data(self, strin, bin, strmask) Get tisscuecut stereo data by bgef/gem and mask. :param strin: raw bgef or bgem :param bin: mask binsize :param strmask: mask path + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y). + gene_names is a list of gene names. + count is a list that save the midcnt of each expression. + cell_index is a list that save the cell idx of each expression. + gene_index is a list that records the gene serial number corresponding to each exp. :return: (uniq_cell, gene_names, count, cell_index, gene_index)
bgef_reader_cy模块
- 提供了对gef文件的读取接口
- 相关接口
.. py:module:: bgef_reader_cy .. py:class:: BgefR(filepath, bin_size, n_thread) .. py:method:: get_expression_num(self) Get the number of expression. .. py:method:: get_cell_num(self) Get the number of cell. .. py:method:: get_gene_num(self) Get the number of gene. .. py:method:: get_gene_names(self) Get a list of gene names. .. py:method:: get_cell_names(self) Get a list of cell ids, each item is (exp.x<<32 | exp.y) .. py:method:: get_gene_data(self) Get gene data. + gene_index is a list that records the gene serial number corresponding to each exp. + gene_names is a list of gene names. :return: (gene_index, gene_names) .. py:method:: get_expression(self) Get the all expression from bgef. + explist is a list, each item is (x, y, count, exon). :return: explist .. py:method:: get_exp_data(self) Get sparse matrix indexes of expression data. + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y). + cell_index is a list that save the cell idx of each expression. + count is a list that save the midcnt of each expression. :return: (uniq_cell, cell_index, count) .. py:method:: get_genedata_in_region(self, min_x, max_x, min_y, max_y, key) Get the explist by the specified gene name in the region. :param min_x: region minx :param max_x: region maxx :param min_y: region miny :param max_y: region maxy :param key: gene name :return: explist .. py:method:: get_offset(self) Get the offset in bgef. :return: (minx, miny) .. py:method:: get_exp_attr(self) Get the bgef attr. :return: (minx, miny, maxx, maxy, maxexp, resolution) .. py:method:: get_filtered_data(self, region, genelist) Get the filtered data from bgef by region or gene. :param region: rect region(minx,maxx,miny,maxy) :param genelist: gene name list + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y). + gene_names is a list of gene names. + count is a list that save the midcnt of each expression. + cell_index is a list that save the cell idx of each expression. + gene_index is a list that records the gene serial number corresponding to each exp. :return: (uniq_cell, gene_names, count, cell_index, gene_index)
bgef_writer_cy模块
- bgef文件的写入功能的python接口
- 相关接口
.. py:module:: bgef_writer_cy .. py:function:: generate_bgef(input_file, bgef_file, stromics="Transcriptomics", n_thread = 8, bin_sizes = None, region = None) Function to generate common bin GEF file(.bgef). :param input_file: The input file path of gem file or bin1 bgef. :param bgef_file: Output BGEF filepath. :param stromics: input the omics. :param n_thread: Number of thread, default 8 :param bin_sizes: A list of bin sizes, default: 1,10,20,50,100,200,500 :param region: A list of region (minX, maxX, minY, maxY) .. py:function:: gem2tif(gempath, tif_path) Function to generate tif file by GEM file(.gem & .gem.gz). :param gempath: The input file path of gem file. :param tif_path: Output tiff filepath.
cgef_adjust_cy模块
- 该模块提供了一些函数,包含lasso、mid count filter、渲染热图采样等功能,主要是stereo map使用
- 相关接口
.. py:module:: cgef_adjust_cy .. py:class:: CgefAdjust() .. py:method:: get_cell_data(self, bgef, cgef) Get raw cell data from cgef and bgef file. :param bgef: the bgef file path :param cgef: the cgef file path :returns: (genelist, vec_cell) .. py:method:: write_cgef_adjustdata(self, path, celldata, dnbdata) write the adjust cell data to cgef :param path: set the Output path :param celldata: input the cell data :param dandata: input the dandata .. py:method:: create_Region_Bgef(self, inpath, outpath, pos) generate spatial bin gef file by lasso region datas :param inpath: the bgef file path :param outpath: set the Output path :param pos: lasso region datas .. py:method:: create_Region_Cgef(self, inpath, outpath, pos) generate cell bin gef file by lasso region datas :param inpath: the cgef file path :param outpath: set the Output path :param pos: lasso region datas .. py:method:: get_regiondata_frombgef(self, inpath, bin, thcnt, pos) Get gene info from spatial bin gef file by lasso region datas :param inpath: the bgef file path :param bin: set bin size :param thcnt: thread counts :param pos: lasso region datas :returns vecdata: gene info{genecnt,midcnt,x,y} in region .. py:method:: get_regiondata_fromcgef(self, input_path, pos) Get cell statistical info from cell bin gef file by lasso region datas :param input_path: the cgef file path :param pos: lasso region datas :returns vecdata: statistical info{cell_count,total_area,average_gene_count,average_exp_count,average_dnb_count,average_area,median_gene_count,median_exp_count,median_dnb_count,median_area} in region .. py:method:: get_multilabel_regiondata_bgef(self, inpath, pos, bin=1, thcnt=4) The gene name and MIDcount of multiple labels are returned after the lasso :param inpath: the input bgef file path :param pos: lasso region datas(contain multi labels) :param bin: binsize :param thcnt: thread count :returns region_data, total_mid: region_data(gene_name, MIDcount), total midcount in region .. py:method:: get_multilabel_regiondata_cgef(self, inpath, pos) The gene name and MIDcount of multiple labels are returned after the lasso :param inpath: the input cgef file path :param pos: lasso region datas(contain multi labels) :returns vecdata, total_data: vecdata(cluster_id, mid_cnt, area, cell_id, x, y), total_data(cluster_id, mid_cnt, area, cell_id) .. py:method:: get_position_by_clusterid(self, inpath, clusterid) Get position value(x, y) by cluster id from h5ad file :param inpath: the input h5ad file :param clusterid: input cluster id need to get position :returns region_data: position value(x, y) .. py:method:: generate_filter_bgef_by_midcnt(self, inpath, outpath, binsize, filter_data, only_filter=False) generate complete bgef file by gene&protein mid count value :param inpath: input bgef file :param outpath: output bgef file :param binsize: current binsize :param only_filter: generate bgef only have filter gene&protein :param filter_data: filter gene&protein name and mid count :returns ret: generate result .. py:method:: get_filter_bgef_process_rate(self) Get generate process rate, Must be used in conjunction with the generate_filter_bgef_by_midcnt :returns ret: current process rate .. py:method:: generate_bgef_by_lasso(self, inpath, outpath, pos) generate complete bgef file by lasso region datas :param inpath: the input bgef file path :param outpath: the generate bgef file path :param pos: lasso region datas :returns ret: generate result .. py:method:: get_lasso_bgef_process_rate(self) Get generate process rate, Must be used in conjunction with the generate_bgef_by_lasso :returns ret: current process rate .. py:method:: generate_bgef_by_coordinate(self, inpath, outpath, cord, bin_size) generate bgef file by input coordinate in lasso region :param inpath: the bgef file path :param inpath: the bgef file path :param cord: lasso region datas :param bin_size: input binsize :returns ret: generate result .. py:method:: generate_cgef_by_coordinate(self, inpath, outpath, cord) generate cell bin gef file by coordinate in lasso region :param inpath: the input cgef file path :param outpath: the output cgef file path :param cord: lasso region datas :returns ret: generate result
cgef_reader模块
- 提供了对cgef文件的读取功能
- 相关接口
.. py:module:: cgef_reader_cy .. py:class:: CgefR(filepath) .. py:method:: get_expression_num(self) Get the number of expression. .. py:method:: get_cell_num(self) Get the number of cell. .. py:method:: get_gene_num(self) Get the number of gene. .. py:method:: get_gene_names(self) Get a list of gene names. The type of gene name is 32 chars. .. py:method:: get_cell_names(self) Get an array of cell ids. Each cell id is (cell.x <<32 | cell.y) .. py:method:: get_cells(self) Get cells, each cell include (id, x, y, offset, geneCount, expCount, dnbCount, area, cellTypeID, clusterID) .. py:method:: get_genes(self) Get genes, each gene include(geneName, offset, cellCount, expCount, maxMIDcount) .. py:method:: get_cellid_and_count(self) Get the count of each cell in each gene. :return: (cell_id, count) .. py:method:: get_geneid_and_count(self) Get the count of each gene in each cell. :return: (gene_id, count) .. py:method:: get_cellborders(self) Gets cell borders. :return: borders_list .. py:method:: get_filtered_data(self, region, genelist) Get the filtered data from cgef by region or gene. :param region: rect region(minx,maxx,miny,maxy) :param genelist: gene name list + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y). + gene_names is a list of gene names. + count is a list that save the midcnt of each expression. + cell_index is a list that save the cell idx of each expression. + gene_index is a list that records the gene serial number corresponding to each exp. :return: (uniq_cell, gene_names, count, cell_index, gene_index)
cgef_writer_cy模块
- 提供对cgef文件的写入
- 相关接口
.. py:module:: cgef_writer_cy .. py:function:: generate_cgef(cgef_file, bgef_file, mask_file, block_size: list) Generate cell bin GEF file from bgef + mask. :param cgef_file: Output CGEF filepath. :param bgef_file: Input BGEF filepath. :param mask_file: Input make filepath. :param block_size: Block size list, usually set to [256,256]. .. py:function:: cgem_to_cgef(cgem_file, outpath, block_size: list) Generate cell bin GEF file from cgem. :param cgem_file: Input cgem path. :param outpath: Output cgef path. :param block_size: Block size list, usually set to [256,256].
gef_to_gem_cy模块
- 提供gem文件转gef文件的功能
- 相关接口
.. py:module:: gef_to_gem_cy .. py:class:: GefToGem(strout, strsn, boutexon) .. py:method:: bgef2gem(self, strbgef, binsize) Create bgem file by bgef. :param strbgef: the bgef file path :param binsize: set the binsize .. py:method:: cgef2gem(self, strbgef, binsize) Create cgem file by cgef and bgef. :param strcgef: the cgef file path :param strbgef: the bgef file path .. py:method:: bgef2cgem(self, strmask, strbgef) Create cgem file by mask and bgef. :param strcgef: the mask file path :param strbgef: the bgef file path
API(also can see here)
See here.
Reporting bugs
Open a bug at https://github.com/STOmics/gefpy/issues.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file gefpy-1.2.3-cp310-cp310-win_amd64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp310-cp310-win_amd64.whl
- Upload date:
- Size: 17.8 MB
- Tags: CPython 3.10, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2fddead8560f2397d15539c6a9250e3787593906735a5eb4053a40611478c0cd |
|
MD5 | 0c6fe49292da0b293332bfa604f275cc |
|
BLAKE2b-256 | e5b303552a2be24d39d03d60551a78055183618e7b4d3ffad7d620ed8e26b9a7 |
File details
Details for the file gefpy-1.2.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 24.6 MB
- Tags: CPython 3.10, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0b7036c3c381ba13cb7d2f8716ddb8a65b28f7d6ffeed184e20564aad3bfd8b5 |
|
MD5 | 4b2b0461f609b772bc4d1425abbdedcd |
|
BLAKE2b-256 | 71892f26f0d5cc2246f438d7a1146755aaad8bee64cabd09cab3f2626624c5d8 |
File details
Details for the file gefpy-1.2.3-cp310-cp310-macosx_11_0_arm64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp310-cp310-macosx_11_0_arm64.whl
- Upload date:
- Size: 6.3 MB
- Tags: CPython 3.10, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ff7663a83a418fd2e4697e1867f382de2c9cd57c651ab56ce512b18e684491ee |
|
MD5 | 32a34c4a43ca47cce72e7dc060c6e516 |
|
BLAKE2b-256 | b4a20f1327689d89bdbedf623d117ae66c2601e9c7eafafed9f5c0bbdc9b974a |
File details
Details for the file gefpy-1.2.3-cp39-cp39-win_amd64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp39-cp39-win_amd64.whl
- Upload date:
- Size: 17.8 MB
- Tags: CPython 3.9, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 19fe03d4dc25a7c051f783466e6141ddf71cd9bcc805bfd644152e001c5e0f5d |
|
MD5 | be1a401199fdb0522e19f4a138ae880a |
|
BLAKE2b-256 | bd269db34190498823c5aa85a6ed4d6b2639d48b65b0b986699d16a491d330de |
File details
Details for the file gefpy-1.2.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 24.6 MB
- Tags: CPython 3.9, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | af93a39948c915ec58b14e88320424048db69e53743be625e5a996a86b115082 |
|
MD5 | 5b778d4d836f2c4abcdd8d3f0c2a19a9 |
|
BLAKE2b-256 | 2e63e85f392b4e73be1402f946bc95f29163a27313acb6af6e7898ed1e2591f1 |
File details
Details for the file gefpy-1.2.3-cp39-cp39-macosx_11_0_arm64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp39-cp39-macosx_11_0_arm64.whl
- Upload date:
- Size: 6.3 MB
- Tags: CPython 3.9, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 31c8cc3309f579ae924db13bb8ed0e5d0afab08de60535d8eb7b83612d59688a |
|
MD5 | c94e20a96d6ab7fa18506cde0afd12b0 |
|
BLAKE2b-256 | f3d91e2f27355400a264cf3118a52a6cd426844d0b6ed616a0ded169d87e2e78 |
File details
Details for the file gefpy-1.2.3-cp38-cp38-win_amd64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp38-cp38-win_amd64.whl
- Upload date:
- Size: 17.8 MB
- Tags: CPython 3.8, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e44c5ece0d41ed86d1c050bc7dd65db721eb1eae49c93caa964a0ade1b52b1ef |
|
MD5 | 3b2ab1c71815484708d808c16020a4f4 |
|
BLAKE2b-256 | d365bb44ac12705b55e73a97189b96c715c997ec0298321a9b093ce66c8e54c3 |
File details
Details for the file gefpy-1.2.3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 24.8 MB
- Tags: CPython 3.8, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b7b3b968940456f35f61ce068858d89d87abb645673677e89bf75d4d93348e62 |
|
MD5 | 7966e272551d6fd0bce7c0ba4803991a |
|
BLAKE2b-256 | 3948dc3613222a5dc4450f4e6c74a84cc5b021fcb4148512093c207278c1c851 |
File details
Details for the file gefpy-1.2.3-cp38-cp38-macosx_11_0_arm64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp38-cp38-macosx_11_0_arm64.whl
- Upload date:
- Size: 6.3 MB
- Tags: CPython 3.8, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | df8d568714c0af1c528e2b6986443507c0d72abda9dbbfb61d1390da8e429992 |
|
MD5 | 77c45a23fc8e5d5bc936526a60175def |
|
BLAKE2b-256 | eb6684fb31cc501692271be53527c56b7d2f64cac26767d62c642ef4583e1102 |
File details
Details for the file gefpy-1.2.3-cp37-cp37m-win_amd64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp37-cp37m-win_amd64.whl
- Upload date:
- Size: 17.8 MB
- Tags: CPython 3.7m, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9e5068407774cb718ef68d89621166c8def54dfb87aa9965bddcb27d1c339a4d |
|
MD5 | c297400c104f09d2fb4430ae99967277 |
|
BLAKE2b-256 | d85ebf554a4d2bc66353f6724b75961a23ed61d669cd07689f4c9517cb08da02 |
File details
Details for the file gefpy-1.2.3-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: gefpy-1.2.3-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 24.7 MB
- Tags: CPython 3.7m, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2ed70c96f1a4678e2e5e39f1c587daa29dc12c3a4c87da3d4ca1bac8f786f17c |
|
MD5 | 847973347d01a5d4dcf78af3ba54b744 |
|
BLAKE2b-256 | 1c56b62f0f726fc097b6262ba8292346fea384463f2a68a969e89c5f2da89b50 |