Skip to main content

A thin, pythonic wrapper around geftool.

Project description

gefpy使用文档(gefpy is a thin, pythonic wrapper around geftools, which runs on Python 3 (3.7+).)

bgef_creater_cy模块

  • 对geftools中creat bgef的python接口
  • 相关接口
    .. py:module:: bgef_creater_cy
    
      .. py:class:: BgefCreater(thcnt=8)
    
      .. py:method:: create_bgef(self, strin, bin, strmask, strout)
    
          Create tisscuecut bgef by bgef/gem and mask.
    
          :param strin: raw bgef or bgem
          :param bin: mask binsize
          :param strmask: mask path
          :param strout: out path
    
      .. py:method:: get_stereo_data(self, strin, bin, strmask)
    
          Get tisscuecut stereo data by bgef/gem and mask. 
    
          :param strin: raw bgef or bgem
          :param bin: mask binsize
          :param strmask: mask path
    
          + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y).
          + gene_names is a list of gene names.
          + count is a list that save the midcnt of each expression.
          + cell_index is a list that save the cell idx of each expression.
          + gene_index is a list that records the gene serial number corresponding to each exp.
    
    
          :return: (uniq_cell, gene_names, count, cell_index, gene_index)
    

bgef_reader_cy模块

  • 提供了对gef文件的读取接口
  • 相关接口
      .. py:module:: bgef_reader_cy
    
      .. py:class:: BgefR(filepath, bin_size, n_thread)
    
      .. py:method:: get_expression_num(self)
    
      Get the number of expression.
    
      .. py:method:: get_cell_num(self)
    
      Get the number of cell.
    
      .. py:method:: get_gene_num(self)
    
      Get the number of gene.
    
      .. py:method:: get_gene_names(self)
    
      Get a list of gene names.
    
      .. py:method:: get_cell_names(self)
    
      Get a list of cell ids, each item is (exp.x<<32 | exp.y)
    
      .. py:method:: get_gene_data(self)
    
      Get gene data.
    
      + gene_index is a list that records the gene serial number corresponding to each exp.
      + gene_names is a list of gene names.
    
      :return: (gene_index, gene_names)
    
      .. py:method:: get_expression(self)
    
      Get the all expression from bgef. 
    
      + explist is a list, each item is (x, y, count, exon).
    
      :return: explist
    
      .. py:method:: get_exp_data(self)
    
      Get sparse matrix indexes of expression data.
    
      + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y).
      + cell_index is a list that save the cell idx of each expression.
      + count is a list that save the midcnt of each expression.
    
      :return: (uniq_cell, cell_index, count)
    
      .. py:method:: get_genedata_in_region(self, min_x, max_x, min_y, max_y, key)
    
      Get the explist by the specified gene name in the region.
    
      :param min_x: region minx
      :param max_x: region maxx
      :param min_y: region miny
      :param max_y: region maxy
      :param key: gene name
      :return: explist
    
      .. py:method:: get_offset(self)
    
      Get the offset in bgef.
    
      :return: (minx, miny)
    
      .. py:method:: get_exp_attr(self)
    
      Get the bgef attr.
    
      :return: (minx, miny, maxx, maxy, maxexp, resolution)
    
      .. py:method:: get_filtered_data(self, region, genelist)
    
      Get the filtered data from bgef by region or gene.
    
      :param region: rect region(minx,maxx,miny,maxy)
      :param genelist: gene name list
    
      + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y).
      + gene_names is a list of gene names.
      + count is a list that save the midcnt of each expression.
      + cell_index is a list that save the cell idx of each expression.
      + gene_index is a list that records the gene serial number corresponding to each exp.
    
      :return: (uniq_cell, gene_names, count, cell_index, gene_index)
    

bgef_writer_cy模块

  • bgef文件的写入功能的python接口
  • 相关接口
      .. py:module:: bgef_writer_cy
    
      .. py:function:: generate_bgef(input_file, bgef_file, stromics="Transcriptomics", n_thread = 8, bin_sizes = None, region = None)
    
      Function to generate common bin GEF file(.bgef).
    
      :param input_file:  The input file path of gem file or bin1 bgef.
      :param bgef_file:   Output BGEF filepath.
      :param stromics:    input the omics.
      :param n_thread:    Number of thread, default 8
      :param bin_sizes:   A list of bin sizes, default: 1,10,20,50,100,200,500
      :param region:      A list of region (minX, maxX, minY, maxY)
    
      .. py:function:: gem2tif(gempath, tif_path)
    
      Function to generate tif file by GEM file(.gem & .gem.gz).
    
      :param gempath:  The input file path of gem file.
      :param tif_path:   Output tiff filepath.
    

cgef_adjust_cy模块

  • 该模块提供了一些函数,包含lasso、mid count filter、渲染热图采样等功能,主要是stereo map使用
  • 相关接口
      .. py:module:: cgef_adjust_cy
    
      .. py:class:: CgefAdjust()
    
      .. py:method:: get_cell_data(self, bgef, cgef)
    
          Get raw cell data from cgef and bgef file.
    
          :param bgef: the bgef file path
          :param cgef: the cgef file path
          :returns: (genelist, vec_cell)
    
      .. py:method:: write_cgef_adjustdata(self, path, celldata, dnbdata)
    
          write the adjust cell data to cgef
    
          :param path: set the Output path
          :param celldata: input the cell data
          :param dandata: input the dandata
          
      .. py:method:: create_Region_Bgef(self, inpath, outpath, pos)
    
          generate spatial bin gef file by lasso region datas
    
          :param inpath: the bgef file path
          :param outpath: set the Output path
          :param pos: lasso region datas
    
      .. py:method:: create_Region_Cgef(self, inpath, outpath, pos)
    
          generate cell bin gef file by lasso region datas
    
          :param inpath: the cgef file path
          :param outpath: set the Output path
          :param pos: lasso region datas
    
      .. py:method:: get_regiondata_frombgef(self, inpath, bin, thcnt, pos)
    
          Get gene info from spatial bin gef file by lasso region datas
    
          :param inpath: the bgef file path
          :param bin: set bin size
          :param thcnt: thread counts
          :param pos: lasso region datas
          :returns vecdata: gene info{genecnt,midcnt,x,y} in region
    
      .. py:method:: get_regiondata_fromcgef(self, input_path, pos)
    
          Get cell statistical info from cell bin gef file by lasso region datas
    
          :param input_path: the cgef file path
          :param pos: lasso region datas
          :returns vecdata: statistical info{cell_count,total_area,average_gene_count,average_exp_count,average_dnb_count,average_area,median_gene_count,median_exp_count,median_dnb_count,median_area} in region
    
      .. py:method:: get_multilabel_regiondata_bgef(self, inpath, pos, bin=1, thcnt=4)
    
          The gene name and MIDcount of multiple labels are returned after the lasso
    
          :param inpath: the input bgef file path
          :param pos: lasso region datas(contain multi labels)
          :param bin: binsize
          :param thcnt: thread count
          :returns region_data, total_mid: region_data(gene_name, MIDcount), total midcount in region
    
      .. py:method:: get_multilabel_regiondata_cgef(self, inpath, pos)
    
          The gene name and MIDcount of multiple labels are returned after the lasso
    
          :param inpath: the input cgef file path
          :param pos: lasso region datas(contain multi labels)
          :returns vecdata, total_data: vecdata(cluster_id, mid_cnt, area, cell_id, x, y), total_data(cluster_id, mid_cnt, area, cell_id)
    
      .. py:method:: get_position_by_clusterid(self, inpath, clusterid)
    
          Get position value(x, y) by cluster id from h5ad file
    
          :param inpath: the input h5ad file
          :param clusterid: input cluster id need to get position
          :returns region_data: position value(x, y)
    
      .. py:method:: generate_filter_bgef_by_midcnt(self, inpath, outpath, binsize, filter_data, only_filter=False)
    
          generate complete bgef file by gene&protein mid count value
    
          :param inpath: input bgef file
          :param outpath: output bgef file
          :param binsize: current binsize
          :param only_filter: generate bgef only have filter gene&protein
          :param filter_data: filter gene&protein name and mid count
          :returns ret: generate result
    
      .. py:method:: get_filter_bgef_process_rate(self)
    
          Get generate process rate, Must be used in conjunction with the generate_filter_bgef_by_midcnt
    
          :returns ret: current process rate
          
      .. py:method:: generate_bgef_by_lasso(self, inpath, outpath, pos)
    
          generate complete bgef file by lasso region datas
    
          :param inpath: the input bgef file path
          :param outpath: the generate bgef file path
          :param pos: lasso region datas
          :returns ret: generate result
    
      .. py:method:: get_lasso_bgef_process_rate(self)
    
          Get generate process rate, Must be used in conjunction with the generate_bgef_by_lasso
    
          :returns ret: current process rate
    
      .. py:method:: generate_bgef_by_coordinate(self, inpath, outpath, cord, bin_size)
    
          generate bgef file by input coordinate in lasso region
    
          :param inpath: the bgef file path
          :param inpath: the bgef file path
          :param cord: lasso region datas
          :param bin_size: input binsize
          :returns ret: generate result
    
      .. py:method:: generate_cgef_by_coordinate(self, inpath, outpath, cord)
    
          generate cell bin gef file by coordinate in lasso region
    
          :param inpath: the input cgef file path
          :param outpath: the output cgef file path
          :param cord: lasso region datas
          :returns ret: generate result
    

cgef_reader模块

  • 提供了对cgef文件的读取功能
  • 相关接口
      .. py:module:: cgef_reader_cy
    
      .. py:class:: CgefR(filepath)
    
      .. py:method:: get_expression_num(self)
    
          Get the number of expression.
    
      .. py:method:: get_cell_num(self)
    
          Get the number of cell.
    
      .. py:method:: get_gene_num(self)
    
          Get the number of gene.
    
      .. py:method:: get_gene_names(self)
    
          Get a list of gene names. The type of gene name is 32 chars.
    
      .. py:method:: get_cell_names(self)
    
          Get an array of cell ids. Each cell id is (cell.x <<32 | cell.y)
    
      .. py:method:: get_cells(self)
    
          Get cells, each cell include (id, x, y, offset, geneCount, expCount, dnbCount, area, cellTypeID, clusterID)
    
      .. py:method:: get_genes(self)
    
          Get genes, each gene include(geneName, offset, cellCount, expCount, maxMIDcount)
    
      .. py:method:: get_cellid_and_count(self)
    
          Get the count of each cell in each gene.
    
          :return:  (cell_id, count)
    
      .. py:method:: get_geneid_and_count(self)
    
          Get the count of each gene in each cell.
          
          :return:  (gene_id, count)
    
      .. py:method:: get_cellborders(self)
    
          Gets cell borders.
          
          :return: borders_list
    
      .. py:method:: get_filtered_data(self, region, genelist)
    
          Get the filtered data from cgef by region or gene.
    
          :param region: rect region(minx,maxx,miny,maxy)
          :param genelist: gene name list
    
          + uniq_cell is list that save all cell, each cell val (exp.x<<32 | exp.y).
          + gene_names is a list of gene names.
          + count is a list that save the midcnt of each expression.
          + cell_index is a list that save the cell idx of each expression.
          + gene_index is a list that records the gene serial number corresponding to each exp.
    
          :return: (uniq_cell, gene_names, count, cell_index, gene_index)
    

cgef_writer_cy模块

  • 提供对cgef文件的写入
  • 相关接口
      .. py:module:: cgef_writer_cy
    
      .. py:function:: generate_cgef(cgef_file, bgef_file, mask_file, block_size: list)
    
      Generate cell bin GEF file from bgef + mask.
    
      :param cgef_file: Output CGEF filepath.
      :param bgef_file: Input BGEF filepath.
      :param mask_file: Input make filepath.
      :param block_size: Block size list, usually set to [256,256].
    
      .. py:function:: cgem_to_cgef(cgem_file, outpath, block_size: list)
    
      Generate cell bin GEF file from cgem.
    
      :param cgem_file: Input cgem path.
      :param outpath: Output cgef path.
      :param block_size: Block size list,  usually set to [256,256].
    

gef_to_gem_cy模块

  • 提供gem文件转gef文件的功能
  • 相关接口
      .. py:module:: gef_to_gem_cy
    
      .. py:class:: GefToGem(strout, strsn, boutexon)
    
      .. py:method:: bgef2gem(self, strbgef, binsize)
    
          Create bgem file by bgef.
    
          :param strbgef: the bgef file path
          :param binsize: set the binsize
    
      .. py:method:: cgef2gem(self, strbgef, binsize)
    
          Create cgem file by cgef and bgef.
    
          :param strcgef: the cgef file path
          :param strbgef: the bgef file path
    
      .. py:method:: bgef2cgem(self, strmask, strbgef)
    
          Create cgem file by mask and bgef.
    
          :param strcgef: the mask file path
          :param strbgef: the bgef file path
    

API(also can see here)

See here.

Reporting bugs

Open a bug at https://github.com/STOmics/gefpy/issues.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

gefpy-1.2.6-cp310-cp310-win_amd64.whl (17.9 MB view details)

Uploaded CPython 3.10Windows x86-64

gefpy-1.2.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (25.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

gefpy-1.2.6-cp310-cp310-macosx_11_0_arm64.whl (6.3 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

gefpy-1.2.6-cp39-cp39-win_amd64.whl (17.9 MB view details)

Uploaded CPython 3.9Windows x86-64

gefpy-1.2.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (25.3 MB view details)

Uploaded CPython 3.9manylinux: glibc 2.17+ x86-64

gefpy-1.2.6-cp39-cp39-macosx_11_0_arm64.whl (6.3 MB view details)

Uploaded CPython 3.9macOS 11.0+ ARM64

gefpy-1.2.6-cp38-cp38-win_amd64.whl (17.9 MB view details)

Uploaded CPython 3.8Windows x86-64

gefpy-1.2.6-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (25.4 MB view details)

Uploaded CPython 3.8manylinux: glibc 2.17+ x86-64

gefpy-1.2.6-cp38-cp38-macosx_11_0_arm64.whl (6.3 MB view details)

Uploaded CPython 3.8macOS 11.0+ ARM64

gefpy-1.2.6-cp37-cp37m-win_amd64.whl (17.9 MB view details)

Uploaded CPython 3.7mWindows x86-64

gefpy-1.2.6-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (25.4 MB view details)

Uploaded CPython 3.7mmanylinux: glibc 2.17+ x86-64

File details

Details for the file gefpy-1.2.6-cp310-cp310-win_amd64.whl.

File metadata

  • Download URL: gefpy-1.2.6-cp310-cp310-win_amd64.whl
  • Upload date:
  • Size: 17.9 MB
  • Tags: CPython 3.10, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.20

File hashes

Hashes for gefpy-1.2.6-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 13bd42d83feba95eeda9e8d08ebad2ce84ab9b25d9a69ceb0b4dacce60550b4b
MD5 bc2832c899e4b24f7fc1b31d6b24ff52
BLAKE2b-256 a63e32c38b4ae19514e2e95d1c3d041c399c21cc5fb5fd9d62c2aa924828e1aa

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gefpy-1.2.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7a3f5031811c889f4fccdd59435bc3a2eb1624d0345db10a1a99a24bcb7d994c
MD5 9214b93022b38d6adca63eb6beecd1a2
BLAKE2b-256 140c7bdbd1ac37522d7392ae43fa252fa373ac3cd4ec2d99dd417d761ff4276c

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gefpy-1.2.6-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 495d2c0940b755469ab3aa79438d38e33576da756fd1a56b41c1fe2f1f18e02b
MD5 caf2aaf32963cde1dd4752dd956de1b5
BLAKE2b-256 9dcd339ebe37cfbe138b12d587a7924fd045e684aa59b74d31113f4d5c74b0e7

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: gefpy-1.2.6-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 17.9 MB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.20

File hashes

Hashes for gefpy-1.2.6-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 974dcadc6bb5947d4d5c27aee732e7cf9574be0a56c2482c377ea8660bf575c8
MD5 6ac199af55cf914fdc0f82c25551eb26
BLAKE2b-256 93833ccfa4946a3ff3abb75da3c799704d93c6e860ec76e4b58d9636e7c0c3b1

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gefpy-1.2.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 8f6939cd038eba20630c9230d91f6db8407c27824c50414749ab945bb2d1b283
MD5 dc8c4a6aecf563c8111e1dd4ad2e923c
BLAKE2b-256 8c361063d065b20836debe363f92f84e3d288d5c9446e5b430237d4817b3a62d

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp39-cp39-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gefpy-1.2.6-cp39-cp39-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 7fb67fbd6e86f85bde4f2a33ef86fdc457e3f0618ec72ff40d65f678d34833f7
MD5 b3439515dc425526c8a01b4f2a4d6c02
BLAKE2b-256 c9223e3d9d3f3c1667543631c4096ad1f9ea976b20d1e20c7d10fe678e92a3c4

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp38-cp38-win_amd64.whl.

File metadata

  • Download URL: gefpy-1.2.6-cp38-cp38-win_amd64.whl
  • Upload date:
  • Size: 17.9 MB
  • Tags: CPython 3.8, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.20

File hashes

Hashes for gefpy-1.2.6-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 df9b9ad445906917549bec05941d49d5f4ffdddbf1f31c42c35e105735bf547d
MD5 53fe0a3915de3658b8ec64fbfdf52dca
BLAKE2b-256 c77a8c46d8814a968ae1caa612fb6c35e32360f3f80318d1e80bc72462b8a484

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gefpy-1.2.6-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 64dda524bda4ea0a9a446b00a6f4fff2bb31dfb20378d5e478daa23aeb4b1d2a
MD5 9beeb5c0ab60765ad0eba35195653f2c
BLAKE2b-256 b2f43e67a76acd7a218f30c32f9afaa0abefe01ec4eb4fe2c6618cc7bbcee4ac

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp38-cp38-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for gefpy-1.2.6-cp38-cp38-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 26762321b6bfcee8f5ad2474ff20ee1c8bec237f7d667491c919b326ff2ba79a
MD5 510553763ab7359b0720345ab87f17a5
BLAKE2b-256 9b0782594a49d83fbf27869c4c28f6d93cd5ed41c6abb0b0c15fd3001662f2c4

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: gefpy-1.2.6-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 17.9 MB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.20

File hashes

Hashes for gefpy-1.2.6-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 7843020b85141c129c626507c89b68adcb56e3254f94150a2cb24217138a20ae
MD5 0e67031c2c77057487bc5d6bbd866bb2
BLAKE2b-256 96c31372b468d84060c976fd0d5234d6b9eeeb42f63326d053da52b0bde1c9ba

See more details on using hashes here.

File details

Details for the file gefpy-1.2.6-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for gefpy-1.2.6-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f756b9d907fd9517d06f9c4952e6e7d9b1f6279a386f3c01508b24c436dfa430
MD5 33922427dc1734bf8e13b6e9a2a18778
BLAKE2b-256 166e0642131dcd5cf5e5d37c1953c264e2ef60c868e5ccb5a4a3854ffd71d148

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page