Add your description here
Project description
hugo-unifier
This python package can unify gene symbols based on the HUGO database.
Installation
The package can be installed via pip, or any other Python package manager.
pip install hugo-unifier
Usage
The package can be used both as a command line tool and as a library.
Command Line Tool
Currently, the command line tool only supports unifying the entries of a column in an AnnData objects var attribute. The input file and column name must be passed as an argument. The tool will update the column in place and save the AnnData object to a new file.
Check the help message for more information:
hugo-unifier --help
Library
The package can be used as a library to unify gene symbols in a pandas DataFrame. The unify function takes a list of gene symbols and returns a list of unified gene symbols. The function can be used as follows:
from hugo_unifier import unify
gene_symbols = ["TP53", "BRCA1", "EGFR"]
unified_symbols = unify(gene_symbols)
print(unified_symbols)
How it works
Different datasets sometimes use different gene symbols for the same gene. Sometimes, the same gene symbol occurs
with slight modifications, such as dashes, underscores, or other characters. The hugo-unifier iteratively applies attempts to manipulate the gene symbols and check them against the HUGO database.
The following manipulations are applied in the following order:
identity: Use the gene symbol as is.dot-to-dash: Replace dots with dashes.discard-after-dot: Discard everything after the first dot.
More conservative manipulations are applied first. The first manipulation that returns a valid gene symbol is used.
Resolution of aliases
Documentation for this will be added soon.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file hugo_unifier-0.1.1.tar.gz.
File metadata
- Download URL: hugo_unifier-0.1.1.tar.gz
- Upload date:
- Size: 47.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bb29abc80d67996d0ba3dd792ac42b7af003aa882015fb95a7c5d2dedca6e0bb
|
|
| MD5 |
e172ec36cec053ca4c15233c15a4eccf
|
|
| BLAKE2b-256 |
36c2b576ce16aac47bf56c8dccdbbaa7a2a23c3ebde1f5157fe1eca4a0c61d32
|
Provenance
The following attestation bundles were made for hugo_unifier-0.1.1.tar.gz:
Publisher:
ci.yml on Mye-InfoBank/hugo-unifier
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hugo_unifier-0.1.1.tar.gz -
Subject digest:
bb29abc80d67996d0ba3dd792ac42b7af003aa882015fb95a7c5d2dedca6e0bb - Sigstore transparency entry: 193758178
- Sigstore integration time:
-
Permalink:
Mye-InfoBank/hugo-unifier@6905722c9914ac561e05f0568c830bf15eb7b492 -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/Mye-InfoBank
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@6905722c9914ac561e05f0568c830bf15eb7b492 -
Trigger Event:
release
-
Statement type:
File details
Details for the file hugo_unifier-0.1.1-py3-none-any.whl.
File metadata
- Download URL: hugo_unifier-0.1.1-py3-none-any.whl
- Upload date:
- Size: 6.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b38f98f526a42df2bfee0715cdb101f40324fb656570df72a79e1205544f4c42
|
|
| MD5 |
7d4feadfc26f24250fa2beaad9a76932
|
|
| BLAKE2b-256 |
12854ff4f9fa7dd2d7908943716bd814586a35639c5235c81fb3c60707712b34
|
Provenance
The following attestation bundles were made for hugo_unifier-0.1.1-py3-none-any.whl:
Publisher:
ci.yml on Mye-InfoBank/hugo-unifier
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hugo_unifier-0.1.1-py3-none-any.whl -
Subject digest:
b38f98f526a42df2bfee0715cdb101f40324fb656570df72a79e1205544f4c42 - Sigstore transparency entry: 193758180
- Sigstore integration time:
-
Permalink:
Mye-InfoBank/hugo-unifier@6905722c9914ac561e05f0568c830bf15eb7b492 -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/Mye-InfoBank
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@6905722c9914ac561e05f0568c830bf15eb7b492 -
Trigger Event:
release
-
Statement type: