Importing and querying HGNC data
Project description
PyHGNC is a Python package to access and query data provided by HGNC-approved gene nomenclature, gene families and associated resources including links to genomic, proteomic and phenotypic information.
Data are installed in a (local or remote) RDBMS enabling bioinformatic algorithms very fast response times to sophisticated queries and high flexibility by using SOLAlchemy database layer.
PyHGNC is developed by the Department of Bioinformatics at the Fraunhofer Institute for Algorithms and Scientific Computing SCAI For more in for information about PyHGNC go to the documentation.
This development is supported by following IMI projects:
Supported databases
PyHGNC uses SQLAlchemy to cover a wide spectrum of RDMSs (Relational database management system). For best performance MySQL or MariaDB is recommended. But if you have no possibility to install software on your system SQLite - which needs no further installation - also works. Following RDMSs are supported (by SQLAlchemy):
Getting Started
This is a quick start tutorial for impatient.
Installation
PyHGNC can be installed with pip.
pip install pyhgnc
If you fail because you have no rights to install use superuser (sudo on Linux before the commend) or …
pip install --user pyhgnc
If you want to make sure you are installing this under python3 use …
python3 -m pip install pyhgnc
SQLite
If you don’t know what all that means skip the section MySQL/MariaDB setup.
Don’t worry! You can always later change the configuration. For more information about changing database system later go to the subtitle Changing database configuration Changing database configuration in the documentation on readthedocs.
MySQL/MariaDB setup
Log in MySQL as root user and create a new database, create a user, assign the rights and flush privileges.
CREATE DATABASE pyhgnc CHARACTER SET utf8 COLLATE utf8_general_ci;
GRANT ALL PRIVILEGES ON pyhgnc.* TO 'pyhgnc_user'@'%' IDENTIFIED BY 'pyhgnc_passwd';
FLUSH PRIVILEGES;
There are two options to set the MySQL/MariaDB.
The simplest is to start the command line tool
pyhgnc mysql
You will be guided with input prompts. Accept the default value in squared brackets with RETURN. You will see something like this
server name/ IP address database is hosted [localhost]:
MySQL/MariaDB user [pyhgnc_user]:
MySQL/MariaDB password [pyhgnc_passwd]:
database name [pyhgnc]:
character set [utf8]:
Connection will be tested and in case of success return Connection was successful. Otherwise you will see following hint
Test was NOT successful
Please use one of the following connection schemas
MySQL/MariaDB (strongly recommended):
mysql+pymysql://user:passwd@localhost/database?charset=utf8
PostgreSQL:
postgresql://user:passwd@localhost/database
MsSQL (pyodbc needed):
mssql+pyodbc://user:passwd@database
SQLite (always works):
- Linux:
sqlite:////absolute/path/to/database.db
- Windows:
sqlite:///C:\absolute\path\to\database.db
Oracle:
oracle://user:passwd@localhost:1521/database
2. The second option is to start a python shell and set the MySQL configuration. If you have not changed anything in the SQL statements above …
import pyhgnc
pyhgnc.set_mysql_connection()
If you have used you own settings, please adapt the following command to you requirements.
import pyhgnc
pyhgnc.set_mysql_connection(host='localhost', user='pyhgnc_user', passwd='pyhgnc_passwd', db='pyhgnc')
Updating
The updating process will download the complete HGNC json file and the HCOP file.
import pyhgnc
pyhgnc.manager.database.update()
This will use either the default connection settings of PyHGNC or the settings defined by the user. It is also possible to run the update process from shell.
pyhgnc update
Quick start with query functions
Initialize the query object
query = pyhgnc.query()
Get all HGNC entries:
all_entries = query.hgnc()
More information
See the installation documentation for more advanced
instructions. Also, check the change log at CHANGELOG.rst
.
HGNC tools
HGNC provides also online tools .
Links
HUGO Gene Nomenclature Committee (HGNC)
PyHGNC
Documented on Read the Docs
Versioned on GitHub
Tested on Travis CI
Distributed by PyPI
Chat on Gitter
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file PyHGNC-0.2.4.tar.gz
.
File metadata
- Download URL: PyHGNC-0.2.4.tar.gz
- Upload date:
- Size: 494.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fc25abf467263b79e458263a3a638f8d401034ab85113fe1c0b68f47f49ce6be |
|
MD5 | 1b7b749210cbc1cad60bde98ca6d2e33 |
|
BLAKE2b-256 | c14b748d767fb133b016ce603987595279612d941e300516b4c489fb481c5a53 |
File details
Details for the file PyHGNC-0.2.4-py3-none-any.whl
.
File metadata
- Download URL: PyHGNC-0.2.4-py3-none-any.whl
- Upload date:
- Size: 80.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 39787827b3f88c1abbce2554f0e9667cc891b01a33861124cd2bce0432ba9135 |
|
MD5 | 934ec59aec17524e7732332c0f8d2de5 |
|
BLAKE2b-256 | 83ba3ed68badfbfb47f73a918d4d32b17acaca805e462fb662742ff749e92331 |
File details
Details for the file PyHGNC-0.2.4-py2-none-any.whl
.
File metadata
- Download URL: PyHGNC-0.2.4-py2-none-any.whl
- Upload date:
- Size: 80.8 kB
- Tags: Python 2
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c50374df4100cf15d61f94e672a5077da6023b9cf9a16f114d0f10248b4160b5 |
|
MD5 | 4fb64ac1ede81f0d916d98dec6962099 |
|
BLAKE2b-256 | 00d365432064ab03d459243bf2b3ce0ae40c9d00bd8d59e8dd2368c69b90df27 |