Skip to main content

Kyoto Encylopedia of Genes and Genomes Markup Language File parser and converter

Project description

KNeXT downloads and parses Kyoto Encylopedia of Genes and Genomes (KEGG) markup language files (KGML). This tool employs NetworkX’s framework to create gene-only networks, but mixed (gene, compound, pathway) networks can also be generated. All output files are in TSV format. KNeXT also retrieves a TXT file of node x-y axis coordinates for use in NetworkX’s graph visualization library, and it is able to convert KEGG IDs into Uniprot and NCBI IDs. KNeXT also maximizes metadata information through preserving each edge’s information.

Usage

Primary line: knext get-kgml [SPECIES_NAME]

  KEGG NetworkX Topological (KNeXT) parser uses the KEGG
  API to gather all KGML files for a single species.
  Input species name in 3 to 4 letter KEGG organism code.

Options:
  --help,   shows options and website for KEGG organism codes
  -d/--d,   directory in which to save output

Primary line: knext genes [Input]

  KNeXT parser deploy's NetworkX's
  framework to create gene-only representations of KGML files.
  Genes between compounds are propagated before compounds are dropped.

Options:
  Input     KGML file or folder of KGML files to parse
  -r/--results      file or folder where output should be stored
  -g/--graphics     outputs TXT file of x-y axis coordinates
  -u/--unique       TSV file's genes have a terminal modifier
  --help    shows options and file types

Primary line: knext mixed [Input]

  KNeXT parser creates mixed (genes, compounds, pathways)
  representations of KGML files.

Options:
  Input     KGML file or folder of KGML files to parse
  -r/--results      file or folder where output should be stored
  -g/--graphics     outputs TXT file of x-y axis coordinates
  -u/--unique       TSV file's genes have a terminal modifier
  --help    shows options and file types

Primary line: knext convert [OPTIONS]

  KNeXT parser converts KEGG entry IDs in TSV output files into
  UniProt or NCBI IDs.

Options:
  file      PATH:   path to TSV file
  species   TEXT:   KEGG 3 to 4 letter organism code
  --uniprot optional flag for output:       use if UniProt IDs are the desired output
  --unique  optional flag for output:       use if the TSV file has terminal modifiers
  --graphics        PATH:   graphics file
  --help    optional flag:  shows options

Options:
  folder    PATH:   path to folder containing TSV files
  species   TEXT:   KEGG 3 to 4 letter organism code
  --uniprot optional flag for output:         use if UniProt IDs are the desired output
  --unique  optional flag for output:         use if the TSV file has terminal modifiers
  --graphics        PATH:       path to folder containing graphics files
  --help    optional flag:            shows options

For example, KNeXT can obtain all KGML files for Homo sapiens:

$ knext get-kgml hsa

The resulting output folder can be used to parse the files:

$ knext genes folder kgml_hsa --graphics

The resulting output folder can be used to convert the TSV files and graphics file:

$ knext convert folder kegg_gene_network_hsa hsa --graphics kegg_gene_network_hsa

Inputs

KNeXT only accepts KGML files downloaded from KEGG

The output of which can be used in successive commands. All input formats must be in TSV format. Column names are mandatory and should not be changed.

Data Frames

Example TSV file with KEGG ID’s

entry1

entry2

type

value

name

hsa:100271927-98

hsa:22800-12

PPrel

–>

activation

hsa:100271927-98

hsa:22808-12

PPrel

–>

activation

hsa:100271927-98

hsa:3265-12

PPrel

–>

activation

Example TSV file for uniprot conversion with –unique output

entry1

entry2

type

value

name

Q9Y243-23

O15111-59

PPrel

–>

activation

Q9Y243-23

Q6GYQ0-240

PPrel,PPrel

|,+p

inhibition,phosphorylation

Q9Y243-23

O14920-59

PPrel

–>

activation

Installation

The current release is v1.1.0 Installation is via pip:

$ pip install https://github.com/everest/knext/knext-1.0.0.tar.gz

Repo can be downloaded and installed through poetry:

$ git clone https://github.com/everest/knext.git
$ cd knext
$ poetry shell
$ poetry install
$ poetry run knext [get-kgml, genes, mixed, or convert]

Requirements

Requirements are (also see pyproject.toml):

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

knext-1.1.1.tar.gz (17.2 kB view details)

Uploaded Source

Built Distribution

knext-1.1.1-py3-none-any.whl (17.9 kB view details)

Uploaded Python 3

File details

Details for the file knext-1.1.1.tar.gz.

File metadata

  • Download URL: knext-1.1.1.tar.gz
  • Upload date:
  • Size: 17.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.9.12 Linux/5.15.0-75-generic

File hashes

Hashes for knext-1.1.1.tar.gz
Algorithm Hash digest
SHA256 9e608b2dc291a6a41f420e33468e465e90b801d44434b8e9515a3033a9f1a017
MD5 cb89695da5307c1e0188897ead5c3bec
BLAKE2b-256 ba2370f54aa7d7493cda8f4e8fb20241f0a225c70b9036673e250b9d685d81e1

See more details on using hashes here.

File details

Details for the file knext-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: knext-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 17.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.9.12 Linux/5.15.0-75-generic

File hashes

Hashes for knext-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f04391d90d59745b24e2f8bb5ea108669661f61636722f6dbea34ba20b42e549
MD5 93378b0963049dab62f8dec7531eee53
BLAKE2b-256 4dde0ad8c8696d3dc29efb815baad8bdc0baf164ca1d26a7acf87dd6f9dc7109

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page