Skip to main content

Incenp.org’s utilities for computational biology

Project description

Incenp.Bioutils - Incenp.org’s utilities for computational biology

Incenp.Bioutils is a set of command line utilities and helper Python modules to deal with some computational biology tasks.

Utilities

SeqTool

SeqTool is intended as a “Swiss Army knife” to perform various operations on biological sequence files.

Among other things, it provides wrappers for the blast and dotter alignment tools; the main interest of those wrappers is that they can be used with sequences in any format supported by Biopython’s SeqIO module, whereas the original tools only read files in the FASTA format.

SeqVault

SeqVault is a command-line interface to BioSQL databases.

SeqVault is intended to be used with a slightly modified version of the BioSQL database schema (provided in the biosql directory), where every biodatabase is associated with a 3-letters prefix. That prefix is then used to automatically assign accession numbers (of the form PRE_xxxxxx where PRE is the prefix) when importing sequences into the database. However SeqVault can also be used with pristine BioSQL databases.

Quickstart (example with PostgreSQL)

Create a new PostgreSQL user account and a new database::

# createuser <username>
# createdb -O owner <username> <dbname>

Initialize the newly created database by running the provided biosql/biosqldb-pg.sql script::

$ psql -h localhost -U <username> <dbname> < biosql/biosqldb-pg.sql

Next, create a plaintext file in $XDG_CONFIG_HOME/seqvault/seqvault.rc and fill it as follows::

[Server]
host: localhost
user: <username>
password: <PostgreSQL user password>
database: <dbname>

SeqVault is now ready to be used.

cc3d-runner

CC3d-Runner is a small wrapper around the command-line interface of CompuCell3D, to run headless simulations.

Helper modules

The main helper module is the incenp.bio.seq.usa module, which provides an implementation of the [Uniform Sequence Address format] (emboss.sourceforge.net/docs/themes/UniformSequenceAddress.html).

A Uniform Sequence Address or USA is a way to refer to a biological sequence and to fetch that sequence from its source.

The following example shows how to read a sequence specified by a USA::

from incenp.bio.seq import usa

records = usa.read_usa('genbank::file.gb:acc:X65923')

The returned records object is an array of Bio.SeqRecord.SeqRecord objects.

Prerequisites

The following Python packages are necessary:

ReportLab is required to run the plasmm subcommand of the seqtool utility.

The seqvault utility requires a SQL adapter for the database server you plan to use it with, such as psycopg2 for PostgreSQL.

Copying

Incenp.Bioutils is distributed under the terms of the GNU General Public License, version 3 or higher. The full license is included in the COPYING file of the source distribution.

Homepage and repository

The project is located at https://incenp.org/dvlpt/bioutils.html. The source code is available in a Git repository at https://git.incenp.org/damien/bioutils.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

incenp.bioutils-0.1.1.tar.gz (57.2 kB view details)

Uploaded Source

Built Distribution

incenp.bioutils-0.1.1-py3-none-any.whl (37.9 kB view details)

Uploaded Python 3

File details

Details for the file incenp.bioutils-0.1.1.tar.gz.

File metadata

  • Download URL: incenp.bioutils-0.1.1.tar.gz
  • Upload date:
  • Size: 57.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.8.3

File hashes

Hashes for incenp.bioutils-0.1.1.tar.gz
Algorithm Hash digest
SHA256 99e5e05f7244f8f01fe6d788a95b2703612f23122203068f4b2e760dd2a8b5ed
MD5 b172c627b78c8465ef0a1fa1a0b7fde7
BLAKE2b-256 01fa29f20dc91d1b99a5c79d8ad643f7522a1cb0bd10875e61e9c3198039f2de

See more details on using hashes here.

Provenance

File details

Details for the file incenp.bioutils-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: incenp.bioutils-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 37.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.8.3

File hashes

Hashes for incenp.bioutils-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 177bbb61f8b4ff832c0596d7be1a14c076a86772354f01c03241a0dfd7dab3b6
MD5 c8123abda942766f8831a035d5840749
BLAKE2b-256 e96d42b4530cbd5c747d70c509e9c3fc8ff42c41e431327ee49291bd2d87a7ee

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page