Skip to main content

Nucleotide A.I.

Project description

ntai

ntai stands for nucleotide (nt) artificial intelligence (A.I.). ntai is a small python library for using fasta sequences with artificial intelligence (A.I.).

Currently there are two main modules that will be of use

  1. Codex, and
  2. bedtools

Codex

Codex is a class for hot-encoding fasta sequences into channels and back. Codex is useful because a character in a fasta sequences can encode multiple nucleotides or even random repeats.

bedtools

bedtools is a function exposing the bedtools library to python. This allows users to extract fasta sequences from a reference genome with writing to / reading from files.

fetch

fetch is a module for fetching the necessary data for using ntai. Currently fetch.utils supports the requests for:

  • fetch_hg38 acquires the fasta for hg38 gzipped

  • fetch_exons acquires Gencode Comprehensive v28 exons in bed format

  • fetch_introns acquires Gencode Comprehensive v28 introns in bed format

  • fetch_hg38_chrom_sizes acquires the chromosome sizes for hg38 in tsv format

  • decompress will decompress a .gz file

The function fetch.fetch_files will get all of these files and decompress hg38 in a specified directory.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ntai-0.0.14.tar.gz (11.1 kB view hashes)

Uploaded Source

Built Distribution

ntai-0.0.14-py3-none-any.whl (16.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page