Skip to main content

The aminocode library can be used to encode texts written in natural language in a format based on amino acids

Project description

The aminocode library can be used to encode texts written in natural language in a format based on amino acids. With coding is enable the application of various bioinformatics tools in text mining.

Stand alone tools based on library are available at link <https://sourceforge.net/projects/aminocode>.

Installation

To install aminocode through pip:

pip install aminocode

Tested Platforms

  • Python:

  • 3.7.4

  • Windows (64bits):

  • 10

  • Ubuntu (64bits)

  • 18.04.1 LTS

Required external libraries

  • numpy

  • unidecode

  • biopython

Functions

  • encodetext(text,detailing='')

  • text: natural language text string to be encoded;

  • detailing: details in coding. ‘d’ for details in digits. ‘p’ for details on the punctuation. ‘dp’ or ‘pd’ for both;

  • output: encode string.

  • decodetext(text,detailing='')

  • text: text string encoded using the encodefile function to be decode;

  • detailing: details used in the text to be decoded. ‘d’ for details in digits. ‘p’ for details on the punctuation. ‘dp’ or ‘pd’ for both;

  • output: decode string.

  • encodefile(input_file_name,output_file_name=None,detailing='',header_format='number+originaltext',verbose=False)

  • input_file_name: text file name or _io.TextIOWrapper variable. It can also be used the format that is imported by the Bio.SeqIO library of Biopython, in which case the function will automatically extract the headers to do the encoding;

  • output_file_name: the name for the output file. If not defined, the result will only be returned as a variable;

  • detailing: same as in the encodetext function;

  • header_format: format for the headers of the generated FASTA. It can be ‘number+originaltext’, ‘number’ or ‘originaltext’. ‘number’ is a count of the lines in the input file. Blank lines are considered in the count, but are not added to the FASTA file. ‘originaltext’ is the input text itself;

  • verbose: if True displays progress;

  • output: FASTA variable in Biopython format. If defined output_file_name a file will be saved.

  • decodefile(input_file_name,output_file_name=None,detailing='',verbose=False)

  • input_file_name: file name or variable in the format used by Biopython’s Bio.SeqIO library

  • output_file_name: the name for the output file. If not defined, the result will only be returned as a variable;

  • detailing: same as in the decodetext function;

  • verbose: if True displays progress;

  • output: string list. If defined output_file_name a file will be saved.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aminocode-1.0.1.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

aminocode-1.0.1-py3-none-any.whl (4.5 kB view details)

Uploaded Python 3

File details

Details for the file aminocode-1.0.1.tar.gz.

File metadata

  • Download URL: aminocode-1.0.1.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for aminocode-1.0.1.tar.gz
Algorithm Hash digest
SHA256 69b8272488dc3452b282e296dd3907feefe5ca008c66406bd17491fca04750dc
MD5 80ce176b9c46fc2431a813da14199957
BLAKE2b-256 183bd4e8a3647be003ea7d7afe413a22c4408585c6acf0e9f4ebf187a10146a8

See more details on using hashes here.

File details

Details for the file aminocode-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: aminocode-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 4.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for aminocode-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 60aadf62e237a0344382a87a903d6dbfb37aa48b1e91e6824e8fc5e16255f85f
MD5 aff73447eba4338b0944f5991a3e42ce
BLAKE2b-256 1cb05f9265fb52cf03bea4ad65563b918c6a233d67eaf343b73bd678796c3db9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page