Skip to main content

Corpus mini-framework allowing for memory-efficient slicing and provides a standardised base corpus structure for the collection of ATAP tools.

Project description

ATAP Corpus

Provides a standardised base Corpus structure for ATAP tools.

Different Corpus can be sliced into subcorpus based on different criterias and will always return an subclass instance of BaseCorpus. The slicing criteria is flexible, it accepts a user defined function and comes with convenience slicing operations layered on top of it out-of-the-box. Subcorpus maintains a parent-child relationship with original corpus in a tree internally.

Corpus can also be serialised and deserialised which can be used to carry across different ATAP analytics notebooks.

pip install atap_corpus

Tests

To run all the unit tests, there is a script you can execute.

./scripts/run_tests.sh

This repo originated from Juxtorpus and is a decoupling effort. Juxtorpus repo may be accessed here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atap_corpus-0.1.8.tar.gz (25.4 kB view details)

Uploaded Source

Built Distribution

atap_corpus-0.1.8-py3-none-any.whl (30.5 kB view details)

Uploaded Python 3

File details

Details for the file atap_corpus-0.1.8.tar.gz.

File metadata

  • Download URL: atap_corpus-0.1.8.tar.gz
  • Upload date:
  • Size: 25.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.10.11 Darwin/22.6.0

File hashes

Hashes for atap_corpus-0.1.8.tar.gz
Algorithm Hash digest
SHA256 16c8673f71511571fac2282692be539a207adbd76bfaba9541076449d65982d8
MD5 d738146f4968a4f59f5879525ce7faff
BLAKE2b-256 2664edcb52b6f5db5c6387ade93ec67fc8aa1b257bc2d8b81e12d6bb09c9cc4b

See more details on using hashes here.

File details

Details for the file atap_corpus-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: atap_corpus-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 30.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.10.11 Darwin/22.6.0

File hashes

Hashes for atap_corpus-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 37c48bd63bc2fe562105ef2bb030deb8441d0d6017689bab6f63f8faf938e36d
MD5 9d1ce611cee7dbe4508260b10893df62
BLAKE2b-256 f48d204604766a8e0f68a091bfd5a0e2ba45f2159a1719dbc70e2178c5310759

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page