Skip to main content

TextBlob Parser for Swarmauri.

Project description

Swarmauri Logo

PyPI - Downloads Hits PyPI - Python Version PyPI - License PyPI - swarmauri_parser_textblob


Swarmauri Parser TextBlob

TextBlob-backed parsers for Swarmauri that split text into sentences or extract noun phrases. Ships two components: TextBlobSentenceParser and TextBlobNounParser.

Features

  • Sentence parser returns a Document per sentence with metadata identifying the parser.
  • Noun phrase parser returns the original text plus metadata['noun_phrases'] containing the phrases discovered by TextBlob.
  • Auto-downloads required NLTK corpora (punkt_tab) during initialization.

Prerequisites

  • Python 3.10 or newer.
  • TextBlob and its NLTK dependencies (installed automatically).
  • Internet access on first run so NLTK can download tokenizer data (or pre-download via python -m textblob.download_corpora).

Installation

# pip
pip install swarmauri_parser_textblob

# poetry
poetry add swarmauri_parser_textblob

# uv (pyproject-based projects)
uv add swarmauri_parser_textblob

Sentence Parsing

from swarmauri_parser_textblob import TextBlobSentenceParser

parser = TextBlobSentenceParser()
text = "One more large chapula please. It should be extra spicy!"

sentences = parser.parse(text)
for doc in sentences:
    print(doc.content)

Noun Phrase Extraction

from swarmauri_parser_textblob import TextBlobNounParser

parser = TextBlobNounParser()
docs = parser.parse("One more large chapula please.")

for doc in docs:
    print(doc.content)
    print(doc.metadata["noun_phrases"])

Tips

  • TextBlob uses simple heuristics—it works well for general English text but may struggle with domain-specific jargon.
  • Download corpora once in CI/CD or container builds (python -m textblob.download_corpora) to avoid runtime downloads.
  • Combine sentence and noun parsers to build structured representations of documents before vectorization or downstream NLP tasks.

Want to help?

If you want to contribute to swarmauri-sdk, read up on our guidelines for contributing that will help you get started.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swarmauri_parser_textblob-0.9.3.dev4.tar.gz (7.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file swarmauri_parser_textblob-0.9.3.dev4.tar.gz.

File metadata

  • Download URL: swarmauri_parser_textblob-0.9.3.dev4.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for swarmauri_parser_textblob-0.9.3.dev4.tar.gz
Algorithm Hash digest
SHA256 bca4afad24a0634ec6b0f67932cf8d17fee716499f7fae5f85ef2fcc8fa1c0bb
MD5 b99f747c07a5555f18a2aad49084ddc6
BLAKE2b-256 7e8444d4a902e9e2df78395698a0bf8ecf7cee57b68696b4a18c23c3184a6fa4

See more details on using hashes here.

File details

Details for the file swarmauri_parser_textblob-0.9.3.dev4-py3-none-any.whl.

File metadata

  • Download URL: swarmauri_parser_textblob-0.9.3.dev4-py3-none-any.whl
  • Upload date:
  • Size: 9.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for swarmauri_parser_textblob-0.9.3.dev4-py3-none-any.whl
Algorithm Hash digest
SHA256 1f26eb37e753067cf4784ca5a106f2fb98ea1e0a8e58e1202e4424d85fdd2b3e
MD5 6e3eaf49b7e2fe65dc79cabc0e9f50fc
BLAKE2b-256 b977adeaad6cecefcc720f2c43c377ee03594738d4aeb4230579a111c70d0f4e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page