Skip to main content

TextBlob Parser for Swarmauri.

Project description

Swarmauri Logo

PyPI - Downloads Hits PyPI - Python Version PyPI - License PyPI - swarmauri_parser_textblob


Swarmauri Parser TextBlob

TextBlob-backed parsers for Swarmauri that split text into sentences or extract noun phrases. Ships two components: TextBlobSentenceParser and TextBlobNounParser.

Features

  • Sentence parser returns a Document per sentence with metadata identifying the parser.
  • Noun phrase parser returns the original text plus metadata['noun_phrases'] containing the phrases discovered by TextBlob.
  • Auto-downloads required NLTK corpora (punkt_tab) during initialization.

Prerequisites

  • Python 3.10 or newer.
  • TextBlob and its NLTK dependencies (installed automatically).
  • Internet access on first run so NLTK can download tokenizer data (or pre-download via python -m textblob.download_corpora).

Installation

# pip
pip install swarmauri_parser_textblob

# poetry
poetry add swarmauri_parser_textblob

# uv (pyproject-based projects)
uv add swarmauri_parser_textblob

Sentence Parsing

from swarmauri_parser_textblob import TextBlobSentenceParser

parser = TextBlobSentenceParser()
text = "One more large chapula please. It should be extra spicy!"

sentences = parser.parse(text)
for doc in sentences:
    print(doc.content)

Noun Phrase Extraction

from swarmauri_parser_textblob import TextBlobNounParser

parser = TextBlobNounParser()
docs = parser.parse("One more large chapula please.")

for doc in docs:
    print(doc.content)
    print(doc.metadata["noun_phrases"])

Tips

  • TextBlob uses simple heuristics—it works well for general English text but may struggle with domain-specific jargon.
  • Download corpora once in CI/CD or container builds (python -m textblob.download_corpora) to avoid runtime downloads.
  • Combine sentence and noun parsers to build structured representations of documents before vectorization or downstream NLP tasks.

Want to help?

If you want to contribute to swarmauri-sdk, read up on our guidelines for contributing that will help you get started.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swarmauri_parser_textblob-0.9.3.dev18.tar.gz (7.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file swarmauri_parser_textblob-0.9.3.dev18.tar.gz.

File metadata

  • Download URL: swarmauri_parser_textblob-0.9.3.dev18.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.12 {"installer":{"name":"uv","version":"0.10.12","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for swarmauri_parser_textblob-0.9.3.dev18.tar.gz
Algorithm Hash digest
SHA256 86af77bf9c5d4f4c8af56aa1a8561cd74230faad8ecc626e1c1d87686ce14a78
MD5 e0341429f4a1005941824d2c1912ea5d
BLAKE2b-256 eacd0e2a3fbdfb186a0435e1d9f8ccccfcfdd77f54f3b7bd1604582b851db7d0

See more details on using hashes here.

File details

Details for the file swarmauri_parser_textblob-0.9.3.dev18-py3-none-any.whl.

File metadata

  • Download URL: swarmauri_parser_textblob-0.9.3.dev18-py3-none-any.whl
  • Upload date:
  • Size: 9.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.12 {"installer":{"name":"uv","version":"0.10.12","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for swarmauri_parser_textblob-0.9.3.dev18-py3-none-any.whl
Algorithm Hash digest
SHA256 1bb4d390e29755c38299a59c56ebbc1087b465df7934fe4f258705c309fd92d9
MD5 a9b122d61fa4af01d7abcfc1f20c2c30
BLAKE2b-256 6418b5e2b29da483fc00b50aeca5ebb45d503861bde24ca2a3d88dff68585708

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page