Skip to main content

Fumigate text

Project description

Fumigate

A small demo library to help in NLP's most tedious task, text cleaning.

Installation pip install fumigate

#Get started

How to fumigate(clean) your text data

1.  from fumigate import Fumes

2. Instantiate a Fumes object
    fumes = Fumes()

3. Call the purge method to clean everything.
    result = fumes.purge(<text>)

4. Call the clean method to clean specific things from the methods available.

    ## methods is a list containing one or multiple methods available for fumigation.
    result = fumes.clean(<text>, methods=["url" | "sym" | "num" | "emo"], extract=False)

    # if extract=True returns tuple with removed characters/strings
    result, garbage = fumes.clean(<text>, methods=["url" | "sym" | "num" | "emo"], extract=True)

5. Call the stem method for stemming.
    result = fumes.stem(<text>)

6. Call the lemm method for lemmatizing.
    result = fumes.lemm(<text>)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fumigate-0.1.8.tar.gz (3.2 kB view hashes)

Uploaded Source

Built Distribution

fumigate-0.1.8-py3-none-any.whl (3.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page