Skip to main content

Text pre-processing function for NLP

Project description

TextPreProcessing

Python package for preprocessing text for NLP models

INSTALLATION

Download PyPi Package , or

$ pip install text-ppf

To upgrade:

$ pip install text-ppf --upgrade

WHAT IT DOES

An all-in-one function which,

  • Removes punctuation
  • Splits the text into a list of words
  • Removes stopwords
  • Makes each word lowercase
  • Lemmatizes each word

USAGE

text_ppf

import pandas as pd
from text_ppf import text_ppf

#LOADING DATASET
df = pd.read_csv('filename.csv')
# print(df.head)

dfNew = df['heading'].apply(text_ppf.clean_up)
print(dfNew)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text_ppf-1.0.0.tar.gz (2.9 kB view hashes)

Uploaded Source

Built Distribution

text_ppf-1.0.0-py3-none-any.whl (3.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page