Skip to main content

Text pre-processing function for NLP

Project description

TextPreProcessing

Python package for preprocessing text for NLP models

INSTALLATION

Download PyPi Package , or

$ pip install text-ppf

To upgrade:

$ pip install text-ppf --upgrade

WHAT IT DOES

An all-in-one function which,

  • Removes punctuation
  • Splits the text into a list of words
  • Removes stopwords
  • Makes each word lowercase
  • Lemmatizes each word

USAGE

text_ppf

import pandas as pd
from text_ppf import text_ppf

#LOADING DATASET
df = pd.read_csv('filename.csv')
# print(df.head)

dfNew = df['heading'].apply(text_ppf.clean_up)
print(dfNew)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text_ppf-1.0.0.tar.gz (2.9 kB view details)

Uploaded Source

Built Distribution

text_ppf-1.0.0-py3-none-any.whl (3.1 kB view details)

Uploaded Python 3

File details

Details for the file text_ppf-1.0.0.tar.gz.

File metadata

  • Download URL: text_ppf-1.0.0.tar.gz
  • Upload date:
  • Size: 2.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/3.10.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.9.4

File hashes

Hashes for text_ppf-1.0.0.tar.gz
Algorithm Hash digest
SHA256 a70a5e57b7a7e1de622a7b22844e1e33a4065d152ca050544b65b4959ef4bdf1
MD5 86ef80a92f2359bf14e8e6dde0aa9618
BLAKE2b-256 2295abc87711eec83990703422e69ea299ca8afd0101ae030518cc44ff6876e2

See more details on using hashes here.

File details

Details for the file text_ppf-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: text_ppf-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/3.10.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.9.4

File hashes

Hashes for text_ppf-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 50410f945a7d7778336c94db95bf7b998f01789f66d0c253161fd1e99ed03825
MD5 a250531ee62855eda71bded49094a84e
BLAKE2b-256 ccbc6eae24144aa0efa928d94b960088433b2547dddc9edc7fdcc9250928fd75

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page