jf-tokenize-package · PyPI

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

A simple tokenizer function for NLP

Project description

This tokenizer function turns text into lowercase word tokens, removes English stopwords, lemmatize the tokens and replaces URLs with a placeholder.

Project details

Release history Release notifications | RSS feed

This version

1.0.3

Jul 25, 2022

1.0.2

Jul 25, 2022

1.0.1

Jul 25, 2022

1.0

Jul 25, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jf_tokenize_package-1.0.3.tar.gz (1.6 kB view details)

Uploaded Jul 25, 2022 Source

File details

Details for the file jf_tokenize_package-1.0.3.tar.gz.

File metadata

Download URL: jf_tokenize_package-1.0.3.tar.gz
Upload date: Jul 25, 2022
Size: 1.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for jf_tokenize_package-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`69d635bb16298b59d5b76d03c273a202b3e3440827e89657fa9dfc1c40858ced`
MD5	`017fd0628869a7bb29c762ccc4b1c430`
BLAKE2b-256	`4ef15b7ea57b0ab383a1208bf3c6a33825965990b1b32b48566a920e514f9b96`

See more details on using hashes here.

Supported by

AWS

AWS Cloud computing and Security Sponsor

Datadog

Datadog Monitoring

Fastly

Google

Google Download Analytics

Microsoft

Microsoft PSF Sponsor

Pingdom

Pingdom Monitoring

Sentry

Sentry Error logging

StatusPage

StatusPage Status page