Skip to main content

A Python package to get basic features from the text data.

Project description

Introduction: textfeatures

When we handle the text data, we always have concerns about the data features, pre-processing of data and more likely the predictions. To improve our model, it is important to understand the data and find the more interesting features in the data like hashtags, links and many more.

What is textfeatures?

It is a python package which helps you to extract the basic features from the text data such as hashtags, stopwords, numerics which will help you to understand the data and improve your model more effectively.

Function structure:

function_name(dataframe, ”text_column”, ”new_column”)

dataframe:- name of dataframe

text_column:- name of the column from which features are to be extracted.

new_column:- new column derived by feature extraction from text_column.

What will textfeatures serve you?

1. word_count():- give the total words count present in text data.

2. char_count():- give the characters count.

3. avg_word_length():- give the average word length.

4. stopwords_count():- give the stopwords count.

5. stopwords():- extract the stopwords from the text data.

6. hashtags_count():- give the hashtags count.

7. hashtags():- extract the hashtags from text data.

8. links_count():- give the embedded links count from text data.

9. links():- extract the links from the text data.

10. numeric_count():- give the numeric digits count.

11. user_mentions_count():- give the user mentions count from text data.

12. user_mentions():- extract the user mentions from text data.

13. clean():- give the pre-processed data after removal for unnecessary material in text data.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

textfeatures-0.0.2-py3-none-any.whl (3.4 kB view details)

Uploaded Python 3

File details

Details for the file textfeatures-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: textfeatures-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 3.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.4.2 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for textfeatures-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ce9574bd0b4e03a50615d05701cd30fc48f451389fc7694360348498da0fceef
MD5 1792aac19dad27137f1296f47a12866a
BLAKE2b-256 27252ead3ed336ed0082474e493c045aad3aa3cf3df96bcf5e78585f0a89f06b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page