Skip to main content

A Python package to get basic features from the text data.

Project description

Introduction: textfeatures

When we handle the text data, we always have concerns about the data features, pre-processing of data and more likely the predictions. To improve our model, it is important to understand the data and find the more interesting features in the data like hashtags, links and many more.

What is textfeatures?

It is a python package which helps you to extract the basic features from the text data such as hashtags, stopwords, numerics which will help you to understand the data and improve your model more effectively.

Function structure:

function_name(dataframe, ”text_column”, ”new_column”)

dataframe:- name of dataframe

text_column:- name of the column from which features are to be extracted.

new_column:- new column derived by feature extraction from text_column.

What will textfeatures serve you?

1. word_count():- give the total words count present in text data.

2. char_count():- give the characters count.

3. avg_word_length():- give the average word length.

4. stopwords_count():- give the stopwords count.

5. stopwords():- extract the stopwords from the text data.

6. hashtags_count():- give the hashtags count.

7. hashtags():- extract the hashtags from text data.

8. links_count():- give the embedded links count from text data.

9. links():- extract the links from the text data.

10. numeric_count():- give the numeric digits count.

11. user_mentions_count():- give the user mentions count from text data.

12. user_mentions():- extract the user mentions from text data.

13. clean():- give the pre-processed data after removal for unnecessary material in text data.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

textfeatures-0.0.1.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

textfeatures-0.0.1-py3-none-any.whl (3.4 kB view details)

Uploaded Python 3

File details

Details for the file textfeatures-0.0.1.tar.gz.

File metadata

  • Download URL: textfeatures-0.0.1.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.4.2 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for textfeatures-0.0.1.tar.gz
Algorithm Hash digest
SHA256 bcfcf08a611cdfae8b0dc98571585e8083939255de85c34c420295da33e74db9
MD5 68c0b9219f7151e5c85522268f45dc05
BLAKE2b-256 ea4ea772bedfa785beb40a18e78ed7bdb4d2850a2d7cf7569a60e6399ba00a1e

See more details on using hashes here.

File details

Details for the file textfeatures-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: textfeatures-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 3.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.4.2 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for textfeatures-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b1ca24edee1788bea8962357d3f80e05e3f6f1e1e90803dc63ddda81c1723eba
MD5 ed76231a11dfe4bd3e77c24bc247656f
BLAKE2b-256 072b799cbfa11b1be2f71f402e3fd70cda8015c5920ba4d095494806bbbb4321

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page