Skip to main content

A package containing NLP programs

Project description

NGD - Natural Language Processing Programs

This package contains a collection of Natural Language Processing (NLP) programs that demonstrate various NLP techniques and concepts.

Installation

You can install the package using pip:

pip install -e .

Available Programs

The package includes 7 programs that cover different aspects of NLP:

  1. Natural Language Processing Text Analysis

    • Tokenization (sentence and word)
    • Stopword removal
    • Stemming
    • Lemmatization
  2. Named Entity Recognition with NLTK

    • Tokenization
    • Part-of-speech tagging
    • Named entity chunking
    • BIO tagging
  3. TF-IDF Implementation

    • Term Frequency calculation
    • Document Frequency calculation
    • Inverse Document Frequency calculation
    • TF-IDF scoring
    • Using scikit-learn's TfidfVectorizer
  4. N-grams Analysis

    • Unigram, bigram, and trigram generation
    • Frequency analysis
    • Text cleaning and preprocessing
  5. Sentiment Analysis

    • Using NLTK's VADER sentiment analyzer
    • Sentiment scoring
    • Detailed sentiment analysis with pandas
  6. Text Classification with Naive Bayes

    • Text preprocessing
    • Feature extraction
    • Model training and evaluation
    • Prediction on new texts
  7. Word Embeddings with Word2Vec

    • Word2Vec model training
    • Word vector visualization
    • Similar word finding
    • Word analogies
    • Word similarity calculation

Usage

To view the code for any program, use the print_program function:

from ngd import print_program

# Print program 1
print_program(1)

# Print program 2
print_program(2)

# And so on...

Requirements

The package requires the following dependencies:

  • nltk
  • pandas
  • scikit-learn
  • requests
  • gensim
  • matplotlib
  • numpy

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngd-0.1.2.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ngd-0.1.2-py3-none-any.whl (8.0 kB view details)

Uploaded Python 3

File details

Details for the file ngd-0.1.2.tar.gz.

File metadata

  • Download URL: ngd-0.1.2.tar.gz
  • Upload date:
  • Size: 8.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ngd-0.1.2.tar.gz
Algorithm Hash digest
SHA256 0b01e87c74adab237ca02f3ad228d41d9fb7b7908915c7d004f756ba44657aef
MD5 ee1955bf07cbf5e251342f87937ff64a
BLAKE2b-256 0edf0fe5cc39f96165f45af98f08aa3a2bde6103297f3c808fb9999f932a4d21

See more details on using hashes here.

File details

Details for the file ngd-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: ngd-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 8.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ngd-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3f3aed2170bd4aa72ddf2faac5696c9555d8110f2814a91fe43b8923fe97544b
MD5 e1524aa2a550ff38fddcf4d9404d8f9f
BLAKE2b-256 576256d55185b36c2fd437180725639b2f4654b53737b00b21167585eb6fcb4e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page