Skip to main content

Package for end to end setiment analysis using Neural Architectures

Project description

sentiment_analysys_csci_e89

This package was designed to enable its users to perform end to end sentiment analysis with state of the art techniques.
The api assumes a common data model that is described in great detail in the documentation. In short, the modules expect tabular datasets with the following fields for training data:

  1. text_id
  2. text
  3. label

and the following fields for live test data:

  1. text_id
  2. text

The api contains 5 main modules:

  1. data_cleaning: A class that was written to support a number of popular machine learning datasets. It cleans the raw data and structures it in a way that the other modules can use.
  2. pre_processing : A class that provides a number of high level functions to perform sophisticated data transformations and cleaning. This class is responsible for preparing the raw text data for our Neural architectures.
  3. modeling : A class that provides a number of methods, each dedicated to training a certain type of architecture. Refer to the documentation for the exact specification of each of the architectures provided.
  4. pretrained_embeddings : A class that provides methods to prepare well known and popular word embeddings (GloVe and word2vec) in a format that our netoworks can work with. We require that the user download the raw data from the appropriate sources. Once again, details are included in the documentation.
  5. predict_newdata: A class that provides methods to use our trained networks to make predictions on live data. Live data as I define it here can be thought of test data that is processed and prepared outside of the original efforts that processed the data our model was trained and validated against.

A number of different neural architectures are provided with easy to call methods, thereby allowing you to train sophisticated models with no more than a few lines of code.Some of the architectures implement transfer learning and require that certain files be downloaded locally.

Please refer to the documentation and the tutorial script.
The turorial is in the form of a jupyter notebook with a step by step implementation. Please find it here: https://github.com/stefano10p/-sentiment_analysis_csci_e89-/tree/master/tutorial

Installation

Run the following to install:

pip install sentiment-analysis-csci-e89

Please download the documentation from here:
https://github.com/stefano10p/-sentiment_analysis_csci_e89-/tree/master/docs/_build/html
Create a local directory on your machine with each of the html files.

You may also download this package from my github: https://github.com/stefano10p/-sentiment_analysis_csci_e89-

You will find a requirements.txt file when you clone the repository. On your machine create a virtual environemnt:
conda create --name sentiment_analysis
Activate the environment and use the requirements file to configure it with all the necessary dependencies.
conda activate sentiment_analysis
pip install -r requirements.txt
You are ready to use the package. Thank you !

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sentiment_analysis_csci_e89-2.1.1.tar.gz (48.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sentiment_analysis_csci_e89-2.1.1-py3-none-any.whl (27.9 kB view details)

Uploaded Python 3

File details

Details for the file sentiment_analysis_csci_e89-2.1.1.tar.gz.

File metadata

  • Download URL: sentiment_analysis_csci_e89-2.1.1.tar.gz
  • Upload date:
  • Size: 48.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3.post20200330 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for sentiment_analysis_csci_e89-2.1.1.tar.gz
Algorithm Hash digest
SHA256 5f28b3aa0a0df058f9a212db8d7ac1034c8e771cedb40a0775bed9b58037a5cf
MD5 9df4a9a5cd47a64fd7cef621e2549946
BLAKE2b-256 df3ebf1a8a4004aea79353903195646aa448d0a8eaeb416a0f9ad8ed4a411b87

See more details on using hashes here.

File details

Details for the file sentiment_analysis_csci_e89-2.1.1-py3-none-any.whl.

File metadata

  • Download URL: sentiment_analysis_csci_e89-2.1.1-py3-none-any.whl
  • Upload date:
  • Size: 27.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3.post20200330 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for sentiment_analysis_csci_e89-2.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8b0306135c1b864588d2276f7c77271d7baa897e5ac276b11b02b1afd68a0ec2
MD5 ef649b83e5d2b6655d787829059725b6
BLAKE2b-256 2a0e6fce7743eae699a41f6a9cce727ac79d9e8b4ef69c83a86b3eb7fc10172e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page