SampleSa·PyPI

Sentiment analysis using RNN

These details have not been verified by PyPI

Project links

Project description

Sentiment Analysis(SA) is the use of natural language processing, statistics and text analysis to extract and identify the sentiment of text into positive, negative or neutral categories. The main objective is to construct a model to perform sentiment analysis for postive, negative and sarcastic sentences using RNN technique. The dataset is cleaned (removal of stop words and HTML tags). Word Vectors are generated for this using GloVe and Word2Vec.

SA using Recurrent Neural Network (RNN).

RNN is a class of artificial neural network where connections between units form a directed cycle. This allows it to exhibit dynamic temporal behavior. The hidden layer in RNN acts as storage for the network. The main difference between the normal neural network and RNN is global parameters(such as weights and bias) used, the network is temporal and dynamic since the network vary in size according to the size of the input and same task executed at each timestamp with different inputs. RNN works on temporal data, at each timestamp, a word is taken as input and the next word will be the output to the network. The process will repeat until the end of sentence i.e, at first timestamp, the first word is given, it will give the second word as output. At second timestamp, second word is given as input, third word will get retrieved as output. This is how the network gets trained. If a sentence contains n words, it needs (n-1) timestamps. At last timestamp, the hidden layer values get stored further given to MLP for classification. The labelling has been done manually.

Usage:

Generate GloVe and Word2vec vectors of your required dimensions(Eg: 100,200,300) or download pre-generated vectors of both.
Change the parameter dimension according to the word vector dimensionality
Give appropriate file paths.
Run sa.py as shown below.

“ python ./sa.py -word_embedding W2V/GloVe/Both ‘File_path that contains train and test folders’ “

Code Details:

sa.py:: Main program to run code.
main.py :: Loads GloVe for each sentence, calls RNN for a word in sentence and writes the S_t values to CSV File.
demo.sh, eval and SRC:: The code to produce the GloVe vectors.
Main_GloVe.py:: Call GloVe code to generate the word vectors. GloVe is generated using the code from Github link “https://github.com/stanfordnlp/GloVe” . This Github code produces the word vector file.
GloVe_Extraction.py :: This code will load all those vectors corresponding to the words in sentences. By every time the function called, word vector for a sentence is returned.
Main_W2V.py:: Generates the Word2Vec by the calling W2V code. And this task is done using NLTK tool.
W2VGenerate.py:: Produce word vectors.
RNN.py :: This code will take one word at each timestamp in sequence outputs immediate word. The parameters U, V, W, b1, b2 are parameters that are shared through out the network. It returns hidden layer values (S_t).
MLP.py:: This is mainly used to classified the sentiment of the text. The Features extracted from the RNN is given as a input to this Multi-layer Perceptron.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.1.1

Sep 22, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

SampleSa-1.1.1.tar.gz (2.5 kB view details)

Uploaded Sep 22, 2018 Source

Built Distribution

SampleSa-1.1.1-py3-none-any.whl (2.8 kB view details)

Uploaded Sep 22, 2018 Python 3

File details

Details for the file SampleSa-1.1.1.tar.gz.

File metadata

Download URL: SampleSa-1.1.1.tar.gz
Upload date: Sep 22, 2018
Size: 2.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for SampleSa-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`eccb14734cb0b3bb7ee2e581a007205a13b668d17dbabd0d033d883448e20681`
MD5	`5ed55d4a3970f8c311d4500f248c2c9a`
BLAKE2b-256	`414fdfa6b79e5ddfdb2f66d7550b5a8f5716ccf728e821dcee746048a5c7a66c`

See more details on using hashes here.

File details

Details for the file SampleSa-1.1.1-py3-none-any.whl.

File metadata

Download URL: SampleSa-1.1.1-py3-none-any.whl
Upload date: Sep 22, 2018
Size: 2.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.7.0

File hashes

Hashes for SampleSa-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`622f9fcb9de7cf684796e93dc2b294d5c5d2a5cf89bf9c281eae8a92e206d394`
MD5	`b1e65f0a07c45ed8cb1d1eb3e1030f61`
BLAKE2b-256	`da2d2a2ce03cb09724ee49305c2097b145ded4d85282c025329d87a7aad8c785`

See more details on using hashes here.

SampleSa 1.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SA using Recurrent Neural Network (RNN).

Usage:

Code Details:

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes