An ESA implementation in python.
Project description
Get the required resources
scp -r webis@webislab40.medien.uni-weimar.de:/home/weci2587/projects/args-topic-modeling/resources .
To run the ESA-script with all terms run:
For normal ESA:
./esa-all-terms.py --similarity cos
--matrix-path <path_to_resources>/resources/esa-plain/<debatepedia|strategic-intelligence|wikipedia>.mat
--model-path <path_to_resources>/resources/esa-w2v/GoogleNews-vectors-negative300.bin
--model-vocab <path_to_resources>/resources/esa-w2v/w2v-vocab.p
--input-path <path_to_input_file>
--output-path <path_to_output_file>
For word2vec-ESA:
./esa-all-terms.py --similarity max
--matrix-path <path_to_resources>/resources/esa-w2v/<debatepedia|strategic-intelligence|wikipedia>.mat
--model-path <path_to_resources>/resources/esa-w2v/GoogleNews-vectors-negative300.bin
--model-vocab <path_to_resources>/resources/esa-w2v/w2v-vocab.p
--input-path <path_to_input_file>
--output-path <path_to_output_file>
To run the word2vec-ESA with reduced terms run:
./esa-top-n-terms.py -n <number_of_terms>
--corpus-path <path_to_resources>/resources/corpora/<debatepedia|strategic-intelligence|wikipedia>.csv
--model-path <path_to_resources>/resources/esa-w2v/GoogleNews-vectors-negative300.bin
--model-vocab <path_to_resources>/resources/esa-w2v/w2v-vocab.p
--input-path <path_to_input_file>
--output-path <path_to_output_file>
The input document must be a csv file with "|" as the separator and must contain the column "document", which is used as the input text for the ESA.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for argument_esa_model-3.0.18.linux-x86_64.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | e729b6bde41d31628fa71b8422de7c137d7495b6b5516ba7ea35f84e6f026426 |
|
MD5 | 2286ecf9fc85f126c0762b476cff8a58 |
|
BLAKE2b-256 | b0a4bf202d9fdf98538eae470db8db812464b2b43b6d499a4ed828147aa57924 |
Close
Hashes for argument_esa_model-3.0.18-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 386713800907754d1e9116250e70b3d70104d12be5261cc7e61a6b6b315f3773 |
|
MD5 | 9f6b192d58d7274b7899d6c3e0ba0b54 |
|
BLAKE2b-256 | 6135ffc2f2532be4330923b3a95425e69af389e0b119d99537cc52e5f5137834 |