Skip to main content

An ESA implementation in python.

Project description

Get the required resources

scp -r webis@webislab40.medien.uni-weimar.de:/home/weci2587/projects/args-topic-modeling/resources .

To run the ESA-script with all terms run:

For normal ESA:

./esa-all-terms.py  --similarity cos
                    --matrix-path <path_to_resources>/resources/esa-plain/<debatepedia|strategic-intelligence|wikipedia>.mat
                    --model-path <path_to_resources>/resources/esa-w2v/GoogleNews-vectors-negative300.bin
                    --model-vocab <path_to_resources>/resources/esa-w2v/w2v-vocab.p
                    --input-path <path_to_input_file>
                    --output-path <path_to_output_file>

For word2vec-ESA:

./esa-all-terms.py  --similarity max
                    --matrix-path <path_to_resources>/resources/esa-w2v/<debatepedia|strategic-intelligence|wikipedia>.mat
                    --model-path <path_to_resources>/resources/esa-w2v/GoogleNews-vectors-negative300.bin
                    --model-vocab <path_to_resources>/resources/esa-w2v/w2v-vocab.p
                    --input-path <path_to_input_file>
                    --output-path <path_to_output_file>

To run the word2vec-ESA with reduced terms run:

./esa-top-n-terms.py    -n <number_of_terms> 
                        --corpus-path <path_to_resources>/resources/corpora/<debatepedia|strategic-intelligence|wikipedia>.csv
                        --model-path <path_to_resources>/resources/esa-w2v/GoogleNews-vectors-negative300.bin
                        --model-vocab <path_to_resources>/resources/esa-w2v/w2v-vocab.p
                        --input-path <path_to_input_file>
                        --output-path <path_to_output_file>

The input document must be a csv file with "|" as the separator and must contain the column "document", which is used as the input text for the ESA.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

argument_esa_model-3.11.102.linux-x86_64.tar.gz (14.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

argument_esa_model-3.11.102-py3-none-any.whl (10.8 kB view details)

Uploaded Python 3

File details

Details for the file argument_esa_model-3.11.102.linux-x86_64.tar.gz.

File metadata

  • Download URL: argument_esa_model-3.11.102.linux-x86_64.tar.gz
  • Upload date:
  • Size: 14.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.6.9

File hashes

Hashes for argument_esa_model-3.11.102.linux-x86_64.tar.gz
Algorithm Hash digest
SHA256 7465707375d275e50674e4c64f86b8a0ca80ef8658a7ddb5572ee58a04367eb8
MD5 276873ba1ff7be7148c69166af81173f
BLAKE2b-256 10dc695a2f3777670368a477f13f054b00499e9f1b5aee335639ed02413b804c

See more details on using hashes here.

File details

Details for the file argument_esa_model-3.11.102-py3-none-any.whl.

File metadata

  • Download URL: argument_esa_model-3.11.102-py3-none-any.whl
  • Upload date:
  • Size: 10.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/3.6.9

File hashes

Hashes for argument_esa_model-3.11.102-py3-none-any.whl
Algorithm Hash digest
SHA256 1acf26b478bfee2615d786b47b20893f960bc6eeffb545e718ad4b606d45ed1f
MD5 41b136c80927b43e1b9f04ab62214167
BLAKE2b-256 59b6b8e0a5a0c3391dbcbe33c191ca438b13ce7cbdfc3c4c5caf8fffd1c81586

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page