Skip to main content

A package to run wikification

Project description

wiki_node_disambiguation - - -

What’s this ?

  • You can run “Wikification” as easy as possible.

    • According to wikipedia, Wikification is in computer science, entity linking with Wikipedia as the target knowledge base

  • You can get disambiguated result with its score.

Please visit Github page also. If you find any bugs and you report it to github issue, I’m glad. Any pull-requests are welcomed.

Requirement

  • Python3.x (checked under )

    • I recommend to use “Anaconda” distribution.

Setup

python setup.py install

Get wikipedia entity vector model

Go to this page and download model file from here. Or run download_model.sh

To those who uses interface.predict_japanese_wiki_names()

You’re supposed to have mysql somewhere.

The step until using it.

  1. start mysql server somewhere

  2. download latest mysql dump files

  3. initialize wikipedia database with mysql

To download wikipedia dump files, execute following commands

wget https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-redirect.sql.gz
wget https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-page.sql.gz
gunzip jawiki-latest-redirect.sql.gz
gunzip jawiki-latest-page.sql.gz

To initialize wikipedia database with mysql,

% CREATE DATABASE wikipedia;
% mysql -u [user_name] -p[password] wikipedia < jawiki-latest-redirect.sql
% mysql -u [user_name] -p[password] wikipedia < jawiki-latest-page.sql

Change logs

  • version0.1

    • released

    • It supports only Japanese wikipedia

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

word2vec_wikification_py-0.17.tar.gz (30.5 kB view details)

Uploaded Source

File details

Details for the file word2vec_wikification_py-0.17.tar.gz.

File metadata

File hashes

Hashes for word2vec_wikification_py-0.17.tar.gz
Algorithm Hash digest
SHA256 b8bc5ebf8eb42c44eacd6b457cc382b86b82a84015d9fe59962368370916bff0
MD5 4dd01250da920ce38b993b48d66994d3
BLAKE2b-256 2859830fe954b5f1022aed380a06aa8414808cced333c049e71b5a3b02e58510

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page