A package to run wikification
Project description
wiki_node_disambiguation - - -
What’s this ?
You can run “Wikification” as easy as possible.
According to wikipedia, Wikification is in computer science, entity linking with Wikipedia as the target knowledge base
You can get disambiguated result with its score.
Please visit Github page also. If you find any bugs and you report it to github issue, I’m glad. Any pull-requests are welcomed.
Requirement
Python3.x (checked under )
I recommend to use “Anaconda” distribution.
Setup
python setup.py install
Get wikipedia entity vector model
Go to this page and download model file from here. Or run download_model.sh
To those who uses interface.predict_japanese_wiki_names()
You’re supposed to have mysql somewhere.
The step until using it.
start mysql server somewhere
download latest mysql dump files
initialize wikipedia database with mysql
To download wikipedia dump files, execute following commands
wget https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-redirect.sql.gz wget https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-page.sql.gz gunzip jawiki-latest-redirect.sql.gz gunzip jawiki-latest-page.sql.gz
To initialize wikipedia database with mysql,
% CREATE DATABASE wikipedia; % mysql -u [user_name] -p[password] wikipedia < jawiki-latest-redirect.sql % mysql -u [user_name] -p[password] wikipedia < jawiki-latest-page.sql
Change logs
version0.1
released
It supports only Japanese wikipedia
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file word2vec_wikification_py-0.17.tar.gz
.
File metadata
- Download URL: word2vec_wikification_py-0.17.tar.gz
- Upload date:
- Size: 30.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b8bc5ebf8eb42c44eacd6b457cc382b86b82a84015d9fe59962368370916bff0 |
|
MD5 | 4dd01250da920ce38b993b48d66994d3 |
|
BLAKE2b-256 | 2859830fe954b5f1022aed380a06aa8414808cced333c049e71b5a3b02e58510 |