Skip to main content

Script to extract words from Wikipedia dump files and create dictionaries.

Project description

extract_wiki_words - Extract words from Wikipedia dumps and create dictionaries

This script can extract words from Wikipedia XML dumps and create dictionaries.

Install extract_wiki_words on your system using :

pip install extract-wiki-words

Usage of the script to create a dictionary from a Wikipedia dump (https://dumps.wikimedia.org):

extract_wiki_words frwiki-20210520-pages-articles-multistream.xml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

extract_wiki_words-0.1.1-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file extract_wiki_words-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: extract_wiki_words-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.2.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.9.4

File hashes

Hashes for extract_wiki_words-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 88cd08c341b5a43ec01be8720b738fa27b596dc79a6d4630dea153eb88e48096
MD5 207196c9b6c3e6bd58d9c36edf1a67b0
BLAKE2b-256 972a055cda7c434b510e3131d5326b5c42de467699b581e8f0f52ec48b4b265b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page