Skip to main content

Japanese Wikipedia cleaner

Project description

Japanese Wikipedia Cleaner

Apply extracted wikipedia text by WikiExtractor.

$ jawiki-cleaner --input ./wiki.txt --output ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt -o ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt # output path will be `./cleaned-wiki.txt`

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jawiki-cleaner-0.1.0.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

jawiki_cleaner-0.1.0-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file jawiki-cleaner-0.1.0.tar.gz.

File metadata

  • Download URL: jawiki-cleaner-0.1.0.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki-cleaner-0.1.0.tar.gz
Algorithm Hash digest
SHA256 6a757763c820ce30d6445d9ccabfcdfc34dd6d3b5c316bd98130c2d1d090e955
MD5 684903796807b1273f942cdc7cb55514
BLAKE2b-256 69eeb3431cecc4cd3389e9297dd99440cfbe25d7aa24e8c39569c7246913502b

See more details on using hashes here.

Provenance

File details

Details for the file jawiki_cleaner-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: jawiki_cleaner-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki_cleaner-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 27d24340b87fc132db366c320971f470e017f95491b05a6ae4c7d8767e3516a7
MD5 8abd8c6dbabe1323e584394fa38ea56e
BLAKE2b-256 ac8b87a40026f1806c6494bf8d6332f84181f9a7fe3c4a3e046028dd98d9d80b

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page