Skip to main content

Japanese Wikipedia cleaner

Project description

Japanese Wikipedia Cleaner

Apply extracted wikipedia text by WikiExtractor.

$ jawiki-cleaner --input ./wiki.txt --output ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt -o ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt # output path will be `./cleaned-wiki.txt`

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jawiki-cleaner-0.1.3.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

jawiki_cleaner-0.1.3-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file jawiki-cleaner-0.1.3.tar.gz.

File metadata

  • Download URL: jawiki-cleaner-0.1.3.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki-cleaner-0.1.3.tar.gz
Algorithm Hash digest
SHA256 aa5be408dc4090575297faf9f7f90d5e53f0cc021ebedbe8754aed5ecdf8696d
MD5 3d00b02f53ada922f85a5265ec333c58
BLAKE2b-256 6f361476f8224e0b4d1fe9421811ff420711013eacddb6c35f745eede697b5c7

See more details on using hashes here.

Provenance

File details

Details for the file jawiki_cleaner-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: jawiki_cleaner-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki_cleaner-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 8ad659b43e73c597c08e4baf023fc0b1f859a3291077f88c0f9e31be3975e84e
MD5 c5869f2aaf8f2b20e72fdfb39c5d1cb5
BLAKE2b-256 92147372bc90a11fc559e76a025ecfd29e5eebc59701bffaa2fb8722d3c37e73

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page