Skip to main content

Japanese Wikipedia cleaner

Project description

Japanese Wikipedia Cleaner

Apply extracted wikipedia text by WikiExtractor.

$ jawiki-cleaner --input ./wiki.txt --output ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt -o ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt # output path will be `./cleaned-wiki.txt`

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jawiki-cleaner-0.1.1.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

jawiki_cleaner-0.1.1-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file jawiki-cleaner-0.1.1.tar.gz.

File metadata

  • Download URL: jawiki-cleaner-0.1.1.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki-cleaner-0.1.1.tar.gz
Algorithm Hash digest
SHA256 ceb5e811fa179f0ebe207d0146df9645a31e4a2d6e9d8ad769c93778d3085eba
MD5 94e0e66dd88ab2f6eeffbaedcb018af0
BLAKE2b-256 795552561335bf84daa57ab75250f6592be2915cfbd82ba0674364025899df2a

See more details on using hashes here.

Provenance

File details

Details for the file jawiki_cleaner-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: jawiki_cleaner-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki_cleaner-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b512c4ee34aca590eb9bef4055730f7d3398b3fefd2d8935fa41090c1fceb998
MD5 a4b8f35c17e3a5a623b393a2eb9013ed
BLAKE2b-256 73a75e512057f54096f1cdd3b41b530a5f74b2db956c7da72f6fb0b6c965846b

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page