Skip to main content

Japanese Wikipedia cleaner

Project description

Japanese Wikipedia Cleaner

Apply extracted wikipedia text by WikiExtractor.

$ jawiki-cleaner --input ./wiki.txt --output ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt -o ./cleaned-wiki.txt
$ jawiki-cleaner -i ./wiki.txt # output path will be `./cleaned-wiki.txt`

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jawiki-cleaner-0.1.2.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

jawiki_cleaner-0.1.2-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file jawiki-cleaner-0.1.2.tar.gz.

File metadata

  • Download URL: jawiki-cleaner-0.1.2.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki-cleaner-0.1.2.tar.gz
Algorithm Hash digest
SHA256 d1661b424dbd7c319602ecec2413ff2aa9afbfbc1c50603d527c05b48a681439
MD5 eb4de9d4365996b934b58b812dcd7db6
BLAKE2b-256 eda4a458664b805900dce12ad75a3b3f6dd577641a274a602a8770b9987f7f0a

See more details on using hashes here.

Provenance

File details

Details for the file jawiki_cleaner-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: jawiki_cleaner-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.2 Darwin/19.6.0

File hashes

Hashes for jawiki_cleaner-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 314463b4c9376342db9a8f84d280c0a71451e32bae9a0ea2232ae32af25950d6
MD5 fecd094c8883d28ee37101f06c803bac
BLAKE2b-256 85619a58b79b63514dae37e27d113a5fad0db006bdc0bb34f3e3cda91b367039

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page