8 projects
ja-entity-parser
Japanese entity parser library for company/corporate name normalization and extraction.
jpcorpreg
jpcorpreg is a Python library that downloads corporate registry which is published in the Corporate Number Publication Site as a data frame.
keibascraper
keibascraper is a simple scraping library for netkeiba.com
joyokanji
The joyokanji converts old-form kanji characters into new-form kanji characters.
cnparser
cnparser is a parser library of Corporate Number Publication Site data.
sagikoza
A Python library for crawling and retrieving all notices published under Japan’s Furikome Sagi Relief Act, with support for both full data extraction and incremental updates.
jpdatetime
The jpdatetime library extends Python's datetime to support Japanese eras (元号). It allows parsing and formatting dates in Japanese eras
nsloader
This script collects articles from Wall Street Journal and returns it in dict format.