A simple utility for crawling text from 2ch
Project description
much
A simple utility for crawling text from 2ch
Usage
The command pull
requires two attributes - url of the web page to fetch and path to output file with json
or txt
extension depending on required output file format. For example:
python -m much pull https://2ch.hk/b/arch/2018-08-22/res/181770037.html assets/stories.txt
Installation
To install dependencies and create conda environment:
conda env create -f environment.yml
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
much-0.0.1.tar.gz
(7.8 kB
view details)
File details
Details for the file much-0.0.1.tar.gz
.
File metadata
- Download URL: much-0.0.1.tar.gz
- Upload date:
- Size: 7.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0ad7851aef93fa055785c29aea2fd965a6f21ec1e13cd4d896d4ee7f3de17a3 |
|
MD5 | fcd3c43a5fcd038474713a270f46721b |
|
BLAKE2b-256 | 8226e76850d655c42cf20465fe1bdc829437cd290d134b0c86354f12f0546e72 |