Skip to main content

A simple utility for crawling text from 2ch

Project description

much

A simple utility for crawling text from 2ch

Usage

The command pull requires two attributes - url of the web page to fetch and path to output file with json or txt extension depending on required output file format. For example:

python -m much pull https://2ch.hk/b/arch/2018-08-22/res/181770037.html assets/stories.txt

Installation

To install dependencies and create conda environment:

conda env create -f environment.yml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

much-0.0.1.tar.gz (7.8 kB view details)

Uploaded Source

File details

Details for the file much-0.0.1.tar.gz.

File metadata

  • Download URL: much-0.0.1.tar.gz
  • Upload date:
  • Size: 7.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.7

File hashes

Hashes for much-0.0.1.tar.gz
Algorithm Hash digest
SHA256 b0ad7851aef93fa055785c29aea2fd965a6f21ec1e13cd4d896d4ee7f3de17a3
MD5 fcd3c43a5fcd038474713a270f46721b
BLAKE2b-256 8226e76850d655c42cf20465fe1bdc829437cd290d134b0c86354f12f0546e72

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page