Project description

much

A simple utility for crawling text from 2ch

Usage

The command pull requires two attributes - url of the web page to fetch and path to output file with json or txt extension depending on required output file format. For example:

python -m much pull https://2ch.hk/b/arch/2018-08-22/res/181770037.html assets/stories.txt

To fetch archived threads on 17th page:

python -m much fetch 17

To list top 10 fetched threads by size (cumulative number of characters in messages longer than 100 symbols):

python -m much top 10

To star a thread (copy it to folder assets/starred with a given name):

python -m much star 263473351 discussion

Installation

To install through pip:

pip install much

To install dependencies and create conda environment:

conda env create -f environment.yml

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.7

Nov 10, 2023

0.0.6

Nov 9, 2023

0.0.5

Oct 25, 2023

This version

0.0.4

Oct 25, 2023

0.0.3

Oct 23, 2023

0.0.2

Oct 15, 2023

0.0.1

Oct 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

much-0.0.4.tar.gz (9.3 kB view details)

Uploaded Oct 25, 2023 Source

File details

Details for the file much-0.0.4.tar.gz.

File metadata

Download URL: much-0.0.4.tar.gz
Upload date: Oct 25, 2023
Size: 9.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.7

File hashes

Hashes for much-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`30e93bfc1a8a4cf43bf65a9d8f9d91115c7d831d40f8a1567647f43f1f5cf108`
MD5	`71174093f00d67bae58f0cbd3b5ebe10`
BLAKE2b-256	`707463764caec66621618f3f68d23a6c15cd4a23889457c38d47ded7038f52cb`