Skip to main content

A lightweight tool for parsing and downloading 4chan threads.

Project description

https://img.shields.io/pypi/v/chandl.svg https://img.shields.io/pypi/pyversions/chandl.svg https://travis-ci.org/gebn/chandl.svg?branch=master https://coveralls.io/repos/github/gebn/chandl/badge.svg?branch=master

A lightweight tool for parsing and downloading 4chan threads.

Features

  • An API for programmatically analysing 4chan content.

  • Customise the each file name using a lambda function.

  • Filter out images or videos.

  • Concurrent downloading, with parallelism linked to the number of cores.

Installation

To install chandl, simply run:

$ pip install chandl

Usage

$ chandl -h
usage: chandl [-h] [-V] [-v] [-f [FILTER]] [-e [EXCLUDE]] [-o [OUTPUT_DIR]]
              [-t [THREAD_DIR]] [-n [NAME]] [-p PARALLELISM]
              url

A lightweight tool for parsing and downloading 4chan threads.

positional arguments:
  url                   the URL of the thread to download

optional arguments:
  -h, --help            show this help message and exit
  -V, --version         show program's version number and exit
  -v, --verbosity       increase output verbosity
  -f [FILTER], --filter [FILTER]
                        file types or extensions to download, value either
                        comma-separated or option passed multiple times
  -e [EXCLUDE], --exclude [EXCLUDE]
                        file names to exclude, value either comma-separated or
                        option passed multiple times
  -o [OUTPUT_DIR], --output-dir [OUTPUT_DIR]
                        the directory to create the `thread-dir` within
  -t [THREAD_DIR], --thread-dir [THREAD_DIR]
                        relative to the `output-dir`, this will contain
                        downloaded files
  -n [NAME], --name [NAME]
                        the format to use for downloaded file names
  -p PARALLELISM, --parallelism PARALLELISM
                        the maximum number of download threads to use per core

Roadmap

  • Implement tracking of threads until they are deleted

  • Improve test coverage

  • Pylint or flake8 integration

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chandl-0.2.0.tar.gz (13.4 kB view details)

Uploaded Source

File details

Details for the file chandl-0.2.0.tar.gz.

File metadata

  • Download URL: chandl-0.2.0.tar.gz
  • Upload date:
  • Size: 13.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for chandl-0.2.0.tar.gz
Algorithm Hash digest
SHA256 48f7824877f8a41bad4068d221fc7ad1d01b329a6f4647cd3c865963b0fdecb2
MD5 f86ef4680f590ae5d97e96e0f85d2d17
BLAKE2b-256 4274bded8a33ff5c534f9ea348fadc95663de7e2c9bbc8e05ab84aed39dd8d9c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page