Skip to main content

An optionally asynchronous multithreaded downloader for python

Project description

multithread

Multithread is an optionally asynchronous Python library for downloading files using several threads.

Features

  • Lightweight: one file, a little over 100 lines of code excluding license
  • Extensive: the ability to pass your own sessions and your own arguments to each request means you don't need to wait for your desired feature to be implemented; anything you can do in aiohttp, you can do in multithread!
  • Fast: benefit from the speed of aiohttp and multithreaded downloading!

Installation

Requirements:

  • Python 3.5.3+
  • aiohttp
  • Tinche/aiofiles
  • optional: tqdm

Use the package manager pip to install multithread.
For support for progress bars, install multithread[progress].

pip3 install multithread

Usage

import multithread

download_object = multithread.Download("http://url.com/file", filename)
download_object.start()

# passing headers (you can pass any other arguments that aiohttp.ClientSession.request can take as well)
download_object = multithread.Download("http://url.com/file", filename, aiohttp_args={"headers": {"a": "b", "c": "d"}})
download_object.start()

Documentation

Downloader

Downloader(self, url, file, threads=4, session=None, progress_bar=True, aiohttp_args={'method': 'GET'}, create_dir=True)

An optionally asynchronous multi-threaded downloader class using aiohttp

Attributes:

- url (str): The URL to download
- file (str or path-like object): The filename to write the download to.
- threads (int): The number of threads to use to download
- session (aiohttp.ClientSession): An existing session to use with aiohttp
- new_session (bool): True if a session was not passed, and the downloader created a new one
- progress_bar (bool): Whether to output a progress bar or not
- aiohttp_args (dict): Arguments to be passed in each aiohttp request. If you supply a Range header using this, it will be overwritten in fetch()

__init__

Downloader.__init__(self, url, file, threads=4, session=None, progress_bar=True, aiohttp_args={'method': 'GET'}, create_dir=True)

Assigns arguments to self for when asyncstart() or start() calls download.

All arguments are assigned directly to self except for:

- session: if not passed, a ClientSession is created
- aiohttp_args: if the key "method" does not exist, it is set to "GET"
- create_dir: see parameter description

Parameters:

- url (str): The URL to download
- file (str or path-like object): The filename to write the download to.
- threads (int): The number of threads to use to download
- session (aiohttp.ClientSession): An existing session to use with aiohttp
- progress_bar (bool): Whether to output a progress bar or not
- aiohttp_args (dict): Arguments to be passed in each aiohttp request. If you supply a Range header using this, it will be overwritten in fetch()
- create_dir (bool): If true, the directories encompassing the file will be created if they do not exist already.

start

Downloader.start(self)

Calls asyncstart() synchronously

asyncstart

Downloader.asyncstart(self)

Re-initializes file and calls download() with it. Closes session if necessary

fetch

Downloader.fetch(self, progress=False, filerange=(0, ''))

Individual thread for fetching files.

Parameters:

- progress (bool or tqdm.Progress): the progress bar (or lack thereof) to update
- filerange (tuple): the range of the file to get

download

Downloader.download(self)

Generates ranges and calls fetch() with them.

Contributing

Any and all pull requests are welcome. As this is a small project, there are no strict standards, but please try to keep your code clean to a reasonable standard. Alternatively, if you would like to clean my code, that would be more than welcome!

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multithread-1.0.1.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

multithread-1.0.1-py3-none-any.whl (5.3 kB view details)

Uploaded Python 3

File details

Details for the file multithread-1.0.1.tar.gz.

File metadata

  • Download URL: multithread-1.0.1.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.3

File hashes

Hashes for multithread-1.0.1.tar.gz
Algorithm Hash digest
SHA256 4b21503d93a633b70ab75ec7c7d7afa3ecb1d1c5d36b2ae75be18e4b03026754
MD5 e5d067f9155036570e3df86884abdc06
BLAKE2b-256 19b0812b3b410c2703ce0c42cb8c28f79b1c43ea21ba0e4a26985845920f7a04

See more details on using hashes here.

File details

Details for the file multithread-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: multithread-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 5.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.3

File hashes

Hashes for multithread-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e9880543e885f77f1e34c2019742f639aaebee87c74d893dc20c0eb9098fbd0d
MD5 c54b3c1cccaecb2c451e55bb63a0306c
BLAKE2b-256 455264dc375bff64c23fe505bf6ab59a299883bdebfba888fb4f1a4cf1d6c090

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page