A general purpose data downloading library.

These details have not been verified by PyPI

Project links

Homepage

Project description

fastdownload

Easily download, verify, and extract archives

If you have datasets or other archives that you want to make available to your users, and ensure they always have the latest versions and that they are downloaded correctly, fastdownload can help.

Install

Using pip:

pip install fastdownload

...or using conda:

conda install -c fastai fastdownload

What's this about?

The situation where you might want to use fastdownload is where you have one or more URLs pointing at some archives you want to make available, and you want to ensure that your users download those archives correctly, have the latest version, and that it's as easy as possible for them to access the information in those archives.

Your user just calls a single method, FastDownload.get, passing the URL required, and the URL will be downloaded and extracted to the directories you choose. The path to the extracted file is returned. If that URL has already been downloaded, then the cached archive or contents will be used automatically. However, if that size or hash of the archive is different to what it should be, then the user will be informed, and a new version will be downloaded.

In the future, you may want to update one or more of your archives. When you do so, fastdownload will ensure your users have the latest version, by checking their downloaded archives against your updated file size and hash information.

For instance, fastai uses fastdownload to provide access to datasets for deep learning. fastai users can download and extract them with a single command, using the return value to access the files. The files are automatically placed in appropriate subdirectories of a .fastai folder in the user's homedir. If a dataset is updated, users are informed the next time they use the dataset, and the latest version is automatically downloaded and extracted for them.

Usage: downloading files

When your users download an archive, fastdownload will automatically save it to a directory, check if the size and hash matches, and extract the contents. Minimal usage for downloading and extracting is:

from fastdownload import FastDownload
d = FastDownload()
path = d.get('https://...')

After this, path will contain the path where the extracted files are located. By default, archives are saved to {base}/archive, and extracted to {base}/data. {base} defaults to ~/.fastdownload. If there is more than one file or folder in the root of the downloaded archive, then a new folder is created in data for the contents.

Instead of get, use download to download the URL without extracting it, or extract to extract the URL without downloading it (assuming it's already been downloaded to the archive directory). All of these methods accept a force parameter which will download/extract the archive even if it's already present.

You can change any or all of the base, archive, and data paths by passing them to FastDownload:

d = FastDownload(base='~/.mypath', archive='downloaded', data='extracted')

You can remove the cached archive file and/or the extracted contents with rm:

d.rm('https://...')

Usage: making archives available to download

fastdownload will add a file download_checks.py to your Python module which contains file sizes and hashes for your archives. The file is located in the same directory as a module you choose, e.g.:

d = FastDownload(module=fastai.some_module)

Then use update to create or update the size and hash for a URL:

d.update('https://...')

You will now find there is a file called download_checks.py in the same directory where fastai.some_module is located, which contains a Python dict with the URL, size, and hash for this file. If you've downloaded this file before to your archive path then it will be used, instead of downloading a new copy. Use get(force=True) first to download a new copy if even you have it in your archive.

Config file

If there is a file called config.ini in your base directory, then keys archive and data will be used as the default values for FastDownload. The file should be in configparser format. Here's a sample config.ini:

[DEFAULT]         
archive = downloaded
data = extracted

If there is no ini file present, one will be automatically created for for you using the details you pass to FastDownload.

You can add any additional key/value pairs to the config file that you want. When you call FastDownload.get pass extract_key to use a key other than data for choosing a location to extract to.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.0.7

Jul 7, 2022

0.0.6

May 20, 2022

This version

0.0.5

Aug 8, 2021

0.0.4

Aug 8, 2021

0.0.3

Aug 1, 2021

0.0.2

Aug 1, 2021

0.0.1

Jul 31, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastdownload-0.0.5.tar.gz (30.7 kB view details)

Uploaded Aug 8, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fastdownload-0.0.5-py3-none-any.whl (13.0 kB view details)

Uploaded Aug 8, 2021 Python 3

File details

Details for the file fastdownload-0.0.5.tar.gz.

File metadata

Download URL: fastdownload-0.0.5.tar.gz
Upload date: Aug 8, 2021
Size: 30.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for fastdownload-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`64e67af30690fa98ae1c8a1b52495769842f723565239a5430208ad05585af18`
MD5	`898c57fd31fb70d4fe997583bae30400`
BLAKE2b-256	`b65dcb08e45b5a08db1d4b19bf438168a3a55cfae27ab2f8ba5c8d5e26d83833`

See more details on using hashes here.

File details

Details for the file fastdownload-0.0.5-py3-none-any.whl.

File metadata

Download URL: fastdownload-0.0.5-py3-none-any.whl
Upload date: Aug 8, 2021
Size: 13.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for fastdownload-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bf5c48fd415e01484307dd619ff01fc0747f22b0ee01cafd1b2b3b98d34f31f4`
MD5	`3589b383983bc847e000a422c9f9d80a`
BLAKE2b-256	`4a7dd352ae8f0aa2170f9e0ae4676148675a738cf9fd0c034bd024b82f7df8ed`

See more details on using hashes here.

fastdownload 0.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

fastdownload

Install

What's this about?

Usage: downloading files

Usage: making archives available to download

Config file

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes