A concurrent python download manager

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Reason this release was yanked:

typo causing the module to misbehave

Project description

pypdl

pypdl is a Python library for downloading files from the internet. It provides features such as multi-threaded downloads, retry download incase of failure and option to continue downloading using a different url if necessary, progress tracking, pause/resume functionality and many more.

Installation

To install the pypdl, run the following command:

pip install pypdl

Usage

Basic Usage

To download a file using the pypdl, simply create a new Downloader object and call its start method, passing in the URL of the file to be downloaded and the path where it should be saved:

from pypdl import Downloader

dl = Downloader()
dl.start('http://example.com/file.txt', 'file.txt')

Advanced Usage

The Downloader object provides additional options for advanced usage:

dl.start(
    url='http://example.com/file.txt',  # URL of the file to download
    filepath='file.txt',  # path to save the downloaded file
    num_connections=10,  # number of connections to use for a multi-threaded download
    display=True,  # whether to display download progress
    multithread=True,  # whether to use multi-threaded download
    block=True,  # whether to block until the download is complete
    retries=0,  # number of times to retry the download in case of an error
    retry_func=None,  # function to call to get a new download URL in case of an error
)

The num_connections option specifies the number of threads to use for a multi-threaded download. The default value is 10.

The display option specifies whether to display download progress. The default value is True.

The multithread option specifies whether to use multi-threaded download. The default value is True.

The block option specifies whether to block until the download is complete. The default value is True.

The retries option specifies the number of times to retry the download in case of an error. The default value is 0.

The retry_func option specifies a function to call to get a new download URL in case of an error.

Example

Here is an example that demonstrates how to use pypdl library to download a file from the internet:

from pypdl import Downloader

def main():
    # create a new downloader object
    dl = Downloader()

    # Use custom headers to set user-agent
    dl.headers = {"User-Agent":"Mozilla/5.0 (Windows NT 6.1; Win64; x64; rv:47.0) Gecko/20100101 Firefox/47.0"}
    # Use custom proxies
    dl.proxies = {
                    "http": "http://10.10.1.10:3128",
                    "https": "https://10.10.1.10:1080",
                }
    # Use authentication for proxy
    dl.auth = ("user","pass")

    # start the download
    dl.start(
        url='https://speed.hetzner.de/100MB.bin',
        filepath='100MB.bin',
        num_connections=10,
        display=True,
        multithread=True,
        block=True,
        retries=3,
        retry_func=None,
    )

if __name__ == '__main__':
    main()

This example downloads a large file from the internet using 10 threads and displays the download progress. If the download fails, it will retry up to 3 times. we are aso using a custom header to set user-agent

Another example of using a custom stop event and printing the progress to console:

from pypdl import Downloader
from threading import Event

# create a custom stop event
stop = Event()

# create a downloader object
dl = Downloader(stop)

# start the download process
# block=False so we can print the progress
# display=False so we can print the progress ourselves
dl.start('https://example.com/file.zip', 'file.zip', num_connections=8,block=False,display=False)

# print the progress
while dl.progress != 70:
  print(dl.progress)

# stop the download process
stop.set() # can also be done by calling d.stop()

#do something
#...

# resume the download process
dl.start('https://example.com/file.zip', 'file.zip', num_connections=8,block=False,display=False)

# print rest of the progress
while dl.progress != 100:
  print(dl.progress)

This example we create a custom stop event and pass it to the Downloader object. We then start the download process and print the progress to console. We then stop the download process and do something else. After that we resume the download process and print the rest of the progress to console. This can be used to create a pause/resume functionality.

API Reference

`Downloader()`

The Downloader class represents a file downloader that can download a file from a given URL to a specified file path. The class supports both single-threaded and multi-threaded downloads and many other features like retry download incase of failure and option to continue downloading using a different url if necessary, pause/resume functionality, progress tracking etc.

Parameters

StopEvent: An optional parameter to set custom a stop event.
header: An optional parameter to set custom header. (Note: Never use custom "range" header if using multithread = True)
proxies: An optional parameter to set custom proxies.
auth: An optional parameter to set authentication for proxies.

Attributes

totalMB: The total size of the file to be downloaded, in MB.
progress: The download progress percentage.
speed: The download speed, in MB/s.
download_mode: The download mode: single-threaded or multi-threaded.
time_spent: The time spent downloading, in seconds.
doneMB: The amount of data downloaded so far, in MB.
eta: The estimated time remaining for download completion, in the format "HH:MM:SS".
remaining: The amount of data remaining to be downloaded, in MB.
Stop: An event that can be used to stop the download process.
headers: A dictionary containing user headers.
proxies: A dictionary containing user proxies.
auth: A tuple containing authentication for proxies.
Failed: A flag that indicates if the download failed.

Methods

start(url, filepath, num_connections=10, display=True, multithread=True, block=True, retries=0, retry_func=None): Starts the download process. Parameters:
- url (str): The download URL.
- filepath (str): The file path to save the download.
- num_connections (int): The number of connections to use for a multi-threaded download.
- display (bool): Whether to display download progress.
- multithread (bool): Whether to use multi-threaded download.
- block (bool): Whether to block until the download is complete.
- retries (int): The number of times to retry the download in case of an error.
- retry_func (function): A function to call to get a new download URL in case of an error.
stop(): Stops the download process.

Helper Classes

`Multidown()`

The Multidown class represents a download worker that is responsible for downloading a specific part of a file in multiple chunks.

Parameters

dic: Dictionary that contains the download information.
id: ID of the download part.
stop: Stop event.
error: Error event.
headers: Custom headers.
proxies: Custom proxies.
auth: Authentication for proxies.

Attributes

curr: The current size of the downloaded file.
completed: Whether the download for this part is complete.
id: The ID of this download part.
dic: A dictionary containing download information for all parts.
stop: An event that can be used to stop the download process.
error: An event that can be used to signal an error.
headers: A dictionary containing user headers.
proxies: A dictionary containing user proxies.
auth: A tuple containing authentication for proxies.

Methods

getval(key): Gets the value of a key from the dictionary.
setval(key, val): Sets the value of a key in the dictionary.
worker(): Downloads a part of the file in multiple chunks.

`Singledown()`

The Singledown class represents a download worker that is responsible for downloading a whole file in a single chunk.

Parameters

url: Url of the file.
path: Path to save the file.
stop: Stop event.
error: Error event.
headers: User headers.
proxies: Custom proxies.
auth: Authentication for proxies.

Attributes

curr: The current size of the downloaded file.
completed: Whether the download is complete.
url: The URL of the file to download.
path: The path to save the downloaded file.
stop: Event to stop the download.
error: Event to indicate an error occurred.
headers: Custom user headers.
proxies: A dictionary containing user proxies.
auth: A tuple containing authentication for proxies.

Methods

worker(): Downloads a whole file in a single chunk.

License

The pypdl library is distributed under the MIT License. See the LICENSE file for more information.

Contribution

Contributions are welcome! If you encounter any issues or have suggestions for improvements, please open an issue on the GitHub repository.

Contact

For any inquiries or questions, you can reach out to the author via email at mjishnu@skiff.com.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.3.2

Apr 6, 2024

1.3.1

Mar 12, 2024

1.3.0

Jan 29, 2024

1.2.1

Jan 21, 2024

1.2.0 yanked

Jan 21, 2024

Reason this release was yanked:

require python 3.8+, no min python version specified

1.1.1

Nov 18, 2023

1.1.0

Nov 18, 2023

1.0.7

Sep 2, 2023

This version

1.0.6 yanked

Sep 1, 2023

Reason this release was yanked:

typo causing the module to misbehave

1.0.5

Aug 8, 2023

1.0.4

Jul 8, 2023

0.0.9

Jun 29, 2023

0.0.8

Jun 29, 2023

0.0.7 yanked

Jun 28, 2023

Reason this release was yanked:

typo cause the app to not run

0.0.6

Apr 16, 2023

0.0.5

Mar 25, 2023

0.0.3

Mar 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypdl-1.0.6.tar.gz (12.4 kB view hashes)

Uploaded Sep 1, 2023 Source

Built Distribution

pypdl-1.0.6-py3-none-any.whl (11.0 kB view hashes)

Uploaded Sep 1, 2023 Python 3

Hashes for pypdl-1.0.6.tar.gz

Hashes for pypdl-1.0.6.tar.gz
Algorithm	Hash digest
SHA256	`97509368b738c47bb63373ff47aa9cf6ab074acaaa65c0b09cff2fb952de6b56`
MD5	`6b7dd9bd997dc90dca273f362f774992`
BLAKE2b-256	`cb3facd21b4af7d438d8a50590a50200862c65d0ed7159f992e39ebe17a19e8a`

Hashes for pypdl-1.0.6-py3-none-any.whl

Hashes for pypdl-1.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`506005658a4b05b32464b9019917a7bdea20c0caaaac2459026f45cd74381541`
MD5	`0a3f9bc6215a21542376a84df2b00ae6`
BLAKE2b-256	`6bbb13978c0952c065d40f2f3a59a530886c5349aaa51bd89e2f01581512ca5c`

pypdl 1.0.6

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

pypdl

Installation

Usage

Basic Usage

Advanced Usage

Example

API Reference

Downloader()

Parameters

Attributes

Methods

Helper Classes

Multidown()

Parameters

Attributes

Methods

Singledown()

Parameters

Attributes

Methods

License

Contribution

Contact

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

`Downloader()`

`Multidown()`

`Singledown()`