A Python package to scrape and download files from the Internet Archive.
Project description
Internet Archive Manager (iadl)
A Python package to scrape and download files from the Internet Archive.
Installation
You can install the package using pip:
pip install ia-manager
Usage
Basic Usage
To download all files from an Internet Archive collection:
iadl --url https://archive.org/details/some-collection --dest ./downloads
Filter by File Type
You can filter files by specific types using the following arguments:
-
Download only archive files (e.g.,
.zip,.rar):iadl --url https://archive.org/details/some-collection --dest ./downloads --archive
-
Download only video files (e.g.,
.mp4,.avi):iadl --url https://archive.org/details/some-collection --dest ./downloads --video
-
Download only audio files (e.g.,
.mp3,.flac):
iadl --url https://archive.org/details/some-collection --dest ./downloads --audio
Limit the Number of Files
To limit the number of files downloaded:
iadl --url https://archive.org/details/some-collection --dest ./downloads --limit 5
Show File Links
To display the direct file links of each file in the terminal:
iadl --url https://archive.org/details/some-collection --show-links
Combine Filters
You can combine multiple filters. For example, to download only video and audio files:
iadl --url https://archive.org/details/some-collection --dest ./downloads --video --audio
Simultaneous Downloads
To download multiple files at the same time through separate processes. Setting the number will determine how many files at any one given moment until its finished. Recommend 2-3, be nice to the servers.
iadl --url https://archive.org/details/some-collection --dest ./downloads --audio --concurrent 3
Help
For a full list of options, use the --help flag:
iadl --help
Uninstall
If you wish to remove the dependencies (must be first otherwise pip uninstall will remove the uninstaller, IF YOU WANT THE DEPENDENCIES TO STAY, just skip this command.):
iadl-cleanup
To remove the module:
pip uninstall ia-manager
IF you messed up and ran pip uninstall ia-manager first, and still want the dependencies removed, no problem just reinstall the package again pip install ia-manager and repeat the above 2 commands in order.
Install in Virtual Enviroment
For Windows
python -m venv env
env\Scripts\Activate.ps1
pip install ia-manager
When done:
deactivate
For Linux
python3 -m venv env
source env/Scripts/activate
pip install ia-manager
When done:
deactivate
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file ia_manager-1.0.5.tar.gz.
File metadata
- Download URL: ia_manager-1.0.5.tar.gz
- Upload date:
- Size: 9.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
57ee59323e3ee73061cc9b5875154145b112a9aa689a571d2d9962613a44d015
|
|
| MD5 |
beb4c38180cbeee0fa798aaf979a309e
|
|
| BLAKE2b-256 |
8b2958767a99a268f596ce71ad0af9e0af0615d234deceed0f8c2df4842de916
|