A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!
Project description
🕳️CygnusX1
Code by Trong-Dat Ngo.
Overviews
🕳️CygnusX1 is a multithreaded tool 🛠️, used to search and download images from popular search engines 🔎. It is straightforward to set up and run!
Key features
- 🥰 No knowledge is required to get up and to run.
- 🚀 Download image using customizable number of threads.
- ⛏️Crawl all possible images (search results and recommendations).
Installation
This repository is tested on Python 3.6+ and PyTorch selenium 3.141.0+, as well as it works fine on macOS, Windows, Linux.
You should setup and run 🕳️CygnusX1 in a virtual environment. If you're unfamiliar with Python virtual environments, check out the user guide here.
First, create a virtual environment with the version of Python you're going to use and activate it. (Can be omitted if you want to set up directly on the OS environment)
source venv/bin/activate
Pip Insstallation
Install 🕳️CygnusX1 by pip:
pip install CygnusX1
Manual Installation
Download 🕳️CygnusX1 from Github:
git clone https://github.com/dat821168/CygnusX1.git
Finally install dependencies in requirements.txt
:
pip install -r requirements.txt
Run
Use cygnusx1 command line:
cygnusx1 --keywords "keyword 1, keyword 2" --workers 8 --use_suggestions --headless
Use run.py
to start the script:
python run.py --keywords "keyword 1, keyword 2" --workers 8 --use_suggestions --headless
Argument details:
--keywords
: Indicate the keywords/keyphrases you want to search. For multiple keywords, separate them with commas.--out_dir
: Path where to save results. Default = './IMAGES'.--workers
: The maximum number of workers used to crawl image. Default = 2.--use_suggestions
: Crawl search engine suggestions/recommendations. Default = False.--headless
: Hide browser during scraping. Default = False.
Future Releases
- Suppor Google search engine.
- Support Bing search engine.
- Support Baidu search engine.
References
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file CygnusX1-1.0.2.tar.gz
.
File metadata
- Download URL: CygnusX1-1.0.2.tar.gz
- Upload date:
- Size: 6.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.64.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f3c2b01acfde2cac49f7bb1e14569ca99bdf0e218bc433aa428dad73f9542e04 |
|
MD5 | fa4631044bec8b094bb66f048d2ae187 |
|
BLAKE2b-256 | 1b0a513d779f2a7730d507fcfd1c138f0399ae6b85622bc98e174cc0cd12a7a0 |
File details
Details for the file CygnusX1-1.0.2-py3-none-any.whl
.
File metadata
- Download URL: CygnusX1-1.0.2-py3-none-any.whl
- Upload date:
- Size: 8.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.22.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.64.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b86d4b32567ecc1e06591a273c694eb69ebc3941e65c13ca361d2385736c772e |
|
MD5 | a1e8f98dbc82b44788cfeabd6a2468ec |
|
BLAKE2b-256 | e0e355930934681e8f40eae61a993f7fc297b504e3a83f41dce6dece80498c73 |