Skip to main content

A command-line tool to scrape and download images from Yandex search results

Project description

Yandex Image Scraper

A command-line tool to scrape and download images from Yandex search results.

Features

  • Search for images on Yandex
  • Scrape image URLs from search results
  • Auto-scrolling to load more images
  • Auto-clicks "Show more" button when available
  • High-performance parallel downloading (50 images at once)
  • Customize search parameters (content type, size, orientation)
  • Save image URLs to text file
  • Browser fingerprint spoofing to avoid detection

How It Works

  1. The script uses Playwright via Camoufox to open a browser window to Yandex Images
  2. Applies advanced browser fingerprint spoofing to avoid detection
  3. It searches for your query and extracts image URLs from the results
  4. Automatically scrolls to load more results and clicks "Show more" when needed
  5. Downloads all found images in parallel (up to 25 images per batch concurrently) for maximum speed
  6. Saves images to a folder named after your search query

Usage

# Install from PyPI
pip install yandex-image-scraper

# Run the tool
yandex-image-scraper

# Follow the prompts to:
# - Enter your search query
# - Specify how many images to download
# - Select content type (photo, clipart, etc.)
# - Choose image size and orientation
# - Enable/disable image downloads and URL saving
# - Choose headless mode (hidden browser) or visible browser

Options

When running the tool, you can customize:

  • Search query: What to search for on Yandex Images
  • Number of images: How many images to find and download
  • Content type: photo, clipart, lineart, face, demotivator, or none
  • Image size: large, medium, small, or none
  • Orientation: horizontal, vertical, square, or none
  • Headless mode: Run without showing browser window (faster but may require manual CAPTCHA solving)

Anti-Detection Technology

This tool uses advanced browser fingerprint spoofing techniques to avoid detection:

  • Randomized user agents, languages, and timezones
  • Human-like cursor movements
  • Spoofed screen and window dimensions
  • WebGL renderer spoofing
  • Audio context randomization
  • Battery status spoofing

Requirements

  • Python >= 3.8

License

GPL-3.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yandex_image_scraper-0.10.0.tar.gz (19.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yandex_image_scraper-0.10.0-py3-none-any.whl (20.9 kB view details)

Uploaded Python 3

File details

Details for the file yandex_image_scraper-0.10.0.tar.gz.

File metadata

  • Download URL: yandex_image_scraper-0.10.0.tar.gz
  • Upload date:
  • Size: 19.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.8

File hashes

Hashes for yandex_image_scraper-0.10.0.tar.gz
Algorithm Hash digest
SHA256 b015c0ba4df882de640f974c39fad8bbed36d275f7b805f948f180e994124582
MD5 96cabf438c095899194780958f6eb038
BLAKE2b-256 50838d5142f9b698a26cda30f487fe2c058f9a2cf8dcae3cd9f7420355b0a037

See more details on using hashes here.

File details

Details for the file yandex_image_scraper-0.10.0-py3-none-any.whl.

File metadata

File hashes

Hashes for yandex_image_scraper-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7f3584adb45dcb9455399b7f5a8563b415b636e55046041aa7a95272fdcd620e
MD5 9dc5dd9425b2f72d5a196ecd66414da2
BLAKE2b-256 133207b866f3a63d3808d6793b46d40c33154957e4435ce8ad97274239eead5a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page