Skip to main content

A Firefox Selenium WebDriver that patches the browser to avoid detection. Bypasses services such as Cloudflare, Distil Networks, and more. Ideal for web scraping, automated testing, and bot development without getting detected.

Project description

Undetected GeckoDriver v1.0.5

A Python package that integrates with Firefox Selenium to bypass anti-bot detection mechanisms, ideal for web scraping, automated testing, and browser automation without being marked as a bot.

PyPI version License: MIT Downloads Downloads Downloads

Preview

Click to see the preview

With undetected-geckodriver:

undetected

Without undetected-geckodriver:

undetected

Overview

Note: Currently, this package supports only Linux due to recent issues. I am actively working on making it compatible with other operating systems.

Undetected GeckoDriver is a powerful Python package designed to work seamlessly with the Selenium browser automation framework. Selenium allows you to control web browsers through code, making it an essential tool for web scraping, automated testing, and browser automation. However, when browsers are controlled by scripts (often referred to as "puppet browsers"), they typically set specific properties that can be detected by anti-bot services like Cloudflare. For instance, properties such as navigator.webdriver can be checked using JavaScript, which may restrict access to content on sites protected by such services.

To address this issue, Undetected GeckoDriver acts as an interface between your code and Selenium, helping you bypass bot detection mechanisms. When you create a new WebDriver instance using the Firefox() class from the Undetected GeckoDriver package (as opposed to using Selenium directly), the following processes occur:

  1. The original Firefox binary is located, copied, and patched to prevent it from modifying properties such as navigator.webdriver while using Selenium.
  2. A Selenium WebDriver instance is created that uses the patched Firefox binary.

This makes it possible to interact with websites without being detected as a bot, allowing you to scrape data, automate tasks, and perform other browser-based operations without triggering bot detection mechanisms. Whether you're involved in web scraping, automated testing, or any other form of browser automation, Undetected GeckoDriver provides a reliable solution to avoid detection and ensure smooth operation.

Installation

You can install the package via pip:

pip install undetected-geckodriver

Or you can install it from source:

git clone https://github.com/ByteXenon/undetected_geckodriver
cd undetected_geckodriver
pip install .

Note: The last example is not supported, and you shouldn't use it unless you want to modify the package.

Usage

Since Undetected GeckoDriver acts as an interface for Selenium, you can use it the same way you would use Selenium.

You can integrate Undetected GeckoDriver into your existing Selenium code by simply replacing the selenium.webdriver.Firefox imports with undetected_geckodriver.Firefox.

Here are a couple of examples demonstrating how you can use this project:

  1. Creating a new undetected WebDriver instance and navigating to example.com:

    from undetected_geckodriver import Firefox
    
    driver = Firefox()
    driver.get("https://www.example.com")
    
  2. Searching for "Undetected Geckodriver 1337!" on Google:

    import time
    from undetected_geckodriver import Firefox
    from selenium.webdriver.common.by import By
    
    # Constants
    SEARCH_FOR = "Undetected Geckodriver 1337!"
    GOOGLE_URL = "https://www.google.com"
    
    # Initialize the undetected Firefox browser
    driver = Firefox()
    
    # Navigate to Google
    driver.get(GOOGLE_URL)
    
    # Locate the search box and perform the search
    search_box = driver.find_element(By.NAME, "q")
    search_box.send_keys(SEARCH_FOR)
    search_box.submit()
    
    # Wait for the page to load
    time.sleep(2)
    
    # Print the current URL after the search
    print("Current URL:", driver.current_url)
    
    # Wait for a while to observe the results
    time.sleep(15)
    
    # Ensure the browser is closed
    driver.quit() # Close the browser
    

For further information and advanced usage, you can take a look at the official Selenium documentation since Undetected GeckoDriver is built on top of Selenium.

Roadmap

Completed

  • Spoof navigator.webdriver property: Implement a method to spoof the navigator.webdriver property to prevent detection by services like Cloudflare. This helps in avoiding bot detection mechanisms.

In Progress

  • Multi-platform support: Extend the compatibility of the tool to work seamlessly across different operating systems other than Linux (Windows, macOS). This includes ensuring all dependencies and configurations are platform-independent.

Planned

  • Helper functions for passing CAPTCHAs and other security measures: Develop utility functions to automate the solving of CAPTCHAs and bypass other common security measures encountered during web scraping. (e.g. automatically passing Cloudflare's "I'm human" challenge)

  • Support for Selenium Wire: Integrate Selenium Wire to allow for more advanced network interactions, such as modifying requests and responses.

Requirements

  • Python 3.6+
  • Selenium 4.10.0+

Contributing

Contributions are welcome! Please feel free to open an issue or submit a pull request if you encounter any problems or have any suggestions, improvements, or new features you would like to see implemented.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

undetected_geckodriver-1.0.5.tar.gz (10.3 kB view details)

Uploaded Source

Built Distribution

undetected_geckodriver-1.0.5-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file undetected_geckodriver-1.0.5.tar.gz.

File metadata

File hashes

Hashes for undetected_geckodriver-1.0.5.tar.gz
Algorithm Hash digest
SHA256 0124a9716f975bccbff2622abcae1737df5e86f063b21809ce55629d36e2e622
MD5 79014b9fb6f5616cd2a783cb3feb9fa6
BLAKE2b-256 49aa72b29e3767d1031fbd0fffe7baa412f50a37da8cbc1376a6f96d4b61c6a3

See more details on using hashes here.

File details

Details for the file undetected_geckodriver-1.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for undetected_geckodriver-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 3c513738a77849d22d0be8f0d733be09ab3adf3c82f4a6150eb8dc36f472e4b2
MD5 cc8418e102694aa43e71e5a8563d7a5d
BLAKE2b-256 cfe7f4cb2914a61ee4a2a9b79b43b535d9b219666650ece2183e99d6cdbcbebe

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page