Skip to main content

Add your description here

Project description

FreePornVideos.xxx Scraper

A Python tool for scraping video URLs from FreePornVideos.xxx with anti-bot protection using Camoufox.

Features

  • Uses Camoufox for anti-fingerprinting and bot detection avoidance
  • Modular architecture for easy maintenance and extension
  • Typer-based CLI with rich error handling and progress indicators
  • Extracts direct video URLs (preferring 720p quality when available)
  • Follows redirects to get the final video URL
  • Saves results to an output file
  • Tracks and logs failed URLs

Installation

This project uses Rye for dependency management.

# Clone the repository
git clone https://github.com/yourusername/freepornvideos-scraper.git
cd freepornvideos-scraper

# Install dependencies with Rye
rye sync

Usage

Command Line Interface

The script provides a Typer-based CLI with various options:

# Basic usage (will prompt for input file)
python -m freepornvideos_scraper

# Specify input and output files
python -m freepornvideos_scraper --input urls.txt --output results.txt

# Short form options
python -m freepornvideos_scraper -i urls.txt -o results.txt

# Run in headless mode (no browser UI)
python -m freepornvideos_scraper -i urls.txt --headless

# Route traffic through Tor (requires Tor running on port 9050)
python -m freepornvideos_scraper -i urls.txt --tor

Input File Format

Create a text file with one FreePornVideos.xxx URL per line:

https://www.freepornvideos.xxx/videos/xxxxx/video-title-1/
https://www.freepornvideos.xxx/videos/yyyyy/video-title-2/
https://www.freepornvideos.xxx/videos/zzzzz/video-title-3/

How It Works

The script will:

  1. Load the URLs from the input file
  2. Visit each URL with Camoufox browser
  3. Click the play button on each page to load the video
  4. Extract the direct video URL (preferring 720p quality)
  5. Follow redirects to get the final video URL
  6. Save the results to the specified output file
  7. Create a separate file for any failed URLs

Project Structure

The project has a modular architecture:

src/freepornvideos_scraper/
├── __init__.py
├── __main__.py          # Package entry point
├── scraper.py           # Main Typer CLI implementation
├── test_import.py       # Test utility
└── lib/                 # Core components
    ├── __init__.py
    ├── base.py          # Abstract base scraper class
    ├── scrapers/        # Site-specific scrapers
    │   ├── __init__.py
    │   └── freevideos.py # FreePornVideos implementation
    └── utils/           # Utility classes
        ├── __init__.py
        ├── camoufox_handler.py # Camoufox browser wrapper
        └── spoofing_config.py  # Anti-fingerprinting settings

Requirements

  • Python 3.8+
  • camoufox (for browser automation with anti-fingerprinting)
  • prompt-toolkit (for path completion in the CLI)
  • requests (for following redirects)
  • typer (for CLI interface)

License

GPL-3.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

camoufox_scraper-0.2.0.tar.gz (22.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

camoufox_scraper-0.2.0-py3-none-any.whl (24.7 kB view details)

Uploaded Python 3

File details

Details for the file camoufox_scraper-0.2.0.tar.gz.

File metadata

  • Download URL: camoufox_scraper-0.2.0.tar.gz
  • Upload date:
  • Size: 22.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.8

File hashes

Hashes for camoufox_scraper-0.2.0.tar.gz
Algorithm Hash digest
SHA256 baf48d9629a50022c2f4c0cd6928f64327469d37f80b08e688bfe6da8e3ebfe7
MD5 5821d72f422ef0a7385b547e83c7efe2
BLAKE2b-256 5dfe836fba1cdfbcfaffd21559765cc665467ebb5e65b14bedaff896d661b464

See more details on using hashes here.

File details

Details for the file camoufox_scraper-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for camoufox_scraper-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6463e78e3703061a61ad344afc5a3ccd61ebae13b8d3edf052cecb3b5f63e3fa
MD5 2f85f15a6f91c34b8d68c19aeae1ca35
BLAKE2b-256 ae1bfdd4176786a4d079f741bcc4edbaacef5e1494ff05ed5402ffbefba9382e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page