A dark web link and image scraper
Project description
:dart: Usage :
Currently this is only designed to
- Scrape dark web for onion links
- Scrape images from dark web
Without tor browser
:sparkles: Unique Features :
- Instead of using TOR browser, Using tor proxy
- Randomized IP address for anonymity
- Spoofing user agent to avoid getting tracked
:wrench: Current Dependencies:
- Linux [ used debian based distro ]
:gear: Prerequisite :
Enable socks
- Update package lists
$ sudo apt update
- Install tor package
$ sudo apt install tor
- Start Tor service
$ sudo service tor start
- Verify installation status
$ sudo service tor status
:books: Tutorial :
Install using pip
$ pip install dark-web-scraper
1. Find onion urls from a dark web link
- Request :
find_onion_links( str ) - Response: links will be saved in
result.txt - Example :
# Main.py
from dark_web_scraper import find_onion_links
find_onion_links('http://random_url.onion')
2. Scrape images on a dark web link
- Request :
find_images_from_onion_link( str ) - Response: Images will be saved in
/static/images - Example :
# Main.py
from dark_web_scraper import find_images_from_onion_link
find_images_from_onion_link('http://random_url.onion')
:rocket: Features to be added :
-
Language Detection - Language translation
-
Onion link validator - Object detection in images
- Named entity recognition
- Search specific keywords in a list of urls
- Sentiment aAnalysis on the webpage contents
:warning: Disclaimer:
- I don't promote illegality.
- This project is just for educational purposes only
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dark_web_scraper-0.0.4-py3-none-any.whl.
File metadata
- Download URL: dark_web_scraper-0.0.4-py3-none-any.whl
- Upload date:
- Size: 4.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2d4632cb6cd2ec57f371205b8823e18dc16b8d008cf4eafbc215ab54b849c2e5
|
|
| MD5 |
645e0ed41862e6a3fe13de79999bb54e
|
|
| BLAKE2b-256 |
570ecc49dd0e493c9899a801b552e5860db1702f6025784020633e896a1f0673
|