A dark web link and image scraper
Project description
:dart: Usage :
Currently this is only designed to
- Scrape dark web for onion links
- Scrape images from dark web
Without tor browser
:sparkles: Unique Features :
- Instead of using TOR browser, Using tor proxy
- Randomized IP address for anonymity
- Spoofing user agent to avoid getting tracked
:wrench: Current Dependencies:
- Linux [ used debian based distro ]
:gear: Prerequisite :
Enable socks
- Update package lists
$ sudo apt update
- Install tor package
$ sudo apt install tor
- Start Tor service
$ sudo service tor start
- Verify installation status
$ sudo service tor status
:books: Tutorial :
Install using pip
$ pip install dark-web-scraper
1. Find onion urls from a dark web link
- Request :
find_onion_links( str )
- Response: links will be saved in
result.txt
- Example :
# Main.py
from dark_web_scraper import find_onion_links
find_onion_links('http://random_url.onion')
2. Scrape images on a dark web link
- Request :
find_images_from_onion_link( str )
- Response: Images will be saved in
/static/images
- Example :
# Main.py
from dark_web_scraper import find_images_from_onion_link
find_images_from_onion_link('http://random_url.onion')
:rocket: Features to be added :
-
Language Detection - Language translation
-
Onion link validator - Object detection in images
- Named entity recognition
- Search specific keywords in a list of urls
- Sentiment aAnalysis on the webpage contents
:warning: Disclaimer:
- I don't promote illegality.
- This project is just for educational purposes only
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for dark_web_scraper-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2d4632cb6cd2ec57f371205b8823e18dc16b8d008cf4eafbc215ab54b849c2e5 |
|
MD5 | 645e0ed41862e6a3fe13de79999bb54e |
|
BLAKE2b-256 | 570ecc49dd0e493c9899a801b552e5860db1702f6025784020633e896a1f0673 |