Skip to main content

A plugin driven package that scrapes sites for free proxies

Project description

FreeProxyScraper

This is a plugin driven web scraper meant to retrieve and test free proxies for use. Note that this package may be unstable and should not be used in a production environment.

Installation (This doesn't work yet)

Run the following to install:

pip install FreeProxyScraper

Usage

from FreeProxyScraper import ProxyQuery

pq = ProxyQuery()

for proxy in pq.find(limit=20):
    print(proxy)

List of sites implemented for scraping:

FAQ

  • When will this be published?

I'm not sure. This is mainly a project for personal use and demonstration of skill that I like to work on in my free time. If you have a particular interest in seeing this published, feel free to help development or

  • Why implement so many websites for scraping?

Websites are always changing, or going down, or banning ip's very quickly. In order to make sure this package stays reliable it is essential that it implements many websites

Development

to install FreeProxyScraper, along with the tools you need to develop, run the following in the directory containing this repo:

pip install -e .[dev]

If you'd like to contribute to development, right now the most needed thing is writing more plugins. In order to help, you need basic knowledge of BeautifulSoup4 and a little bit of patience with websites purposely making it hard for you to scrape information. Check out src/plugins/examplePlugin.py to see an example layout of a plugin file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

FreeProxyScraper-0.0.2.tar.gz (3.5 kB view hashes)

Uploaded Source

Built Distribution

FreeProxyScraper-0.0.2-py3-none-any.whl (3.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page