A secure python library for fetching data with async, JS, and Tor support
Project description
PyGrab
PyGrab is a simple and performant Python library for making HTTP requests written in rust. It is designed to be fully interoperable with the popular requests
package in Python.
References
Features
- Simple: PyGrab's API is designed to be straightforward and easy to use. If you're familiar with
requests
, you'll feel right at home. - Flexible: Whether you need to make a simple GET request, post data to a server, or download a file, PyGrab has you covered.
- Performant: With its Rust-powered backend, PyGrab offers enhanced performance for CPU-bound tasks, providing faster data decompression, thread handling, and network handling.
- Asynchronous Support: PyGrab includes functions for making asynchronous HTTP requests, allowing you to efficiently grab data from multiple URLs at once.
- JavaScript Support: PyGrab can render JavaScript-enabled websites, allowing you to grab data from dynamic web pages.
- Interface with Tor Network: PyGrab includes built-in support for routing requests through the tor network.
- Automated IP Rotation: PyGrab includes built-in support for rotating connections to the Tor Network.
Limitations
- PyGrab is only supported on windows an linux currently.
Installation
You can install PyGrab using pip:
pip install pygrab
Usage
Here's a simple example of how to use PyGrab to make a GET request:
import pygrab
response = pygrab.get('http://example.com')
print(response.text)
In this example, response
is syntactically similar to the requests.Response
object. You can use it just like you would in requests
.
Here's an example of how to use PyGrab to make asynchronous GET requests through the Tor Network:
import pygrab
pygrab.Tor.start_tor()
urls = ['http://example.com', 'http://example.org', 'http://example.net']
responses = pygrab.get_batch(urls)
for response in responses.values():
print(response.text)
In this example, responses
is a dictionary of urls matched to their respective pygrab.HttpResponse
objects. Each response corresponds to the URL at the same index in the urls
list.
We can also use pygrab with python's async/await
syntax.
async def grab_data():
res = await pygrab.get_async('https://www.google.com')
return res
Contributing
Contributions are welcome. Please submit a pull request with any improvements.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pygrab-3.1.4.tar.gz
.
File metadata
- Download URL: pygrab-3.1.4.tar.gz
- Upload date:
- Size: 13.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 795fad79ec4739d378b5977707dcad8d642e3528958401630b6fba1a8c78cb83 |
|
MD5 | ff2ebd921f3df6f2dd5607ffe153a6b7 |
|
BLAKE2b-256 | 430addd2e374d7217610169c48e0f6e99faf9c82553df7a9d1814fd1cc6fcdaf |
File details
Details for the file pygrab-3.1.4-py3-none-any.whl
.
File metadata
- Download URL: pygrab-3.1.4-py3-none-any.whl
- Upload date:
- Size: 14.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c6c44db156a952b8670d6fac8a34cc51f241cd4c192cb47fb549c2c0d2c2999c |
|
MD5 | f754cc47ab37f1eb76592f9e98b76b1b |
|
BLAKE2b-256 | 87648123e6c9fcdbd2380b10c9412ef1dd4b83e11ad4c774cfdab214267ac46e |