Skip to main content

High level, performant HTTP library with out-of-the-box support for concurrency and Tor

Project description

PyGrab

PyGrab is a simple and performant Python library for making HTTP requests written in rust. It is designed to be fully interoperable with the popular requests package in Python.

References

Features

  • Simple: PyGrab's API is designed to be straightforward and easy to use. If you're familiar with requests, you'll feel right at home.
  • Flexible: Whether you need to make a simple GET request, post data to a server, or download a file, PyGrab has you covered.
  • Performant: With its Rust-powered backend, PyGrab offers enhanced performance for CPU-bound tasks, providing faster data decompression, thread handling, and network handling.
  • Asynchronous Support: PyGrab includes functions for making asynchronous HTTP requests, allowing you to efficiently grab data from multiple URLs at once.
  • JavaScript Support: PyGrab can render JavaScript-enabled websites, allowing you to grab data from dynamic web pages.
  • Interface with Tor Network: PyGrab includes built-in support for routing requests through the tor network.
  • Automated IP Rotation: PyGrab includes built-in support for rotating connections to the Tor Network.

Limitations

  • PyGrab is only supported on windows an linux currently.

Installation

You can install PyGrab using pip:

pip install pygrab

Usage

Here's a simple example of how to use PyGrab to make a GET request:

import pygrab

response = pygrab.get('http://example.com')
print(response.text)

In this example, response is syntactically similar to the requests.Response object. You can use it just like you would in requests.

Here's an example of how to use PyGrab to make asynchronous GET requests through the Tor Network:

import pygrab

pygrab.Tor.start_tor()

urls = ['http://example.com', 'http://example.org', 'http://example.net']
responses = pygrab.get_batch(urls)

for response in responses.values():
    print(response.text)

In this example, responses is a dictionary of urls matched to their respective pygrab.HttpResponse objects. Each response corresponds to the URL at the same index in the urls list.

We can also use pygrab with python's async/await syntax.

async def grab_data():
    res = await pygrab.get_async('https://www.google.com')
    return res

Contributing

Contributions are welcome. Please submit a pull request with any improvements.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pygrab-3.1.6.tar.gz (12.4 kB view details)

Uploaded Source

Built Distribution

pygrab-3.1.6-py3-none-any.whl (14.1 kB view details)

Uploaded Python 3

File details

Details for the file pygrab-3.1.6.tar.gz.

File metadata

  • Download URL: pygrab-3.1.6.tar.gz
  • Upload date:
  • Size: 12.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for pygrab-3.1.6.tar.gz
Algorithm Hash digest
SHA256 ed1857f99e8b92a5fa7d00233bc46169f12db1d42601950045917dfa1bcf1dd7
MD5 af02dd2a14ffdfeccfd2cdc1e915bf11
BLAKE2b-256 231101a2168570d62eaa0c303329935fe5f171d980cac397f7b3e4ac43518b77

See more details on using hashes here.

File details

Details for the file pygrab-3.1.6-py3-none-any.whl.

File metadata

  • Download URL: pygrab-3.1.6-py3-none-any.whl
  • Upload date:
  • Size: 14.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for pygrab-3.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 50e042d70826afb2bc2895e5afe2e270ef9163ed3e783e55de03f2ec162d5aa3
MD5 f0c7369fde99c44f4feb9df1a97fd66d
BLAKE2b-256 ff02b699d548cfd3c1ac5fbff9f8c524bedbed3f1f2308b7d8fbe8c78cfc1954

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page