libcurl ffi bindings for Python, with impersonation support
Project description
curl_cffi
Python binding for curl-impersonate via cffi.
Unlike other pure python http clients like httpx
or requests
, curl_cffi
can
impersonate browsers' TLS signatures or JA3 fingerprints. If you are blocked by some
website for no obvious reason, you can give this package a try.
Features
- Supports JA3/TLS and http2 fingerprints impersonation.
- Much faster than requests/httpx/tls_client, on par with aiohttp/pycurl, see benchmarks.
- Mimics requests API, no need to learn another one.
- Pre-compiled, so you don't have to compile on your machine.
- Supports
asyncio
with proxy rotation on each request. - Supports http 2.0, which requests does not.
Install
pip install curl_cffi --upgrade
This should work on Linux(x86_64/aarch64), macOS(Intel/Apple Silicon) and Windows(amd64).
If it does not work on you platform, you may need to compile and install curl-impersonate
first and set some environment variables like LD_LIBRARY_PATH
.
Usage
requests-like
from curl_cffi import requests
# Notice the impersonate parameter
r = requests.get("https://tls.browserleaks.com/json", impersonate="chrome110")
print(r.json())
# output: {..., "ja3n_hash": "aa56c057ad164ec4fdcb7a5a283be9fc", ...}
# the js3n fingerprint should be the same as target browser
# http/socks proxies are supported
proxies = {"https": "http://localhost:3128"}
r = requests.get("https://tls.browserleaks.com/json", impersonate="chrome110", proxies=proxies)
proxies = {"https": "socks://localhost:3128"}
r = requests.get("https://tls.browserleaks.com/json", impersonate="chrome110", proxies=proxies)
Sessions
# sessions are supported
s = requests.Session()
# httpbin is a http test website
s.get("https://httpbin.org/cookies/set/foo/bar")
print(s.cookies)
# <Cookies[<Cookie foo=bar for httpbin.org />]>
r = s.get("https://httpbin.org/cookies")
print(r.json())
# {'cookies': {'foo': 'bar'}}
Supported impersonate versions, as supported by curl-impersonate:
- chrome99
- chrome100
- chrome101
- chrome104
- chrome107
- chrome110
- chrome99_android
- edge99
- edge101
- safari15_3
- safari15_5
asyncio
from curl_cffi.requests import AsyncSession
async with AsyncSession() as s:
r = await s.get("https://example.com")
More concurrency:
import asyncio
from curl_cffi.requests import AsyncSession
urls = [
"https://googel.com/",
"https://facebook.com/",
"https://twitter.com/",
]
async with AsyncSession() as s:
tasks = []
for url in urls:
task = s.get("https://example.com")
tasks.append(task)
results = await asyncio.gather(*tasks)
curl-like
Alternatively, you can use the low-level curl-like API:
from curl_cffi import Curl, CurlOpt
from io import BytesIO
buffer = BytesIO()
c = Curl()
c.setopt(CurlOpt.URL, b'https://tls.browserleaks.com/json')
c.setopt(CurlOpt.WRITEDATA, buffer)
c.impersonate("chrome110")
c.perform()
c.close()
body = buffer.getvalue()
print(body.decode())
See the docs for more details.
Acknowledgement
- Originally forked from multippt/python_curl_cffi, which is under the MIT license.
- Headers/Cookies files are copied from httpx, which is under the BSD license.
- Asyncio support is inspired by Tornado's curl http client.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for curl_cffi-0.5.9b1-cp37-abi3-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 89db264e6a9dc031f6d0ed6e65f52f9f0697164f9a75325680d602134bf25e18 |
|
MD5 | 8926c41287048f7a13f73fba0a5915b8 |
|
BLAKE2b-256 | 7148a61590e3fe8fb9a5c802417297514712005717e630114abaf43a842ccc34 |
Hashes for curl_cffi-0.5.9b1-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7a67f6cc8c2c18f5b414466f3f41cd4de95f847122a83507050282db8b9c4e91 |
|
MD5 | 143b99b53d95917e17e0e8e12e028985 |
|
BLAKE2b-256 | 3eace43c18385538d4b02a32a3d106542e6f48a15fa310b3e467aedf06a91161 |
Hashes for curl_cffi-0.5.9b1-cp37-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f76c5ef35227c0a9da74b1bd239c2992b3f6ff3e4f45756d50faf772990d6d13 |
|
MD5 | 81576cff4f68ee2b301de888db51b82b |
|
BLAKE2b-256 | 5cac413a5779a41a889f6d3ea5b92bfdbe5e7f3d9ecc7f0bb94977a28d3f7e69 |
Hashes for curl_cffi-0.5.9b1-cp37-abi3-macosx_11_0_arm64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b1e9c8d3d910f08b8871d0410df0ee24eb94bcc8edaa620230c78ad7fc5d511e |
|
MD5 | 66f537ffb58a50140799f8ca7a2286d1 |
|
BLAKE2b-256 | 7b7504eb126cb7092fc99f9bf417f0a051af5e0137ade119b2203a14f963c91f |
Hashes for curl_cffi-0.5.9b1-cp37-abi3-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3c798eafc6603127e485fce3e5a20eb967ca863c4a5ab3866ec0c2e799cea368 |
|
MD5 | 9195365a7b869dd419b7c8a8f00bdb59 |
|
BLAKE2b-256 | 89b4b86e23eb73740cfad811d75e4f4bd2b501fe6f71d387571f506eb0a31718 |