WebScraping.AI

These details have not been verified by PyPI

Project links

Project description

webscraping-ai

WebScraping.AI scraping API provides LLM-powered tools with Chromium JavaScript rendering, rotating proxies, and built-in HTML parsing.

This Python package is automatically generated by the OpenAPI Generator project:

API version: 3.2.1
Package version: 3.2.1
Generator version: 7.22.0
Build package: org.openapitools.codegen.languages.PythonClientCodegen For more information, please visit https://webscraping.ai

Requirements.

Python 3.10+

Installation & Usage

pip install

If the python package is hosted on a repository, you can install directly using:

pip install git+https://github.com/webscraping-ai/webscraping-ai-python.git

(you may need to run pip with root permission: sudo pip install git+https://github.com/webscraping-ai/webscraping-ai-python.git)

Then import the package:

import webscraping_ai

Setuptools

Install via Setuptools.

python setup.py install --user

(or sudo python setup.py install to install the package for all users)

Then import the package:

import webscraping_ai

Tests

Execute pytest to run the tests.

Getting Started

Please follow the installation procedure and then run the following:

import webscraping_ai
from webscraping_ai.rest import ApiException
from pprint import pprint

# Defining the host is optional and defaults to https://api.webscraping.ai
# See configuration.py for a list of all supported configuration parameters.
configuration = webscraping_ai.Configuration(
    host = "https://api.webscraping.ai"
)

# The client must configure the authentication and authorization parameters
# in accordance with the API server security policy.
# Examples for each auth method are provided below, use the example that
# satisfies your auth use case.

# Configure API key authorization: api_key
configuration.api_key['api_key'] = os.environ["API_KEY"]

# Uncomment below to setup prefix (e.g. Bearer) for API key, if needed
# configuration.api_key_prefix['api_key'] = 'Bearer'


# Enter a context with an instance of the API client
with webscraping_ai.ApiClient(configuration) as api_client:
    # Create an instance of the API class
    api_instance = webscraping_ai.AIApi(api_client)
    url = 'https://example.com' # str | URL of the target page.
    fields = {'key': '{\"title\":\"Main product title\",\"price\":\"Current product price\",\"description\":\"Full product description\"}'} # Dict[str, str] | Object describing fields to extract from the page and their descriptions
    headers = {'key': '{\"Cookie\":\"session=some_id\"}'} # Dict[str, str] | HTTP headers to pass to the target page. Can be specified either via a nested query parameter (...&headers[One]=value1&headers=[Another]=value2) or as a JSON encoded object (...&headers={\"One\": \"value1\", \"Another\": \"value2\"}). (optional)
    timeout = 10000 # int | Maximum web page retrieval time in ms. Increase it in case of timeout errors (10000 by default, maximum is 30000). (optional) (default to 10000)
    js = True # bool | Execute on-page JavaScript using a headless browser (true by default). (optional) (default to True)
    js_timeout = 2000 # int | Maximum JavaScript rendering time in ms. Increase it in case if you see a loading indicator instead of data on the target page. (optional) (default to 2000)
    wait_for = 'wait_for_example' # str | CSS selector to wait for before returning the page content. Useful for pages with dynamic content loading. Overrides js_timeout. (optional)
    proxy = datacenter # str | Type of proxy. Use `residential` if your site restricts traffic from datacenters, or `stealth` for the most heavily protected sites with advanced anti-bot detection (`datacenter` by default). Residential and stealth proxy requests are more expensive than datacenter, see the pricing page for details. (optional) (default to datacenter)
    country = us # str | Country of the proxy to use (US by default). (optional) (default to us)
    custom_proxy = 'custom_proxy_example' # str | Your own proxy URL to use instead of our built-in proxy pool in \"http://user:password@host:port\" format (<a target=\"_blank\" href=\"https://webscraping.ai/proxies/smartproxy\">Smartproxy</a> for example). (optional)
    device = desktop # str | Type of device emulation. (optional) (default to desktop)
    error_on_404 = False # bool | Return error on 404 HTTP status on the target page (false by default). (optional) (default to False)
    error_on_redirect = False # bool | Return error on redirect on the target page (false by default). (optional) (default to False)
    js_script = 'document.querySelector(\'button\').click();' # str | Custom JavaScript code to execute on the target page. (optional)

    try:
        # Extract structured data fields from a web page
        api_response = api_instance.get_fields(url, fields, headers=headers, timeout=timeout, js=js, js_timeout=js_timeout, wait_for=wait_for, proxy=proxy, country=country, custom_proxy=custom_proxy, device=device, error_on_404=error_on_404, error_on_redirect=error_on_redirect, js_script=js_script)
        print("The response of AIApi->get_fields:\n")
        pprint(api_response)
    except ApiException as e:
        print("Exception when calling AIApi->get_fields: %s\n" % e)

Documentation for API Endpoints

All URIs are relative to https://api.webscraping.ai

Class	Method	HTTP request	Description
AIApi	get_fields	GET /ai/fields	Extract structured data fields from a web page
AIApi	get_question	GET /ai/question	Get an answer to a question about a given web page
AccountApi	account	GET /account	Information about your account calls quota
HTMLApi	get_html	GET /html	Page HTML by URL
SelectedHTMLApi	get_selected	GET /selected	HTML of a selected page area by URL and CSS selector
SelectedHTMLApi	get_selected_multiple	GET /selected-multiple	HTML of multiple page areas by URL and CSS selectors
TextApi	get_text	GET /text	Page text by URL

Documentation For Models

Documentation For Authorization

Authentication schemes defined for the API:

api_key

Type: API key
API key parameter name: api_key
Location: URL query string

Author

support@webscraping.ai

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

4.0.0

May 12, 2026

This version

3.2.1

May 12, 2026

3.2.0

Feb 9, 2025

3.1.3

Jan 4, 2024

3.1.2

Jan 3, 2024

2.0.2

Sep 24, 2020

2.0.1

Sep 23, 2020

2.0.0

Sep 20, 2020

1.0.0

Mar 27, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webscraping_ai-3.2.1.tar.gz (34.4 kB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

webscraping_ai-3.2.1-py3-none-any.whl (42.9 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file webscraping_ai-3.2.1.tar.gz.

File metadata

Download URL: webscraping_ai-3.2.1.tar.gz
Upload date: May 12, 2026
Size: 34.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for webscraping_ai-3.2.1.tar.gz
Algorithm	Hash digest
SHA256	`2c2f50ee7cf3b26063018ae0a802f9f254b9e834d520a4c982431d934aa40812`
MD5	`763a9216954adf164429ae63843d7dc1`
BLAKE2b-256	`fb71ebc2f85c745fe0dc1c317a2a8a43738377a6c66e6c115ed2e1ff2163dcb0`

See more details on using hashes here.

File details

Details for the file webscraping_ai-3.2.1-py3-none-any.whl.

File metadata

Download URL: webscraping_ai-3.2.1-py3-none-any.whl
Upload date: May 12, 2026
Size: 42.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for webscraping_ai-3.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`538fd3017449d6052b432691ee8015eea3b2282630710b26a43cfbf34ebeda0b`
MD5	`0abf3155c4d0dcbae187f105afd9221c`
BLAKE2b-256	`76e52dc27dd51392712043d314a23dbfae17e9b5005704c5a8323cfbbba38e32`

See more details on using hashes here.

webscraping-ai 3.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

webscraping-ai

Requirements.

Installation & Usage

pip install

Setuptools

Tests

Getting Started

Documentation for API Endpoints

Documentation For Models

Documentation For Authorization

api_key

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes