Python SDK for Spider Cloud API

These details have not been verified by PyPI

Project links

Homepage

Project description

Spider Cloud Python SDK

The Spider Cloud Python SDK offers a toolkit for straightforward website scraping, crawling at scale, and other utilities like extracting links and taking screenshots, enabling you to collect data formatted for compatibility with language models (LLMs). It features a user-friendly interface for seamless integration with the Spider Cloud API.

Installation

To install the Spider Cloud Python SDK, you can use pip:

pip install spider_client

Usage

Get an API key from spider.cloud
Set the API key as an environment variable named SPIDER_API_KEY or pass it as a parameter to the Spider class.

Here's an example of how to use the SDK:

from spider import Spider

# Initialize the Spider with your API key
app = Spider(api_key='your_api_key')

# Scrape a single URL
url = 'https://spider.cloud'
scraped_data = app.scrape_url(url)

# Crawl a website
crawler_params = {
    'limit': 1,
    'proxy_enabled': True,
    'metadata': False,
    'request': 'http'
}
crawl_result = app.crawl_url(url, params=crawler_params)

Scraping a URL

To scrape data from a single URL:

url = 'https://example.com'
scraped_data = app.scrape_url(url)

Crawling a Website

To automate crawling a website:

url = 'https://example.com'
crawl_params = {
    'limit': 200,
    'request': 'smart_mode'
}
crawl_result = app.crawl_url(url, params=crawl_params)

Crawl Streaming

Stream crawl the website in chunks to scale.

    def handle_json(json_obj: dict) -> None:
        assert json_obj["url"] is not None

    url = 'https://example.com'
    crawl_params = {
        'limit': 200,
    }
    response = app.crawl_url(
        url,
        params=params,
        stream=True,
        callback=handle_json,
    )

Search

Perform a search for websites to crawl or gather search results:

query = 'a sports website'
crawl_params = {
    'request': 'smart_mode',
    'search_limit': 5,
    'limit': 5,
    'fetch_page_content': True
}
crawl_result = app.search(query, params=crawl_params)

Retrieving Links from a URL(s)

Extract all links from a specified URL:

url = 'https://example.com'
links = app.links(url)

Transform

Transform HTML to markdown or text lightning fast:

data = [ { 'html': '<html><body><h1>Hello world</h1></body></html>' } ]
params = {
    'readability': False,
    'return_format': 'markdown',
}
result = app.transform(data, params=params)

Taking Screenshots of a URL(s)

Capture a screenshot of a given URL:

url = 'https://example.com'
screenshot = app.screenshot(url)

Checking Available Credits

You can check the remaining credits on your account:

credits = app.get_credits()

Unblocker

Access blocked or protected content with anti-bot bypass:

url = 'https://protected-site.com'
result = app.unblocker(url)

Unblocker with AI Extraction

Unblock and extract structured data using AI:

url = 'https://protected-site.com/products'
result = app.unblocker(url, params={
    'custom_prompt': 'Extract all product names and prices as JSON'
})
# Extracted data is available in result[0]['metadata']['extracted_data']

Unblocker with JSON Schema Extraction

Use JSON Schema for structured, validated extraction output:

url = 'https://protected-site.com/products'
result = app.unblocker(url, params={
    'extraction_schema': {
        'name': 'products',
        'description': 'Product listing extraction',
        'schema': '''{
            "type": "object",
            "properties": {
                "products": {
                    "type": "array",
                    "items": {
                        "type": "object",
                        "properties": {
                            "name": {"type": "string"},
                            "price": {"type": "number"}
                        },
                        "required": ["name", "price"]
                    }
                }
            }
        }''',
        'strict': True
    }
})
# Extracted data conforms to the schema in result[0]['metadata']['extracted_data']

AI Studio Methods

AI Studio methods require an active AI Studio subscription. See spider.cloud/ai/pricing for plans.

AI Crawl

AI-guided crawling using natural language prompts:

result = app.ai_crawl(
    url='https://example.com',
    prompt='Find all blog posts and extract titles and summaries'
)

AI Scrape

AI-guided scraping using natural language prompts:

result = app.ai_scrape(
    url='https://example.com/products',
    prompt='Extract all product names, prices, and descriptions'
)

AI Search

AI-enhanced web search using natural language:

result = app.ai_search(prompt='Find the best Python web scraping libraries')

AI Browser

AI-guided browser automation:

result = app.ai_browser(
    url='https://example.com/login',
    prompt='Click the sign in button and fill the email field with test@example.com'
)

AI Links

AI-guided link extraction and filtering:

result = app.ai_links(
    url='https://example.com',
    prompt='Find all links to product pages and documentation'
)

Streaming

If you need to stream the request use the third param:

url = 'https://example.com'

crawler_params = {
    'limit': 1,
    'proxy_enabled': True,
    'metadata': False,
    'request': 'http'
}

links = app.links(url, crawler_params, True)

Content-Type

The following Content-type headers are supported using the fourth param:

application/json
text/csv
application/xml
application/jsonl

url = 'https://example.com'

crawler_params = {
    'limit': 1,
    'proxy_enabled': True,
    'metadata': False,
    'request': 'http'
}

# stream json lines back to the client
links = app.crawl(url, crawler_params, True, "application/jsonl")

Error Handling

The SDK handles errors returned by the Spider Cloud API and raises appropriate exceptions. If an error occurs during a request, an exception will be raised with a descriptive error message.

Contributing

Contributions to the Spider Cloud Python SDK are welcome! If you find any issues or have suggestions for improvements, please open an issue or submit a pull request on the GitHub repository.

License

The Spider Cloud Python SDK is open-source and released under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.88

Mar 20, 2026

0.1.87

Mar 2, 2026

0.1.85

Jan 21, 2026

0.1.84

Jan 20, 2026

0.1.82

Nov 22, 2025

0.1.81

Nov 22, 2025

0.1.80

Nov 6, 2025

0.1.79

Nov 1, 2025

0.1.77

Aug 29, 2025

0.1.75

Aug 21, 2025

0.1.74

Aug 20, 2025

0.1.73

Aug 20, 2025

0.1.72

Aug 14, 2025

0.1.71

Aug 3, 2025

0.1.70

Aug 3, 2025

0.1.69

Jul 20, 2025

0.1.68

Jul 20, 2025

0.1.67

Jul 20, 2025

0.1.65

Jul 17, 2025

0.1.63

Jul 14, 2025

0.1.62

Jul 14, 2025

0.1.61

Jul 14, 2025

0.1.60

Jul 14, 2025

0.1.59

Jul 14, 2025

0.1.58

Jul 14, 2025

0.1.57

Jul 14, 2025

0.1.56

Jul 12, 2025

0.1.55

Jul 12, 2025

0.1.54

Jul 12, 2025

0.1.53

Jul 10, 2025

0.1.52

Jul 10, 2025

0.1.38

Jun 19, 2025

0.1.37

Jun 8, 2025

0.1.36

May 18, 2025

0.1.35

May 18, 2025

0.1.34

May 5, 2025

0.1.33

May 5, 2025

0.1.32

Mar 26, 2025

0.1.31

Mar 25, 2025

0.1.30

Mar 25, 2025

0.1.28

Mar 12, 2025

0.1.27

Mar 10, 2025

0.1.26

Jan 21, 2025

0.1.25

Dec 12, 2024

0.1.24

Dec 5, 2024

0.1.23

Nov 7, 2024

0.1.22

Nov 4, 2024

0.0.72

Oct 11, 2024

0.0.71

Oct 9, 2024

0.0.70

Sep 14, 2024

0.0.69

Aug 27, 2024

0.0.68

Aug 20, 2024

0.0.67

Aug 9, 2024

0.0.66

Aug 8, 2024

0.0.65

Aug 3, 2024

0.0.64

Jul 24, 2024

0.0.63

Jul 24, 2024

0.0.62

Jul 23, 2024

0.0.61

Jul 23, 2024

0.0.60

Jul 23, 2024

0.0.59

Jul 21, 2024

0.0.58

Jul 14, 2024

0.0.57

Jul 14, 2024

0.0.56

Jul 13, 2024

0.0.55

Jul 13, 2024

0.0.54

Jul 13, 2024

0.0.53

Jul 9, 2024

0.0.52

Jul 7, 2024

0.0.51

Jul 6, 2024

0.0.50

Jul 6, 2024

0.0.49

Jul 6, 2024

0.0.48

Jul 4, 2024

0.0.47

Jul 4, 2024

0.0.46

Jul 4, 2024

0.0.45

Jul 4, 2024

0.0.44

Jul 4, 2024

0.0.43

Jul 2, 2024

0.0.42

Jul 2, 2024

0.0.41

Jul 2, 2024

0.0.40

Jul 2, 2024

0.0.39

Jul 1, 2024

0.0.38

Jul 1, 2024

0.0.37

Jul 1, 2024

0.0.36

Jun 30, 2024

0.0.35

Jun 30, 2024

0.0.34

Jun 30, 2024

0.0.33

Jun 30, 2024

0.0.32

Jun 30, 2024

0.0.31

Jun 30, 2024

0.0.30

Jun 30, 2024

0.0.29

Jun 30, 2024

0.0.28

Jun 30, 2024

0.0.27

Jun 18, 2024

0.0.26

Jun 18, 2024

0.0.25

Jun 15, 2024

0.0.24

Jun 7, 2024

0.0.23

Jun 7, 2024

0.0.22

May 27, 2024

0.0.21

May 13, 2024

0.0.20

May 13, 2024

0.0.11

Apr 29, 2024

0.0.10

Apr 26, 2024

0.0.9

Apr 25, 2024

0.0.8

Apr 25, 2024

0.0.7

Apr 25, 2024

0.0.6

Apr 23, 2024

0.0.5

Apr 23, 2024

0.0.4

Apr 23, 2024

0.0.3

Apr 22, 2024

0.0.2

Apr 22, 2024

0.0.1

Apr 22, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spider_client-0.1.88.tar.gz (19.0 kB view details)

Uploaded Mar 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

spider_client-0.1.88-py3-none-any.whl (16.8 kB view details)

Uploaded Mar 20, 2026 Python 3

File details

Details for the file spider_client-0.1.88.tar.gz.

File metadata

Download URL: spider_client-0.1.88.tar.gz
Upload date: Mar 20, 2026
Size: 19.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for spider_client-0.1.88.tar.gz
Algorithm	Hash digest
SHA256	`bd3246b6e4f68631936d15da997a479cd9a58f0503a35e6565b4c2e2b6d5bad0`
MD5	`9f996046247012f6d26055409331be96`
BLAKE2b-256	`0df62f613cff7f57f17a2f33651550b61bcddb189e29a5865522af84c444b7a6`

See more details on using hashes here.

File details

Details for the file spider_client-0.1.88-py3-none-any.whl.

File metadata

Download URL: spider_client-0.1.88-py3-none-any.whl
Upload date: Mar 20, 2026
Size: 16.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for spider_client-0.1.88-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5f72acfc979cf45223c4fec3a099ffaab28921dc1867abc965aeb62582768be5`
MD5	`28aee149e4e3494f2503ab44bb23f5a4`
BLAKE2b-256	`890f76a88ab646d57e64079830c73a183d55b030ba5b334276850837998ceb9f`

See more details on using hashes here.

spider-client 0.1.88

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Spider Cloud Python SDK

Installation

Usage

Scraping a URL

Crawling a Website

Crawl Streaming

Search

Retrieving Links from a URL(s)

Transform

Taking Screenshots of a URL(s)

Checking Available Credits

Unblocker

Unblocker with AI Extraction

Unblocker with JSON Schema Extraction

AI Studio Methods

AI Crawl

AI Scrape

AI Search

AI Browser

AI Links

Streaming

Content-Type

Error Handling

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes