API for the internet

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Desync Search — "API to the Internet"

Motto: The easiest way to scrape and retrieve web data without aggressive rate limits or heavy detection.

Key Features

No Rate Limiting: We allow you to scale concurrency without punishing usage. You can open many parallel searches; we’ll only throttle if the underlying cloud providers themselves are saturated.
Extremely Low Detection Rates: Our “stealth_search” uses advanced methods for a “human-like” page visit. While we cannot guarantee 100% evasion, most websites pass under the radar, and CAPTCHAs—when they do appear—are often circumvented by a second pass.
Competitive, Pay-as-You-Go Pricing: No forced subscriptions or huge minimum monthly costs. You pick how much you spend. Our per-search cost is typically half of what big competitors charge (who often require $1,000+ per month).
First 1,000 Searches Free: Not convinced? Try it yourself, risk-free. We’ll spot you 1,000 searches when you sign up. Check out desync.ai for more info.

Installation

Install via PyPI using:

pip install desync_search

This library requires Python 3.6+ and the requests package (installed automatically).

Basic Usage

You’ll need a user API key (like "totallynotarealapikeywithactualcreditsonit").
A best practice is to store that key in an environment variable (e.g. DESYNC_API_KEY) to avoid embedding secrets in code:

export DESYNC_API_KEY="YOUR_ACTUAL_KEY"

Then in your Python code:

import os
from desync_search.core import DesyncClient

user_api_key = os.environ.get("DESYNC_API_KEY", "")
client = DesyncClient(user_api_key)

Here, the client automatically targets our production endpoint:

https://nycv5sx75joaxnzdkgvpx5mcme0butbo.lambda-url.us-east-1.on.aws/

Searching for Data

1) Performing a Search

By default, search(...) does a stealth search (cost: 10 credits). If you want a test search (cost: 1 credit), pass search_type="test_search".

# Stealth Search (default)
page_data = client.search("https://www.137ventures.com/portfolio")

print("URL:", page_data.url)
print("Text length:", len(page_data.text_content))

# Test Search
test_response = client.search(
    "https://www.python.org", 
    search_type="test_search"
)
print("Test search type:", test_response.search_type)

Both calls return a PageData object if success=True. For stealth, you’ll typically see fields like .text_content, .internal_links, and .external_links. Example:

print(page_data)
# <PageData url=https://www.137ventures.com/portfolio search_type=stealth_search timestamp=... complete=True>

print(page_data.text_content[:200])  # first 200 chars of text

You can pass scrape_full_html=True to get the entire HTML, or remove_link_duplicates=False to keep duplicates:

stealth_response = client.search(
    "https://www.137ventures.com/portfolio",
    scrape_full_html=True,
    remove_link_duplicates=False
)
print(len(stealth_response.html_content), "HTML chars")

Example: Visit All Internal Links

A simple spider approach: after you search a page, you gather internal links, check which ones you haven’t visited, and recursively fetch them. Example pseudo-code:

visited = set()

def crawl(client, url):
    if url in visited:
        return
    visited.add(url)

    page_data = client.search(url)  # stealth by default
    print("Scraped:", url, "Found", len(page_data.internal_links), "internal links")

    # For each new internal link, crawl again
    for link in page_data.internal_links:
        if link not in visited:
            crawl(client, link)

# Start from a seed URL
crawl(client, "https://www.137ventures.com/portfolio")

Note: Keep an eye on your credit usage if you do large-scale crawling.

Retrieving Past Results

2) Listing Available Results

Use list_available() to get minimal data for each past search:

records = client.list_available()
for r in records:
    print(r.id, r.url, r.search_type, r.created_at)

Each r is a PageData with minimal fields (omitting large text/html for bandwidth savings).

3) Pulling Detailed Data

If you want all fields (including text, HTML, links, etc.), call pull_data(...).
By default, we show:

# e.g. pull by record_id
details = client.pull_data(record_id=10)
if details:
    first = details[0]
    print(first.url, len(first.text_content), "chars of text")

You can also pass a url_filter if your library method supports it, e.g.:

details = client.pull_data(url_filter="https://example.org")

4) Checking Your Credits Balance

Get your current_credits_balance:

balance_info = client.pull_credits_balance()
print(balance_info)
# e.g. { "success": true, "credits_balance": 240 }

We store the user’s credits on our server, so you can see how many searches remain.

Additional Notes

Attribution: This package relies on open-source libraries like requests.
Rate Limits: We do not impose user-level concurrency throttles, but large-scale usage could be slowed if the underlying cloud environment is heavily utilized.
Your First 1,000 Searches: On new accounts, we credit 1,000 searches automatically, so you can test stealth or test calls with zero upfront cost.
For more advanced usage (like admin ops, account creation, adding credits) see desync.ai or contact support.

License

This project is licensed under the MIT License.

Happy scraping with Desync Search—the next-level “API to the Internet”! We look forward to your feedback and contributions.

END README CONTENT

Simply replace all instances of

with the normal Markdown code fence marker (```) in your local `README.md`. Then your code sections and headers will render properly on GitHub (or similar).

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.2.29

Aug 28, 2025

0.2.28

Aug 28, 2025

0.2.27

Aug 28, 2025

0.2.26

Aug 28, 2025

0.2.25

Mar 12, 2025

0.2.24

Feb 11, 2025

0.2.23

Feb 11, 2025

0.2.22

Feb 11, 2025

0.2.21

Feb 7, 2025

0.2.20

Jan 31, 2025

0.2.19

Jan 31, 2025

0.2.18

Jan 29, 2025

0.2.17

Jan 25, 2025

0.2.16

Jan 25, 2025

0.2.15

Jan 25, 2025

0.2.14

Jan 25, 2025

0.2.13

Jan 25, 2025

0.2.12

Jan 23, 2025

0.2.11

Jan 23, 2025

This version

0.2.10

Jan 23, 2025

0.2.9

Jan 23, 2025

0.2.8

Jan 22, 2025

0.2.4

Jan 20, 2025

0.2.3

Jan 20, 2025

0.2.2

Jan 20, 2025

0.2.1

Jan 20, 2025

0.2.0

Jan 20, 2025

0.1.4

Dec 5, 2024

0.1.3

Dec 5, 2024

0.1.0

Dec 3, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

desync_search-0.2.10.tar.gz (8.1 kB view details)

Uploaded Jan 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

desync_search-0.2.10-py3-none-any.whl (8.5 kB view details)

Uploaded Jan 23, 2025 Python 3

File details

Details for the file desync_search-0.2.10.tar.gz.

File metadata

Download URL: desync_search-0.2.10.tar.gz
Upload date: Jan 23, 2025
Size: 8.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.4

File hashes

Hashes for desync_search-0.2.10.tar.gz
Algorithm	Hash digest
SHA256	`2f34e3737d7620ba537a457790b7a8d2d03e961cd5a8ba8f4dbe5982704910e9`
MD5	`010278dd6d7ef81df6c3175631182074`
BLAKE2b-256	`34b139d5a4373a181a09bf2352865d6986c35ce0376acc64cd0a74a67f596a16`

See more details on using hashes here.

File details

Details for the file desync_search-0.2.10-py3-none-any.whl.

File metadata

Download URL: desync_search-0.2.10-py3-none-any.whl
Upload date: Jan 23, 2025
Size: 8.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.4

File hashes

Hashes for desync_search-0.2.10-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c4b2ab9e63e0af4fb11b42a8be08a9158412384c91ffdaa59cf148f1c74068ed`
MD5	`48e95116ac0e23fd204289e509879dc4`
BLAKE2b-256	`52f6f83a18a47ea95c471334df8288e19e38e44ea77144a046399a0a7e0cebec`

See more details on using hashes here.

desync-search 0.2.10

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Desync Search — "API to the Internet"

Key Features

Installation

Basic Usage

Searching for Data

1) Performing a Search

Example: Visit All Internal Links

Retrieving Past Results

2) Listing Available Results

3) Pulling Detailed Data

4) Checking Your Credits Balance

Additional Notes

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes