No project description provided

These details have not been verified by PyPI

Project links

Repository

Project description

Newscatcher Python Library

The Newscatcher Python library provides convenient access to the Newscatcher APIs from Python.

Documentation
Installation
Reference
Usage
Environments
Async Client
Exception Handling
Retrieving More Articles
Query Validation
Advanced
Contributing

Documentation

API reference documentation is available here.

Installation

pip install newscatcher-sdk

Reference

A full reference for this library is available here.

Usage

Instantiate and use the client with the following:

from newscatcher import NewscatcherApi

client = NewscatcherApi(
    api_key="<value>",
)

client.search.post(
    q="\"supply chain\" AND Amazon NOT China",
    page_size=1,
)

Environments

This SDK allows you to configure different environments for API requests.

from newscatcher import NewscatcherApi
from newscatcher.environment import NewscatcherApiEnvironment

client = NewscatcherApi(
    environment=NewscatcherApiEnvironment.DEFAULT,
)

Async Client

The SDK also exports an async client so that you can make non-blocking calls to our API. Note that if you are constructing an Async httpx client class to pass into this client, use httpx.AsyncClient() instead of httpx.Client() (e.g. for the httpx_client parameter of this client).

import asyncio

from newscatcher import AsyncNewscatcherApi

client = AsyncNewscatcherApi(
    api_key="<value>",
)


async def main() -> None:
    await client.search.post(
        q="\"supply chain\" AND Amazon NOT China",
        page_size=1,
    )


asyncio.run(main())

Exception Handling

When the API returns a non-success status code (4xx or 5xx response), a subclass of the following error will be thrown.

from newscatcher.core.api_error import ApiError

try:
    client.search.post(...)
except ApiError as e:
    print(e.status_code)
    print(e.body)

Retrieving more articles

The standard News API endpoints have a limit of 10,000 articles per query. To retrieve more articles when needed, use these methods that automatically break down your request into smaller time chunks:

Get all articles

import datetime
from newscatcher import NewscatcherApi

client = NewscatcherApi(api_key="YOUR_API_KEY")

# Get articles about renewable energy from the past 10 days
articles = client.get_all_articles(
    q="renewable energy",
    from_="10d",  # Last 10 days
    time_chunk_size="1d",  # Split into 1-day chunks
    max_articles=50000,    # Limit to 50,000 articles
    show_progress=True     # Show progress indicator
)

print(f"Retrieved {len(articles)} articles")

Get all latest headlines

from newscatcher import NewscatcherApi

client = NewscatcherApi(api_key="YOUR_API_KEY")

# Get all technology headlines from the past week
articles = client.get_all_headlines(
    when="7d",
    time_chunk_size="1h",  # Split into 1-hour chunks
    show_progress=True
)

print(f"Retrieved {len(articles)} articles")

These methods handle pagination and deduplication automatically, giving you a seamless experience for retrieving large datasets.

You can also use async versions of these methods with the AsyncNewscatcherApi client.

Query validation

The SDK includes client-side query validation to help you catch syntax errors before making API calls:

from newscatcher import NewscatcherApi

client = NewscatcherApi(api_key="YOUR_API_KEY")

# Validate query syntax
is_valid, error_message = client.validate_query("machine learning")
if is_valid:
    print("Query is valid!")
else:
    print(f"Invalid query: {error_message}")

Automatic validation

Query validation is enabled by default in methods like get_all_articles() and will raise a ValueError for invalid queries. You can disable validation by setting validate_query=False:

# Enable validation (default)
articles = client.get_all_articles(
    q="AI OR \"artificial intelligence\"",  # Valid query
    validate_query=True,  # Optional, True by default
    from_="7d"
)

# Disable validation (not recommended)
articles = client.get_all_articles(
    q="some query",
    validate_query=False,  # Skip client-side validation
    from_="7d"
)

For complete validation rules, bulk validation techniques, and troubleshooting, see Validate queries with Python SDK.

Advanced

Access Raw Response Data

The SDK provides access to raw response data, including headers, through the .with_raw_response property. The .with_raw_response property returns a "raw" client that can be used to access the .headers and .data attributes.

from newscatcher import NewscatcherApi

client = NewscatcherApi(...)
response = client.search.with_raw_response.post(...)
print(response.headers)  # access the response headers
print(response.status_code)  # access the response status code
print(response.data)  # access the underlying object

Retries

The SDK is instrumented with automatic retries with exponential backoff. A request will be retried as long as the request is deemed retryable and the number of retry attempts has not grown larger than the configured retry limit (default: 2).

A request is deemed retryable when any of the following HTTP status codes is returned:

408 (Timeout)
429 (Too Many Requests)
5XX (Internal Server Errors)

Use the max_retries request option to configure this behavior.

client.search.post(..., request_options={
    "max_retries": 1
})

Timeouts

The SDK defaults to a 60 second timeout. You can configure this with a timeout option at the client or request level.

from newscatcher import NewscatcherApi

client = NewscatcherApi(..., timeout=20.0)

# Override timeout for a specific method
client.search.post(..., request_options={
    "timeout_in_seconds": 1
})

Custom Client

You can override the httpx client to customize it for your use-case. Some common use-cases include support for proxies and transports.

import httpx
from newscatcher import NewscatcherApi

client = NewscatcherApi(
    ...,
    httpx_client=httpx.Client(
        proxy="http://my.test.proxy.example.com",
        transport=httpx.HTTPTransport(local_address="0.0.0.0"),
    ),
)

Contributing

While we value open-source contributions to this SDK, this library is generated programmatically. Additions made directly to this library would have to be moved over to our generation code, otherwise they would be overwritten upon the next generated release. Feel free to open a PR as a proof of concept, but know that we will not be able to merge it as-is. We suggest opening an issue first to discuss with us!

On the other hand, contributions to the README are always very welcome!

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

2.1.0

Apr 23, 2026

This version

2.0.3

Apr 16, 2026

2.0.2

Mar 16, 2026

2.0.1

Mar 16, 2026

2.0.0

Mar 16, 2026

1.4.0

Aug 24, 2025

1.3.0

Aug 22, 2025

1.2.0

Mar 17, 2025

1.1.0

Jan 16, 2025

1.0.0

Dec 24, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

newscatcher_sdk-2.0.3.tar.gz (109.1 kB view details)

Uploaded Apr 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

newscatcher_sdk-2.0.3-py3-none-any.whl (178.9 kB view details)

Uploaded Apr 16, 2026 Python 3

File details

Details for the file newscatcher_sdk-2.0.3.tar.gz.

File metadata

Download URL: newscatcher_sdk-2.0.3.tar.gz
Upload date: Apr 16, 2026
Size: 109.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.10.20 Linux/6.17.0-1010-azure

File hashes

Hashes for newscatcher_sdk-2.0.3.tar.gz
Algorithm	Hash digest
SHA256	`b1badbd4183bc973477baf065c2b5b0d46bc532f39f42468494c1b44316be223`
MD5	`375be99bf7658a7bfc394a00a5ca6aed`
BLAKE2b-256	`7ebdd3c0adae14f5c0110ce00eb84bae4d362cc55cae67e139f229128c443cc0`

See more details on using hashes here.

File details

Details for the file newscatcher_sdk-2.0.3-py3-none-any.whl.

File metadata

Download URL: newscatcher_sdk-2.0.3-py3-none-any.whl
Upload date: Apr 16, 2026
Size: 178.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.10.20 Linux/6.17.0-1010-azure

File hashes

Hashes for newscatcher_sdk-2.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d77073ab5bd70d41e2612540b07a05e6996758fa47a109772f8561b54702bd12`
MD5	`eb5f45df480b10a18e2461f3ba01c440`
BLAKE2b-256	`f7cb7b3a875a2dbcd02f6d44e77eb11bdc217787745601f6bbbc74e793ce359a`

See more details on using hashes here.

newscatcher-sdk 2.0.3

Navigation

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Newscatcher Python Library

Table of Contents

Documentation

Installation

Reference

Usage

Environments

Async Client

Exception Handling

Retrieving more articles

Get all articles

Get all latest headlines

Query validation

Automatic validation

Advanced

Access Raw Response Data

Retries

Timeouts

Custom Client

Contributing

Project details

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes