Skip to main content

Use the Article Search API to look up articles by keyword.

Project description

NytArticleSearch Python SDK 1.0.0

Welcome to the NytArticleSearch SDK documentation. This guide will help you get started with integrating and using the NytArticleSearch SDK in your project.

Versions

  • API version: 2.0.0
  • SDK version: 1.0.0

About the API

Use the Article Search API to look up articles by keyword. You can refine your search using filters and facets. /articlesearch.json?q={query}&fq={filter}

Example Call

https://api.nytimes.com/svc/search/v2/articlesearch.json?q=election&api-key=yourkey

FILTERING YOUR SEARCH

Use filters to narrow the scope of your search. You can specify the fields and the values that your query will be filtered on. The Article Search API uses Elasticsearch, so the filter query (fq) uses standard Lucene syntax. Separate the filter field name and value with a colon, and surround multiple values with parentheses. field-name:( value1 , value2 , ... value n ) The default connector for values in parentheses is OR. If you declare an explicit boolean value, it must be capitalized. You can filter on multiple values and fields. field-name-1:( value1 ) AND field-name-2:( value2 , value3 ) For a list of all fields you can filter on, see the Filter Query Fields table below. You can also filter by search text.

Pagination

The Article Search API returns a max of 10 results at a time. The meta node in the response contains the total number of matches ( hits ) and the current offset. Use the page query parameter to paginate thru results (page=0 for results 1-10, page=1 for 11-20, ...). You can paginate thru up to 100 pages (1,000 results). If you get too many results try filtering by date range.

Filter Query Examples

Restrict your search to articles with The New York Times as the source: fq=source:( The New York Times ) Restrict your search to articles from either the Sports or Foreign desk: fq=news_desk:( Sports , Foreign ) Restrict your search to articles that are about New York City and from the Sports desk: fq=news_desk:( Sports ) AND glocations:( NEW YORK CITY ) If you do not specify a field, the scope of the filter query will look for matches in the body, headline and byline. The example below will restrict your search to articles with The New York Times in the body, headline or byline: fq=The New York Times Find articles with the word Pokemon that were on the print paper's front page. fq=Pokemon AND print_page:1 AND (print_section:( A , 1 ) OR (!_exists_:print_section))

USING FACETS

Use facets to view the relative importance of certain fields to a search term, and gain insight about how to best refine your queries and filter your search results. The following fields can be used as facet fields: day_of_week, document_type, ingredients, news_desk, pub_month, pub_year, section_name, source, subsection_name, and type_of_material. Specify facets using the facet_fields parameter. Set facet=true and the response will contain an array with a count for the five terms that have the highest count for each facet. For example, including the following in your request will add a facet array with a count for the top five days of the week to the response. facet_fields=day_of_week&facet=true By default, facet counts ignore all filters and return the count for all results of a query. For the following queries, the facet count in each response will be identical, even though the results returned in one set is restricted to articles published in 2012. q=obama&facet_fields=source&facet=true&begin_date=20120101&end_date=20121231 You can force facet counts to respect filters by setting facet_filter=true. Facet counts will be restricted based on any filters you have specified (this includes both explicit filter queries set using the fq parameter and implicit filters like begin_date). Here is the facet array response to the query. javascript facets : { source : { _type : terms , missing : 524, total : 83121, other : 317, terms : [ { term : The New York Times , count : 68530 }, { term : AP , count : 7705 }, { term : Reuters , count : 4969 }, { term : International Herald Tribune , count : 1464 }, { term : , count : 136 } ] } } If you set facet_filter to true the facet array will only count facet occurences in 2012. javascript facets : { source : { _type : terms , missing : 192, total : 22596, other : 139, terms : [ { term : The New York Times , count : 14812 }, ...

Examples Requests

Search for documents containing 'new york times' and return results 20-29 with results sorted oldest first. https://api.nytimes.com/svc/search/v2/articlesearch.json?q=new+york+times&page=2&sort=oldest&api-key=your-api-key Search for all documents published on January 1, 2012 containing 'romney'. Facet count will be returned for 'day_of_week' and will be reflective of all documents (i.e., the date range filter and filter query do not affect facet counts). https://api.nytimes.com/svc/search/v2/articlesearch.json?fq=romney&facet_field=day_of_week&facet=true&begin_date=20120101&end_date=20120101&api-key=your-api-key

Example Response

Here is an partial example response. javascript { response : { meta : { hits : 25, time : 332, offset : 0 }, docs : [ { web_url : http://thecaucus.blogs.nytimes.com/2012/01/01/virginia-attorney-general-backs-off-ballot-proposal/ , snippet : Virginia's attorney general on Sunday backed off of a proposal to loosen the state's ballot access rules to allow more Republican presidential candidates to qualify. , lead_paragraph : DES MOINES -- Virginia's attorney general on Sunday backed off of a proposal to loosen the state's ballot access rules to allow more Republican presidential candidates to qualify. , ... } ], facets : { day_of_week : { _type : terms , missing : 1871790, total : 13098462, other : 3005891, terms : [ { term : Sunday , count : 3122347 }, ...

Limit

Fields in Response You can limit the number fields returned in the response with the fl parameter. fl=web_url

Table of Contents

Setup & Configuration

Supported Language Versions

This SDK is compatible with the following versions: Python >= 3.7

Installation

To get started with the SDK, we recommend installing using pip:

pip install nyt-article-search

Services

The SDK provides various services to interact with the API.

Below is a list of all available services with links to their detailed documentation:
Name
SearchService

Models

The SDK includes several models that represent the data structures used in API requests and responses. These models help in organizing and managing the data efficiently.

Below is a list of all available models with links to their detailed documentation:
Name Description
GetArticlesearchJsonOkResponse
Facet
FacetFields
FacetFilter
Sort
Response
Article
Meta
Multimedia
Headline
Keyword
Byline
Legacy
Person

License

This SDK is licensed under the MIT License.

See the LICENSE file for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nyt-article-search-1.0.0.tar.gz (25.2 kB view details)

Uploaded Source

Built Distribution

nyt_article_search-1.0.0-py3-none-any.whl (36.9 kB view details)

Uploaded Python 3

File details

Details for the file nyt-article-search-1.0.0.tar.gz.

File metadata

  • Download URL: nyt-article-search-1.0.0.tar.gz
  • Upload date:
  • Size: 25.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.7.17

File hashes

Hashes for nyt-article-search-1.0.0.tar.gz
Algorithm Hash digest
SHA256 c8fd63af0550b920633160976637712bcd6da49b67ef7d0afae84cdaad0c535d
MD5 5180df5ae4df76c3d477f9736de4dc23
BLAKE2b-256 59c052020ae7965cdeecd8a955f7eabe7c107a06af6b5a8b9d4921b429a47ba7

See more details on using hashes here.

File details

Details for the file nyt_article_search-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for nyt_article_search-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d978ae8f071c1fb2e388971438506e90bfe2225285587074158781d0bebb5553
MD5 1c051482f2eed0c4e66b0ae6198d2b15
BLAKE2b-256 7573eeb4207ffd8a14c0514dccd51510ddbb498f7ee8b9b4ff5a26f0753e9a72

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page