Search for words, documents, images, news, maps and text translation using the DuckDuckGo.com search engine.
Project description
Duckduckgo_search
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
⚠️ Warning: use AsyncDDGS in asynchronous code
Table of Contents
- Install
- CLI version
- Duckduckgo search operators
- Regions
- DDGS and AsyncDDGS classes
- Proxies
- Exceptions
- 1. text() - text search
- 2. answers() - instant answers
- 3. images() - image search
- 4. videos() - video search
- 5. news() - news search
- 6. maps() - map search
- 7. translate() - translation
- 8. suggestions() - suggestions
Install
pip install -U duckduckgo_search
CLI version
ddgs --help
or
python -m duckduckgo_search --help
CLI examples:
# text search
ddgs text -k "ayrton senna"
# text search via proxy (example: Tor Browser)
ddgs text -k "china is a global threat" -p socks5://localhost:9150
# find and download pdf files
ddgs text -k "russia filetype:pdf" -m 50 -d
# find in es-es region and download pdf files via proxy (example: Tor browser)
ddgs text -k "embajada a tamorlán filetype:pdf" -r es-es -m 50 -d -p socks5://localhost:9150
# find and download xls files from a specific site
ddgs text -k "sanctions filetype:xls site:gov.ua" -m 50 -d
# find and download any doc(x) files from a specific site
ddgs text -k "filetype:doc site:mos.ru" -m 50 -d
# find and download images
ddgs images -k "yuri kuklachev cat theatre" -m 500 -s off -d
# find in br-br region and download images via proxy (example: Tor browser) in 10 threads
ddgs images -k "rio carnival" -r br-br -s off -m 500 -d -th 10 -p socks5://localhost:9150
# get latest news
ddgs news -k "ukraine war" -s off -t d -m 10
# get last day's news and save it to a csv file
ddgs news -k "hubble telescope" -t d -m 50 -o csv
# get answers and save to a json file
ddgs answers -k holocaust -o json
Duckduckgo search operators
Keywords example | Result |
---|---|
cats dogs | Results about cats or dogs |
"cats and dogs" | Results for exact term "cats and dogs". If no results are found, related results are shown. |
cats -dogs | Fewer dogs in results |
cats +dogs | More dogs in results |
cats filetype:pdf | PDFs about cats. Supported file types: pdf, doc(x), xls(x), ppt(x), html |
dogs site:example.com | Pages about dogs from example.com |
cats -site:example.com | Pages about cats, excluding example.com |
intitle:dogs | Page title includes the word "dogs" |
inurl:cats | Page url includes the word "cats" |
Regions
expand
xa-ar for Arabia
xa-en for Arabia (en)
ar-es for Argentina
au-en for Australia
at-de for Austria
be-fr for Belgium (fr)
be-nl for Belgium (nl)
br-pt for Brazil
bg-bg for Bulgaria
ca-en for Canada
ca-fr for Canada (fr)
ct-ca for Catalan
cl-es for Chile
cn-zh for China
co-es for Colombia
hr-hr for Croatia
cz-cs for Czech Republic
dk-da for Denmark
ee-et for Estonia
fi-fi for Finland
fr-fr for France
de-de for Germany
gr-el for Greece
hk-tzh for Hong Kong
hu-hu for Hungary
in-en for India
id-id for Indonesia
id-en for Indonesia (en)
ie-en for Ireland
il-he for Israel
it-it for Italy
jp-jp for Japan
kr-kr for Korea
lv-lv for Latvia
lt-lt for Lithuania
xl-es for Latin America
my-ms for Malaysia
my-en for Malaysia (en)
mx-es for Mexico
nl-nl for Netherlands
nz-en for New Zealand
no-no for Norway
pe-es for Peru
ph-en for Philippines
ph-tl for Philippines (tl)
pl-pl for Poland
pt-pt for Portugal
ro-ro for Romania
ru-ru for Russia
sg-en for Singapore
sk-sk for Slovak Republic
sl-sl for Slovenia
za-en for South Africa
es-es for Spain
se-sv for Sweden
ch-de for Switzerland (de)
ch-fr for Switzerland (fr)
ch-it for Switzerland (it)
tw-tzh for Taiwan
th-th for Thailand
tr-tr for Turkey
ua-uk for Ukraine
uk-en for United Kingdom
us-en for United States
ue-es for United States (es)
ve-es for Venezuela
vn-vi for Vietnam
wt-wt for No region
DDGS and AsyncDDGS classes
The DDGS and AsyncDDGS classes are used to retrieve search results from DuckDuckGo.com. To use the AsyncDDGS class, you can perform asynchronous operations using Python's asyncio library. To initialize an instance of the DDGS or AsyncDDGS classes, you can provide the following optional arguments:
class DDGS:
"""DuckDuckgo_search class to get search results from duckduckgo.com
Args:
headers (dict, optional): Dictionary of headers for the HTTP client. Defaults to None.
proxies (Union[dict, str], optional): Proxies for the HTTP client (can be dict or str). Defaults to None.
timeout (int, optional): Timeout value for the HTTP client. Defaults to 10.
"""
Here is an example of initializing the DDGS class:
from duckduckgo_search import DDGS
with DDGS() as ddgs:
results = [r for r in ddgs.text("python programming", max_results=5)]
print(results)
Here is an example of initializing the AsyncDDGS class:
import asyncio
import logging
import sys
from itertools import chain
from random import shuffle
import requests
from duckduckgo_search import AsyncDDGS
# bypass curl-cffi NotImplementedError in windows https://curl-cffi.readthedocs.io/en/latest/faq/
if sys.platform.lower().startswith("win"):
asyncio.set_event_loop_policy(asyncio.WindowsSelectorEventLoopPolicy())
def get_words():
word_site = "https://www.mit.edu/~ecprice/wordlist.10000"
resp = requests.get(word_site)
words = resp.text.splitlines()
return words
async def aget_results(word):
async with AsyncDDGS(proxies=proxies) as ddgs:
results = [r async for r in ddgs.text(word, max_results=None)]
return results
async def main():
words = get_words()
shuffle(words)
tasks = []
for word in words[:10]:
tasks.append(aget_results(word))
results = await asyncio.gather(*tasks)
print(f"Done")
for r in chain.from_iterable(results):
print(r)
if __name__ == "__main__":
logging.basicConfig(level=logging.DEBUG)
asyncio.run(main())
It is important to note that the DDGS and AsyncDDGS classes should always be used as a context manager (with statement). This ensures proper resource management and cleanup, as the context manager will automatically handle opening and closing the HTTP client connection.
Proxies
Proxy can be specified as a dictionary or just a string
proxies = {"http": "socks5://localhost:9150", "https": "socks5://localhost:9150"}
proxies = "socks5://localhost:9150"
1. The easiest way. Launch the Tor Browser
from duckduckgo_search import DDGS
with DDGS(proxies="socks5://localhost:9150", timeout=20) as ddgs:
for r in ddgs.text("something you need", max_results=50):
print(r)
2. Use any proxy server (example with iproyal residential proxies)
from duckduckgo_search import DDGS
with DDGS(proxies="socks5://user:password@geo.iproyal.com:32325", timeout=20) as ddgs:
for r in ddgs.text("something you need", max_results=50):
print(r)
Exceptions
Exceptions:
DuckDuckGoSearchException
: Raised when there is a generic exception during the API request.
1. text() - text search by duckduckgo.com
def text(
keywords: str,
region: str = "wt-wt",
safesearch: str = "moderate",
timelimit: Optional[str] = None,
backend: str = "api",
max_results: Optional[int] = None,
) -> Iterator[Dict[str, Optional[str]]]:
"""DuckDuckGo text search generator. Query params: https://duckduckgo.com/params
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m, y. Defaults to None.
backend: api, html, lite. Defaults to api.
api - collect data from https://duckduckgo.com,
html - collect data from https://html.duckduckgo.com,
lite - collect data from https://lite.duckduckgo.com.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Yields:
dict with search results.
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
for r in ddgs.text('live free or die', region='wt-wt', safesearch='off', timelimit='y', max_results=10):
print(r)
# Searching for pdf files
with DDGS() as ddgs:
for r in ddgs.text('russia filetype:pdf', region='wt-wt', safesearch='off', timelimit='y', max_results=10):
print(r)
2. answers() - instant answers by duckduckgo.com
def answers(keywords: str) -> Iterator[Dict[str, Optional[str]]]::
"""DuckDuckGo instant answers. Query params: https://duckduckgo.com/params
Args:
keywords: keywords for query.
Yields:
dict with instant answers results.
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
for r in ddgs.answers("sun"):
print(r)
3. images() - image search by duckduckgo.com
def images(
keywords: str,
region: str = "wt-wt",
safesearch: str = "moderate",
timelimit: Optional[str] = None,
size: Optional[str] = None,
color: Optional[str] = None,
type_image: Optional[str] = None,
layout: Optional[str] = None,
license_image: Optional[str] = None,
max_results: Optional[int] = None,
) -> Iterator[Dict[str, Optional[str]]]:
"""DuckDuckGo images search. Query params: https://duckduckgo.com/params
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: Day, Week, Month, Year. Defaults to None.
size: Small, Medium, Large, Wallpaper. Defaults to None.
color: color, Monochrome, Red, Orange, Yellow, Green, Blue,
Purple, Pink, Brown, Black, Gray, Teal, White. Defaults to None.
type_image: photo, clipart, gif, transparent, line.
Defaults to None.
layout: Square, Tall, Wide. Defaults to None.
license_image: any (All Creative Commons), Public (PublicDomain),
Share (Free to Share and Use), ShareCommercially (Free to Share and Use Commercially),
Modify (Free to Modify, Share, and Use), ModifyCommercially (Free to Modify, Share, and
Use Commercially). Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Yields:
dict with image search results.
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
keywords = 'butterfly'
ddgs_images_gen = ddgs.images(
keywords,
region="wt-wt",
safesearch="off",
size=None,
color="Monochrome",
type_image=None,
layout=None,
license_image=None,
max_results=100,
)
for r in ddgs_images_gen:
print(r)
4. videos() - video search by duckduckgo.com
def videos(
keywords: str,
region: str = "wt-wt",
safesearch: str = "moderate",
timelimit: Optional[str] = None,
resolution: Optional[str] = None,
duration: Optional[str] = None,
license_videos: Optional[str] = None,
max_results: Optional[int] = None,
) -> Iterator[Dict[str, Optional[str]]]:
"""DuckDuckGo videos search. Query params: https://duckduckgo.com/params
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m. Defaults to None.
resolution: high, standart. Defaults to None.
duration: short, medium, long. Defaults to None.
license_videos: creativeCommon, youtube. Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Yields:
dict with videos search results
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
keywords = 'tesla'
ddgs_videos_gen = ddgs.videos(
keywords,
region="wt-wt",
safesearch="off",
timelimit="w",
resolution="high",
duration="medium",
max_results=100,
)
for r in ddgs_videos_gen:
print(r)
5. news() - news search by duckduckgo.com
def news(
keywords: str,
region: str = "wt-wt",
safesearch: str = "moderate",
timelimit: Optional[str] = None,
max_results: Optional[int] = None,
) -> Iterator[Dict[str, Optional[str]]]:
"""DuckDuckGo news search. Query params: https://duckduckgo.com/params
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
safesearch: on, moderate, off. Defaults to "moderate".
timelimit: d, w, m. Defaults to None.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Yields:
dict with news search results.
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
keywords = 'holiday'
ddgs_news_gen = ddgs.news(
keywords,
region="wt-wt",
safesearch="off",
timelimit="m",
max_results=20
)
for r in ddgs_news_gen:
print(r)
6. maps() - map search by duckduckgo.com
def maps(
keywords,
place: Optional[str] = None,
street: Optional[str] = None,
city: Optional[str] = None,
county: Optional[str] = None,
state: Optional[str] = None,
country: Optional[str] = None,
postalcode: Optional[str] = None,
latitude: Optional[str] = None,
longitude: Optional[str] = None,
radius: int = 0,
max_results: Optional[int] = None,
) -> Iterator[Dict[str, Optional[str]]]:
"""DuckDuckGo maps search. Query params: https://duckduckgo.com/params
Args:
keywords: keywords for query
place: if set, the other parameters are not used. Defaults to None.
street: house number/street. Defaults to None.
city: city of search. Defaults to None.
county: county of search. Defaults to None.
state: state of search. Defaults to None.
country: country of search. Defaults to None.
postalcode: postalcode of search. Defaults to None.
latitude: geographic coordinate (north–south position). Defaults to None.
longitude: geographic coordinate (east–west position); if latitude and
longitude are set, the other parameters are not used. Defaults to None.
radius: expand the search square by the distance in kilometers. Defaults to 0.
max_results: max number of results. If None, returns results only from the first response. Defaults to None.
Yields:
dict with maps search results
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
for r in ddgs.maps("school", place="Uganda", max_results=50):
print(r)
7. translate() - translation by duckduckgo.com
def translate(
self,
keywords: str,
from_: Optional[str] = None,
to: str = "en",
) -> Optional[Dict[str, Optional[str]]]:
"""DuckDuckGo translate
Args:
keywords: string or a list of strings to translate
from_: translate from (defaults automatically). Defaults to None.
to: what language to translate. Defaults to "en".
Returns:
dict with translated keywords.
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
keywords = 'school'
r = ddgs.translate(keywords, to="de")
print(r)
8. suggestions() - suggestions by duckduckgo.com
def suggestions(
keywords,
region: str = "wt-wt",
) -> Iterator[Dict[str, Optional[str]]]:
"""DuckDuckGo suggestions. Query params: https://duckduckgo.com/params
Args:
keywords: keywords for query.
region: wt-wt, us-en, uk-en, ru-ru, etc. Defaults to "wt-wt".
Yields:
dict with suggestions results.
"""
Example
from duckduckgo_search import DDGS
with DDGS() as ddgs:
for r in ddgs.suggestions("fly"):
print(r)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file duckduckgo_search-4.4.2.tar.gz
.
File metadata
- Download URL: duckduckgo_search-4.4.2.tar.gz
- Upload date:
- Size: 24.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 964b978550fcf6423b1503797620055076bb4d1e7613e0b7f535ee4a964d1bdf |
|
MD5 | 47fe6b1b0c2b8a99b34ea5d711e36736 |
|
BLAKE2b-256 | e543ba7ea7f5f5b23ddf1f1cb9f83299e17c4dc150d93379cecbfbcc76cb53f9 |
File details
Details for the file duckduckgo_search-4.4.2-py3-none-any.whl
.
File metadata
- Download URL: duckduckgo_search-4.4.2-py3-none-any.whl
- Upload date:
- Size: 20.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ba5f817ca18efb5930784eea3a0c2c0cfdf8635ed63b0c42882f55bb31e9587f |
|
MD5 | 45c8bbe0f238ed021a4c54232e9d775b |
|
BLAKE2b-256 | e7212f1b427f4751f83bd21d6ffaecea1c1886038e88aefe6c241eccf2155260 |