Aysncio search engine scraping package

These details have not been verified by PyPI

Project links

Homepage

Intended Audience
- Developers
Operating System
Programming Language

Project description

searchit

Searchit is a library for async scraping of search engines. The library supports multiple search engines (currently Google, Yandex, Qwant and Bing) with support for other search engines to come.

Install

pip install searchit

Can be installed using pip, by running the above command.

Using Searchit

import asyncio

from searchit import GoogleScraper, YandexScraper, BingScraper
from searchit import ScrapeRequest

request = ScrapeRequest("watch movies online", 30)
google = GoogleScraper(max_results_per_page=10) # max_results = Number of results per page
yandex = YandexScraper(max_results_per_page=10)

loop = asyncio.get_event_loop()

results = loop.run_until_complete(google.scrape(request))
results = loop.run_until_complete(yandex.scrape(request))

To use Searchit users first create a ScrapeRequest object, with term and number of results as required fields. This object can then be passed to multiple different search engines and scraped asynchronously.

Scrape Request - Object

term - Required str - the term to be searched for
count - Required int - the total number of results
domain - Optional[str] - the domain to search i.e. .com or .com
sleep - Optional[int] - time to wait betweeen paginating pages - important to prevent getting blocked
proxy - Optional[str] - proxy to be used to make request - default none
language - Optional[str] - language to conduct search in (only Google atm)
geo - Optional[str] - Geo location to conduct search from Yandex, and Qwant

Roadmap

Add additional search engines
Tests
Blocking non-async scrape method
Add support for page rendering (Selenium and Puppeteer)

Project details

These details have not been verified by PyPI

Project links

Homepage

Intended Audience
- Developers
Operating System
Programming Language

Release history Release notifications | RSS feed

This version

2023.2.5.1

Feb 5, 2023

2019.12.30.2

Dec 30, 2019

2019.12.30.1

Dec 30, 2019

2019.12.29.1

Dec 29, 2019

2019.12.29.0

Dec 29, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

searchit-2023.2.5.1.macosx-10.9-x86_64.tar.gz (11.8 kB view details)

Uploaded Feb 5, 2023 Source

File details

Details for the file searchit-2023.2.5.1.macosx-10.9-x86_64.tar.gz.

File metadata

Download URL: searchit-2023.2.5.1.macosx-10.9-x86_64.tar.gz
Upload date: Feb 5, 2023
Size: 11.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.8.2

File hashes

Hashes for searchit-2023.2.5.1.macosx-10.9-x86_64.tar.gz
Algorithm	Hash digest
SHA256	`e5a84f4f74d8f759040da02352667bd86f0ea00176f04689c0603a4e24c6a09b`
MD5	`3c1d88e5659c0d877cc628a5bad11943`
BLAKE2b-256	`80492cdb1b7b0aef146a93dbf0d46a36982e44e29c74159800012ef0650cc429`

See more details on using hashes here.

searchit 2023.2.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

searchit

Install

Using Searchit

Scrape Request - Object

Roadmap

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes