Skip to main content

A Python library for SERP and web scraping with multiple provider integration

Project description

surfhub

A python library for surfing and crawling website.

This library provides two basic components for you to run google search and getting result

  • Seprer is a API to provide structured data from Google search. There are many serper providers such as ValueSerp, Serper, etc
  • Scraper is an API to extract HTML from website. You can run it on your own laptop, but it is better to use providers such as Zyte, or Browserless

To start, you can visit Serper to get a free account.

import surhub import get_serper

s = serp.get_serper("serper", api_key="yourkey")
print(s.serp("hello world").items)

Supported SERP provider:

Example to use scraper

from surfhub import get_scraper

s = serp.get_scraper("browserless", api_key="yourkey")
s.scrape("https://webscraper.io/test-sites/e-commerce/allinone")

Supported Scraper provider

  • Local (run on your laptop) with proxy support
  • Browserless
  • Zyte
  • Crawlbase

TODO

  • Support ScrappingBee
  • Support SerpAPI
  • Support Duckduckgo api

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page