Skip to main content

Simple Google parser

Project description

GOOgle spiDER

Google search engine parser on python3

Instruction

Requirement python 3.10+

pip install gooder

from gooder import Gooder

gooder = Gooder()
# Make request on google.com/search?q=Hello+World
parsed = gooder.parse(query="Hello World")

# Print only result links
print(gooder.get_links())

# Print only result titles
print(gooder.get_titles())

# Print all results list[tuple[link,title]]
print(gooder.raw_results)

# If TRUE = parsed, else = captcha/rate limit
if (parsed)
    # Save urls to json file
    gooder.save_to_file(only_urls=True, to_json=True, override=True, file="results.json")

Methods & Fields

Method/Field Args Example Result
Gooder.parse query: str, page: int=0, ignore_google: bool=True, clear_old: bool=True gooder.parse("hello", clear_old=False) True | False
Gooder.raw_results Field Field [[link, title], ...]
Gooder.get_links repeats: bool = False gooder.get_links() [unique_link, ...]
Gooder.get_titles None gooder.get_titles() [title, title, ...]
Gooder.save_to_file only_urls: bool = True override: bool = True to_json: bool = False file: str = "urls.txt" gooder.save_to_file() New file with urls
Gooder.get_hostname links: str | list gooder.get_hostname(https://google.com/) google.com
Gooder.get_captcha_url None gooder.get_captcha_url() None | google.com/sorry/...
Gooder.get_headers None gooder.get_headers() None | HTTPHeaderDict({...})

Todo:

  • Add proxy manager
  • Replace raw_results: list(list()) to dict()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gooder-0.3.2.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gooder-0.3.2-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file gooder-0.3.2.tar.gz.

File metadata

  • Download URL: gooder-0.3.2.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.3 Windows/10

File hashes

Hashes for gooder-0.3.2.tar.gz
Algorithm Hash digest
SHA256 904758e1446fbb9837953f5ff11735a10cf4ea5aa6fcaf74ed6858593837a1ee
MD5 600ff983f82f962b5144855fd276a6bc
BLAKE2b-256 14419d68019b0a0e7661d64e2a7a955299de1d01d7e444d7c2c5a2e5118cd39a

See more details on using hashes here.

File details

Details for the file gooder-0.3.2-py3-none-any.whl.

File metadata

  • Download URL: gooder-0.3.2-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.3 Windows/10

File hashes

Hashes for gooder-0.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 51b3fd2af314edc58d19636f3baaeceeb09f0fcf76a4cd6bf3d8ff6458458c34
MD5 da1dcbf224175e60caa1cf1d27cf5c10
BLAKE2b-256 61ba286d24108dfd40688fa57a7d653f93cffd49a2506b916cb06e4c5a5b7b39

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page