Poogle is a Google scraping library and command line utility for Python.
Project description
Poogle is a Google scraping library and command line utility for Python.
Currently, it only offers the ability to scrape Google search results. Future functionality will enable retrieving of image links, similar search queries, and even the ability to retrieve direct answers to questions you can ask Google.
Installation
To install Poogle, simply:
$ pip install poogle
Usage
Poogle provides both a librry for use in your own Python applications, as well as a CLI utility for executing search queries in the command line.
Library
google_search()
Using the Poogle search library is pretty straightforward. If you all you want is a set number of Google search results returned in list format, use the google_search function,
from poogle import google_search
results = google_search('Python', results=2)
for result in results:
print(result.title)
print(result.url.as_string() + '\n')
The above code will result in output like the following:
Welcome to Python.org https://www.python.org/ Python (programming language) - Wikipedia, the free encyclopedia https://en.wikipedia.org/wiki/Python_(programming_language)
URL’s are returned as Yurl objects out of the box.
Poogle
For more advanced usage, you will want to work with the Poogle class object directly.
from poogle import Poogle
search = Poogle('Python', per_page=10, start_page=1, lazy=True)
print(search.total_results) # Estimated number of total search results reported by Google
print(search.results)
print(search.next_page()) # Returns an object containing the search results for the next page only
print(search.results)
The above code will result in output like the following:
159000000 [<PoogleResult Container: "u'Welcome to Python.org'">, <PoogleResult Container: "u'Python (programming language) - Wikipedia, the free encyclopedia'">, <Poog leResult Container: "u'Python | Codecademy'">, <PoogleResult Container: "u'Python tutorial - TutorialsPoint'">, <PoogleResult Container: "u'Python - Reddit'" >, <PoogleResult Container: "u'Learn Python The Hard Way'">] <PoogleResultsPage Container: Page 2> [<PoogleResult Container: "u'Welcome to Python.org'">, <PoogleResult Container: "u'Python (programming language) - Wikipedia, the free encyclopedia'">, <Poog leResult Container: "u'Python | Codecademy'">, <PoogleResult Container: "u'Python tutorial - TutorialsPoint'">, <PoogleResult Container: "u'Python - Reddit'" >, <PoogleResult Container: "u'Learn Python The Hard Way'">, <PoogleResult Container: "u'Python (programming language) - Wikipedia, the free encyclopedia'">, <PoogleResult Container: "u'Python | Codecademy'">, <PoogleResult Container: "u'Python tutorial - TutorialsPoint'">, <PoogleResult Container: "u'Python - Re ddit'">, <PoogleResult Container: "u'Learn Python The Hard Way'">, <PoogleResult Container: "u'Learn Python'">, <PoogleResult Container: "u"Newest 'python' Q uestions - Stack Overflow"">, <PoogleResult Container: "u'Programming for Everybody (Getting Started with Python ... - Coursera'">, <PoogleResult Container: "u"Python - Programming - Books & Videos - O'Reilly Media"">]
The only major difference between using the google_search() function and the Poogle class object at the moment is the ability to fetch more search result pages after a query has already been executed.
CLI
For documentation on how to use the Poogle command line utility, run poogle --help.
The only command available at the moment is search,
Usage: poogle search [OPTIONS] QUERY Execute a Google search query and display the results Options: -r, --results INTEGER The number of search results to retrieve --plain Disables bolding and keyword highlighting --help Show this message and exit.
Using it is pretty straightfoward. Just be sure to quote your search query if it contains more than one word.
$ poogle search -r 2 "Python" Executing search query for Python Welcome to Python.org ============================== https://www.python.org/ Python (programming language) - Wikipedia, the free encyclopedia ============================== https://en.wikipedia.org/wiki/Python_(programming_language)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file Poogle-0.1.0.tar.gz
.
File metadata
- Download URL: Poogle-0.1.0.tar.gz
- Upload date:
- Size: 7.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2596d62d78df0019ab847f1856b924086b465d869c6f5df514b83197a5d0dd0f |
|
MD5 | 1631667486ef65cd695e9930895fd6a1 |
|
BLAKE2b-256 | 851b2854b5b0a832bb1041c57b986f2d84ee929fe893dd1166f89ebb90a3a994 |