Skip to main content

An Awesome App Store Review Scraper 🧹

Project description

build PRs Welcome PyPI downloads license code style

Quickstart

Install:

pip3 install python-app-store-scraper

Scrape reviews for an app:

from app_store_scraper import AppStore
from pprint import pprint

facebook = AppStore(country="us", app_name="facebook")
facebook.review(how_many=20)

pprint(facebook.reviews)
pprint(facebook.reviews_count)

Scrape reviews for a podcast:

from app_store_scraper import Podcast
from pprint import pprint

sysk = Podcast(country="us", app_name="stuff you should know")
sysk.review(how_many=20)

pprint(sysk.reviews)
pprint(sysk.reviews_count)

Instantiation

There are two required and one positional parameters:

  • country (required)
  • app_name (required)
    • name of an iOS application to fetch reviews for
    • also used by search_id() method to search for app_id internally
  • app_id (positional)
    • can be passed directly
    • or ignored to be obtained by search_id method internally

Once instantiated, the object can be examined:

>>> facebook
AppStore(country='us', app_name='facebook', app_id=284882215)
>>> print(app)
     Country | us
        Name | facebook
          ID | 284882215
         URL | https://apps.apple.com/us/app/facebook/id284882215
Review count | 0

Other optional parameters are:

  • log_format
    • passed directly to logging.basicConfig(format=log_format)
    • default is "%(asctime)s [%(levelname)s] %(name)s - %(message)s"
  • log_level
    • passed directly to logging.basicConfig(level=log_level)
    • default is "INFO"
  • log_interval
    • log is produced every 5 seconds (by default) as a "heartbeat" (useful for a long scraping session)
    • default is 5

Fetching Review

The maximum number of reviews fetched per request is 20. To minimise the number of calls, the limit of 20 is hardcoded. This means the review() method will always grab more than the how_many argument supplied with an increment of 20.

>>> facebook.review(how_many=33)
>>> facebook.reviews_count
40

If how_many is not provided, review() will terminate after all reviews are fetched.

NOTE the review count seen on the landing page differs from the actual number of reviews fetched. This is simply because only some users who rated the app also leave reviews.

Optional Parameters

  • after
    • a datetime object to filter older reviews
  • sleep
    • an int to specify seconds to sleep between each call

Review Data

The fetched review data are loaded in memory and live inside reviews attribute as a list of dict.

>>> facebook.reviews
[{'userName': 'someone', 'rating': 5, 'date': datetime.datetime(...

Each review dictionary has the following schema:

{
    "date": datetime.datetime,
    "isEdited": bool,
    "rating": int,
    "review": str,
    "title": str,
    "userName": str
 }

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python-app-store-scraper-1.1.1.tar.gz (6.6 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file python-app-store-scraper-1.1.1.tar.gz.

File metadata

File hashes

Hashes for python-app-store-scraper-1.1.1.tar.gz
Algorithm Hash digest
SHA256 6339dc7b179e705a5218c95d4fcacac191fa034c5760f4130e9364d395ab34c0
MD5 f9a62bed3d920351e2efa1d09f1b7262
BLAKE2b-256 2515102773c533534a53889f2157f8cdfdebba4b392907c87991b1ad029fe438

See more details on using hashes here.

File details

Details for the file python_app_store_scraper-1.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for python_app_store_scraper-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 99ce30fca8fd40779c63818dccbb87ea918655e523e8a93eb29a5df9071835c1
MD5 da5945e9ebd3f5fdc29930c569d5bf41
BLAKE2b-256 ee92a58f5e3ab8d3ceef2cfbb9b391ad71fadc00a4a8fd28f5fa3d71d8cbaec9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page