An Awesome App Store Review Scraper 🧹
Project description
Quickstart
Install:
pip3 install python-app-store-scraper
Scrape reviews for an app:
from app_store_scraper import AppStore
from pprint import pprint
facebook = AppStore(country="us", app_name="facebook")
facebook.review(how_many=20)
pprint(facebook.reviews)
pprint(facebook.reviews_count)
Scrape reviews for a podcast:
from app_store_scraper import Podcast
from pprint import pprint
sysk = Podcast(country="us", app_name="stuff you should know")
sysk.review(how_many=20)
pprint(sysk.reviews)
pprint(sysk.reviews_count)
Instantiation
There are two required and one positional parameters:
country
(required)- two-letter country code of ISO 3166-1 alpha-2 standard
app_name
(required)- name of an iOS application to fetch reviews for
- also used by
search_id()
method to search forapp_id
internally
app_id
(positional)- can be passed directly
- or ignored to be obtained by
search_id
method internally
Once instantiated, the object can be examined:
>>> facebook
AppStore(country='us', app_name='facebook', app_id=284882215)
>>> print(app)
Country | us
Name | facebook
ID | 284882215
URL | https://apps.apple.com/us/app/facebook/id284882215
Review count | 0
Other optional parameters are:
log_format
- passed directly to
logging.basicConfig(format=log_format)
- default is
"%(asctime)s [%(levelname)s] %(name)s - %(message)s"
- passed directly to
log_level
- passed directly to
logging.basicConfig(level=log_level)
- default is
"INFO"
- passed directly to
log_interval
- log is produced every 5 seconds (by default) as a "heartbeat" (useful for a long scraping session)
- default is
5
Fetching Review
The maximum number of reviews fetched per request is 20. To minimise the number of calls, the limit of 20 is hardcoded. This means the review()
method will always grab more than the how_many
argument supplied with an increment of 20.
>>> facebook.review(how_many=33)
>>> facebook.reviews_count
40
If how_many
is not provided, review()
will terminate after all reviews are fetched.
NOTE the review count seen on the landing page differs from the actual number of reviews fetched. This is simply because only some users who rated the app also leave reviews.
Optional Parameters
after
- a
datetime
object to filter older reviews
- a
sleep
- an
int
to specify seconds to sleep between each call
- an
Review Data
The fetched review data are loaded in memory and live inside reviews
attribute as a list of dict.
>>> facebook.reviews
[{'userName': 'someone', 'rating': 5, 'date': datetime.datetime(...
Each review dictionary has the following schema:
{
"date": datetime.datetime,
"isEdited": bool,
"rating": int,
"review": str,
"title": str,
"userName": str
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file python-app-store-scraper-1.1.1.tar.gz
.
File metadata
- Download URL: python-app-store-scraper-1.1.1.tar.gz
- Upload date:
- Size: 6.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6339dc7b179e705a5218c95d4fcacac191fa034c5760f4130e9364d395ab34c0 |
|
MD5 | f9a62bed3d920351e2efa1d09f1b7262 |
|
BLAKE2b-256 | 2515102773c533534a53889f2157f8cdfdebba4b392907c87991b1ad029fe438 |
File details
Details for the file python_app_store_scraper-1.1.1-py3-none-any.whl
.
File metadata
- Download URL: python_app_store_scraper-1.1.1-py3-none-any.whl
- Upload date:
- Size: 8.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 99ce30fca8fd40779c63818dccbb87ea918655e523e8a93eb29a5df9071835c1 |
|
MD5 | da5945e9ebd3f5fdc29930c569d5bf41 |
|
BLAKE2b-256 | ee92a58f5e3ab8d3ceef2cfbb9b391ad71fadc00a4a8fd28f5fa3d71d8cbaec9 |