Single API ☝ App Store Review Scraper 🧹
Project description
___ _____ _ _____
/ _ \ / ___| | / ___|
/ /_\ \_ __ _ __ \ `--.| |_ ___ _ __ ___ \ `--. ___ _ __ __ _ _ __ ___ _ __
| _ | '_ \| '_ \ `--. \ __/ _ \| '__/ _ \ `--. \/ __| '__/ _` | '_ \ / _ \ '__|
| | | | |_) | |_) | /\__/ / || (_) | | | __/ /\__/ / (__| | | (_| | |_) | __/ |
\_| |_/ .__/| .__/ \____/ \__\___/|_| \___| \____/ \___|_| \__,_| .__/ \___|_|
| | | | | |
|_| |_| |_|
Quickstart
Install:
pip3 install app-store-scraper
Scrape reviews for an app:
from app_store_scraper import AppStore
from pprint import pprint
minecraft = AppStore(country="nz", app_name="minecraft")
minecraft.review(how_many=20)
pprint(minecraft.reviews)
pprint(minecraft.reviews_count)
Scrape reviews for a podcast:
from app_store_scraper import Podcast
from pprint import pprint
sysk = Podcast(country="nz", app_name="stuff you should know")
sysk.review(how_many=20)
pprint(sysk.reviews)
pprint(sysk.reviews_count)
Extra Details
Let's continue from the code example used in Quickstart.
Instantiation
There are two required and one positional parameters:
country
(required)- two-letter country code of ISO 3166-1 alpha-2 standard
app_name
(required)- name of an iOS application to fetch reviews for
- also used by
search_id()
method to search forapp_id
internally
app_id
(positional)- can be passed directly
- or ignored to be obtained by
search_id
method internally
Once instantiated, the object can be examined:
>>> minecraft
AppStore(country='nz', app_name='minecraft', app_id=479516143)
>>> print(app)
Country | nz
Name | minecraft
ID | 479516143
URL | https://apps.apple.com/nz/app/minecraft/id479516143
Review count | 0
Other optional parameters are:
log_format
- passed directly to
logging.basicConfig(format=log_format)
- default is
"%(asctime)s [%(levelname)s] %(name)s - %(message)s"
- passed directly to
log_level
- passed directly to
logging.basicConfig(level=log_level)
- default is
"INFO"
- passed directly to
log_interval
- log is produced every 5 seconds (by default) as a "heartbeat" (useful for a long scraping session)
- default is
5
Fetching Review
The maximum number of reviews fetched per request is 20. To minimise the number of calls, the limit of 20 is hardcoded. This means the review()
method will always grab more than the how_many
argument supplied with an increment of 20.
>>> minecraft.review(how_many=33)
>>> minecraft.reviews_count
40
If how_many
is not provided, review()
will terminate after all reviews are fetched.
NOTE the review count seen on the landing page differs from the actual number of reviews fetched. This is simply because only some users who rated the app also leave reviews.
Review Data
The fetched review data are loaded in memory and live inside reviews
attribute as a list of dict.
>>> minecraft.reviews
[{'userName': 'someone', 'rating': 5, 'date': datetime.datetime(...
Each review dictionary has the following schema:
{
"date": datetime.datetime,
"isEdited": bool,
"rating": int,
"review": str,
"title": str,
"userName": str
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file app-store-scraper-0.3.3.tar.gz
.
File metadata
- Download URL: app-store-scraper-0.3.3.tar.gz
- Upload date:
- Size: 7.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 75f61323f8e9550763dcea98c3020722e63f82bf3ea9fa130a820af18f038db4 |
|
MD5 | bebf5d6a87c7ee0a8fcf5cc5e8624449 |
|
BLAKE2b-256 | 44766a1a6d58c4d07d1c261e8f73c6344c26181f68b230f7bbe53467148f7b45 |
File details
Details for the file app_store_scraper-0.3.3-py3-none-any.whl
.
File metadata
- Download URL: app_store_scraper-0.3.3-py3-none-any.whl
- Upload date:
- Size: 8.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fb7dfc2598bbe76b5c2b362f32dfdced4f1dab1ca74d516c3ef26e4e8ffc74ba |
|
MD5 | 0ab497e4aa9e9d091a1d7f811fc27839 |
|
BLAKE2b-256 | 8f70d412d4620fd31ec26d995e7f399f13d92c9e5d917414e71e4aa4f3d37b77 |