Datasette plugin for working with Apple's binary plist format
Project description
datasette-bplist
Datasette plugin for working with Apple's binary plist format.
This plugin adds two features: a display hook and a SQL function.
The display hook will detect any database values that are encoded using the binary plist format. It will decode them, convert them into JSON and display them pretty-printed in the Datasette UI.
The SQL function bplist_to_json(value)
can be used inside a SQL query to convert a binary plist value into a JSON string. This can then be used with SQLite's json_extract()
function or with the datasette-jq plugin to further analyze that data as part of a SQL query.
Install this plugin in the same environment as Datasette to enable this new functionality:
pip install datasette-bplist
Trying it out
If you use a Mac you already have plenty of SQLite databases that contain binary plist data.
One example is the database that powers the Apple Photos app.
This database tends to be locked, so you will need to create a copy of the database in order to run queries against it:
cp ~/Pictures/Photos\ Library.photoslibrary/database/photos.db /tmp/photos.db
The database also makes use of custom SQLite extensions which prevent it from opening in Datasette.
You can work around this by exporting the data that you want to experiment with into a new SQLite file.
I recommend trying this plugin against the RKMaster_dataNote
table, which contains plist-encoded EXIF metadata about the photos you have taken.
You can export that table into a fresh database like so:
sqlite3 /tmp/photos.db ".dump RKMaster_dataNote" | sqlite3 /tmp/exif.db
Now run datasette /tmp/exif.db
and you can start trying out the plugin.
Using the bplist_to_json() SQL function
Once you have the exif.db
demo working, you can try the bplist_to_json()
SQL function.
Here's a query that shows the camera lenses you have used the most often to take photos:
select
json_extract(
bplist_to_json(value),
"$.{Exif}.LensModel"
) as lens,
count(*) as n
from RKMaster_dataNote
group by lens
order by n desc;
If you have a large number of photos this query can take a long time to execute, so you may need to increase the SQL time limit enforced by Datasette like so:
$ datasette /tmp/exif.db \
--config sql_time_limit_ms:10000
Here's another query, showing the time at which you took every photo in your library which is classified as as screenshot:
select
attachedToId,
json_extract(
bplist_to_json(value),
"$.{Exif}.DateTimeOriginal"
)
from RKMaster_dataNote
where
json_extract(
bplist_to_json(value),
"$.{Exif}.UserComment"
) = "Screenshot"
And if you install the datasette-cluster-map plugin, this query will show you a map of your most recent 1000 photos:
select
*,
json_extract(
bplist_to_json(value),
"$.{GPS}.Latitude"
) as latitude,
-json_extract(
bplist_to_json(value),
"$.{GPS}.Longitude"
) as longitude,
json_extract(
bplist_to_json(value),
"$.{Exif}.DateTimeOriginal"
) as datetime
from
RKMaster_dataNote
where
latitude is not null
order by
attachedToId desc
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file datasette_bplist-0.1-py3-none-any.whl
.
File metadata
- Download URL: datasette_bplist-0.1-py3-none-any.whl
- Upload date:
- Size: 7.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3befe172d5621287634020e933af9b531567782058a9c1e0f969bbb72488d192 |
|
MD5 | 198af42a127b8c2d953489b72fd0d6ec |
|
BLAKE2b-256 | 4714269ee90f057a4672a837a976f4cd96b8622469b2945ddcd62f46dec577db |