Self tracking your online life!
Project description
nostalgia_chrome
Cross-platform Chrome History Analysis
Self tracking
There is a movement of self tracking. Monitoring pulse, heartbeat and so on. But the most important is not being tracked: our online behavior.
Making sure we can self document, we need the following things.
-
Chrome only keeps its history for a max of 90 days, so we need to start saving history.
-
We need to collect HTML data from the pages we visit.
-
We need to extract and analyze data from the HTML, such as code snippets, links, microdata, images, events.. anything really. This is done in Nostalgia Core.
-
Allow plugins (and make them configurable, please contribute). The first example is that it will additionally track which videos you watch.
What can you expect (Data overview)
In ~/nostalgia_data/meta.jsonl
an index will be saved per visit:
{
"path":"/home/pascal/nostalgia_data/html/1576317113.7_httpsgithubcomnostalgiadevnostalgia_chrome.html.gz",
"url": "https://github.com/nostalgia-dev/nostalgia_chrome",
"time":"1576317113.75019"
}
In ~/nostalgia_data/html
the source HTML will be stored as .html.gz
(reaching about 8x compression).
In ~/nostalgia_data/videos_watched.jsonl
the data for events on HTML5 video elements will be stored (on stop playing/close tab):
{
"playingSince": 1576273573.08,
"seekTime": 0,
"playingUntil": 1576273599.977,
"duration": 26.8970000744,
"totalClipDuration": 3510.301,
"pageLoadTime": 1576266470.316,
"loc": "https://www.youtube.com/watch?v=Zz-bhLjVS5o",
"title": "Lost Frequencies | Tomorrowland Mainstage 2019 (Full Set) - YouTube",
"likes": 24137,
"dislikes": 946
}
Installation
-
Clone this repository:
git clone git@github.com:nostalgia-dev/nostalgia_chrome.git
-
In Chrome click the settings button and click "More tools" and navigate to "Extensions". Click "Load unpacked". Navigate to the
chromePlugin
folder and click "Open". -
pip install nostalgia_chrome
-
To test it out, run
nostalgia_chrome run_server
. This will run the web server in the foreground so you can see that it works. -
Visit a (non-file / localhost) URL so that you can verify it works. The data will be stored in
~/nostalgia_data/meta.jsonl
,~/nostalgia_data/html
. -
To make sure
nostalgia_chrome
gets automatically run on boot:
On Linux (systemctl based):
pip install sysdm
sysdm create "nostalgia_chrome run_server" --extensions ""
On Windows awaiting contribution for how to do this https://github.com/nostalgia-dev/nostalgia_chrome/issues/2
On OSX awaiting contribution for how to do this https://github.com/nostalgia-dev/nostalgia_chrome/issues/1
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for nostalgia_chrome-0.0.24-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | def84146606b61bff893511f5485be6d48fc87894e67a97b038b3bcb60a6a541 |
|
MD5 | 93f5b404f6438f19cf6b162acf12658a |
|
BLAKE2b-256 | 3a5d5eb5a00ff1dea920641d86d7123b82b6880daae6135601ab8e5daca144bb |