Self tracking your online life!
Project description
nostalgia_chrome
Cross-platform Chrome History Analysis
Self tracking
There is a movement of self tracking. Monitoring pulse, heartbeat and so on. But the most important is not being tracked: our online behavior.
Making sure we can self document, we need the following things.
-
Chrome only keeps its history for a max of 90 days, so we need to start saving history.
-
We need to collect HTML data from the pages we visit.
-
We need to extract and analyze data from the HTML, such as code snippets, links, microdata, images, events.. anything really. This is done in Nostalgia Core.
-
Allow plugins (and make them configurable, please contribute). The first example is that it will additionally track which videos you watch.
What can you expect (Data overview)
In ~/.nostalgia/meta.jsonl
an index will be saved per visit:
{
"path":"/home/pascal/.nostalgia/html/1576317113.7_httpsgithubcomnostalgiadevnostalgia_chrome.html.gz",
"url": "https://github.com/nostalgia-dev/nostalgia_chrome",
"time":"1576317113.75019"
}
In ~/.nostalgia/html
the source HTML will be stored as .html.gz
(reaching about 8x compression).
In ~/.nostalgia/videos_watched.jsonl
the data for events on HTML5 video elements will be stored (on stop playing/close tab):
{
"playingSince": 1576273573.08,
"seekTime": 0,
"playingUntil": 1576273599.977,
"duration": 26.8970000744,
"totalClipDuration": 3510.301,
"pageLoadTime": 1576266470.316,
"loc": "https://www.youtube.com/watch?v=Zz-bhLjVS5o",
"title": "Lost Frequencies | Tomorrowland Mainstage 2019 (Full Set) - YouTube",
"likes": 24137,
"dislikes": 946
}
Installation
-
Clone this repository:
git clone git@github.com:nostalgia-dev/nostalgia_chrome.git
-
In Chrome click the settings button and click "More tools" and navigate to "Extensions". Click "Load unpacked". Navigate to the
chromePlugin
folder and click "Open". -
pip install nostalgia_chrome
-
To test it out, run
nostalgia_chrome serve
. This will run the web server in the foreground so you can see that it works. -
Visit a (non-file / localhost) URL so that you can verify it works. The data will be stored in
~/.nostalgia/meta.jsonl
,~/.nostalgia/html
. -
To make sure
nostalgia_chrome
gets automatically run on boot, look at theboot_as_service
folder on how to runnostalgia_chrome
as a service on boot.
Note: contributions of service files are asked for: here are the corresponding Windows issue and OSX issue.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for nostalgia_chrome-0.0.14-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b146af2b2913d92c851463eeb6b81ded829748c42bc0a5c6c61248c215031771 |
|
MD5 | d663f2de0eca33f457a3fcede1793002 |
|
BLAKE2b-256 | d1b7c3c98b0fd9797c47912e04c6b8643f99849e8910b877ead40500612ef980 |