protagonist

A tagsystem. Organises your files with non-hierarchical tags.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- DFSG approved
- OSI Approved :: GNU General Public License (GPL)
Natural Language
- English
Operating System
- Unix
Programming Language
Topic
- Software Development :: Libraries

Project description

Protagonist implements a tagsystem: it is used for tagging files non-hierarchically, so that they can be found with boolean queries. Protagonist interfaces a particular filesystem structure (described below) that represents a tagsystem through the use of special directories and links.

(For usage examples, see EXAMPLES.rst.)

A major design constraint of this project is to provide seamless compatibility with Tahoe-LAFS backup storage. Tahoe-LAFS backup will take a directory and recursively back up the files in every subdirectory. It does not have a native representation of symbolic links.

Structure

The tagsystem structure is defined in a special directory named “.protagonist/”.

There is a subdirectory named “.protagonist/tags”, and a subdirectory “.protagonist/tags/t” for every existing tag, t. Any file which is tagged with t is given a unique identifier and a hard link in the directory t.

More generally, an intended future feature is that the tagsystem supports key-value pairs, with the directories “.protagonist/key/value/”

Why hard links?

Protagonist could be implemented with symbolic links, but hard links were chosen for the following reasons:

For compatibility with Tahoe-LAFS backup storage.
To enable reference counting and tracking logic.
To reduce layers of indirection.

Unique identifiers

For unique file identification, Protagonist uses the (20 byte) BLAKE2 hash of the contents at the time of tagging, with the same file extension as the original file. The potential advantages of content-based hashing are:

Recovery from file movement.
Discovery of untagged files (could also be done with inodes).
Identification of multiple copies of the same file.
Enabling integrity checking of immutable files.

Using content-based hashing on mutable files will not have these advantages, though it could be used to help clarify intent, by, for example, allowing the user to specifically list files that should stay unchanged.

Using hard links has repercussions for deleting files that have been tagged. With symlinks, a deleted or moved file will leave broken links. With hard links, there needs to be special logic for determining whether the file exists outside of the tagsystem. This is implemented in the command line interface, which wraps rm and mv.

Truenames

A problem with using unique IDs is that now the result of a tagls is not recognisable to the user. This motivates the truenames index.

Part of the goals of this project is to use the directory structure of the underlying filesystem as a primitive that can be used as a data structure. Therefore, I wish to avoid symlinks and databases in favour of filesystem mechanisms.

The design I have chosen is to make a special directory called “truenames”, where there is a file named with the file ID, the contents of which are the true pathname. This is eerily like implementing symlinks. It essentially is symlinks, but stealth symlinks, that aren’t flagged as such in the inode tables, and therefore aren’t followed by commands that normally follow symlinks.

Methods

The tagsystem supports:

addition and deletion of tags
tagging and untagging files
querying with boolean combinations
cleanly removing and moving files from the filesystem even if they have been tagged.

creation of a tagsystem

The tagsystem is automatically created with the first tagging of a file. Tagsystem creation is idempotent. If there is already a tagsystem there, nothing is changed.

untagging files

When we wish to remove a tag, t, from a file, f:

If f was the only file with tag t, then tag t should also be removed from the tagsystem.
Because files with identical content have the same file id, a request to untag f, when f is not tagged, but an identical file f’ is tagged could result in untagging the wrong file copy. Therefore care must be taken to assure that f is the correct link in the file. For this we will use inodes instead of content hashing.

deleting tags

Tag deletion can be done even if some files have the tag. Those links just go away, and if I file is left with no tags, it is removed from the truenames index.

Dependencies

pyblake2

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- DFSG approved
- OSI Approved :: GNU General Public License (GPL)
Natural Language
- English
Operating System
- Unix
Programming Language
Topic
- Software Development :: Libraries

Release history Release notifications | RSS feed

This version

0.1.12

Jul 9, 2014

0.1.11

Jul 5, 2014

0.1.9

Jul 5, 2014

0.1.8

Jul 4, 2014

0.1.7

Jul 4, 2014

0.1.6

Jul 3, 2014

0.1.5

Jul 3, 2014

0.1.4

Jul 3, 2014

0.1.3

Jul 2, 2014

0.1.2

Jul 2, 2014

0.1.1

Jul 2, 2014

0.1.0-dirty

Jul 2, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protagonist-0.1.12.tar.gz (25.9 kB view details)

Uploaded Jul 9, 2014 Source

File details

Details for the file protagonist-0.1.12.tar.gz.

File metadata

Download URL: protagonist-0.1.12.tar.gz
Upload date: Jul 9, 2014
Size: 25.9 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for protagonist-0.1.12.tar.gz
Algorithm	Hash digest
SHA256	`d9bc204d35ea7bad4024f7eff85a0ed3a9667931a2a12dc3de411a17c5a61d0c`
MD5	`a778b9c3b04bf488d9440c86a67f9053`
BLAKE2b-256	`6899652c1e51d9e870fcab6fef69d26664852544666ea219b2d016501c86acc5`

See more details on using hashes here.

protagonist 0.1.12

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Structure

Why hard links?

Unique identifiers

Truenames

Methods

creation of a tagsystem

untagging files

deleting tags

Dependencies

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes