Skip to main content

A tool for managing virtual file attributes

Project description

Viat

Tests PyPI Package AUR Package

A tool for managing virtual file attributes.

Viat allows recording file attributes in a plain text file. The main unit of operation is a vault, which is determined by .viat subdirectory. In the simplest case, this subdirectory contains config.toml, storage.toml and possibly schema.json.

In short, in an empty vault, the command

viat set file.pdf --raw attr value

puts the following into storage.toml:

["file.pdf"]
attr = "value"

Determining which files are tracked by Viat is done via tracker providers, while storing the attributes is done via storage providers. Both protocols are very general and new providers can easily be added.

Table of contents

See also the documentation on ReadTheDocs

Usage

We give a usage tutorial here; refer to the online documentation or to the man page viat(1) for more details.

Command-line usage

First, a vault must be initialized:

viat init

The vault is determined by a .viat subfolder that contains config.toml and storage.toml files (JSON is also supported for both). We can immediately set attributes for any file on the file system:

$ viat update tractatus.pdf '{"author": "Ludwig Wittgenstein", "year": 1921}'
Warning: File 'tractatus.pdf' is not being tracked.
{"author": "Ludwig Wittgenstein", "year": 1921}

All stored attributes for the file get printed; in this case the only stored attributes are those we have just added. We also get a warning saying that the vault's tracker does not know about this file.

The role of the tracker is to enumerate the files that are explicitly tracked by the vault. The default glob-based tracking provider requires explicit patterns. We can track all PDF files in the root of the vault using the following configuration:

[tracker.glob]
patterns = ["*.pdf"]

With this, we can add new properties without warnings:

$ viat set tractatus.pdf rating 4
{"author": "Ludwig Wittgenstein", "year": 1921, "rating": 4}

The above worked because "true" is a valid JSON value; if we were to set a string instead, we would have to escape it in quotes, which is inconvenient. Instead, we can treat the value as a string by passing the --raw flag:

$ viat set --raw tractatus.pdf publisher 'Annalen der Naturphilosophie'
...

It makes sense to utilize JSON schemas. Let us add the following to .viat/schema.json:

{
  "type":"object",
  "properties": {
    "year": {"type": "number"}
  }
}

Now we can no longer set the year to anything that is not a number:

$ viat set tractatus.pdf --raw year string
Error: Validation error for 'tractatus.pdf': data.year must be number.

The essence of the tool is that the attributes are stored in plain text formats that can be edited committed to version control. For example, .viat/storage.toml should now look as follows:

["tractatus.pdf"]
author = "Ludwig Wittgenstein"
year = 1921
rating = 4
publisher = "Annalen der Naturphilosophie"

If we manually change the year to "string", we will get a warning when loading the vault:

$ viat get tractatus.pdf rating
Warning: Validation error in stored data for 'tractatus.pdf': data.year must be number.
4

If we move tractatus.pdf to book.pdf, viat will no longer know about it:

$ viat get book.pdf rating
Warning: File 'book.pdf' is not being tracked.
Error: Attribute 'rating' has not been set for 'book.pdf'.

Such discrepancies can be determined relatively easily:

$ viat stale
tractatus.pdf
$ viat tracked --no-data
book.pdf

For such cases, we provide the helpers viat mv and viat rm, but otherwise avoid being too clever.

Programmatic usage

The programmatic usage is straightforward enough because of the API reference. Here is a brief continuation of the above example:

vault = autoload_vault()

assert vault.tracker.is_tracked('tractatus.pdf')

# This context manager validates and writes the file upon exiting.
# If no mutators have been used, no validation and writing is performed.
with vault.storage as conn:
    # The inner lock-based context managers allow either read-only or read-write operations.

    # Readers are Mapping instances.
    with conn.get_reader('tractatus.pdf') as reader:
        print(mut['year'])

    # Mutators are MutableMapping instances.
    with conn.get_mutator('tractatus.pdf') as mut:
        mut['year'] = 1921

Installation

The viat PyPI package contains the core programmatic API.

To install the viat executable for the current user, you can use pipx or uv:

pipx install viat
uv tool install viat

The git tracker requires the git extra.

To install from GitHub, you must use the following:

uv tool install viat --from git+https://github.com/v--/viat

Sometimes a particular feature branch need to be tested. For installing a fixed revision (i.e. common/branch/tag), the following should work (if extra-name is needed, use viat@rev[extra-name]):

uv tool install viat --from git+https://github.com/v--/viat@rev

To install viat from a cloned repository, you can use the following:

uv sync
uv build --wheel
# Once built, we can install using uv
uv tool install viat --from dist/*.whl
# or pipx
pipx install --include-deps dist/*.whl

Tasks inside the repository like linting and testing use are summarized in poe.toml (configuration for poethepoet). For example, building the documentation requires some hacks, but can be done using a single command:

uv run poe docs-build

[!TIP] An AUR package is available for reference, as well as a GitHub Action. If you are packaging this for some other package manager, consider using PEP-517 tools as shown in this PKGBUILD file.

Motivation

When managing lots of files, there comes a point when metadata needs to be attached to them somehow.

  • Different file systems offer extended file attributes. Unfortunately, poor software support reduces their utility. For example, curl --xattr <url> will record some attributes, but they will be lost on copy (with GNU cp at least) and will not be tracked by git.

  • git attributes are obviously supported by git, but other tools have to consult git in order to use them. Furthermore, there is no convenient mechanism for setting git attributes.

  • XMP (extensible metadata platform) files are designed to be used by arbitrary tools and can be easily tracked using version control, but are cumbersome to manage.

Perhaps I am missing some other approaches, but at this point it should be clear that there is no convenient way to manage file metadata. A long time ago I wrote a small script that tracked "virtual" attributes across a directory by putting them into a single JSON file. At some point I decided to refine the script, and so Viat was born.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

viat-0.9.6.post1.tar.gz (23.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

viat-0.9.6.post1-py3-none-any.whl (45.3 kB view details)

Uploaded Python 3

File details

Details for the file viat-0.9.6.post1.tar.gz.

File metadata

  • Download URL: viat-0.9.6.post1.tar.gz
  • Upload date:
  • Size: 23.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.13 {"installer":{"name":"uv","version":"0.11.13","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for viat-0.9.6.post1.tar.gz
Algorithm Hash digest
SHA256 63a784cd78aa2872bcf626a06882f0df9361bc65a68eb34403be878f110323c3
MD5 0addd66f6e43991030f9e02e520b29f6
BLAKE2b-256 686783406a1ebd097a52e86b147560e1ca15c95933f422127cf6e98db0fdc97b

See more details on using hashes here.

File details

Details for the file viat-0.9.6.post1-py3-none-any.whl.

File metadata

  • Download URL: viat-0.9.6.post1-py3-none-any.whl
  • Upload date:
  • Size: 45.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.13 {"installer":{"name":"uv","version":"0.11.13","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for viat-0.9.6.post1-py3-none-any.whl
Algorithm Hash digest
SHA256 2793e8b37149345218cae8c5693f82b39f3b2ef12f8ded9238dc53fb6a631d7e
MD5 ef57efc789cfa822f803b5c270834239
BLAKE2b-256 c402a154bb0d192bef6cd5dd834b109fa98bfe91e25e43a9a6cbb29082524182

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page