Knowledge management app to interface with quantum chemical calculations
Project description
Quantum Chemistry Database
Python app to build a knowledge database from a directory structure of ORCA calculations.
The database takes the form of a directory structure of markdown files, representing structure and information on the calculations. This allows establishing relationships between calculations and forms a lab notebook type structure.
Installation
There is nothing to install - given you're using uv.
The QCDB tool is on PyPI, so you can use it asynchroniously with uv tool run (uvx for short):
uvx qcdb --root /path/to/project/root --vault-dir /path/to/vault
To inspect the vault, download and use Obsidian and use it to inspect the vault.
Architecture
The scraper can be updated in the future to
- support more QC codes
- extract more metadata
- automatically prepare dashboards for e.g. SCF or optimization convergence, IR spectra, ..
- be parallel
The idea is that we can scrape arbitrary directory structures and "load them into the database" - except here we use a managed directory structure (the "vault") with plain Markdown.
This "vault" is the database.
We don't store the actual calculation's data (too big), but represent it in our knowledge graph as a Markdown (.md) note with
the automatically extracted metadata and links to other notes that e.g. use the same geometry.
All the metadata is stored in the YAML frontmatter of each note:
---
charge: 0
mult: 1
date: '2024-06-05'
geometry: '47 ...'
input: '! D4 TPSSh ..'
..
---
# Notes
Experimenting with increased grid size to converge [[link_to_previous, failed calc]]
...
The beauty is that we retain the information from failed experiments as well, without needing to hold onto the raw data itself. We only keep the knowledge we gained from the experiment.
By (automatically) creating provenance trees (i.e. calculation "timelines", what led to what etc) we can, in the future, pick a specific result and export its provenance tree only. This way we can fully reproduce a given result with minimal needed experiments.
Because all the notes are plain .md files, the whole vault can be versioned (and shared?) with git.
The links are also important for e.g. selecting a bunch of calculations, and then exporting them as a fully self-contained vault, or building a citation bibliography from them, or compiling geometry tables for the Supporting Information, or archiving them for long-term storage, ...
By defining custom code-fences, one can create new experiments direclty in the vault and then have a runner run them. Imagine a code block like
! D4 TPSSh OPT FREQ
and linking a geometry. Then run e.g. uvx qcdb --run /path/to/vault to detect and run such experiments automatically.
One also gets to use the whole Obsidian ecosystem to directly work with the knowledge graph:
- use regular markdown notes to link e.g. meeting notes or papers with experiments
- write short notes on insights gained, and link them - create a network of knowledge, and how it came to be (reproducibly!)
- use obsidian bases or the dataview plugin to create tables straight from the notes
- use excalidraw plugin for visual note taking and experiment design
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file qcdb-0.1.1.tar.gz.
File metadata
- Download URL: qcdb-0.1.1.tar.gz
- Upload date:
- Size: 53.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5f074bd4bf6e60a65fec60ca89b28f7845f87bedcbb08021a36f80eedec7e298
|
|
| MD5 |
d660660201f7dba30db1644534ea2ea3
|
|
| BLAKE2b-256 |
d521f9b5398584cc15144146844fbf6ae5c32218c45e3e2e2eca886ab2b68401
|
File details
Details for the file qcdb-0.1.1-py3-none-any.whl.
File metadata
- Download URL: qcdb-0.1.1-py3-none-any.whl
- Upload date:
- Size: 5.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2665a00f0df4c2311aafde34df8c25b86c85b0e2ed9336b37ef8fb4dc9e740bd
|
|
| MD5 |
7d3f4e61d602ce54e22bc8478598924f
|
|
| BLAKE2b-256 |
7995c783dc7fdb4b46d79b20d368fd3a117c2f5668773d024ded85b509c418c3
|