Skip to main content

KBKit: Kirkwood-Buff Analysis Toolkit

Project description

KBKit: Kirkwood-Buff Analysis Toolkit

License PyPI version Powered by: Pixi Code style: ruff Coverage Status docs python 3.12

KBKit is a Python package for automated Kirkwood-Buff (KB) analysis of molecular simulation data. It provides tools to parse simulation outputs, compute Kirkwood-Buff integrals, and extract thermodynamic properties for binary and multicomponent systems. KBKit supports flexible workflows, including:

  • Parsing and processing of simulation data (e.g., RDFs, densities)
  • Calculation of KB integrals and related thermodynamic quantities
  • Integration of activity coefficient derivatives (numerical or polynomial)
  • Automated pipelines for batch analysis
  • Calculation of static structure factor and X-ray intensities in the limit of q → 0
  • Visualization tools for KB integrals, thermodynamic properties, and static structure factors

KBKit is designed for researchers in computational chemistry, soft matter, and statistical mechanics who need robust, reproducible KB analysis from simulation data. The package is modular, extensible, and integrates easily with Jupyter notebooks and Python scripts.

Installation

Quick install via PyPI

pip install kbkit

Developer install (recommended for contributors or conda users)

Clone the GitHub repository and use the provided Makefile to set up your development environment:

git clone https://github.com/aperoutka/kbkit.git
cd kbkit
make setup-dev

This one-liner creates the kbkit-dev conda environment, installs kbkit in editable mode, and runs the test suite.

To install without running tests:

make dev-install

To build and install the package into a clean user environment:

make setup-user

For a full list of available commands:

make help

File Organization

For running kbkit.Pipeline or its dependencies, the following file structure is required: a structured directory layout that separates mixed systems from pure components. This organization enables automated parsing, reproducible KB integrals, and scalable analysis across chemical systems.

  • NOTE: KBKit currently only supports parsing for GROMACS files.

An example of file structure:

kbi_dir/
├── project/
   └── system/
       ├── rdf_dir/
          ├── mol1_mol1.xvg
          ├── mol1_mol2.xvg
          └── mol1_mol2.xvg
       ├── system_npt.edr
       ├── system_npt.gro
       └── system.top
└── pure_components/
    └── molecule1/
        ├── molecule1_npt.edr
        └── molecule1.top

Requirements:

  • Each system to be analyzed must include:
    • rdf_dir/ containing .xvg RDF files for all pairwise interactions
      • Both molecule IDs in RDF calculation MUST BE in filename
    • either .top topology file or .gro structure file (.gro is recommended)
    • .edr energy file
  • Each pure component must include:
    • either .top topology file or .gro structure file (.gro is recommended)
    • .edr energy file
    • all other files (optional)

Examples

Below are several examples on various ways to implement KBKit. See examples for a more complete example on the ethanol/water binary system.

Calculating Kirkwood-Buff integrals on a single RDF

import os
from kbkit.analysis import KBIntegrator
from kbkit.systems import SystemProperties

syspath = "./examples/test_data/ethanol_water_26C/sys_405"
rdf_path = os.path.join(sys_path, "kbi_rdf_files_gmx25", "rdf_ETHOL_SPCEW.xvg")

# create integrator object from single RDF file
integrator = KBIntegrator(
    rdf_file=rdf_path,
    system_properties=SystemProperties(sys_path),
    use_fixed_rmin=False,
)

# calculate KBI in thermodynamic limit
kbi = integrator.kbi_limit(mol_j="SPCEW")

Run an automated pipeline for batch analysis

from kbkit import Pipeline

# Set up and run the pipeline
pipe = Pipeline(
    base_path="./examples/test_data/ethanol_water_26C", # directory with system data
    pure_path="./examples/test_data/pure_components",   # directory with pure component data
    pure_systems=["ETHOL_300", "SPCEW_300"],            # list of pure systems
    ensemble="npt",                                     # ensemble type: npt or nvt
    gamma_integration_type="numerical",                 # integration method
    verbose=False                                       # logging verbosity
)

# run kbkit pipeline
pipe.run()

# Access the results properties
# stored in dataclass (ThermoProperty); attributes: name, value, units
# example for excess energy
ge_obj = pipe.get("ge")
print("GE summary: ", ge_array.shape)

# Convert units from kJ/mol -> kcal/mol
# default units will be those from GROMACS
pipe.convert_units("ge", "kcal/mol")

# make figures for select thermodynamic properties
pipe.plot(
    molecule_map={"ETHOL": "ethanol", "SPCEW": "water"}, # dictionary mapping MD names to label names
    x_mol="ETHOL"                                        # MD name of molecule for x-axis
)

Parse GROMACS files

import os
from kbkit.parsers import TopFileParser, EdrFileParser, GroFileParser

syspath = "./examples/test_data/ethanol_water_26C/sys_405"
top_file = os.path.join(syspath, "sys_405.top")
gro_file = os.path.join(syspath, "sys_405.gro")
edr_file = os.path.join(syspath, "sys_405_npt.edr")

# determines molecules present in simulation and their counts
top_parser = TopFileParser(top_file)
print("molecule dict: ", top_parser.molecule_counts)
print("molecule names: ", top_parser.molecules)
print("total molecule number: ", top_parser.total_molecules)

# determines electron count for each molecule type
gro_parser = GroFileParser(gro_file)
print("electron dict: ", gro_parser.electron_count)
print("box volume: ", gro_parser.compute_box_volume())

# computes energy properties by calling gmx energy
edr_parser = EdrFileParser(edr_file)
print("List of available properties: ", edr_parser.available_properties())
print("Density array over simulation time: ", edr_parser.extract_timeseries("density"))
print("Average density with std deviation: ", edr_parser.average_property("density", return_std=True))

Credits

This package was created with Cookiecutter and the jevandezande/pixi-cookiecutter project template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kbkit-1.0.13.tar.gz (64.6 kB view details)

Uploaded Source

File details

Details for the file kbkit-1.0.13.tar.gz.

File metadata

  • Download URL: kbkit-1.0.13.tar.gz
  • Upload date:
  • Size: 64.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for kbkit-1.0.13.tar.gz
Algorithm Hash digest
SHA256 e947ebca9fcccf2ffb7308bac3bbbacb212fc37a425ac817ab9a1e37029a76b2
MD5 73923daa0f7bf243d2921f3745179166
BLAKE2b-256 6d4c3fbbb2d2e8982b7a7acc0553e6d259033687be6d565d44b40585f7bc6358

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page