pymatgen-db

Pymatgen-db is a database add-on for the Python Materials Genomics (pymatgen) materials analysis library.

These details have not been verified by PyPI

Project links

Homepage

Project description

Pymatgen-db is a database add-on for the Python Materials Genomics (pymatgen) materials analysis library. It enables the creation of Materials Project-style MongoDB databases for management of materials data. A query engine is also provided to enable the easy translation of MongoDB docs to useful pymatgen objects for analysis purposes.

Major change

From v2021.5.13, pymatgen-db is now a proper namespace add-on to pymatgen. In other words, you no longer import from matgendb but rather pymatgen.db.

Getting pymatgen-db

Stable version

The easiest way to install pymatgen-db on any system is to use pip, as follows:

pip install pymatgen-db

Requirements

All required python dependencies should be automatically taken care of if you install pymatgen-db using easy_install or pip. Otherwise, these packages should be available on PyPI.

Python 3.7+ required.
Pymatgen 2022+, including all dependencies associated with it. Please refer to the pymatgen docs for detailed installation instructions.
Pymongo 3.3+: For interfacing with MongoDb.
MongoDB 2.2+: Get it at the MongoDB website.

Usage

A powerful command-line script (mgdb) provides most of the access to many of the features in pymatgen-db, including db initialization, insertion of data, running the materials genomics ui, etc. To see all options available, type:

mgdb --help

Initial setup

The first step is to install and setup MongoDB on a server of your choice. The MongoDB manual is an excellent place to start. For the purposes of testing out the tools here, you may simply download the binary distributions corresponding to your OS from the MongoDB website, and then running the following commands:

# For Mac and Linux OS.
mkdir test_db && mongod --dbpath test_db

This will create a test database and start the Mongo daemon. Once you are done with testing, you can simply press Ctrl-C to stop the server and delete the “test_db” folder. Running a Mongo server this way is insecure as Mongo does not enable authentication by default. Please refer to the MongoDB manual when setting up your production database.

After your server is up, you should create a database config file by running the following command:

mgdb init -c db.json

This will prompt you for a few parameters to create a database config file, which will make it much easier to use mgdb in future. Note that the config file name can be anything of your choice, but using “db.json” will allow you to use mgdb without explicitly specifying the filename in future. If you are just testing using the test database, simply hit Enter to accept the defaults for all settings.

For more advanced use of the “db.json” config file (e.g., specifying aliases, defaults, etc., please refer to the following sample.

Inserting calculations

To insert an entire directory of runs (where the topmost directory is “dir_name”) into the database, use the following command:

# Note that "-c db.json" may be omitted if the config filename is the
# current directory under the default filename of db.json.

mgdb insert -c db.json dir_name

A sample run has been provided for download for testing purposes. Unzip the file and run the above command in the directory.

Querying a database

Sometimes, more fine-grained querying is needed (e.g., for subsequent postprocessing and analysis).

The mgdb script allows you to make simple queries from the command line:

# Query for the task id and energy per atom of all calculations with
# formula Li2O. Note that the criteria has to be specified in the form of
# a json string. Note that "-c db.json" may be omitted if the config
# filename is the current directory under the default filename of db.json.

mgdb query -c db.json --crit '{"pretty_formula": "Li2O"}' --props task_id energy_per_atom

For more advanced queries, you can use the QueryEngine class for which an alias is provided at the root package. Some examples are as follows:

>>> from pymatgen.db import QueryEngine
# Depending on your db.json, you may need to supply keyword args below
# for `port`, `database`, `collection`, etc.
>>> qe = QueryEngine()

#Print the task id and formula of all entries in the database.
>>> for r in qe.query(properties=["pretty_formula", "task_id"]):
...     print "{task_id} - {pretty_formula}".format(**r)
...
12 - Li2O

# Get a pymatgen Structure from the task_id.
>>> structure = qe.get_structure_from_id(12)

# Get pymatgen ComputedEntries using a criteria.
>>> entries = qe.get_entries({})

The language follows very closely to pymongo/MongoDB syntax, except that QueryEngine provides useful aliases for commonly used fields as well as translation to commonly used pymatgen objects like Structure and ComputedEntries.

Extending pymatgen-db

Currently, pymatgen-db is written with standard VASP runs in mind. However, it is perfectly extensible to any kind of data, e.g., other kinds of VASP runs (bandstructure, NEB, etc.) or just any form of data in general. Developers looking to adapt pymatgen-db for other purposes should look at the VaspToDbTaskDrone class as an example and write similar drones for their needs. The QueryEngine can generally be applied to any Mongo collection, with suitable specification of aliases if desired.

How to cite pymatgen-db

If you use pymatgen and pymatgen-db in your research, please consider citing the following work:

Shyue Ping Ong, William Davidson Richards, Anubhav Jain, Geoffroy Hautier, Michael Kocher, Shreyas Cholia, Dan Gunter, Vincent Chevrier, Kristin A. Persson, Gerbrand Ceder. Python Materials Genomics (pymatgen) : A Robust, Open-Source Python Library for Materials Analysis. Computational Materials Science, 2013, 68, 314-319. doi:10.1016/j.commatsci.2012.10.028

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

2023.7.18

Jul 18, 2023

2023.2.23

Feb 23, 2023

2022.5.20

May 20, 2022

2022.5.2

May 2, 2022

2022.3.17

Mar 17, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymatgen-db-2023.7.18.tar.gz (63.3 kB view details)

Uploaded Jul 18, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pymatgen_db-2023.7.18-py3-none-any.whl (87.8 kB view details)

Uploaded Jul 18, 2023 Python 3

File details

Details for the file pymatgen-db-2023.7.18.tar.gz.

File metadata

Download URL: pymatgen-db-2023.7.18.tar.gz
Upload date: Jul 18, 2023
Size: 63.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for pymatgen-db-2023.7.18.tar.gz
Algorithm	Hash digest
SHA256	`ca0acab590c1cc1000d745b8001ddd2e26ee56612257754ce21d96db69b0f8b2`
MD5	`b53c5fed6d64f22a2d3985d59d718346`
BLAKE2b-256	`f807d728b338f2ecdd609d9f84f377021aeab87a13f1ea24a656e26f7622d27e`

See more details on using hashes here.

File details

Details for the file pymatgen_db-2023.7.18-py3-none-any.whl.

File metadata

Download URL: pymatgen_db-2023.7.18-py3-none-any.whl
Upload date: Jul 18, 2023
Size: 87.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for pymatgen_db-2023.7.18-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7f9079ecc118cd5a7a4f942e6fd1e8611995210031a03a714ffee08d13fe5c74`
MD5	`7022fe76f67605226e49a2d0634b5517`
BLAKE2b-256	`97a44659cb9c8ed21cd80c36514d1fc84d818734d9d9a24e6d5528245c916393`

See more details on using hashes here.

pymatgen-db 2023.7.18

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Major change

Getting pymatgen-db

Stable version

Requirements

Usage

Initial setup

Inserting calculations

Querying a database

Extending pymatgen-db

How to cite pymatgen-db

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes