Machine Learning using atomic-scale calculations.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Programming Language

Project description

CatLearn

An environment for atomistic machine learning in Python for applications in catalysis.

Utilities for building and testing atomic machine learning models. Gaussian Processes (GP) regression machine learning routines are implemented. These will take any numpy array of training and test feature matrices along with a vector of target values.

In general, any data prepared in this fashion can be fed to the GP routines, a number of additional functions have been added that interface with ASE. This integration allows for the manipulation of atoms objects through GP predictions, as well as dynamic generation of descriptors through use of the many ASE functions.

Please see the tutorials for a detailed overview of what the code can do and the conventions used in setting up the predictive models. For an overview of all the functionality available, please read the documentation.

Installation
Usage
Tutorials
Functionality
Contribution

Installation

(Back to top)

The easiest way to install the code is with:

$ pip install catlearn

This will automatically install the code as well as the dependencies. Alternatively, you can clone the repository to a local directory with:

$ git clone https://github.com/SUNCAT-Center/CatLearn.git

And then put the <install_dir>/ into your $PYTHONPATH environment variable.

Be sure to install dependencies in with:

$ pip install -r requirements.txt

Docker

To use the docker image, it is necessary to have docker installed and running. After cloning the project, build and run the image as follows:

$ docker build -t catlearn .

Then it is possible to use the image in two ways. It is possible to run the docker image as a bash environment in which CatLearn can be used will all dependencies in place.

$ docker run -it catlearn bash

Or python can be run from the docker image.

$ docker run -it catlearn python2 [file.py]
$ docker run -it catlearn python3 [file.py]

Use Ctrl + d to exit the docker image when done.

Optional Dependencies

The tutorial scripts will generally output some graphical representations of the results etc. For these scripts, it is advisable to have at least matplotlib installed:

$ pip install matplotlib seaborn

Usage

(Back to top)

In the most basic form, it is possible to set up a GP model and make some predictions using the following lines of code:

import numpy as np
from catlearn.regression import GaussianProcess

# Define some input data.
train_features = np.arange(200).reshape(50, 4)
target = np.random.random_sample((50,))
test_features = np.arange(100).reshape(25, 4)

# Setup the kernel.
kernel = [{'type': 'gaussian', 'width': 0.5}]

# Train the GP model.
gp = GaussianProcess(kernel_list=kernel, regularization=1e-3,
                     train_fp=train_features, train_target=target,
                     optimize_hyperparameters=True)

# Get the predictions.
prediction = gp.predict(test_fp=test_features)

Tutorials

(Back to top)

The above sample of code will train a GP with the squared exponential kernel, fitting some random function. Of course, this isn't so useful, more helpful examples and test scripts are present for most features; primarily, please see the tutorials.

Functionality

(Back to top)

There is much functionality in CatLearn to assist in handling atom data and building optimal models. This includes:

API to other codes:
- Atomic simulation environment API
- Magpie API
- NetworkX API
Fingerprint generators:
- Bulk systems
- Support/slab systems
- Discrete systems
Preprocessing routines:
- Data cleaning
- Feature elimination
- Feature engineering
- Feature extraction
- Feature scaling
Regression methods:
- Regularized ridge regression
- Gaussian processes regression
Cross-validation:
- K-fold cv
- Ensemble k-fold cv
Machine Learning Algorithms
- Machine Learning Nudged Elastic Band (ML-NEB) algorithm.
General utilities:
- K-means clustering
- Neighborlist generators
- Penalty functions
- SQLite db storage

Contribution

(Back to top)

Anyone is welcome to contribute to the project. Please see the contribution guide for help setting up a local copy of the code. There are some TODO items in the README files for the various modules that give suggestions on parts of the code that could be improved.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Programming Language

Release history Release notifications | RSS feed

0.6.2

Mar 27, 2020

0.6.1

Apr 29, 2019

This version

0.6.0

Mar 21, 2019

0.6.0.dev3 pre-release

Feb 5, 2019

0.6.0.dev2 pre-release

Dec 4, 2018

0.6.0.dev1 pre-release

Nov 22, 2018

0.5.0

Oct 17, 2018

0.5.0.dev1 pre-release

Aug 24, 2018

0.4.4.post1

Aug 17, 2018

0.4.4

Aug 15, 2018

0.4.4.dev5 pre-release

Jul 31, 2018

0.4.4.dev4 pre-release

Jul 24, 2018

0.4.4.dev3 pre-release

Jul 24, 2018

0.4.4.dev2 pre-release

Jun 14, 2018

0.4.4.dev1 pre-release

Jun 8, 2018

0.4.3

May 30, 2018

0.4.2

May 17, 2018

0.4.2.dev3 pre-release

May 11, 2018

0.4.2.dev2 pre-release

May 10, 2018

0.4.2.dev1 pre-release

May 2, 2018

0.4.1.post1

Apr 26, 2018

0.4.1

Apr 26, 2018

0.4.0.dev3 pre-release

Apr 23, 2018

0.4.0.dev2 pre-release

Apr 23, 2018

0.4.0.dev1 pre-release

Apr 23, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

CatLearn-0.6.0.tar.gz (17.4 MB view details)

Uploaded Mar 21, 2019 Source

File details

Details for the file CatLearn-0.6.0.tar.gz.

File metadata

Download URL: CatLearn-0.6.0.tar.gz
Upload date: Mar 21, 2019
Size: 17.4 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for CatLearn-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`c71f6395e984fec3d081ac8fb169188126ac4bd7b3e61620df6272cf0b18af04`
MD5	`b60a07b042ecc51edea46d1ae517ca54`
BLAKE2b-256	`f3709fc605eef30287ecc53cffa0310932fdbc38574e628f4513276f2531340e`

See more details on using hashes here.

CatLearn 0.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

CatLearn

Table of contents

Installation

Docker

Optional Dependencies

Usage

Tutorials

Functionality

Contribution

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes