A Machine Learning and Informatics Program Suite for the Chemical and Materials Sciences
Project description
[![Build Status](https://travis-ci.org/hachmannlab/chemml.svg?branch=master)](https://travis-ci.org/hachmannlab/chemml)
[![codecov](https://codecov.io/gh/hachmannlab/chemml/branch/master/graph/badge.svg)](https://codecov.io/gh/hachmannlab/chemml)
# ChemML
ChemML is a machine learning and informatics program suite for the analysis, mining, and modeling of chemical and materials data.
## Code Design:
ChemML is developed in the Python 3 programming language and makes use of a host of data analysis and ML libraries(accessible through the Anaconda distribution), as well as domain-specific libraries.
The development follows a strictly modular and object-oriented design to make the overall code as flexible and versatile as possible.
The format of library is similar to the well known libraries like Scikit-learn. ChemML is also available
via graphical user interface provided by [ChemEco](https://github.com/hachmannlab/chemeco).
ChemEco is a general-purpose framework for data mining without coding. It also interfaces with many of the libraries that supply methods for the
representation, preprocessing, analysis, mining, and modeling of large-scale chemical data sets.
## Latest Version:
- Program Version: 0.4.4
- Release Date: March 25, 2019
## Installation and Dependencies:
You can download ChemML from Python Package Index (PyPI) via pip.
pip install chemml --user -U
Here is a list of external libraries that will be installed with chemml:
- numpy
- pandas
- tensorflow
- keras
- scikit-learn
- matplotlib
- seaborn
- lxml
Since conda installation is not available for ChemML yet, we recommend installing rdkit and openbabel in a conda virtual environment prior to
installing ChemML. For doing so, you need to follow the conda installer:
conda create --name my_chemml_env python=3.6
source activate my_chemml_env
conda install -c openbabel openbabel
conda install -c rdkit rdkit
pip install chemml
## Python 2 Fans
The library was initially compatible with Python 2 and 3, and we still develop compatible codes. However, since the Python community
and some of the dependencies are phasing out support for Python 2, we also removed it from the classifier languages.
However, you should still be able to clone the ChemML repository from github and install the older versions of some of the dependencies that
support Python 2 prior to installing ChemML locally via `pip install -e.`.
## Citation:
Please cite the use of ChemML as:
- M. Haghighatlari, J. Hachmann, ChemML – A Machine Learning and Informatics Program Suite for the Analysis, Mining, and Modeling of Chemical and Materials Data, in preparation (2018).
- M. Haghighatlari, J. Hachmann, A Machine Learning and Informatics Program Suite for Chemical and Materials Data Mining. Available from: https://hachmannlab.github.io/chemml.
- J. Hachmann, M.A.F. Afzal, M. Haghighatlari, Y. Pal, Building and Deploying a Cyberinfrastructure for the Data-Driven Design of Chemical Systems and the Exploration of Chemical Space, Mol. Simul. 44 (2018), 921-929. DOI: 10.1080/08927022.2018.1471692
## License:
ChemML is copyright (C) 2014-2018 Johannes Hachmann and Mojtaba Haghighatlari, all rights reserved.
ChemML is distributed under 3-Clause BSD License (https://opensource.org/licenses/BSD-3-Clause).
## About us:
### Maintainers:
- Johannes Hachmann, hachmann@buffalo.edu
- Mojtaba Haghighatlari
University at Buffalo - The State University of New York (UB)
### Contributors:
- Doaa Altarawy (MolSSI): software scientist and mentor
- Ramachandran Subramanian (UB): Magpie descriptor library port
- Gaurav Vishwakarma (UB): automated model optimization
- Bhargava Urala Kota (UB): library database
- Aditya Sonpal (UB): debugging
- Srirangaraj Setlur (UB): scientific advice
- Venugopal Govindaraju (UB): scientific advice
- Krishna Rajan (UB): scientific advice
- We encourage any contributions and feedback. Feel free to fork and make pull-request to the "development" branch.
### Acknowledgements:
- ChemML is based upon work supported by the U.S. National Science Foundation under grant #OAC-1751161 and in part by #OAC-1640867.
- ChemML was also supported by start-up funds provided by UB's School of Engineering and Applied Science and UB's Department of Chemical and Biological Engineering, the New York State Center of Excellence in Materials Informatics through seed grant #1140384-8-75163, and the U.S. Department of Energy under grant #DE-SC0017193.
- Mojtaba Haghighatlari received a 2018 Phase-I Software Fellowship by the Molecular Sciences Software Institute (MolSSI) for his work on ChemML.
[![codecov](https://codecov.io/gh/hachmannlab/chemml/branch/master/graph/badge.svg)](https://codecov.io/gh/hachmannlab/chemml)
# ChemML
ChemML is a machine learning and informatics program suite for the analysis, mining, and modeling of chemical and materials data.
## Code Design:
ChemML is developed in the Python 3 programming language and makes use of a host of data analysis and ML libraries(accessible through the Anaconda distribution), as well as domain-specific libraries.
The development follows a strictly modular and object-oriented design to make the overall code as flexible and versatile as possible.
The format of library is similar to the well known libraries like Scikit-learn. ChemML is also available
via graphical user interface provided by [ChemEco](https://github.com/hachmannlab/chemeco).
ChemEco is a general-purpose framework for data mining without coding. It also interfaces with many of the libraries that supply methods for the
representation, preprocessing, analysis, mining, and modeling of large-scale chemical data sets.
## Latest Version:
- Program Version: 0.4.4
- Release Date: March 25, 2019
## Installation and Dependencies:
You can download ChemML from Python Package Index (PyPI) via pip.
pip install chemml --user -U
Here is a list of external libraries that will be installed with chemml:
- numpy
- pandas
- tensorflow
- keras
- scikit-learn
- matplotlib
- seaborn
- lxml
Since conda installation is not available for ChemML yet, we recommend installing rdkit and openbabel in a conda virtual environment prior to
installing ChemML. For doing so, you need to follow the conda installer:
conda create --name my_chemml_env python=3.6
source activate my_chemml_env
conda install -c openbabel openbabel
conda install -c rdkit rdkit
pip install chemml
## Python 2 Fans
The library was initially compatible with Python 2 and 3, and we still develop compatible codes. However, since the Python community
and some of the dependencies are phasing out support for Python 2, we also removed it from the classifier languages.
However, you should still be able to clone the ChemML repository from github and install the older versions of some of the dependencies that
support Python 2 prior to installing ChemML locally via `pip install -e.`.
## Citation:
Please cite the use of ChemML as:
- M. Haghighatlari, J. Hachmann, ChemML – A Machine Learning and Informatics Program Suite for the Analysis, Mining, and Modeling of Chemical and Materials Data, in preparation (2018).
- M. Haghighatlari, J. Hachmann, A Machine Learning and Informatics Program Suite for Chemical and Materials Data Mining. Available from: https://hachmannlab.github.io/chemml.
- J. Hachmann, M.A.F. Afzal, M. Haghighatlari, Y. Pal, Building and Deploying a Cyberinfrastructure for the Data-Driven Design of Chemical Systems and the Exploration of Chemical Space, Mol. Simul. 44 (2018), 921-929. DOI: 10.1080/08927022.2018.1471692
## License:
ChemML is copyright (C) 2014-2018 Johannes Hachmann and Mojtaba Haghighatlari, all rights reserved.
ChemML is distributed under 3-Clause BSD License (https://opensource.org/licenses/BSD-3-Clause).
## About us:
### Maintainers:
- Johannes Hachmann, hachmann@buffalo.edu
- Mojtaba Haghighatlari
University at Buffalo - The State University of New York (UB)
### Contributors:
- Doaa Altarawy (MolSSI): software scientist and mentor
- Ramachandran Subramanian (UB): Magpie descriptor library port
- Gaurav Vishwakarma (UB): automated model optimization
- Bhargava Urala Kota (UB): library database
- Aditya Sonpal (UB): debugging
- Srirangaraj Setlur (UB): scientific advice
- Venugopal Govindaraju (UB): scientific advice
- Krishna Rajan (UB): scientific advice
- We encourage any contributions and feedback. Feel free to fork and make pull-request to the "development" branch.
### Acknowledgements:
- ChemML is based upon work supported by the U.S. National Science Foundation under grant #OAC-1751161 and in part by #OAC-1640867.
- ChemML was also supported by start-up funds provided by UB's School of Engineering and Applied Science and UB's Department of Chemical and Biological Engineering, the New York State Center of Excellence in Materials Informatics through seed grant #1140384-8-75163, and the U.S. Department of Energy under grant #DE-SC0017193.
- Mojtaba Haghighatlari received a 2018 Phase-I Software Fellowship by the Molecular Sciences Software Institute (MolSSI) for his work on ChemML.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
chemml-0.5.0.tar.gz
(146.0 kB
view details)
Built Distribution
chemml-0.5.0-py3-none-any.whl
(196.5 kB
view details)
File details
Details for the file chemml-0.5.0.tar.gz
.
File metadata
- Download URL: chemml-0.5.0.tar.gz
- Upload date:
- Size: 146.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/2.7.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 91164a1d6ae06c77176fb940ad01a9e9190624c77288e3d2361170b6c24e629a |
|
MD5 | e5d91cc51c9fc3ce83d3c6aaa1483591 |
|
BLAKE2b-256 | 6686a2ede5e0fa1c736c25786fcdb7b61b61f3893a1f82d094c1209c58f712e4 |
File details
Details for the file chemml-0.5.0-py3-none-any.whl
.
File metadata
- Download URL: chemml-0.5.0-py3-none-any.whl
- Upload date:
- Size: 196.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/2.7.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0b19f1b322ff0acead2a41afe5ef0eab77ab845948825cba56f01cb2f4dabde4 |
|
MD5 | de847ef85a67c9f5e9e12707bc0440cb |
|
BLAKE2b-256 | a42e782e440cb264f5dffc61816c8aa3718e1ef07bef73dec436502e63ecac20 |