Python toolkit for Albert Invent
Project description
Albert Invent Data Science Library
The Albert Invent Data Science Library is a set of wrappers and helper functions which can be used to build data science applications on top of the albert platform.
Non-Python Dependencies
Docker
If you want to use the docker image for doing development you will need to install the docker runtime (or docker desktop for windows/macos). If you are going to be running the docker image on an ubuntu system with a GPU you will need to additionally install the nvidia docker runtime -- instructions for which can be found at https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html
Getting Started
After installing the albert-toolkit package, you can setup your environment configuration by running the command albert init
and following the prompts. Any credentials that you do not have (e.g. if you do not have data warehouse credentials) you can simply leave them blank.
If you will be using clearML integrations then be sure to have your clearml pip install command ready as you will be prompted for it as part of the init process. If you have already setup clearml in your virtual environment prior to installing albert-toolkit then you can simply answer no to the prompt and it will leave your current configuration untouched.
Running Unit Tests
NOTE: All the following commands should be run from within a virtual environment. And they will all require that your albert API token is setup correctly
albert-toolkit utilizes pytest as its testing framework -- to run the full suite of unit tests use the helper script and the ta
command:
$ ./al.sh ta
Alternatively if you want to run the tests in a fresh venv (i.e. test the build/install/unit test workflow) run the command tb
-- This is useful for checking that you have correctly supplied all dependencies in the package setup.cfg.
$ ./al.sh tb
Note that running the above command will look for lib-jwt-python at the relative path ../lib-jwt-python
if it is not located there it will attempt to clone the repo using ssh and will fail if you don't have your github ssh keys setup correct.
Package Installation
Until we get a pypi package setup you will need to clone this repository and perform a pip install locally.
Choosing an Install Environment
Depending on the application you may not need or want the full development stack associated with this library. You can therefore install different dependencies using the square bracket syntax e.g. albert-toolkit[viz]
if you only want to install a certain set of dependencies. This is particularly useful if you are building out an application or microservice which only depends upon a subset of the full albert-toolkit library, and you do not want to install unused dependencies.
If you specify no environment tag then you will just the get base code for albert-toolkit installed into your environment and most of it will not function without the required dependencies -- so be sure to choose one of the following stacks when installing the library.
Note the use of quotes in the package install commands below e.g. "albert-toolkit[dev]"
-- the quotes are required and you will get an error if you forget to include them.
Full Development Stack
The full development stack can be installed using the [dev]
or [all]
tags. The development environment contains support for juptyerlab and all the associated ipython dependencies. It should not be installed in a deployed application
$ pip install "albert-toolkit[dev]" #This should be run from within your python virtual environment
Visualization Stack
The visualization stack includes only those dependencies which are needed to utilize the visualization components of the albert-toolkit library. It can be installed with the [viz]
tag
$ pip install "albert-toolkit[viz]" #This should be run from within your python virtual environment
Metrics Stack
The metrics stack includes only those depenencies which are necessary to compute metrics (for example: if you have a microservice architecture and you are running lots of predictions and storing those in a database, you may want to have a microservice which is dedicated to computing performance/accuracy/etc... metrics and hence that service should not require any of the other stacks)
$ pip install "albert-toolkit[metrics]" #This should be run from within your python virtual environment
NLP Stack
The NLP Stack includes only those dependencies which are necessary to compute NLP transforms or embeddings, including helper functions for transforming or analyzing text data
$ pip install "albert-toolkit[nlp]" #This should be run from within your python virtual environment
Models Stack
The models stack includes all the dependencies necessary to build or run models. Installing the Models stack will also install the metrics stack
$ pip install "albert-toolkit[models]" #This should be run from within your python virtual environment
Chemistry Stack
The chemistry stacks includes all the dependencies needed to run the chemistry modules -- the proper way to install the chemistry stack is throught he use of the albert
cli tool. Simply run the following to install the chemistry dependencies after installing the main albert toolkit
$ albert init-chem
This will require cmake
and CUDA to be installed.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file albert_toolkit-0.1.1.tar.gz
.
File metadata
- Download URL: albert_toolkit-0.1.1.tar.gz
- Upload date:
- Size: 286.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.0 CPython/3.10.9 Darwin/22.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 271b614d6dbc4023c17f856ac7e0161f6277a1951f7194923de3db440a236123 |
|
MD5 | 2b157ab5508e87098f14199515c00d5d |
|
BLAKE2b-256 | fc076139712d836e3a5f04b5fe688e19d0b3ca082dfd96c876c6849097c10f9d |
File details
Details for the file albert_toolkit-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: albert_toolkit-0.1.1-py3-none-any.whl
- Upload date:
- Size: 739.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.0 CPython/3.10.9 Darwin/22.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aa787078746abaf2385a17766d770855c7d955525579977eaeb0a4c619b18338 |
|
MD5 | 75d1688128188046428dc502d6094c15 |
|
BLAKE2b-256 | 2ec9ccb69fb10c77dda06dac2774c6fa24b6d25fc3de90e27c8e6d3f20b81e7c |