A library for deploying Numer.ai Prediction Nodes.
Project description
numerai-cli
Welcome to the Numerai CLI for the Numerai Tournament. Numerai CLI (aka Compute Heavy) is a tool to help you set up cloud automation for your Numerai predictions. This README is designed to have EVERYTHING you need to setup and maintain a Numerai Compute Heavy node.
This CLI runs on your local computer to configure a Numerai Prediction Node in the cloud. This solution is architected to cost less than $5/mo on average, but actual costs may vary. It has been tested on MacOS/OSX, Windows 8/10, and Ubuntu 18/20, but should theoretically work anywhere that Docker and Python 3 are available.
Contents
- Getting Started
- Troubleshooting and Feedback
- Billing Alerts
- Prediction Node Architecture
- Special Thanks
Getting Started
To use this tool you need:
- Numerai API keys
- Paid cloud provider account
- Python
- Docker
-
Sign up a Numerai Account, get your Numerai API Keys, and your first Model:
- Sign up at https://numer.ai/signup and log in to your new account
- Go to https://numer.ai/account > "Your API keys" section > click "Add"
- Name your key and check all boxes under "Scope this key will have"
- Enter your password and click "Confirm"
- Copy your secret public key and secret key somewhere safe
-
Choose your cloud provider and follow the corresponding guide:
-
Install Docker and Python for your Operating System (if you encounter errors or your OS is not supported, please see Troubleshooting and Feedback):
-
Mac Terminal (cmd + space, type
terminal
, selectterminal.app
):curl https://raw.githubusercontent.com/numerai/numerai-cli/master/scripts/setup-mac.sh | bash
-
Ubuntu 18/20 Terminal (ctrl + alt + t):
sudo apt update && sudo apt install -y libcurl4 curl && sudo curl https://raw.githubusercontent.com/numerai/numerai-cli/master/scripts/setup-ubu.sh | sudo bash
-
Windows 10 Command Prompt (windows key, type
cmd
, select Command Prompt):powershell -command "$Script = Invoke-WebRequest 'https://raw.githubusercontent.com/numerai/numerai-cli/master/scripts/setup-win10.ps1'; $ScriptBlock = [ScriptBlock]::Create($Script.Content); Invoke-Command -ScriptBlock $ScriptBlock"
-
-
After the setup script confirms Python and Docker, install
numerai-cli
via:pip3 install --upgrade numerai-cli --user
NOTES:
- This command will also work to update to new versions of the package in the future.
- If you are using python venv then drop the --user option. If you don't know what that is, disregard this note.
-
Run these commands on your personal computer (not a temporary instance) to get an example node running in minutes:
For AWS run:
numerai setup --provider aws
and enter your user's keys when prompted.
For Azure users run:
numerai setup --provider azure
and enter your application's keys when prompted.
For GCP users run:
numerai setup --provider gcp
and enter the complete path to your downloaded key when prompted.
numerai node config --example tournament-python3 numerai node deploy numerai node test
If you want to use larger instances to generate your predictions first run
numerai list-constants
to list the vCPU/mem presets available, then you can configure a node to use one of the presets via:numerai node config -s mem-lg
Your compute node is now setup and ready to run! When you make changes to your code or re-train your model, simply deploy and test your node again:
numerai node deploy numerai node test
NOTES:
- These commands have stored configuration files in
$USER_HOME/.numerai/
. DO NOT LOSE THIS FILE! or else you will have to manually delete every cloud resource by hand. - The example node trains a model in the cloud, which is bad. You should train locally, pickle the trained model, deploy your node, then unpickle your model to do the live predictions
- The default example does not make stake changes; please refer to the numerapi docs for the methods you must call to do this.
- You can view resources and logs in the AWS Console (region us-east-1) for your ECS Cluster and other resources
- These commands have stored configuration files in
List of Commands
Use the --help
option on any command or sub-command to get a full description of it:
numerai
numerai --help
numerai [command] --help
numerai [command] [sub-command] --help
Each command or sub-command takes its own options, for example if you want to copy the numerai signals example and configure a node for a signals model with large memory requirements you'd use something like this (replacing [MODEL NAME] with the relevant signals model name):
numerai node -m [MODEL NAME] -s config -s mem-lg -e signals-python3
Here, the node
command takes a model name with -m
and a flag -s
to detect if it's
a signals model or numerai model. The config
sub-command also takes a -s
option to
specify the size of the node to configure.
Upgrading
Upgrading numerai-cli will always require you to update the package itself using pip:
pip install --upgrade numerai-cli --user
numerai upgrade
0.1/0.2 to 0.3
CLI 0.3 uses a new configuration format that is incompatible with versions 0.1 and 0.2,
but a command to migrate you configuration is provided for you. Run this in the directory
you ran numerai setup
from the previous version:
numerai upgrade
0.3 to 0.4
CLI 0.4 introduces a new provider option (Microsoft Azure) and moves the default aws
terraform into a subdirectory. You'll need to run upgrade
:
numerai upgrade
If you want to use azure, follow the setup guide for azure then run:
numerai setup --provider azure
numerai node config --provider azure
0.4 to 1.0
CLI 1.0 is a big change for AWS, you'll need to:
- update the permissions for the role you created in the AWS Setup Guide
- update the terraform with
numerai setup
- replace your aws nodes by running:
numerai node -m <model_name> destroy --preserve-node-config
numerai node -m <model_name> config
numerai node -m <model_name> deploy
Beyond
Some updates will make changes to configuration files used by Numerai CLI. These will require you to re-run some commands to upgrade your nodes to the newest versions:
numerai setup
will copy over changes to files in the$HOME/.numerai
directorynumerai node config
will apply those changes to a node
Uninstalling
numerai uninstall
Troubleshooting and Feedback
Please review this entire section and check github issues before asking questions. If you've exhausted this document, then join us on Discord
If you still cannot find a solution or answer, please join us on Discord and include the following information with your issue/message:
- The commands you ran that caused the error (even previous commands)
- Version information from running:
pip3 show numerai-cli
python -V
docker -v
- System Information from running
- Mac:
system_profiler SPSoftwareDataType && system_profiler SPHardwareDataType
- Linux:
lsb_release -a && uname -a
- Windows:
powershell -command "Get-ComputerInfo"
- Mac:
Python
If the environment setup script fails to install Python3 for you, report the error to Numerai and then install it manually with one of the following options:
- Download Python3 directly
- install it from your system's package manager
- Use conda to install and manage python for you
Docker
If the environment setup script fails to install Docker for you, report the error to Numerai then read the following to get a working installation on your machine. For PCs, you may need to activate virtualization in your BIOS before installing.
MacOS and Windows 10
Just install Docker Desktop to get it running. You should also increase the RAM allocated to the VM:
- Open the Docker Desktop GUI then
- Click Gear in top right corner
- Select Resources > Advanced in left sidebar
- Use slider to allocate more memory (leave a few gigs for your OS and background applications, otherwise your computer might crash)
Linux
Check here for instructions for your distribution: https://docs.docker.com/engine/install/
Older PCs: Docker Toolbox
If your machine is older and/or doesn't have Hyper-V enabled, then you will have to follow these steps to install docker toolbox on your machine:
- Install Oracle VirtualBox for your Operating System
- Restart your computer
- Install Docker Toolbox
- Restart your computer
- After it's installed, open the "Docker QuickStart Terminal" and run the following to increase its RAM:
docker-machine rm default
docker-machine create -d virtualbox --virtualbox-cpu-count=2 --virtualbox-memory=4096 --virtualbox-disk-size=50000 default
docker-machine restart default
NOTE: your code must live somewhere under your User directory (ie. C:\Users\USER_NAME\ANY_FOLDER). This is a restriction of docker toolbox not sharing paths correctly otherwise.
Azure
If you just made your Azure account there's a chance your account provisioning could take some time, potentially up to 24 hours.
When configuring your node for the first time the numerai-cli may hang as it tries to provision infrastructure in your account. If running numerai node config --provider azure
shows no log output for more than 5 minutes, your account is likely in the stuck provisioning state. While we investigate this issue, the best course of action is to wait until the following day and run the command again as there is no way to skip this Azure provisioning step.
Common Errors
Error:
subprocess.CalledProcessError: Command 'docker run --rm -it -v /home/jason/tmp/.numerai:/opt/plan -w /opt/plan hashicorp/terraform:light init' returned non-zero exit status 127.
Reason:
Docker is not installed.
Solutions:
If you're certain that docker is installed, make sure that your user can execute docker, ie. try to run `docker ps`.
If that's the issue, then depending on your system, you can do the following:
- Windows/OSX
- Make sure the Docker Desktop is running and finished booting up.
It can take a few minutes to be completely ready.
When clicking on the docker tray icon, it should say "Docker Desktop is Running".
- If you're using Docker Toolbox on Windows, then make sure you've opened the "Docker Quickstart Terminal".
- Linux
- Run `sudo usermod -aG docker $USER`
- Then reboot or logout/login for this to take effect.
Error:
docker: Error response from daemon: Drive has not been shared
Solutions:
- You need to [share your drive](https://docs.docker.com/docker-for-windows/#shared-drives).
Error:
numerai: command not found
Solutions:
- osx/linux: Try and run `~/.local/bin/numerai`
- Windows: `%LOCALAPPDATA%\Programs\Python\Python37-32\Scripts\numerai.exe`
- Alternatively, exit then re-open your terminal/command prompt.
Error:
error calling sts:GetCallerIdentity: InvalidClientTokenId: The security token included in the request is invalid.
...
Command 'docker run -e "AWS_ACCESS_KEY_ID=..." -e "AWS_SECRET_ACCESS_KEY=..." --rm -it -v /home/jason/tmp/.numerai:/opt/plan -w /opt/planhashicorp/terraform:light apply -auto-approve' returned non-zero exit status 1.
Solutions:
- Run `numerai configure` to re-write your API Keys
Error:
ERROR numerapi.base_api: Error received from your webhook server: {"tasks":[],"failures":[{"reason":"The requested CPU configuration is above your limit"}]}
Solution:
1. Go to the [Quota Dashboard for EC2](console.aws.amazon.com/servicequotas/home/services/ec2/quotas)
2. Search for "On-Demand", this will list all instance types and their limits for your account.
3. Click the bubble next to "Running On-Demand Standard (A, C, D, H, I, M, R, T, Z) instances"
4. click "Request quota increase" in the top right
5. Input a higher value than the currently applied quota and finish the request
- If after AWS increases your quota, you still get this error, try again 1-2 more times
- You may have to complete a quota request for other types of instances too depending on how resource intensive your setup is
Billing Alerts
There's no automated way to setup billing alerts, so you'll need to configure one manually. We estimate costs to be less than $5 per month unless your compute takes more than 12 hours a week, but increasing the RAM/CPU will increase your costs.
Prediction Node Architecture
A Prediction Node represents a cloud-based model that Numerai can trigger for predictions; it is designed to be reliable, resource efficient, and easy to configure and debug. Prediction Nodes use a few important components like a Dockerfile
, a Trigger
, a Container
, and a Compute Cluster
, all of which can be created using one of the following examples.
Python Example
numerai-python3
├── Dockerfile
├── .dockerignore
├── predict.py
├── requirements.txt
└── train.py
-
Dockerfile
: Used duringnumerai node deploy
to build a Docker image that's used to run your code in the cloud. It copies all files in its directory, installs Python requirements for requirements.txt, and runspython predict.py
by default. -
.dockerignore
: This file uses regex to match files that should not be included in the Docker image. -
train.py
: This is an extra entry point specifically for training, it's used when runningnumerai node test --local --command "python train.py"
-
requirements.txt
: Defines python packages required to run the code. -
predict.py
: Gets run by default locally and in the cloud when runningnumerai test
without the--command|-c
option.
RLang Example
numerai-rlang
├── Dockerfile
├── .dockerignore
├── install_packages.R
└── main.R
-
Dockerfile
: Used duringnumerai node deploy
to build a Docker image that's used to run your code in the cloud. It copies all files in its directory, installs Rlang requirements from install_packages.R, and runs main.R by default. -
.dockerignore
: This file uses regex to match files that should not be included in the Docker image. -
install_packages.R
: Installs dependencies necessary for running the example. -
main.R
: Ran by default locally and in the cloud and when runningnumerai test
without the--command|-c
option.
The Dockerfile
This is the most important component of deploying a Prediction Node. It is a program (much like a bash script), that packages up your code as an image
; this image contains everything your code needs to run in the cloud. The most typical case of a Dockerfile is demonstrated in Numerai Examples, if you're not sure how to use a Dockerfile, first copy an example with numerai copy-example
, then read the documentation in the Dockerfile to learn the basics.
These files are very flexible, the default Dockerfile will just copy everything in whatever directory it is in, but this can be customized if you'd like to share code between models. For example, if you have a python project setup like so:
numerai_models
├── common
├──── __init__.py
├──── data.py
├──── setup.py
├── model_1
├──── Dockerfile
├──── .dockerignore
├──── predict.py
├──── requirements.txt
└──── train.py
Where common
is an installable python package you want to use in multiple models, you can add this line to model_1/Dockerfile: RUN pip install ../common/
. Finally, run numerai node deploy
from the numerai_models
directory to install the package in the image, making it available to your model code.
If you want to learn more about how to customize this file [checkout the Dockerfile reference] (https://docs.docker.com/engine/reference/builder/).
Cloud Components
The CLI uses Terraform to provision cloud resources. Each component and the related cloud resource(s) are listed below. The links will take you to the AWS console where you can monitor any of these resources for a given node; just visit the link and select the resource with the same name as the node you want to monitor (further directions are given for each resource below).
-
Trigger
: A small function that schedules a "task" on yourCompute Cluster
. This "task" handles pulling the image that was created by theDockerfile
and running it as aContainer
on yourCompute Cluster
. This is handled by two resources:- API Gateway: Hosts the webhook (HTTP endpoint) that Numerai calls to trigger your nodes. After clicking the link and selecting the resource, use the left sidebar to access metrics and logging.
- Lambda: Schedules your compute job when you call your Webhook URL. After clicking the link and selecting the resource, use the "Monitor" tab below the "Function Overview" section.
-
Container
: The thing that actually contains and runs your code on a computer provisioned by theCompute Cluster
. The--size
(or-s
) flag on thenumerai node config
sets the CPU and Memory limits for aContainer
. This is stored in one place:- ECR (Elastic Container Repository):
Used for storing docker images. This is the location to which
numerai docker deploy
will push your image. There is not much monitoring here, but you can view your images and when they were uploaded.
- ECR (Elastic Container Repository):
Used for storing docker images. This is the location to which
-
Compute Cluster
: A handler that accepts scheduled "tasks" and spins up and down computers to runContainers
. This is handled by ECS:- ECS (Elastic Container Service): This is where your containers will actually run and where you'll want to look if your containers don't seem to be scheduled/running. After clicking the link, you'll be able to scroll and monitor the top-level metrics of each cluster. After selecting a specific cluster, you can use the various tabs to view different components of the cluster (tasks are the runnable jobs that the Lambda schedules, instances are the computers the tasks run on, and metrics will show cluster-wide information)
Special Thanks
- Thanks to uuazed for their work on numerapi
- Thanks to hellno for starting the Signals ticker map
- Thanks to tit_BTCQASH (numerai profile and twitter profile) for debugging the environment setup process on Windows 8
- Thanks to eses-wk for implementing Azure support
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file numerai-cli-1.1.3.tar.gz
.
File metadata
- Download URL: numerai-cli-1.1.3.tar.gz
- Upload date:
- Size: 64.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 81a124780565fc993f6fbeaa04ed2d75e42775cb0a5360e49a1ebe4c6c1a80b8 |
|
MD5 | abca76e71983cf59d716821f172481b5 |
|
BLAKE2b-256 | a34b1e677d700844649efc216105de43c4d5927b4632e915e5c0f39a752d123c |
File details
Details for the file numerai_cli-1.1.3-py3-none-any.whl
.
File metadata
- Download URL: numerai_cli-1.1.3-py3-none-any.whl
- Upload date:
- Size: 82.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.7.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9bf4c22f7c5b9a38c7c52707141742135f67b95940b94ae872b9ff1cedb04b3c |
|
MD5 | 4158fce36d4cf27ffcda8e9813b5f4f7 |
|
BLAKE2b-256 | df877edad19a727d2a3cbda11b9ee5079230991ad8291765b441cbe23a203461 |