datarobot-drum

DRUM - develop, test and deploy custom models

These details have not been verified by PyPI

Project links

Project description

About

The DataRobot Model Runner (DRUM) is a tool that allows you to work locally with Python, R, and Java custom models. It can be used to verify that a custom model can run and make predictions before it is uploaded to DataRobot. However, this testing is only for development purposes. DataRobot recommends that any model you wish to deploy should also be tested in the Custom Model Workshop.

DRUM can also:

run performance and memory usage testing for models.
perform model validation tests, e.g., checking model functionality on corner cases, like null values imputation.
run models in a Docker container.

DataRobot DRUM is only tested to support Linux/macOS operating systems. To run DRUM in Windows 10 please use WSL (Windows Subsystem for Linux).

Communication

open an issue in the DRUM GitHub repository.

Custom inference models quickstart guide

View examples here.

Custom tasks

View examples here.

Installation

Prerequisites:

All models:

Install the dependencies needed to run your code.
If you are using a drop-in environment found in this repo, you must pip install these dependencies.

Python models:

Check https://pypi.org/project/datarobot-drum/ for supported Python versions.

Java models:

JRE >= 11.

R models:

Python >= 3.6.
The R framework must be installed.
DRUM uses the rpy2 package (by default the latest version is installed) to run R. You may need to adjust the rpy2 and pandas versions for compatibility.

To install DRUM with Python/Java models support:
pip install datarobot-drum

To install DRUM with R support:
pip install datarobot-drum[R]

Autocompletion

DRUM supports autocompletion based on the argcomplete package. Additional configuration is required to use it:

run activate-global-python-argcomplete --user; this should create a file: ~/.bash_completion.d/python-argcomplete,
source created file source ~/.bash_completion.d/python-argcomplete in your ~/.bashrc or another profile-related file according to your system.

If global completion is not completing your script, bash may have registered a default completion function:

run complete | grep drum; if there is an output complete -F _minimal <some_line_containing_drum> do
complete -r <some_line_containing_drum>

For more information and troubleshooting visit the argcomplete information page.

Usage

Help:
**drum** -help

Code Directory --code-dir

The --code-dir (code directory) argument is required in all commands and should point to a folder which contains your model artifacts and any other code needed for DRUM to run your model. For example, if you're running DRUM from testdir with a test input file at the root and your model in a subdirectory called model, you would enter:

drum score --code-dir ./model/ --input ./testfile.csv

Additional model code dependencies

Code dir may contain a requirements.txt file, listing dependency packages which are required by code. Only Python and R models are supported.

Format of requirements.txt file

for Python: pip requrements file format
for R: a package per line

DRUM will attempt to install dependencies only when running with --docker option.

Model template generation

DRUM can help you to generate a code folder template with the custom file described above.
drum new model --code-dir ~/user_code_dir/ --language r
This command creates a folder with a custom.py/R file and a short description: README.md.

Batch scoring mode

Run a binary classification custom model

Make batch predictions with a binary classification model. Optionally, specify an output file. Otherwise, predictions are returned to the command line:
drum score --code-dir ~/user_code_dir/ --input 10k.csv --target-type binary --positive-class-label yes --negative-class-label no --output 10k-results.csv --verbose

Run a regression custom model

Make batch predictions with a regression model:
drum score --code-dir ~/user_code_dir/ --input fast-iron.csv --target-type regression --verbose

Testing model performance

You can test how the model performs and get its latency times and memory usage.
In this mode, the model is started with a prediction server. Different request combinations are submitted to it. After it completes, it returns a report.
drum perf-test --code-dir ~/user_code_dir/ --input 10k.csv --target-type binary --positive-class-label yes --negative-class-label no
Report example:

samples   iters    min     avg     max    used (MB)   total (MB)
============================================================================
Test case         1     100   0.028   0.030   0.054     306.934    31442.840
Test case        10     100   0.030   0.034   0.069     307.375    31442.840
Test case       100      10   0.036   0.038   0.045     307.512    31442.840
Test case      1000      10   0.042   0.047   0.058     308.258    31442.840
Test case    100000       1   0.674   0.674   0.674     330.902    31442.840
50MB file    838861       1   5.206   5.206   5.206     453.121    31442.840

For more feature options, see: drum perf-test --help

Model validation checks

You can validate the model on a set of various checks. It is highly recommended running these checks, as they are performed in DataRobot before the model can be deployed.

List of checks:

null values imputation: each feature of the provided dataset is set to missing and fed to the model.

To run: drum validation --code-dir ~/user_code_dir/ --input 10k.csv --target-type binary --positive-class-label yes --negative-class-label no Sample report:

Validation check results
Test case         Status
==============================
Null value imputation   PASSED

In case of check failure more information will be provided.

Runtime Parameters

Runtime parameters are created by the user via the custom model's version create routes (.e.g DataRobot WEB UI, DataRobot client, etc.). These runtime parameters can then be loaded into the custom model using the RuntimeParameters class, as follows:

from datarobot_drum import RuntimeParameters

def load_model(code_dir):
    url = RuntimeParameters.get("URL_PARAM_1")
    aws_credential = RuntimeParameters.get("AWS_CREDENTIAL_PARAM_1")
    ...

During testing and debugging in a local development environment, the user can write the runtime parameter values into a YAML file and provide it as an input to the drum utility. The YAML file can have any name ending with .yaml and should follow the example layout below:

URL_PARAM_1: http://any-desired-location/
AWS_CRED_PARAM_1:
    # See the REST API documentation for details on all supported credential types:
    #     https://docs.datarobot.com/en/docs/api/reference/public-api/credentials.html#properties_3
    credentialType: s3
    awsAccessKeyId: ABDEFGHIJK...
    awsSecretAccessKey: asdjDFSDJafslkjsdDLKGDSDlkjlkj...
    awsSessionToken: null

For credential type parameters, the value matches the Credentials REST API payload. For a complete example, see the following model template with runtime parameters.

And here is how you use it when running the drum utility:

drum score --runtime-params-file <filepath> --code-dir ~/user_code_dir/ --target-type <target type> --input dataset.csv

Prediction server mode

DRUM can also run as a prediction server. To do so, provide a server address argument:
drum server --code-dir ~/user_code_dir --target-type regression --address localhost:6789

The DRUM prediction server provides the following routes. You may provide the environment variable URL_PREFIX. Note that URLs must end with /.
For complete API specification in Openapi 3.0 format check here drum_server_api.yaml, you can also open it rendered in the Swagger Editor.

Status routes:
A GET URL_PREFIX/ and URL_PREFIX/ping/ routes, shows server status - if the server is alive.
Example: GET http://localhost:6789/
Response:

   {"message": "OK"}

Health route:
A GET URL_PREFIX/health/ route, shows functional health. E.g. model is loaded and functioning properly.
Example: GET http://localhost:6789/health/
Response:

Success:
```
{"message": "OK"}
```

Error:

{
  "message": "ERROR: \n\nRunning environment language: Python.\n Failed loading hooks from [/tmp/model/python3_sklearn/custom.py] : No module named 'andas'"
}

Info route:
A GET URL_PREFIX/info/ route, shows information about running model (metadata, paths, predictor type, etc.).
Example: GET http://localhost:6789/info/
Response:

{
   "codeDir": "/tmp/model/python3_sklearn",
   "drumServer": "flask",
   "drumVersion": "1.5.3",
   "language": "python",
   "modelMetadata": {
     "environmentID": "5e8c889607389fe0f466c72d",
     "inferenceModel": {
       "targetName": "Grade 2014"
     },
     "modelID": "5f1f15a4d6111f01cb7f91fd",
     "name": "regression model",
     "targetType": "regression",
     "type": "inference",
     "validation": {
       "input": "../../../tests/testdata/juniors_3_year_stats_regression.csv"
     }
   },
   "predictor": "scikit-learn",
   "targetType": "regression"
}
      ```

Statistics route:
A GET URL_PREFIX/stats/ route, shows running model statistics (memory).
Example: GET http://localhost:6789/stats/
mem_info::drum_rss represent a sum of drum_info::mem values.
Response:

{
  "drum_info": [{
      "cmdline": [
          "/tmp/drum_tests_virtual_environment/bin/python3",
          "/tmp/drum_tests_virtual_environment/bin/drum",
          "server",
          "--code-dir",
          "/tmp/model/python3_sklearn",
          "--target-type",
          "regression",
          "--address",
          "localhost:6789",
          "--with-error-server",
          "--show-perf"
      ],
      "mem": 256.71484375,
      "pid": 342391
  }],
  "mem_info": {
      "avail": 17670.828125,
      "container_limit": null,
      "container_max_used": null,
      "container_used": null,
      "drum_rss": 256.71484375,
      "free": 312.33203125,
      "total": 31442.73046875
  },
  "time_info": {
    "run_predictor_total": {
      "avg": 0.0165,
      "max": 0.023,
      "min": 0.013
    }
  }
}

Capabilities route:
A GET URL_PREFIX/capabilities/ route, shows payload formats and methods supported by the running model.
Example: GET http://localhost:6789/capabilities/
Structured predictions routes:
A POST URL_PREFIX/predict/ and URL_PREFIX/predictions/ routes, which returns predictions on data.
Example: POST http://localhost:6789/predict/; POST http://localhost:6789/predictions/
For these routes data can be posted in two ways:
- as form data parameter with a key:value pair, where:
  key = X
  value = filename of the csv/mtx format, that contains the inference data.
- as binary data; in case of mtx format, mimetype text/mtx must be set.
Structured transform route (for Python predictor only):
A POST URL_PREFIX/transform/ route, which returns transformed data.
Example: POST http://localhost:6789/transform/;
For this route data can be posted in two ways:
- as form data parameter with a key:value pair, where:
  key = X.
  value = filename of the csv/mtx format, that contains the inference data.
  
  optionally a second key, y, can be passed with value = a second filename containing target data.
  
  if y is passed, the route will return both X.transformed and y.transformed keys, along with out.format indicating the format of the transformed X output. This will take a value of csv or sparse. y.transformed is never sparse.
- as binary data; in case of mtx format, mimetype text/mtx must be set.
Unstructured predictions routes:
A POST URL_PREFIX/predictUnstructured/ and URL_PREFIX/predictionsUnstructured/ routes, which returns predictions on data.
Example: POST http://localhost:6789/predictUnstructured/; POST http://localhost:6789/predictionsUnstructured/
For these routes data is posted as binary data. Provide mimetype and charset to properly handle the data. For more detailed information please go here.

[DEPRECATED] Starting drum as prediction server in production mode.

DRUM prediction server can be started in production mode, which means that the Flask server will run in multi-process mode. This provides better stability and scalability - depending on how many CPUs are available several workers will be started to serve predictions.
--max-workers parameter is required in order to specify the number of workers. E.g. drum server --code-dir ~/user_code_dir --address localhost:6789 --production --max-workers 2

Fit mode

Note: Running fit inside of DataRobot is currently in alpha. Check back soon for the opportunity to test out this functionality yourself.

DRUM can run your training model to make sure it can produce a trained model artifact before adding the training model into DataRobot.

You can try this out on our sklearn classifier model template this this command:

drum fit --code-dir task_templates/3_pipelines/python3_sklearn_binary --target-type binary --target Species --input \
tests/testdata/iris_binary_training.csv --output . --positive-class-label Iris-setosa \
--negative-class-label Iris-versicolor

Note: If you don't provide class label, DataRobot tries to autodetect the labels for you.

You can also use DRUM on regression datasets, and soon you will also be able to provide row weights. Checkout the drum fit --help output for further details.

Running inside a docker container

In every mode, DRUM can be run inside a docker container by providing the option --docker <image_name/directory_path>. The container should implement an environment required to perform desired action. DRUM must be installed as a part of this environment.
The following is an example on how to run DRUM inside of container:
drum score --code-dir ~/user_code_dir/ --target-type <target type> --input dataset.csv --docker <container_name>
drum perf-test --code-dir ~/user_code_dir/ --target-type <target type> --input dataset.csv --docker <container_name>

Alternatively, the argument passed through the --docker flag may be a directory containing the unbuilt contents of an image. The DRUM tool will then attempt to build an image using this directory and run your model inside the newly built image.

If the argument passed to --docker is a docker context directory, and code dir contains dependencies file requirements.txt, DRUM will try to install the packages during the image build.
To skip dependencies installation you can use --skip-deps-install flag.

Drum Push

Starting in version 1.1.4, drum includes a new verb called push. When the user writes drum push -cd /dirtopush/ the contents of that directory will be submitted as a custom model to DataRobot. However, for this to work, you must create two types of configuration.

DataRobot client configuration push relies on correct global configuration of the client to access a DataRobot server. There are two options for supplying this configuration, through environment variables or through a config file which is read by the DataRobot client. Both of these options will include an endpoint and an API token to authenticate the requests.

Option 1: Environment variables. Example:

export DATAROBOT_ENDPOINT=https://app.datarobot.com/api/v2
export DATAROBOT_API_TOKEN=<yourtoken>

Option 2: Create this file, which we check for: ~/.config/datarobot/drconfig.yaml
Example:
```
endpoint: https://app.datarobot.com/api/v2
token: <yourtoken>
```

Model Metadata push also relies on a metadata file, which is parsed on DRUM to create the correct sort of model in DataRobot. This metadata file includes quite a few options. You can read about those options or see an example.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.17.18

Jun 4, 2026

1.17.17

May 18, 2026

1.17.16

May 15, 2026

1.17.15

Apr 15, 2026

1.17.14

Apr 3, 2026

1.17.13

Feb 28, 2026

1.17.12.post1

Feb 28, 2026

1.17.12

Jan 27, 2026

1.17.11.post1

Feb 27, 2026

1.17.11

Dec 26, 2025

1.17.10

Dec 18, 2025

1.17.9

Dec 10, 2025

1.17.8

Nov 21, 2025

1.17.7

Nov 20, 2025

1.17.6

Nov 3, 2025

1.17.5

Oct 9, 2025

1.17.4

Oct 7, 2025

1.17.3

Oct 1, 2025

1.17.3a1 pre-release

Oct 1, 2025

1.17.2

Sep 23, 2025

1.17.1

Sep 12, 2025

1.17.0

Sep 11, 2025

1.16.26

Sep 4, 2025

1.16.25

Aug 29, 2025

1.16.24

Aug 25, 2025

1.16.23

Aug 19, 2025

1.16.22

Aug 13, 2025

1.16.20

Jul 3, 2025

1.16.19

Jun 26, 2025

1.16.18

Jun 25, 2025

1.16.17

Jun 9, 2025

1.16.16

Jun 4, 2025

1.16.15

May 30, 2025

1.16.14

May 27, 2025

1.16.13

May 12, 2025

1.16.12

Apr 24, 2025

1.16.11

Mar 31, 2025

1.16.10.post1

Jun 26, 2025

1.16.10

Mar 20, 2025

1.16.9

Mar 18, 2025

1.16.8

Mar 14, 2025

1.16.7

Mar 12, 2025

1.16.6

Feb 26, 2025

1.16.5

Feb 18, 2025

1.16.4

Jan 31, 2025

1.16.3

Jan 28, 2025

1.16.2

Jan 24, 2025

1.16.1

Jan 22, 2025

1.16.0

Jan 9, 2025

1.15.0

Nov 27, 2024

1.14.3

Nov 13, 2024

1.14.2

Nov 12, 2024

1.14.1

Nov 1, 2024

1.14.0

Oct 31, 2024

1.13.0

Oct 22, 2024

1.13.0b2 pre-release

Oct 11, 2024

1.13.0b1 pre-release

Oct 10, 2024

1.12.2

Oct 8, 2024

1.12.2a1 pre-release

Oct 4, 2024

1.12.1

Sep 25, 2024

1.12.0

Sep 4, 2024

1.11.6

Aug 30, 2024

1.11.5

Jun 18, 2024

1.11.4

May 30, 2024

1.11.3

May 24, 2024

1.11.2

May 23, 2024

1.11.1

May 14, 2024

1.11.0

May 2, 2024

1.10.21

Mar 25, 2024

1.10.20

Feb 6, 2024

1.10.19

Jan 29, 2024

1.10.18

Jan 16, 2024

1.10.17

Jan 4, 2024

1.10.16

Dec 12, 2023

1.10.16.dev1 pre-release

Jan 2, 2024

1.10.15

Dec 5, 2023

1.10.14

Nov 23, 2023

1.10.13

Nov 14, 2023

1.10.12

Nov 6, 2023

1.10.11

Nov 1, 2023

1.10.10

Aug 14, 2023

1.10.10.dev1 pre-release

Oct 31, 2023

1.10.9

Aug 8, 2023

1.10.8

Aug 2, 2023

1.10.8rc3 pre-release

Aug 1, 2023

1.10.8rc2 pre-release

Jul 31, 2023

1.10.8rc1 pre-release

Jul 29, 2023

1.10.7

Jun 30, 2023

1.10.6

Jun 14, 2023

1.10.5

Jun 5, 2023

1.10.4 yanked

Jun 2, 2023

1.10.3

May 17, 2023

1.10.2

Apr 25, 2023

1.10.2.dev4 pre-release

May 2, 2023

1.10.2.dev3 pre-release

May 2, 2023

1.10.2.dev2 pre-release

May 2, 2023

1.10.2.dev1 pre-release

May 2, 2023

1.10.1

Mar 23, 2023

1.10.1rc4 pre-release

Apr 12, 2023

1.10.1rc3 pre-release

Mar 16, 2023

1.10.1rc2 pre-release

Mar 16, 2023

1.10.1rc1 pre-release

Mar 10, 2023

1.10.0

Feb 10, 2023

1.10.dev2 pre-release

Oct 31, 2023

1.9.15.dev2 pre-release

Jan 31, 2023

1.9.15.dev1 pre-release

Jan 30, 2023

1.9.14

Jan 11, 2023

1.9.13

Nov 8, 2022

1.9.12

Nov 1, 2022

1.9.11

Oct 24, 2022

1.9.11.dev1 pre-release

Oct 18, 2022

1.9.10

Oct 5, 2022

1.9.9

Sep 29, 2022

1.9.9.dev2 pre-release

Sep 1, 2022

1.9.9.dev1 pre-release

Aug 30, 2022

1.9.8

Aug 8, 2022

1.9.7

Aug 2, 2022

1.9.6

Jul 21, 2022

1.9.5

Jul 6, 2022

1.9.5.dev1 pre-release

Jul 6, 2022

1.9.4.post2

Oct 2, 2022

1.9.4.post1

Sep 30, 2022

1.9.4

Jun 13, 2022

1.9.4rc1 pre-release

Jun 7, 2022

1.9.3

Apr 11, 2022

1.9.2

Apr 1, 2022

1.9.1

Mar 31, 2022

1.9.0

Mar 22, 2022

1.8.0

Feb 25, 2022

1.7.2.dev1 pre-release

Feb 17, 2022

1.7.1

Feb 10, 2022

1.7.0

Feb 3, 2022

1.6.7

Jan 24, 2022

1.6.6

Dec 17, 2021

1.6.5

Dec 11, 2021

1.6.4

Dec 3, 2021

1.6.3

Nov 16, 2021

1.6.2.post6

Jan 26, 2022

1.6.2.post2

Dec 16, 2021

1.6.2.post1

Dec 14, 2021

1.6.2

Nov 5, 2021

1.6.1

Oct 22, 2021

1.6.0

Oct 8, 2021

1.5.16

Sep 30, 2021

1.5.15

Sep 24, 2021

1.5.14 yanked

Sep 20, 2021

Reason this release was yanked:

same as 1.5.13

1.5.13

Sep 17, 2021

1.5.12

Sep 16, 2021

1.5.11

Aug 20, 2021

1.5.10

Jul 30, 2021

1.5.9

Jul 19, 2021

1.5.8

Jul 14, 2021

1.5.7

Jun 23, 2021

1.5.6.post1

May 24, 2021

1.5.6

May 19, 2021

1.5.5

Apr 29, 2021

1.5.4

Apr 14, 2021

1.5.3

Apr 6, 2021

1.5.2

Mar 19, 2021

1.5.1

Mar 9, 2021

1.5.0

Feb 27, 2021

1.4.16.post4

Feb 24, 2021

1.4.16.post3

Feb 24, 2021

1.4.16.post2

Feb 24, 2021

1.4.16.post1

Feb 24, 2021

1.4.16

Feb 19, 2021

1.4.15

Feb 12, 2021

1.4.14

Feb 8, 2021

1.4.13

Feb 1, 2021

1.4.12

Jan 19, 2021

1.4.11

Jan 15, 2021

1.4.10

Jan 12, 2021

1.4.9

Dec 30, 2020

1.4.8

Dec 14, 2020

1.4.8.dev3 pre-release

Dec 12, 2020

1.4.8.dev2 pre-release

Dec 12, 2020

1.4.8.dev1 pre-release

Dec 11, 2020

1.4.7

Dec 11, 2020

1.4.7.dev1 pre-release

Dec 11, 2020

1.4.6

Dec 8, 2020

1.4.5

Dec 2, 2020

1.4.4

Nov 29, 2020

1.4.3

Nov 19, 2020

1.4.3rc2 pre-release

Nov 18, 2020

1.4.3rc1 pre-release

Nov 17, 2020

1.4.2

Nov 14, 2020

1.4.1

Oct 29, 2020

1.4.0

Oct 24, 2020

1.3.0

Oct 15, 2020

1.2.0

Aug 28, 2020

1.1.4

Aug 4, 2020

1.1.3

Jul 18, 2020

1.1.2

Jun 18, 2020

1.1.1

Jun 10, 2020

1.1.0

May 29, 2020

1.0.20.post1

May 28, 2020

1.0.20

May 28, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

datarobot_drum-1.17.18-py3-none-any.whl (10.8 MB view details)

Uploaded Jun 4, 2026 Python 3

File details

Details for the file datarobot_drum-1.17.18-py3-none-any.whl.

File metadata

Download URL: datarobot_drum-1.17.18-py3-none-any.whl
Upload date: Jun 4, 2026
Size: 10.8 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for datarobot_drum-1.17.18-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b6b310e3aaceddb1ccdce60c432cf0648946826a63d1d2239e3d2e7f1657d51b`
MD5	`60f79078f708f33da9f9682162413779`
BLAKE2b-256	`e3b797784d051ed0d4fa92552f27b677e6352e2b26d0249b6e12f7fab75681a6`

See more details on using hashes here.

datarobot-drum 1.17.18

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

About

Communication

Custom inference models quickstart guide

Custom tasks

Installation

Prerequisites:

Autocompletion

Usage

Operations

Code Directory --code-dir

Additional model code dependencies

Model template generation

Batch scoring mode

Run a binary classification custom model

Run a regression custom model

Testing model performance

Model validation checks

Runtime Parameters

Prediction server mode

[DEPRECATED] Starting drum as prediction server in production mode.

Fit mode

Running inside a docker container

Drum Push

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes