Client to connect to the MDML

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

MDML Client

Create a client to easily access the features of the Manufacturing Data & Machine Learning Layer (MDML).

Installation

    pip install mdml_client

Usage

The MDML client uses a class named experiment that provides methods for connecting to the MDML message broker, starting an experiment, publishing data, running analyses, terminating an experiment, and receiving event notifications. Below is an example of a standard use case for the MDML.

  import mdml_client as mdml

  # Create an MDML experiment
  My_MDML_Exp = mdml.experiment("EXPERIMENT_ID", "USERNAME", "PASSWORD", "HOST.IP.ADDRESS")

  # Start the debugger - prints messages from the MDML about your experiment
  My_MDML_Exp.start_debugger()

  # Login to allow FuncX usage. A link will be printed in the console window for authentication. 
  My_MDML_Exp.globus_login()

  # Validate and locally add a configuration to your experiment
  My_MDML_Exp.add_config({"Your configuration file here"}, "optional_run_id")

  # Send the configuration to the MDML
  My_MDML_Exp.send_config()

  # Publishing data - do this as much and as often as required by your experiment
  My_MDML_Exp.publish_data(device_id, data, data_delimiter, use_influxDB)

  # Analyze data
  My_MDML_Exp.publish_analysis(queries, function_uuid, endpoint_uuid)

  # Make sure to reset the MDML to end your experiment!
  My_MDML_Exp.reset()

Documentation

My_MDML_Exp = mdml.experiment(experiment_id, username, passwd, host)

Parameters:

experiment_id (str) - MDML experiment ID, given to you by an MDML admin
username (str) - MDML username
passwd (str) - MDML password
host (str) - IP address of the MDML host, given to you by an MDML admin

Returns - experiment object

This is the first step in interacting with the MDML. mdml.experiment() creates an experiment object through which methods for interacting with the MDML are accessed. This line also creates a connection to the MDML that will be used later. All input parameters specified here will be given to you by an MDML admin. From here on out, all methods should be called on the variable created by mdml.experiment() - in this case My_MDML_Exp. Since it is possible that your experiment may need to send data from multiple different places, multiple connections to the MDML can be made with this line of code.

My_MDML_Exp.start_debugger()

Starting the debugger allows you to receive event notifications from the MDML. These notifications will be automatically printed to the console window.

My_MDML_Exp.globus_login()

This method logs the user in using Globus' authentication. It is only required if FuncX analyses will be run. Upon running, a link will be printed in the console window. Clicking it will open a web browser where you will log in to your globus account and be provided a token. Copy and paste this token in the console window to finish the login.

My_MDML_Exp.add_config(config, experiment_run_id)

Parameters:

config (str | dict) - string to a file path or a dict containing the configuration (syntax below)
experiment_run_id (str) - string of only letters and underscores to identify the experiment run

This method adds your configuration file to your experiment object - it has not been sent to the MDML yet. The config parameter is explained in detail below. The second parameter is the run ID for the experiment about to be started. A valid run ID can only contain letters and underscores. Reusing a previous run ID will treat the data as if it came from the past experiment regardless of the time elapsed - data files will be appended to where they left off.

My_MDML_Exp.send_config()

This message sends the configuration you added with .add_config() to the MDML. If a debugger has been started, you should see a message regarding the configuration.

# Send the configuration to the MDML
My_MDML_Exp.send_config()

# Publishing data - do this as much and as often as required by your experiment
My_MDML_Exp.publish_data(device_id, data, data_delimiter, use_influxDB)

# Analyze data
My_MDML_Exp.publish_analysis(queries, function_uuid, endpoint_uuid)

# Make sure to reset the MDML to end your experiment!
My_MDML_Exp.reset()

My_MDML_Exp.publish_data(device_id, data, data_delimiter='null', influxDB = False)

Parameters:

device_id (str) - device id string that corresponds to a device in the configuration file
data (str) - delimited string of data
data_delimiter (str) - delimiter used in the data parameter
influxDB (bool) - true if data should be stored in InfluxDB, false otherwise

My_MDML_Exp.publish_image(device_id, img_byte_string, timestamp = 0)

Parameters:

device_id (str) - device id string that corresponds to a device in the configuration file
img_byte_string (str) - string of bytes for the image (output from mdml_client.read_image())
timestamp (str) - unix time in nanseconds when the photo was taken

My_MDML_Exp.publish_analysis(queries, function_id, endpoint_id)

Parameters:

queries (str) - Description of the data to send to the FuncX function using the syntax below
function_id (str) - From FuncX, id of the function to run
endpoint_id (str) - From FuncX, id of the endpoint to run on

My_MDML_Exp.reset()

This method must be called in order to end an experiment. A message is sent to the MDML backend that finishes sending data messages and begins archiving all data files for storage.

MDML Configuration Syntax

Configuration Documentation

Every experiment run through the MDML needs to first have a configuration file. This serves to give the MDML context to your data and provide meaningful metadata for your experiments, processes, and data-generating devices. Information in the configuration file should answer questions that the data itself does not. Things like, what units are the data in, what kind of device generated the data, or was an analysis done before sending your data to the MDML? Providing as much information as possible not only increases the data's value for scientific purposes but also minimizes future confusion when you or another researcher want to use the data.

The configuration of an experiment serves as metadata for each device/sensor generating data and for the experiment itself. The configuration also allows the MDML to warn you and prevent any bad data from being published. We highly recommend taking the time to craft a detailed configuration so that if used in the future, any researcher would be able to understand your experiment and data.

The configuration file must be a valid JSON file. It consist of two parts, an experiment section and a devices section. The experiment section is for general experiment notes and the list of devices that will generate data. The devices section contains an entry for each device listed in the experiment section. In each section, there are required fields and optional fields that control the MDML's behavior while streaming data. Furthermore, it is possible to create any additional fields you wish as long as the field's name is not already used by a required or optional field. Below is an in depth description of the configuration file.

Experiment Section

Required Fields:

experiment_id
- Experiment ID provided by the MDML administrators
experiment_notes
- Any important notes about your experiment that you would like to remain with the data
experiment_devices
- A list of devices that will be generating and sending data. These will be described in the Devices section

Optional Fields:

experiment_run_id
- Experiment run ID (Defaults to 1 and increases for each new experiment) Will be added automatically if the second parameter in .add_config() is specified. This is different than experiment_id.

Devices Section

Required Fields:

device_id
- Identification string for the device. MUST match a device listed in the experiment section
device_name
- Full name of the device
device_output
- Explanation of what data the device is outputting
device_output_rate
- The rate (in hertz) that this sensor will be generating data (If the rate during your experiment may vary, please use the fastest rate)
device_data_type
- Type of data being generated. Must be "text/numeric", "vector", or "image"
device_notes
- Any other relevant information to provide that has not been listed
headers
- A list of headers to describe the data that will be sent
data_types
- A list of data types for each value (MUST correspond to the headers field)
data_units
- A list of the units for each value (MUST correspond to the headers field)

Optional Fields:

melt_data - Contains more data on how to melt the data (see the melting data section below)
- keep
  - List of variables to keep the same (must have been listed in the headers field)
- var_name
  - Name of the new variable that is created with all the values from headers that are not included in keep
- var_val
  - Name of the new variable that is created with the values corresponding to the original headers
influx_tags
- List of variables that should be used as tags - MUST correspond to values in the headers field (Tags are specific to InfluxDB. See the Software Stack section below for details.)

Experiment Configuration Example

{
    "experiment": {
      "experiment_id": "FSP",
      "experiment_notes": "Flame Spray Pyrolysis Experiment",
      "experiment_devices": [
        "OES",
        "DATA_LOG",
        "PLIF"
      ]
    },
    "devices": [
      {
        "device_id": "OES",
        "device_name": "ANDOR Kymera328",
        "device_output": "2048 intensity values in the 250-700nm wavelength range",
        "device_output_rate": 0.01,
        "device_data_type": "text/numeric",
        "device_notes": "Points directly at the flame in 8 different locations",
        "headers": [
          "time",
          "Date",
          "Channel",
          "188.06",
          "188.53"
        ],
        "data_types": [
          "time",
          "date",
          "numeric",
          "numeric",
          "numeric"
        ],
        "data_units": [
          "nanoseconds",
          "date",
          "number",
          "dBm/nm",
          "dBm/nm"
        ],
        "melt_data": {
          "keep": [
            "time",
            "Date",
            "Channel",
          ],
          "var_name": "wavelength",
          "var_val": "intensity"
        },
        "influx_tags": ["Channel", "wavelength"]
      },
      {
        "device_id": "DATA_LOG",
        "device_name": "ANDOR Kymera328",
        "device_output": "2048 intensity values in the 250-700nm wavelength range",
        "device_output_rate": 0.9,
        "device_data_type": "text/numeric",
        "device_notes": "Points directly at the flame in 8 different locations",
        "headers": [
          "time",
          "Sample #",
          "Date",
          "SOL#",
          "Vol remaining [ml]",
          "Exhaust Flow",
          "Pressure"
        ],
        "data_types": [
          "time",
          "numeric",
          "date",
          "numeric",
          "numeric",
          "numeric",
          "numeric"
        ],
        "data_units": [
          "nanoseconds",
          "number",
          "date",
          "number",
          "milliliters",
          "liters/hour",
          "atm"
        ]
      },
      {
        "device_id": "PLIF",
        "device_name": "Planar Laser Induced Fluorescence",
        "device_output": "Image of flames showing specific excited species.",
        "device_output_rate": 10,
        "device_data_type": "image",
        "device_notes": "Points down, directly at the flame",
        "headers": [
          "PLIF"
        ],
        "data_types": [
          "image"
        ],
        "data_units": [
          "image"
        ]
      }
    ]
  }

MDML Query Syntax

The query syntax is used to specify what data should be sent to a FuncX function. Using this syntax, the MDML builds and executes queries for InfluxDB to gather all data that neeeds to be sent to the FuncX function. For each device to be queried, an dictionary should be created with the following three keys:

device - value is the device ID specified in the configuration
variables - value is a list of variables to be an empty list will grab all variables for the given device
last - value is the number of lines to return (most recent lines)

Below is an example of the syntax.

[
  {
    "device": "OES_VECTOR",
    "variables": ["intensity", "wavelength"],
    "last": 1
  },
  {
    "device": "DEVICE_J",
    "variables": [],
    "last" : 2
  }
]

Helper Functions

The following function are imported with the MDML python client.

mdml_client.unix_time(ret_int = False)

Parameters:

ret_int (bool) - True to return an int, False to return a string This function returns the current Unix time in nanoseconds as either an int or a string. This can be used to add to you data before publishing. If the corresponding data headers in the configuration file is "time", InfluxDB (MDML's time-series database) will use this as the official timestamp for that data entry. Without a "time" variable, InfluxDB will use the timestamp when the data was inserted. This is not ideal since the timestamp does not reflect when the data was actually created.

mdml_client.read_image(file_name, resize_x = 0, resize_y = 0)

Parameters:

file_name (str) - file path to the image file to be read
resize_x (int) - resize width
resize_y (int) - resize height

Returns - a string of bytes for the image that can be passed directly to the publish_image method in experiment objects

Examples

Examples of the MDML client can be found on GitHub in the examples folder.

Time

This package includes a helper function "unix_time()" which outputs the current unix time in nanoseconds. This can be used to append a timestamp to your data - like in the example above. In the experiment's configuration, the corresponding data header must be "time" which ensures that InfluxDB (MDML's time-series database) will use it properly. Without it, the timestamp will be created by InfluxDB and represent when the data was stored, not when the data was actually generated.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.2.15

Jul 27, 2022

1.2.14

Jan 28, 2022

1.2.13

Jan 28, 2022

1.2.12

Jan 20, 2022

1.2.11

Jan 10, 2022

1.2.10

Jan 10, 2022

1.2.8

Nov 2, 2021

1.2.7

Oct 25, 2021

1.2.6

Oct 21, 2021

1.2.5

Oct 21, 2021

1.2.4

Oct 12, 2021

1.2.3

Oct 7, 2021

1.2.2

Oct 5, 2021

1.2.1

Oct 1, 2021

1.2.0

Oct 1, 2021

1.1.108

Sep 22, 2021

1.1.107

Sep 21, 2021

1.1.106

Sep 21, 2021

1.1.105

Sep 20, 2021

1.1.104

Sep 16, 2021

1.1.103

Sep 16, 2021

1.1.102

Aug 27, 2021

1.1.101

Aug 26, 2021

1.1.100

Aug 23, 2021

1.1.99

Aug 12, 2021

1.1.98

Aug 12, 2021

1.1.97

Aug 12, 2021

1.1.96

Aug 12, 2021

1.1.95

Aug 12, 2021

1.1.94

Aug 11, 2021

1.1.93

Aug 11, 2021

1.1.92

Aug 9, 2021

1.1.91

Aug 9, 2021

1.1.90

Aug 9, 2021

1.1.89

Jul 27, 2021

1.1.88

Jul 26, 2021

1.1.87

Jul 23, 2021

1.1.86

Jul 22, 2021

1.1.85

Jul 22, 2021

1.1.84

Jun 28, 2021

1.1.83

Jun 28, 2021

1.1.82

Jun 24, 2021

1.1.81

Jun 14, 2021

1.1.80

Jun 14, 2021

1.1.79

Jun 8, 2021

1.1.78

Jun 7, 2021

1.1.77

Jun 7, 2021

1.1.76

Jun 7, 2021

1.1.75

Jun 1, 2021

1.1.74

May 20, 2021

1.1.73

May 20, 2021

1.1.72

May 20, 2021

1.1.71

May 20, 2021

1.1.70

May 20, 2021

1.1.69

May 19, 2021

1.1.68

May 18, 2021

1.1.67

May 14, 2021

1.1.66

May 13, 2021

1.1.65

Mar 24, 2021

1.1.64

Mar 24, 2021

1.1.63

Mar 17, 2021

1.1.62

Mar 17, 2021

1.1.61

Mar 17, 2021

1.1.60

Mar 17, 2021

1.1.59

Mar 10, 2021

1.1.58

Mar 10, 2021

1.1.57

Feb 26, 2021

1.1.56

Feb 26, 2021

1.1.55

Feb 26, 2021

1.1.54

Jan 11, 2021

1.1.53

Jan 11, 2021

1.1.52

Jan 11, 2021

1.1.51

Jan 6, 2021

1.1.50

Jan 6, 2021

1.1.49

Oct 9, 2020

1.1.47

Sep 23, 2020

1.1.46

Sep 18, 2020

1.1.45

Sep 14, 2020

1.1.44

Sep 14, 2020

1.1.43

Sep 14, 2020

1.1.42

Aug 27, 2020

1.1.41

Aug 25, 2020

1.1.40

Aug 20, 2020

1.1.38

Jun 26, 2020

1.1.37

Jun 19, 2020

1.1.36

Jun 19, 2020

1.1.35

Jun 19, 2020

1.1.34

Jun 19, 2020

1.1.32

Jun 15, 2020

1.1.31

Jun 12, 2020

1.1.30

Jun 12, 2020

1.1.29

Jun 12, 2020

1.1.28

Jun 10, 2020

1.1.27

Jun 10, 2020

1.1.26

Jun 5, 2020

1.1.25

Jun 5, 2020

1.1.24

Jun 5, 2020

1.1.23

Jun 4, 2020

1.1.22

May 14, 2020

1.1.21

Apr 28, 2020

1.1.20

Apr 28, 2020

1.1.19

Apr 24, 2020

1.1.18

Apr 23, 2020

1.1.17

Apr 22, 2020

1.1.16

Apr 22, 2020

1.1.15

Apr 22, 2020

1.1.14

Apr 21, 2020

1.1.13

Apr 21, 2020

1.1.12

Apr 21, 2020

1.1.11

Apr 14, 2020

1.1.10

Mar 3, 2020

1.1.9

Mar 3, 2020

1.1.8

Feb 25, 2020

1.1.7

Feb 25, 2020

1.1.6

Feb 19, 2020

1.1.5

Feb 19, 2020

1.1.4

Jan 31, 2020

1.1.3

Jan 31, 2020

1.1.2

Jan 27, 2020

1.1.1

Jan 24, 2020

1.1.0

Jan 22, 2020

1.0.9

Jan 13, 2020

1.0.8

Jan 10, 2020

1.0.7

Jan 8, 2020

1.0.6

Dec 20, 2019

1.0.5

Dec 13, 2019

1.0.4

Dec 13, 2019

1.0.3

Dec 13, 2019

1.0.1

Dec 11, 2019

1.0.0

Dec 11, 2019

0.9.0

Nov 11, 2019

0.8.9

Nov 11, 2019

0.8.8

Nov 11, 2019

This version

0.8.7

Nov 6, 2019

0.8.6

Nov 6, 2019

0.8.5

Nov 6, 2019

0.8.4

Nov 6, 2019

0.8.3

Nov 6, 2019

0.8.2

Nov 6, 2019

0.8.1

Nov 6, 2019

0.8.0

Nov 6, 2019

0.7.9

Nov 6, 2019

0.7.8

Nov 5, 2019

0.7.7

Nov 5, 2019

0.7.6

Nov 5, 2019

0.7.5

Nov 4, 2019

0.7.4

Nov 4, 2019

0.7.2

Nov 1, 2019

0.6.6

Oct 29, 2019

0.6.4

Oct 24, 2019

0.6.2

Oct 22, 2019

0.6.1

Oct 22, 2019

0.6.0

Oct 18, 2019

0.5.9

Oct 17, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mdml_client-0.8.7.tar.gz (12.4 kB view details)

Uploaded Nov 6, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mdml_client-0.8.7-py3-none-any.whl (14.4 kB view details)

Uploaded Nov 6, 2019 Python 3

File details

Details for the file mdml_client-0.8.7.tar.gz.

File metadata

Download URL: mdml_client-0.8.7.tar.gz
Upload date: Nov 6, 2019
Size: 12.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.20.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for mdml_client-0.8.7.tar.gz
Algorithm	Hash digest
SHA256	`0f3058fa4084d0b4b5fde85aaaceeb34079b28d506f3076af998ccddad7e6e7d`
MD5	`d93d3fd855cf4248a52d05e1610f59b7`
BLAKE2b-256	`3af0730d42a60942e298a22f4f434a9b9cad0cee54983781ddaf649f88569f44`

See more details on using hashes here.

File details

Details for the file mdml_client-0.8.7-py3-none-any.whl.

File metadata

Download URL: mdml_client-0.8.7-py3-none-any.whl
Upload date: Nov 6, 2019
Size: 14.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.20.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for mdml_client-0.8.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`194cfa9a9bfb29f877a769fde62bf0d93feabd2b8778c67a4bc7fd5c63ad30dc`
MD5	`96882142585311f5fa3edd9e8c20cbaa`
BLAKE2b-256	`655b40678f66c0008080fb435aa29a5756a3d503cc4deaead56872d01acb8077`

See more details on using hashes here.

mdml-client 0.8.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MDML Client

Installation

Usage

Documentation

MDML Configuration Syntax

Configuration Documentation

Experiment Section

Required Fields:

Optional Fields:

Devices Section

Required Fields:

Optional Fields:

Experiment Configuration Example

MDML Query Syntax

Helper Functions

Examples

Time

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes