Combinatorial h+/e- Sample Investigation using Voltaic Measurements
Project description
COHESIVM: Combinatorial h+/e- Sample Investigation using Voltaic Measurements
Introduction
The COHESIVM Python package provides a versatile framework for conducting combinatorial voltaic measurements in
scientific research and development. In order to enable a broad range of electrical and electrochemical analysis
methods, COHESIVM uses a generalized design of the main components which facilitates the extension and adaptation to
different measurement devices and routines. These components are cohesively put together in an
Experiment class which runs compatibility checks and executes the actual measurements.
Key Features:
- Modular Design: COHESIVM adopts a module-oriented approach where components such as measurement devices
(
Device), contacting interfaces (Interface), and measurement routines (Measurement) are abstracted into interchangeable units. This modular architecture enhances flexibility in experimental setups and makes it easy to add new component implementations. - Combinatorial Flexibility: By abstracting the class for the contacting interface, COHESIVM enables diverse configurations for sample investigation. Researchers can simply implement their combinatorial sample design or even interface a robotic contacting system.
- Data Handling: Collected data is stored in a structured HDF5 database
format using the
Databaseclass, ensuring efficient data management and accessibility.Metadatais collected based on the DCMI standard which is extended by COHESIVM-specific metadata terms. - Analysis and GUIs: Alongside the measurement routines, analysis functions and plots can be implemented, extending
the
Analysisbase class. Together with the graphical user interface (also available for conducting experiments and reviewing the database contents), initial screening of the data is facilitated.
Table of Contents
Getting Started
Installation
Using pip
To install the core COHESIVM package from the Python Package Index (PyPI), simply run:
pip install cohesivm
This command will download and install the latest stable version of COHESIVM and its core dependencies.
If you want to use the Graphical User Interfaces inside your Jupyter environment, make sure to
specify the gui extra:
pip install cohesivm[gui]
Further extras exist for the implemented devices and for developers (refer to the pyproject.toml in the source
files to get a listing of all available optional dependencies).
Cloning from GitHub
If you want to install the development version of the package from the GitHub repository, follow these steps:
- Clone the repository to your local machine:
git clone https://github.com/mxwalbert/cohesivm.git - Navigate into the cloned directory:
cd cohesivm - Install the package and its dependencies:
pip install .
Configuration
A config.ini file should be placed in the root of your project to configure the hardware ports/addresses of the
contact interfaces and measurement devices. Some DCMI metadata terms also need to be defined there. COHESIVM implements
a config parser which allows to access these values, e.g.:
>>> import cohesivm
>>> cohesivm.config.get_option('DCMI', 'creator')
Dow, John
A working file with the implemented interfaces and devices can be copied from the GitHub repository, or you can create your own from this template:
# This file is used to configure the project as well as the devices and interfaces (e.g., COM ports, addresses, ...).
# METADATA ------------------------------------------------------------------------------------------------------------
[DCMI]
# The following options correspond to the terms defined by the Dublin Core Metadata Initiative.
# See https://purl.org/dc/terms/ for detailed descriptions.
publisher = "Your Company Ltd."
creator = "Dow, John"
rights = <https://link.to/licence>
subject = "modular design"; "combinatorial flexibility"; "data handling"; "analysis and gui"
# ---------------------------------------------------------------------------------------------------------------------
# INTERFACES ----------------------------------------------------------------------------------------------------------
[NAME_OF_USB_INTERFACE]
com_port = 42
# ---------------------------------------------------------------------------------------------------------------------
# DEVICES -------------------------------------------------------------------------------------------------------------
[NAME_OF_NETWORK_DEVICE]
address = localhost
port = 8888
timeout = 0.1
# ---------------------------------------------------------------------------------------------------------------------
Basic Usage
With working implementations of the main components (Device,
Interface, Measurement), setting up and running an
experiment only takes a few lines of code:
from cohesivm import config
from cohesivm.database import Database, Dimensions
from cohesivm.devices.agilent import Agilent4156C
from cohesivm.measurements.iv import CurrentVoltageCharacteristic
from cohesivm.interfaces import MA8X8
from cohesivm.experiment import Experiment
from cohesivm.progressbar import ProgressBar
# Create a new or load an existing database
db = Database('Test.h5')
# Configure the components
smu = Agilent4156C.SweepVoltageSMUChannel()
device = Agilent4156C.Agilent4156C(channels=[smu], **config.get_section('Agilent4156C'))
interface = MA8X8(com_port=config.get_option('MA8X8', 'com_port'), pixel_dimensions=Dimensions.Circle(radius=0.425))
measurement = CurrentVoltageCharacteristic(start_voltage=-2.0, end_voltage=2.0, voltage_step=0.01, illuminated=True)
# Combine the components in an experiment
experiment = Experiment(
database=db,
device=device,
interface=interface,
measurement=measurement,
sample_id='test_sample_42',
selected_contacts=None
)
# Optionally set up a progressbar
pbar = ProgressBar(experiment)
# Run the experiment
with pbar.show():
experiment.quickstart()
If you want to change the measurement device to a different one, you only need to adjust the lines for the
Channel and the Device accordingly:
from cohesivm.devices.ossila import OssilaX200
smu = OssilaX200.VoltageSMUChannel()
device = OssilaX200.OssilaX200(channels=[smu], **config.get_section('OssilaX200'))
Graphical User Interfaces
If you work with Jupyter, you may use the Graphical User Interfaces (GUIs) which are implemented in the form of Jupyter Widgets. Currently, three GUIs are available:
Experiment GUI
On the left panel "Control", you see the current
ExperimentState, followed by a
representation of the Interface and the control buttons at the bottom. The circles are
annotated with the contact_ids and the colors correspond to their current state.
On the right panel "Plot", the currently running Measurement is displayed. The plot is
automatically updated as soon as new measurement data arrives in the
data_stream of the Experiment object.
Database GUI
This GUI enables to display and filter the measurement data which is stored in an HDF5 file. At the top, you select to
display the data grouped in terms of the
Measurement or by the
sample_name of the Experiment object. If you
choose the former one, you may additionally filter the data by means of measurement parameters. The button to the very
right of each data row enables you to copy the dataset path, to access it in the Database.
Analysis GUI
Similar to the Experiment GUI, the "Interface" panel represents the contacts with their respective IDs. They can be
clicked to display the measured data in the "Plot" panel to the right. There, the arrows can be used to switch between
functions that are defined in the Analysis class. The
results of the functions, which are also implemented there, are shown in the table
below.
Detailed guides to work with the GUIs can be found in the documentation.
Examples
Run an Experiment
Follow the Basic Usage example to set up and run an
Experiment. If you do not have all components ready yet, follow these tutorials:
To follow the other examples you may just run the code from the Basic Usage example even if you do not have access to the hardware. This will fail but create an HDF5 file and store an empty dataset entry.
Manage the Data
After you collected some data and stored it in an HDF5 file, you can use the Database class
to work with it. First, initialise the Database object and list the samples which are stored
there:
>>> from cohesivm.database import Database
>>> db = Database('Test.h5')
>>> db.get_sample_ids()
['test_sample_42']
This is exactly the sample_id which was specified when the
Experiment was configured, and it can be used to retrieve the actual
dataset path in the Database object:
>>> db.filter_by_sample_id('test_sample_42')
['/CurrentVoltageCharacteristic/55d96687ee75aa11:26464063430fe52f:a69a946e7a02e547:c8965a35118ce6fc:67a8bfb44702cfc7:8131a44cea4d4bb8/2024-07-01T10:44:59.033161-test_sample_42']
The resulting list contains the path strings for all experiments with the specified
sample_id (currently only one entry). These strings get quite long because they
contain the name of the Measurement
procedure, followed by a hashed representation of the settings dictionary,
and finally the datetime combined with the sample_id. With this
dataset path, you may retrieve some information from the
Metadata object which got created by the Experiment:
>>> dataset = db.filter_by_sample_id('test_sample_42')[0]
>>> metadata = db.load_metadata(dataset)
>>> metadata.sample_id, metadata.device, metadata.interface, metadata.measurement
('test_sample_42', 'Agilent4156C', 'MA8X8', 'CurrentVoltageCharacteristic')
Storing a new dataset is less trivial because you need a fully qualified Metadata object,
which asks for a large number of arguments. Anyway, this is usually handled by the
Experiment class because it guarantees that the specified components are compatible. For
testing, the Metadata object from above may be used to initialize a new dataset:
>>> db.initialize_dataset(metadata)
'/CurrentVoltageCharacteristic/55d96687ee75aa11:26464063430fe52f:a69a946e7a02e547:c8965a35118ce6fc:67a8bfb44702cfc7:8131a44cea4d4bb8/2024-07-01T10:46:05.910371-test_sample_42'
This yields practically the same dataset path as before, only the datetime is different. Adding data entries, on
the other hand, is fairly simple because you only need to specify the dataset and the contact_id (alongside
the data of course):
>>> db.save_data(np.array([1]), dataset)
>>> db.save_data(np.array([42]), dataset, '1')
Finally, you may load a data entry by specifying the contact_id (or a list of several) or load an entire dataset,
including the Metadata:
>>> db.load_data(dataset, '0')
[array([1])]
>>> db.load_data(dataset, ['0', '1'])
[array([1]), array([42])]
>>> db.load_dataset(dataset)
({'0': array([1]), '1': array([42])}, Metadata(CurrentVoltageCharacteristic, Agilent4156C, MA8X8))
The Database class also implements methods for filtering datasets based on
settings of the Measurement. Check out the
documentation of the filter_by_settings and
filter_by_settings_batch to learn more.
Analyse the Results
The Analysis is tightly bound with the Measurement because
this will determine how the data is shaped and which features you want to extract from it. Therefore, the base class
should be extended as explained in this tutorial:
However, in the following example, the base class will be used to show the basic functionality.
Since the MA8X8 interface was used in the previous examples, the dataset should be filled
with data accordingly. If you already have an HDF5 file from following the basic usage example ("Test.h5"), then
this script should do the job:
import numpy as np
from cohesivm.database import Database
# load existing data and corresponding metadata
db = Database('Test.h5')
dataset = db.filter_by_sample_id('test_sample_42')[0]
metadata = db.load_metadata(dataset)
# create a new data to not interfere with previous examples
dataset = db.initialize_dataset(metadata)
# iterate over contact_ids and save data arrays
for contact_id in metadata.contact_ids:
db.save_data(np.array(range(10), dtype=[('Voltage (V)', float)]), dataset, contact_id)
# load the data
data, metadata = db.load_dataset(dataset)
This time, the save_data method was used correctly (contrary to the previous
examples) because the provided data should always be
a structured array.
Next, functions and plots must be defined:
>>> def maximum(contact_id: str) -> float:
... return max(data[contact_id]['Voltage (V)'])
>>> functions = {'Maximum': maximum}
>>> plots = {} # will be spared for simplicity (refer to the tutorial instead)
This approach seems too complex for what the function does, but it makes sense if you consider that this should be
implemented in a separate Analysis class. There, the data is stored as a property and the
functions (i.e., methods) have direct access to it. Due to the use of structured
arrays (which facilitate to store the quantity and the unit alongside the data), the label also needs to be stated
explicitly. But, again, this will normally be available as a property.
In the following, the class is initialized with and without using the Metadata from the
dataset. The former approach has the advantage that all available fields could be accessed by the
functions, e.g., values that are stored in the
measurement_settings.
>>> from cohesivm.analysis import Analysis
# without metadata, the contact_position_dict must be provided
>>> analysis = Analysis(functions, plots, data, metadata.contact_position_dict)
# with metadata, additional metadata fields can be used in the analysis
>>> analysis = Analysis(functions, plots, (data, metadata))
>>> analysis.metadata.measurement_settings['illuminated']
True
The main usage of the Analysis, besides providing the framework for
the Analysis GUI, is to quickly generate maps of analysis results:
>>> analysis.generate_result_maps('Maximum')[0]
array([[9., 9., 9., 9., 9., 9., 9., 9.],
[9., 9., 9., 9., 9., 9., 9., 9.],
[9., 9., 9., 9., 9., 9., 9., 9.],
[9., 9., 9., 9., 9., 9., 9., 9.],
[9., 9., 9., 9., 9., 9., 9., 9.],
[9., 9., 9., 9., 9., 9., 9., 9.],
[9., 9., 9., 9., 9., 9., 9., 9.],
[9., 9., 9., 9., 9., 9., 9., 9.]])
As expected, the maximum value of the generated data is placed in a 2D numpy array on locations corresponding to
the contact_positions.
Package Reference
The package reference can be found in the documentation.
Contributing
We welcome contributions from the community to make COHESIVM better! If you'd like to contribute an implementation
of a Device, an Interface,
a Measurement or an Analysis, please follow these steps:
- Fork the repository to your own GitHub account.
- Clone your forked repository to your local machine.
- Create a new branch for your new component: git checkout -b my-new-component.
- Make your changes and ensure the code passes existing tests.
- Add new tests for your changes, if applicable.
- Commit your changes with clear and concise messages.
- Push your branch to your forked repository: git push origin my-new-feature.
- Open a pull request to the main repository, describing the changes and why they should be merged.
Please make sure to follow the project's structure. The best way to start is to have a look at the tutorials given in
the documentation. Also, don't forget to write tests for your newly implemented feature which may requires a new custom
marker in the conftest.py (e.g., if you implement a physical device for which the tests will always fail if it is
not connected).
You may also contribute by submitting feature requests, bugs and other issues over GitHub.
Thank you for your contributions!
License
The source code of this project is licensed under the MIT license, and the hardware design and schematics are licensed under the CERN OHL v2 Permissive license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.