openBIS connection and interaction, optimized for using with Jupyter

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Welcome to pyBIS!

pyBIS is a Python module for interacting with openBIS. pyBIS is designed to be most useful in a Jupyter Notebook or IPython environment, especially if you are developing Python scripts for automatisation. Jupyter Notebooks offer some sort of IDE for openBIS, supporting TAB completition and immediate data checks, making the life of a researcher hopefully easier.

Dependencies and Requirements

pyBIS relies the openBIS API v3
openBIS version 16.05.2 or newer is required
19.06.5 or later is recommended
pyBIS uses Python 3.6 or newer and the Pandas module

Installation

pip install --upgrade pybis

That command will download install pyBIS and all its dependencies. If pyBIS is already installed, it will be upgraded to the latest version.

If you haven't done yet, install Jupyter and/or Jupyter Lab (the next Generation of Jupyter):

pip install jupyter
pip install jupyterlab

General Usage

TAB completition and other hints in Jupyter / IPython

in a Jupyter Notebook or IPython environment, pybis helps you to enter the commands
After every dot . you might hit the TAB key in order to look at the available commands.
if you are unsure what parameters to add to a , add a question mark right after the method and hit SHIFT+ENTER
Jupyter will then look up the signature of the method and show some helpful docstring

Checking input

When working with properties of entities, they might use a controlled vocabulary or are of a specific property type.
Add an underscore _ character right after the property and hit SHIFT+ENTER to show the valid values
When a property only acceps a controlled vocabulary, you will be shown the valid terms in a nicely formatted table
if you try to assign an invalid value to a property, you'll receive an error immediately

Glossary

spaces: used for authorisation eg. to separate two working groups. If you have permissions in a space, you can see everything which in that space, but not necessarily in another space (unless you have the permission).
projects: a space consists of many projects.
experiments / collections: a projects contain many experiments. Experiments can have properties
samples / objects: an experiment contains many samples. Samples can have properties
dataSet: a dataSet which contains the actual data files, either pyhiscal (stored in openBIS dataStore) or linked
attributes: every entity above contains a number of attributes. They are the same accross all instances of openBIS and independent of their type.
properties: Additional specific key-value pairs, available for these entities:
- experiments
- samples
- dataSets
every single instance of an entity must be of a specific entity type (see below). The type defines the set of properties.
experiment type / collection type: a type for experiments which specifies its properties
sample type / object type: a type for samples / objects which specifies its properties
dataSet type: a type for dataSets which specifies its properties
property type: a single property, as defined in the entity types above. It can be of a classic data type (e.g. INTEGER, VARCHAR, BOOLEAN) or its values can be controlled (CONTROLLEDVOCABULARY).
plugin: a script written in Jython which allows to check property values in a even more detailed fashion

connect to OpenBIS

login

In an interactive session e.g. inside a Jupyter notebook, you can use getpass to enter your password safely:

from pybis import Openbis
o = Openbis('https://example.com')

import getpass
password = getpass.getpass()

o.login('username', password, save_token=True)   # save the session token in ~/.pybis/example.com.token

In a script you would rather use two environment variables to provide username and password:

from pybis import Openbis
o = Openbis(os.environ['OPENBIS_HOST'])

o.login(os.environ['OPENBIS_USERNAME'], os.environ['OPENBIS_PASSWORD'])

Verify certificate

By default, your SSL-Certification is being verified. If you have a test-instance with a self-signed certificate, you'll need to turn off this verification explicitly:

from pybis import Openbis
o = Openbis('https://test-openbis-instance.com', verify_certificates=False)

Check session token, logout()

Check whether your session, i.e. the session token is still valid and log out:

print(f"Session is active: {o.is_session_active()} and token is {o.token}")
o.logout()
print(f"Session is active: {o.is_session_active()"}

Caching

With pyBIS 1.17.0, a lot of caching has been introduced to improve the speed of object lookups that do not change often. If you encounter any problems, you can turn it off like this:

o = Openbis('https://example.com', use_cache=False)

# or later in the script
o.use_cache = False
o.clear_cache()
o.clear_cache('sampleType')

Mount openBIS dataStore server

Prerequisites: FUSE / SSHFS

Mounting an openBIS dataStore server requires FUSE / SSHFS to be installed (requires root privileges). The mounting itself requires no root privileges.

Mac OS X

Follow the installation instructions on https://osxfuse.github.io

Unix Cent OS 7

$ sudo yum install epel-release
$ sudo yum --enablerepo=epel -y install fuse-sshfs
$ user="$(whoami)"
$ usermod -a -G fuse "$user"

After the installation, an sshfs command should be available.

Mount dataStore server with pyBIS

Because the mount/unmount procedure differs from platform to platform, pyBIS offers two simple methods:

o.mount()
o.mount(username, password, hostname, mountpoint, volname)
o.is_mounted()
o.unmount()
o.get_mountpoint()

Currently, mounting is supported for Linux and Mac OS X only.

All attributes, if not provided, are re-used by a previous login() command. If no mountpoint is provided, the default mounpoint will be ~/hostname. If this directory does not exist, it will be created. The directory must be empty before mounting.

Masterdata

OpenBIS stores quite a lot of meta-data along with your dataSets. The collection of data that describes this meta-data (i.e. meta-meta-data) is called masterdata. It consists of:

sample types
dataSet types
material types
experiment types
property types
vocabularies
vocabulary terms
plugins (jython scripts that allow complex data checks)
tags
semantic annotations

browse masterdata

sample_types = o.get_sample_types()  # get a list of sample types 
sample_types.df                      # DataFrame object
st = o.get_sample_types()[3]         # get 4th element of that list
st = o.get_sample_type('YEAST')
st.code
st.generatedCodePrefix
st.attrs.all()                       # get all attributes as a dict
st.get_validationPlugin()            # returns a plugin object

st.get_property_assignments()        # show the list of properties
                                     # for that sample type
o.get_material_types()
o.get_dataset_types()
o.get_experiment_types()
o.get_collection_types()

o.get_property_types()
pt = o.get_property_type('BARCODE_COMPLEXITY_CHECKER')
pt.attrs.all()

o.get_plugins()
pl = o.get_plugin('Diff_time')
pl.script  # the Jython script that processes this property

o.get_vocabularies()
o.get_vocabulary('BACTERIAL_ANTIBIOTIC_RESISTANCE')
o.get_terms(vocabulary='STORAGE')
o.get_tags()

create property types

Samples (objects), experiments (collections) and dataSets contain type-specific properties. When you create a new sample, experiment or datasSet of a given type, the set of properties is well defined. Also, the values of these properties are being type-checked.

The first step in creating a new entity type is to create a so called property type:

pt = o.new_property_type(
    code        = 'MY_NEW_PROPERTY_TYPE', 
    label       = 'yet another property type', 
    description = 'my first property',
    dataType    = 'VARCHAR',
)

pt_int = o.new_property_type(
    code        = '$DEFAULT_OBJECT_TYPE', 
    label       = 'default object type for ELN-LIMS', 
    dataType    = 'VARCHAR',
    managedInternally = True,
)

pt_voc = o.new_property_type(
    code        = 'MY_CONTROLLED_VOCABULARY', 
    label       = 'label me', 
    description = 'give me a description',
    dataType    = 'CONTROLLEDVOCABULARY',
    vocabulary  = 'STORAGE',
)

The dataType attribute can contain any of these values:

INTEGER
VARCHAR
MULTILINE_VARCHAR
REAL
TIMESTAMP
BOOLEAN
HYPERLINK
XML
CONTROLLEDVOCABULARY
MATERIAL

When choosing CONTROLLEDVOCABULARY, you must specify a vocabulary attribute (see example). Likewise, when choosing MATERIAL, a materialType attribute must be provided. PropertyTypes that start with a $ are by definition managedInternally and therefore this attribute must be set to True.

create sample types / object types

The second step (after creating a property type, see above) is to create the sample type. The new name for sample is object. You can use both methods interchangeably:

new_sample_type() == new_object_type()

sample_type = o.new_sample_type(
    code                = 'my_own_sample_type',  # mandatory
    generatedCodePrefix = 'S',                   # mandatory
    description         = '',
    autoGeneratedCode   = True,
    subcodeUnique       = False,
    listable            = True,
    showContainer       = False,
    showParents         = True,
    showParentMetadata  = False,
    validationPlugin    = 'Has_Parents'          # see plugins below
)
sample_type.save()

assign and revoke properties to sample type / object type

The third step, after saving the sample type, is to assign or revoke properties to the newly created sample type. This assignment procedure applies to all entity types (dataset type, experiment type).

sample_type.assign_property(
	prop                 = 'diff_time',           # mandatory
	section              = '',
	ordinal              = 5,
	mandatory            = True,
	initialValueForExistingEntities = 'initial value'
	showInEditView       = True,
	showRawValueInForms  = True
)
sample_type.revoke_property('diff_time')
sample_type.get_property_assignments()

create a dataset type

The second step (after creating a property type, see above) is to create the dataset type. The third step is to assign or revoke the properties to the newly created dataset type.

dataset_type = o.new_dataset_type(
    code                = 'my_dataset_type',       # mandatory
    description         = None,
    mainDataSetPattern  = None,
    mainDataSetPath     = None,
    disallowDeletion    = False,
    validationPlugin    = None,
)
dataset_type.save()
dataset_type.assign_property('property_name')
dataset_type.revoke_property('property_name')
dataset_type.get_property_assignments()

create a experiment / collection type

The second step (after creating a property type, see above) is to create the experiment type.

The new name for experiment is collection. You can use both methods interchangeably:

new_experiment_type() == new_collection_type()

experiment_type = o.new_experiment_type(
    code, 
    description      = None,
    validationPlugin = None,
)
experiment_type.save()
experiment_type.assign_property('property_name')
experiment_type.revoke_property('property_name')
experiment_type.get_property_assignments()

create material types

Materials and material types are deprecated in newer versions of openBIS.

material_type = o.new_material_type(
    code, 
    description=None,
    validationPlugin=None,
)
material_type.save()
material_type.assign_property('property_name')
material_type.revoke_property('property_name')
material_type.get_property_assignments()

create plugins

Plugins are Jython scripts that can accomplish more complex data-checks than ordinary types and vocabularies can achieve. They are assigned to entity types (dataset type, sample type etc). Documentation and examples can be found here

pl = o.new_plugin(
    name       ='my_new_entry_validation_plugin',
    pluginType ='ENTITY_VALIDATION',       # or 'DYNAMIC_PROPERTY' or 'MANAGED_PROPERTY',
    entityKind = None,                     # or 'SAMPLE', 'MATERIAL', 'EXPERIMENT', 'DATA_SET'
    script     = 'def calculate(): pass'   # a JYTHON script
)
pl.save()

Users, Groups and RoleAssignments

o.get_groups()
group = o.new_group(code='group_name', description='...')
group = o.get_group('group_name')
group.save()
group.assign_role(role='ADMIN', space='DEFAULT')
group.get_roles() 
group.revoke_role(role='ADMIN', space='DEFAULT')

group.add_members(['admin'])
group.get_members()
group.del_members(['admin'])
group.delete()

o.get_persons()
person = o.new_person(userId='username')
person.space = 'USER_SPACE'
person.save()
# person.delete() is currently not possible.

person.assign_role(role='ADMIN', space='MY_SPACE')
person.assign_role(role='OBSERVER')
person.get_roles()
person.revoke_role(role='ADMIN', space='MY_SPACE')
person.revoke_role(role='OBSERVER')

o.get_role_assignments()
o.get_role_assignments(space='MY_SPACE')
o.get_role_assignments(group='MY_GROUP')
ra = o.get_role_assignment(techId)
ra.delete()

Spaces

space = o.new_space(code='space_name', description='')
space.save()
o.get_spaces(
    start_with = 0,                   # start_with and count
    count      = 10,                  # enable paging
)
space = o.get_space('MY_SPACE')

# get individual attributes
space.code
space.description
space.registrator
space.registrationDate
space.modifier
space.modificationDate

# set individual attribute
# most of the attributes above are set automatically and cannot be modified.
space.description = '...'

# get all attributes as a dictionary
space.attrs.all()

space.delete('reason for deletion')

Projects

project = o.new_project(
    space       = space, 
    code        = 'project_name',
    description = 'some project description'
)
project = space.new_project(
	code         = 'project_code',
	description  = 'project description'
)
project.save()

o.get_projects(
    space       = 'MY_SPACE',         # show only projects in MY_SPACE
    start_with  = 0,                  # start_with and count
    count       = 10,                 # enable paging
)
o.get_projects(space='MY_SPACE')
space.get_projects()

project.get_experiments()
project.get_attachments()
p.add_attachment(fileName='testfile', description= 'another file', title= 'one more attachment')
project.download_attachments()

# get individual attributes
project.code
project.description

# set individual attribute
project.description = '...'

# get all attributes as a dictionary
project.attrs.all()

project.freeze = True
project.freezeForExperiments = True
project.freezeForSamples = True

Samples / Objects

The new name for sample is object. You can use boths names interchangeably:

get_sample() = get_object()
new_sample() = new_object()
get_samples() = get_objects()

etc.

sample = o.new_sample(
    type       = 'YEAST', 
    space      = 'MY_SPACE',
    experiment = '/MY_SPACE/MY_PROJECT/EXPERIMENT_1',
    parents    = [parent_sample, '/MY_SPACE/YEA66'], 
    children   = [child_sample],
    props      = {"name": "some name", "description": "something interesting"}
)
sample = space.new_sample( type='YEAST' )
sample.save()

sample = o.get_sample('/MY_SPACE/MY_SAMPLE_CODE')
sample = o.get_sample('20170518112808649-52')
samples= o.get_samples(type='UNKNOWN')    # search for samples, see below

# get individual attributes
sample.space
sample.code
sample.permId
sample.identifier
sample.type  # once the sample type is defined, you cannot modify it

# set attribute
sample.space = 'MY_OTHER_SPACE'

sample.experiment    # a sample can belong to one experiment only
sample.experiment = '/MY_SPACE/MY_PROJECT/MY_EXPERIMENT'

sample.project
sample.project = '/MY_SPACE/MY_PROJECT'  # only works if project samples are
enabled

sample.tags
sample.tags = ['guten_tag', 'zahl_tag' ]

sample.attrs.all()                    # returns all attributes as a dict
sample.props.all()                    # returns all properties as a dict

sample.get_attachments()
sample.download_attachments()
sample.add_attachment('testfile.xls')

sample.delete('deleted for some reason')

create many samples in a transaction

Creating a sample takes some time. If you need to create many samples, you might want to create them in one transaction. This will transfer all your sample data at once. The Upside of this is the gain in speed. The downside: this is a all-or-nothing operation, which means, either all samples will be registered or none (if any error occurs).

You can mix creation or update of existing samples within the transaction.

sample1 = o.new_sample(...)
sample2 = o.new_sample(...)
sample3 = o.new_sample(...)

trans = o.new_transaction()
trans.add(sample1)
trans.add(sample2)
trans.add(sample3)

trans.commit()

parents, children, components and container

sample.get_parents()
sample.set_parents(['/MY_SPACE/PARENT_SAMPLE_NAME')
sample.add_parents('/MY_SPACE/PARENT_SAMPLE_NAME')
sample.del_parents('/MY_SPACE/PARENT_SAMPLE_NAME')

sample.get_children()
sample.set_children('/MY_SPACE/CHILD_SAMPLE_NAME')
sample.add_children('/MY_SPACE/CHILD_SAMPLE_NAME')
sample.del_children('/MY_SPACE/CHILD_SAMPLE_NAME')

# A Sample may belong to another Sample, which acts as a container.
# As opposed to DataSets, a Sample may only belong to one container.
sample.container    # returns a sample object
sample.container = '/MY_SPACE/CONTAINER_SAMPLE_NAME'   # watch out, this will change the identifier of the sample to:
                                                       # /MY_SPACE/CONTAINER_SAMPLE_NAME:SAMPLE_NAME
sample.container = ''                                  # this will remove the container. 

# A Sample may contain other Samples, in order to act like a container (see above)
# The Sample-objects inside that Sample are called «components» or «contained Samples»
# You may also use the xxx_contained() functions, which are just aliases.
sample.get_components()
sample.set_components('/MY_SPACE/COMPONENT_NAME')
sample.add_components('/MY_SPACE/COMPONENT_NAME')
sample.del_components('/MY_SPACE/COMPONENT_NAME')

sample tags

sample.get_tags()
sample.set_tags('tag1')
sample.add_tags(['tag2','tag3'])
sample.del_tags('tag1')

useful tricks when dealing with properties, using Jupyter or IPython

sample.p + TAB                        # in IPython or Jupyter: show list of available properties
sample.p.my_property_ + TAB           # in IPython or Jupyter: show datatype or controlled vocabulary
sample.p['my-weird.property-name']    # accessing properties containing a dash or a dot

sample.set_props({ ... })             # set properties by providing a dict
sample.p                              # same thing as .props
sample.p.my_property = "some value"   # set the value of a property
                                      # value is checked (type/vocabulary)
sample.save()                         # update the sample in openBIS

search for samples / objects

The result of a search is always list, even when no items are found. The .df attribute returns the Pandas dataFrame of the results.

samples = o.get_samples(
    space      ='MY_SPACE',
    type       ='YEAST',
    tags       =['*'],                # only sample with existing tags
    start_with = 0,                   # start_with and count
    count      = 10,                  # enable paging
    NAME       = 'some name',         # properties are always uppercase 
                                      # to distinguish them from attributes
    **{ "SOME.WEIRD:PROP": "value"}   # property name contains a dot or a
                                      # colon: cannot be passed as an argument
    registrationDate = "2020-01-01",  # date format: YYYY-MM-DD
    modificationDate = "<2020-12-31", # use > or < to search for specified date and later / earlier
    attrs=[                           # show these attributes in the dataFrame
        'sample.code',
        'registrator.email',
        'type.generatedCodePrefix'
    ],
    props=['$NAME', 'MATING_TYPE']    # show these properties in the result
)

sample = samples[9]                   # get the 10th sample
                                      # of the search results
sample = samples['/SPACE/AABC']       # same, fetched by identifier
for sample in samples:                # iterate over the
   print(sample.code)                 # search results


samples.df                            # returns a Pandas DataFrame object

samples = o.get_samples(props="*")    # retrieve all properties of all samples

freezing samples

sample.freeze = True
sample.freezeForComponents = True
sample.freezeForChildren = True
sample.freezeForParents = True
sample.freezeForDataSets = True

Experiments / Collections

The new name for experiment is collection. You can use boths names interchangeably:

get_experiment() = get_collection()
new_experiment() = new_collection()
get_experiments() = get_collections()

exp = o.new_experiment
    type='DEFAULT_EXPERIMENT',
    space='MY_SPACE',
    project='YEASTS'
)
exp.save()

experiments = o.get_experiments(
    project       = 'YEASTS',
    space         = 'MY_SPACE', 
    type          = 'DEFAULT_EXPERIMENT',
    tags          = '*', 
    finished_flag = False,
    props         = ['name', 'finished_flag']
)
experiments = project.get_experiments()
experiment = experiments[0]        # get first experiment of result list
experiment = experiment
for experiment in experiments:     # iterate over search results
    print(experiment.props.all())
dataframe = experiments.df         # get Pandas DataFrame of result list
    
exp = o.get_experiment('/MY_SPACE/MY_PROJECT/MY_EXPERIMENT')

exp.set_props({ key: value})
exp.props
exp.p                              # same thing as .props
exp.p.finished_flag=True
exp.p.my_property = "some value"   # set the value of a property (value is checked)
exp.p + TAB                        # in IPython or Jupyter: show list of available properties
exp.p.my_property_ + TAB           # in IPython or Jupyter: show datatype or controlled vocabulary
exp.p['my-weird.property-name']    # accessing properties containing a dash or a dot

exp.attrs.all()                    # returns all attributes as a dict
exp.props.all()                    # returns all properties as a dict

exp.attrs.tags = ['some', 'tags']
exp.tags = ['some', 'tags']        # same thing
exp.save()

exp.code
exp.description
exp.registrator
exp.registrationDate
exp.modifier
exp.modificationDate

exp.freeze = True
exp.freezeForDataSets = True
exp.freezeForSamples = True

Datasets

working with existing dataSets

# search for datasets, see more search examples below
datasets = sample.get_datasets(type='SCANS', start_with=0, count=10)

for dataset in datasets:
    print(dataset.props.all())
    print(dataset.file_list)
    dataset.download()
dataset = datasets[0]

ds = o.get_dataset('20160719143426517-259')
ds.get_parents()
ds.get_children()
ds.sample
ds.experiment
ds.physicalData
ds.status                         # AVAILABLE LOCKED ARCHIVED 
                                  # ARCHIVE_PENDING UNARCHIVE_PENDING
                                  # BACKUP_PENDING
ds.archive()
ds.unarchive()

ds.attrs.all()                    # returns all attributes as a dict
ds.props.all()                    # returns all properties as a dict

ds.add_attachment()               # attachments usually contain meta-data
ds.get_attachments()              # about the dataSet, not the data itself.
ds.download_attachments()

download dataSets

o.download_prefix                  # used for download() and symlink() method.
                                   # Is set to data/hostname by default, but can be changed.
ds.get_files(start_folder="/")     # get file list as Pandas dataFrame
ds.file_list                       # get file list as array

ds.download()                      # simply download all files to data/hostnae/permId/
ds.download(
	destination = 'my_data',        # download files to folder my_data/
	create_default_folders = False, # ignore the /original/DEFAULT folders made by openBIS
	wait_until_finished = False,    # download in background, continue immediately
	workers = 10                    # 10 downloads parallel (default)
)
ds.is_physical()                   # TRUE if dataset has been physically downloaded

link dataSets

Instead of downloading a dataSet, you can create a symbolic link to a dataSet in the openBIS dataStore. To do that, the openBIS dataStore needs to be mounted first (see mount method above). Note: Symbolic links and the mount() feature currently do not work with Windows.

o.download_prefix                  # used for download() and symlink() method.
                                   # Is set to data/hostname by default, but can be changed.
ds.symlink()                       # creates a symlink for this dataset: data/hostname/permId
                                   # tries to mount openBIS instance 
                                   # in case it is not mounted yet
ds.symlink(
   target_dir = 'data/dataset_1/', # default target_dir is: data/hostname/permId
   replace_if_symlink_exists=True
)
ds.is_symlink()

dataSet attributes and properties

ds.set_props({ key: value})
ds.props
ds.p                              # same thing as .props
ds.p.my_property = "some value"   # set the value of a property
ds.p + TAB                        # show list of available properties
ds.p.my_property_ + TAB           # show datatype or controlled vocabulary
ds.p['my-weird.property-name']    # accessing properties containing a dash or a dot

ds.attrs.all()                    # returns all attributes as a dict
ds.props.all()                    # returns all properties as a dict

search for dataSets

The result of a search is always list, even when no items are found
The .df attribute returns the Pandas dataFrame of the results
properties must be in UPPERCASE to distinguish them from attributes

datasets = o.get_datasets(
    type  ='MY_DATASET_TYPE',
    NAME  = 'some name',              # properties are always uppercase 
                                      # to distinguish them from attributes
    **{ "SOME.WEIRD:PROP": "value"},  # property name contains a dot or a
                                      # colon: cannot be passed as an argument 
    start_with = 0,                   # start_with and count
    count      = 10,                  # enable paging
    registrationDate = "2020-01-01",  # date format: YYYY-MM-DD
    modificationDate = "<2020-12-31", # use > or < to search for specified date and later / earlier
    attrs=[                           # show these attributes in the dataFrame
        'sample.code',
        'registrator.email',
        'type.generatedCodePrefix'
    ],
    props=['$NAME', 'MATING_TYPE']    # show these properties in the result
)
datasets = o.get_datasets(props="*")  # retrieve all properties of all dataSets
dataset = datasets[0]                 # get the first dataset in the search result
for dataset in datasets:              # iterate over the datasets
    ...
df = datasets.df                      # returns a Pandas dataFrame object of the search results

In some cases, you might want to retrieve precisely certain datasets. This can be achieved by methods chaining (but be aware, it might not be very performant):

datasets = o.get_experiments(project='YEASTS')\
			 .get_samples(type='FLY')\
			 .get_datasets(
					type='ANALYZED_DATA',
					props=['MY_PROPERTY'],
					MY_PROPERTY='some analyzed data'
		 	 )

another example:

datasets = o.get_experiment('/MY_NEW_SPACE/MY_PROJECT/MY_EXPERIMENT4')\
           .get_samples(type='UNKNOWN')\
           .get_parents()\
           .get_datasets(type='RAW_DATA')

freeze dataSets

once a dataSet has been frozen, it cannot be changed by anyone anymore
so be careful!

ds.freeze = True
ds.freezeForChildren = True
ds.freezeForParents = True
ds.freezeForComponents = True
ds.freezeForContainers = True
ds.save()

create a new dataSet

ds_new = o.new_dataset(
    type       = 'ANALYZED_DATA', 
    experiment = '/SPACE/PROJECT/EXP1', 
    sample     = '/SPACE/SAMP1',
    files      = ['my_analyzed_data.dat'], 
    props      = {'name': 'some good name', 'description': '...' }
)
ds_new.save()

create dataSet with zipfile

DataSet containing one zipfile which will be unzipped in openBIS:

ds_new = o.new_dataset(
    type       = 'RAW_DATA', 
    sample     = '/SPACE/SAMP1',
    zipfile    = 'my_zipped_folder.zip', 
)
ds_new.save()

create dataSet with mixed content

mixed content means: folders and files are provided
a relative specified folder (and all its content) will end up in the root, while keeping its structure
- ../measurements/ --> /measurements/
- some/folder/somewhere/ --> /somewhere/
relative files will also end up in the root
- my_file.txt --> /my_file.txt
- ../somwhere/else/my_other_file.txt --> /my_other_file.txt
- some/folder/file.txt --> /file.txt
useful if DataSet contains files and folders
the content of the folder will be zipped (on-the-fly) and uploaded to openBIS
openBIS will keep the folder structure intact
relative path will be shortened to its basename. For example:

local	openBIS
`../../myData/`	`myData/`
`some/experiment/results/`	`results/`

ds_new = o.new_dataset(
    type       = 'RAW_DATA', 
    sample     = '/SPACE/SAMP1',
    files     = ['../measurements/', 'my_analyis.ipynb', 'results/'] 
)
ds_new.save()

create dataSet container

A DataSet of kind=CONTAINER contains other DataSets, but no files:

ds_new = o.new_dataset(
    type       = 'ANALYZED_DATA', 
    experiment = '/SPACE/PROJECT/EXP1', 
    sample     = '/SPACE/SAMP1',
    kind       = 'CONTAINER',
    props      = {'name': 'some good name', 'description': '...' }
)
ds_new.save()

get, set, add and remove parent datasets

dataset.get_parents()
dataset.set_parents(['20170115220259155-412'])
dataset.add_parents(['20170115220259155-412'])
dataset.del_parents(['20170115220259155-412'])

get, set, add and remove child datasets

dataset.get_children()
dataset.set_children(['20170115220259155-412'])
dataset.add_children(['20170115220259155-412'])
dataset.del_children(['20170115220259155-412'])

dataSet containers

A DataSet may belong to other DataSets, which must be of kind=CONTAINER
As opposed to Samples, DataSets may belong (contained) to more than one DataSet-container

dataset.get_containers()
dataset.set_containers(['20170115220259155-412'])
dataset.add_containers(['20170115220259155-412'])
dataset.del_containers(['20170115220259155-412'])

a DataSet of kind=CONTAINER may contain other DataSets, to act like a folder (see above)
the DataSet-objects inside that DataSet are called components or contained DataSets
you may also use the xxx_contained() functions, which are just aliases.

dataset.get_components()
dataset.set_components(['20170115220259155-412'])
dataset.add_components(['20170115220259155-412'])
dataset.del_components(['20170115220259155-412'])

Semantic Annotations

create semantic annotation for sample type 'UNKNOWN':


sa = o.new_semantic_annotation(
	entityType = 'UNKNOWN',
	predicateOntologyId = 'po_id',
	predicateOntologyVersion = 'po_version',
	predicateAccessionId = 'pa_id',
	descriptorOntologyId = 'do_id',
	descriptorOntologyVersion = 'do_version',
	descriptorAccessionId = 'da_id'
)
sa.save()

Create semantic annotation for property type (predicate and descriptor values omitted for brevity)

sa = o.new_semantic_annotation(propertyType = 'DESCRIPTION', ...)
sa.save()

Create semantic annotation for sample property assignment (predicate and descriptor values omitted for brevity)

sa = o.new_semantic_annotation(
	entityType = 'UNKNOWN',
	propertyType = 'DESCRIPTION', 
	...
)
sa.save()

Create a semantic annotation directly from a sample type. Will also create sample property assignment annotations when propertyType is given:

st = o.get_sample_type("ORDER")
st.new_semantic_annotation(...)

Get all semantic annotations

o.get_semantic_annotations()

Get semantic annotation by perm id

sa = o.get_semantic_annotation("20171015135637955-30")

Update semantic annotation

sa.predicateOntologyId = 'new_po_id'
sa.descriptorOntologyId = 'new_do_id'
sa.save()

Delete semantic annotation

sa.delete('reason')

Vocabulary and VocabularyTerms

An entity such as Sample (Object), Experiment (Collection), Material or DataSet can be of a specific entity type:

Sample Type (Object Type)
Experiment Type (Collection Type)
DataSet Type
Material Type

Every type defines which Properties may be defined. Properties act like Attributes, but they are type-specific. Properties can contain all sorts of information, such as free text, XML, Hyperlink, Boolean and also Controlled Vocabulary. Such a Controlled Vocabulary consists of many VocabularyTerms. These terms are used to only allow certain values entered in a Property field.

So for example, you want to add a property called Animal to a Sample and you want to control which terms are entered in this Property field. For this you need to do a couple of steps:

create a new vocabulary AnimalVocabulary
add terms to that vocabulary: Cat, Dog, Mouse
create a new PropertyType (e.g. Animal) of DataType CONTROLLEDVOCABULARY and assign the AnimalVocabulary to it
create a new SampleType (e.g. Pet) and assign the created PropertyType to that Sample type.
If you now create a new Sample of type Pet you will be able to add a property Animal to it which only accepts the terms Cat, Dog or Mouse.

create new Vocabulary with three VocabularyTerms

voc = o.new_vocabulary(
    code = 'BBB',
    description = 'description of vocabulary aaa',
    urlTemplate = 'https://ethz.ch',
    terms = [
        { "code": 'term_code1', "label": "term_label1", "description": "term_description1"},
        { "code": 'term_code2', "label": "term_label2", "description": "term_description2"},
        { "code": 'term_code3', "label": "term_label3", "description": "term_description3"}
    ]   
)
voc.save()

create additional VocabularyTerms

term = o.new_term(
	code='TERM_CODE_XXX', 
	vocabularyCode='BBB', 
	label='here comes a label',
	description='here might appear a meaningful description'
)
term.save()

update VocabularyTerms

To change the ordinal of a term, it has to be moved either to the top with the .move_to_top() method or after another term using the .move_after_term('TERM_BEFORE') method.

voc = o.get_vocabulary('STORAGE')
term = voc.get_terms()['RT']
term.label = "Room Temperature"
term.official = True
term.move_to_top()
term.move_after_term('-40')
term.save()
term.delete()

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.37.0rc2 pre-release

Apr 29, 2024

1.37.0rc1 pre-release

Apr 5, 2024

1.37.0rc0 pre-release

Mar 27, 2024

1.36.4rc10 pre-release

Feb 29, 2024

1.36.4rc9 pre-release

Feb 9, 2024

1.36.4rc8 pre-release

Feb 2, 2024

1.36.4rc7 pre-release

Jan 16, 2024

1.36.4rc6 pre-release

Dec 8, 2023

1.36.4rc5 pre-release

Nov 24, 2023

1.36.4rc4 pre-release

Nov 10, 2023

1.36.4rc3 pre-release

Nov 10, 2023

1.36.4rc2 pre-release

Nov 2, 2023

1.36.4rc1 pre-release

Nov 2, 2023

1.36.3

Oct 5, 2023

1.36.2

Aug 25, 2023

1.36.2rc3 pre-release

Aug 24, 2023

1.36.2rc2 pre-release

Aug 24, 2023

1.36.2rc1 pre-release

Aug 24, 2023

1.36.1

Aug 16, 2023

1.36.1rc6 pre-release

Aug 11, 2023

1.36.1rc5 pre-release

Aug 10, 2023

1.36.1rc4 pre-release

Aug 10, 2023

1.36.1rc3 pre-release

Aug 10, 2023

1.36.1rc2 pre-release

Aug 9, 2023

1.36.1rc1 pre-release

Aug 7, 2023

1.36.0

Jul 24, 2023

1.36.0rc1 pre-release

Jul 19, 2023

1.35.11

Jul 19, 2023

1.35.10

Jul 17, 2023

1.35.9

Jul 17, 2023

1.35.8

Jul 13, 2023

1.35.7

Jul 13, 2023

1.35.7rc3 pre-release

Jun 30, 2023

1.35.7rc2 pre-release

Jun 27, 2023

1.35.7rc1 pre-release

Jun 27, 2023

1.35.6

Jun 27, 2023

1.35.6rc5 pre-release

Jun 27, 2023

1.35.6rc4 pre-release

Jun 27, 2023

1.35.6rc3 pre-release

Jun 22, 2023

1.35.6rc2 pre-release

Jun 15, 2023

1.35.6rc1 pre-release

Jun 12, 2023

1.35.5

Jun 5, 2023

1.35.5rc1 pre-release

May 25, 2023

1.35.4

May 4, 2023

1.35.4rc3 pre-release

Apr 6, 2023

1.35.4rc2 pre-release

Mar 29, 2023

1.35.4rc1 pre-release

Mar 21, 2023

1.35.3

Feb 27, 2023

1.35.2

Feb 15, 2023

1.35.2rc1 pre-release

Feb 15, 2023

1.35.1

Feb 7, 2023

1.35.0

Feb 1, 2023

1.35.0rc5 pre-release

Dec 13, 2022

1.35.0rc4 pre-release

Nov 17, 2022

1.35.0rc3 pre-release

Nov 15, 2022

1.35.0rc2 pre-release

Nov 8, 2022

1.35.0rc1 pre-release

Nov 8, 2022

1.35.0rc0 pre-release

Nov 4, 2022

1.34.6

Dec 27, 2022

1.34.5

Nov 9, 2022

1.34.4

Nov 9, 2022

1.34.3

Nov 9, 2022

1.34.2

Nov 2, 2022

1.34.1

Nov 2, 2022

1.34.0

Nov 2, 2022

1.34.0rc4 pre-release

Oct 24, 2022

1.33.2

Oct 24, 2022

1.33.1

Oct 13, 2022

1.33.0

Oct 6, 2022

1.33.0rc4 pre-release

Oct 3, 2022

1.33.0rc3 pre-release

Sep 21, 2022

1.33.0rc2 pre-release

Sep 20, 2022

1.33.0rc1 pre-release

Aug 8, 2022

1.32.1

Sep 22, 2022

1.32.0

Jul 1, 2022

1.32.0rc2 pre-release

Jul 1, 2022

1.32.0rc1 pre-release

Jul 1, 2022

1.31.6

Mar 15, 2022

1.31.5

Mar 14, 2022

1.31.4

Mar 13, 2022

1.31.3

Mar 11, 2022

1.31.2

Mar 9, 2022

1.31.1

Mar 9, 2022

1.31.0rc1 pre-release

Mar 7, 2022

1.30.4

Feb 14, 2022

1.30.3

Dec 15, 2021

1.30.2

Dec 14, 2021

1.30.1

Nov 12, 2021

1.30.0

Oct 27, 2021

1.20.5

Oct 12, 2021

1.20.4

Oct 6, 2021

1.20.3

Sep 22, 2021

1.20.2

Sep 14, 2021

1.20.1

Sep 6, 2021

1.20.0

Aug 10, 2021

1.19.1

Jun 22, 2021

1.19.1rc1 pre-release

Jun 21, 2021

1.19.0

Jun 3, 2021

1.19.0rc1 pre-release

May 31, 2021

1.19.0rc0 pre-release

May 31, 2021

1.18.12

May 19, 2021

1.18.11

May 14, 2021

1.18.10

May 10, 2021

1.18.9

Apr 15, 2021

1.18.8

Apr 12, 2021

1.18.7

Apr 1, 2021

1.18.6

Mar 31, 2021

1.18.5

Mar 9, 2021

1.18.5a2 pre-release

Mar 5, 2021

1.18.5a1 pre-release

Mar 4, 2021

1.18.4

Mar 3, 2021

1.18.3

Mar 2, 2021

1.18.2

Feb 1, 2021

1.18.1

Feb 1, 2021

1.18.0

Jan 29, 2021

1.17.4

Jan 13, 2021

1.17.3

Jan 13, 2021

This version

1.17.1

Jan 11, 2021

1.16.2

Oct 22, 2020

1.16.1

Oct 20, 2020

1.15.1

Aug 26, 2020

1.15.0

Aug 17, 2020

1.14.10

Aug 3, 2020

1.14.9

Jul 1, 2020

1.14.8

Jul 1, 2020

1.14.7

May 12, 2020

1.14.6

Apr 29, 2020

1.14.5

Apr 28, 2020

1.14.4

Apr 23, 2020

1.14.3

Apr 23, 2020

1.14.2

Apr 22, 2020

1.14.1

Apr 20, 2020

1.14.0

Apr 20, 2020

1.13.0

Apr 2, 2020

1.12.4

Mar 11, 2020

1.12.3

Mar 7, 2020

1.12.0

Mar 3, 2020

1.11.1

Mar 2, 2020

1.11.0

Feb 11, 2020

1.10.8

Jan 8, 2020

1.10.7

Dec 20, 2019

1.10.6

Dec 12, 2019

1.10.5

Dec 9, 2019

1.10.4

Dec 6, 2019

1.10.3

Nov 20, 2019

1.10.2

Nov 13, 2019

1.10.0

Nov 1, 2019

1.9.8

Oct 24, 2019

1.9.7

Oct 22, 2019

1.9.6

Oct 1, 2019

1.9.6.dev1 pre-release

Oct 1, 2019

1.9.5

Sep 17, 2019

1.9.4

Sep 17, 2019

1.9.3

Sep 13, 2019

1.9.2

Sep 12, 2019

1.9.1

Sep 3, 2019

1.9.0

Aug 27, 2019

1.9.0.dev1 pre-release

Aug 23, 2019

1.8.5

Jun 28, 2019

1.8.4

Mar 23, 2019

1.8.3

Mar 22, 2019

1.8.2

Mar 18, 2019

1.8.1

Mar 14, 2019

1.8.0

Mar 4, 2019

1.7.6

Dec 21, 2018

1.7.5

Dec 4, 2018

1.7.4

Nov 15, 2018

1.7.3

Nov 13, 2018

1.7.1

Oct 23, 2018

1.7.0

Oct 3, 2018

1.6.8

Sep 28, 2018

1.6.7

Aug 31, 2018

1.6.6

Aug 23, 2018

1.6.5

Aug 20, 2018

1.6.4

May 16, 2018

1.6.3

Apr 19, 2018

1.6.2

Apr 13, 2018

1.6.1

Apr 10, 2018

1.6.0

Mar 23, 2018

1.5.0

Feb 16, 2018

1.4.3

Jan 29, 2018

1.4.2

Dec 20, 2017

1.2.4

Aug 29, 2017

1.2.3

Aug 22, 2017

1.2.2

Aug 3, 2017

1.2.1

Aug 3, 2017

1.2.0

Aug 3, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyBIS-1.17.1.tar.gz (112.8 kB view hashes)

Uploaded Jan 11, 2021 Source

Hashes for PyBIS-1.17.1.tar.gz

Hashes for PyBIS-1.17.1.tar.gz
Algorithm	Hash digest
SHA256	`f3a9546dd4b96c48f503c704fd5445d567a942d9188935dea1d0e63ba9a50e52`
MD5	`635e787d8ee912a047fe481f1e9b194b`
BLAKE2b-256	`4860a4ebbbe65fa1416fa43dca4aaf97412af710774eb00f80beccda62f1e89c`

PyBIS 1.17.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

Welcome to pyBIS!

Dependencies and Requirements

Installation

General Usage

TAB completition and other hints in Jupyter / IPython

Checking input

Glossary

connect to OpenBIS

login

Verify certificate

Check session token, logout()

Caching

Mount openBIS dataStore server

Prerequisites: FUSE / SSHFS

Mount dataStore server with pyBIS

Masterdata

browse masterdata

create property types

create sample types / object types

assign and revoke properties to sample type / object type

create a dataset type

create a experiment / collection type

create material types

create plugins

Users, Groups and RoleAssignments

Spaces

Projects

Samples / Objects

create many samples in a transaction

parents, children, components and container

sample tags

useful tricks when dealing with properties, using Jupyter or IPython

search for samples / objects

freezing samples

Experiments / Collections

Datasets

working with existing dataSets

download dataSets

link dataSets

dataSet attributes and properties

search for dataSets

freeze dataSets

create a new dataSet

create dataSet with zipfile

create dataSet with mixed content

create dataSet container

get, set, add and remove parent datasets

get, set, add and remove child datasets

dataSet containers

Semantic Annotations

Tags

Vocabulary and VocabularyTerms

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution