Accompanying library to the AURSAD dataset

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- Microsoft :: Windows :: Windows 10
Programming Language
- Python :: 3.6

Project description

AURSAD

A python library for the AURSAD dataset. Detailed description of the dataset and download.

The library contains several useful functionalities for preprocessing the dataset for ML applications:

Creating numpy training and test datasets for sampled data
Creating a Keras TimeSeries generators for sliding window data
Filtering the dataset
Removing undesired columns as outlined in the paper
3 different types of labeling
- Full sample labeling where loosening and tightening motions are labeled together
- Separate sample labeling where loosening motion is given its own label
- 'Tighten' sample labeling, when only the tightening parts of the whole process are labeled as normal/anomalies, loosening and movement parts of the motion get its own separate labels
Subsampling the data
Dimensionality reduction using PCA or ANOVA F-values
One-hot label encoding
Zero padding the samples to equalise their length
Z-score standardisation
Taking data only from screwdriver sensors

Dataset

The dataset contains 2045 samples in total. The robot was sampled with frequency of 100 Hz.

Type	Label	Samples	%
Normal operation	0	1420	69
Damaged screw	1	221	11
Extra assembly component	2	183	9
Missing screw	3	218	11
Damaged thread samples	4	3	0

Additionally, there are 2049 supplementary samples describing the loosening/screw picking motion, labeled 5.

Installation

AURSAD has been tested on Windows 10 and Python 3.8.

PIP installation

To install from pip with required dependencies use:

pip install aursad

Source installation

To install latest version from github, clone the source from the project repository and install with setup.py:

git clone https://github.com/CptPirx/robo-package
cd robo-package
python setup.py install --user

Instructions

The package presents to user two methods: get_dataset_numpy() and get_dataset_generator().

Sampling

def get_dataset_numpy(path, onehot_labels=True, reduce_dimensionality=False, reduce_method='PCA', n_dimensions=60,
                      subsample_data=True, subsample_freq=2, train_size=0.7, random_state=42, normal_samples=1,
                      damaged_samples=1, assembly_samples=1, missing_samples=1, damaged_thread_samples=0,
                      loosening_samples=1, move_samples=1, drop_extra_columns=True, pad_data=True,
                      label_type='partial', binary_labels=False, standardize=False, screwdriver_only = False):
    """
    Create numpy dataset from input h5 file

    :param path: path to the data
    :param label_type: string,
        'full', 'partial' or 'tighten'
    :param drop_extra_columns: bool,
        drop the extra columns as outlined in the paper
    :param missing_samples: float,
        percentage of missing samples to take
    :param assembly_samples: float,
        percentage of extra assembly samples to take
    :param damaged_samples: float,
        percentage of damaged samples to take
    :param normal_samples: float,
        percentage of normal samples to take
    :param loosening_samples: float,
        percentage of loosening samples to take
    :param move_samples: float,
        percentage of movement samples to take
    :param damaged_thread_samples: float,
        percentage of damaged thread samples to take
    :param random_state: int,
        random state for train_test split
    :param train_size: float,
        percentage of data as training data
    :param subsample_freq: int,
        the frequency of subsampling
    :param subsample_data: bool,
        reduce number of events by taking every subsample_freq event
    :param reduce_dimensionality: bool,
        reduce dimensionality of the dataset
    :param reduce_method: string,
        dimensionality reduction method to be used
    :param n_dimensions: int,
        the target number of dimensions
    :param onehot_labels: bool,
        output onehot encoded labels
    :param binary_labels: bool,
        if True all anomalies are labeled the same
    :param standardize: bool,
        if True apply z-score standardisation
    :param pad_data: bool,
        if True pad data to equal length samples, if False return data in continuous form
    :param screwdriver_only: bool,
        take only the 4 dimensions from the screwdriver sensors

    :return: 4 np arrays,
        train and test data & labels
    """

Sample usage:

import aursad

data_path = 'C:/Users/my_path/robot_data.h5'

train_x, train_y, test_x, test_y = aursad.get_dataset_numpy(data_path)

Sliding window

def get_dataset_generator(path, window_size=100, reduce_dimensionality=False, reduce_method='PCA', n_dimensions=60,
                          subsample_data=True, subsample_freq=2, train_size=0.7, random_state=42, normal_samples=1,
                          damaged_samples=1, assembly_samples=1, missing_samples=1, damaged_thread_samples=0,
                          loosening_samples=1, drop_loosen=True, drop_movement=False, drop_extra_columns=True,
                          label_type='partial', batch_size=256, binary_labels=False, standardize=False,
                          screwdriver_only=False):
    """
    Create Keras sliding window generator from input h5 file

    :param drop_movement: bool,
        drop the the movement samples
    :param path: path to the data
    :param label_type: string,
        'full', 'partial' or 'tighten'
    :param drop_extra_columns: bool,
        drop the extra columns as outlined in the paper
    :param drop_loosen: bool,
        drop the loosening columns
    :param missing_samples: float,
        percentage of missing samples to take
    :param assembly_samples: float,
        percentage of extra assembly samples to take
    :param damaged_samples: float,
        percentage of damaged samples to take
    :param normal_samples: float,
        percentage of normal samples to take
    :param loosening_samples: float,
        percentage of loosening samples to take
    :param damaged_thread_samples: float,
        percentage of damaged thread samples to take
    :param random_state: int,
        random state for train_test split
    :param train_size: float,
        percentage of data as training data
    :param subsample_freq: int,
        the frequency of subsampling
    :param subsample_data: bool,
        reduce number of events by taking every subsample_freq event
    :param reduce_dimensionality: bool,
        reduce dimensionality of the dataset
    :param reduce_method: string,
        dimensionality reduction method to be used
    :param n_dimensions: int,
        the target number of dimensions
    :param window_size: int,
        size of the sliding window
    :param batch_size: int,
        batch size for the sliding window generator
    :param binary_labels: bool,
        if True all anomalies are labeled the same
    :param standardize: bool,
        if True apply z-score standardisation
    :param screwdriver_only: bool,
        take only the 4 dimensions from the screwdriver sensors

    :return: 4 np arrays,
        train and test data & labels
    :return: keras TimeSeries generators,
        train and test generators
    """

Sample usage:

import aursad

data_path = 'C:/Users/my_path/robot_data.h5'

train_x, train_y, test_x, test_y, train_generator, test_generator = aursad.get_dataset_generator(data_path)

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- Microsoft :: Windows :: Windows 10
Programming Language
- Python :: 3.6

Release history Release notifications | RSS feed

0.2.4

Mar 31, 2021

0.2.3

Mar 30, 2021

0.2.2

Mar 30, 2021

0.2.1

Mar 30, 2021

0.2

Mar 30, 2021

0.1.14

Mar 30, 2021

0.1.13 yanked

Mar 15, 2021

Reason this release was yanked:

Wrong encoding sequence

0.1.12

Mar 11, 2021

0.1.11

Mar 8, 2021

0.1.10

Mar 8, 2021

0.1.9

Feb 22, 2021

This version

0.1.8

Feb 22, 2021

0.1.7

Feb 22, 2021

0.1.6

Feb 22, 2021

0.1.5

Feb 8, 2021

0.1.4

Feb 8, 2021

0.1.3

Feb 5, 2021

0.1.2

Feb 3, 2021

0.1.1

Feb 3, 2021

0.1.0

Feb 1, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aursad-0.1.8.tar.gz (10.2 kB view details)

Uploaded Feb 22, 2021 Source

File details

Details for the file aursad-0.1.8.tar.gz.

File metadata

Download URL: aursad-0.1.8.tar.gz
Upload date: Feb 22, 2021
Size: 10.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.0.0.post20201207 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.8.5

File hashes

Hashes for aursad-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`db4fcdb8990485fb96901027f6b8202c48074b4ac25fec9f23175a7aac9e37d5`
MD5	`2d2979847e323199222e7ac4b77c8480`
BLAKE2b-256	`39adf5137c824f1e1e7d24cc338d41c41df5871a60a1a660577bb5c5179e1664`

See more details on using hashes here.

aursad 0.1.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AURSAD

Dataset

Installation

PIP installation

Source installation

Instructions

Sampling

Sliding window

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes