Unsupervised task detection in lifelong learning systems using autoencoders for task similarity detection and statistical analysis.

These details have not been verified by PyPI

Project description

LLL Task Manager

This is a Python package that implements the Task Manager for the Lifelong Learning Loop (LLL).
The Task Manager is designed to handle task detection and management in lifelong learning systems, utilizing autoencoders for reconstruction-based task similarity detection.

The code is derived from the paper:
Omid Gheibi and Danny Weyns. Lifelong self-adaptation: self-adaptation meets lifelong machine learning.
In Proceedings of the 17th Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS '22).
https://doi.org/10.1145/3524844.3528052

This is an efficient and packaged version of the task manager code that is available here:
GitHub: dimoibiehg / lifelong_self-adaptation

Contributions by Ferdinand Koenig, 2025:

Adjusted data structures, algorithm choice, and adjusted Holm’s correction for improved efficiency and correctness.
Added functionality for task deletion.
Developed an interface for adding tasks triggered externally.
Implemented a workaround for a bug related to tuner directory removal on Windows OS and OneDrive environments.
Packaged the code into a Python module for easier distribution and usage.
Enhanced the documentation for better usability and clarity.

Features

Task Detection: Uses reconstruction errors to detect whether a new task is introduced or an existing task can be assigned.
Autoencoder-based Learning: Tasks are managed with autoencoders that detect similarities and update their reconstruction errors accordingly.
Efficient Data Structures: Data structures are optimized for task management efficiency.
Holm’s Correction: Ensures statistically robust task similarity detection using Holm’s method.
Extensible Interface: The package provides functions to interface with external data and manage tasks.

Installation

You can install the package from the Python Package Index via

pip install lll_taskmanager

Key Methods

detect(X): Detects whether the given data X introduces a new task or can be assigned to an existing task.
- Returns: task_is_new, task_id, pvalues
- task_is_new: Boolean flag indicating whether the task is new.
- task_id: ID of the task.
- pvalues: List of p-values for statistical tests.
add_new_task(X): Manually adds a new task by creating a new autoencoder and training it on X.
delete_task(task_id): Deletes a task based on its task_id.

Example and Discussion

import numpy as np
from lll_taskmanager import TaskManager

task_manager = TaskManager()

# Test data
# Normal Distribution with mean = 0,  std = 1
X0_1 = np.random.normal(loc=0, scale=1, size=(100, 10))
X0_2 = np.random.normal(loc=0, scale=1, size=(100, 10))
X0_3 = np.random.normal(loc=0, scale=1, size=(70, 10))
# Normal Distribution with mean = 5,  std = 1
X1_1 = np.random.normal(loc=5, scale=1, size=(100, 10))
X1_2 = np.random.normal(loc=5, scale=1, size=(100, 10))
# Uniform Distribution
X2_1 = np.random.uniform(low=-5, high=5, size=(100, 10))
X2_2 = np.random.uniform(low=-5, high=5, size=(80, 10))

print(task_manager.detect(X0_1))
# (True, 0, {0: 1.0})
# New task with id 0
print(task_manager.detect(X0_1))
# (False, 0, {0: 0.968813007996528})
# Task 0 with significance of ~97%
print(task_manager.detect(X0_2))
# (True, 1, {0: 1.4362702261304395e-08, 1: 1.0})
# False Positive of new task.
print(task_manager.detect(X0_3))
# (True, 2, {0: 1.4917703144233305e-05, 1: 1.312398816087575e-05, 2: 1.0})
# False Positive. This is due to the different sample size.
# Under the hood, a Mann-Whitney U test is being used that depends on same sample sizes.
# Make sure that your batches have the same size for accurate results

resultX1_1 = task_manager.detect(X1_1)
print(resultX1_1)
# (True, 3, {0: 2.5495805520903834e-20, 1: 5.170804387535538e-31, 2: 4.60666565667063e-27, 3: 1.0})
# Correct assignment to new task (3)
print(task_manager.detect(X1_2))
# (False, 3, {0: 0.0006248683334076798, 1: 3.4916777010682185e-19, 2: 5.33830704127903e-25, 3: 0.008729918349922575})
# Correct assignment

print(task_manager.detect(X2_1))
# (True, 4, {0: 1.1996356111739001e-43, 1: 5.254326993995392e-33, 2: 8.018213024078718e-27, 3: 8.457238847010412e-44, 4: 1.0})
# Correct assignment to new task (4)
print(task_manager.detect(X2_2))
# (False, 4, {0: 2.776922809508148e-37, 1: 2.4074810570805218e-29, 2: 1.6242946841985236e-24, 3: 1.7855840454361994e-37, 4: 0.718388260985277})
# Correct assignment to task 4

task_manager.delete_task(resultX1_1[1])
# Delete task of X1_1: Expected Behavior: Do not detect it anymore. When task reoccurs, assign new ID
print(task_manager.detect(X1_2))
# (True, 5, {0: 1.6014167268343994e-40, 1: 7.64579055060309e-33, 2: 3.553684842364269e-27, 4: 8.688713292514744e-20, 5: 1.0})
# Correctly assigned to new task, as old task (3) was deleted.
print(task_manager.detect(X1_1))
# (True, 6, {0: 1.3943531255201782e-40, 1: 5.254326993995392e-33, 2: 3.553684842364269e-27, 4: 5.743734298732715e-24, 5: 0.00019656616032667692, 6: 1.0})
# False Positive. Should have been assigned to task 5.

The use of the Mann-Whitney U test to compare reconstruction errors between training data and inference data appears to be a novel approach introduced by Gheibi and Weyns, and it has not been applied previously. Consequently, extensive testing and evaluation are necessary to assess its effectiveness.

In some cases, the Autoencoders may not be able to accurately capture the underlying distribution, as observed in the first and last False Positive cases. To address this limitation, future work could explore the use of Variational Autoencoders (VAEs) to better approximate the probability density functions and improve task detection accuracy.

Supported Versions and Compatibility

Currently, this package supports Python versions 3.9 and 3.10 due to TensorFlow's compatibility constraints.
Therefore, the dependencies are pinned to TensorFlow 2.11.

If you'd like to use this package with other versions of Python or TensorFlow, you're welcome to contribute by forking the repository and submitting a pull request with the updated versions in pytoml.

Tested and Confirmed Platforms:

Ubuntu: Python 3.9 and 3.10
Windows: Python 3.9 and 3.10
macOS: Python 3.9

License

This project is licensed under the CC BY 4.0 license, which allows you to share and adapt the material, as long as you provide appropriate credit.

CC BY 4.0 License

References

Omid Gheibi and Danny Weyns. Lifelong self-adaptation: self-adaptation meets lifelong machine learning.
In Proceedings of the 17th Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS '22).
doi.org/10.1145/3524844.3528052
GitHub: dimoibiehg / lifelong_self-adaptation

Contact

For any questions or feedback, feel free to contact me:

Ferdinand Koenig
Email: ferdinand (-at-) koenix.de

Citing

Please give appropriate credit when using this code by citing both the reference paper and this Python package:

@software{omid_lll_taskmanager_2025,
    author = {Gheibi, Omid and Weyns, Danny and Koenig, Ferdinand},
    title = {LLL TaskManager: A Python Package for Lifelong Unsupervised Task Management in Machine Learning},
    month = jan,
    year = 2025,
    url = {https://github.com/ferdinand-koenig/llltaskmanager},
    license = {CC BY 4.0}
}

@inproceedings{10.1145/3524844.3528052,
    author = {Gheibi, Omid and Weyns, Danny},
    title = {Lifelong self-adaptation: self-adaptation meets lifelong machine learning},
    year = {2022},
    isbn = {9781450393058},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    url = {https://doi.org/10.1145/3524844.3528052},
    doi = {10.1145/3524844.3528052},
    abstract = {In the past years, machine learning (ML) has become a popular approach to support self-adaptation. While ML techniques enable dealing with several problems in self-adaptation, such as scalable decision-making, they are also subject to inherent challenges. In this paper, we focus on one such challenge that is particularly important for self-adaptation: ML techniques are designed to deal with a set of predefined tasks associated with an operational domain; they have problems to deal with new emerging tasks, such as concept shift in input data that is used for learning. To tackle this challenge, we present lifelong self-adaptation: a novel approach to self-adaptation that enhances self-adaptive systems that use ML techniques with a lifelong ML layer. The lifelong ML layer tracks the running system and its environment, associates this knowledge with the current tasks, identifies new tasks based on differentiations, and updates the learning models of the self-adaptive system accordingly. We present a reusable architecture for lifelong self-adaptation and apply it to the case of concept drift caused by unforeseen changes of the input data of a learning model that is used for decision-making in self-adaptation. We validate lifelong self-adaptation for two types of concept drift using two cases.},
    booktitle = {Proceedings of the 17th Symposium on Software Engineering for Adaptive and Self-Managing Systems},
    pages = {1–12},
    numpages = {12},
    location = {Pittsburgh, Pennsylvania},
    series = {SEAMS '22}
}

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.5rc6 pre-release

Mar 23, 2025

0.1.5rc5 pre-release

Mar 23, 2025

0.1.5rc4 pre-release

Mar 23, 2025

0.1.5rc3 pre-release

Mar 23, 2025

0.1.5rc2 pre-release

Mar 23, 2025

0.1.5rc1 pre-release

Mar 23, 2025

0.1.4

Mar 23, 2025

This version

0.1.3

Mar 23, 2025

0.1.2

Jan 27, 2025

0.1.1

Jan 26, 2025

0.1.1rc4 pre-release

Jan 26, 2025

0.1.1rc3 pre-release

Jan 26, 2025

0.1.1rc2 pre-release

Jan 26, 2025

0.1.1rc1 pre-release

Jan 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lll_taskmanager-0.1.3.tar.gz (14.7 kB view details)

Uploaded Mar 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lll_taskmanager-0.1.3-py3-none-any.whl (16.9 kB view details)

Uploaded Mar 23, 2025 Python 3

File details

Details for the file lll_taskmanager-0.1.3.tar.gz.

File metadata

Download URL: lll_taskmanager-0.1.3.tar.gz
Upload date: Mar 23, 2025
Size: 14.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for lll_taskmanager-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`a9f80af05a288d719314ee12df73f9eade9554158816320ab311d712abea63cd`
MD5	`1bf765cd63b57576adf152b18bb20700`
BLAKE2b-256	`3a991009a7b1d9939d68c53f55b9251d43e9dc8c47dc9cfc6bcb65c0c5d956be`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lll_taskmanager-0.1.3.tar.gz:

Publisher: python-package-release.yml on ferdinand-koenig/llltaskmanager

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lll_taskmanager-0.1.3.tar.gz
- Subject digest: a9f80af05a288d719314ee12df73f9eade9554158816320ab311d712abea63cd
- Sigstore transparency entry: 186888601
- Sigstore integration time: Mar 23, 2025
Source repository:
- Permalink: ferdinand-koenig/llltaskmanager@a5b7f26a8189916f8e9a7c22b52096e1e8fa4d4e
- Branch / Tag: refs/heads/release
- Owner: https://github.com/ferdinand-koenig
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-package-release.yml@a5b7f26a8189916f8e9a7c22b52096e1e8fa4d4e
- Trigger Event: push

File details

Details for the file lll_taskmanager-0.1.3-py3-none-any.whl.

File metadata

Download URL: lll_taskmanager-0.1.3-py3-none-any.whl
Upload date: Mar 23, 2025
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for lll_taskmanager-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`729df019883c37fac1d2b9ec62767d285d240184a2d51e13af9c3e2adc721d2e`
MD5	`1f8b2cea9dbfef1201c46788233f88f8`
BLAKE2b-256	`78664a10f3269d2489fa321c9b2dfa80f7fa689160e80410f34139f70511168b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lll_taskmanager-0.1.3-py3-none-any.whl:

Publisher: python-package-release.yml on ferdinand-koenig/llltaskmanager

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lll_taskmanager-0.1.3-py3-none-any.whl
- Subject digest: 729df019883c37fac1d2b9ec62767d285d240184a2d51e13af9c3e2adc721d2e
- Sigstore transparency entry: 186888602
- Sigstore integration time: Mar 23, 2025
Source repository:
- Permalink: ferdinand-koenig/llltaskmanager@a5b7f26a8189916f8e9a7c22b52096e1e8fa4d4e
- Branch / Tag: refs/heads/release
- Owner: https://github.com/ferdinand-koenig
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-package-release.yml@a5b7f26a8189916f8e9a7c22b52096e1e8fa4d4e
- Trigger Event: push

lll-taskmanager 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

LLL Task Manager

Features

Installation

Key Methods

Example and Discussion

Supported Versions and Compatibility

Tested and Confirmed Platforms:

License

References

Links

Contact

Citing

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance