Skip to main content

Scikit-longitudinal is an open-source Python library for longitudinal data analysis, building on Scikit-learn's foundation with tools tailored for repeated measures data.

Project description


Scikit-longitudinal
Scikit-longitudinal

A specialised Python library for longitudinal data analysis built on Scikit-learn


💡 About The Project

Scikit-longitudinal (Sklong) is a machine learning library designed to analyse longitudinal data (Classification tasks focussed as of today). It offers tools and models for processing, analysing, and predicting longitudinal data, with a user-friendly interface that integrates with the Scikit-learn ecosystem.

For more details, visit the official documentation.


🛠️ Installation

[!NOTE] Want to be using Jupyter Notebook, Marimo, Google Colab, or JupyterLab? Head to the Getting Started section of the documentation, we explain it all! 🎉

To install Scikit-longitudinal:

  1. ✅ Install the latest version:

    pip install Scikit-longitudinal
    

    To install a specific version:

    pip install Scikit-longitudinal==0.1.0
    

[!CAUTION] Scikit-longitudinal is currently compatible with Python versions 3.9 only. Ensure you have one of these versions installed before proceeding with the installation.

Now, while we understand that this is a limitation, we are tied for the time being because of Deep Forest. Deep Forest is a dependency of Scikit-longitudinal that is not compatible with Python versions greater than 3.9. Deep Forest helps us with the Deep Forest algorithm, to which we have made some modifications to welcome Lexicographical Deep Forest.

To follow up on this discussion, please refer to this github issue.

If you encounter any errors, feel free to explore further the installation section in the Getting Started of the documentation. If it still doesn't work, please open an issue on GitHub.


🚀 Getting Started

Here's how to analyse longitudinal data with Scikit-longitudinal:

from scikit_longitudinal.data_preparation import LongitudinalDataset
from scikit_longitudinal.estimators.ensemble.lexicographical.lexico_gradient_boosting import LexicoGradientBoostingClassifier

dataset = LongitudinalDataset('./stroke.csv') # Note this is a fictional dataset. Use yours!
dataset.load_data_target_train_test_split(
  target_column="class_stroke_wave_4",
)

# Pre-set or manually set your temporal dependencies 
dataset.setup_features_group(input_data="elsa")

model = LexicoGradientBoostingClassifier(
  features_group=dataset.feature_groups(),
  threshold_gain=0.00015 # Refer to the API for more hyper-parameters and their meaning
)

model.fit(dataset.X_train, dataset.y_train)
y_pred = model.predict(dataset.X_test)

# Classification report
print(classification_report(y_test, y_pred))

📝 How to Cite

We are currently cooking a JOSS submission, wait a bit for it! Meanwhile, click on Cite This Repository on the top right corner of this page to get a BibTeX entry.


🔐 License

Scikit-longitudinal is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scikit_longitudinal-0.0.10.tar.gz (14.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scikit_longitudinal-0.0.10-py3-none-any.whl (120.6 kB view details)

Uploaded Python 3

File details

Details for the file scikit_longitudinal-0.0.10.tar.gz.

File metadata

File hashes

Hashes for scikit_longitudinal-0.0.10.tar.gz
Algorithm Hash digest
SHA256 84dd920fe5ee6f18be578b169f57e5ca47519d6354c1f00685fa6ed14664ea6b
MD5 23b89f00a31f3c3a593006255ab4b6f4
BLAKE2b-256 2c0a8d5b7fd3fe1f00e3f0da8841848fe7aab7f1e930bb211ace1f5a3be44442

See more details on using hashes here.

File details

Details for the file scikit_longitudinal-0.0.10-py3-none-any.whl.

File metadata

File hashes

Hashes for scikit_longitudinal-0.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 5f6654cb3b038b10456142a3aa4fa5b368aae956bf33447c59ab11be16367168
MD5 ca185f8159d5aad297bbeb032925a0b4
BLAKE2b-256 6f57d9186830908af02da23816f659660cb0347b292ccdb5373edfbf583866ab

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page