Scikit-longitudinal, an open-source Python lib for longitudinal data analysis, builds on Scikit-learn's foundation. It offers specialized tools to tackle challenges of repeated measures data, ideal for researchers, data scientists, & analysts.

Project description

Scikit-longitudinal

A specialised Python library for longitudinal data analysis built on Scikit-learn

⚙️ Project Status

☎️ Contacts

🌟 Exciting Update: We're delighted to introduce the brand new v0.1 documentation for Scikit-longitudinal! For a deep dive into the library's capabilities and features, please visit here.

💡 About The Project

Scikit-longitudinal is a machine learning library designed to analyse longitudinal data (Classification tasks focussed as of today). It offers tools and models for processing, analysing, and predicting longitudinal data, with a user-friendly interface that integrates with the Scikit-learn ecosystem.

Please for further information, visit the official documentation.

🛠️ Installation

ON-HOLD until the first public release

Note that for developers, you should follow up onto the Contributing tab of the official documentation.

🚀 Getting Started

To perform longitudinal analysis with Scikit-Longitudinal, use the LongitudinalDataset class to prepare the dataset. To analyse your data, use the LexicoGradientBoostingClassifier (i.e. Gradient Boosting variant for Longitudinal Data) or another available estimator/preprocessor.

Following that, you can apply the popular fit, predict, prodict_proba, or transform methods in the same way that Scikit-learn does, as shown in the example below.

from scikit_longitudinal.data_preparation import LongitudinalDataset
from scikit_longitudinal.estimators.ensemble.lexicographical.lexico_gradient_boosting import LexicoGradientBoostingClassifier

dataset = LongitudinalDataset('./stroke_4_years.csv')
dataset.load_data_target_train_test_split(
  target_column="class_stroke_wave_4",
)

# Pre-set or manually set your temporal dependencies 
dataset.setup_features_group(input_data="Elsa")

model = LexicoGradientBoostingClassifier(
  features_group=dataset.feature_groups(),
  threshold_gain=0.00015
)

model.fit(dataset.X_train, dataset.y_train)
y_pred = model.predict(dataset.X_test)

📝 How to Cite?

Paper's citation information will be added here once published. Currently, it has been submitted to a conference. In the meantime, for the repository, utilise the button top right corner of the repository "How to cite?". Or open the following citation file: CITATION.cff.

🔐 License

MIT License

Project details

Release history Release notifications | RSS feed

0.0.6

Aug 1, 2024

0.0.5

Jul 11, 2024

0.0.4

Jul 4, 2024

This version

0.0.2

Jul 2, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scikit_longitudinal-0.0.2.tar.gz (5.5 kB view hashes)

Uploaded Jul 2, 2024 Source

Built Distribution

scikit_longitudinal-0.0.2-py3-none-any.whl (4.5 kB view hashes)

Uploaded Jul 2, 2024 Python 3

Hashes for scikit_longitudinal-0.0.2.tar.gz

Hashes for scikit_longitudinal-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`92f75d1fbca965d23c4e0bc8a397903c5f35ef214cf9f13156a16651ca20734e`
MD5	`00da4dca8a0756fbd05aa205771da031`
BLAKE2b-256	`5d5b2d0da74e60978fda3235eb6e80bb8d8d19c6090a6104306e2b586e8cab13`

Hashes for scikit_longitudinal-0.0.2-py3-none-any.whl

Hashes for scikit_longitudinal-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`955d481cce704a2a1be3b0ae45b59e86d9800748e376b21e8444d21e7b972c0b`
MD5	`a3ef6758681007d4e63664c45be28959`
BLAKE2b-256	`f2f55be0d005102b529279fe7f2c523b5ac4b8ae36e0330c41da015cd929890b`