Scikit-longitudinal, an open-source Python lib for longitudinal data analysis, builds on Scikit-learn's foundation. It offers specialized tools to tackle challenges of repeated measures data, ideal for researchers, data scientists, & analysts.
Project description
Scikit-longitudinal
A specialised Python library for longitudinal data analysis built on Scikit-learn
⚙️ Project Status |
☎️ Contacts |
|||||
|
|
🌟 Exciting Update: We're delighted to introduce the brand new v0.1 documentation for Scikit-longitudinal! For a deep dive into the library's capabilities and features, please visit here.
💡 About The Project
Scikit-longitudinal
is a machine learning library designed to analyse
longitudinal data (Classification tasks focussed as of today). It offers tools and models for processing, analysing,
and predicting longitudinal data, with a user-friendly interface that
integrates with the Scikit-learn
ecosystem.
Please for further information, visit the official documentation.
🛠️ Installation
ON-HOLD until the first public release
Note that for developers, you should follow up onto the Contributing
tab
of the official documentation.
🚀 Getting Started
To perform longitudinal analysis with Scikit-Longitudinal
, use the
LongitudinalDataset
class to prepare the dataset. To analyse your
data, use the LexicoGradientBoostingClassifier
(i.e. Gradient Boosting variant for Longitudinal Data) or another
available
estimator/preprocessor.
Following that, you can apply the popular fit, predict, prodict_proba, or transform
methods in the same way that Scikit-learn
does, as shown in the example below.
from scikit_longitudinal.data_preparation import LongitudinalDataset
from scikit_longitudinal.estimators.ensemble.lexicographical.lexico_gradient_boosting import LexicoGradientBoostingClassifier
dataset = LongitudinalDataset('./stroke_4_years.csv')
dataset.load_data_target_train_test_split(
target_column="class_stroke_wave_4",
)
# Pre-set or manually set your temporal dependencies
dataset.setup_features_group(input_data="Elsa")
model = LexicoGradientBoostingClassifier(
features_group=dataset.feature_groups(),
threshold_gain=0.00015
)
model.fit(dataset.X_train, dataset.y_train)
y_pred = model.predict(dataset.X_test)
📝 How to Cite?
Paper's citation information will be added here once published. Currently, it has been submitted to a conference. In the meantime, for the repository, utilise the button top right corner of the repository "How to cite?". Or open the following citation file: CITATION.cff.
🔐 License
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for scikit_longitudinal-0.0.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 92f75d1fbca965d23c4e0bc8a397903c5f35ef214cf9f13156a16651ca20734e |
|
MD5 | 00da4dca8a0756fbd05aa205771da031 |
|
BLAKE2b-256 | 5d5b2d0da74e60978fda3235eb6e80bb8d8d19c6090a6104306e2b586e8cab13 |
Hashes for scikit_longitudinal-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 955d481cce704a2a1be3b0ae45b59e86d9800748e376b21e8444d21e7b972c0b |
|
MD5 | a3ef6758681007d4e63664c45be28959 |
|
BLAKE2b-256 | f2f55be0d005102b529279fe7f2c523b5ac4b8ae36e0330c41da015cd929890b |