Machine Learning dataset splitting for life sciences.
Project description
Splito - Dataset splitting for life sciences
Splito is a machine learning dataset splitting library for life sciences.
Installation
You can install splito
using pip:
pip install splito
You can use conda/mamba. Ask @maclandrol for credentials to the conda forge or for a token
mamba install -c conda-forge splito
Documentation
Find the documentation at https://splito-docs.datamol.io/.
Development lifecycle
Setup dev environment
micromamba create -n splito -f env.yml
micromamba activate splito
pip install --no-deps -e .
Tests
You can run tests locally with:
pytest
Code style
We use ruff
as a linter and formatter.
ruff check
ruff format
Documentation
You can build and run documentation server with:
mkdocs serve
License
Under the Apache-2.0 license. See LICENSE.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
splito-0.1.4.tar.gz
(1.0 MB
view hashes)
Built Distribution
splito-0.1.4-py3-none-any.whl
(39.5 kB
view hashes)