9 data sets for semi-supervised learning
Project description
sslbookdata
This python module provides functions to load the 9 data sets published in the book "Semi-Supervised Learning".
They are converted from the matlab files as found on Olivier Chapelle's web page http://olivier.chapelle.cc/ssl-book/benchmarks.html
Detailed description of the data
Each data set comes with a 10 or 12 different splits, and users can choose the number of labeled points their training gets to see.
Labels are provided for all points (for benchmarking), but the benchmarks suggest to use a fixed number of labels (10 or 100 for most sets).
Full details about the benchmarks are provided in chapter 23 of the book (online here: http://olivier.chapelle.cc/ssl-book/benchmarks.pdf)
This code
-
This code (c) by Oliver Obst o.obst@westernsydney.edu.au has been released under MIT License (see the LICENSE file).
-
If you use these data sets in your research, you can cite the SSL book:
@Book{ChaSchZie06,
editor = {O. Chapelle and B. Sch{\"o}lkopf and A. Zien},
title = {Semi-Supervised Learning},
publisher = {MIT Press},
year = 2006,
url = {http://olivier.chapelle.cc/ssl-book/index.html},
address = {Cambridge, MA}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for sslbookdata-0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | eab2604b6331d822b44bd9ce51772605e2ead71ca41bcfe5e95cc5d77e4de7bb |
|
MD5 | 12a74a1a34c9ca5801b0b724d85104af |
|
BLAKE2b-256 | bdd392dd2c8036301e472e24c62866876ee022e5d165b47d1252f6d56af857eb |