Data Sets for Machine Learning in Python
Skdata is a library of datasets for empirical computer science. Lots of disciplines such as machine learning, natural language processing, and computer vision have data sets. This module makes the most popular and standard datasets (even the big awkward ones) easy to access from Python programs.
This work was supported in part by the National Science Foundation (IIS-0963668).
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.