Datasets Summary Viewer. Enables the exploration of dataset search results in Jupyter Notebooks
Project description
DatasetsSummarizer
Datasets Summarizer is compatible with Jupyter Notebooks. Need the x and y values based on any similarity metric to generated the similarity plot between datasets. Supports the metadata format generated by datamart-profiler library to generate the Detail View to explore each dataset.
( Click one dataset from the list of results to open the Detail View.)
Demo
Live demo (Google Colab):
In Jupyter Notebook:
import DatasetsSummarizer
data = DatasetsSummarizer.get_taxi_data()
DatasetsSummarizer.plot_datasets_summary(data)
Install
Option 1: install via pip:
pip install datasets-summarizer
Custom similarity metric
Use a subset or add a new entry (x
and y
values ) based on a different similatiry metric. For example, here we added x
and y
values based on a similarity metric using a modified version of the titles. Note that modif_title_x
and modif_title_y
must be included in the dataframe.
new_similarity_metrics = [{'name': 'Title', 'x': 'title_x', 'y': 'title_y'},
{'name': 'ModifiedTitle', 'x': 'modif_title_x', 'y': 'modif_title_y'}
]
Then, we can pass this new similarity metrics as a parameter of our visualization
DatasetsSummarizer.plot_datasets_summary(dataframe, new_similarity_metrics)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for datasets-summarizer-0.1.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | d0cce2b615d235bc33989fe2abb2b16dbdba565f08735bf35d83108a81e5f81a |
|
MD5 | a2df634397621a7cd35b0a3ea2d10d02 |
|
BLAKE2b-256 | 1a967ab6390dcb6a4258e4446e7455c8faa3c0759906e2122182507e233466f6 |
Hashes for datasets_summarizer-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c3bf08bcce779a26681fbf8cfd4d5bd8656d23c3a7aae61efabe1bff7145b70a |
|
MD5 | d4c114fff4e72d415c0f1769b52e2cdb |
|
BLAKE2b-256 | 575ee52b5a43a37808ad56168386ea357abbd93eb6033de81aeaf32a1fe6f1d1 |