Skip to main content

Automatically profile your pandas dataframes in jupyter lab.

Project description

PyPi Binder Lite

Profile your Pandas Dataframes! Autoprofiler will automatically visualize your Pandas dataframes after every execution, no extra code necessary.

Autoprofiler allows you to spend less time specifying charts and more time interacting with your data by automatically showing you profiling information like:

  • Distribution of each column
  • Sample values
  • Summary statistics

Updates profiles as your data updates

screenshot of Autoprofiler

Autoprofiler reads your current Jupyter notebook and produces profiles for the Pandas Dataframes in your memory as they change.

https://user-images.githubusercontent.com/13400543/199877605-ba50f9c8-87e5-46c9-8207-1c6496bb3b18.mov

Install

To instally locally use pip and then open jupyter lab and the extension will be running.

pip install -U digautoprofiler

Please note, AutoProfiler only works in JupyterLab with version >=3.x, < 4.0.0.

Try it out

To try out Autoprofiler in a hosted notebook, use one of the options below

Jupyter Lite Binder
Lite Binder

Browser support: AutoProfiler has been developed and tested with Chrome.

Development Install

For development install instructions, see CONTRIBUTING.md.

If you're having install issues, see TROUBLESHOOTING.md.

Acknowledgements

Big thanks to the Rill Data team! Much of our profiler UI code is adapted from Rill Developer.

Citation

Please reference our VIS'23 paper:

@article{epperson23autoprofiler,
  title={Dead or Alive: Continuous Data Profiling for Interactive Data Science},
  author={Will Epperson and Vaishnavi Goranla and Dominik Moritz and Adam Perer},
  journal={IEEE Transactions on Visualization and Computer Graphics},
  year={2023},
  url={https://arxiv.org/abs/2308.03964}
}

Let us know what you think! 📢

We would love to hear your feedback on how you are using AutoProfiler! Please fill out this form or email Will at willepp@cmu.edu.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

digautoprofiler-3.0.1.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

digautoprofiler-3.0.1-py3-none-any.whl (3.1 MB view details)

Uploaded Python 3

File details

Details for the file digautoprofiler-3.0.1.tar.gz.

File metadata

  • Download URL: digautoprofiler-3.0.1.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.10

File hashes

Hashes for digautoprofiler-3.0.1.tar.gz
Algorithm Hash digest
SHA256 49e30287eb9e21207f80cfd6766ef57bb3cf514f540289fe85364a6fe8dd3d0f
MD5 3e01cfbbd2de19f3411dc060f9256798
BLAKE2b-256 4b1ed216983d0e0619ffa17af0594fac609f7772b0ff1fc87bb6a6d117aaea08

See more details on using hashes here.

File details

Details for the file digautoprofiler-3.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for digautoprofiler-3.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 18e1fd07511575c9161c1bd56a2032bac0021e18052024950d3beb7bd9551316
MD5 195101c6d43647eaa5592893af35478b
BLAKE2b-256 a181568185df840ff1d28903c92e47800d2a5c2c2b9cf94cabbdbd8f6930c4f2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page