Skip to main content

Share and Edit Pandas Dataframes with a Link!

Project description

image

share-df: Instantly Share and Modify Dataframes With a Web Interface From Anywhere

PyPI Downloads        PyPI Latest Release        Demos and Source Code

Goal

This package enables cross-collaboration between nontechnical and technical contributors by allowing developers to generate a URL for free with one line of code that they can then send to nontechnical contributors enabling them to modify the dataframe with a web app. Then, they can send it back to the developer, directly generating the modified dataframe, maintaining code continuity, and removing the burden of file transfer and conversion to other file formats.

Technical Contributor Features

  • pip install share-df
  • one function call to generate a link to send, accessible anywhere
  • changes made by the client are received back as a dataframe for seamless development

Nontechnical Contributor Features

  • Easy Google OAuth login
  • Seamless UI to modify the dataframe
    • Change column names
    • Drag around columns
    • Change all values
    • Rename columns
    • Add new columns and rows
  • Send the results back with the click of a button
  • Work with large amounts of data quickly

How to Run

  1. pip install share-df
  2. If you do not already have one, generate an auth token for free in less than a minute with ngrok
  3. Create a .env file in your directory with NGROK_AUTHTOKEN=
  4. import and call the function on any df!

Example Code

import pandas as pd
from share_df import pandaBear

df = pd.DataFrame({
    'Name': ['John', 'Alice', 'Bob', 'Carol'],
    'City': ['New York', 'London', 'Paris', 'Tokyo'],
    'Salary': [50000, 60000, 75000, 65000]
})

df = pandaBear(df)
print(df)

Handling Big Data

  • As per the demo, currently, the site takes 6 seconds to load a million rows.
  • After loading, it can handle cell changes, row additions, column sorting, new columns, fast scrolling, and sending the data back frictionlessly.
  • That being said given interest I can improve this experience.

Google Colab

  • This code works by creating a localhost and then tunneling traffic to make it accessible to other people.
  • Thereby, since Google Colab code runs on a VM this is an interesting challenge to handle.
  • As of 0.1.7 the package offers support for creating a Google-generated link for DFs but this link is not shareable.
  • For Google Colab instead of using a .env I recommend putting your NGROK_AUTHTOKEN into the Google Colab secrets manager (key icon on the left side of the screen). That way your secrets also can be synced to other notebooks and you don't have to repeat the .env uploading each time.
  • I initially aimed for full functionality (link sharing) with Google Colab however it seems impossible as Colab locks it to Colab session authentification.
  • Google has also stated that they may deprecate their serve_kernel_port_as_window function in the future in which case it will be swapped to serve_kernel_port_as_iframe and the same functionality will remain except it will be in the IFrame.
  • For now, there is an optional parameter that allows you to use the editor in IFrame mode.
  • Check out a demo notebook here.

Future Features

  • Better Dataframe handling (pagination, lazy loading, better frontend for big data)
  • Better Security (input sanitization, CSRF protection, configurable endpoint rate limiting)
  • Better UI (search, dark mode, export option)
  • IFrame Usage Option in Google Colab
  • True Asynchronicity with ipyparallel
  • Code Recreation (instead of overwriting the df just solve the code needed)
  • Multiple authenticated users

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

share_df-1.0.4.tar.gz (14.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

share_df-1.0.4-py3-none-any.whl (13.5 kB view details)

Uploaded Python 3

File details

Details for the file share_df-1.0.4.tar.gz.

File metadata

  • Download URL: share_df-1.0.4.tar.gz
  • Upload date:
  • Size: 14.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.4 CPython/3.13.0 Darwin/23.5.0

File hashes

Hashes for share_df-1.0.4.tar.gz
Algorithm Hash digest
SHA256 4bcdae4596b42d6fa2841ec7040dbbad97d8a9c82c4bd9bf5f675e0e1206f4c5
MD5 d1030c90f49fe1261611eb726a35c7e3
BLAKE2b-256 129ab4312c9fc4df4e894a81b4716e804fdf2b1f6dcc84c3ca053bb731f8503a

See more details on using hashes here.

File details

Details for the file share_df-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: share_df-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 13.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.4 CPython/3.13.0 Darwin/23.5.0

File hashes

Hashes for share_df-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 60d8cfbc9d5783bd1aa68ecd80c89f72fd285f1d4c5f356d4da4bec31fddd674
MD5 55b2fa3479882da14a8f74f648f22635
BLAKE2b-256 a3c381aa0248ae72902a9aafd6b5630c8c2560f9c6100e25f3a7af55ac685a01

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page