df2tables: Pandas DataFrames to Interactive DataTables

These details have not been verified by PyPI

Project links

Project description

df2tables: Pandas DataFrames to Interactive DataTables

df2tables is a Python utility for exporting pandas.DataFrame objects to interactive HTML tables using DataTables—an excellent JavaScript library for table functionality. It generates standalone .html files viewable in any browser without Jupyter notebooks, servers, or frameworks.

Useful for data inspection, feature engineering workflows, especially with large datasets that need interactive exploration.

Features

Converts pandas.DataFrame to interactive standalone HTML tables
You can browse quite large data sets using filters and sorting
DataTables Column Control integration: Smartly leverages the powerful DataTables Column Control extension for automatic dropdown filters and advanced search functionality, loaded programmatically via JavaScript
Self-contained HTML files with embedded data—no external dependencies at runtime
Works independently of Jupyter or web servers—viewable offline in any browser, portable and easy to share
Color-coded formatting for numeric columns
Useful for some training dataset inspection and feature engineering: Quickly browse through large datasets, identify outliers, and data quality issues interactively
Easy customizable HTML
Smart column detection: Automatically identifies categorical columns (≤4 unique values) for dropdown filtering

Screenshots

A standalone html file containing a js array as data source for datatables has several advantages, e.g. you can browse quite large datasets locally (something you don't usually do on a server). The column control feature provides dropdown filters for categorical data and search functionality for text columns, enhancing data exploration capabilities through the excellent DataTables Column Control extension. (By default, filtering is enabled for all non-numeric columns) Below is an example of 100k rows with additional html rendering.

Installation

pip install df2tables

Quick Start

import pandas as pd
import df2tables as df2t

df = pd.DataFrame({
    "Name": ["Alice", "Bob", "Carol"],
    "Score": [92.5, -78.3, 85.0],
    "Joined": pd.to_datetime(["2021-01-05", "2021-02-10", "2021-03-15"])
})

# Basic usage with color-coded numeric columns
df2t.render(
    df,
    title="User Scores",
    precision=1,
    num_html=["Score"],
    to_file="output.html",
    startfile=True
)

Main Function

render

df2t.render(
    df: pd.DataFrame,
    title: str = "Title",
    precision: int = 2,
    num_html: List[str] = [],
    to_file: Optional[str] = None,
    startfile: bool = True,
    templ_path: str = TEMPLATE_PATH,
    load_column_control: bool = True
) -> Union[str, file_object]

Parameters:

df: Input pandas DataFrame
title: Title for the HTML table
precision: Number of decimal places for floating-point numbers
num_html: List of numeric column names to render with color-coded HTML formatting (negative values in red)
to_file: Output HTML file path. If None, returns HTML string instead of writing file
startfile: If True, automatically opens the generated HTML file in default browser
templ_path: Path to custom HTML template (uses default if not specified)
load_column_control: If True, smartly integrates the exceptional DataTables Column Control extension programmatically for enhanced filtering and search capabilities (default: True)

Returns:

HTML string if to_file=None
File object if to_file is specified

DataTables Column Control Extension Integration

The load_column_control parameter enables smart integration with the remarkable DataTables Column Control extension, bringing professional-grade filtering capabilities to your data tables:

Categorical columns (≤4 unique values): Get elegant dropdown select filters for intuitive data filtering
Text/numeric columns: Benefit from sophisticated search dropdown functionality and ordering controls
Intelligent detection: The module automatically identifies column types and applies the most appropriate Column Control features
Seamless loading: The outstanding Column Control extension is loaded dynamically via JavaScript, ensuring optimal performance and compatibility

# Enable smart integration with DataTables Column Control extension (default)
df2t.render(df, load_column_control=True, to_file="enhanced_table.html")

# Disable Column Control for simpler tables
df2t.render(df, load_column_control=False, to_file="simple_table.html")

sample_df

Generates and renders a built-in example DataFrame for testing:

html_string = df2t.sample_df()

Fast Dataset Browsing

One of the key strengths of df2tables is its ability to quickly generate interactive HTML tables for rapid dataset exploration. The combination of standalone HTML files and the DataTables Column Control extension makes it exceptionally fast to browse through multiple datasets.

Bulk Dataset Processing

For exploratory data analysis across multiple datasets, you can generate tables programmatically. The example below uses the vega_datasets package, which provides easy access to a variety of sample datasets commonly used in data visualization and analysis.

Note: Install vega_datasets with pip install vega_datasets to run this example.

import df2tables as df2t
from vega_datasets import data

# WARNING: This will open many browser tabs! Use with caution.
# Consider setting startfile=False for bulk processing.

for dataset_name in sorted(dir(data)):
    dataset_func = getattr(data, dataset_name)
    try:
        df = dataset_func()
        print(f"{dataset_name}: {len(df.index)} rows")
        
        # df2tables can handle datasets above 100k rows, but we limit to smaller datasets 
        # for this demo to avoid generating too many large files
        if len(df.index) < 100_000:
            df2t.render(
                df, 
                title=f'Dataset: {dataset_name}',
                to_file=f'{dataset_name}.html',
                startfile=False  # Prevent opening all files automatically
            )
    except Exception as e:
        print(f'Error processing {dataset_name}: {e}')

print("Generated HTML files. Open them manually to browse datasets.")

⚠️ Important Note: When startfile=True (default), each generated HTML file opens automatically in your default browser. For bulk processing, set startfile=False to avoid opening dozens of browser tabs simultaneously.

Benefits for Fast Browsing

Instant loading: HTML files with embedded data load immediately without server dependencies
Interactive filtering: The DataTables Column Control extension enables quick data exploration
Offline browsing: Generated files work completely offline
Portable: Share HTML files easily with colleagues for collaborative data exploration
No memory constraints: Unlike Jupyter notebooks, these files don't consume Python memory after generation
Python 3.7+
pandas
numpy

Technical Details

DataTables Column Control Extension Integration

The module smartly integrates with the exceptional DataTables Column Control extension for optimal user experience:

Select columns: Columns with ≤4 unique values get sophisticated dropdown filters (searchList) via Column Control
Search columns: Other columns benefit from Column Control's advanced search functionality and ordering controls
Dynamic loading: The Column Control extension JavaScript libraries are loaded programmatically to maintain clean templates
Robust fallback: If the Column Control extension cannot be loaded, tables gracefully fall back to standard DataTables functionality

TODO

Support rendering a minimal HTML snippet (instead of full document) suitable for inclusion in Flask or Jinja2 templates:
- The resulting string would only contain the table markup and JS data bindings.
- All external dependencies (jQuery, DataTables, ColumnControl, styles) would be loaded dynamically via JavaScript, as is already supported by load_column_control=True.
- Ideal for embedding data previews or interactive tables directly into existing web apps.

Error Handling

The module includes robust error handling for:

JSON serialization: Custom encoder handles complex pandas data types
Column compatibility: Automatically converts problematic column types to string representation
Missing columns: Validates num_html column names against DataFrame columns
Script loading: Graceful fallback if the DataTables Column Control extension cannot be loaded

License

MIT License
© Tomasz Sługocki

Appendix: Template Customization

Offline Usage

Note: "Offline" viewing assumes internet connectivity for CDN resources (DataTables, jQuery, PureCSS, DataTables Column Control extension). For truly offline usage, modify the template to reference local copies of these libraries instead of CDN links.

Templates use comnt, a minimal markup system based on HTML/JS comments.

<!--[title-->
My Table Title
<!--title]-->

const data = /*[tab_data*/ [...] /*tab_data]*/;

The default HTML template includes:

PureCSS (CDN) for responsive styling
DataTables 2.3.2 (CDN) for table interactivity
jQuery 3.7.1 (CDN)
DataTables Column Control Extension (CDN) - the outstanding Column Control extension loaded programmatically when enabled
JavaScript enhancements for sorting HTML-formatted numbers and coloring negative values

DataTables Column Control Extension CDN Resources

When load_column_control=True, the following resources from the excellent DataTables Column Control extension are loaded dynamically:

// JavaScript libraries loaded programmatically
const columncontrol_js = [
    "https://cdn.datatables.net/columncontrol/1.0.6/js/dataTables.columnControl.js",
    "https://cdn.datatables.net/columncontrol/1.0.6/js/columnControl.dataTables.js"
];

// CSS loaded after JavaScript initialization
const columncontrol_css = 
    "https://cdn.datatables.net/columncontrol/1.0.6/css/columnControl.dataTables.css";

While comnt is used to ensure that the HTML template just works independently (and avoid Json.parse), you can also use other templating systems like Jinja2 by rendering the final content after.

Custom Templates

Copy and modify datatable_templ.html to apply custom styling or libraries, then pass the new template path to templ_path.

Customization

# Return HTML string for further processing
html_content = df2t.render(df, to_file=None)

# Use custom template
df2t.render(
    df,
    to_file="custom_output.html",
    templ_path="my_custom_template.html"
)

# Disable DataTables Column Control extension for custom implementations
df2t.render(
    df,
    to_file="basic_table.html",
    load_column_control=False
)

# Handle MultiIndex columns (experimental)
# MultiIndex columns are automatically flattened with underscore separation

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.1

Nov 12, 2025

0.2.0

Oct 28, 2025

0.1.9

Oct 20, 2025

0.1.8

Oct 11, 2025

0.1.7

Oct 8, 2025

0.1.6

Oct 2, 2025

0.1.5

Oct 2, 2025

0.1.4

Sep 30, 2025

0.1.3

Sep 24, 2025

0.1.2

Sep 12, 2025

0.1.1

Sep 1, 2025

0.1.0

Aug 1, 2025

0.0.8

Jul 14, 2025

0.0.6

Jul 8, 2025

0.0.5

Jul 4, 2025

0.0.4

Jul 1, 2025

This version

0.0.3

Jul 1, 2025

0.0.2

Jun 27, 2025

0.0.1

Jun 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

df2tables-0.0.3.tar.gz (19.8 kB view details)

Uploaded Jul 1, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

df2tables-0.0.3-py3-none-any.whl (17.1 kB view details)

Uploaded Jul 1, 2025 Python 3

File details

Details for the file df2tables-0.0.3.tar.gz.

File metadata

Download URL: df2tables-0.0.3.tar.gz
Upload date: Jul 1, 2025
Size: 19.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for df2tables-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`8d580e520e4ae9a92ae1eb9a5f7bbba53c803dda078693a841ad35d6ade8a5d4`
MD5	`8b59562c98b45ef1dd9dff38a2877215`
BLAKE2b-256	`eb814b1b9792cf2bf24e7811a0300d4622a9280eb3acbad1ad38ec8eb686a21c`

See more details on using hashes here.

File details

Details for the file df2tables-0.0.3-py3-none-any.whl.

File metadata

Download URL: df2tables-0.0.3-py3-none-any.whl
Upload date: Jul 1, 2025
Size: 17.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for df2tables-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`943f8a2b5014018b5e6ed1ba4ca809d927824ae2430ce2fe29a7a7623893d043`
MD5	`c92f73956c6010a9e1ec740585803490`
BLAKE2b-256	`862d34233d7d1032de94f722cc07578865754b806b87df57e209d811579c7ad5`

See more details on using hashes here.

df2tables 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

df2tables: Pandas DataFrames to Interactive DataTables

Features

Screenshots

Installation

Quick Start

Main Function

render

DataTables Column Control Extension Integration

sample_df

Fast Dataset Browsing

Bulk Dataset Processing

Benefits for Fast Browsing

Technical Details

DataTables Column Control Extension Integration

TODO

Error Handling

License

Appendix: Template Customization

Offline Usage

DataTables Column Control Extension CDN Resources

Custom Templates

Customization

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes