Skip to main content

Sphinx extension to render JSON data as tables

Project description

sphinxcontrib-jsontable

Tests Coverage Python

Languages: English | 日本語

A powerful Sphinx extension that renders JSON data (from files or inline content) as beautifully formatted reStructuredText tables. Perfect for documentation that needs to display structured data, API examples, configuration references, and data-driven content.

Background / Motivation

In recent years, there has been an increasing trend of using documents as data sources for Retrieval Augmented Generation (RAG). However, tabular data within documents often loses its structural relevance during the process of being ingested by RAG systems. This presented a challenge where the original value of the structured data could not be fully leveraged.

Against this backdrop, sphinxcontrib-jsontable was developed to directly embed structured data, such as JSON, as meaningful tables in Sphinx-generated documents, with the objective to ensure that readability and the data's value as a source effectively coexist.

Features

Flexible Data Sources

  • Load JSON from files within your Sphinx project
  • Embed JSON directly inline in your documentation
  • Support for relative file paths with safe path resolution

📊 Multiple Data Formats

  • JSON objects (single or arrays)
  • 2D arrays with optional headers
  • Mixed data types with automatic string conversion
  • Nested data structures (flattened appropriately)

🎛️ Customizable Output

  • Optional header rows with automatic key extraction
  • Row limiting for large datasets
  • Custom file encoding support
  • Responsive table formatting

🔒 Robust & Safe

  • Path traversal protection
  • Comprehensive error handling
  • Encoding validation
  • Detailed logging for debugging

Installation

From PyPI

pip install sphinxcontrib-jsontable

From Source

git clone https://github.com/sasakama-code/sphinxcontrib-jsontable.git
cd sphinxcontrib-jsontable
pip install -e .

Quick Start

1. Enable the Extension

Add to your conf.py:

extensions = [
    # ... your other extensions
    'sphinxcontrib.jsontable',
]

2. Create Sample Data

Create data/users.json:

[
  {
    "id": 1,
    "name": "Alice Johnson",
    "email": "alice@example.com",
    "department": "Engineering",
    "active": true
  },
  {
    "id": 2,
    "name": "Bob Smith",
    "email": "bob@example.com", 
    "department": "Marketing",
    "active": false
  }
]

3. Add to Your Documentation

In reStructuredText (.rst):

User Database
=============

.. jsontable:: data/users.json
   :header:
   :limit: 10

In Markdown (with myst-parser):

# User Database

```{jsontable} data/users.json
:header:
:limit: 10
```

4. Build Your Documentation

sphinx-build -b html docs/ build/html/

Comprehensive Usage Guide

Data Format Support

Array of Objects (Most Common)

Perfect for database records, API responses, configuration lists:

[
  {"name": "Redis", "port": 6379, "ssl": false},
  {"name": "PostgreSQL", "port": 5432, "ssl": true},
  {"name": "MongoDB", "port": 27017, "ssl": true}
]
.. jsontable:: data/services.json
   :header:

Output: Automatically generates headers from object keys (name, port, ssl).

2D Arrays with Headers

Great for CSV-like data, reports, matrices:

[
  ["Service", "Port", "Protocol", "Status"],
  ["HTTP", 80, "TCP", "Active"],
  ["HTTPS", 443, "TCP", "Active"],
  ["SSH", 22, "TCP", "Inactive"]
]
.. jsontable:: data/ports.json
   :header:

Output: First row becomes the table header.

2D Arrays without Headers

Simple tabular data:

[
  ["Monday", "Sunny", "75°F"],
  ["Tuesday", "Cloudy", "68°F"],
  ["Wednesday", "Rainy", "62°F"]
]
.. jsontable:: data/weather.json

Output: All rows treated as data (no headers).

Single Object

Configuration objects, settings, metadata:

{
  "database_host": "localhost",
  "database_port": 5432,
  "debug_mode": true,
  "max_connections": 100
}
.. jsontable:: data/config.json
   :header:

Output: Keys become one column, values become another.

Directive Options Reference

Option Type Default Description Example
header flag off Include first row as table header :header:
encoding string utf-8 File encoding for JSON files :encoding: utf-16
limit positive int unlimited Maximum rows to display :limit: 50

Advanced Examples

Large Dataset with Pagination

For performance and readability with large datasets:

.. jsontable:: data/large_dataset.json
   :header:
   :limit: 20

*Showing first 20 of 1000+ records. See full dataset in source file.*

Non-UTF8 Encoding

Working with legacy systems or specific character encodings:

.. jsontable:: data/legacy_data.json
   :encoding: iso-8859-1
   :header:

Inline JSON for Examples

Perfect for API documentation, examples, tutorials:

API Response Format
==================

The user endpoint returns data in this format:

.. jsontable::

   {
     "user_id": 12345,
     "username": "john_doe",
     "email": "john@example.com",
     "created_at": "2024-01-15T10:30:00Z",
     "is_verified": true,
     "profile": {
       "first_name": "John",
       "last_name": "Doe",
       "avatar_url": "https://example.com/avatar.jpg"
     }
   }

Complex Nested Data

For nested JSON, the extension flattens appropriately:

.. jsontable::

   [
     {
       "id": 1,
       "name": "Product A",
       "category": {"name": "Electronics", "id": 10},
       "tags": ["popular", "sale"],
       "price": 99.99
     }
   ]

Note: Objects and arrays in values are converted to string representations.

Integration Examples

With Sphinx Tabs

Combine with sphinx-tabs for multi-format documentation:

.. tabs::

   .. tab:: JSON Data

      .. jsontable:: data/api_response.json
         :header:

   .. tab:: Raw JSON

      .. literalinclude:: data/api_response.json
         :language: json

With Code Blocks

Document API endpoints with request/response examples:

Get Users Endpoint
==================

**Request:**

.. code-block:: http

   GET /api/v1/users HTTP/1.1
   Host: api.example.com
   Authorization: Bearer <token>

**Response:**

.. jsontable::

   [
     {
       "id": 1,
       "username": "alice",
       "email": "alice@example.com",
       "status": "active"
     },
     {
       "id": 2, 
       "username": "bob",
       "email": "bob@example.com",
       "status": "inactive"
     }
   ]

In MyST Markdown

Full MyST Markdown support for modern documentation workflows:

# Configuration Reference

## Database Settings

```{jsontable} config/database.json
:header:
:encoding: utf-8
```

## Feature Flags

```{jsontable}
[
  {"feature": "dark_mode", "enabled": true, "rollout": "100%"},
  {"feature": "new_dashboard", "enabled": false, "rollout": "0%"},
  {"feature": "advanced_search", "enabled": true, "rollout": "50%"}
]
```

File Organization Best Practices

Recommended Directory Structure

docs/
├── conf.py
├── index.rst
├── data/
│   ├── users.json
│   ├── products.json
│   ├── config/
│   │   ├── database.json
│   │   └── features.json
│   └── examples/
│       ├── api_responses.json
│       └── error_codes.json
└── api/
    └── endpoints.rst

Naming Conventions

  • Use descriptive filenames: user_permissions.json not data1.json
  • Group related data in subdirectories: config/, examples/, test_data/
  • Include version or date when appropriate: api_v2_responses.json

Performance Considerations

Large Files

For files over 1000 rows, consider:

.. jsontable:: data/large_dataset.json
   :header:
   :limit: 100

.. note::
   This table shows the first 100 entries. Download the complete dataset: 
   :download:`large_dataset.json <data/large_dataset.json>`

Build Time Optimization

  • Use :limit: for very large datasets
  • Consider splitting large files into smaller chunks
  • Cache processed JSON when possible

Memory Usage

The extension loads entire JSON files into memory. For extremely large files (>100MB), consider:

  • Preprocessing data into smaller chunks
  • Using database views instead of static JSON
  • Implementing custom pagination

Troubleshooting

Common Issues

Error: "No JSON data source provided"

# ❌ Missing file path or content
.. jsontable::

# ✅ Provide file path or inline content  
.. jsontable:: data/example.json

Error: "JSON file not found"

  • Check file path relative to source directory
  • Verify file exists and has correct permissions
  • Ensure no typos in filename

Error: "Invalid inline JSON"

  • Validate JSON syntax using online validator
  • Check for trailing commas, unquoted keys
  • Ensure proper escaping of special characters

Encoding Issues

# For non-UTF8 files
.. jsontable:: data/legacy.json
   :encoding: iso-8859-1

Empty Tables

  • Check if JSON file is empty or null
  • Verify JSON structure (must be array or object)
  • Use :limit: option to check if data exists beyond limit

Debug Mode

Enable detailed logging in conf.py:

import logging
logging.basicConfig(level=logging.DEBUG)

# For sphinx-specific logs
extensions = ['sphinxcontrib.jsontable']

Testing Configuration

Create a simple test file to verify setup:

[{"test": "success", "status": "ok"}]
.. jsontable:: test.json
   :header:

Security Considerations

Path Traversal Protection

The extension automatically prevents directory traversal attacks:

# ❌ This will be blocked
.. jsontable:: ../../etc/passwd

# ✅ Safe relative paths only
.. jsontable:: data/safe_file.json

File Access

  • Only files within the Sphinx source directory are accessible
  • No network URLs or absolute system paths allowed
  • File permissions respected by the system

Migration Guide

From Other Extensions

From sphinx-jsonschema:

  • Replace .. jsonschema:: with .. jsontable::
  • Remove schema validation options
  • Add :header: option if needed

From Custom Solutions:

  • Export your data to JSON format
  • Replace custom table generation with .. jsontable::
  • Update file paths to be relative to source directory

Version Compatibility

  • Sphinx: 3.0+ (recommended: 4.0+)
  • Python: 3.10+ (recommended: 3.11+)
  • Docutils: 0.14+

API Reference

Core Classes

JsonTableDirective

  • Main Sphinx directive class
  • Handles option parsing and execution
  • Coordinates data loading, conversion, and rendering

JsonDataLoader

  • Loads JSON from files or inline content
  • Validates encoding and file paths
  • Provides secure file access

TableConverter

  • Transforms JSON structures into 2D table data
  • Handles different data formats (objects, arrays, mixed)
  • Manages header extraction and row limiting

TableBuilder

  • Generates Docutils table nodes
  • Creates proper table structure with headers/body
  • Handles cell formatting and padding

Error Handling

All errors inherit from JsonTableError:

  • File access errors
  • JSON parsing errors
  • Invalid data structure errors
  • Path traversal attempts

Contributing

We welcome contributions! See CONTRIBUTING.md for:

  • Development setup
  • Code style guidelines
  • Testing procedures
  • Pull request process

Development Setup

git clone https://github.com/sasakama-code/sphinxcontrib-jsontable.git
cd sphinxcontrib-jsontable
pip install -e ".[dev]"
pytest

Running Tests

# Run all tests
pytest

# Run with coverage
pytest --cov=sphinxcontrib.jsontable

# Run specific test
pytest tests/test_directives.py::test_json_table_basic

Examples Repository

See the examples/ directory for:

  • Complete Sphinx project setup
  • Various data format examples
  • Integration with other extensions
  • Advanced configuration examples
cd examples/
sphinx-build -b html . _build/html/

Changelog

See CHANGELOG.md for detailed version history and release notes.

License

This project is licensed under the MIT License.

Support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sphinxcontrib_jsontable-0.1.0.tar.gz (49.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sphinxcontrib_jsontable-0.1.0-py3-none-any.whl (13.5 kB view details)

Uploaded Python 3

File details

Details for the file sphinxcontrib_jsontable-0.1.0.tar.gz.

File metadata

  • Download URL: sphinxcontrib_jsontable-0.1.0.tar.gz
  • Upload date:
  • Size: 49.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for sphinxcontrib_jsontable-0.1.0.tar.gz
Algorithm Hash digest
SHA256 30841168bea982c0f6cd608fcfe47c560f4b29c1f323cba9024f238edd01e98b
MD5 28b95852c8e4098685e6bf437fe47e32
BLAKE2b-256 19dfbcda8a97d23b320a32db8f10fb8271f70dda3ea0ccd6f62312b562264425

See more details on using hashes here.

Provenance

The following attestation bundles were made for sphinxcontrib_jsontable-0.1.0.tar.gz:

Publisher: release.yml on sasakama-code/sphinxcontrib-jsontable

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file sphinxcontrib_jsontable-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sphinxcontrib_jsontable-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 09e62d5d12dd8de1b037e2cc280c619c4c4e3c6cf7470aa296a3f2f71c1d7d07
MD5 aa5144bd1a1f0503b201f1f57e4b6f8e
BLAKE2b-256 eb9f2714df0c336bd3b6eeb76a73e8cb91bb59e648a7f44a68a9ed2d459870b8

See more details on using hashes here.

Provenance

The following attestation bundles were made for sphinxcontrib_jsontable-0.1.0-py3-none-any.whl:

Publisher: release.yml on sasakama-code/sphinxcontrib-jsontable

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page