a basic wrapper kernel for DuckDB

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

DuckDB Kernel for Jupyter

This is a simple DuckDB wrapper kernel which accepts SQL as input, executes it using a previously loaded DuckDB instance and formats the output as a table. There are some magic commands that make teaching easier with this kernel.

Setup
- Using pip
- Using Docker
Usage

Setup

Using pip

Run pip to install the corresponding package from pypi after Jupyter is already installed.

pip install jupyter-duckdb

jupyter kernelspec install <path to the site-packages directory>/duckdb_kernel

Now start Jupyter the usual way and the kernel should be available.

If DuckDB cannot be installed on your system, you can use SQLite as a backend instead. To do this, set the environment variable SQLITE when running pip:

SQLITE=1 pip install jupyter-duckdb

Using Docker

Execute the following command to pull and run a prepared image.

docker run -p 8888:8888 troebs/jupyter-duckdb

There is also a second image. It contains an additional instance of PostgreSQL:

docker run -p 8888:8888 troebs/jupyter-duckdb:postgresql

This image can also be used with JupyterHub and the DockerSpawner / SwarmSpawner and probably with the kubespawner. You can also build your own image using the Dockerfile in the repository.

Usage

A detailed example can be found in the repository. The rest of this section describes the magic commands.

A Note on Magic Commands

Many Jupyter kernels make a difference between magic commands for a single line starting with one percent sign and others for a whole cell starting with two percent signs. The upcoming magic commands always apply to a whole cell. Therefore, it does not matter whether you use a single or two percent signs. However, the magic commands must always be used at the beginning of a cell.

It is also possible to use more than one magic command per cell.

Load a Database

To load the database two magic commands are available.

CREATE creates a new database and therefore overwrites files with the same name without prompting. Using the optional parameter OF you can either provide another DuckDB file or a file with SQL statements. In the first case the included tables will be copied to the new database, while in the second case the SQL statements are just executed. We find this feature very useful to work in a temporary copy of the data and therefore be able to restart at any time. The optional parameter NAME may be used to name a connection and reference it later by using the magic command USE.

%CREATE data.duckdb OF my_statements.sql

LOAD on the other hand loads an existing database and returns an error if it does not exist. (That is why OF cannot be used with LOAD! NAME on the other hand is available also with this magic command.)

%LOAD data.duckdb

Multiple databases can be open at any time. If a new database with the same name is created or loaded, the current one is closed first and saved to disk if necessary.

Please note that :memory: is also a valid file path for DuckDB. The data is then stored exclusively in the main memory. In combination with CREATE and OF this makes it possible to work on a temporary copy in memory.

Although the name suggests otherwise, the kernel can also be used with other databases:

SQLite is automatically used as a fallback if the DuckDB dependency is missing.
To connect to a PostgreSQL instance, you need to specify a database URI starting with (postgresql|postgres|pgsql|psql|pg)://.

Schema Diagrams

The magic command SCHEMA can be used to create a simple schema diagram of the loaded database, showing all created tables, their columns and data types, but without any views. Primary keys are printed in bold and unique keys are underlined. Foreign keys are also highlighted and the dependencies between the tables are shown by arrows.

The optional flag TD can be set to force a vertical layout. This representation requires more space, but can improve readability.

%SCHEMA TD

The optional argument ONLY, followed by one or more table names separated by a comma, can be used to display only the named tables and all those connected with a foreign key.

Graphviz (dot in PATH) is required to render schema diagrams.

Number of Rows

By default, only 20 rows are shown. All further lines are replaced by three dots. When hovering over the three dots using the cursor, the number of omitted lines is displayed. Of course, the number of lines displayed can be changed.

The magic command ALL_ROWS and its short form ALL can be used to display * all* rows of the query in the same cell. Caution: With large result sets this can lead to a frozen Jupyter instance.

%ALL_ROWS
SELECT *
FROM foo
-- all rows

The magic command QUERY_MAX_ROWS followed by an integer can be used to change the number of displayed rows for the current cell.

%QUERY_MAX_ROWS 50
SELECT *
FROM foo
-- 50 rows

The magic command MAX_ROWS followed by an integer can be used to change the number of displayed rows for all future queries including the current cell.

%MAX_ROWS 30
SELECT *
FROM foo
-- 30 rows

SELECT *
FROM bar
-- 30 rows

Ship Tests With Your Notebooks

Simple tests can be loaded from json files with the help of magic command LOAD_TESTS. These tests are stored as a JSON file. Each test is assigned a unique name, a result set and whether the test should check the order of the result. A very simple test file looks like the following JSON object:

{
  "task1": {
    "ordered": false,
    "equals": [
      [
        1,
        "Name 1"
      ],
      [
        2,
        "Name 2"
      ]
    ]
  }
}

To bind a test to a cell, use the magic command TEST in combination with a name. After the cell is executed, the result is evaluated and then displayed below the query result.

%TEST task1
SELECT 2, 'Name 2'
UNION
SELECT 1, 'Name 1'

By default, failed tests will display an explanation, but the notebook will continue to run. Set the DUCKDB_TESTS_RAISE_EXCEPTION environment variable to true to raise an exception when a test fails. This can be useful for automated testing in CI environments.

Disclaimer: The integrated testing is work-in-progress and thus subject to potentially incompatible changes and enhancements.

Relational Algebra

An interpreter for relational algebra queries is integrated in this kernel. The magic command RA activates the relational algebra mode for a single cell:

%RA
π [a, b] (σ [c = 1] (R))

The supported operations are:

Projection π
Selection σ
Rename β
Union ∪
Intersection ∩
Difference \
Natural Join ⋈
Cross Product ×
Division ÷

The optional flag ANALYZE can be used to add an execution diagram to the output.

You can also add comments to queries using -- or /* */, just like in SQL.

The Dockerfile also installs the Jupyter Lab plugin jupyter-ra-extension. It adds the symbols mentioned above and some other supported symbols to the toolbar for insertion on click.

Domain Calculus

An interpreter for domain calculus queries is integrated in this kernel. The magic command DC activates the domain calculus mode for a single cell:

%DC
{ a, b | R(a, b, c) ∧ c = 1 }

Automated Parser Selection

%ALL_RA or %ALL_DC enables the corresponding parser for all subsequently executed cells.

If the magic command %AUTO_PARSER is added to a cell, a parser is automatically selected. If %GUESS_PARSER is executed, the parser is automatically selected for all subsequent cells.

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.5.200

Apr 14, 2026

1.5.100

Mar 25, 2026

1.4.402

Feb 3, 2026

1.4.401

Jan 29, 2026

1.4.400

Jan 29, 2026

1.4.201

Nov 14, 2025

1.4.200

Nov 13, 2025

1.4.114

Nov 10, 2025

1.4.112

Nov 10, 2025

1.4.111

Nov 8, 2025

1.4.109

Nov 7, 2025

This version

1.4.108

Nov 7, 2025

1.4.107

Nov 6, 2025

1.4.106

Nov 5, 2025

1.4.105

Nov 4, 2025

1.4.100

Oct 8, 2025

1.4.0

Sep 17, 2025

1.3.200

Jul 9, 2025

1.3.100

Jun 17, 2025

1.3.0

May 23, 2025

1.2.200

Apr 8, 2025

1.2.105

Mar 25, 2025

1.2.104

Mar 25, 2025

1.2.103

Mar 24, 2025

1.2.102

Mar 18, 2025

1.2.101

Mar 14, 2025

1.2.100

Mar 6, 2025

1.2.7

Mar 4, 2025

1.2.0.6

Feb 27, 2025

1.2.0.5

Feb 21, 2025

1.2.0.4

Feb 18, 2025

1.2.0.3

Feb 13, 2025

1.2.0.2

Feb 12, 2025

1.2.0.1

Feb 12, 2025

1.2.0.0

Feb 7, 2025

1.1.3.4

Jan 30, 2025

1.1.3.2

Nov 13, 2024

1.1.3.1

Nov 5, 2024

1.1.2.3

Nov 4, 2024

1.1.2.2

Oct 25, 2024

1.1.2.1

Oct 23, 2024

0.10.2.1

Apr 28, 2024

0.10.1.1

Apr 12, 2024

0.9.2.6

Jan 29, 2024

0.9.2.5

Jan 23, 2024

0.9.2.4

Jan 23, 2024

0.9.2.3

Jan 22, 2024

0.9.2.2

Jan 18, 2024

0.9.2.1

Nov 20, 2023

0.9.1.8

Nov 7, 2023

0.9.1.6

Nov 7, 2023

0.9.1.5

Nov 7, 2023

0.9.1.4

Nov 2, 2023

0.9.1.3

Oct 26, 2023

0.9.1.2

Oct 26, 2023

0.9.1.1

Oct 18, 2023

0.8.1.5

Oct 12, 2023

0.8.1.4

Sep 18, 2023

0.4.1

Sep 5, 2023

0.3.3

Jan 5, 2023

0.3.2

Jan 5, 2023

0.3.1

Dec 22, 2022

0.3

Dec 5, 2022

0.2

Dec 1, 2022

0.1

Dec 1, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jupyter_duckdb-1.4.108.tar.gz (1.5 MB view details)

Uploaded Nov 7, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

jupyter_duckdb-1.4.108-py3-none-any.whl (1.5 MB view details)

Uploaded Nov 7, 2025 Python 3

File details

Details for the file jupyter_duckdb-1.4.108.tar.gz.

File metadata

Download URL: jupyter_duckdb-1.4.108.tar.gz
Upload date: Nov 7, 2025
Size: 1.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for jupyter_duckdb-1.4.108.tar.gz
Algorithm	Hash digest
SHA256	`4ca39258ea1b785b1c8140b46532ee9a8a33d4c7b0fe32537e25386bbe074875`
MD5	`6612c79716185166943ac12764f40ba3`
BLAKE2b-256	`e5f059ee053d27e688f582c98d1f4d0b9994c41a76d9bdc1196e320b37510de2`

See more details on using hashes here.

File details

Details for the file jupyter_duckdb-1.4.108-py3-none-any.whl.

File metadata

Download URL: jupyter_duckdb-1.4.108-py3-none-any.whl
Upload date: Nov 7, 2025
Size: 1.5 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for jupyter_duckdb-1.4.108-py3-none-any.whl
Algorithm	Hash digest
SHA256	`00da08e9ae72976c6171398c93b3fcf8839549d12b2a1427e87c783f68c9eeab`
MD5	`04b4c8000dc3287cd60943ff5e7caf6d`
BLAKE2b-256	`5961af0e126d8647eebaee11f99fcebeb2a6ed6a0b2fec8d85cbfe8a3c73d162`

See more details on using hashes here.

jupyter-duckdb 1.4.108

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DuckDB Kernel for Jupyter

Table of Contents

Setup

Using pip

Using Docker

Usage

A Note on Magic Commands

Load a Database

Schema Diagrams

Number of Rows

Ship Tests With Your Notebooks

Relational Algebra

Domain Calculus

Automated Parser Selection

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes