Skip to main content

Tool to manage notebooks and clean output cells.

Project description

databricks-workspace-tool

dwt is a tool to clear run cells from notebooks, for example where there might be concern about data held in run cells, or as preparation for commit to source control.

You can also use it to import/export multiple notebooks with this capability, in use cases where dbc export may not be possible due to volume limits.

Commands

Command Sub-Command Parameters Description
list notebooks List all notebooks in workspace.
list libraries List all libraries in workspace.
export notebooks path: location to output zip of notebooks Exports all notebooks from a workspace as base64 source code. The process will remove annotations for run cells
import notebooks path: location of notebooks.zip
import_prefix: folder to import into (default: IMPORT)
Import notebooks into workspace.
clean folders Delete all empty folders in workspace.
clean notebooks Remove annotations for run cells from all notebooks in workspace.

Installation

In a python 3.7 environment install this repository, e.g:
pip install git+https://github.com/frogrammer/fire-commands.git
The tool can be installed to an azure cloud shell.

Databricks Workspace Login

The dwt CLI is built using the databricks CLI sdk https://github.com/databricks/databricks-cli, and uses its authentication mechanism to login to a workspace.
To login to an azure databricks workspace using a user token:

echo MY_TOKEN >> token.txt
databricks configure --host MY_HOST -f token.txt
rm token.txt 

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

databricks-workspace-tool-0.1.4.tar.gz (4.8 kB view hashes)

Uploaded Source

Built Distribution

databricks_workspace_tool-0.1.4-py3-none-any.whl (10.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page