Skip to main content

Tool to manage notebooks and clean output cells.

Project description

databricks-workspace-tool

dwt is a tool to clear run cells from notebooks, for example where there might be concern about data held in run cells, or as preparation for commit to source control.

You can also use it to import/export multiple notebooks with this capability, in use cases where dbc export may not be possible due to volume limits.

Commands

Command Sub-Command Parameters Description
list notebooks List all notebooks in workspace.
list libraries List all libraries in workspace.
export notebooks path: location to output zip of notebooks Exports all notebooks from a workspace as base64 source code. The process will remove annotations for run cells
import notebooks path: location of notebooks.zip
import_prefix: folder to import into (default: IMPORT)
Import notebooks into workspace.
clean folders Delete all empty folders in workspace.
clean notebooks Remove annotations for run cells from all notebooks in workspace.

Installation

In a python 3.7 environment install this repository, e.g:
pip install git+https://github.com/frogrammer/fire-commands.git
The tool can be installed to an azure cloud shell.

Databricks Workspace Login

The dwt CLI is built using the databricks CLI sdk https://github.com/databricks/databricks-cli, and uses its authentication mechanism to login to a workspace.
To login to an azure databricks workspace using a user token:

echo MY_TOKEN >> token.txt
databricks configure --host MY_HOST -f token.txt
rm token.txt 

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

databricks-workspace-tool-0.1.4.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

databricks_workspace_tool-0.1.4-py3-none-any.whl (10.9 kB view details)

Uploaded Python 3

File details

Details for the file databricks-workspace-tool-0.1.4.tar.gz.

File metadata

  • Download URL: databricks-workspace-tool-0.1.4.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.24.0 setuptools/50.3.1.post20201107 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for databricks-workspace-tool-0.1.4.tar.gz
Algorithm Hash digest
SHA256 8777951aedebb41324dbb43a64f89c0f8a267b6a1628bf7a475c5e210fffa059
MD5 52eec63fe6e8b9c9a4c3a622dc9b069b
BLAKE2b-256 8cd08620cccd92a16daf0cc9c6790c9891552d60a2840b12f17dc1eec46d9b4c

See more details on using hashes here.

File details

Details for the file databricks_workspace_tool-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: databricks_workspace_tool-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 10.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.24.0 setuptools/50.3.1.post20201107 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for databricks_workspace_tool-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 247a2c39cc2949ee318c159932e6256776b2eec8b95f33bbfae05d53f6194585
MD5 acc913644288e8cebd42a978570f50e7
BLAKE2b-256 655ef887da39cb0ee37ddfd4bd0247091f5c7e88649b39a7b7c54ff17637f624

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page