Skip to main content

No project description provided

Project description

arrowcm

Arrowcm is a Jupyter Content Manager that lets you use any pyarrow-supported filesystem as a Content Manager.

Arrowcm has been tested with JupyterLab 4, but it might also work with previous versions.

As Arrowcm relies on pyarrow, any pyarrow-supported filesystem should be available, and fsspec filesystems should work too.

It is based on the new jupyter-server content manager interface and will not work with the old notebook content manager interface.

It is available on PyPI:

pip install arrowcm

After installation, you need to configure the storage that you want to use.

This can be done in your Jupyter config file, where you will need to pass an instance of a PyArrow filesystem as a filesystem property.

from pyarrow import fs

c.ServerApp.contents_manager_class = "arrowcm.ArrowContentsManager"

# S3 example
c.ArrowContentsManager.filesystem = fs.S3FileSystem(
    access_key="myaccesskey",
    secret_key="mysecretkey",
    region=fs.resolve_s3_region("mybucket"),
)
c.ArrowContentsManager.root_dir = "mybucket/notebooks"

# HDFS example
c.ArrowContentsManager.filesystem = fs.HadoopFileSystem("myhdfshost")
c.ArrowContentsManager.root_dir = "/user/jupyter/notebooks"

Arrowcm aims to be as compatible as possible with the default file content manager, so most of the options should work.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arrowcm-0.0.2.tar.gz (18.9 kB view hashes)

Uploaded Source

Built Distribution

arrowcm-0.0.2-py3-none-any.whl (17.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page