Jupyter Notebook Contents Manager for AWS S3
Project description
Jupyter S3
Jupyter Notebook Contents Manager for AWS S3.
Installation
pip install jupyters3
Configuration
To configure Jupyter Notebook to use JupterS3, you can add the following to your notebook config file.
from jupters3 import JupyterS3, JupyterS3SecretAccessKeyAuthentication
c = get_config()
c.NotebookApp.contents_manager_class = JupyterS3
and must also set the following settings on c.JupyterS3
in your config file.
Setting | Description | Example |
---|---|---|
aws_region |
The AWS region in which the bucket is located | 'eu-west-1' |
aws_s3_bucket |
The name of the S3 bucket. | 'my-example-bucket' |
aws_s3_host |
The hostname of the AWS S3 API. Typically, this is of the form s3-<aws_region>.amazonaws.com . |
's3-eu-west-1.amazonaws.com' |
prefix |
The prefix to all keys used to store notebooks and checkpoints. This can be the empty string '' . If non-empty, typically this would end in a forward slash / . |
'some-prefix/ ' |
You must also, either, authenticate using a secret key, in which case you must have the following configuration
from jupters3 import JupyterS3SecretAccessKeyAuthentication
c.JupyterS3.authentication_class = JupyterS3SecretAccessKeyAuthentication
and the following settings on c.JupyterS3SecretAccessKeyAuthentication
Setting | Description | Example |
---|---|---|
aws_access_key_id |
The ID of the AWS access key used to sign the requests to the AWS S3 API. | ommitted |
aws_secret_access_key |
The secret part of the AWS access key used to sign the requests to the AWS S3 API. | ommitted |
or authenticate using a role in an ECS container, in which case you must have the following configuration
from jupters3 import JupyterS3ECSRoleAuthentication
c.JupyterS3.authentication_class = JupyterS3ECSRoleAuthentication
where JupyterS3ECSRoleAuthentication does not have configurable options.
Differences from S3Contents
-
There are no extra dependencies over those already required for Jupyter Notebook. Specifically, there is no virtual filesystem library such as S3FS used, boto3 is not used, and Tornado is used as the HTTP client.
-
Checkpoints are also saved to S3, under the key
<file_name>/.checkpoints/
. -
Multiple checkpoints are saved.
-
The event loop is mostly not blocked during requests to S3. There are some exceptions due to Jupyter Notebook expecting certain requests to block.
-
Uploading arbitrary files, such as JPEGs, and viewing them in Jupyter or downloading them, works.
-
Copying and renaming files don't download or re-upload object data from or to S3. "PUT Object - Copy" is used instead.
-
Authentication from EC2 instances using roles is not supported, but authentication using roles from ECS containers is supported.
-
Folders are created using a 0 byte object with key suffix
/
(forward slash). A single forward slash suffix is consistent with both the AWS Console and AWS AppStream.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for jupyters3-0.0.35-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ad2d66e38c03db103aac7fd99149dde618a09bd628646268cd1e4f06df082cc0 |
|
MD5 | ce7e66069a80a23186d9428ec76e0020 |
|
BLAKE2b-256 | aa21d2915b6dc9d119cf4080b69ac71c1d4535e4197567c8bfa1c6dc55b38b50 |