Skip to main content

A package to add metadata tags to objects saved in s3

Project description

AWS S3 Metadata Tagger

The S3 Metadata tagger adds information in the form of metadata to files saved in S3.

To do this, the central handler takes a file location and a metadata extracting function. It first checks whether the file already contains the requested information via a HEAD request. If it does not, it downloads the file, invokes extracting function, and adds the metadata to the s3 object with a inplace COPY, MetadataDirective="REPLACE" operation.

This package comes with two optional variants for metadata extraction:

  • PDF: for determining the number of pages in a pdf
  • PICTURE: for determining the dimension of an image

Structure

object_tagger

contains the higher-level orchestration:

  • object_tagger.py contains all the logic for checking whether the file has already been tagged, downloading it, invoking the metadata script, creating the tag object, and adding it to the s3 resource.

The metadata scripts are stored in their respective folders

pdf_tagger

The pdf tagger uses PyPDF2 to determine the amount of pages in a pdf

picture_tagger

Using Pillow, the script gets the width and height of the passed image

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

s3-metadata-tagger-0.0.3.tar.gz (5.8 kB view hashes)

Uploaded Source

Built Distribution

s3_metadata_tagger-0.0.3-py3-none-any.whl (6.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page