Skip to main content

Splitter de fichiers pdfs sur S3

Project description

S3PdfSplitter Build Status Coverage Status PyPI version PyPI - Python Version

Python aws-s3 pdf spliter

Usage

basic usage :

from PdfSplitter import Splitter

spliter = Splitter("config.json")
spliter.split(data)

exemple config.json :

{
    "aws" : {
        "access_key_id" : "aws-acces-key",
        "secret_access_key" : "aws secret",
    },
    "s3" : {
        "bucket" : "bucket"
    }
}

Note that the config is managed with ConfigEnv so you can provide an .ini file or overide the config with environement variable ( AWS_S3_BUCKET, AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY )

example data :

{
    "input" : [
        "firstFile.pdf",
        "secondFile.pdf"
    ],
    "output": [
        {
            "s3Key": "output1.pdf",
            "pages": [
                { "index": 0, "pages": [0,1] },
                { "index": 1, "pages": [0,1] }
            ]
        },{
            "s3Key": "output2.pdf",
            "pages": [
                { "index": 0, "pages": [0] },
                { "index": 1, "pages": [0] },
                { "index": 0, "pages": [1] },
                { "index": 1, "pages": [1] }
            ]
        }
    ]
}

this will produce 2 pdfs in your s3:

  • the first, output1.pdf, with page 0 and 1 from firstFile and page 0 and 1 from secondFile
  • the second, output2.pdf, with page 0 from firstFile, page 0 from secondFile, page 1 from firstFile and page 1 from secondFile

Developpement guide

installation

with virtualenv :

# create virtualenv
virtualenv -p python3 .venv

# activate venv
source .venv/bin/activate

# install dependancies
pip install -r requirements.txt
pip install -r requirements-dev.txt

testing

with unittest :

# if your test config is setup :
python -m unittest

# if you want to overide your test config :
S3_BUCKET=<your bucket> AWS_ACCESS_KEY_ID=<your key id> AWS_SECRET_ACCESS_KEY=<your key secret> python -m unittest

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for S3PdfSplitter, version 1.0.8
Filename, size File type Python version Upload date Hashes
Filename, size S3PdfSplitter-1.0.8.tar.gz (2.9 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page