Splitter de fichiers pdfs sur S3
Project description
S3PdfSplitter
Python aws-s3 pdf spliter
Usage
basic usage :
from PdfSplitter import Splitter
spliter = Splitter("config.json")
spliter.split(data)
exemple config.json :
{
"aws" : {
"access_key_id" : "aws-acces-key",
"secret_access_key" : "aws secret",
},
"s3" : {
"bucket" : "bucket"
}
}
Note that the config is managed with ConfigEnv so you can provide an .ini file or overide the config with environement variable ( AWS_S3_BUCKET, AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY )
example data :
{
"input" : [
"firstFile.pdf",
"secondFile.pdf",
"http://urlFile.pdf"
],
"output": [
{
"s3Key": "output1.pdf",
"pages": [
{ "index": 0, "pages": [0,1] },
{ "index": 1, "pages": [0,1] }
]
},{
"s3Key": "output2.pdf",
"pages": [
{ "index": 0, "pages": [0] },
{ "index": 1, "pages": [0] },
{ "index": 0, "pages": [1] },
{ "index": 1, "pages": [1] }
]
}
]
}
this will produce 2 pdfs in your s3:
- the first, output1.pdf, with page 0 and 1 from firstFile and page 0 and 1 from secondFile
- the second, output2.pdf, with page 0 from firstFile, page 0 from secondFile, page 1 from firstFile and page 1 from secondFile
Developpement guide
installation
with virtualenv :
# create virtualenv
virtualenv -p python3 .venv
# activate venv
source .venv/bin/activate
# install dependancies
pip install -r requirements.txt
pip install -r requirements-dev.txt
testing
with unittest :
# if your test config is setup :
python -m unittest
# if you want to overide your test config :
S3_BUCKET=<your bucket> AWS_ACCESS_KEY_ID=<your key id> AWS_SECRET_ACCESS_KEY=<your key secret> python -m unittest
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
S3PdfSplitter-1.2.0.tar.gz
(3.0 kB
view details)
File details
Details for the file S3PdfSplitter-1.2.0.tar.gz
.
File metadata
- Download URL: S3PdfSplitter-1.2.0.tar.gz
- Upload date:
- Size: 3.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.20.0 setuptools/47.3.0 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.5.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 67949c6ff2a73b1665076cf9a29e8e1d4d97b9f1c38f2c93f506729e0b9e02ba |
|
MD5 | d7b5b1c4f730020c786ee93f25d578c9 |
|
BLAKE2b-256 | cec5196f42ee7dc0f5d9fb6066284edfef754211c3b34e82c750cb8d397a7925 |