Splitter de fichiers pdfs sur S3
Project description
S3PdfSplitter
Python aws-s3 pdf spliter
Usage
basic usage :
from PdfSplitter import Splitter
spliter = Splitter("config.json")
spliter.split(data)
exemple config.json :
{
"aws" : {
"access_key_id" : "aws-acces-key",
"secret_access_key" : "aws secret",
},
"s3" : {
"bucket" : "bucket"
}
}
Note that the config is managed with ConfigEnv so you can provide an .ini file or overide the config with environement variable ( AWS_S3_BUCKET, AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY )
example data :
{
"input" : [
"firstFile.pdf",
"secondFile.pdf"
],
"output": [
{
"s3Key": "output1.pdf",
"pages": [
{ "index": 0, "pages": [0,1] },
{ "index": 1, "pages": [0,1] }
]
},{
"s3Key": "output2.pdf",
"pages": [
{ "index": 0, "pages": [0] },
{ "index": 1, "pages": [0] },
{ "index": 0, "pages": [1] },
{ "index": 1, "pages": [1] }
]
}
]
}
this will produce 2 pdfs in your s3:
- the first, output1.pdf, with page 0 and 1 from firstFile and page 0 and 1 from secondFile
- the second, output2.pdf, with page 0 from firstFile, page 0 from secondFile, page 1 from firstFile and page 1 from secondFile
Developpement guide
installation
with virtualenv :
# create virtualenv
virtualenv -p python3 .venv
# activate venv
source .venv/bin/activate
# install dependancies
pip install -r requirements.txt
pip install -r requirements-dev.txt
testing
with unittest :
# if your test config is setup :
python -m unittest
# if you want to overide your test config :
S3_BUCKET=<your bucket> AWS_ACCESS_KEY_ID=<your key id> AWS_SECRET_ACCESS_KEY=<your key secret> python -m unittest
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
S3PdfSplitter-1.0.7.tar.gz
(2.9 kB
view hashes)
Built Distribution
Close
Hashes for S3PdfSplitter-1.0.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | af85ffe78656643f3242dfebd65e5019e2a09350df7bcace8ebd3f58de4fd1fa |
|
MD5 | 27ce39b35c54d878d2971a0c0ac3985f |
|
BLAKE2b-256 | 13ca8842c0cbebcce9b9105b868492f4f9e8fe4d66f107204603a74e0fe1813a |