Splitter de fichiers pdfs sur S3
Project description
S3PdfSplitter
Python aws-s3 pdf spliter
Usage
basic usage :
from PdfSplitter import Splitter
spliter = Splitter("config.json")
spliter.split(data)
exemple config.json :
{
"aws" : {
"access_key_id" : "aws-acces-key",
"secret_access_key" : "aws secret",
"s3" : {
"bucket" : "bucket"
}
}
}
Note that the config is managed with ConfigEnv so you can provide an .ini file or overide the config with environement variable ( AWS_S3_BUCKET, AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY )
example data :
{
"input" : [
"firstFile.pdf",
"secondFile.pdf"
],
"output": [
{
"s3Key": "output1.pdf",
"pages": [
{ "index": 0, "pages": [0,1] },
{ "index": 1, "pages": [0,1] }
]
},{
"s3Key": "output2.pdf",
"pages": [
{ "index": 0, "pages": [0] },
{ "index": 1, "pages": [0] },
{ "index": 0, "pages": [1] },
{ "index": 1, "pages": [1] }
]
}
]
}
this will produce 2 pdfs in your s3:
- the first, output1.pdf, with page 0 and 1 from firstFile and page 0 and 1 from secondFile
- the second, output2.pdf, with page 0 from firstFile, page 0 from secondFile, page 1 from firstFile and page 1 from secondFile
Developpement guide
installation
with virtualenv :
# create virtualenv
virtualenv -p python3 .venv
# activate venv
source .venv/bin/activate
# install dependancies
pip install -r requirements.txt
pip install -r requirements-dev.txt
testing
with unittest :
# if your test config is setup :
python -m unittest
# if you want to overide your test config :
AWS_S3_BUCKET=<your bucket> AWS_ACCESS_KEY_ID=<your key id> AWS_SECRET_ACCESS_KEY=<your key secret> python -m unittest
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
S3PdfSplitter-1.0.4.tar.gz
(2.9 kB
view hashes)
Built Distribution
Close
Hashes for S3PdfSplitter-1.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8f99decbc69c05f5fc4a2f6894d8fba02f13befc800cbb3005a93122e2927e61 |
|
MD5 | 33694cae6e44193c39bbab66a4c19931 |
|
BLAKE2b-256 | 8b3d4d7f26d40d07183ab65f5ea8eab0b8284bc3ba7507642d6d8b6c3f031656 |