Skip to main content

Split a CoreHR PDF application pack into multiple individual PDF applications.

Project description

CoreHR Application Pack PDF Splitter

Python License PyPI version

A Python package to extract individual applications from a combined PDF file, such as for Oxford HR application packs.

Installation and Usage

Option 1: Using uvx (Recommended)

If you have uv installed, you can run the tool directly without installing it:

uvx corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

Option 2: Install from PyPI

Install the package globally or in a virtual environment:

pip install corehr-pdf-split

Then run:

corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

Option 3: Using uv (for development or local use)

uv tool install corehr-pdf-split

Then run:

corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

How it works

The tool processes the input PDF file and saves individual applications in the specified output directory. The output folder will be created if it does not exist yet. Each applicant's PDF is saved with a filename format: LastName,FirstName [ApplicantID].pdf.

Example

uvx corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

This will process the applicationspack.pdf file and save individual applications in the output directory.

Development

If you want to contribute to or modify this project:

Prerequisites

  • uv for dependency management

Setup

  1. Clone this repository:

    git clone https://github.com/synthetic-society/corehr-pdf-split.git
    cd corehr-pdf-split
    
  2. Install dependencies:

    uv sync
    
  3. Run the tool in development mode:

    uv run corehr-pdf-split --input-pdf <path_to_input_pdf> --output-dir <path_to_output_directory>
    

Building and Publishing

To build the package:

uv build

To publish to PyPI (maintainers only):

uv publish

License

This project is available under the MIT License.

Contributing

Contributions, issues, and feature requests are welcome.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corehr_pdf_split-1.0.1.tar.gz (8.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

corehr_pdf_split-1.0.1-py3-none-any.whl (4.7 kB view details)

Uploaded Python 3

File details

Details for the file corehr_pdf_split-1.0.1.tar.gz.

File metadata

  • Download URL: corehr_pdf_split-1.0.1.tar.gz
  • Upload date:
  • Size: 8.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.12

File hashes

Hashes for corehr_pdf_split-1.0.1.tar.gz
Algorithm Hash digest
SHA256 c054ca4bccdc92e045ee5e9373c6cd83bd3a1e879f908f9b13fe91721740dd3f
MD5 35a92ded9e071a38ddf4219b4a11abc9
BLAKE2b-256 190aa95ff2ec65ead8204b9bad8701e14178f3c9bb80720a9b911510aafc0d9c

See more details on using hashes here.

File details

Details for the file corehr_pdf_split-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for corehr_pdf_split-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c896333846974ef6d9b700d03fbb177dbe90ec8ab34c57bcc9dc2ab377f9d204
MD5 0fa2e4fd85f4776efe79382fc3cc2877
BLAKE2b-256 04b9cb6472bbb7a9cfcfc2e724f2855b8d3b87f0cffbd63a73241e3863fb9b39

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page