Skip to main content

Split a CoreHR PDF application pack into multiple individual PDF applications.

Project description

CoreHR Application Pack PDF Splitter

CI Python License PyPI version

A Python package to extract individual applications from a combined PDF file, such as for Oxford HR application packs.

Installation and Usage

Option 1: Using uvx (Recommended)

If you have uv installed, you can run the tool directly without installing it:

uvx corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

Option 2: Install from PyPI

Install the package globally or in a virtual environment:

pip install corehr-pdf-split

Then run:

corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

Option 3: Using uv (for development or local use)

uv tool install corehr-pdf-split

Then run:

corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

How it works

The tool processes the input PDF file and saves individual applications in the specified output directory. The output folder will be created if it does not exist yet. Each applicant's PDF is saved with a filename format: LastName,FirstName [ApplicantID].pdf.

Example

uvx corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

This will process the applicationspack.pdf file and save individual applications in the output directory.

Development

If you want to contribute to or modify this project:

Prerequisites

  • uv for dependency management

Setup

  1. Clone this repository:

    git clone https://github.com/synthetic-society/corehr-pdf-split.git
    cd corehr-pdf-split
    
  2. Install dependencies:

    uv sync
    
  3. Run the tool in development mode:

    uv run corehr-pdf-split --input-pdf <path_to_input_pdf> --output-dir <path_to_output_directory>
    

Building and Publishing

To build the package:

uv build

To publish to PyPI (maintainers only):

uv publish

License

This project is available under the MIT License.

Contributing

Contributions, issues, and feature requests are welcome.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corehr_pdf_split-1.0.3.tar.gz (30.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

corehr_pdf_split-1.0.3-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file corehr_pdf_split-1.0.3.tar.gz.

File metadata

  • Download URL: corehr_pdf_split-1.0.3.tar.gz
  • Upload date:
  • Size: 30.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.13

File hashes

Hashes for corehr_pdf_split-1.0.3.tar.gz
Algorithm Hash digest
SHA256 7aaca848ebd0d521ac8a0d32a85fb2740973058418569fc9e021863888452168
MD5 f0d3e22ca976c7c87e58fd478f00fb09
BLAKE2b-256 e64e81ecb6812395fe891e93da408eb4cdc67faddda6de93f1a2aa921b2b544d

See more details on using hashes here.

File details

Details for the file corehr_pdf_split-1.0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for corehr_pdf_split-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 09b952bb73e3bea9689e50eb5fb10023f7b664fe0afee70b120929d4d1420c28
MD5 79161f4252a889b91fd8ef3a7195bb7f
BLAKE2b-256 54c49d106f5cbd35e292cbf21321b46fd542a53de6d6bfa8a1e7d0258c897cbd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page