Skip to main content

Split a CoreHR PDF application pack into multiple individual PDF applications.

Project description

CoreHR Application Pack PDF Splitter

Python License PyPI version

A Python package to extract individual applications from a combined PDF file, such as for Oxford HR application packs.

Installation and Usage

Option 1: Using uvx (Recommended)

If you have uv installed, you can run the tool directly without installing it:

uvx corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

Option 2: Install from PyPI

Install the package globally or in a virtual environment:

pip install corehr-pdf-split

Then run:

corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

Option 3: Using uv (for development or local use)

uv tool install corehr-pdf-split

Then run:

corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

How it works

The tool processes the input PDF file and saves individual applications in the specified output directory. The output folder will be created if it does not exist yet. Each applicant's PDF is saved with a filename format: LastName,FirstName [ApplicantID].pdf.

Example

uvx corehr-pdf-split --input-pdf applicationspack.pdf --output-dir output

This will process the applicationspack.pdf file and save individual applications in the output directory.

Development

If you want to contribute to or modify this project:

Prerequisites

  • uv for dependency management

Setup

  1. Clone this repository:

    git clone https://github.com/synthetic-society/corehr-pdf-split.git
    cd corehr-pdf-split
    
  2. Install dependencies:

    uv sync
    
  3. Run the tool in development mode:

    uv run corehr-pdf-split --input-pdf <path_to_input_pdf> --output-dir <path_to_output_directory>
    

Building and Publishing

To build the package:

uv build

To publish to PyPI (maintainers only):

uv publish

License

This project is available under the MIT License.

Contributing

Contributions, issues, and feature requests are welcome.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corehr_pdf_split-1.0.0.tar.gz (8.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

corehr_pdf_split-1.0.0-py3-none-any.whl (4.7 kB view details)

Uploaded Python 3

File details

Details for the file corehr_pdf_split-1.0.0.tar.gz.

File metadata

  • Download URL: corehr_pdf_split-1.0.0.tar.gz
  • Upload date:
  • Size: 8.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.12

File hashes

Hashes for corehr_pdf_split-1.0.0.tar.gz
Algorithm Hash digest
SHA256 9223c772a168fccea043e9f5225c966b4bc56a9ae276c6966af411a5cdbeed9b
MD5 2261e68f127dbd39f61efc48d7f8e6dd
BLAKE2b-256 71df9f0b9e46ad7713820a32848eeb05950f1e684a96f29767938439dc48628a

See more details on using hashes here.

File details

Details for the file corehr_pdf_split-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for corehr_pdf_split-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5b7b13e64be1277e9df82f08502b8fdd89b4e6a69afc29f404e871271fc93045
MD5 d6188fae1239128b38135789207989aa
BLAKE2b-256 7c15fdc09e306aa8f112f2a7709826a629d0a0bce83be83cc83c628167079f6e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page