Skip to main content

A Command Line tool for parsing PDFs.

Project description

SaralGyaan PDF Parser

SaralGyaan PDF Parser is a command-line PDF parsing tool which allows you to:-

  1. Delete Pages from a PDF
  2. Merge PDFs
  3. Sort Pages of a PDF
  4. Split a PDF

Installation

You can install SaralGyaan PDF Parser from PyPI: pip install sg-pdfparser The PDF Parser supports Python 3.6 and above.

How to use?

The SaralGyaan PDF Parser is a command line application, named pdfparser. To start it you can simply open the terminal, go to the folder containing the PDF file(s) to be parsed and call the program:-

$ pdfparser
Welcome to PDF Parser

What do you want to do?

1. Delete Pages from a PDF
2. Merge PDFs
3. Sort Pages of a PDF
4. Split a PDF

Press ctrl + C to exit.
Enter your choice (1-4):

Delete Pages from a PDF

If you select the option to delete pages, it will ask for the filename followed by '.pdf'

Enter your choice (1-4): 1
Delete Pages from a PDF
Enter the name of file with extension(.pdf)

Make sure the pdf file exist at the location and once you provide a valid pdf file, it will give you two options:-

What do you want to do?
 1. Delete specific pages
 2. Keep Specific pages

Both the options accept comma separated values of pages or page ranges or both e.g. 1, 2, 3-5 or 1-2 or 2-3, 4-6. One, thing you need to know that if you use Keep specific pages and change the order like 1-2, 6-4, then it will re-arrange the pages too.

Merge PDFs

This options accepts comma separated file names and it will merge the files in the order, it is provided as an input.

Sort pages of a PDF

Sort pages, give you three options

What do you want to do?
1. Reverse order of all the pages
2. Swap Pages
3. Move certain pages to a specific index
  1. The first one, will simply reverse the order of the pages.
  2. The second one will swap two pages, you can input multiple or single swaps e.g. 1-3, 2-7, 8-9, etc.
  3. The third one will move the page to a certain page number (not index). It also accepts comma separated values. So 21-2 will move page number 21 to 2 and hence shifting the rest of the page to right.

Split a PDF

This gives you two options:-

What do you want to do?
 1. Split all the pages
 2. Split specific pages
  1. The first option will make n splits for n-paged PDF file.
  2. The second one will split the pdf into the ranges or pages as provided. e.g. 1, 3, 9-22, will give three split files first page, third page and pages from nine to twenty two.

Video Usage Guide

You can also check out our Usage Guide on Youtube.

License

© 2020 Udit Vashisht

This repository is licensed under the MIT license. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sg-pdfparser-1.0.2.tar.gz (10.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sg_pdfparser-1.0.2-py3-none-any.whl (11.9 kB view details)

Uploaded Python 3

File details

Details for the file sg-pdfparser-1.0.2.tar.gz.

File metadata

  • Download URL: sg-pdfparser-1.0.2.tar.gz
  • Upload date:
  • Size: 10.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.2

File hashes

Hashes for sg-pdfparser-1.0.2.tar.gz
Algorithm Hash digest
SHA256 c2b12d4a599095c8b53b3f360f8c8173f6c38362b5ed6af77e441faf3f379479
MD5 7b2c870a16b230c3ff9812a5877b4aed
BLAKE2b-256 3613d2b877b0193237f20ca033ad03c3d64e8c075b66928afc3b58703c12d48f

See more details on using hashes here.

File details

Details for the file sg_pdfparser-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: sg_pdfparser-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 11.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.2

File hashes

Hashes for sg_pdfparser-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 f560d65b42cb87cf1e5532bddaa9e5668c0981ce9fd860f7cb810205b5b7fadc
MD5 46576d9af564291734732b5b83c331af
BLAKE2b-256 48167a9ae85783b8802ee1e4f7295e160a92dff3b123170876f5ef7172b1de09

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page