A 100% standalone Pdf Python API for Processing Pdf Files
Project description
Comprehensive PDF Manipulation Class Library for Python
Product Page | Documentation | Examples | Forum | Temporary License | Customized Demo
Spire.PDF for Python is a robust and versatile PDF manipulation library designed to simplify PDF reading, creation, editing, conversion, and management tasks within Python applications. Developed by E-iceblue, this reliable package offers a complete suite of tools enabling developers to effortlessly build solutions for working with PDF documents in Python programs.
This Python library empowers users to streamline document workflows by providing a rich set of APIs that facilitate seamless integration with enterprise-level PDF processing needs. It supports a wide array of advanced features without the need for any external dependencies, ensuring compatibility and performance consistency.
Core Features & Functionality
Extensive PDF Version Support
- PDF 1.2 to PDF 1.7
- PDF/A - 1b
- PDF/x1a
- PDF/A1
- PDF/A2
- PDF/A3
- PDF/UA
Flexible PDF Document Conversions
- Convert PDF to images (PNG, JPG, BMP, etc.) for wide distribution
- Convert PDF to Word documents for easy content editing
- Convert PDF to Excel workbooks for convenient data processing
- Convert PDF to SVG and many other formats effortlessly
Solid PDF Security Measures
- Protect PDF documents with passwords for secure sharing
- Set PDF security permissions to control access and usage
- Sign PDF documents with digital signature for certified trust
Efficient PDF Construction & Manipulation
- Create new PDF documents with customizable attributes and layouts
- Merge multiple PDFs into one PDF or split a PDF into separate PDFs
Effortless PDF Content Extraction
- Extract text from PDFs with simple Python code
- Extract images from PDFs for further editing and utilization
Code Examples
Create PDF Documents
from spire.pdf.common import *
from spire.pdf import *
outputFile = "HelloWorld.pdf"
# Create a pdf document
doc= PdfDocument()
# Create one page
page = doc.Pages.Add()
s = "Hello, World"
x = 10.0
y = 10.0
font = PdfFont(PdfFontFamily.Helvetica ,30.0)
color = PdfRGBColor(Color.get_Black())
textBrush = PdfSolidBrush(color)
# Draw the text
page.Canvas.DrawString(s, font, textBrush, x, y)
# Save the document
doc.SaveToFile(outputFile)
doc.Close()
Convert PDFs to Word Documents
from spire.pdf.common import *
from spire.pdf import *
outputFile = "ToDocx.docx"
inputFile = "./Demos/Data/ToDocx.pdf"
# Load a pdf document
inputfile = inputFile
doc = PdfDocument()
doc.LoadFromFile(inputfile)
# Convert to doc file.
doc.SaveToFile(outputFile, FileFormat.DOCX)
doc.Close()
Encrypt PDF Documents
from spire.pdf.common import *
from spire.pdf import *
# Create a PdfDocument object
doc = PdfDocument()
# Load a sample PDF file
doc.LoadFromFile("input.pdf")
# Encrypt the PDF file with an open password and a permission password
doc.Security.Encrypt("openPsd", "permissionPsd", PdfPermissionsFlags.FillFields, PdfEncryptionKeySize.Key128Bit)
# Save the result file
doc.SaveToFile("output/Encrypted.pdf", FileFormat.PDF)
Extract Text from PDF Documents
from spire.pdf.common import *
from spire.pdf import *
def WriteAllText(fname:str,text:List[str]):
fp = open(fname,"w")
for s in text:
fp.write(s)
fp.close()
inputFile = "./Demos/Data/PDFTemplate-Az.pdf"
outputFile = "ExtractTextFromParticularPage_out.txt"
doc = PdfDocument()
# Read a pdf file
doc.LoadFromFile(inputFile)
# Get the first page
page = doc.Pages[0]
# Extract text from page keeping white space
text = page.ExtractText(True)
# Write a line of text to the file
WriteAllText(outputFile, text)
doc.Close()
Product Page | Documentation | Examples | Forum | Temporary License | Customized Demo
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for Spire.Pdf-10.6.1-py3-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2b40d81b697ccd5ea5f95adb20f13e2f42a3cbf57809c37daa62cb94b1a289a9 |
|
MD5 | 0d4ebb19afb6de5651abb1da96cba169 |
|
BLAKE2b-256 | ce0ea4af4e3d97e163505b39f22d558712b8297fb1035f203ea7b8f84e0726e8 |
Hashes for Spire.Pdf-10.6.1-py3-none-manylinux1_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d8e9929735abf5832e9459072417b0939ffa569948eb9c14cdf34f5773b53bc0 |
|
MD5 | 2c67973537a5e895e4df0025b904893f |
|
BLAKE2b-256 | 9fa2979ac0515cfb9f6d38364d1758e61396167f5c658fc8cd2aea89b527a9df |