A 100% standalone Pdf Python API for Processing Pdf Files
Project description
Comprehensive PDF Manipulation Class Library for Python
Product Page | Documentation | Examples | Forum | Temporary License | Customized Demo
Spire.PDF for Python is a robust and versatile PDF manipulation library designed to simplify PDF reading, creation, editing, conversion, and management tasks within Python applications. Developed by E-iceblue, this reliable package offers a complete suite of tools enabling developers to effortlessly build solutions for working with PDF documents in Python programs.
This Python library empowers users to streamline document workflows by providing a rich set of APIs that facilitate seamless integration with enterprise-level PDF processing needs. It supports a wide array of advanced features without the need for any external dependencies, ensuring compatibility and performance consistency.
Core Features & Functionality
Extensive PDF Version Support
- PDF 1.2 to PDF 1.7
- PDF/A - 1b
- PDF/x1a
- PDF/A1
- PDF/A2
- PDF/A3
- PDF/UA
Flexible PDF Document Conversions
- Convert PDF to images (PNG, JPG, BMP, etc.) for wide distribution
- Convert PDF to Word documents for easy content editing
- Convert PDF to Excel workbooks for convenient data processing
- Convert PDF to SVG and many other formats effortlessly
Solid PDF Security Measures
- Protect PDF documents with passwords for secure sharing
- Set PDF security permissions to control access and usage
- Sign PDF documents with digital signature for certified trust
Efficient PDF Construction & Manipulation
- Create new PDF documents with customizable attributes and layouts
- Merge multiple PDFs into one PDF or split a PDF into separate PDFs
Effortless PDF Content Extraction
- Extract text from PDFs with simple Python code
- Extract images from PDFs for further editing and utilization
Code Examples
Create PDF Documents
from spire.pdf.common import *
from spire.pdf import *
outputFile = "HelloWorld.pdf"
# Create a pdf document
doc= PdfDocument()
# Create one page
page = doc.Pages.Add()
s = "Hello, World"
x = 10.0
y = 10.0
font = PdfFont(PdfFontFamily.Helvetica ,30.0)
color = PdfRGBColor(Color.get_Black())
textBrush = PdfSolidBrush(color)
# Draw the text
page.Canvas.DrawString(s, font, textBrush, x, y)
# Save the document
doc.SaveToFile(outputFile)
doc.Close()
Convert PDFs to Word Documents
from spire.pdf.common import *
from spire.pdf import *
outputFile = "ToDocx.docx"
inputFile = "./Demos/Data/ToDocx.pdf"
# Load a pdf document
inputfile = inputFile
doc = PdfDocument()
doc.LoadFromFile(inputfile)
# Convert to doc file.
doc.SaveToFile(outputFile, FileFormat.DOCX)
doc.Close()
Encrypt PDF Documents
from spire.pdf.common import *
from spire.pdf import *
# Create a PdfDocument object
doc = PdfDocument()
# Load a sample PDF file
doc.LoadFromFile("input.pdf")
# Encrypt the PDF file with an open password and a permission password
doc.Security.Encrypt("openPsd", "permissionPsd", PdfPermissionsFlags.FillFields, PdfEncryptionKeySize.Key128Bit)
# Save the result file
doc.SaveToFile("output/Encrypted.pdf", FileFormat.PDF)
Extract Text from PDF Documents
from spire.pdf.common import *
from spire.pdf import *
def WriteAllText(fname:str,text:List[str]):
fp = open(fname,"w")
for s in text:
fp.write(s)
fp.close()
inputFile = "./Demos/Data/PDFTemplate-Az.pdf"
outputFile = "ExtractTextFromParticularPage_out.txt"
doc = PdfDocument()
# Read a pdf file
doc.LoadFromFile(inputFile)
# Get the first page
page = doc.Pages[0]
# Extract text from page keeping white space
text = page.ExtractText(True)
# Write a line of text to the file
WriteAllText(outputFile, text)
doc.Close()
Product Page | Documentation | Examples | Forum | Temporary License | Customized Demo
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for Spire.Pdf-10.8.1-py3-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7b711f6764e808f8e8d2a77754778a4a4057e6b823a97643543266bec092684e |
|
MD5 | 74face99cdc409f341cd65e2665a7baa |
|
BLAKE2b-256 | ccd145539e59722b46b53ea872e1c3b00ba17896a807c15ab23d88beb988c840 |
Hashes for Spire.Pdf-10.8.1-py3-none-manylinux1_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f52cd21c6fc08ace3a9662a169d0bfc2b1e9a5c4419c2784d2d1122fe5be92fc |
|
MD5 | fca6f3b9440e784c8578b48eac94422c |
|
BLAKE2b-256 | 7c3b4c77c51a8c664cc76cfa9792b5bb237cd4400481d27cb17b5c37b6a5eab5 |