The fastest Python PDF library: 0.8ms mean, 5× faster than PyMuPDF. Text extraction, markdown conversion, PDF creation. 100% pass rate on 3,830 PDFs.

These details have not been verified by PyPI

Project links

Project description

PDFOxide - The Fastest PDF Toolkit for 20 Languages — Python, Rust, Go, JS/TS, C#, Java, Kotlin, Swift, C++ & more, plus CLI & AI

New in v0.3.69 — eleven new language bindings. PDFOxide now ships idiomatic bindings for C++, Swift, Kotlin, Dart, R, Julia, Zig, Scala, Clojure, Objective-C, and Elixir, each built over the stable C ABI with its own CI workflow, api-coverage tests, and runnable examples. That brings the toolkit to 20 languages (Rust core + 19 bindings). Want another language? Open an issue and tell us.

The fastest PDF library for text extraction, image extraction, and markdown conversion. A Rust core with bindings for 19 languages — Python, Go, JavaScript / TypeScript, C# / .NET, Java, Kotlin, Scala, Clojure, Ruby, PHP, C++, Objective-C, Swift, Dart, R, Julia, Zig, Elixir, and WASM — plus a CLI tool and MCP server for AI assistants. 0.8ms mean per document, 5× faster than PyMuPDF, 15× faster than pypdf. 100% pass rate on 3,830 real-world PDFs. MIT licensed.

Quick Start

Python

from pdf_oxide import PdfDocument

with PdfDocument("paper.pdf") as doc:
    print(len(doc))                          # number of pages
    for page in doc:
        text = page.text                     # lazy property
        chars = page.chars                   # lazy property
        md = page.markdown(detect_headings=True)

# Direct page access by index
doc = PdfDocument("paper.pdf")
page = doc[0]
text = page.text

pip install pdf_oxide

Rust

use pdf_oxide::PdfDocument;

let mut doc = PdfDocument::open("paper.pdf")?;
let text = doc.extract_text(0)?;
let images = doc.extract_images(0)?;
let markdown = doc.to_markdown(0, Default::default())?;

[dependencies]
pdf_oxide = "0.3"

CLI

pdf-oxide text document.pdf
pdf-oxide markdown document.pdf -o output.md
pdf-oxide search document.pdf "pattern"
pdf-oxide merge a.pdf b.pdf -o combined.pdf

brew install yfedoseev/tap/pdf-oxide

MCP Server (for AI assistants)

# Install
brew install yfedoseev/tap/pdf-oxide   # includes pdf-oxide-mcp

# Configure in Claude Desktop / Claude Code / Cursor
{
  "mcpServers": {
    "pdf-oxide": { "command": "crgx", "args": ["pdf_oxide_mcp@latest"] }
  }
}

Why PDFOxide?

Fast — 0.8ms mean per document, 5× faster than PyMuPDF, 15× faster than pypdf, 29× faster than pdfplumber
Reliable — 100% pass rate on 3,830 test PDFs, zero panics, zero timeouts
Complete — Text extraction, image extraction, PDF creation, and editing in one library
Multi-platform — 20 languages (Rust core + 19 bindings: Python, Go, JS/TS, C#/.NET, Java, Kotlin, Scala, Clojure, Ruby, PHP, C++, Objective-C, Swift, Dart, R, Julia, Zig, Elixir, WASM), plus a CLI and MCP server for AI assistants
Permissive license — MIT / Apache-2.0 — use freely in commercial and open-source projects

Performance

Benchmarked on 3,830 PDFs from three independent public test suites (veraPDF, Mozilla pdf.js, DARPA SafeDocs). Text extraction libraries only (no OCR). Single-thread, 60s timeout, no warm-up.

Python Libraries

Library	Mean	p99	Pass Rate	License
PDFOxide	0.8ms	9ms	100%	MIT
PyMuPDF	4.6ms	28ms	99.3%	AGPL-3.0
pypdfium2	4.1ms	42ms	99.2%	Apache-2.0
pymupdf4llm	55.5ms	280ms	99.1%	AGPL-3.0
pdftext	7.3ms	82ms	99.0%	GPL-3.0
pdfminer	16.8ms	124ms	98.8%	MIT
pdfplumber	23.2ms	189ms	98.8%	MIT
markitdown	108.8ms	378ms	98.6%	MIT
pypdf	12.1ms	97ms	98.4%	BSD-3

Rust Libraries

Library	Mean	p99	Pass Rate	Text Extraction
PDFOxide	0.8ms	9ms	100%	Built-in
oxidize_pdf	13.5ms	11ms	99.1%	Basic
unpdf	2.8ms	10ms	95.1%	Basic
pdf_extract	4.08ms	37ms	91.5%	Basic
lopdf	0.3ms	2ms	80.2%	No built-in extraction

Text Quality

99.5% text parity vs PyMuPDF and pypdfium2 across the full corpus. PDFOxide extracts text from 7–10× more "hard" files than it misses vs any competitor.

Corpus

Suite	PDFs	Pass Rate
veraPDF (PDF/A compliance)	2,907	100%
Mozilla pdf.js	897	99.2%
SafeDocs (targeted edge cases)	26	100%
Total	3,830	100%

100% pass rate on all valid PDFs — the 7 non-passing files across the corpus are intentionally broken test fixtures (missing PDF header, fuzz-corrupted catalogs, invalid xref streams).

Features

Extract	Create	Edit
Text & Layout	Documents	Annotations
Images	Tables	Form Fields
Forms	Graphics	Bookmarks
Annotations	Templates	Links
Bookmarks	Images	Content

Python API

Page-oriented API

from pdf_oxide import PdfDocument

with PdfDocument("report.pdf") as doc:
    print(len(doc))          # page count
    print(doc.version())

    # Iterate or index pages
    for page in doc:
        text   = page.text                      # str, lazy
        chars  = page.chars                     # list[TextChar], lazy
        words  = page.words                     # list[Word], lazy
        lines  = page.lines                     # list[TextLine], lazy
        tables = page.tables                    # list[Table], lazy
        images = page.images                    # list[Image], lazy
        md     = page.markdown(detect_headings=True)
        html   = page.html()
        print(f"Page {page.index}: {page.width:.0f}×{page.height:.0f} pts")

    # Direct index access (supports negative indices)
    first = doc[0]
    last  = doc[-1]

Scoped extraction

# Extract from a region: (x, y, width, height) in PDF points
header = doc.within(0, (0, 700, 612, 92)).extract_text()
region = doc.within(0, (50, 400, 500, 200))
region_words  = region.extract_words()
region_images = region.extract_images()

Extraction profiles

from pdf_oxide import ExtractionProfile

# Pre-tuned profiles for different document types
words = doc.extract_words(0, profile=ExtractionProfile.form())
lines = doc.extract_text_lines(0, profile=ExtractionProfile.academic())

# Override adaptive thresholds (in PDF points)
words = doc.extract_words(0, word_gap_threshold=2.5)
lines = doc.extract_text_lines(0, word_gap_threshold=2.5, line_gap_threshold=4.0)
params = doc.page_layout_params(0)
print(f"word gap: {params.word_gap_threshold:.1f}")

Form Fields

# Extract form fields
fields = doc.get_form_fields()
for f in fields:
    print(f"{f.name} ({f.field_type}) = {f.value}")

# Fill and save
doc.set_form_field_value("employee_name", "Jane Doe")
doc.set_form_field_value("wages", "85000.00")
doc.save("filled.pdf")

Rust API

use pdf_oxide::PdfDocument;

fn main() -> Result<(), Box<dyn std::error::Error>> {
    let mut doc = PdfDocument::open("paper.pdf")?;

    // Extract text
    let text = doc.extract_text(0)?;

    // Character-level extraction
    let chars = doc.extract_chars(0)?;

    // Extract images
    let images = doc.extract_images(0)?;

    // Vector graphics
    let paths = doc.extract_paths(0)?;

    Ok(())
}

Form Fields (Rust)

use pdf_oxide::editor::{DocumentEditor, EditableDocument, SaveOptions};
use pdf_oxide::editor::form_fields::FormFieldValue;

let mut editor = DocumentEditor::open("w2.pdf")?;
editor.set_form_field_value("employee_name", FormFieldValue::Text("Jane Doe".into()))?;
editor.save_with_options("filled.pdf", SaveOptions::incremental())?;

Installation

Python

pip install pdf_oxide

Wheels available for Linux, macOS, and Windows. Python 3.8–3.14.

Rust

[dependencies]
pdf_oxide = "0.3"

JavaScript/WASM

npm install pdf-oxide-wasm

const { WasmPdfDocument } = require("pdf-oxide-wasm");

CLI

brew install yfedoseev/tap/pdf-oxide    # Homebrew (macOS/Linux)
cargo install pdf_oxide_cli             # Cargo
cargo binstall pdf_oxide_cli            # Pre-built binary via cargo-binstall

MCP Server

brew install yfedoseev/tap/pdf-oxide    # Included with CLI in Homebrew
cargo install pdf_oxide_mcp             # Cargo

Other languages

Established bindings:

Go — go get github.com/yfedoseev/pdf_oxide/go — see go/README.md
JavaScript / TypeScript (Node.js) — npm install pdf-oxide — see js/README.md
C# / .NET — dotnet add package PdfOxide — see csharp/README.md
Java (JDK 11+) — Maven coords fyi.oxide:pdf-oxide:0.3.70 — see java/README.md
Ruby — gem install pdf_oxide — see ruby/README.md
PHP — composer require oxide/pdf-oxide — see php/README.md

New in v0.3.69 (all over the stable C ABI):

C++ (header-only, CMake / Conan) — see cpp/README.md
Swift (SwiftPM) — see swift/README.md
Kotlin (fyi.oxide:pdf-oxide-kotlin:0.3.70) — see kotlin/README.md
Scala (fyi.oxide %% pdf-oxide-scala) — see scala/README.md
Clojure (fyi.oxide/pdf-oxide-clojure on Clojars) — see clojure/README.md
Dart / Flutter (dart pub add pdf_oxide) — see dart/README.md
R (install.packages("pdfoxide")) — see r/README.md
Julia (Pkg.add("PdfOxide")) — see julia/README.md
Zig (build.zig.zon) — see zig/README.md
Objective-C (CocoaPods) — see objc/README.md
Elixir ({:pdf_oxide, "~> 0.3.70"} on Hex) — see elixir/README.md

<!-- Java (Maven) -->
<dependency>
  <groupId>fyi.oxide</groupId>
  <artifactId>pdf-oxide</artifactId>
  <version>0.3.70</version>
</dependency>

// Kotlin (Gradle, Kotlin DSL)
implementation("fyi.oxide:pdf-oxide-kotlin:0.3.70")

Every binding shares the same Rust core, so a bug fix in one lands in all of them — everything you read in this README applies, just with each language's native naming conventions. Publishing details for each registry are in docs/RELEASING-bindings.md.

CLI

22 commands for PDF processing directly from your terminal:

pdf-oxide text report.pdf                      # Extract text
pdf-oxide markdown report.pdf -o report.md     # Convert to Markdown
pdf-oxide html report.pdf -o report.html       # Convert to HTML
pdf-oxide info report.pdf                      # Show metadata
pdf-oxide search report.pdf "neural.?network"  # Search (regex)
pdf-oxide images report.pdf -o ./images/       # Extract images
pdf-oxide merge a.pdf b.pdf -o combined.pdf    # Merge PDFs
pdf-oxide split report.pdf -o ./pages/         # Split into pages
pdf-oxide watermark doc.pdf "DRAFT"            # Add watermark
pdf-oxide forms w2.pdf --fill "name=Jane"      # Fill form fields

Run pdf-oxide with no arguments for interactive REPL mode. Use --pages 1-5 to process specific pages, --json for machine-readable output.

MCP Server

pdf-oxide-mcp lets AI assistants (Claude, Cursor, etc.) extract content from PDFs locally via the Model Context Protocol.

Add to your MCP client configuration:

{
  "mcpServers": {
    "pdf-oxide": { "command": "crgx", "args": ["pdf_oxide_mcp@latest"] }
  }
}

The server exposes an extract tool that supports text, markdown, and HTML output formats with optional page ranges and image extraction. All processing runs locally — no files leave your machine.

Building from Source

# Clone and build
git clone https://github.com/yfedoseev/pdf_oxide
cd pdf_oxide
cargo build --release

# Run tests
cargo test

# Build Python bindings
maturin develop

# Build the shared library for Go, JS/TS, and C# bindings
cargo build --release --lib
# Output: target/release/libpdf_oxide.{so,dylib} or pdf_oxide.dll

Documentation

Full Documentation — Complete documentation site
Getting Started (Rust) — Rust guide
Getting Started (Python) — Python guide
Getting Started (Go) — Go guide
Getting Started (JavaScript / TypeScript) — Node.js guide
Getting Started (C# / .NET) — .NET guide
Getting Started (WASM) — Browser and Node.js WASM guide
API Docs — Full Rust API reference
Performance Benchmarks — Full benchmark methodology and results

Use Cases

RAG / LLM pipelines — Convert PDFs to clean Markdown for retrieval-augmented generation with LangChain, LlamaIndex, or any framework
AI assistants — Give Claude, Cursor, or any MCP-compatible tool direct PDF access via the MCP server
Document processing at scale — Extract text, images, and metadata from thousands of PDFs in seconds
Data extraction — Pull structured data from forms, tables, and layouts
Academic research — Parse papers, extract citations, and process large corpora
PDF generation — Create invoices, reports, certificates, and templated documents programmatically
PyMuPDF alternative — MIT licensed, 5× faster, no AGPL restrictions

Why I built this

I needed PyMuPDF's speed without its AGPL license, and I needed it in more than one language. Nothing existed that ticked all three boxes — fast, MIT, multi-language — so I wrote it. The Rust core is what does the real work; the bindings for Python, Go, JS/TS, C#, and WASM are thin shells around the same code, so a bug fix in one lands in all of them. It now passes 100% of the veraPDF + Mozilla pdf.js + DARPA SafeDocs test corpora (3,830 PDFs) on every platform I've tested.

If it's useful to you, a star on GitHub genuinely helps. If something's broken or missing, open an issue — I read all of them.

— Yury

License

Dual-licensed under MIT or Apache-2.0 at your option. Unlike AGPL-licensed alternatives, PDFOxide can be used freely in any project — commercial or open-source — with no copyleft restrictions.

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

cargo build && cargo test && cargo fmt && cargo clippy -- -D warnings

Citation

@software{pdf_oxide,
  title = {PDFOxide: Fast Multi-Language PDF Toolkit (Rust core, 19 language bindings)},
  author = {Yury Fedoseev},
  year = {2025},
  url = {https://github.com/yfedoseev/pdf_oxide}
}

20 languages (Rust + Python + Go + JS/TS + C# + Java + Kotlin + Scala + Clojure + Ruby + PHP + C++ + Objective-C + Swift + Dart + R + Julia + Zig + Elixir + WASM) + CLI + MCP | MIT/Apache-2.0 | 100% pass rate on 3,830 PDFs | 0.8ms mean | 5× faster than the industry leaders

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.74

Jul 14, 2026

0.3.73

Jul 6, 2026

0.3.72

Jul 5, 2026

0.3.71

Jul 4, 2026

0.3.70

Jul 2, 2026

0.3.69

Jun 27, 2026

0.3.68

Jun 24, 2026

0.3.67

Jun 20, 2026

0.3.66

Jun 18, 2026

0.3.65

Jun 16, 2026

0.3.64

Jun 12, 2026

0.3.63

Jun 10, 2026

0.3.61

Jun 7, 2026

0.3.60

Jun 4, 2026

0.3.59

Jun 1, 2026

0.3.58

May 31, 2026

0.3.57

May 30, 2026

0.3.56

May 29, 2026

0.3.55

May 25, 2026

0.3.54

May 24, 2026

0.3.53

May 23, 2026

0.3.52

May 20, 2026

0.3.51

May 19, 2026

0.3.50

May 17, 2026

0.3.49

May 16, 2026

0.3.48

May 15, 2026

0.3.47

May 13, 2026

0.3.46

May 11, 2026

0.3.45

May 7, 2026

0.3.44

May 6, 2026

0.3.43

May 4, 2026

0.3.42

May 3, 2026

0.3.41

May 1, 2026

0.3.40

Apr 29, 2026

0.3.39

Apr 27, 2026

0.3.38

Apr 23, 2026

0.3.37

Apr 21, 2026

0.3.36

Apr 20, 2026

0.3.35

Apr 19, 2026

0.3.34

Apr 18, 2026

0.3.33

Apr 17, 2026

0.3.32

Apr 16, 2026

0.3.30

Apr 12, 2026

0.3.29

Apr 12, 2026

0.3.28

Apr 12, 2026

0.3.27

Apr 12, 2026

0.3.24

Apr 11, 2026

0.3.23

Apr 10, 2026

0.3.22

Apr 9, 2026

0.3.21

Apr 6, 2026

0.3.20

Apr 4, 2026

0.3.19

Apr 3, 2026

0.3.18

Apr 2, 2026

0.3.17

Mar 9, 2026

0.3.16

Mar 8, 2026

0.3.15

Mar 6, 2026

0.3.14

Mar 4, 2026

0.3.13

Mar 3, 2026

0.3.12

Mar 2, 2026

0.3.11

Mar 1, 2026

0.3.10

Feb 28, 2026

0.3.9

Feb 25, 2026

0.3.8

Feb 21, 2026

0.3.7

Feb 20, 2026

0.3.6

Feb 16, 2026

0.3.5

Feb 16, 2026

0.3.4

Feb 13, 2026

0.3.1

Jan 14, 2026

0.3.0

Jan 12, 2026

0.2.5

Jan 10, 2026

0.2.4

Jan 10, 2026

0.2.3

Jan 7, 2026

0.2.2

Dec 15, 2025

0.2.1

Dec 15, 2025

0.1.4

Dec 12, 2025

0.1.3

Dec 12, 2025

0.1.2

Nov 27, 2025

0.1.1

Nov 26, 2025

0.1.0

Nov 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf_oxide-0.3.74.tar.gz (6.7 MB view details)

Uploaded Jul 14, 2026 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pdf_oxide-0.3.74-cp38-abi3-win_arm64.whl (10.4 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+Windows ARM64

pdf_oxide-0.3.74-cp38-abi3-win_amd64.whl (11.1 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+Windows x86-64

pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_x86_64.whl (11.2 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+musllinux: musl 1.2+ x86-64

pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_aarch64.whl (10.6 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+musllinux: musl 1.2+ ARM64

pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_x86_64.whl (11.0 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+manylinux: glibc 2.28+ x86-64

pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_aarch64.whl (10.4 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+manylinux: glibc 2.28+ ARM64

pdf_oxide-0.3.74-cp38-abi3-macosx_11_0_arm64.whl (10.1 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+macOS 11.0+ ARM64

pdf_oxide-0.3.74-cp38-abi3-macosx_10_12_x86_64.whl (10.7 MB view details)

Uploaded Jul 14, 2026 CPython 3.8+macOS 10.12+ x86-64

File details

Details for the file pdf_oxide-0.3.74.tar.gz.

File metadata

Download URL: pdf_oxide-0.3.74.tar.gz
Upload date: Jul 14, 2026
Size: 6.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74.tar.gz
Algorithm	Hash digest
SHA256	`1dee9fb9e534dfbd5f2d355c7c9531f562477bb048f3673e1657a4f778e9ffc2`
MD5	`9bf6bd1088d7d623b930b7f3cdb00053`
BLAKE2b-256	`763fce4bfd761cf4de338ad5582a8d0488d60e1f91cbdf91b04929c723b33c11`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-win_arm64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-win_arm64.whl
Upload date: Jul 14, 2026
Size: 10.4 MB
Tags: CPython 3.8+, Windows ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-win_arm64.whl
Algorithm	Hash digest
SHA256	`eafa8c4b0f8fc8c46e61dec4c869ab1743e244cf99b1708d429ac1df4019675c`
MD5	`3fe5e686b167e9461b9ccabd1d93ff91`
BLAKE2b-256	`54f524424d3f7951cfeb690af7998d8ca753dcdfc69ed447952779f120196c42`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-win_amd64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-win_amd64.whl
Upload date: Jul 14, 2026
Size: 11.1 MB
Tags: CPython 3.8+, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-win_amd64.whl
Algorithm	Hash digest
SHA256	`a6a1492116971b20b9607d8ed11606f455d0d94296b32efbcaf110b7905e9e40`
MD5	`b8e960acfbd86ffc8e8d4cadd9f748b0`
BLAKE2b-256	`6db17628aff63753e0c4fbb04df222a1ffab57349ee7283c3490171ea0e50f6c`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_x86_64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_x86_64.whl
Upload date: Jul 14, 2026
Size: 11.2 MB
Tags: CPython 3.8+, musllinux: musl 1.2+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_x86_64.whl
Algorithm	Hash digest
SHA256	`e327f935c97827e424417cd4373aab7747afc960549a07809f305847ade7f144`
MD5	`69fe63b44a6727f252337f5622fb9945`
BLAKE2b-256	`4af765f0098a6d41e28dcaaf35034fbfea7f2d82ee024947de904e1becc394e2`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_aarch64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_aarch64.whl
Upload date: Jul 14, 2026
Size: 10.6 MB
Tags: CPython 3.8+, musllinux: musl 1.2+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-musllinux_1_2_aarch64.whl
Algorithm	Hash digest
SHA256	`0034f26294a3ba0003f65ad6df0243144c89dcd1bc5e65a898ead12432768594`
MD5	`847e468c3db7efacc313cf715d957cdf`
BLAKE2b-256	`4a84d5a08188c1dd0758dfa844c7fc56851750df248e328a371bad0f5dec3574`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_x86_64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_x86_64.whl
Upload date: Jul 14, 2026
Size: 11.0 MB
Tags: CPython 3.8+, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`5644bdacc0b7f81facc656fef6732f413386f114534253cb807278df8afb588f`
MD5	`caae05019dfc9dd7e430d404a7aea643`
BLAKE2b-256	`7b4ea8ee8b9b19ef5e2970cd74091fa3350604aed8deb39053ac60b77fa813cb`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_aarch64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_aarch64.whl
Upload date: Jul 14, 2026
Size: 10.4 MB
Tags: CPython 3.8+, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`d8fcc6f5c82e2053ac3d0287d1b83abe4b3ccf70b315b6a8669bce420adb8047`
MD5	`2706441ad72ac86e78b1be8bb91a5d73`
BLAKE2b-256	`131c6434ddcb34b3e0ae9205e367c64d8128d4402ebc7b8b80d93c935620819b`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-macosx_11_0_arm64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-macosx_11_0_arm64.whl
Upload date: Jul 14, 2026
Size: 10.1 MB
Tags: CPython 3.8+, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`2487c3a68840c3264c8c8a1477742ef979538a436f6dd8ca451b7db79c0df931`
MD5	`a9991ebf9ebfc7957892137130b113b2`
BLAKE2b-256	`d649ca6a3b1bef6f429aed7c369d54472dffb4afd0ea4a2bb8642ba049e51316`

See more details on using hashes here.

File details

Details for the file pdf_oxide-0.3.74-cp38-abi3-macosx_10_12_x86_64.whl.

File metadata

Download URL: pdf_oxide-0.3.74-cp38-abi3-macosx_10_12_x86_64.whl
Upload date: Jul 14, 2026
Size: 10.7 MB
Tags: CPython 3.8+, macOS 10.12+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for pdf_oxide-0.3.74-cp38-abi3-macosx_10_12_x86_64.whl
Algorithm	Hash digest
SHA256	`7ff7cfcf78c74be475fabee809c699c21f28dfe516c875024d9112ef4a906208`
MD5	`efb1a4ba786d5005e1f0ed3a3dbc8998`
BLAKE2b-256	`bbc9b5881db9242a353c38eced6b69a39ca9a09d02b3f8e3191b4840ba4ae9f4`

See more details on using hashes here.

pdf-oxide 0.3.74

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PDFOxide - The Fastest PDF Toolkit for 20 Languages — Python, Rust, Go, JS/TS, C#, Java, Kotlin, Swift, C++ & more, plus CLI & AI

Quick Start

Python

Rust

CLI

MCP Server (for AI assistants)

Why PDFOxide?

Performance

Python Libraries

Rust Libraries

Text Quality

Corpus

Features

Python API

Page-oriented API

Scoped extraction

Extraction profiles

Form Fields

Rust API

Form Fields (Rust)

Installation

Python

Rust

JavaScript/WASM

CLI

MCP Server

Other languages

CLI

MCP Server

Building from Source

Documentation

Use Cases

Why I built this

License

Contributing

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes