Skip to main content

Universal document converter with 200+ format combinations. Features PowerPoint to Obsidian Markdown with image extraction, format preservation, and navigation links. Supports PDF, DOCX, PPTX, MD, TEX, CSV, XLSX, TXT, HEIC, JPG, PNG, HTML, RTF, ODT.

Project description

Docuvert

Docuvert is a command-line tool that supports converting documents from any format to any other format.

Installation

  1. Clone the repository:

    git clone https://github.com/your-repo/docuvert.git
    cd docuvert
    
  2. Run the setup script:

    ./setup.sh
    

    This script will install the necessary dependencies and create a docuvert executable wrapper in your project directory. It will also provide instructions on how to set up an alias for easy command-line access.

Usage

Docuvert converts files based on their extensions. The syntax is simple:

docuvert <input_file_path> <output_file_path>

Examples:

  • Convert PDF to DOCX:

    docuvert document.pdf document.docx
    
  • Convert Markdown to PDF:

    docuvert notes.md notes.pdf
    
  • Convert PowerPoint to Obsidian Markdown (NEW!):

    docuvert presentation.pptx notes.md
    
  • Convert Legacy PowerPoint with automatic conversion:

    docuvert lecture.ppt lecture.md
    
  • Convert DOCX to Markdown:

    docuvert report.docx report.md
    

Supported Conversions

Docuvert supports 200+ format combinations with intelligent conversion routing. Key features include:

🎯 PowerPoint Conversions (NEW!)

  • PPTX/PPT to Obsidian Markdown (pptx2md, ppt2md) - Featured Converter
    • ✅ Automatic image extraction and embedding
    • ✅ Format preservation (bold, italic, colors)
    • ✅ Obsidian-specific features (YAML frontmatter, internal links, callouts)
    • ✅ Slide navigation with Previous/Next links
    • ✅ Table of contents generation
    • ✅ Legacy .ppt support via LibreOffice conversion
  • PPTX to PDF (pptx2pdf)
  • PPTX to HTML (pptx2html)
  • PPTX to Plain Text (pptx2txt)
  • Markdown to PPTX (md2pptx)

📄 Document Conversions

  • PDF to DOCX (pdf2docx)
  • PDF to Markdown (pdf2md)
  • PDF to LaTeX (pdf2tex)
  • PDF to Plain Text (pdf2txt)
  • PDF to CSV (pdf2csv)
  • PDF to XLSX (pdf2xlsx)
  • DOCX to PDF (docx2pdf)
  • DOCX to Markdown (docx2md)
  • DOCX to LaTeX (docx2tex)
  • DOCX to Plain Text (docx2txt)
  • DOCX to CSV (docx2csv)
  • DOCX to XLSX (docx2xlsx)
  • Markdown to PDF (md2pdf)
  • Markdown to DOCX (md2docx)
  • Markdown to LaTeX (md2tex)
  • Markdown to Plain Text (md2txt)
  • Markdown to CSV (md2csv)
  • Markdown to XLSX (md2xlsx)
  • LaTeX to PDF (tex2pdf)
  • LaTeX to DOCX (tex2docx)
  • LaTeX to Markdown (tex2md)
  • LaTeX to Plain Text (tex2txt)
  • LaTeX to CSV (tex2csv)
  • LaTeX to XLSX (tex2xlsx)
  • Plain Text to PDF (txt2pdf)
  • Plain Text to DOCX (txt2docx)
  • Plain Text to Markdown (txt2md)
  • Plain Text to LaTeX (txt2tex)
  • Plain Text to CSV (txt2csv)
  • Plain Text to XLSX (txt2xlsx)
  • CSV to PDF (csv2pdf)
  • CSV to DOCX (csv2docx)
  • CSV to Markdown (csv2md)
  • CSV to LaTeX (csv2tex)
  • CSV to Plain Text (csv2txt)
  • CSV to XLSX (csv2xlsx)
  • XLSX to PDF (xlsx2pdf)
  • XLSX to DOCX (xlsx2docx)
  • XLSX to Markdown (xlsx2md)
  • XLSX to LaTeX (xlsx2tex)
  • XLSX to Plain Text (xlsx2txt)
  • XLSX to CSV (xlsx2csv)

🔄 Legacy Format Support

Docuvert automatically handles legacy Microsoft Office formats:

📝 Legacy Word (.doc) Support

  • Automatic conversion: .doc files are automatically converted to .docx format before processing
  • All format combinations supported: Use any .doc to format conversion just like .docx
  • Examples:
    docuvert old-document.doc new-document.pdf
    docuvert report.doc report.md
    docuvert legacy.doc modern.docx
    

📊 Legacy Excel (.xls) Support

  • Automatic conversion: .xls files are automatically converted to .xlsx format before processing
  • All format combinations supported: Use any .xls to format conversion just like .xlsx
  • Examples:
    docuvert old-spreadsheet.xls new-spreadsheet.pdf
    docuvert data.xls data.csv
    docuvert legacy.xls modern.xlsx
    

📋 Requirements for Legacy Formats

🔧 Conversion Process

  1. Docuvert detects legacy format (.doc or .xls)
  2. Creates temporary modern format file (.docx or .xlsx)
  3. Processes conversion using existing converters
  4. Cleans up temporary files automatically
  5. Returns final converted output

No additional configuration needed - just use legacy files like modern formats!

Contributing

See instructions.md for details on project organization and how to add new converters.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docuvert-1.0.1.tar.gz (102.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

docuvert-1.0.1-py3-none-any.whl (135.0 kB view details)

Uploaded Python 3

File details

Details for the file docuvert-1.0.1.tar.gz.

File metadata

  • Download URL: docuvert-1.0.1.tar.gz
  • Upload date:
  • Size: 102.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for docuvert-1.0.1.tar.gz
Algorithm Hash digest
SHA256 f5a24e7488d5ade8645205cb86f60fc75ab486701196da1af3cb3234e6735163
MD5 1c7fdd0dbcd5da2f103ed503ff01c298
BLAKE2b-256 89f0de5bfdb7fb8254bc82f3e22f07b05808f5dc2552624d69dab8c457b94019

See more details on using hashes here.

File details

Details for the file docuvert-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: docuvert-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 135.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for docuvert-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 703bb1c54c6cdde5be46723f339bc63011e85e31cee55079fad31b9845384f03
MD5 149feb8578ce7ee2796ac8f848de0d8c
BLAKE2b-256 774042ed6d4640a6c5ffec9bf25c376ebe34a2ca28e8b6b57f4fb4588ac04ae4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page