Latex PDF Translator
Project description
PDFMathTranslate
基于字体规则和深度学习的 Latex PDF 翻译及双语对照,保留公式和图表排版
安装
pip install pdf2zh
使用
命令行中执行翻译指令,在当前目录下生成中文文档 example-zh.pdf
以及双语文档 example-dual.pdf
翻译完整文档
pdf2zh example.pdf
翻译部分文档
pdf2zh example.pdf -p 1-3,5
使用正则表达式指定需要保留样式的公式字体和字符
注:从 \ufb00
开始是英文风格连字
pdf2zh BDA3.pdf -f "(CM[^RT].*|MS.*|XY.*|MT.*|BL.*|.*0700|.*0500|.*Italic)" -c "(\(|\||\)|\+|=|\d|[\u0080-\ufaff])"
致谢
文档合并:PyMuPDF
文档解析:pdfminer.six
多线程翻译:MathTranslate
排版解析:LayoutParser
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pdf2zh-1.2.6.tar.gz
(129.6 kB
view hashes)
Built Distribution
pdf2zh-1.2.6-py3-none-any.whl
(138.0 kB
view hashes)