文件翻译工具

Project description

Project Logo

DocuTranslate

A lightweight local file translation tool based on Large Language Models

✅ Multiple Format Support: Translates various files including pdf, docx, xlsx, md, txt, json, epub, srt, ass, and more.
✅ Automatic Glossary Generation: Supports automatic generation of glossaries for term alignment.
✅ PDF Table, Formula, and Code Recognition: Recognizes and translates tables, formulas, and code often found in academic papers, powered by docling and mineru PDF parsing engines.
✅ JSON Translation: Supports specifying values to be translated in JSON using JSON paths (following jsonpath-ng syntax).
✅ Word/Excel Format Preservation: Translates docx and xlsx files while preserving their original formatting ( does not yet support doc or xls files).
✅ Multi-AI Platform Support: Compatible with most AI platforms, enabling high-performance, concurrent AI translation with custom prompts.
✅ Asynchronous Support: Designed for high-performance scenarios with full asynchronous support, offering service interfaces for parallel tasks.
✅ LAN and Multi-user Support: Can be used by multiple people simultaneously on a local area network.
✅ Interactive Web Interface: Provides an out-of-the-box Web UI and RESTful API for easy integration and use.
✅ Small, Multi-platform Standalone Packages: Windows and Mac standalone packages under 40MB (for versions not using the docling local PDF parser).

When translating pdf files, they are first converted to Markdown, which will **cause the original layout to be lost **. Users with strict layout requirements should take note.

QQ Discussion Group: 1047781902

UI Interface: Translation Effect

Academic Paper Translation: Translation Effect

Novel Translation: Translation Effect

All-in-One Packages

For users who want to get started quickly, we provide all-in-one packages on GitHub Releases. Simply download, unzip, and enter your AI platform API Key to begin.

DocuTranslate: Standard version, uses the online minerU engine to parse PDF documents. Choose this version if you don't need local PDF parsing (recommended).
DocuTranslate_full: Full version, includes the built-in docling local PDF parsing engine. Choose this version if you need local PDF parsing.

Installation

Using pip

# Basic installation
pip install docutranslate

# To use docling for local PDF parsing
pip install docutranslate[docling]

Using uv

# Initialize environment
uv init

# Basic installation
uv add docutranslate

# Install docling extension
uv add docutranslate[docling]

Using git

# Initialize environment
git clone https://github.com/xunbu/docutranslate.git

cd docutranslate

uv sync

Core Concept: Workflow

The core of the new DocuTranslate is the Workflow. Each workflow is a complete, end-to-end translation pipeline designed for a specific file type. Instead of interacting with a single large class, you select and configure a workflow based on your file type.

The basic usage flow is as follows:

Select a Workflow: Choose a workflow based on your input file type (e.g., PDF/Word or TXT), such as MarkdownBasedWorkflow or TXTWorkflow.
Build Configuration: Create the corresponding configuration object for the selected workflow (e.g., MarkdownBasedWorkflowConfig). This object contains all necessary sub-configurations, such as:
- Converter Config: Defines how to convert the original file (like a PDF) to Markdown.
- Translator Config: Defines which LLM, API-Key, target language, etc., to use.
- Exporter Config: Defines specific options for the output format (like HTML).
Instantiate the Workflow: Create an instance of the workflow using the configuration object.
Execute Translation: Call the workflow's .read_*() and .translate() / .translate_async() methods.
Export/Save Results: Call the .export_to_*() or .save_as_*() methods to get or save the translation results.

Available Workflows

Workflow	Use Case	Input Formats	Output Formats	Core Config Class
`MarkdownBasedWorkflow`	Processes rich text documents like PDF, Word, images. Flow: `File -> Markdown -> Translate -> Export`.	`.pdf`, `.docx`, `.md`, `.png`, `.jpg`, etc.	`.md`, `.zip`, `.html`	`MarkdownBasedWorkflowConfig`
`TXTWorkflow`	Processes plain text documents. Flow: `txt -> Translate -> Export`.	`.txt` and other plain text formats	`.txt`, `.html`	`TXTWorkflowConfig`
`JsonWorkflow`	Processes JSON files. Flow: `json -> Translate -> Export`.	`.json`	`.json`, `.html`	`JsonWorkflowConfig`
`DocxWorkflow`	Processes docx files. Flow: `docx -> Translate -> Export`.	`.docx`	`.docx`, `.html`	`DocxWorkflowConfig`
`XlsxWorkflow`	Processes xlsx files. Flow: `xlsx -> Translate -> Export`.	`.xlsx`, `.csv`	`.xlsx`, `.html`	`XlsxWorkflowConfig`
`SrtWorkflow`	Processes srt files. Flow: `srt -> Translate -> Export`.	`.srt`	`.srt`, `.html`	`SrtWorkflowConfig`
`EpubWorkflow`	Processes epub files. Flow: `epub -> Translate -> Export`.	`.epub`	`.epub`, `.html`	`EpubWorkflowConfig`
`HtmlWorkflow`	Processes html files. Flow: `html -> Translate -> Export`.	`.html`, `.htm`	`.html`	`HtmlWorkflowConfig`

You can export to PDF format in the interactive interface.

Launch Web UI and API Service

For ease of use, DocuTranslate provides a full-featured Web interface and RESTful API.

Start the service:

# Start the service, listening on port 8010 by default
docutranslate -i

# Start on a specific port
docutranslate -i -p 8011

# You can also specify the port via an environment variable
export DOCUTRANSLATE_PORT=8011
docutranslate -i

Interactive Interface: After starting the service, visit http://127.0.0.1:8010 (or your specified port) in your browser.
API Documentation: The complete API documentation (Swagger UI) is available at http://127.0.0.1:8010/docs.

Usage

Example 1: Translate a PDF file (using `MarkdownBasedWorkflow`)

This is the most common use case. We will use the minerU engine to convert the PDF to Markdown and then use an LLM for translation. This example uses the asynchronous method.

import asyncio
from docutranslate.workflow.md_based_workflow import MarkdownBasedWorkflow, MarkdownBasedWorkflowConfig
from docutranslate.converter.x2md.converter_mineru import ConverterMineruConfig
from docutranslate.translator.ai_translator.md_translator import MDTranslatorConfig
from docutranslate.exporter.md.md2html_exporter import MD2HTMLExporterConfig


async def main():
    # 1. Build translator configuration
    translator_config = MDTranslatorConfig(
        base_url="https://open.bigmodel.cn/api/paas/v4",  # AI Platform Base URL
        api_key="YOUR_ZHIPU_API_KEY",  # AI Platform API Key
        model_id="glm-4-air",  # Model ID
        to_lang="English",  # Target language
        chunk_size=3000,  # Text chunk size
        concurrent=10,  # Concurrency level
        # glossary_generate_enable=True, # Enable automatic glossary generation
        # glossary_dict={"Jobs":"乔布斯"}, # Pass in a glossary
        # system_proxy_enable=True, # Enable system proxy
    )

    # 2. Build converter configuration (using minerU)
    converter_config = ConverterMineruConfig(
        mineru_token="YOUR_MINERU_TOKEN",  # Your minerU Token
        formula_ocr=True  # Enable formula recognition
    )

    # 3. Build main workflow configuration
    workflow_config = MarkdownBasedWorkflowConfig(
        convert_engine="mineru",  # Specify the parsing engine
        converter_config=converter_config,  # Pass the converter config
        translator_config=translator_config,  # Pass the translator config
        html_exporter_config=MD2HTMLExporterConfig(cdn=True)  # HTML export configuration
    )

    # 4. Instantiate the workflow
    workflow = MarkdownBasedWorkflow(config=workflow_config)

    # 5. Read the file and execute translation
    print("Reading and translating the file...")
    workflow.read_path("path/to/your/document.pdf")
    await workflow.translate_async()
    # Or use the synchronous method
    # workflow.translate()
    print("Translation complete!")

    # 6. Save the results
    workflow.save_as_html(name="translated_document.html")
    workflow.save_as_markdown_zip(name="translated_document.zip")
    workflow.save_as_markdown(name="translated_document.md")  # Markdown with embedded images
    print("Files saved to the ./output folder.")

    # Or get the content strings directly
    html_content = workflow.export_to_html()
    html_content = workflow.export_to_markdown()
    # print(html_content)


if __name__ == "__main__":
    asyncio.run(main())

Example 2: Translate a TXT file (using `TXTWorkflow`)

For plain text files, the process is simpler as it doesn't require a document parsing (conversion) step. This example uses the asynchronous method.

import asyncio
from docutranslate.workflow.txt_workflow import TXTWorkflow, TXTWorkflowConfig
from docutranslate.translator.ai_translator.txt_translator import TXTTranslatorConfig
from docutranslate.exporter.txt.txt2html_exporter import TXT2HTMLExporterConfig


async def main():
    # 1. Build translator configuration
    translator_config = TXTTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
        model_id="gpt-4o",
        to_lang="Chinese",
    )

    # 2. Build main workflow configuration
    workflow_config = TXTWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=TXT2HTMLExporterConfig(cdn=True)
    )

    # 3. Instantiate the workflow
    workflow = TXTWorkflow(config=workflow_config)

    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.txt")
    await workflow.translate_async()
    # Or use the synchronous method
    # workflow.translate()

    # 5. Save the result
    workflow.save_as_txt(name="translated_notes.txt")
    print("TXT file saved.")

    # You can also export the translated plain text
    text = workflow.export_to_txt()


if __name__ == "__main__":
    asyncio.run(main())

Example 3: Translate a JSON file (using `JsonWorkflow`)

This example uses the asynchronous method. The json_paths item in JsonTranslatorConfig needs to specify the JSON paths to be translated (conforming to the jsonpath-ng syntax). Only values matching these paths will be translated.

import asyncio

from docutranslate.exporter.js.json2html_exporter import Json2HTMLExporterConfig
from docutranslate.translator.ai_translator.json_translator import JsonTranslatorConfig
from docutranslate.workflow.json_workflow import JsonWorkflowConfig, JsonWorkflow


async def main():
    # 1. Build translator configuration
    translator_config = JsonTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
        model_id="gpt-4o",
        to_lang="Chinese",
        json_paths=["$.*", "$.name"]  # Conforms to jsonpath-ng syntax, values at matching paths will be translated
    )

    # 2. Build main workflow configuration
    workflow_config = JsonWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=Json2HTMLExporterConfig(cdn=True)
    )

    # 3. Instantiate the workflow
    workflow = JsonWorkflow(config=workflow_config)

    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.json")
    await workflow.translate_async()
    # Or use the synchronous method
    # workflow.translate()

    # 5. Save the result
    workflow.save_as_json(name="translated_notes.json")
    print("JSON file saved.")

    # You can also export the translated JSON text
    text = workflow.export_to_json()


if __name__ == "__main__":
    asyncio.run(main())

Example 4: Translate a DOCX file (using `DocxWorkflow`)

This example uses the asynchronous method.

import asyncio

from docutranslate.exporter.docx.docx2html_exporter import Docx2HTMLExporterConfig
from docutranslate.translator.ai_translator.docx_translator import DocxTranslatorConfig
from docutranslate.workflow.docx_workflow import DocxWorkflowConfig, DocxWorkflow


async def main():
    # 1. Build translator configuration
    translator_config = DocxTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
        model_id="gpt-4o",
        to_lang="Chinese",
        insert_mode="replace",  # Options: "replace", "append", "prepend"
        separator="\n",  # Separator used in "append" and "prepend" modes
    )

    # 2. Build main workflow configuration
    workflow_config = DocxWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=Docx2HTMLExporterConfig(cdn=True)
    )

    # 3. Instantiate the workflow
    workflow = DocxWorkflow(config=workflow_config)

    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.docx")
    await workflow.translate_async()
    # Or use the synchronous method
    # workflow.translate()

    # 5. Save the result
    workflow.save_as_docx(name="translated_notes.docx")
    print("DOCX file saved.")

    # You can also export the translated DOCX as bytes
    text_bytes = workflow.export_to_docx()


if __name__ == "__main__":
    asyncio.run(main())

Example 5: Translate an XLSX file (using `XlsxWorkflow`)

This example uses the asynchronous method.

import asyncio

from docutranslate.exporter.xlsx.xlsx2html_exporter import Xlsx2HTMLExporterConfig
from docutranslate.translator.ai_translator.xlsx_translator import XlsxTranslatorConfig
from docutranslate.workflow.xlsx_workflow import XlsxWorkflowConfig, XlsxWorkflow


async def main():
    # 1. Build translator configuration
    translator_config = XlsxTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
        model_id="gpt-4o",
        to_lang="Chinese",
        insert_mode="replace",  # Options: "replace", "append", "prepend"
        separator="\n",  # Separator used in "append" and "prepend" modes
    )

    # 2. Build main workflow configuration
    workflow_config = XlsxWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=Xlsx2HTMLExporterConfig(cdn=True)
    )

    # 3. Instantiate the workflow
    workflow = XlsxWorkflow(config=workflow_config)

    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.xlsx")
    await workflow.translate_async()
    # Or use the synchronous method
    # workflow.translate()

    # 5. Save the result
    workflow.save_as_xlsx(name="translated_notes.xlsx")
    print("XLSX file saved.")

    # You can also export the translated XLSX as bytes
    text_bytes = workflow.export_to_xlsx()


if __name__ == "__main__":
    asyncio.run(main())

Example 5: Configuration Items for Other Workflows (Using `HtmlWorkflow`, `EpubWorkflow`)

Here is an example using asynchronous mode.

# HtmlWorkflow
from docutranslate.translator.ai_translator.html_translator import HtmlTranslatorConfig
from docutranslate.workflow.html_workflow import HtmlWorkflowConfig, HtmlWorkflow


async def html():
    # 1. Create translator configuration
    translator_config = HtmlTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
        model_id="gpt-4o",
        to_lang="Chinese",
        insert_mode="replace",  # Options: "replace", "append", "prepend"
        separator="\n",  # Separator used for "append" and "prepend" modes
    )

    # 2. Create main workflow configuration
    workflow_config = HtmlWorkflowConfig(
        translator_config=translator_config,
    )
    workflow_html = HtmlWorkflow(config=workflow_config)


# EpubWorkflow
from docutranslate.exporter.epub.epub2html_exporter import Epub2HTMLExporterConfig
from docutranslate.translator.ai_translator.epub_translator import EpubTranslatorConfig
from docutranslate.workflow.epub_workflow import EpubWorkflowConfig, EpubWorkflow


async def epub():
    # 1. Create translator configuration
    translator_config = EpubTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
        model_id="gpt-4o",
        to_lang="Chinese",
        insert_mode="replace",  # Options: "replace", "append", "prepend"
        separator="\n",  # Separator used for "append" and "prepend" modes
    )

    # 2. Create main workflow configuration
    workflow_config = EpubWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=Epub2HTMLExporterConfig(cdn=True),
    )
    workflow_epub = EpubWorkflow(config=workflow_config)

Prerequisites and Configuration Details

1. Get a Large Model API Key

The translation feature relies on large language models. You need to obtain a base_url, api_key, and model_id from the respective AI platform.

Recommended models: Volcengine's doubao-seed-1-6-flash and doubao-seed-1-6 series, Zhipu's glm-4-flash, Alibaba Cloud's qwen-plus and qwen-flash, Deepseek's deepseek-chat, etc.

302.AI👈 Register through this link to enjoy a $1 free credit

Platform Name	Get API Key	Base URL
ollama		`http://127.0.0.1:11434/v1`
lm studio		`http://127.0.0.1:1234/v1`
302.AI	Click to get	`https://api.302.ai/v1`
openrouter	Click to get	`https://openrouter.ai/api/v1`
openai	Click to get	`https://api.openai.com/v1/`
gemini	Click to get	`https://generativelanguage.googleapis.com/v1beta/openai/`
deepseek	Click to get	`https://api.deepseek.com/v1`
Zhipu AI (智谱ai)	Click to get	`https://open.bigmodel.cn/api/paas/v4`
Tencent Hunyuan (腾讯混元)	Click to get	`https://api.hunyuan.cloud.tencent.com/v1`
Alibaba Cloud Bailian (阿里云百炼)	Click to get	`https://dashscope.aliyuncs.com/compatible-mode/v1`
Volcengine (火山引擎)	Click to get	`https://ark.cn-beijing.volces.com/api/v3`
SiliconFlow (硅基流动)	Click to get	`https://api.siliconflow.cn/v1`
DMXAPI	Click to get	`https://www.dmxapi.cn/v1`
Juguang AI (聚光AI)	Click to get	`https://ai.juguang.chat/v1`

2. PDF Parsing Engine (ignore if not translating PDFs)

2.1 Get a minerU Token (Online PDF parsing, free, recommended)

If you choose mineru as your document parsing engine (convert_engine="mineru"), you need to apply for a free token.

Visit the minerU official website to register and apply for an API.
Create a new API Token in the API Token Management interface.

Note: minerU Tokens are valid for 14 days. Please create a new one after expiration.

2.2. docling Engine Configuration (Local PDF parsing)

If you choose docling as your document parsing engine (convert_engine="docling"), it will download the required models from Hugging Face upon first use.

A better option is to download docling_artifact.zip from GitHub Releases and extract it to your working directory.

Solutions for network issues when downloading docling models:

Set a Hugging Face mirror (Recommended):
- Method A (Environment Variable): Set the system environment variable HF_ENDPOINT and restart your IDE or terminal.
```
HF_ENDPOINT=https://hf-mirror.com
```

Method B (Set in code): Add the following code at the beginning of your Python script.

import os

os.environ['HF_ENDPOINT'] = 'https://hf-mirror.com'

Offline Usage (Download the model package in advance):
- Download docling_artifact.zip from GitHub Releases.
- Extract it into your project directory.

Specify the model path in your configuration (if the model is not in the same directory as the script):

from docutranslate.converter.x2md.converter_docling import ConverterDoclingConfig

converter_config = ConverterDoclingConfig(
    artifact="./docling_artifact",  # Path to the extracted folder
    code_ocr=True,
    formula_ocr=True
)

FAQ

Q: Why is the translated text still in the original language?
A: Check the logs for errors. It's usually due to an overdue payment on the AI platform or network issues (check if you need to enable the system proxy).

Q: Port 8010 is already in use. What should I do?
A: Use the -p parameter to specify a new port, or set the DOCUTRANSLATE_PORT environment variable.

Q: Does it support translating scanned PDFs?
A: Yes. Please use the mineru parsing engine, which has powerful OCR capabilities.

Q: Why is the first PDF translation very slow?
A: If you are using the docling engine, it needs to download models from Hugging Face on its first run. Please refer to the "Network Issues Solutions" section above to speed up this process.

Q: How can I use it in an intranet (offline) environment?
A: Absolutely. You need to meet the following conditions:

Local LLM: Deploy a language model locally using tools like Ollama or LM Studio, and fill in the local model's base_url in TranslatorConfig.
Local PDF Parsing Engine (only for parsing PDFs): Use the docling engine and download the model package in advance as described in the "Offline Usage" section above.

Q: How does the PDF parsing cache mechanism work?
A: MarkdownBasedWorkflow automatically caches the results of document parsing (file-to-Markdown conversion) to avoid repetitive, time-consuming parsing. The cache is stored in memory by default and records the last 10 parses. You can change the cache size using the DOCUTRANSLATE_CACHE_NUM environment variable.

Q: How can I make the software use a proxy?
A: By default, the software does not use the system proxy. You can enable it by setting system_proxy_enable=True in TranslatorConfig.

Star History

Sponsorship

Your support is welcome! Please mention the reason for your donation in the memo.

Sponsorship QR Code

Project details

Release history Release notifications | RSS feed

1.7.5

Apr 24, 2026

1.7.4

Apr 24, 2026

1.7.3

Apr 19, 2026

1.7.2

Apr 8, 2026

1.7.1.post1

Mar 8, 2026

1.7.1

Mar 7, 2026

1.7.0

Mar 2, 2026

1.7.0a2 pre-release

Feb 25, 2026

1.7.0a1 pre-release

Feb 25, 2026

1.6.3.post1

Jan 19, 2026

1.6.3 yanked

Jan 18, 2026

1.6.2

Jan 11, 2026

1.6.1 yanked

Jan 10, 2026

1.6.0

Dec 31, 2025

1.5.6

Dec 17, 2025

1.5.5

Dec 14, 2025

1.5.4

Dec 12, 2025

1.5.3

Dec 4, 2025

1.5.3a1 pre-release

Dec 2, 2025

1.5.2.post1 yanked

Nov 25, 2025

1.5.2 yanked

Nov 25, 2025

This version

1.5.1

Nov 10, 2025

1.4.18

Nov 3, 2025

1.4.17

Oct 26, 2025

1.4.16.post1

Oct 20, 2025

1.4.16

Oct 20, 2025

1.4.15

Oct 19, 2025

1.4.14

Oct 19, 2025

1.4.13

Oct 18, 2025

1.4.12

Oct 15, 2025

1.4.11

Oct 14, 2025

1.4.10

Oct 13, 2025

1.4.9

Oct 10, 2025

1.4.8

Oct 4, 2025

1.4.7

Sep 29, 2025

1.4.6

Sep 24, 2025

1.4.5

Sep 24, 2025

1.4.5b2 pre-release

Sep 24, 2025

1.4.4

Sep 17, 2025

1.4.3

Sep 9, 2025

1.4.2.post2

Sep 7, 2025

1.4.2.post1

Sep 7, 2025

1.4.2

Sep 6, 2025

1.4.1.post1

Sep 5, 2025

1.4.1

Sep 5, 2025

1.4.0

Sep 4, 2025

1.3.3

Sep 3, 2025

1.3.2

Sep 2, 2025

1.3.2a1 pre-release

Aug 30, 2025

1.3.1

Aug 30, 2025

1.3.0b1 pre-release

Aug 29, 2025

1.2.5

Aug 24, 2025

1.2.4

Aug 23, 2025

1.2.3

Aug 22, 2025

1.2.2

Aug 20, 2025

1.2.1

Aug 20, 2025

1.2.0 yanked

Aug 20, 2025

1.1.6

Aug 18, 2025

1.1.5

Aug 18, 2025

1.1.3

Aug 14, 2025

1.1.1

Aug 9, 2025

1.0.0

Aug 5, 2025

0.3.3

Jul 16, 2025

0.3.2

Jul 16, 2025

0.2.41

Jul 7, 2025

0.2.40

Jul 7, 2025

0.2.39

Jul 3, 2025

0.2.38

Jun 19, 2025

0.2.37

Jun 10, 2025

0.2.36

Jun 10, 2025

0.2.35

Jun 4, 2025

0.2.34

Jun 2, 2025

0.2.31

May 29, 2025

0.2.28

May 26, 2025

0.2.27

May 26, 2025

0.2.25 yanked

May 26, 2025

Reason this release was yanked:

mathjax渲染错误

0.2.23

May 22, 2025

0.2.21

May 20, 2025

0.2.20

May 20, 2025

0.2.19

May 19, 2025

0.2.18

May 19, 2025

0.2.17

May 19, 2025

0.2.16

May 18, 2025

0.2.15

May 18, 2025

0.2.14

May 17, 2025

0.2.13

May 17, 2025

0.2.12

May 17, 2025

0.2.11

May 17, 2025

0.2.10

May 17, 2025

0.2.9

May 16, 2025

0.2.8

May 16, 2025

0.2.7

May 16, 2025

0.2.6

May 14, 2025

0.2.4

May 13, 2025

0.2.3

May 13, 2025

0.2.2.post1

May 12, 2025

0.2.2

May 12, 2025

0.2.1.post1

May 12, 2025

0.2.1

May 12, 2025

0.2.0

May 12, 2025

0.1.8

May 11, 2025

0.1.7

May 11, 2025

0.1.6

May 10, 2025

0.1.5

May 10, 2025

0.1.4

May 10, 2025

0.1.3.post1

May 10, 2025

0.1.3

May 10, 2025

0.1.2

May 10, 2025

0.1.1

May 9, 2025

0.1.0

May 9, 2025

0.0.8

May 8, 2025

0.0.7

May 8, 2025

0.0.6

May 8, 2025

0.0.5

May 8, 2025

0.0.4

May 8, 2025

0.0.3

May 8, 2025

0.0.2

May 8, 2025

0.0.1

May 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docutranslate-1.5.1.tar.gz (3.7 MB view details)

Uploaded Nov 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

docutranslate-1.5.1-py3-none-any.whl (4.6 MB view details)

Uploaded Nov 10, 2025 Python 3

File details

Details for the file docutranslate-1.5.1.tar.gz.

File metadata

Download URL: docutranslate-1.5.1.tar.gz
Upload date: Nov 10, 2025
Size: 3.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.14

File hashes

Hashes for docutranslate-1.5.1.tar.gz
Algorithm	Hash digest
SHA256	`7b697101600de632b6cae4535c6ebe8fb71981a93966a16d3452c911c91f5333`
MD5	`47b0c79d24c8ce6c894ebb93afac799a`
BLAKE2b-256	`e78591d17c5104b47c51e509568a67987f7f26bf4ad8f7d2d036088d89e362a6`

See more details on using hashes here.

File details

Details for the file docutranslate-1.5.1-py3-none-any.whl.

File metadata

Download URL: docutranslate-1.5.1-py3-none-any.whl
Upload date: Nov 10, 2025
Size: 4.6 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.14

File hashes

Hashes for docutranslate-1.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`669529c00d777a7a1ae696321ad1701033c66c8238cd0be339ea1cf2b704ad31`
MD5	`1417b09b98aafd654bcfa4d7cb36a7b2`
BLAKE2b-256	`4f56a76b72caea4ba25ccf8e94d7c615ff30fbf331e8e9c5a05bc431f045a3f4`

See more details on using hashes here.

docutranslate 1.5.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

DocuTranslate

All-in-One Packages

Installation

Using pip

Using uv

Using git

Core Concept: Workflow

Available Workflows

Launch Web UI and API Service

Usage

Example 1: Translate a PDF file (using MarkdownBasedWorkflow)

Example 2: Translate a TXT file (using TXTWorkflow)

Example 3: Translate a JSON file (using JsonWorkflow)

Example 4: Translate a DOCX file (using DocxWorkflow)

Example 5: Translate an XLSX file (using XlsxWorkflow)

Example 5: Configuration Items for Other Workflows (Using HtmlWorkflow, EpubWorkflow)

Prerequisites and Configuration Details

1. Get a Large Model API Key

2. PDF Parsing Engine (ignore if not translating PDFs)

2.1 Get a minerU Token (Online PDF parsing, free, recommended)

2.2. docling Engine Configuration (Local PDF parsing)

FAQ

Star History

Sponsorship

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Example 1: Translate a PDF file (using `MarkdownBasedWorkflow`)

Example 2: Translate a TXT file (using `TXTWorkflow`)

Example 3: Translate a JSON file (using `JsonWorkflow`)

Example 4: Translate a DOCX file (using `DocxWorkflow`)

Example 5: Translate an XLSX file (using `XlsxWorkflow`)

Example 5: Configuration Items for Other Workflows (Using `HtmlWorkflow`, `EpubWorkflow`)