Skip to main content

Everything to Markdown.

Project description

wisup_e2m Logo

E2M Repo E2M Version Python Version

E2M: Everything to Markdown

Everything to Markdown

E2M is a versatile tool that converts a wide range of file types into Markdown format.

Supported File Types

  • doc
  • docx
  • epub
  • html
  • htm
  • url
  • pdf
  • pptx
  • mp3
  • m4a

Installation

To install E2M, use pip:

pip install wisup_e2m

Usage

Here's a simple example demonstrating how to use E2M:

from wisup_e2m import E2MParser

# Initialize the parser with your configuration file
ep = E2MParser.from_config("config.yaml")

# Parse the desired file
data = ep.parse(file_name="/path/to/file.pdf")

# Print the parsed data as a dictionary
print(data.to_dict())

Config Template

parsers:
    doc_parser:
        engine: "unstructured"
        langs: ["en", "zh"]
    docx_parser:
        engine: "unstructured"
        langs: ["en", "zh"]
    epub_parser:
        engine: "unstructured"
        langs: ["en", "zh"]
    html_parser:
        engine: "unstructured"
        langs: ["en", "zh"]
    pdf_parser:
        engine: "marker"
        langs: ["en", "zh"]
    pptx_parser:
        engine: "unstructured"
        langs: ["en", "zh"]
    voice_parser:
        engine: "openai-whisper"
        langs: ["en", "zh"]
        openai_whisper_model: "large"

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

For any questions or inquiries, please open an issue on GitHub or contact us at team@wisup.ai.

🌟Contributing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wisup_e2m-0.1.22.tar.gz (16.3 kB view details)

Uploaded Source

Built Distribution

wisup_e2m-0.1.22-py3-none-any.whl (27.7 kB view details)

Uploaded Python 3

File details

Details for the file wisup_e2m-0.1.22.tar.gz.

File metadata

  • Download URL: wisup_e2m-0.1.22.tar.gz
  • Upload date:
  • Size: 16.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.14 Darwin/23.5.0

File hashes

Hashes for wisup_e2m-0.1.22.tar.gz
Algorithm Hash digest
SHA256 02f142b490699940efe1e6ba65b186fd2901f22e1da9bbff0d8b9a0814adf83b
MD5 c52875b94168400cc5f862a10973f0c1
BLAKE2b-256 72dc3d32bdfa7cddfe408cfb1593b7073da175396fb1b252af86b0580f49f4db

See more details on using hashes here.

File details

Details for the file wisup_e2m-0.1.22-py3-none-any.whl.

File metadata

  • Download URL: wisup_e2m-0.1.22-py3-none-any.whl
  • Upload date:
  • Size: 27.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.14 Darwin/23.5.0

File hashes

Hashes for wisup_e2m-0.1.22-py3-none-any.whl
Algorithm Hash digest
SHA256 88b1a4fde686a844c37d8e155e2472684d16695f4dbb24fcf03513a2dd6ceca1
MD5 0cd91d8b634d44a0d37b47f5b60d66c3
BLAKE2b-256 7995c9dca41cadfd5596f56173fb5fc146462be6a33e7385c78a9b19ec136307

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page