Skip to main content

llama-index readers MarkItDown integration

Project description

LlamaIndex MarkItDown Reader Integration

MarkItDown is a powerful tool that converts various file formats to Markdown.

llama-index-readers-markitdown is an integration that uses MarkItDown to extract text from various file formats, supporting:

  • .txt files and text-based files without extension
  • .csv, .xml and .json files
  • HTML files (.html)
  • Presentations (.pptx)
  • Word documents (.docx)
  • PDF documents (.pdf)
  • ZIP files (.zip)

You can install it via:

pip install llama-index-readers-markitdown

And you can use it in your scripts as follows:

from llama_index.readers.markitdown import MarkItDownReader

reader = MarkItDownReader()
documents = reader.load_data("presentation.pptx")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_markitdown-0.2.1.tar.gz (3.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_readers_markitdown-0.2.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_markitdown-0.2.1.tar.gz
Algorithm Hash digest
SHA256 c4c46c3e8fbe8aad02dbbc9221244398900a2b187ce80fba0eed20599ae16a47
MD5 674acb1ebca7d21a8b9887f40992ef7d
BLAKE2b-256 5b57142e438146e16e35ef63893d5965bac206c0d0337c7ce2a456e358726709

See more details on using hashes here.

File details

Details for the file llama_index_readers_markitdown-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_markitdown-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 48cf4883928fa246e929f344d48697bcc19efa03f29d80777568277bdcb014f1
MD5 acc17698422038a0b573381c20db5124
BLAKE2b-256 a11db80d55a56db0dec156d0a5ef1f826006d6f3c83d64371c9d66364e04dece

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page