Skip to main content

llama-index readers MarkItDown integration

Project description

LlamaIndex MarkItDown Reader Integration

MarkItDown is a powerful tool that converts various file formats to Markdown.

llama-index-readers-markitdown is an integration that uses MarkItDown to extract text from various file formats, supporting:

  • .txt files and text-based files without extension
  • .csv, .xml and .json files
  • HTML files (.html)
  • Presentations (.pptx)
  • Word documents (.docx)
  • PDF documents (.pdf)
  • ZIP files (.zip)

You can install it via:

pip install llama-index-readers-markitdown

And you can use it in your scripts as follows:

from llama_index.readers.markitdown import MarkItDownReader

reader = MarkItDownReader()
documents = reader.load_data("presentation.pptx")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_markitdown-0.2.0.tar.gz (3.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_readers_markitdown-0.2.0.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_markitdown-0.2.0.tar.gz
Algorithm Hash digest
SHA256 59d6b0ae986a26307cae060abef02a46b5be8c9e44b779978a48a007b81c13bd
MD5 1d82509355f5b104d082b408369252b3
BLAKE2b-256 4962f0a04cb993ab01fd6277dd39f49e8542ff426aad93bfbcd75b5ea10efacf

See more details on using hashes here.

File details

Details for the file llama_index_readers_markitdown-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_markitdown-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f7c60934051ee73386f54de173a087959bbd1fa31bd8afeb18e2398e4dcb26c7
MD5 82094b1dfe6ce888ca413917c6765f05
BLAKE2b-256 8827f458e2523f1d8c7e19921e1c35ab7dd405f0e44cdc9262ab07c38ec6b835

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page