Skip to main content

llama-index packs llama_dataset_metadata integration

Project description

LlamaDataset Metadata Pack

As part of the LlamaDataset submission package into llamahub, two metadata files are required, namely: card.json and README.md. This pack creates these two files and saves them to disk to help expedite the submission process.

CLI Usage

You can download llamapacks directly using llamaindex-cli, which comes installed with the llama-index python package:

llamaindex-cli download-llamapack LlamaDatasetMetadataPack --download-dir ./llama_dataset_metadata_pack

You can then inspect the files at ./llama_dataset_metadata_pack and use them as a template for your own project!

Code Usage

You can download the pack to the ./llama_dataset_metadata_pack directory through python code as well. The sample script below demonstrates how to construct LlamaDatasetMetadataPack using a LabelledRagDataset downloaded from llama-hub and a simple RAG pipeline built off of its source documents.

from llama_index.core.llama_pack import download_llama_pack

# Download and install dependencies
LlamaDatasetMetadataPack = download_llama_pack(
    "LlamaDatasetMetadataPack", "./llama_dataset_metadata_pack"
)

# construction requires a query_engine, a rag_dataset, and optionally a judge_llm
llama_dataset_metadata_pack = LlamaDatasetMetadataPack()

# create and save `card.json` and `README.md` to disk
dataset_description = (
    "A labelled RAG dataset based off an essay by Paul Graham, consisting of "
    "queries, reference answers, and reference contexts."
)

llama_dataset_metadata_pack.run(
    name="Paul Graham Essay Dataset",
    description=dataset_description,
    rag_dataset=rag_dataset,  # defined earlier not shown here
    index=index,  # defined earlier not shown here
    benchmark_df=benchmark_df,  # defined earlier not shown here
    baseline_name="llamaindex",
)

NOTE: this pack should be used only after performing a RAG evaluation (i.e., by using RagEvaluatorPack on a LabelledRagDataset). In the code snippet above, index, rag_dataset, and benchmark_df are all objects that you'd expect to have only after performing the RAG evaluation as mention in the previous sentence.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_packs_llama_dataset_metadata-0.4.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_packs_llama_dataset_metadata-0.4.1.tar.gz
Algorithm Hash digest
SHA256 e28c9f14482cad8e08ed6dc8d8be8b5b1d99a8ec3a8a9d65db40dc2c8bf32c53
MD5 32914ce83bad9a356e00e12c49e131b8
BLAKE2b-256 03c419356b81a2b1adfdaf28975a34f1a7e52e72038d08b62adfe1b79a51f641

See more details on using hashes here.

File details

Details for the file llama_index_packs_llama_dataset_metadata-0.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_packs_llama_dataset_metadata-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b2d788753b90d17675fd2cd5d4b7f783df14d0c3d7f349c875517749948af4fc
MD5 e3df39f0706a14f2a8adfad4f0cd7147
BLAKE2b-256 67b266e44d479133633cbffc1a1dd8655d714296f6bd1d3ead01d8ab8f005e52

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page