Skip to main content

llama-index readers airbyte_typeform integration

Project description

Airbyte Typeform Loader

pip install llama-index-readers-airbyte-typeform

The Airbyte Typeform Loader allows you to access different Typeform objects.

Usage

Here's an example usage of the AirbyteTypeformReader.

from llama_index.readers.airbyte_typeform import AirbyteTypeformReader

typeform_config = {
    # ...
}
reader = AirbyteTypeformReader(config=typeform_config)
documents = reader.load_data(stream_name="forms")

Configuration

Check out the Airbyte documentation page for details about how to configure the reader. The JSON schema the config object should adhere to can be found on Github: https://github.com/airbytehq/airbyte/blob/master/airbyte-integrations/connectors/source-typeform/source_typeform/spec.json.

The general shape looks like this:

{
    "credentials": {
        "auth_type": "Private Token",
        "access_token": "<your auth token>",
    },
    "start_date": "<date from which to start retrieving records from in ISO format, e.g. 2020-10-20T00:00:00Z>",
    "form_ids": [
        "<id of form to load records for>"
    ],  # if omitted, records from all forms will be loaded
}

By default all fields are stored as metadata in the documents and the text is set to the JSON representation of all the fields. Construct the text of the document by passing a record_handler to the reader:

def handle_record(record, id):
    return Document(
        doc_id=id, text=record.data["title"], extra_info=record.data
    )


reader = AirbyteTypeformReader(
    config=typeform_config, record_handler=handle_record
)

Lazy loads

The reader.load_data endpoint will collect all documents and return them as a list. If there are a large number of documents, this can cause issues. By using reader.lazy_load_data instead, an iterator is returned which can be consumed document by document without the need to keep all documents in memory.

Incremental loads

This loader supports loading data incrementally (only returning documents that weren't loaded last time or got updated in the meantime):

reader = AirbyteTypeformReader(config={...})
documents = reader.load_data(stream_name="forms")
current_state = reader.last_state  # can be pickled away or stored otherwise

updated_documents = reader.load_data(
    stream_name="forms", state=current_state
)  # only loads documents that were updated since last time

This loader is designed to be used as a way to load data into LlamaIndex.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_airbyte_typeform-0.4.0.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_readers_airbyte_typeform-0.4.0.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_airbyte_typeform-0.4.0.tar.gz
Algorithm Hash digest
SHA256 fded67d1d33ae9806626623e420d0151edf9d285fbf3a7cf4e33b2e729d12d2d
MD5 f42ccf7373c91729a5abcdc24be46073
BLAKE2b-256 3fb0d75bf84627b571317fd614ad0940af52cbc69d1f92aac1649b16e6602367

See more details on using hashes here.

File details

Details for the file llama_index_readers_airbyte_typeform-0.4.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_airbyte_typeform-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 90ca1608b49ec9f5ae14282b09fc9b5d1afce17fbc2cffdba4535ff7bdc534ea
MD5 096732b9d7053f4b0a246539db7579bb
BLAKE2b-256 3fd2e7e375fb87d18abb63a4d58bf77ef1b806f702e2d85acf720d70586f1a29

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page