Skip to main content

llama-index readers google integration

Project description

LlamaIndex Integration: Google Readers

Effortlessly incorporate Google-based data loaders into your Python workflow using LlamaIndex. It now supports more advanced operations through the implementation of ResourcesReaderMixin and FileSystemReaderMixin. Unlock the potential of various readers to enhance your data loading capabilities, including:

  • Google Calendar
  • Google Chat
  • Google Docs
  • Google Drive
  • Gmail
  • Google Keep
  • Google Maps
  • Google Sheets

Installation

pip install llama-index-readers-google

Authentication

You will need a credentials.json file from Google Cloud to interact with Google Services. To get this file, follow these steps:

  • Create a new project in the Google Cloud Console
  • Go to APIs & Services -> Library and search for the API you want, e.g. Gmail
  • Go to APIs & Services -> Credentials and create a new OAuth client ID
    • Application type: Web application
    • Authorized redirect URIs: http://localhost:8080/ (the last slash seems important)
  • Go to APIs & Services -> OAuth consent screen and make the app external, which allows you to connect your personal Google data once you explicitly add yourself as an allowed test user
  • Download the credentials JSON file from this screen and save it as credentials.json in the root of your project

See this example for a sample of code that successfully authenticates with Gmail once you have the credentials.json file.

Examples

Google Drive Reader

from llama_index.readers.google import GoogleDriveReader

# Initialize the reader
reader = GoogleDriveReader(
    folder_id="folder_id",
    service_account_key="[SERVICE_ACCOUNT_KEY_JSON]",
)

# Load data
documents = reader.load_data()

# List resources in the drive
resources = reader.list_resources()

# Get information about a specific resource
resource_info = reader.get_resource_info("file.txt")

# Load a specific resource
specific_doc = reader.load_resource("file.txt")

# Read file content directly
file_content = reader.read_file_content("path/to/file.txt")

print(f"Loaded {len(documents)} documents")
print(f"Found {len(resources)} resources")
print(f"Resource info: {resource_info}")
print(f"Specific document: {specific_doc}")
print(f"File content length: {len(file_content)} bytes")

Google Docs Reader

from llama_index.readers.google import GoogleDocsReader

# Specify the document IDs you want to load
document_ids = ["<document_id>"]

# Load data from Google Docs
documents = GoogleDocsReader().load_data(document_ids=document_ids)

Google Sheets Reader (Documents and Dataframes)

from llama_index.readers.google import GoogleSheetsReader

# Specify the list of sheet IDs you want to load
list_of_sheets = ["spreadsheet_id"]

# Create a Google Sheets Reader instance
sheets_reader = GoogleSheetsReader()

# Load data into Pandas in Data Classes of choice (Documents or Dataframes)
documents = sheets.load_data(list_of_sheets)
dataframes = sheets_reader.load_data_in_pandas(list_of_sheets)

Integrate these readers seamlessly to efficiently manage and process your data within your Python environment, providing a robust foundation for your data-driven workflows with LlamaIndex.

Google Maps Text Search Reader

from llama_index.readers.google import GoogleMapsTextSearchReader
from llama_index.core import VectorStoreIndex

loader = GoogleMapsTextSearchReader()
documents = loader.load_data(
    text="I want to eat quality Turkish food in Istanbul",
    number_of_results=160,
)


index = VectorStoreIndex.from_documents(documents)
index.query("Which Turkish restaurant has the best reviews?")

Google Chat Reader

from llama_index.readers.google import GoogleChatReader
from llama_index.core import VectorStoreIndex

space_names = ["<CHAT_ID>"]
chatReader = GoogleChatReader()
docs = chatReader.load_data(space_names=space_names)
index = VectorStoreIndex.from_documents(docs)
query_eng = index.as_query_engine()
print(query_eng.query("What was this conversation about?"))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_google-0.7.1.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_readers_google-0.7.1-py3-none-any.whl (34.4 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_readers_google-0.7.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_readers_google-0.7.1.tar.gz
Algorithm Hash digest
SHA256 51c3b92f097ee73c6ffc3d5a015a16a6e7200c2f845b4121ef3efdec6ebaed95
MD5 beb6c50ea9524d6e86e70b12a08452ce
BLAKE2b-256 58c5d2923d0f1038cbe1d6441508c1343a53f307b5698d218330876c24cfd1b0

See more details on using hashes here.

File details

Details for the file llama_index_readers_google-0.7.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_readers_google-0.7.1-py3-none-any.whl
Algorithm Hash digest
SHA256 54fdc50b49bacca0132b6006fcf9518ab3887d806ada600c4b8774286eb34e35
MD5 24ce4468af209aa09738aaa10536f0a1
BLAKE2b-256 3663cd1f3b060615a30e6cbc9676e9b586678c1ce0580e6ffdc96a78e8d4f80d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page