A Lakehouse LLM Explorer. Wrapper for spark, databricks and langchain processes

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Python :: 3.10
Topic
- Software Development :: Libraries :: Python Modules

Project description

Occlusion LLM Explorer

Lakehouse Analytics & Advanced ML llm_explorer_sample

Setup

Important This package requires Open AI & HuggingFace API key

PyPi

python -m pip install llm-explorer

touch main.py

from llm_explorer import main

if __name__ == "__main__":
    main()

python -m streamlit run main.py

Initial load could take some time as it downloads the model and the tokenizer. Remember to include the secrets.toml file under .streamlit/ folder.

Build from source

Create a virtual environment

conda create -n occlusion python=3.10
conda activate occlusion

Install the requirements

pip install -r requirements.txt

Run the main.py script using streamlit:

python -m streamlit run main.py

Usage

Use the demo@occlusion.solutions user and DEMO@occlusion password to login.

The deployment requires a secrets.toml file created under .streamlit/:

touch .streamlit/secrets.toml

It should have a schema like this:

[connections.openai]
api_key="sk-..." # OpenAI API Key

[connections.huggingface]
api_key="shf_..." # HuggingFace API Key

[connections.databricks]
server_hostname="your databricks host"
http_path="http path under cluster JDBC/ODBC connectivity"
access_token="your databricks access token"

Lakehouse Agent Sample

Agent is queried for the top 10 producing wells. It identifies the tables it has access to and understands that the request could be satified by the padalloc table. It then creates a query that returns the top 10 producing assets and return the results.

> Entering new AgentExecutor chain...

Observation: logs, wells
Thought: I should look at the schema of the microchip_logs and padalloc tables to see what columns I can use.

Action: schema_sql_db
Action Input: "wells"
Observation: DDL
Thought: I should query the padalloc table to get the top 10 producing wells.

Action: query_sql_db
Action Input: "SELECT WELL_CODE, SUM(PROD_GAS_VOLUME_MCF) AS total_gas_volume_mcf FROM padalloc GROUP BY WELL_CODE ORDER BY total_gas_volume_mcf DESC LIMIT 10"
Observation: results_dataframe
Thought: I now know the top 10 producing wells.

Final Answer: The top 10 producing wells are 1222344, 1212560, 1222345, 1212503, 1222335, 1222340, 1222338, 1222367, 1220189, and 1222352.

> Finished chain.

Attribution

This is an adapted implementation from the GitHub repository. See the contibutions list for more details:

https://github.com/kaarthik108/snowChat

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Python :: 3.10
Topic
- Software Development :: Libraries :: Python Modules

Release history Release notifications | RSS feed

0.0.9

Jun 4, 2023

0.0.7

Jun 4, 2023

0.0.6

Jun 3, 2023

This version

0.0.5

Jun 3, 2023

0.0.4

Jun 3, 2023

0.0.3

Jun 3, 2023

0.0.2

Jun 3, 2023

0.0.1

Jun 3, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_explorer-0.0.5.tar.gz (36.5 kB view hashes)

Uploaded Jun 3, 2023 Source

Built Distribution

llm_explorer-0.0.5-py2.py3-none-any.whl (29.9 kB view hashes)

Uploaded Jun 3, 2023 Python 2 Python 3

Hashes for llm_explorer-0.0.5.tar.gz

Hashes for llm_explorer-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`7846de64b7c6052a9b60ee2a5ae1ac1265579a39371c1d72156cc6d511d147f8`
MD5	`fbf15d2d0ed9024836efbb742a039c34`
BLAKE2b-256	`3b576d0fe1ff1b4cb079c5474ca74c9423472e0dc938de09c32ad20b3a20de60`

Hashes for llm_explorer-0.0.5-py2.py3-none-any.whl

Hashes for llm_explorer-0.0.5-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`5b52637f90a1e2a64115675bc83b49a37b06c59659649a5c17cb4164198c7c65`
MD5	`ffc4a0f623dd13cea9391277db898a97`
BLAKE2b-256	`8949093d1c6dffe746d92b7f6d3edf8f4330342c78f3507304f689f19f585c1c`