datadashr

Engage with your data (SQL, CSV, pandas, polars, mongodb, noSQL, etc.) using Ollama, an open-source tool that operates locally. Datadashr transforms data analysis into a conversational experience powered by Ollama LLMs and RAG.

These details have not been verified by PyPI

Project description

DataDashr Logo

Description

Converse with Your Data Through Open Source AI.

Unleash the power of your data with natural language questions.
Our open-source platform, built on Ollama, delivers powerful insights without the cost of APIs.

Integrate effortlessly with your existing infrastructure, connecting to various data sources including SQL, NoSQL, CSV, and XLS files.

Obtain in-depth analytics by aggregating data from multiple sources into a unified platform, providing a holistic view of your business.

Convert raw data into valuable insights, facilitating data-driven strategies and enhancing decision-making processes.

Design intuitive and interactive charts and visual representations to simplify the understanding and interpretation of your business metrics.

Installation

To install the package, run the following command:

pip install datadashr

Starting the Interface

To start the user interface, run the following command:

datadashr

Usage Example

import pandas as pd
from pprint import pprint
from datadashr import DataDashr
from datadashr.core.llm import OllamaLLM

# Create DataFrame containing employee details
employees_df = pd.DataFrame({
    'employeeid': [1, 2, 3],
    'name': ['Alice', 'Bob', 'Charlie'],
    'department': ['HR', 'IT', 'Finance']
})

# Create DataFrame containing salary information for employees
salaries_df = pd.DataFrame({
    'employeeid': [1, 2, 3],
    'salary': [50000, 60000, 70000]
})

# Create DataFrame containing department information and their managers
departments_df = pd.DataFrame({
    'department': ['HR', 'IT', 'Finance'],
    'manager': ['Dave', 'Eva', 'Frank']
})

# Create DataFrame containing project details and employee assignments
projects_df = pd.DataFrame({
    'projectid': [101, 102, 103],
    'projectname': ['Project A', 'Project B', 'Project C'],
    'employeeid': [1, 2, 3]
})

# Structure to import and map the data sources
import_data = {
    'sources': [
        {"source_name": "employees_df", "data": employees_df, "source_type": "pandas",
         "description": "Contains employee details including their department."},
        {"source_name": "salaries_df", "data": salaries_df, "source_type": "pandas",
         "description": "Contains salary information for employees."},
        {"source_name": "departments_df", "data": departments_df, "source_type": "pandas",
         "description": "Contains information about departments and their managers."},
        {"source_name": "projects_df", "data": projects_df, "source_type": "pandas",
         "description": "Contains information about projects and the employees assigned to them."},
    ],
    'mapping': {
        "employeeid": ['employees_df', 'salaries_df', 'projects_df'],  # Mapping employeeid across three DataFrames
        "department": ['employees_df', 'departments_df']  # Mapping department across two DataFrames
    }
}

# Initialize the LLM (Language Learning Model) instance with specific parameters
llm = OllamaLLM(model='codestral', params={"temperature": 0.0}, verbose=False)

# Initialize the DataDashr object with imported data and LLM instance
df = DataDashr(data=import_data, llm_instance=llm, verbose=False, enable_cache=True, format_type='data')

# Perform a query on the combined DataFrame to get the employee with the highest salary and their salary
result = df.chat('Show the employer with highest salary and the salary')

# Print the result
pprint(result)

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.5

Aug 8, 2024

0.2.4

Jul 8, 2024

0.2.3

Jun 28, 2024

0.2.1

Jun 26, 2024

This version

0.2.0

Jun 26, 2024

0.1.7

Jun 11, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datadashr-0.2.0.tar.gz (87.0 MB view hashes)

Uploaded Jun 26, 2024 Source

Built Distribution

datadashr-0.2.0-py3-none-any.whl (89.2 MB view hashes)

Uploaded Jun 26, 2024 Python 3

Hashes for datadashr-0.2.0.tar.gz

Hashes for datadashr-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`f22e80bd301dccc90631c7a2ffee64a2918047912d9cf72b6126ebd16d285794`
MD5	`d817cd1df7e242890344d9df98673896`
BLAKE2b-256	`b9d48fe64b1870e573172f8ae599e3311e9ff796fb483765383b3dba38483ca3`

Hashes for datadashr-0.2.0-py3-none-any.whl

Hashes for datadashr-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d0bca38594313548c3d244fd4a376858a84d8a8a4b6891e82badfcd5b6a13fd4`
MD5	`04f390bcbf9b0cadc20c7fd3b383679e`
BLAKE2b-256	`ba544edd9bf204faa5fb17a150acb9b277a495ca19ab2122a4ba628cc0be8cc9`