Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you
Project description
date-a-scientist
Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you.
Currently just a thin wrapper around an amazing library called pandas-ai
by sinaptik-ai!
How to use it?
from date_a_scientist import DateAScientist
import pandas as pd
df = pd.DataFrame(
[
{"name": "Alice", "age": 25, "city": "New York"},
{"name": "Bob", "age": 30, "city": "Los Angeles"},
{"name": "Charlie", "age": 35, "city": "Chicago"},
]
)
ds = DateAScientist(
df=df,
llm_openai_api_token=..., # your OpenAI API token goes here
llm_model_name="gpt-3.5-turbo", # by default, it uses "gpt-4o"
)
# should return "Alice"
ds.chat("What is the name of the first person?")
Additionally we can pass a description of fields, so that more meaningful questions can be asked:
ds = DateAScientist(
df=df,
llm_openai_api_token=..., # your OpenAI API token goes here
llm_model_name="gpt-3.5-turbo", # by default, it uses "gpt-4o"
column_descriptions={
"name": "The name of the person",
"age": "The age of the person",
"city": "The city where the person lives",
},
)
ds = DateAScientist(
df=df,
llm_openai_api_token=..., # your OpenAI API token goes here
llm_model_name="gpt-3.5-turbo", # by default, it uses "gpt-4o"
)
# should return DataFrame with Chicago rows
ds.chat("Who lives in Chicago?")
Finally if you want to get the code that was generated, you can use ds.code()
:
ds.code("Who lives in Chicago?")
which will return monokai styled code. If you want to return plain code, you can use:
ds.code("Who lives in Chicago?", return_as_string=True)
Inspirations
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file date_a_scientist-0.1.19.tar.gz
.
File metadata
- Download URL: date_a_scientist-0.1.19.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6bfbbc83a33c32b6b04c4399de41b109b0c6cecc651fa27c936ec45cde1c9006 |
|
MD5 | 1f9a81f964f9515605d306fde3680dde |
|
BLAKE2b-256 | 2f3ee39992e5386c9d409fe1dd2d9a517612a5efd118acec4420fe348d921d2d |
File details
Details for the file date_a_scientist-0.1.19-py3-none-any.whl
.
File metadata
- Download URL: date_a_scientist-0.1.19-py3-none-any.whl
- Upload date:
- Size: 7.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bc1f9815ca8911f624649e5283464f2d86a6a7332b515e59e8239f2abca53efb |
|
MD5 | 8d12f784cfe158c9aa3efee2a82301ff |
|
BLAKE2b-256 | 67089be71365a391f04c04f4038482a8cfca57b7dc0d1903a59d558f262995fe |