Skip to main content

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you

Project description

date-a-scientist Logo

date-a-scientist

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you.

Currently just a thin wrapper around an amazing library called pandas-ai by sinaptik-ai!

How to use it?

from date_a_scientist import DateAScientist
import pandas as pd

df = pd.DataFrame(
    [
        {"name": "Alice", "age": 25, "city": "New York"},
        {"name": "Bob", "age": 30, "city": "Los Angeles"},
        {"name": "Charlie", "age": 35, "city": "Chicago"},
    ]
)
ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return "Alice"
ds.chat("What is the name of the first person?")

Additionally we can pass a description of fields, so that more meaningful questions can be asked:

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
    column_descriptions={
        "name": "The name of the person",
        "age": "The age of the person",
        "city": "The city where the person lives",
    },
)

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return DataFrame with Chicago rows
ds.chat("Who lives in Chicago?")

Finally if you want to get the code that was generated, you can use ds.code():

ds.code("Who lives in Chicago?")

which will return monokai styled code. If you want to return plain code, you can use:

ds.code("Who lives in Chicago?", return_as_string=True)

Inspirations

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

date_a_scientist-0.1.19.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

date_a_scientist-0.1.19-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file date_a_scientist-0.1.19.tar.gz.

File metadata

  • Download URL: date_a_scientist-0.1.19.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0

File hashes

Hashes for date_a_scientist-0.1.19.tar.gz
Algorithm Hash digest
SHA256 6bfbbc83a33c32b6b04c4399de41b109b0c6cecc651fa27c936ec45cde1c9006
MD5 1f9a81f964f9515605d306fde3680dde
BLAKE2b-256 2f3ee39992e5386c9d409fe1dd2d9a517612a5efd118acec4420fe348d921d2d

See more details on using hashes here.

File details

Details for the file date_a_scientist-0.1.19-py3-none-any.whl.

File metadata

File hashes

Hashes for date_a_scientist-0.1.19-py3-none-any.whl
Algorithm Hash digest
SHA256 bc1f9815ca8911f624649e5283464f2d86a6a7332b515e59e8239f2abca53efb
MD5 8d12f784cfe158c9aa3efee2a82301ff
BLAKE2b-256 67089be71365a391f04c04f4038482a8cfca57b7dc0d1903a59d558f262995fe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page