Skip to main content

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you

Project description

date-a-scientist Logo

date-a-scientist

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you.

Currently just a thin wrapper around an amazing library called pandas-ai by sinaptik-ai!

How to use it?

from date_a_scientist import DateAScientist
import pandas as pd

df = pd.DataFrame(
    [
        {"name": "Alice", "age": 25, "city": "New York"},
        {"name": "Bob", "age": 30, "city": "Los Angeles"},
        {"name": "Charlie", "age": 35, "city": "Chicago"},
    ]
)
ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return "Alice"
ds.chat("What is the name of the first person?")

Additionally we can pass a description of fields, so that more meaningful questions can be asked:

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
    column_descriptions={
        "name": "The name of the person",
        "age": "The age of the person",
        "city": "The city where the person lives",
    },
)

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return DataFrame with Chicago rows
ds.chat("Who lives in Chicago?")

Finally if you want to get the code that was generated, you can use ds.code():

ds.code("Who lives in Chicago?")

which will return monokai styled code. If you want to return plain code, you can use:

ds.code("Who lives in Chicago?", return_as_string=True)

Inspirations

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

date_a_scientist-0.1.15.tar.gz (6.1 kB view details)

Uploaded Source

Built Distribution

date_a_scientist-0.1.15-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file date_a_scientist-0.1.15.tar.gz.

File metadata

  • Download URL: date_a_scientist-0.1.15.tar.gz
  • Upload date:
  • Size: 6.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0

File hashes

Hashes for date_a_scientist-0.1.15.tar.gz
Algorithm Hash digest
SHA256 347649e3c0faa931ab606e7ba643b0d2b7965cc5c89a7bea5846ca8ce229ce3b
MD5 fa49f91b19c49130a64e022935b4b2a8
BLAKE2b-256 225251c3492be5f9ff5a933c496847c19416c1bf4a87c7d85108c1c5f20278ec

See more details on using hashes here.

File details

Details for the file date_a_scientist-0.1.15-py3-none-any.whl.

File metadata

  • Download URL: date_a_scientist-0.1.15-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0

File hashes

Hashes for date_a_scientist-0.1.15-py3-none-any.whl
Algorithm Hash digest
SHA256 dfaf5f01823d5b9d29cb2fe80df29919b7003dd39feacf7fbf24eb581bd1400c
MD5 839626bab858050242696c2c9a67a00e
BLAKE2b-256 1a4ec4343d61d73be28cf834b545c62cd2f4015ee118c16fcf174dc5e42a95ba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page