Skip to main content

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you

Project description

date-a-scientist Logo

date-a-scientist

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you.

Currently just a thin wrapper around an amazing library called pandas-ai by sinaptik-ai!

How to use it?

from date_a_scientist import DateAScientist
import pandas as pd

df = pd.DataFrame(
    [
        {"name": "Alice", "age": 25, "city": "New York"},
        {"name": "Bob", "age": 30, "city": "Los Angeles"},
        {"name": "Charlie", "age": 35, "city": "Chicago"},
    ]
)
ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return "Alice"
ds.chat("What is the name of the first person?")

Additionally we can pass a description of fields, so that more meaningful questions can be asked:

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
    column_descriptions={
        "name": "The name of the person",
        "age": "The age of the person",
        "city": "The city where the person lives",
    },
)

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return DataFrame with Chicago rows
ds.chat("Who lives in Chicago?")

Finally if you want to get the code that was generated, you can use ds.code():

ds.code("Who lives in Chicago?")

which will return monokai styled code. If you want to return plain code, you can use:

ds.code("Who lives in Chicago?", return_as_string=True)

Inspirations

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

date_a_scientist-0.1.18.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

date_a_scientist-0.1.18-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file date_a_scientist-0.1.18.tar.gz.

File metadata

  • Download URL: date_a_scientist-0.1.18.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0

File hashes

Hashes for date_a_scientist-0.1.18.tar.gz
Algorithm Hash digest
SHA256 a6396586451a84d82579c5c9713eeb923dfc0db8f0d477c6693966fb33a51148
MD5 83d698438824771f77760c0415a6aaa5
BLAKE2b-256 3f8b715ea5055a4375a5b31c8f6c5f7e3f9041c1efb26d7ed6f7e918967c7d9d

See more details on using hashes here.

File details

Details for the file date_a_scientist-0.1.18-py3-none-any.whl.

File metadata

  • Download URL: date_a_scientist-0.1.18-py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0

File hashes

Hashes for date_a_scientist-0.1.18-py3-none-any.whl
Algorithm Hash digest
SHA256 8f9abd24177b4ba5478f763241393071bfb883d3dcb6e685d84e500b47467f49
MD5 4cedc1621cacad695420fbb096b413f6
BLAKE2b-256 5a7bf1cbf0d1c9ba1c736288de6398c4933470984920359bdbd7db27de5303de

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page