Skip to main content

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you

Project description

date-a-scientist Logo

date-a-scientist

Query dataframes, find issue with your notebook snippets as if a professional data scientist was pair coding with you.

Currently just a thin wrapper around an amazing library called pandas-ai by sinaptik-ai!

How to use it?

from date_a_scientist import DateAScientist
import pandas as pd

df = pd.DataFrame(
    [
        {"name": "Alice", "age": 25, "city": "New York"},
        {"name": "Bob", "age": 30, "city": "Los Angeles"},
        {"name": "Charlie", "age": 35, "city": "Chicago"},
    ]
)
ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return "Alice"
ds.chat("What is the name of the first person?")

Additionally we can pass a description of fields, so that more meaningful questions can be asked:

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
    column_descriptions={
        "name": "The name of the person",
        "age": "The age of the person",
        "city": "The city where the person lives",
    },
)

ds = DateAScientist(
    df=df,
    llm_openai_api_token=...,  # your OpenAI API token goes here
    llm_model_name="gpt-3.5-turbo",  # by default, it uses "gpt-4o"
)

# should return DataFrame with Chicago rows
ds.chat("Who lives in Chicago?")

Finally if you want to get the code that was generated, you can use ds.code():

ds.code("Who lives in Chicago?")

which will return monokai styled code. If you want to return plain code, you can use:

ds.code("Who lives in Chicago?", return_as_string=True)

Inspirations

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

date_a_scientist-0.1.16.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

date_a_scientist-0.1.16-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file date_a_scientist-0.1.16.tar.gz.

File metadata

  • Download URL: date_a_scientist-0.1.16.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0

File hashes

Hashes for date_a_scientist-0.1.16.tar.gz
Algorithm Hash digest
SHA256 f35abb7fde2a06be2b534222a6b04d9465e897922b5516363cca9e83b84916cc
MD5 c2bf9c00ca1aadc18bd718cdd728f4fa
BLAKE2b-256 132e01d8f117e08adaa86526ed99eb647d5b7770f915ad5c1a145b440e762be2

See more details on using hashes here.

File details

Details for the file date_a_scientist-0.1.16-py3-none-any.whl.

File metadata

  • Download URL: date_a_scientist-0.1.16-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.3 Darwin/24.0.0

File hashes

Hashes for date_a_scientist-0.1.16-py3-none-any.whl
Algorithm Hash digest
SHA256 6c732f47f38823dbe32391c07de662f1350b53f034d16549a5eab2d3e8f39f08
MD5 152e67c3a6b3170574e05cb44b97836e
BLAKE2b-256 32ad85df5f631ea96cb9856f40895afbd964d681df93b88f1931b252a85a076b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page