Use LLMs to automate data science tasks
Project description
Data Pilot
DISCLAIMER: This is a work in progress and currently under development.
Data Pilot is a python package that automates the process of using Language Models (e.g. GPT-3.5, GPT-4, StableLM, Vicuna40B) to perform Data Science tasks such as data-cleaning, data-preprocessing, data-analysis, and ML model training.
We recommend using OpenAI's models to perform these tasks as they will provide the best output. However, the user is free to choose between a range of different models.
To use data-pilot, you will need two APIs:
- An API for data source : e.g. Supabase, Databricks, or any other Database
- An API for language models : e.g. OpenAI , Replicate
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
data_pilot-0.0.1.tar.gz
(1.7 kB
view hashes)
Built Distribution
Close
Hashes for data_pilot-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2f32be528acc536709dbe7407112a8388fa91612b49df5a3e8529a5515c2da57 |
|
MD5 | 822225da8924e5db7fe5cf62d4bc5878 |
|
BLAKE2b-256 | 9fe72e8b4469cb1d77ad8ca6f72c7b2306a759e374d4cc5aca6794c01ff7206b |