Skip to main content

Use LLMs to automate data science tasks

Project description

Data Pilot

DISCLAIMER: This is a work in progress and currently under development.

Data Pilot is a python package that automates the process of using Language Models (e.g. GPT-3.5, GPT-4, StableLM, Vicuna40B) to perform Data Science tasks such as data-cleaning, data-preprocessing, data-analysis, and ML model training.

We recommend using OpenAI's models to perform these tasks as they will provide the best output. However, the user is free to choose between a range of different models.

To use data-pilot, you will need two APIs:

  • An API for data source : e.g. Supabase, Databricks, or any other Database
  • An API for language models : e.g. OpenAI , Replicate

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data_pilot-0.0.1.tar.gz (1.7 kB view hashes)

Uploaded Source

Built Distribution

data_pilot-0.0.1-py3-none-any.whl (2.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page