Skip to main content

Unleash the Potential of Datasets with Intelligent Tasks, Tutorials, and Algorithm Recommendations.

Project description

🚀 ProTaska-GPT

Your AI-powered data companion 🤖

Specify your dataset of choice, and ProTaska-GPT generates a tailored codebase, empowering you to visualize and understand the dataset with tasks, tutorials, and actionable insights. Accelerate your data science journey with ease and efficiency!

🖊️ Key Features:

  1. Dataset Ingestion: ProTaska-GPT seamlessly integrates with dataset sources like Kaggle and Hugging Face (for now), allowing users to easily import and work with diverse datasets.
  2. Task Recommendations: Leveraging its GPT-backbone, it generates a customized set of tasks tailored to each dataset, providing users with valuable project ideas and challenges.
  3. Algorithm Suggestions: Based on the dataset characteristics, it suggests suitable machine learning algorithms, enabling users to make informed decisions during their project journey.
  4. Conversational Chatbot: Allow user to discuss about different techniques and scrape information from Wikipedia to give relevant responses.

🔎 Objectives:

  1. Beginner-Friendly Tutorials: ProTaska-GPT aims to offer automated generation of a collection of beginner-friendly tutorials that guide users through common data science workflows, step-by-step, fostering practical learning and skill development.

Installation:

pip install ProTaska-GPT --upgrade

📈 Usage:

Importing base descriptors and ideation bot:

from protaska.describer import describe_dataset
from protaska.ideate import main as chatbot

Providing meta-data about the dataset to be used:

openai_key = '**open-ai-secret-key**'
importer_type = "HuggingFaceDatasetImporter"
dataset_key = 'mteb/tweet_sentiment_extraction'
destination_path = './downloaded_data/'

Getting automated data descriptions:

description, dataloader_obj = describe_dataset(openai_key, importer_type, destination_path, dataset_key)
description

Running an interactive ChatBot for ideation and base-code building:

chatbot(openai_key, description, dataloader_obj.superficial_meta_data, agent_verbose=False)

💁 Contributing

This is an open-source project and we would be really grateful to any contributions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ProTaska-GPT-0.0.11.tar.gz (9.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ProTaska_GPT-0.0.11-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file ProTaska-GPT-0.0.11.tar.gz.

File metadata

  • Download URL: ProTaska-GPT-0.0.11.tar.gz
  • Upload date:
  • Size: 9.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for ProTaska-GPT-0.0.11.tar.gz
Algorithm Hash digest
SHA256 755f81b2e81a7571f4b2739274dbfa7ebe1c6d4d1929fe359e109beabadbcebb
MD5 5cdddf61c21b472547be56314e699f5b
BLAKE2b-256 e7d959c4e46f049605d485abf88cba145ffa7dd5a531502bcac36a81b78f77a0

See more details on using hashes here.

File details

Details for the file ProTaska_GPT-0.0.11-py3-none-any.whl.

File metadata

  • Download URL: ProTaska_GPT-0.0.11-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for ProTaska_GPT-0.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 5cd25ee1d825925ceb739d4f6eb50b5e7615b03387fcca37bc50c0ca50484c2f
MD5 cee2b00373984a699320a58744420eca
BLAKE2b-256 814a49d0ff6a95008ad37d773cfc9b4094a2ae567bfd5bff732e6cf590502584

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page