Unleash the Potential of Datasets with Intelligent Tasks, Tutorials, and Algorithm Recommendations.
Project description
🧑🎓 ProTaska-GPT
Your AI-powered data companion 🤖
Specify your dataset of choice, and ProTaska-GPT generates a tailored codebase, empowering you to visualize and understand the dataset with tasks, tutorials, and actionable insights. Accelerate your data science journey with ease and efficiency!
🖊️ Key Features:
- Dataset Ingestion: ProTaska-GPT seamlessly integrates with dataset sources like Kaggle and Hugging Face (for now), allowing users to easily import and work with diverse datasets.
- Task Recommendations: Leveraging its GPT-backbone, it generates a customized set of tasks tailored to each dataset, providing users with valuable project ideas and challenges.
- Algorithm Suggestions: Based on the dataset characteristics, it suggests suitable machine learning algorithms, enabling users to make informed decisions during their project journey.
- Conversational Chatbot: Allow user to discuss about different techniques and scrape information from Wikipedia to give relevant responses.
🔎 Objectives:
- Beginner-Friendly Tutorials: ProTaska-GPT aims to offer automated generation of a collection of beginner-friendly tutorials that guide users through common data science workflows, step-by-step, fostering practical learning and skill development.
📈 Installation:
pip install ProTaska-GPT --upgrade
🚀 Usage:
Importing base descriptors and ideation bot:
from protaska.describer import describe_dataset
from protaska.ideate import main as chatbot
Providing meta-data about the dataset to be used:
openai_key = '**open-ai-secret-key**'
importer_type = "HuggingFaceDatasetImporter"
dataset_key = 'mteb/tweet_sentiment_extraction'
destination_path = './downloaded_data/'
Getting automated data descriptions:
description, dataloader_obj = describe_dataset(openai_key, importer_type, destination_path, dataset_key)
description
Running an interactive ChatBot for ideation and base-code building:
chatbot(openai_key, description, dataloader_obj.superficial_meta_data, agent_verbose=False)
💁 Contributing
This is an open-source project and we would be really grateful to any contributions.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ProTaska-GPT-0.0.12.tar.gz
.
File metadata
- Download URL: ProTaska-GPT-0.0.12.tar.gz
- Upload date:
- Size: 10.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9e84d91980ee0d84a4efd570ba1019d164c0a6250ade6f2d95160841dc0f58a2 |
|
MD5 | cde00041beaae58fcfa4d266d49797be |
|
BLAKE2b-256 | 7fb72f82d44e16dec1dddcedef8acf06b24637fe3cdf33b614e83388095960cb |
File details
Details for the file ProTaska_GPT-0.0.12-py3-none-any.whl
.
File metadata
- Download URL: ProTaska_GPT-0.0.12-py3-none-any.whl
- Upload date:
- Size: 10.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a11d23b37f3680277402ddb95a197c748e0fd1c8d15f2d17939abf6c1c14be16 |
|
MD5 | a1060971f6fadaa689203a1b76a64a85 |
|
BLAKE2b-256 | 355f17f43d6a49b7995a51836938b01e898ec6521cf0b2db76fa9386bc1ba20a |