"It is a Machine Learning Copilot Agent that can execute code for end-to-end ML Cycle"

These details have not been verified by PyPI

Project description

ML-Copilot-Agent

ML-Copilot is an interactive machine learning assistant that streamlines the process of data preprocessing, model training, evaluation, plotting results, and generating documentation—all through a command-line interface powered by OpenAI's GPT models.

Features

List Files: View files in the current directory.
Data Preprocessing: Preprocess data for binary classification tasks with customizable instructions.
Model Training: Train binary classification models using algorithms like Logistic Regression, SVM, or Random Forest.
Model Evaluation: Evaluate trained models and obtain metrics such as accuracy, precision, recall, F1-score, and AUC.
Plotting: Generate various plots (e.g., bar plots, PCA plots, correlation matrices) from data or evaluation results.
Documentation: Automatically generate a documentation report summarizing the entire workflow.
Interactive Workflow: Seamlessly navigate through different steps with an intuitive command-line interface.

Installation

pip install ml-copilot-agent

Clone the repository:

git clone https://github.com/VatsalPatel18/ml-copilot-agent.git

Navigate to the project directory:

cd ml-copilot

Install the required dependencies:

pip install -r requirements.txt

Ensure you have Python 3.7 or higher installed on your system.

Usage

Set Up OpenAI API Key:

Export your OpenAI API key as an environment variable or include it when running the program.

export OPENAI_API_KEY='your-openai-api-key'

Run ML-Copilot:

python -m ml_copilot_agent your-openai-api-key

Replace your-openai-api-key with your actual OpenAI API key.

Interact with ML-Copilot:

Once started, ML-Copilot will prompt you for commands. You can enter any of the following commands:

list files: Show files in the current directory.
preprocess: Preprocess data for a binary classification task.
train: Train a binary classification model.
evaluate: Evaluate the trained model.
plot: Generate plots from data or evaluation results.
document: Generate a documentation report.
exit: Terminate the workflow.

Example Workflow

Step 1: List Files

list files

View all files in the current directory to ensure your dataset is available.

Step 2: Preprocess Data

preprocess

Dataset Path: Provide the path to your dataset (e.g., data/dataset.csv).
Target Column Name: Specify the name of the target column in your dataset.
Save Path: Choose where to save the preprocessed data (default is data/preprocessed_data.csv).
Additional Instructions: (Optional) Add any specific preprocessing instructions (e.g., "use standard scaler").

Step 3: Train Model

train

Model Save Path: Specify where to save the trained model (default is models/model.pkl).
Additional Instructions: (Optional) Specify model preferences (e.g., "use SVM classifier").

Step 4: Evaluate Model

evaluate

Evaluation Save Path: Specify where to save the evaluation results (default is results/evaluation.txt).

Step 5: Plot Results

plot

Data File Path: Provide the data file path or press Enter to use default evaluation results or preprocessed data.
Additional Plotting Instructions: (Optional) Specify the type of plot (e.g., "make a bar plot of accuracy and precision").

Step 6: Generate Documentation

document

ML-Copilot will generate a report summarizing the preprocessing steps, model training, and evaluation results.

Step 7: Exit

exit

Terminate the workflow when you are done.

Dependencies

Python 3.7 or higher
OpenAI GPT Models
LlamaIndex
Pandas
Scikit-learn
Matplotlib
Seaborn

Install all dependencies using:

pip install -r requirements.txt

Project Structure

ml_copilot/
__init__.py: Initialization and configuration.
__main__.py: Entry point of the application.
workflow.py: Defines the MLWorkflow class and all associated steps and events.
data/: Directory where preprocessed data is saved.
models/: Directory where trained models are saved.
results/: Directory where evaluation results and plots are saved.
reports/: Directory where documentation reports are saved.
requirements.txt: Contains all Python dependencies.

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository.
Create a new branch for your feature or bug fix.
Commit your changes with clear and descriptive messages.
Push to your fork and submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

Thanks to the developers of OpenAI and LlamaIndex for providing the foundational tools that make this project possible.

Contact

For any questions or suggestions, feel free to open an issue or contact the repository owner.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.8

Oct 14, 2024

0.1.7

Oct 14, 2024

0.1.6

Oct 13, 2024

0.1.5

Oct 5, 2024

0.1.4

Oct 5, 2024

0.1.3

Oct 5, 2024

This version

0.1.2

Oct 4, 2024

0.1.1

Oct 4, 2024

0.1.0

Oct 4, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ml_copilot_agent-0.1.2.tar.gz (6.2 kB view hashes)

Uploaded Oct 4, 2024 Source

Built Distribution

ml_copilot_agent-0.1.2-py3-none-any.whl (7.3 kB view hashes)

Uploaded Oct 4, 2024 Python 3

Hashes for ml_copilot_agent-0.1.2.tar.gz

Hashes for ml_copilot_agent-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`9e6370e5828c5e642ab45981fc9788149c1abc56c6719d123ddeb5b43a6fb79e`
MD5	`dd5b8025c0e7ade5c60e9d73e697e966`
BLAKE2b-256	`cf00c7dfb54eca23b736a61cd5caa3af742151cfc5ccb479b68ed836a5fef777`

Hashes for ml_copilot_agent-0.1.2-py3-none-any.whl

Hashes for ml_copilot_agent-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4dae858b203a24d91c24c25154289d9ae14ec439fca7d20ef6a83afc440cbf32`
MD5	`636899b17b23faaff4bc25d705794489`
BLAKE2b-256	`10fe7627a0db06c86fe750d9d1593a6698dc4f49c2a4727eb0d898e5187913b6`