Skip to main content

Prepup is a free, open-source package for data preprocessing in terminal

Project description

💻 Prepup: Interactive Data Preprocessing Toolkit

Static Badge

Python Versions image Static Badge Static Badge Static Badge License

🚀 Quick Overview

Prepup is a powerful, user-friendly data preprocessing tool designed to simplify and streamline your data analysis workflow directly from the terminal. Whether you're a data scientist, analyst, or researcher, Prepup provides an intuitive interface for exploring, cleaning, and preparing your datasets.

✨ Features

Interactive Mode

  • 📊 Load datasets from various formats (CSV, Excel, Parquet)
  • 🔍 Comprehensive data inspection
  • 📈 Advanced data exploration
  • 🧹 Missing value handling
  • 📊 Feature visualization
  • 🤖 Automatic Machine Learning (AutoML) model selection

Key Functionalities

  • Data Loading
  • Feature Inspection
  • Correlation Analysis
  • Distribution Checking
  • Outlier Detection
  • Missing Value Imputation
  • Feature Standardization
  • Automatic Model Training

🛠 Installation

Using pip

pip install prepup-linux

From Source

git clone https://github.com/sudhanshumukherjeexx/prepup-linux.git
cd prepup-linux
pip install .

💻 Usage

Interactive Mode

prepup

Loading a Specific Dataset

prepup path/to/your/dataset.csv

Main Menu Options

  1. Load Dataset
  2. Inspect Data
  3. Explore Data
  4. Visualize Data
  5. Impute Missing Values
  6. Standardize Features
  7. Export Data
  8. AutoML (Train & Evaluate Models)

🎮 Interactive Workflow Example

  1. Launch Prepup prepup

  2. Load Your Dataset: Choose option 1 and enter your dataset path

  3. Inspect Data: Use option 2 to explore features, data types, and missing values

  4. Preprocess: Impute missing values | Standardize features

  5. Analyze: Visualize data distributions | Perform correlation analysis | Run AutoML for model selection

🤖 AutoML Capabilities

  • Supports both Classification and Regression tasks
  • Evaluates multiple machine learning algorithms
  • Provides performance metrics
  • Saves results to CSV

📦 Dependencies

  • NumPy
  • Pandas
  • Scikit-learn
  • Matplotlib
  • and more (see requirements.txt)

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

📋 License

Distributed under the MIT License. See LICENSE for more information.

Package Link

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prepup_linux-0.2.1.tar.gz (21.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

prepup_linux-0.2.1-py3-none-any.whl (18.2 kB view details)

Uploaded Python 3

File details

Details for the file prepup_linux-0.2.1.tar.gz.

File metadata

  • Download URL: prepup_linux-0.2.1.tar.gz
  • Upload date:
  • Size: 21.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for prepup_linux-0.2.1.tar.gz
Algorithm Hash digest
SHA256 534a3fc67d0c3035897d28803e301216a6c12d261c6318bdacd2f518d86ae5db
MD5 0481006f991c010d46c70c8684504c96
BLAKE2b-256 38076ead114b9551325f9b8d631d5dcbd86a6d3c0df1e0221f62656a2274b5b9

See more details on using hashes here.

File details

Details for the file prepup_linux-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: prepup_linux-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 18.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for prepup_linux-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2bb712c2603f8a5e2a187e0e137035168bbaa1f9d6ae0e3b65d5cfcc9ec9ce15
MD5 fa48055841bcd8c593b07ccc23ea9ff0
BLAKE2b-256 c7515fdb6d9da915ebc89a68cbfc22e03104eb27ef85bc3777b43e4355602bc4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page