Skip to main content

Prepup is a free, open-source package for data preprocessing in terminal

Project description

💻 Prepup: Interactive Data Preprocessing Toolkit

Static Badge

Python Versions image Static Badge Static Badge Static Badge License

🚀 Quick Overview

Prepup is a powerful, user-friendly data preprocessing tool designed to simplify and streamline your data analysis workflow directly from the terminal. Whether you're a data scientist, analyst, or researcher, Prepup provides an intuitive interface for exploring, cleaning, and preparing your datasets.

✨ Features

Interactive Mode

  • 📊 Load datasets from various formats (CSV, Excel, Parquet)
  • 🔍 Comprehensive data inspection
  • 📈 Advanced data exploration
  • 🧹 Missing value handling
  • 📊 Feature visualization
  • 🤖 Automatic Machine Learning (AutoML) model selection

Key Functionalities

  • Data Loading
  • Feature Inspection
  • Correlation Analysis
  • Distribution Checking
  • Outlier Detection
  • Missing Value Imputation
  • Feature Standardization
  • Automatic Model Training

🛠 Installation

Using pip

pip install prepup-linux

From Source

git clone https://github.com/sudhanshumukherjeexx/prepup-linux.git
cd prepup-linux
pip install .

💻 Usage

Interactive Mode

prepup

Loading a Specific Dataset

prepup path/to/your/dataset.csv

Main Menu Options

  1. Load Dataset
  2. Inspect Data
  3. Explore Data
  4. Visualize Data
  5. Impute Missing Values
  6. Standardize Features
  7. Export Data
  8. AutoML (Train & Evaluate Models)

🎮 Interactive Workflow Example

  1. Launch Prepup prepup

  2. Load Your Dataset: Choose option 1 and enter your dataset path

  3. Inspect Data: Use option 2 to explore features, data types, and missing values

  4. Preprocess: Impute missing values | Standardize features

  5. Analyze: Visualize data distributions | Perform correlation analysis | Run AutoML for model selection

🤖 AutoML Capabilities

  • Supports both Classification and Regression tasks
  • Evaluates multiple machine learning algorithms
  • Provides performance metrics
  • Saves results to CSV

📦 Dependencies

  • NumPy
  • Pandas
  • Scikit-learn
  • Matplotlib
  • and more (see requirements.txt)

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

📋 License

Distributed under the MIT License. See LICENSE for more information.

Package Link

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prepup_linux-0.2.2.tar.gz (18.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

prepup_linux-0.2.2-py3-none-any.whl (18.0 kB view details)

Uploaded Python 3

File details

Details for the file prepup_linux-0.2.2.tar.gz.

File metadata

  • Download URL: prepup_linux-0.2.2.tar.gz
  • Upload date:
  • Size: 18.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for prepup_linux-0.2.2.tar.gz
Algorithm Hash digest
SHA256 84d04d67a8db58b28eb9efb7ce8e04066f112972004449d5a4d3cd886eb8c6b0
MD5 ccb404fbe424b0b4c0e872e8f1544b4d
BLAKE2b-256 48d8c8422df3f210ffc589123efa4bea3dcf57aa0b364d9aa0d1c0ce77c01e38

See more details on using hashes here.

File details

Details for the file prepup_linux-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: prepup_linux-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 18.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for prepup_linux-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 bf99221642308b5d03dd3d72cd4478b5ee2a986cdad76d40873005f99448f0fd
MD5 e9ee6983277d2404ed09df412cc76ec0
BLAKE2b-256 ce98049398981ac8565ab26a518b087e1cf66d719c4f7e122cd9200ded625fbd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page