Skip to main content

Comprehensive AI-powered file processing and organization tool

Project description

cleanupx - Comprehensive File Processing Tool

Version 0.8.2 - Near Production Ready

A powerful, AI-enhanced file organization and processing framework with comprehensive capabilities for code analysis, image processing, file deduplication, and privacy utilities.

๐ŸŽฏ What's New in v0.8.2 (Near Production Ready)

  • ๐Ÿ—๏ธ Complete Reorganization: Clean modular structure with core functionality separated from storage
  • ๐Ÿš€ Enhanced Performance: Streamlined imports and optimized processing
  • ๐Ÿ”„ Backward Compatibility: All existing commands continue to work
  • ๐Ÿ“ฆ Unified Architecture: Consolidated scattered functionality into organized modules
  • ๐Ÿ”’ SSL Ready: Full HTTPS support with proper certificate configuration
  • ๐ŸŽฏ Consistent Branding: Unified "cleanupx" naming throughout the project
  • ๐Ÿ”ง Legacy Integration: All storage functionality migrated and working
  • โšก Production Ready: Comprehensive testing and workflow optimization

๐Ÿ“ Project Structure

cleanupx/
โ”œโ”€โ”€ cleanupx.py                 # Main CLI interface
โ”œโ”€โ”€ cleanupx_core/              # Core functionality
โ”‚   โ”œโ”€โ”€ api/                    # XAI API integration  
โ”‚   โ”œโ”€โ”€ processors/
โ”‚   โ”‚   โ”œโ”€โ”€ integrated/         # New comprehensive processing
โ”‚   โ”‚   โ””โ”€โ”€ legacy/             # Backward compatibility
โ”‚   โ””โ”€โ”€ utils/                  # Common utilities
โ”œโ”€โ”€ storage/                    # Non-core functionality archive
โ”‚   โ”œโ”€โ”€ legacy_methods/         # Original processing methods
โ”‚   โ”œโ”€โ”€ dev_tools/              # Development utilities
โ”‚   โ””โ”€โ”€ documentation/          # Archive documentation
โ””โ”€โ”€ test/                       # Test files

๐Ÿš€ Quick Start

Installation

# Install from PyPI (recommended)
pip install cleanupx

# Or install prerelease version
pip install --pre cleanupx

# Or clone from source
git clone https://github.com/lukeslp/cleanupx.git
cd cleanupx
pip install -r requirements.txt

# Set up environment (optional for AI features)
echo "XAI_API_KEY=your-xai-api-key" > .env

Basic Usage

# Check system status
cleanupx --help

# Run file deduplication
cleanupx deduplicate --dir test

# Process images for accessibility
cleanupx images --dir test  

# Comprehensive processing with all features
cleanupx comprehensive --dir test

# Privacy: scramble filenames
cleanupx scramble --dir test

๐Ÿ› ๏ธ Features

Core Processing

  • File Deduplication: Smart duplicate detection and organization
  • Code Analysis: Extract and analyze code snippets
  • File Organization: Categorize and organize files by type and content
  • Citation Processing: Extract and format citations from documents

AI-Powered Features

  • X.AI Integration: Advanced AI processing with retry logic
  • Image Alt Text: Generate accessibility descriptions for images
  • Content Analysis: AI-powered content understanding and categorization
  • Smart Deduplication: Intelligent duplicate detection beyond simple hashing

Privacy & Utility

  • Filename Scrambling: Randomize filenames for privacy/testing
  • Rename Logging: Track and reverse filename changes
  • Rich CLI Interface: Beautiful terminal output with progress bars
  • Comprehensive Logging: Detailed operation logs

๐Ÿ“‹ Available Commands

Primary Commands

comprehensive    # Full processing with all features
images          # AI-powered image processing and alt text generation  
scramble        # Privacy-focused filename scrambling

Legacy Commands (Backward Compatible)

deduplicate     # Find and process duplicate files
extract         # Extract important code snippets  
organize        # Organize and rename files
all             # Run all legacy processing steps

๐Ÿ”ง Configuration

Environment Variables

# Required for AI features
XAI_API_KEY=your-xai-api-key

# Optional configurations
CLEANUP_OUTPUT_DIR=custom_output_directory
CLEANUP_LOG_LEVEL=INFO

Dependencies

Core Requirements

requests>=2.31.0        # HTTP requests
rich>=13.7.0           # Beautiful console output  
inquirer>=3.4.0        # Interactive prompts
pillow>=10.0.0         # Image processing
PyPDF2>=3.0.1          # PDF processing
python-docx>=1.1.2     # Word document processing

Optional Dependencies

openai                 # OpenAI API fallback
PyHEIF                # HEIC/HEIF image support
rarfile               # RAR archive processing

๐Ÿ—๏ธ Architecture

Modular Design

cleanupx is built with a clean, modular architecture:

  • cleanupx_core/: Core functionality with stable APIs
  • storage/: Non-essential functionality for experimentation
  • Processors: Specialized processing modules for different file types
  • API Layer: Unified interface for AI service integration

Key Benefits

  1. Clean Separation: Core vs. experimental functionality
  2. Backward Compatibility: Legacy commands continue to work
  3. Extensible: Easy to add new processors and features
  4. Production Ready: Robust error handling and logging
  5. Organized Output: Centralized output management

๐Ÿ“Š Supported File Types

  • Images: .jpg, .jpeg, .png, .gif, .webp, .bmp, .tiff
  • Code: .py, .js, .html, .css, .md, .txt, .json, .yaml
  • Documents: .pdf, .doc, .docx, .rtf, .pptx
  • Archives: .zip, .tar, .gz (with optional RAR support)
  • All Others: Categorized and processed appropriately

๐Ÿšฆ Status Check

# Verify module status
python3 -c "import cleanupx_core; cleanupx_core.print_status()"

Expected output:

cleanupx Core v0.8.2
  Integrated Processors: โœ“
  XAI API Support: โœ“  
  Legacy Processors: โœ“
  Module Path: /path/to/cleanupx_core

๐Ÿ” Examples

Basic File Organization

# Organize a downloads directory
cleanupx organize --dir ~/Downloads

# Find duplicates in a project
cleanupx deduplicate --dir ~/Projects/MyProject

AI-Enhanced Processing

# Generate alt text for all images
cleanupx images --dir ./photos

# Comprehensive AI analysis
cleanupx comprehensive --dir ./documents

Privacy & Testing

# Scramble filenames for privacy
cleanupx scramble --dir ./sensitive_data

# Note: Scrambling creates a log file to reverse changes

๐Ÿ› ๏ธ Development

Module Status

The reorganized architecture provides:

  • Stable Core: cleanupx_core/ for production functionality
  • Experimental Storage: storage/ for development and testing
  • Clear APIs: Well-defined interfaces between modules
  • Easy Testing: Modular design enables easy unit testing

Contributing

  1. Core functionality goes in cleanupx_core/
  2. Experimental features start in storage/dev_tools/
  3. All changes must maintain backward compatibility
  4. Add comprehensive tests for new features

๐Ÿ“ License & Credits

MIT License by Luke Steuber

๐Ÿ”— Connect & Support

Platform Link
๐ŸŒ Website lukesteuber.com
๐Ÿ› ๏ธ Playground assisted.site
๐Ÿ“ง Email luke@lukesteuber.com
๐Ÿฆ Bluesky @lukesteuber.com
๐Ÿ’ผ LinkedIn lukesteuber
๐Ÿ’ป GitHub lukeslp
๐Ÿง  LlamaLine AI CLI Tool
โœ‰๏ธ Newsletter Substack
โ˜• Support Tip Jar

๐ŸŽฏ What's Next

  1. Performance Optimization: Profile and optimize processing speeds
  2. Enhanced AI Features: More sophisticated content analysis
  3. Web Interface: Browser-based processing dashboard
  4. API Server: REST API for remote processing
  5. Plugin System: Custom processor plugins

Version: 0.8.2 - Near Production Ready
Last Updated: June 7, 2025
Status: ๐Ÿš€ Near Production Ready - Legacy Integration Complete

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanupx-0.8.2.tar.gz (82.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanupx-0.8.2-py3-none-any.whl (86.9 kB view details)

Uploaded Python 3

File details

Details for the file cleanupx-0.8.2.tar.gz.

File metadata

  • Download URL: cleanupx-0.8.2.tar.gz
  • Upload date:
  • Size: 82.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for cleanupx-0.8.2.tar.gz
Algorithm Hash digest
SHA256 067f932f79e4803efb7f0c42c57214acbee114251458f1ab401b28cfeab79385
MD5 233a29b49507db0ec6f920b56f1fae9e
BLAKE2b-256 4524ff6ddeddbabff92ba3700c1b2691ca6bdaa6b017aa89cebb971934a799f7

See more details on using hashes here.

File details

Details for the file cleanupx-0.8.2-py3-none-any.whl.

File metadata

  • Download URL: cleanupx-0.8.2-py3-none-any.whl
  • Upload date:
  • Size: 86.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for cleanupx-0.8.2-py3-none-any.whl
Algorithm Hash digest
SHA256 81099800fdbec79d8daeed7b4e3525107561f0b92d9047b1b0dbb8ace1c63296
MD5 804ea9e505581ecea95e1a2acce1e980
BLAKE2b-256 75b8e8407cd683bc6c63bc04809732dec5fd9b4fa733cb9ffef59b7ed363e1f1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page