Skip to main content

Comprehensive AI-powered file processing and organization tool

Project description

cleanupx - Comprehensive File Processing Tool

Version 0.8.0 - Prerelease

A powerful, AI-enhanced file organization and processing framework with comprehensive capabilities for code analysis, image processing, file deduplication, and privacy utilities.

๐ŸŽฏ What's New in v0.8 (Prerelease)

  • ๐Ÿ—๏ธ Complete Reorganization: Clean modular structure with core functionality separated from storage
  • ๐Ÿš€ Enhanced Performance: Streamlined imports and optimized processing
  • ๐Ÿ”„ Backward Compatibility: All existing commands continue to work
  • ๐Ÿ“ฆ Unified Architecture: Consolidated scattered functionality into organized modules
  • ๐Ÿ”’ SSL Ready: Full HTTPS support with proper certificate configuration
  • ๐ŸŽฏ Consistent Branding: Unified "cleanupx" naming throughout the project

๐Ÿ“ Project Structure

cleanupx/
โ”œโ”€โ”€ cleanupx.py                 # Main CLI interface
โ”œโ”€โ”€ cleanupx_core/              # Core functionality
โ”‚   โ”œโ”€โ”€ api/                    # XAI API integration  
โ”‚   โ”œโ”€โ”€ processors/
โ”‚   โ”‚   โ”œโ”€โ”€ integrated/         # New comprehensive processing
โ”‚   โ”‚   โ””โ”€โ”€ legacy/             # Backward compatibility
โ”‚   โ””โ”€โ”€ utils/                  # Common utilities
โ”œโ”€โ”€ storage/                    # Non-core functionality archive
โ”‚   โ”œโ”€โ”€ legacy_methods/         # Original processing methods
โ”‚   โ”œโ”€โ”€ dev_tools/              # Development utilities
โ”‚   โ””โ”€โ”€ documentation/          # Archive documentation
โ””โ”€โ”€ test/                       # Test files

๐Ÿš€ Quick Start

Installation

# Install from PyPI (recommended)
pip install cleanupx

# Or install prerelease version
pip install --pre cleanupx

# Or clone from source
git clone https://github.com/lukeslp/cleanupx.git
cd cleanupx
pip install -r requirements.txt

# Set up environment (optional for AI features)
echo "XAI_API_KEY=your-xai-api-key" > .env

Basic Usage

# Check system status
cleanupx --help

# Run file deduplication
cleanupx deduplicate --dir test

# Process images for accessibility
cleanupx images --dir test  

# Comprehensive processing with all features
cleanupx comprehensive --dir test

# Privacy: scramble filenames
cleanupx scramble --dir test

๐Ÿ› ๏ธ Features

Core Processing

  • File Deduplication: Smart duplicate detection and organization
  • Code Analysis: Extract and analyze code snippets
  • File Organization: Categorize and organize files by type and content
  • Citation Processing: Extract and format citations from documents

AI-Powered Features

  • X.AI Integration: Advanced AI processing with retry logic
  • Image Alt Text: Generate accessibility descriptions for images
  • Content Analysis: AI-powered content understanding and categorization
  • Smart Deduplication: Intelligent duplicate detection beyond simple hashing

Privacy & Utility

  • Filename Scrambling: Randomize filenames for privacy/testing
  • Rename Logging: Track and reverse filename changes
  • Rich CLI Interface: Beautiful terminal output with progress bars
  • Comprehensive Logging: Detailed operation logs

๐Ÿ“‹ Available Commands

Primary Commands

comprehensive    # Full processing with all features
images          # AI-powered image processing and alt text generation  
scramble        # Privacy-focused filename scrambling

Legacy Commands (Backward Compatible)

deduplicate     # Find and process duplicate files
extract         # Extract important code snippets  
organize        # Organize and rename files
all             # Run all legacy processing steps

๐Ÿ”ง Configuration

Environment Variables

# Required for AI features
XAI_API_KEY=your-xai-api-key

# Optional configurations
CLEANUP_OUTPUT_DIR=custom_output_directory
CLEANUP_LOG_LEVEL=INFO

Dependencies

Core Requirements

requests>=2.31.0        # HTTP requests
rich>=13.7.0           # Beautiful console output  
inquirer>=3.4.0        # Interactive prompts
pillow>=10.0.0         # Image processing
PyPDF2>=3.0.1          # PDF processing
python-docx>=1.1.2     # Word document processing

Optional Dependencies

openai                 # OpenAI API fallback
PyHEIF                # HEIC/HEIF image support
rarfile               # RAR archive processing

๐Ÿ—๏ธ Architecture

Modular Design

cleanupx is built with a clean, modular architecture:

  • cleanupx_core/: Core functionality with stable APIs
  • storage/: Non-essential functionality for experimentation
  • Processors: Specialized processing modules for different file types
  • API Layer: Unified interface for AI service integration

Key Benefits

  1. Clean Separation: Core vs. experimental functionality
  2. Backward Compatibility: Legacy commands continue to work
  3. Extensible: Easy to add new processors and features
  4. Production Ready: Robust error handling and logging
  5. Organized Output: Centralized output management

๐Ÿ“Š Supported File Types

  • Images: .jpg, .jpeg, .png, .gif, .webp, .bmp, .tiff
  • Code: .py, .js, .html, .css, .md, .txt, .json, .yaml
  • Documents: .pdf, .doc, .docx, .rtf, .pptx
  • Archives: .zip, .tar, .gz (with optional RAR support)
  • All Others: Categorized and processed appropriately

๐Ÿšฆ Status Check

# Verify module status
python3 -c "import cleanupx_core; cleanupx_core.print_status()"

Expected output:

cleanupx Core v0.8.0
  Integrated Processors: โœ“
  XAI API Support: โœ“  
  Legacy Processors: โœ“
  Module Path: /path/to/cleanupx_core

๐Ÿ” Examples

Basic File Organization

# Organize a downloads directory
cleanupx organize --dir ~/Downloads

# Find duplicates in a project
cleanupx deduplicate --dir ~/Projects/MyProject

AI-Enhanced Processing

# Generate alt text for all images
cleanupx images --dir ./photos

# Comprehensive AI analysis
cleanupx comprehensive --dir ./documents

Privacy & Testing

# Scramble filenames for privacy
cleanupx scramble --dir ./sensitive_data

# Note: Scrambling creates a log file to reverse changes

๐Ÿ› ๏ธ Development

Module Status

The reorganized architecture provides:

  • Stable Core: cleanupx_core/ for production functionality
  • Experimental Storage: storage/ for development and testing
  • Clear APIs: Well-defined interfaces between modules
  • Easy Testing: Modular design enables easy unit testing

Contributing

  1. Core functionality goes in cleanupx_core/
  2. Experimental features start in storage/dev_tools/
  3. All changes must maintain backward compatibility
  4. Add comprehensive tests for new features

๐Ÿ“ License & Credits

MIT License by Luke Steuber

๐Ÿ”— Connect & Support

Platform Link
๐ŸŒ Website lukesteuber.com
๐Ÿ› ๏ธ Playground assisted.site
๐Ÿ“ง Email luke@lukesteuber.com
๐Ÿฆ Bluesky @lukesteuber.com
๐Ÿ’ผ LinkedIn lukesteuber
๐Ÿ’ป GitHub lukeslp
๐Ÿง  LlamaLine AI CLI Tool
โœ‰๏ธ Newsletter Substack
โ˜• Support Tip Jar

๐ŸŽฏ What's Next

  1. Performance Optimization: Profile and optimize processing speeds
  2. Enhanced AI Features: More sophisticated content analysis
  3. Web Interface: Browser-based processing dashboard
  4. API Server: REST API for remote processing
  5. Plugin System: Custom processor plugins

Version: 0.8.0 - Prerelease
Last Updated: June 6, 2025
Status: ๐Ÿšง Prerelease - Testing & Feedback Welcome

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanupx-0.8.1.tar.gz (82.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanupx-0.8.1-py3-none-any.whl (86.9 kB view details)

Uploaded Python 3

File details

Details for the file cleanupx-0.8.1.tar.gz.

File metadata

  • Download URL: cleanupx-0.8.1.tar.gz
  • Upload date:
  • Size: 82.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for cleanupx-0.8.1.tar.gz
Algorithm Hash digest
SHA256 25799e421bab311b31cce7e792e9ebffaf2653787631e85460d743c16ff1cd00
MD5 3e9c7127152268d93ea0da4cc3424196
BLAKE2b-256 ed71f34068e7ea8a4ba272484a8212cd3f1661d47012838426b7285ffc15e51b

See more details on using hashes here.

File details

Details for the file cleanupx-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: cleanupx-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 86.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for cleanupx-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2b65e760864b6ba056792d6621fe0ca1fbebbf07e8334cca1f595ea3da82f7e0
MD5 03d9bc9f0592b5f55f353850d29f415c
BLAKE2b-256 fb5191cb31cd70baa789f97daf2bdba615a94224de9fb94aa8e3602ff6018f76

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page