Comprehensive AI-powered file processing and organization tool
Project description
cleanupx - Comprehensive File Processing Tool
Version 0.8.0 - Prerelease
A powerful, AI-enhanced file organization and processing framework with comprehensive capabilities for code analysis, image processing, file deduplication, and privacy utilities.
๐ฏ What's New in v0.8 (Prerelease)
- ๐๏ธ Complete Reorganization: Clean modular structure with core functionality separated from storage
- ๐ Enhanced Performance: Streamlined imports and optimized processing
- ๐ Backward Compatibility: All existing commands continue to work
- ๐ฆ Unified Architecture: Consolidated scattered functionality into organized modules
- ๐ SSL Ready: Full HTTPS support with proper certificate configuration
- ๐ฏ Consistent Branding: Unified "cleanupx" naming throughout the project
๐ Project Structure
cleanupx/
โโโ cleanupx.py # Main CLI interface
โโโ cleanupx_core/ # Core functionality
โ โโโ api/ # XAI API integration
โ โโโ processors/
โ โ โโโ integrated/ # New comprehensive processing
โ โ โโโ legacy/ # Backward compatibility
โ โโโ utils/ # Common utilities
โโโ storage/ # Non-core functionality archive
โ โโโ legacy_methods/ # Original processing methods
โ โโโ dev_tools/ # Development utilities
โ โโโ documentation/ # Archive documentation
โโโ test/ # Test files
๐ Quick Start
Installation
# Install from PyPI (recommended)
pip install cleanupx
# Or install prerelease version
pip install --pre cleanupx
# Or clone from source
git clone https://github.com/lukeslp/cleanupx.git
cd cleanupx
pip install -r requirements.txt
# Set up environment (optional for AI features)
echo "XAI_API_KEY=your-xai-api-key" > .env
Basic Usage
# Check system status
cleanupx --help
# Run file deduplication
cleanupx deduplicate --dir test
# Process images for accessibility
cleanupx images --dir test
# Comprehensive processing with all features
cleanupx comprehensive --dir test
# Privacy: scramble filenames
cleanupx scramble --dir test
๐ ๏ธ Features
Core Processing
- File Deduplication: Smart duplicate detection and organization
- Code Analysis: Extract and analyze code snippets
- File Organization: Categorize and organize files by type and content
- Citation Processing: Extract and format citations from documents
AI-Powered Features
- X.AI Integration: Advanced AI processing with retry logic
- Image Alt Text: Generate accessibility descriptions for images
- Content Analysis: AI-powered content understanding and categorization
- Smart Deduplication: Intelligent duplicate detection beyond simple hashing
Privacy & Utility
- Filename Scrambling: Randomize filenames for privacy/testing
- Rename Logging: Track and reverse filename changes
- Rich CLI Interface: Beautiful terminal output with progress bars
- Comprehensive Logging: Detailed operation logs
๐ Available Commands
Primary Commands
comprehensive # Full processing with all features
images # AI-powered image processing and alt text generation
scramble # Privacy-focused filename scrambling
Legacy Commands (Backward Compatible)
deduplicate # Find and process duplicate files
extract # Extract important code snippets
organize # Organize and rename files
all # Run all legacy processing steps
๐ง Configuration
Environment Variables
# Required for AI features
XAI_API_KEY=your-xai-api-key
# Optional configurations
CLEANUP_OUTPUT_DIR=custom_output_directory
CLEANUP_LOG_LEVEL=INFO
Dependencies
Core Requirements
requests>=2.31.0 # HTTP requests
rich>=13.7.0 # Beautiful console output
inquirer>=3.4.0 # Interactive prompts
pillow>=10.0.0 # Image processing
PyPDF2>=3.0.1 # PDF processing
python-docx>=1.1.2 # Word document processing
Optional Dependencies
openai # OpenAI API fallback
PyHEIF # HEIC/HEIF image support
rarfile # RAR archive processing
๐๏ธ Architecture
Modular Design
cleanupx is built with a clean, modular architecture:
cleanupx_core/: Core functionality with stable APIsstorage/: Non-essential functionality for experimentation- Processors: Specialized processing modules for different file types
- API Layer: Unified interface for AI service integration
Key Benefits
- Clean Separation: Core vs. experimental functionality
- Backward Compatibility: Legacy commands continue to work
- Extensible: Easy to add new processors and features
- Production Ready: Robust error handling and logging
- Organized Output: Centralized output management
๐ Supported File Types
- Images:
.jpg,.jpeg,.png,.gif,.webp,.bmp,.tiff - Code:
.py,.js,.html,.css,.md,.txt,.json,.yaml - Documents:
.pdf,.doc,.docx,.rtf,.pptx - Archives:
.zip,.tar,.gz(with optional RAR support) - All Others: Categorized and processed appropriately
๐ฆ Status Check
# Verify module status
python3 -c "import cleanupx_core; cleanupx_core.print_status()"
Expected output:
cleanupx Core v0.8.0
Integrated Processors: โ
XAI API Support: โ
Legacy Processors: โ
Module Path: /path/to/cleanupx_core
๐ Examples
Basic File Organization
# Organize a downloads directory
cleanupx organize --dir ~/Downloads
# Find duplicates in a project
cleanupx deduplicate --dir ~/Projects/MyProject
AI-Enhanced Processing
# Generate alt text for all images
cleanupx images --dir ./photos
# Comprehensive AI analysis
cleanupx comprehensive --dir ./documents
Privacy & Testing
# Scramble filenames for privacy
cleanupx scramble --dir ./sensitive_data
# Note: Scrambling creates a log file to reverse changes
๐ ๏ธ Development
Module Status
The reorganized architecture provides:
- Stable Core:
cleanupx_core/for production functionality - Experimental Storage:
storage/for development and testing - Clear APIs: Well-defined interfaces between modules
- Easy Testing: Modular design enables easy unit testing
Contributing
- Core functionality goes in
cleanupx_core/ - Experimental features start in
storage/dev_tools/ - All changes must maintain backward compatibility
- Add comprehensive tests for new features
๐ License & Credits
MIT License by Luke Steuber
๐ Connect & Support
| Platform | Link |
|---|---|
| ๐ Website | lukesteuber.com |
| ๐ ๏ธ Playground | assisted.site |
| ๐ง Email | luke@lukesteuber.com |
| ๐ฆ Bluesky | @lukesteuber.com |
| ๐ผ LinkedIn | lukesteuber |
| ๐ป GitHub | lukeslp |
| ๐ง LlamaLine | AI CLI Tool |
| โ๏ธ Newsletter | Substack |
| โ Support | Tip Jar |
๐ฏ What's Next
- Performance Optimization: Profile and optimize processing speeds
- Enhanced AI Features: More sophisticated content analysis
- Web Interface: Browser-based processing dashboard
- API Server: REST API for remote processing
- Plugin System: Custom processor plugins
Version: 0.8.0 - Prerelease
Last Updated: June 6, 2025
Status: ๐ง Prerelease - Testing & Feedback Welcome
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cleanupx-0.8.1.tar.gz.
File metadata
- Download URL: cleanupx-0.8.1.tar.gz
- Upload date:
- Size: 82.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
25799e421bab311b31cce7e792e9ebffaf2653787631e85460d743c16ff1cd00
|
|
| MD5 |
3e9c7127152268d93ea0da4cc3424196
|
|
| BLAKE2b-256 |
ed71f34068e7ea8a4ba272484a8212cd3f1661d47012838426b7285ffc15e51b
|
File details
Details for the file cleanupx-0.8.1-py3-none-any.whl.
File metadata
- Download URL: cleanupx-0.8.1-py3-none-any.whl
- Upload date:
- Size: 86.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2b65e760864b6ba056792d6621fe0ca1fbebbf07e8334cca1f595ea3da82f7e0
|
|
| MD5 |
03d9bc9f0592b5f55f353850d29f415c
|
|
| BLAKE2b-256 |
fb5191cb31cd70baa789f97daf2bdba615a94224de9fb94aa8e3602ff6018f76
|