Storage Analyzer & Cleanup Recommender
Project description
storageanalyser
Storage Analyzer & Cleanup Recommender for macOS and Windows.
Scans your home directory (and optionally other paths) to find large files, stale files, junk directories, build artifacts, duplicates, and old downloads. Outputs a prioritised list of cleanup recommendations with estimated space savings.
Platform-aware: automatically detects OS-specific cache locations, system directories, and generates the appropriate cleanup script (bash on macOS, PowerShell on Windows).
Installation
pip install storageanalyser
Usage
storageanalyser # Scan home directory
storageanalyser --top 30 # Show top 30 recommendations
storageanalyser --duplicates # Include duplicate detection
storageanalyser --json # JSON output
storageanalyser --threshold 50 # Flag files over 50 MB
storageanalyser --ignoredir node_modules # Skip directories
storageanalyser --includedir CloudStorage # Override a default-skipped directory
storageanalyser --list-skipped # Show directories skipped by default
storageanalyser --web # Launch the web interface
storageanalyser --web --port 9000 # Web interface on a custom port
Scan a specific drive or folder:
# macOS
storageanalyser /Volumes/ExternalDrive
# Windows
storageanalyser D:\Data
Platform Support
| Feature | macOS | Windows |
|---|---|---|
| CLI scanner | Yes | Yes |
| Web interface | Yes | Yes |
| OS-specific caches | Library/Caches, .Trash, Xcode, etc. | AppData/Local/Temp, CrashDumps, etc. |
| Cloud storage skip | iCloud, Google Drive, OneDrive, Dropbox | Google Drive, OneDrive, Dropbox |
| System dir skip | .Spotlight-V100, .fseventsd, Photos Library | $Recycle.Bin, System Volume Information |
| Cleanup script | Bash (.sh) | PowerShell (.ps1) |
| Duplicate detection | Yes | Yes |
| Google Drive integration | Yes | Yes |
Web Interface
Launch the web UI with storageanalyser --web. Features include:
- Live scan progress with cancel support
- Duplicate hashing progress bar
- Disk usage pie chart and summary cards
- Category breakdown bar chart
- Clickable treemap (click to jump to the recommendation)
- Recommendations organized by category tabs with per-tab sorting
- Short/full path toggle for readability
- Duplicate detection with full path listing and wasted space calculation
- Configurable skipped directories (override defaults to include cloud storage, etc.)
- Cleanup script download for selected items (bash or PowerShell)
- Google Drive integration and cross-environment deduplication
Screenshots
Scan form with configurable skipped directories:
Results summary with disk usage pie chart and summary cards:
Category breakdown and clickable treemap:
Recommendations organized by category tabs:
Duplicate detection with copy count and wasted space:
Skipped Directories
By default, certain directories are skipped during scans. The list is platform-specific
(cloud storage, OS system directories, managed libraries, etc.). Use --list-skipped
to see the defaults for your OS and --includedir to override:
storageanalyser --list-skipped
storageanalyser --includedir Music --includedir Movies
Documentation
Full documentation is available at storageanalyser.readthedocs.io.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file storageanalyser-0.1.3.tar.gz.
File metadata
- Download URL: storageanalyser-0.1.3.tar.gz
- Upload date:
- Size: 744.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b5e6cf5b0d3e37893cf7e932fcb9aec53e8b4a804088679e977987c122e1063
|
|
| MD5 |
68952d97925f379445b7af43ad0763b9
|
|
| BLAKE2b-256 |
c3a9f8c1c2896706b5ea78d4d646ad61e6b4c9a42ea338db05cdf337b2ffd40c
|
Provenance
The following attestation bundles were made for storageanalyser-0.1.3.tar.gz:
Publisher:
publish.yml on jdrumgoole/storageanalyser
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
storageanalyser-0.1.3.tar.gz -
Subject digest:
5b5e6cf5b0d3e37893cf7e932fcb9aec53e8b4a804088679e977987c122e1063 - Sigstore transparency entry: 1203518449
- Sigstore integration time:
-
Permalink:
jdrumgoole/storageanalyser@b53a55d496db673939322b290e69298cac311f19 -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/jdrumgoole
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b53a55d496db673939322b290e69298cac311f19 -
Trigger Event:
release
-
Statement type:
File details
Details for the file storageanalyser-0.1.3-py3-none-any.whl.
File metadata
- Download URL: storageanalyser-0.1.3-py3-none-any.whl
- Upload date:
- Size: 55.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
189493eead551f3c6ac8676bf3bee8e88f55d0037b00bb930990bff6e3ac63e6
|
|
| MD5 |
ccb67e9d8584ae2c0f24473fd402fb61
|
|
| BLAKE2b-256 |
6b5c183bcd9eb6384eb23918e0006ba97944ba747635d9ee697b46afa3525bd1
|
Provenance
The following attestation bundles were made for storageanalyser-0.1.3-py3-none-any.whl:
Publisher:
publish.yml on jdrumgoole/storageanalyser
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
storageanalyser-0.1.3-py3-none-any.whl -
Subject digest:
189493eead551f3c6ac8676bf3bee8e88f55d0037b00bb930990bff6e3ac63e6 - Sigstore transparency entry: 1203518452
- Sigstore integration time:
-
Permalink:
jdrumgoole/storageanalyser@b53a55d496db673939322b290e69298cac311f19 -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/jdrumgoole
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b53a55d496db673939322b290e69298cac311f19 -
Trigger Event:
release
-
Statement type: