A utility to sort files by type and date.
Project description
Sortium
Sortium is a high-performance Python utility for rapidly organizing file systems. It emphasizes a safe, preview-first workflow that lets you plan and review categorized moves (by type, date, or regex) before anything changes on disk.
Designed for both speed and safety, it is memory-efficient for handling massive directories and automatically prevents file overwrites.
Table of Contents
- Key Features
- Installation
- Getting Started: Usage Examples
- Command Line Usage
- Running Tests
- Documentation
- Contributing
- Author
- License
Key Features
- Plan-first workflow – Every sort emits an editable JSON plan so you can audit, tweak, version, or share the intended moves before running them.
- Memory-efficient design – Uses generators and streaming I/O so it scales to very large trees without exhausting RAM.
- Flexible sorting strategies – Built-in helpers for sorting by file type, modification date, or arbitrary regex patterns.
- Collision-safe moves – Automatically generates unique destination names (e.g.,
image (1).jpg) to avoid overwriting files. - In-place or cross-volume moves – Choose to tidy a directory in situ or relocate everything into a dedicated archive folder.
- Utility toolkit –
FileUtilsexposes recursive scanners, directory flattening, tree export, and reversible plan execution.
Installation
From PyPI
To install the latest stable version from PyPI:
pip install sortium
From Source
To install the latest development version from the repository:
git clone https://github.com/Sarthak-G0yal/Sortium.git
cd Sortium
pip install -e .
Getting Started: Usage Examples
Here are a few examples to get you started quickly.
Example 1: Sort Files by Type
This is the most common use case. It now works in two phases: generate a plan, review/edit the JSON (optional), then apply it when you're ready.
from sortium.sorter import Sorter
# The folder you want to clean up
source_directory = "./my_messy_downloads_folder"
# Create a Sorter instance
sorter = Sorter()
# Phase 1: Generate an editable JSON plan
plan_path = sorter.sort_by_type(
source_directory,
recursive=True, # include nested folders
)
# (Optional) Inspect / edit the JSON plan here
# ...
# Phase 2: Apply the plan when you're satisfied
sorter.file_utils.apply_move_plan(str(plan_path))
print(f"Applied plan {plan_path}")
# Need to undo? Re-use the same plan with reverse=True
sorter.file_utils.apply_move_plan(str(plan_path), reverse=True)
# Prefer a shallow cleanup? Drop recursive=True (it defaults to False).
Example 2: Sort Files to a Different Destination
Organize files from a source folder and move the categorized results to a completely different location.
from sortium.sorter import Sorter
source_dir = "./my_source_files"
destination_dir = "./organized_archive"
sorter = Sorter()
# Generate plan targeted at `destination_dir`
plan_path = sorter.sort_by_type(
source_dir,
dest_folder_path=destination_dir,
plan_output="./sorting_plan.json",
recursive=True,
)
# Review/edit sorting_plan.json if needed, then execute
sorter.file_utils.apply_move_plan(str(plan_path))
Example 3: Advanced Sorting with Regex
Recursively scan a directory and sort files based on custom patterns. This is great for organizing project files, logs, or datasets.
from sortium.sorter import Sorter
project_folder = "./my_data_science_project"
sorted_output = "./sorted_project_files"
# Define categories and their corresponding regex patterns
regex_map = {
"Datasets": r".*\.csv$",
"Notebooks": r".*\.ipynb$",
"Python_Code": r".*\.py$",
"Final_Reports": r"final_report_.*\.pdf$"
}
sorter = Sorter()
plan_path = sorter.sort_by_regex(project_folder, regex_map, sorted_output)
sorter.file_utils.apply_move_plan(str(plan_path))
```
---
## Command Line Usage
Sortium now ships with a `sortium` CLI so you can work entirely from the terminal.
```bash
# Generate a recursive type-based plan and write it to Downloads
sortium plan type --source ./Downloads --dest ./Downloads/Sorted --recursive
# Apply or undo the plan later
sortium apply --plan ./Downloads/sortium_plan_type_20250101_101010.json
sortium undo --plan ./Downloads/sortium_plan_type_20250101_101010.json
# Produce a tree snapshot for auditing
sortium tree --source ./Downloads --output ./downloads_structure.json
```
Run `sortium --help` or `sortium plan --help` to explore every flag (strategies,
regex rules, folder-specific date sorting, dry-run previews, etc.).
---
## Running Tests
To run the full test suite and generate a coverage report, first install the development dependencies:
```bash
pip install pytest pytest-cov
Then, from the project's root directory, run:
pytest --cov=sortium
For more details on the test structure, see the Test Suite README.
Documentation
This project uses Sphinx for documentation.
-
Online Documentation: View Documentation
-
To build the documentation locally:
# Navigate to the docs directory cd docs # Install documentation requirements pip install -r requirements.txt # Build the HTML pages make html
View the generated files at
docs/_build/html/index.html.
Contributing
Contributions are welcome! Please follow these steps to contribute:
- Fork the repository.
- Create a new branch for your feature or fix (
feature/my-featureorfix/my-fix). - Write tests that cover your changes.
- Commit your changes using clear, conventional messages.
- Open a pull request with a detailed description of your work.
Please follow the Conventional Commits specification. Ensure all code is linted and tested before submitting.
- For bugs and feature requests, please open an issue.
Author
Sarthak Goyal
- Email: sarthakgoyal487@gmail.com
License
This project is licensed under the GNU General Public License v3.0.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sortium-2.2.3.tar.gz.
File metadata
- Download URL: sortium-2.2.3.tar.gz
- Upload date:
- Size: 31.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5f8232c303cc60c398051172e2c2b9f327c2f71f1e75edb92fe770f09a854371
|
|
| MD5 |
9c56ec95190fa8d732816abc0eb004f7
|
|
| BLAKE2b-256 |
8319215ba0879160b98c8e85e65dae28697ad2b3f4248cf1751c2fc80747aba8
|
Provenance
The following attestation bundles were made for sortium-2.2.3.tar.gz:
Publisher:
release.yaml on Sarthak-G0yal/Sortium
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sortium-2.2.3.tar.gz -
Subject digest:
5f8232c303cc60c398051172e2c2b9f327c2f71f1e75edb92fe770f09a854371 - Sigstore transparency entry: 741470995
- Sigstore integration time:
-
Permalink:
Sarthak-G0yal/Sortium@3ce82ab62839a194a038c110ea64653f0209b5c7 -
Branch / Tag:
refs/tags/2.2.3 - Owner: https://github.com/Sarthak-G0yal
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@3ce82ab62839a194a038c110ea64653f0209b5c7 -
Trigger Event:
push
-
Statement type:
File details
Details for the file sortium-2.2.3-py3-none-any.whl.
File metadata
- Download URL: sortium-2.2.3-py3-none-any.whl
- Upload date:
- Size: 35.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e40b3c5428c95bd176c84f596e58c1855a4d31fef2530ee88c3a990edbd6b259
|
|
| MD5 |
9840d4bbc72eb0cca436ca3693db9773
|
|
| BLAKE2b-256 |
2b3272ffca4f7d9152be5fa63d3d013c400b1cf7cc9188201789b6d4e471d859
|
Provenance
The following attestation bundles were made for sortium-2.2.3-py3-none-any.whl:
Publisher:
release.yaml on Sarthak-G0yal/Sortium
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sortium-2.2.3-py3-none-any.whl -
Subject digest:
e40b3c5428c95bd176c84f596e58c1855a4d31fef2530ee88c3a990edbd6b259 - Sigstore transparency entry: 741471000
- Sigstore integration time:
-
Permalink:
Sarthak-G0yal/Sortium@3ce82ab62839a194a038c110ea64653f0209b5c7 -
Branch / Tag:
refs/tags/2.2.3 - Owner: https://github.com/Sarthak-G0yal
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@3ce82ab62839a194a038c110ea64653f0209b5c7 -
Trigger Event:
push
-
Statement type: