Skip to main content

syncengine backend for file synchronization

Project description

syncengine

A flexible, bidirectional file synchronization engine for Python that supports multiple sync modes and conflict resolution strategies.

What is syncengine?

syncengine is a powerful file synchronization library that enables you to keep files synchronized between two locations (source and destination). Unlike simple copy operations, syncengine intelligently tracks file state, detects changes, handles conflicts, and provides multiple synchronization modes to fit different use cases.

Why is syncengine useful?

Real-world Use Cases

  1. Cloud Storage Synchronization

    • Sync local files with cloud storage (Dropbox, Google Drive, S3, etc.)
    • Implement custom cloud backup solutions
    • Build your own sync client with fine-grained control
  2. Backup Management

    • Create one-way backup systems (never delete backed-up files)
    • Implement versioned backup strategies
    • Maintain disaster recovery copies
  3. Development Workflows

    • Sync code between local development and remote servers
    • Mirror files to multiple deployment targets
    • Keep test environments synchronized with production data
  4. Content Distribution

    • Distribute files from a master source to multiple destinations
    • Keep documentation or assets synchronized across systems
    • Manage multi-site content updates

Key Features

  • Multiple Sync Modes: Choose the behavior that fits your needs

    • TWO_WAY: Bidirectional sync with conflict detection
    • SOURCE_TO_DESTINATION: Mirror source to destination (typical one-way sync)
    • SOURCE_BACKUP: Protect source from deletions (upload-only backup)
    • DESTINATION_TO_SOURCE: Mirror destination to source (cloud download)
    • DESTINATION_BACKUP: Protect local backup from remote changes
  • Intelligent Change Detection

    • Tracks file modifications via timestamps and sizes
    • Detects renames and moves
    • Identifies conflicts when both sides change
  • Flexible Conflict Resolution

    • Newest file wins
    • Source always wins
    • Destination always wins
    • Manual conflict handling
  • State Management

    • Persistent state tracking across sync sessions
    • Resume interrupted syncs
    • Detect changes since last sync
  • Pattern-based Filtering

    • Gitignore-style ignore patterns
    • Include/exclude specific files or directories
    • Control what gets synchronized
  • Protocol Agnostic

    • Works with any storage backend (local, S3, FTP, custom protocols)
    • Pluggable storage interface
    • Easy to extend for new storage types

Quick Example

from syncengine import SyncEngine, SyncMode, LocalStorageClient, SyncPair

# Create storage clients
source = LocalStorageClient("/path/to/source")
destination = LocalStorageClient("/path/to/destination")

# Create sync engine
engine = SyncEngine(mode=SyncMode.TWO_WAY)

# Create sync pair
pair = SyncPair(
    source_root="/path/to/source",
    destination_root="/path/to/destination",
    source_client=source,
    destination_client=destination
)

# Perform sync
stats = engine.sync_pair(pair)
print(f"Uploaded: {stats['uploads']}, Downloaded: {stats['downloads']}")

When to Use Each Sync Mode

Mode Use Case Source Changes Destination Changes Deletions
TWO_WAY Keep both sides in sync Upload Download Propagated both ways
SOURCE_TO_DESTINATION Mirror source to destination Upload Ignored (deleted) Propagated to destination
SOURCE_BACKUP Backup source, never delete Upload Download Never delete source
DESTINATION_TO_SOURCE Mirror destination to source Ignored (deleted) Download Propagated to source
DESTINATION_BACKUP Backup from destination Ignored Download Never delete local backup

Installation

pip install syncengine

Or for development:

git clone https://github.com/holgern/syncengine
cd syncengine
pip install -e .

Benchmarks

The project includes comprehensive benchmarks that test all sync modes with various scenarios. See benchmarks/README.md for details.

Run benchmarks:

python benchmarks/run_benchmarks.py

Documentation

For detailed API documentation, see the individual module docstrings:

  • syncengine/engine.py - Main sync engine
  • syncengine/modes.py - Sync mode definitions
  • syncengine/comparator.py - Change detection logic
  • syncengine/protocols.py - Storage protocol interfaces
  • syncengine/config.py - Configuration options

Contributing

Contributions are welcome! Please ensure:

  • Tests pass: pytest tests/
  • Benchmarks pass: python benchmarks/run_benchmarks.py
  • Code follows project style

License

See LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

syncengine-0.1.0.tar.gz (118.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

syncengine-0.1.0-py3-none-any.whl (68.5 kB view details)

Uploaded Python 3

File details

Details for the file syncengine-0.1.0.tar.gz.

File metadata

  • Download URL: syncengine-0.1.0.tar.gz
  • Upload date:
  • Size: 118.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for syncengine-0.1.0.tar.gz
Algorithm Hash digest
SHA256 c81141c54b205c0d4c3b56707d8ec8eef7b8e313b7522e451f7e0be03ef34df8
MD5 9e34dfd1e697ac4a538e20359a0d79e4
BLAKE2b-256 2335cae00802e6d1079041a609ab5f2154c9ce43be7fb77054462e5b1d48f634

See more details on using hashes here.

File details

Details for the file syncengine-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: syncengine-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 68.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for syncengine-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ba732f585b56638bf959016af92dc928b7383dcc8120c65178d570d7bbc4919a
MD5 ad1685eaa1b1c188d8995abf9c212f43
BLAKE2b-256 961fcc6435d1438a5ceaccb85956585b7aeba10a156cc8de7686a0b4638c9bcd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page