Skip to main content

Moooove your data with ease using our udderly simple pipeline framework.

Project description

Carabao

                                                           +:
   -                                                        *-
  -*                                                        +--
 -*                                                          *=:
-=*                                                         +=+-
-+*                                                        +==*-
++=*                                                      **=+-
 *==*+                                                 -**==*-
  -***=*++                                         -*=**=**-%
   %-****=*===----------#*%*##+%#@#**-*---------===**+=-%-#
      **=-***-=*==*==--****%#%%@*@@%#*-----+*=++%++++%%*
          *%%%#%####%##%*#%%=##@++%@%%######%%%%#%@
               + @%@@@@##%#=@#@@+++@@%
             --+++++##%@%#=@@+@@%%@#@#++++=%
           @+*@@@@@@##@##*=**+*@%%%@#@+@@#@++%
              #*       *#**##+#%%%*@ %%%%++++#+
                       *+****+#%%%*@
                       **=*==+#%##*
                        *=*=+*#%#*
                        #=*****#*
                       #@=***#@##
                       %%=%%%#=#%#
                       ####%%%#

GitHub

A Python library for building robust publisher-subscriber (pub/sub) frameworks with built-in lanes for common tasks.

Features

  • Core framework for managing pub/sub systems based on l2l (lane2lane)
  • Built-in lanes for:
    • Database logging (LogToDB) - Records exceptions to MongoDB
    • Network health monitoring (NetworkHealth) - Tracks network ping times
    • Environment variable display (PrettyEnv) - Formats environment variables for debugging
  • Automatic configuration management with settings system
  • Error handling with custom error handlers
  • Clean shutdown with exit handlers
  • Command-line interface for management, including interactive selection
  • Support for multiple database connections (MongoDB, Redis, Elasticsearch, PostgreSQL)
  • Development and production mode support
  • Test mode for safe testing in production environments

Installation

pip install carabao

Requirements

  • async-timeout
  • dnspython
  • fun-things
  • generic-lane
  • lazy-main
  • python-dotenv
  • simple-chalk
  • typing-extensions

Usage

Basic Usage

The framework is started using the CLI commands:

# For development mode
moo dev [queue_name]

# For production mode
moo run

No import statement is needed to start the framework.

Environment Variables

Carabao uses the following environment variables:

  • QUEUE_NAME: (Required) Name of the queue to consume
  • CARABAO_AUTO_INITIALIZE: Controls automatic initialization
  • CARABAO_AUTO_START: Controls automatic starting
  • CARABAO_START_WITH_ERROR: Whether to start even if errors occurred
  • SINGLE_RUN: Run once then exit if True
  • TESTING: Enable debug logging if True

Environment Files

Carabao supports environment variables loaded from .env files using python-dotenv:

  • .env.development: Used when running in development mode (moo dev)
  • .env.release: Used when running in production mode (moo run)
  • .env: Used as a fallback if neither of the above files exists

When initializing a new project with moo init, these files are automatically created.

The framework prioritizes environment variables in the following order:

  1. Variables defined in the system environment
  2. Variables defined in the appropriate .env file
  3. Default values defined in settings

This makes it easy to maintain different configurations for development, testing, and production environments without changing code.

Settings System

Carabao uses a centralized Settings system for configuration management. The Settings class provides a unified interface for accessing configuration values throughout the application.

Setting Up settings.py

A typical settings.py file inherits from the base Settings class:

from carabao import Settings as S


class Settings(S):
    # Directory where the lane modules are stored
    LANE_DIRECTORIES = [
        "lanes",
    ]

    # Whether to run the pipeline once and exit
    SINGLE_RUN = False

    # Minimum and maximum sleep times between runs (in seconds)
    SLEEP_MIN = 1.0
    SLEEP_MAX = 3.0

    # Whether to exit when processing is finished
    EXIT_ON_FINISH = False

    # Delay before exiting (in seconds)
    EXIT_DELAY = 0.0

    # Number of parallel processes to use
    PROCESSES = 1

    # Whether to deploy safely in production
    DEPLOY_SAFELY = True

    # Custom error handler function
    @classmethod
    def error_handler(cls, error: Exception) -> None:
        """
        Custom error handler for the application.

        Args:
            error: The exception that was raised.
        """
        print(f"An error occurred: {error}")

    @classmethod
    def before_start(cls) -> None:
        """
        Hook method called before framework startup.
        """
        # Perform any necessary initialization
        pass

When you run moo init, this file is automatically created for you in the appropriate location.

Settings Configuration

  1. carabao.cfg File: The framework uses a configuration file to locate your settings module:

    [directories]
    settings = src.settings  # or path.to.your.settings
    
  2. Accessing Settings in Code: To use these settings in your code:

    from carabao.settings import Settings
    
    settings = Settings.get()
    value = settings.value_of("LANE_DIRECTORIES")
    
  3. Available Settings: Common settings include:

    • LANE_DIRECTORIES: List of directories to search for lane definitions
    • SINGLE_RUN: Whether to run lanes once or continuously
    • SLEEP_MIN, SLEEP_MAX: Minimum and maximum sleep times between runs
    • EXIT_ON_FINISH: Whether to exit after finishing processing
    • EXIT_DELAY: Delay before exiting
    • PROCESSES: Number of parallel processes to use
    • DEPLOY_SAFELY: Whether to enforce production safety settings

    You can also define your own custom settings and access them the same way.

  4. Overriding Settings: Settings can be overridden by environment variables. For example, if your setting is named SINGLE_RUN, you can override it by setting the SINGLE_RUN environment variable.

CLI Usage

Carabao provides a command-line interface for managing lanes:

# Run in production mode
moo run [queue_name]

# Run in development mode
moo dev [queue_name]

# Initialize a new project
moo init [--skip]

# Create a new lane
moo new [lane_name]

The development mode (dev) command:

  • If no queue name is provided, displays an interactive curses-based menu to select from available lanes
  • Highlights the last run queue
  • Provides navigation with arrow keys
  • Allows selection with Enter key
  • Exit option at the bottom

Development

Creating a New Project

You can quickly initialize a new project with:

moo init

This will set up the necessary directory structure and configuration files.

Creating a New Lane

To create a new lane for processing:

moo new MyLaneName

This will generate a file with proper naming conventions (snake_case for the filename, PascalCase for the class name).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

carabao-1.17.2.tar.gz (29.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

carabao-1.17.2-py3-none-any.whl (36.0 kB view details)

Uploaded Python 3

File details

Details for the file carabao-1.17.2.tar.gz.

File metadata

  • Download URL: carabao-1.17.2.tar.gz
  • Upload date:
  • Size: 29.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for carabao-1.17.2.tar.gz
Algorithm Hash digest
SHA256 28f758314dc4d0dc560c731d728ac9d539e406bf2dc52c33ed7cc7694294c066
MD5 4cfebe07211ca3957e8f793333b3188b
BLAKE2b-256 e54662ab8353d7207244eb4378bc0f3253cbf3a3b6daf943f2191c61da3e1d92

See more details on using hashes here.

File details

Details for the file carabao-1.17.2-py3-none-any.whl.

File metadata

  • Download URL: carabao-1.17.2-py3-none-any.whl
  • Upload date:
  • Size: 36.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for carabao-1.17.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b8ba10dc76f3dc3b400b043d8c49f81ee009df8170e3eb563ce0c030cf76b7f4
MD5 fb8fb8ed28428b1bd3d707a83bd50026
BLAKE2b-256 99f6037ca042edf10ddbc943d014656475886fadc87523f538f254d368f9bd52

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page