Good Kiwi Common Library

Project description

good-common

A small set of common dependencies for Good Kiwi.

Dependency Provider

BaseProvider is a base class for creating fast_depends (so FastAPI and FastStream compatible) dependency providers.

class APIClient:
    def __init__(self, api_key: str):
        self.api_key = api_key

    def get(self, url: str):
        return f"GET {url} with {self.api_key}"

class APIClientProvider(BaseProvider[APIClient], APIClient):
    pass


from fast_depends import inject

@inject
def some_task(
    api_client: APIClient = APIClientProvider(api_key="1234"),
):
    return api_client.get("https://example.com")

Can also be used without fast_depends:

client = APIClientProvider(api_key="1234").get()

Override initializer to customize how the dependency class is initialized.

class APIClientProvider(BaseProvider[APIClient], APIClient):
    def initializer(
        self,
        cls_args: typing.Tuple[typing.Any, ...],  # args passed to the Provider
        cls_kwargs: typing.Dict[str, typing.Any],  # kwargs passed to the Provider
        fn_kwargs: typing.Dict[str, typing.Any],  # kwargs passed to the function at runtime
    ):
        return cls_args, {**cls_kwargs, **fn_kwargs}  # override the api_key with the one passed to the function


@inject
def some_task(
    api_key: str,
    api_client: APIClient = APIClientProvider(),
):
    return api_client.get("https://example.com")


some_task(api_key="5678")

Pipeline

Overview

The Pipeline library provides a flexible and efficient way to create and execute pipelines of components in Python. It supports both synchronous and asynchronous execution, type checking, parallel processing, and error handling.

Features

Create pipelines with multiple components that can accept multiple inputs and produce multiple outputs
Typed "channels" for passing data between components
Support for both synchronous and asynchronous components
Type checking for inputs and outputs using Python type annotations
Parallel execution of pipeline instances
Error handling with Result types
Function mapping for flexible component integration

Quick Start

from typing import Annotated
from good_common.pipeline import Pipeline, Attribute

def add(a: int, b: int) -> Annotated[int, Attribute("result")]:
    return a + b

def multiply(result: int, factor: int) -> Annotated[int, Attribute("result")]:
    return result * factor

# Create a pipeline
my_pipeline = Pipeline(add, multiply)

# Execute the pipeline
result = await my_pipeline(a=2, b=3, factor=4)
print(result.result)  # Output: 20

Usage

Creating a Pipeline

Use the Pipeline class to create a new pipeline:

from pipeline import Pipeline

my_pipeline = Pipeline(component1, component2, component3)

Defining Components

Components can be synchronous or asynchronous functions:

from typing import Annotated
from pipeline import Attribute

def sync_component(x: int) -> Annotated[int, Attribute("result")]:
    return x + 1

async def async_component(x: int) -> Annotated[int, Attribute("result")]:
    await asyncio.sleep(0.1)
    return x * 2

Executing a Pipeline

Execute a pipeline asynchronously:

result = await my_pipeline(x=5)
print(result.result)

Parallel Execution

Execute a pipeline with multiple inputs in parallel:

inputs = [{"a": 1, "b": 2, "factor": 2}, {"a": 2, "b": 3, "factor": 3}]
results = [result async for result in my_pipeline.execute(*inputs, max_workers=3)]

for result in results:
    if result.is_ok():
        print(result.unwrap().result)
    else:
        print(f"Error: {result.unwrap_err()}")

Error Handling

The pipeline handles errors gracefully in parallel execution:

def faulty_component(x: int) -> Annotated[int, Attribute("result")]:
    if x == 2:
        raise ValueError("Error on purpose!")
    return x + 1

pipeline = Pipeline(faulty_component)
inputs = [{"x": 1}, {"x": 2}, {"x": 3}]
results = [result async for result in pipeline.execute(*inputs)]

for result in results:
    if result.is_ok():
        print(result.unwrap().result)
    else:
        print(f"Error: {result.unwrap_err()}")

Function Mapping

Use function_mapper to adjust input parameter names:

from pipeline import function_mapper

def multiply_diff(difference: int, factor: int) -> Annotated[int, Attribute("result")]:
    return difference * factor

pipeline = Pipeline(subtract, function_mapper(multiply_diff, diff="difference"))

Advanced Features

Mixed synchronous and asynchronous components in a single pipeline
Custom output types with Attribute annotations
Flexible error handling in both single and parallel executions

URL Plugin System

The URL class in good-common now supports a plugin system for extending URL processing capabilities without modifying the core library.

Features

Extend URL canonicalization rules
Add custom tracking parameters to filter
Define domain-specific processing rules
Add URL classification patterns
Register short URL providers and bio link domains
Apply custom URL transformations

Built-in Plugins

Good-common includes several built-in plugins for common use cases:

ECommerceURLPlugin

Handles e-commerce website URLs (Amazon, eBay, Etsy, AliExpress, etc.)

Removes tracking parameters like ref, hash, _trkparms
Preserves product identifiers and search parameters
Transforms mobile URLs to desktop versions
Classifies product pages, search results, shopping carts

AnalyticsTrackingPlugin

Removes analytics and tracking parameters from all major platforms

Google Analytics (utm_*, gclid, etc.)
Facebook (fbclid, fb_*)
Microsoft/Bing (msclkid)
Email marketing (mc_cid, _hsenc, mkt_tok)
Social media tracking parameters
Preserves content identifiers and navigation parameters

VideoStreamingPlugin

Handles video platform URLs (YouTube, Vimeo, Twitch, etc.)

Removes tracking parameters like feature, ab_channel
Preserves video IDs, timestamps, and playlist information
Transforms mobile YouTube URLs to desktop
Classifies video pages, channels, playlists

SearchEnginePlugin

Processes search engine URLs (Google, Bing, DuckDuckGo)

Removes search tracking parameters (ved, ei, source)
Preserves search queries and result types
Overrides built-in disable rules for Google
Classifies different search types (images, videos, maps)

DocumentSharingPlugin

Handles document and cloud storage platforms (Google Drive/Docs, Dropbox, Box)

Removes sharing tracking parameters (usp, dl, raw)
Preserves document identifiers and view settings
Classifies different document types

Using Built-in Plugins

from good_common.types.builtin_plugins import load_builtin_plugins

# Load all built-in plugins
load_builtin_plugins()

# Load specific plugins only
load_builtin_plugins(["ecommerce", "analytics", "video"])

# Use enhanced URL processing
url = URL("https://www.amazon.com/dp/B123?ref=sr&utm_source=google")
canonical = url.canonicalize()  # Removes both ref and utm_source

Creating a Plugin

from good_common.types import URLPlugin
import re

class MyURLPlugin(URLPlugin):
    def get_tracking_params(self) -> Set[str]:
        """Additional tracking parameters to remove during canonicalization."""
        return {"my_tracking_id", "custom_ref"}
    
    def get_canonical_params(self) -> Set[str]:
        """Parameters that should be preserved."""
        return {"article_id", "product_id"}
    
    def get_domain_rules(self) -> Dict[str, Dict[str, Any]]:
        """Domain-specific canonicalization rules."""
        return {
            r".*\.mysite\.com": {
                "canonical": {"id", "page"},
                "non_canonical": {"session", "temp"},
                "force_www": True,
            }
        }
    
    def get_short_url_providers(self) -> Set[str]:
        """Additional short URL domains."""
        return {"mylink.co", "short.link"}
    
    def get_classification_patterns(self) -> Dict[str, Pattern]:
        """Custom URL classification patterns."""
        return {
            "product_page": re.compile(r"/products?/[\w-]+"),
            "category_page": re.compile(r"/categor(y|ies)/[\w-]+"),
        }
    
    def transform_url(self, url: 'URL', config: 'UrlParseConfig') -> Optional['URL']:
        """Apply custom URL transformations."""
        from good_common.types import URL
        
        # Example: Rewrite mobile URLs to desktop
        if url.host == "m.mysite.com":
            return URL.build(
                scheme="https",
                host="www.mysite.com",
                path=url.path,
                query=url.query_params(format="plain", flat_delimiter=","),
            )
        return None

Registering Plugins

Method 1: Entry Points (Recommended for Packages)

Add to your package's pyproject.toml:

[project.entry-points."good_common.url_plugins"]
my_plugin = "my_package.plugins:MyURLPlugin"
social_media = "my_package.plugins:SocialMediaPlugin"

Plugins registered via entry points are automatically loaded when the good-common module is imported.

Method 2: Direct Registration

from good_common.types import URL, URLPlugin

class MyPlugin(URLPlugin):
    # ... implementation ...

# Register at class level
URL.register_plugin(MyPlugin())

# Or use the global registry
from good_common.types import url_plugin_registry
url_plugin_registry.register(MyPlugin())

Method 3: Runtime Registration

from good_common.types import URL

# Create and register a plugin at runtime
plugin = MyURLPlugin()
URL.register_plugin(plugin)

# Use the enhanced URL functionality
url = URL("https://example.com/page?my_tracking_id=123&article_id=456")
canonical = url.canonicalize()  # my_tracking_id will be removed, article_id preserved

# Check custom classifications
classifications = url.classify()
if classifications.get("product_page"):
    print("This is a product page")

# Unregister when done
URL.unregister_plugin(plugin)

Example Plugins

The library includes example plugins in good_common.types.example_plugin:

SocialMediaURLPlugin: Handles social media specific parameters and transformations
NewsMediaURLPlugin: Manages news site tracking parameters and classifications

from good_common.types.example_plugin import SocialMediaURLPlugin

# Use the pre-built social media plugin
plugin = SocialMediaURLPlugin()
URL.register_plugin(plugin)

# Now URLs from social media sites will be processed with specialized rules
url = URL("https://instagram.com/p/ABC123?igshid=tracker")
canonical = url.canonicalize()  # igshid parameter will be removed

Performance Considerations

Plugins are designed with minimal overhead (<10% when registered)
Plugin data is cached for efficiency
Lazy loading ensures plugins only impact performance when used
Use entry points for automatic loading or register manually for fine control

Utilities

Various utility functions for common tasks.

Look at /tests/good_common/utilities for usage

Project details

Release history Release notifications | RSS feed

1.5.2.dev0 pre-release

Dec 3, 2025

1.5.1

Dec 3, 2025

1.5.1.dev0 pre-release

Nov 26, 2025

1.5.0

Nov 26, 2025

1.4.5.dev0 pre-release

Nov 19, 2025

1.4.4

Nov 19, 2025

1.4.4.dev0 pre-release

Nov 17, 2025

1.4.3

Nov 17, 2025

1.4.3.dev0 pre-release

Nov 17, 2025

1.4.2

Nov 17, 2025

1.4.1.dev0 pre-release

Nov 17, 2025

1.4.0

Nov 17, 2025

1.3.1.dev0 pre-release

Nov 11, 2025

1.3.0

Nov 11, 2025

1.2.2.dev0 pre-release

Oct 21, 2025

1.2.1

Oct 21, 2025

1.2.1.dev0 pre-release

Oct 20, 2025

1.2.0

Oct 20, 2025

1.1.2.dev0 pre-release

Oct 13, 2025

1.1.1

Oct 13, 2025

1.0.1.dev0 pre-release

Sep 11, 2025

This version

1.0.0

Sep 11, 2025

0.6.4.dev0 pre-release

Sep 6, 2025

0.6.3

Sep 6, 2025

0.6.3.dev0 pre-release

Sep 5, 2025

0.6.2

Sep 5, 2025

0.6.2.dev0 pre-release

Sep 5, 2025

0.6.1

Sep 5, 2025

0.6.1.dev0 pre-release

Sep 5, 2025

0.6.0

Sep 5, 2025

0.4.1.dev0 pre-release

Sep 4, 2025

0.4.0

Sep 4, 2025

0.3.5

Sep 19, 2025

0.3.4

Aug 14, 2025

0.3.3

Aug 8, 2025

0.3.2

Jul 23, 2025

0.3.1

Jul 22, 2025

0.3

Jul 21, 2025

0.2.31

Jul 16, 2025

0.2.29

Jul 14, 2025

0.2.28

Jul 7, 2025

0.2.27

Jul 7, 2025

0.2.26

Jul 3, 2025

0.2.25

Jul 3, 2025

0.2.24

Jun 30, 2025

0.2.23

Jun 28, 2025

0.2.22

Jun 17, 2025

0.2.21

Jun 5, 2025

0.2.20

Jun 3, 2025

0.2.19

May 30, 2025

0.2.18

May 30, 2025

0.2.17

May 27, 2025

0.2.17a0 pre-release

May 27, 2025

0.2.16

May 21, 2025

0.2.15

May 16, 2025

0.2.14

May 14, 2025

0.2.13

May 14, 2025

0.2.12

May 12, 2025

0.2.10

May 5, 2025

0.2.8

Mar 19, 2025

0.2.6

Mar 2, 2025

0.2.5 yanked

Mar 2, 2025

0.2.5a0 pre-release yanked

Mar 2, 2025

0.2.4

Mar 2, 2025

0.2.3

Feb 27, 2025

0.2.2

Feb 25, 2025

0.2.1

Feb 24, 2025

0.2.0

Dec 6, 2024

0.1.17

Oct 18, 2024

0.1.16

Sep 30, 2024

0.1.15

Sep 25, 2024

0.1.14

Sep 25, 2024

0.1.13

Sep 20, 2024

0.1.12.1

Sep 18, 2024

0.1.12

Sep 16, 2024

0.1.11

Sep 13, 2024

0.1.10

Sep 11, 2024

0.1.9.6

Sep 3, 2024

0.1.9.5

Aug 30, 2024

0.1.9.4

Aug 29, 2024

0.1.9.3

Aug 29, 2024

0.1.9.2

Aug 27, 2024

0.1.9.1

Aug 27, 2024

0.1.9

Aug 27, 2024

0.1.8

Aug 23, 2024

0.1.7.2

Aug 23, 2024

0.1.7.1

Aug 23, 2024

0.1.7

Aug 23, 2024

0.1.6

Aug 23, 2024

0.1.5

Aug 22, 2024

0.1.4

Aug 22, 2024

0.1.3

Aug 22, 2024

0.1.3a0 pre-release

Aug 22, 2024

0.1.2

Aug 22, 2024

0.1.1

Aug 14, 2024

0.1.0

Aug 13, 2024

0.0.0

Sep 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

good_common-1.0.0-cp313-cp313-win_amd64.whl (857.4 kB view details)

Uploaded Sep 11, 2025 CPython 3.13Windows x86-64

good_common-1.0.0-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl (2.3 MB view details)

Uploaded Sep 11, 2025 CPython 3.13manylinux: glibc 2.17+ x86-64manylinux: glibc 2.28+ x86-64

good_common-1.0.0-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl (2.3 MB view details)

Uploaded Sep 11, 2025 CPython 3.13manylinux: glibc 2.17+ ARM64manylinux: glibc 2.28+ ARM64

good_common-1.0.0-cp313-cp313-macosx_10_13_universal2.whl (1.2 MB view details)

Uploaded Sep 11, 2025 CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

File details

Details for the file good_common-1.0.0-cp313-cp313-win_amd64.whl.

File metadata

Download URL: good_common-1.0.0-cp313-cp313-win_amd64.whl
Upload date: Sep 11, 2025
Size: 857.4 kB
Tags: CPython 3.13, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for good_common-1.0.0-cp313-cp313-win_amd64.whl
Algorithm	Hash digest
SHA256	`c5d7c76a65f0fb536330ccc1dda1272723afa48406dc2365af2cb9d8f93ab3cd`
MD5	`f58a8d674491efc2a703a0ca11489b21`
BLAKE2b-256	`917efd15c8195a585dede322d5827b1f3c223298437033019509676c6a8ceafe`

See more details on using hashes here.

File details

Details for the file good_common-1.0.0-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl.

File metadata

Download URL: good_common-1.0.0-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl
Upload date: Sep 11, 2025
Size: 2.3 MB
Tags: CPython 3.13, manylinux: glibc 2.17+ x86-64, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for good_common-1.0.0-cp313-cp313-manylinux2014_x86_64.manylinux_2_17_x86_64.manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`475cc82708f3d518e5fec235ad56545e5e70f4b2ff876b7d65e71e1bdada18eb`
MD5	`f6c9520c945b70dba6878f318e02ccf4`
BLAKE2b-256	`da5c4036412dcd4199366d47fe4bc1b29423ac2f5899bc8b34e9ced801bb8a53`

See more details on using hashes here.

File details

Details for the file good_common-1.0.0-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl.

File metadata

Download URL: good_common-1.0.0-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl
Upload date: Sep 11, 2025
Size: 2.3 MB
Tags: CPython 3.13, manylinux: glibc 2.17+ ARM64, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for good_common-1.0.0-cp313-cp313-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`ad20a00a2845aeacc5848b5a60dbc42f49ece854b22c06820371a7aa11b9364f`
MD5	`89d84212c1cb5b535280e1fc684cf4d0`
BLAKE2b-256	`e1c605654df53dce7d586dfdbba23e5e5efafd3873deb56ef957379a3b2ce396`

See more details on using hashes here.

File details

Details for the file good_common-1.0.0-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

Download URL: good_common-1.0.0-cp313-cp313-macosx_10_13_universal2.whl
Upload date: Sep 11, 2025
Size: 1.2 MB
Tags: CPython 3.13, macOS 10.13+ universal2 (ARM64, x86-64)
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for good_common-1.0.0-cp313-cp313-macosx_10_13_universal2.whl
Algorithm	Hash digest
SHA256	`4b3c0815fa7dce92bea6fbc6dc12e9cfc40881bf724e4326fa9005c88e408713`
MD5	`2e76050708201ee93dd9d97cffd96c4c`
BLAKE2b-256	`55fa85469231482f486c87f2b18e87dea6f1c914302be2a387ffcbd77e94b98a`

See more details on using hashes here.

good-common 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

good-common

Dependency Provider

Pipeline

Overview

Features

Quick Start

Usage

Creating a Pipeline

Defining Components

Executing a Pipeline

Parallel Execution

Error Handling

Function Mapping

Advanced Features

URL Plugin System

Features

Built-in Plugins

ECommerceURLPlugin

AnalyticsTrackingPlugin

VideoStreamingPlugin

SearchEnginePlugin

DocumentSharingPlugin

Using Built-in Plugins

Creating a Plugin

Registering Plugins

Method 1: Entry Points (Recommended for Packages)

Method 2: Direct Registration

Method 3: Runtime Registration

Example Plugins

Performance Considerations

Utilities

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes