Python wrapper for ThreadPoolExecutor to easily multithread resource bound tasks

These details have not been verified by PyPI

Project description

Moethread

Overview
Library Installalion
Library Usage

Overview

Moethread is a python wrapper for the ThreadPoolExecutor library to easily multithread resource bound tasks. The library offers a decorator style of parallelizing function calls. NOTE, this only works for resource bound (API calls, network requests, disk read/write operations, etc) operations. If your task is CPU intensive, then this library may not offer much benefit and you're better off exploring other options such as multiporcessing.

Library Installalion

To install the library simply run the following command in a cmd, shell or whatever...

# Windows
pip install moethread

# Linux
pip3 install moethread

Library usage?

To start, you need to import the library

from moethread import parallel_call

If you need to read results back from the parallelized function, then you have to define the internal variables/objects globally where you can access them outside of that function. The function to parallelize will accept arguments and keyword arguments. Arguments are primitives/constants/variables that you'd like to pass through to your function. If you'd like to have counters inside the parallelized function, then define those globally as shown in the following code snippet.

global counter
counter = 0

As for the data which needs to be parallelized, this needs to be specified in the keywords argument. The keyword data is reserved for the input data. The input data is a dictionary collection of whatever needs to run in parallel.

For example if you have a dataset of images and you would like to read those images in parallel and those images have labels, then you have to create a dictionary of image paths and their corrosponding labels. You have to make sure that the two lists are aligned.

image_paths  = ["image_0.jpg", "image_1.jpg", ...] 	# some dummy paths
image_labels = [0, 1, ...] 		                # some dummy labels
assert len(image_paths) == len(image_labels)

# It's your responsiblity to ensure that elements align, e.g. image_labels[0] is the label for image_paths[0]
data = {"image_path": image_paths, "image_label": image_labels}

The next step is write the building block of your function. You will add the decorator @parallel_call on top of the function and assign *args and **kwargs as your function parameters. Inside the function, you will read the data dictionary which contains the path to image and its corrosponding label.

@parallel_call # decorator
def function_to_parallelize(*args, **kwargs):
	# Define globals...
	global counter
	# Read data in...
	image_path  = kwargs.get('data').get('image_path')
	image_label = kwargs.get('data').get('image_label')
	# Read image
	image = cv2.imread(image_path)
	if image_label == 1:
		counter += 1 # assume images with label == 1 are valid images
	## Do whatever you like to do below...

Lastly, you will just call the function and specify the number of threads. If you set threads = -1, then the libary will figure out the suitable number of threads for the task.

function_to_parallelize(data=data, threads=-1) # automatically assigns the needed number of threads...

Putting it all together.

from moethread import parallel_call

image_paths  = ["image_0.jpg", "image_1.jpg", ...] 	# some paths
image_labels = [0, 1, ...] 		                # some dummy labels
assert len(image_paths) == len(image_labels)

# It's your responsiblity to ensure that elements align, e.g. image_labels[0] is the label for image_paths[0]
data = {"image_path": image_paths, "image_label": image_labels}
global counter
counter = 0

@parallel_call # decorator
def function_to_parallelize(*args, **kwargs):
	# Define globals...
	global counter
	# Read data in...
	image_path  = kwargs.get('data').get('image_path')
	image_label = kwargs.get('data').get('image_label')
	# Read image
	image = cv2.imread(image_path)
	if image_label == 1:
		counter += 1 # assume images with label == 1 are valid images
	## Do whatever you like to do below...

function_to_parallelize(data=data, threads=-1) # Automatically assigns the needed number of threads...

Another example, Pull-request processing.

This examples shows how to read github pull requests and parse body content and return a list of github users who produced failed pull-requests.

from moethread import parallel_call

global invalid_pulls
github_users  = []
invalid_pulls = 0
github_token = ghx_test124
etag   = None
params = {'state': 'open'}
pulls  = list(self._iter(int(-1), url, repo.pulls.ShortPullRequest, params, etag))
@parallel_call
def process_pulls(*args, **kwargs):
    global invalid_pulls
    pull = kwargs.get('data').get('pulls')
    response = self._get(f'{url}/{pull.number}/reviews', auth=('', github_token))
    if response.ok:
        reviews = json.loads(response.text)
        for review in reviews:
            body = review.get('body', '').lower()
            err = "failure"
            if err in body:
                res = self._get(pull.user.url, auth=('', github_token))
                if res.ok:
                    github_user = json.loads(res.text)
                    github_users.append(github_user.get('login', ''))
                invalid_pulls += 1
                break
    elif response.status_code != 404:
        pass
process_pulls(data={"pulls": pulls}, threads=-1)

Author: Hamdan, Muhammad (@mhamdan91 - Â©)

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.4.2

Feb 11, 2024

1.4.1

Feb 8, 2024

1.4.0

Dec 11, 2023

1.3.4

Aug 29, 2023

1.3.3

Aug 29, 2023

1.2.3

Aug 2, 2023

1.1.3

Apr 30, 2023

1.1.1

Dec 16, 2022

This version

1.1.0

Nov 27, 2022

1.0.9

Nov 26, 2022

1.0.8

Nov 26, 2022

1.0.7

Nov 26, 2022

1.0.6

Nov 6, 2022

1.0.5

Nov 2, 2022

1.0.2

Nov 2, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

moethread-1.1.0.tar.gz (4.5 kB view details)

Uploaded Nov 27, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

moethread-1.1.0-py3-none-any.whl (5.3 kB view details)

Uploaded Nov 27, 2022 Python 3

File details

Details for the file moethread-1.1.0.tar.gz.

File metadata

Download URL: moethread-1.1.0.tar.gz
Upload date: Nov 27, 2022
Size: 4.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.7.5

File hashes

Hashes for moethread-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`1d8de346f71e7766840961ad352e061e3a9655803cff7816e2b71eab753ad77a`
MD5	`b72a29cf727bb94013179639cbe58277`
BLAKE2b-256	`e4b78672dc60037814d6eb50df6d434f926b5fd04d7d3025ba671afb39d07b33`

See more details on using hashes here.

File details

Details for the file moethread-1.1.0-py3-none-any.whl.

File metadata

Download URL: moethread-1.1.0-py3-none-any.whl
Upload date: Nov 27, 2022
Size: 5.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.7.5

File hashes

Hashes for moethread-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ccdb073239e5779d2af0b7a2e984a6394616d26dcdbf95a9b42238637370a8d1`
MD5	`d7be2de0506ea7ca1e5ae79374e1052d`
BLAKE2b-256	`6e293c051bf408fc3858c22b5d2c542e65097c3d10bf8f710634d12a150b8c08`

See more details on using hashes here.

moethread 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Moethread

Table of Contents

Overview

Library Installalion

Library usage?

Another example, Pull-request processing.

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes