A trigger-based framework for creating and executing ML pipelines.

These details have not been verified by PyPI

Project description

Motion

Motion is a system for defining and incrementally maintaining reactive prompts in Python.

Why Reactive Prompts?

LLM accuracy often significantly improves with more context. Consider an e-commerce focused LLM pipeline that recommends products to users. The recommendations might improve if the prompt considers the user's past purchases and browsing history. Ideally, any new information about the user (e.g., a new purchase or browsing event) should be incorporated into the LLM pipeline's prompts as soon as possible; thus, we call them reactive prompts.

Why is it Hard to Use Reactive Prompts?

Consider the e-commerce example above. The prompt might grow to be very long---so long that there's a bunch of redundant or event useless information in the prompt. So, we might want to summarize the user's past purchases and browsing history into a single prompt. However, summarizing the user's past purchases and browsing history every time we log a new purchase or browsing event, or whenever the user requests a new recommendation, can take too long and thus prohibitively increase end-to-end latency for getting a recommendation.

In general, we may want to use LLMs or run some other expensive operation when incrementally processing new information, e.g., through summarization, extracting structured information, or generating new data. When there is a lot of information to process, the best LLMs can take upwards of 30 seconds. This can be unacceptable for production latency.

What is Motion?

As LLM pipeline developers, we want a few things when building and using reactive prompts:

Flexibility: We want to be able to define our sub-parts of prompts (e.g., summaries). We also want to be able to define our own logic for how to turn sub-parts into string prompts and reactively update sub-parts.
Availability: We want there to always be some version of prompt sub-parts available, even if they are a little stale. This way we can minimize end-to-end latency.
Freshness: Prompts should incorporate as much of the latest information as possible. In the case where information arrives faster than we can process it, it may be desirable to ignore older information.

Motion allows LLM pipeline developers to define and incrementally maintain reactive prompts in Python. With Motion, we define components that represent prompt sub-parts, and flows that represent how to assemble sub-parts into a prompt for an LLM in real-time and how to reactively update sub-parts in the background based on new information.

Motion's execution engine serves cached prompt sub-parts for minimal real-time latency and handles concurrency and sub-part consistency when running flows that update sub-parts. All prompt sub-parts are backed by a key-value store. You can run Motion components anywhere and in any number of Python processes (e.g., in a notebook, in a serverless function, in a web server) at the same time for maximal availability.

An Example Motion Component

It's hard to understand Motion without an example. In Motion, you define components, which are stateful objects that can be updated incrementally with new data. A component has an init_state method that initializes the state of the component, and any number of flows, where each flow consists of a serve operation (state read-only) and an update operation (can read and write state). These operations are arbitrary user-defined Python functions.

Here's an example of a component that recommends books to buy, personalized to each user:

from motion import Component

BookRecommender = Component("BookRecommender")

@BookRecommender.init_state
def setup(user_demographics, liked_books):
    return {
        "user_demographics": user_demographics,
        "liked_books": liked_books,
        "recommended_books": [],
        "genres": [],
    }

@BookRecommender.serve("rec")
async def get_rec(state, props):
    genre_str = ", ".join(state["genres"]) if state["genres"] else ""
    rec = await llm(f"I liked {state['liked_books']} and {genre_str} genres. What book would you recommend me to read in the {props['specified_genre']} genre?")
    return rec

@BookRecommender.update("rec", discard_policy=DiscardPolicy.SECONDS, discard_after=86400) # If the update wasn't processed within 24 hours (due to backpressure), discard it
async def update_genres(state, props):
    recommended_books = state["recommended_books"] + props.serve_result
    all_books_positive_signal = state["liked_books"] + recommended_books
    new_genres = await llm(f"Update my list of preferred genres {state['genres']} based on my book collection: {all_books_positive_signal}")
    return {
        "recommended_books": recommended_books,
        "genres": new_genres"
    }

@BookRecommender.update("liked_book")
async def update_liked_books(state, props):
    all_books_positive_signal = state["liked_books"] + [props["liked_book"]]
    new_genres = await llm(f"Update my list of preferred genres {state['genres']} based on my book collection: {all_books_positive_signal}")
    return {"liked_books": props["liked_books"], "genres": new_genres}

In the above example, the serve operation recommends a book to the user based on a specified genre, and the update operation updates the context to be used in future recommendations (i.e., "rec" serve operations). serve operations execute first and cannot modify state, while update operations can modify state and execute after serve operations in the background.

You can run a flow by calling run or arun (async version of run) on the component:

# Initialize component instance
book_recommender = BookRecommender("some_user_id", init_state_params={"user_demographics": "some_user_demographics", "liked_books": ["book1", "book2"]})

# Run the "rec" flow. Will return the result of the "rec" serve
# operation, and queue the "rec" update operation to run in the background.
rec = await book_recommender.arun("rec", props={"specified_genre": "fantasy"})

# Log a new liked book. There is no serve operation for the "liked_book" flow,
# so nothing is returned. The "liked_book" update operation is queued to run in
# the background.
book_recommender.arun("liked_book", props={"liked_book": "Harry Potter and the Deathly Hallows"})

After rec is returned, the update operation will run in the background and update the state of the component (for as long as the Python process is running). The state of the component instance is always committed to the key-value store after a flow is fully run, and is loaded from the key-value store when the component instance is initialized again.

Multiple clients can run flows on the same component instance, and the state of the component will be updated accordingly. Serve operations are run in parallel, while update operations are run sequentially in the order they are called. Motion maintains consistency by locking the state of the component while an update operation is running. Serve operations can run with old state while an update operation is running, so they are not blocked.

Should I use Motion?

Motion is especially useful for LLM pipelines

Need to update prompts based on new data (e.g., maintain a dynamic summary in the prompt)
Want a Pythonic interface to build a distributed system of LLM application components

Motion is built for developers who know how to code in Python and want to be able to control operations in their ML applications. For low-code and domain-specific development patterns (e.g., enhancing videos), you may want to check out other tools.

Where did Motion come from?

Motion is developed and maintained by researchers at the UC Berkeley EPIC Lab who specialize in data management for ML pipelines.

Getting Started

Check out the docs for more information.

Motion is currently in alpha. We are actively working on improving the documentation and adding more features. If you are interested in using Motion and would like dedicated support from one of our team members, please reach out to us at shreyashankar@berkeley.edu.

Testing and Development

You can run make install to install an editable source of Motion. We use poetry to manage dependencies.

To run tests, we use pytest and a local Redis cache. You should run Redis on port 6381 before you run make tests. To run Redis with Docker, either run the docker-compose.yml file in this repo (i.e., docker-compose up) or run the following command in your terminal:

docker run -p 6381:6379 --name motion-backend-testing redis/redis-stack-server:latest

Then when you run make tests, your tests should pass.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.132

Jun 9, 2024

0.1.131

Jun 3, 2024

0.1.130

Mar 6, 2024

0.1.129

Mar 1, 2024

0.1.128

Feb 19, 2024

0.1.127

Feb 19, 2024

0.1.126

Feb 19, 2024

0.1.125

Feb 1, 2024

0.1.124

Jan 30, 2024

This version

0.1.123

Jan 29, 2024

0.1.122

Jan 28, 2024

0.1.121

Jan 28, 2024

0.1.120

Jan 13, 2024

0.1.119

Jan 13, 2024

0.1.118

Jan 11, 2024

0.1.117

Jan 8, 2024

0.1.116

Jan 5, 2024

0.1.115

Jan 4, 2024

0.1.114

Jan 4, 2024

0.1.113

Jan 3, 2024

0.1.112

Dec 30, 2023

0.1.111

Dec 15, 2023

0.1.110

Dec 12, 2023

0.1.109

Nov 30, 2023

0.1.108

Nov 30, 2023

0.1.107

Nov 30, 2023

0.1.106

Nov 23, 2023

0.1.105

Nov 1, 2023

0.1.104

Oct 31, 2023

0.1.103

Oct 30, 2023

0.1.102

Oct 25, 2023

0.1.101

Oct 14, 2023

0.1.100

Oct 3, 2023

0.1.99

Sep 29, 2023

0.1.98

Sep 20, 2023

0.1.97

Sep 19, 2023

0.1.96

Sep 18, 2023

0.1.95

Aug 23, 2023

0.1.94

Aug 21, 2023

0.1.93

Aug 21, 2023

0.1.92

Aug 20, 2023

0.1.91

Aug 16, 2023

0.1.90

Aug 16, 2023

0.1.89

Aug 14, 2023

0.1.88

Aug 14, 2023

0.1.87

Aug 11, 2023

0.1.86

Aug 8, 2023

0.1.85

Jul 25, 2023

0.1.84

Jul 24, 2023

0.1.83

Jul 22, 2023

0.1.82

Jul 21, 2023

0.1.81

Jul 14, 2023

0.1.80

Jul 11, 2023

0.1.79

Jul 8, 2023

0.1.78

Jul 7, 2023

0.1.77

Jul 7, 2023

0.1.76

Jul 7, 2023

0.1.75

Jul 7, 2023

0.1.74

Jul 6, 2023

0.1.73

Jul 6, 2023

0.1.72

Jul 6, 2023

0.1.71

Jul 6, 2023

0.1.70

Jun 30, 2023

0.1.69

Jun 28, 2023

0.1.68

Jun 23, 2023

0.1.67

Jun 23, 2023

0.1.66

Jun 17, 2023

0.1.65

Jun 15, 2023

0.1.64

Jun 14, 2023

0.1.63

Jun 10, 2023

0.1.62

Jun 8, 2023

0.1.61

Jun 8, 2023

0.1.60

Jun 8, 2023

0.1.59

Jun 8, 2023

0.1.58

Jun 8, 2023

0.1.57

Jun 6, 2023

0.1.56

Jun 5, 2023

0.1.55

Jun 5, 2023

0.1.54

Jun 3, 2023

0.1.53

Jun 2, 2023

0.1.52

Jun 2, 2023

0.1.51

Jun 2, 2023

0.1.50

Jun 1, 2023

0.1.49

Jun 1, 2023

0.1.48

Jun 1, 2023

0.1.47

Jun 1, 2023

0.1.46

May 31, 2023

0.1.45

May 31, 2023

0.1.44

May 31, 2023

0.1.43

May 23, 2023

0.1.41

May 14, 2023

0.1.40

May 12, 2023

0.1.39

May 9, 2023

0.1.38

May 9, 2023

0.1.37

May 8, 2023

0.1.36

Apr 20, 2023

0.1.35

Apr 19, 2023

0.1.34

Apr 14, 2023

0.1.33

Apr 13, 2023

0.1.32

Apr 12, 2023

0.1.31

Apr 11, 2023

0.1.30

Apr 10, 2023

0.1.29

Apr 10, 2023

0.1.28

Apr 10, 2023

0.1.27

Apr 7, 2023

0.1.26

Apr 7, 2023

0.1.25

Apr 7, 2023

0.1.24

Apr 7, 2023

0.1.23

Apr 7, 2023

0.1.22

Apr 7, 2023

0.1.21

Apr 7, 2023

0.1.20

Apr 7, 2023

0.1.19

Apr 7, 2023

0.1.18

Apr 7, 2023

0.1.17

Apr 7, 2023

0.1.16

Apr 7, 2023

0.1.15

Apr 2, 2023

0.1.14

Apr 2, 2023

0.1.13

Mar 31, 2023

0.1.12

Mar 31, 2023

0.1.2 yanked

Mar 30, 2023

0.1.1

Mar 29, 2023

0.1.0 yanked

Mar 29, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

motion_python-0.1.123.tar.gz (47.8 kB view hashes)

Uploaded Jan 29, 2024 Source

Built Distribution

motion_python-0.1.123-py3-none-any.whl (49.8 kB view hashes)

Uploaded Jan 29, 2024 Python 3

Hashes for motion_python-0.1.123.tar.gz

Hashes for motion_python-0.1.123.tar.gz
Algorithm	Hash digest
SHA256	`99d23873d2a663f7c20e14fe081c4a125dd7d114798db09f66df2b73a1d8da3f`
MD5	`9d025ec2c699b9dda10c053f828b42ac`
BLAKE2b-256	`15e8baa26e5a64c443025259d2a23b5c42e6bb7eeff63fc63604ebcbb663b6e9`

Hashes for motion_python-0.1.123-py3-none-any.whl

Hashes for motion_python-0.1.123-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2d4321bc8c6193305c59c152a452f1e2c116556e44f3d543589de1bd1022f664`
MD5	`3a81070aabf0145b2af4846d977d13df`
BLAKE2b-256	`fbfc588259d04b5ee444229a2681efed6ef59234d463ddb1e62eb3e6c01683fc`