Skip to main content

Run your own A/B testing backend on AWS Lambda

Project description

Gimel

Build Status PyPI

a Scaleable A/B testing backend in ~100 lines of code (and for free*)

What is it?

an A/B testing backend using AWS Lambda/API Gateway + Redis

Key Features:

  • Highly scalable due to the nature of AWS Lambda
  • High performance and low memory footprint using Redis HyperLogLog
  • Cost Effective
  • Easy deployment using gimel deploy. No need to twiddle with AWS.

Looking for contributors

click here for more info

What does Gimel mean?

Gimel (גִּימֵל) is the 3rd letter of the Hebrew Alphabet. The letter (ג) also looks visually similar to the greek Lambda (λ).

Installation / Quick Start

You will need a live instance of redis accessible online from AWS. Then run:

$ pip install gimel
$ gimel configure
$ gimel deploy

It will automatically configure your AWS Lambda functions, API gateway and produce a JS snippet ready to use for tracking your experiments.

Architecture

Client

I suggest looking at Alephbet to get more details, but at a high level, the client runs on the end-user browser. It will randomly pick a variant and execute a javascript function to 'activate' it. When a goal is reached -- user performs a certain action, this also include the pseudo-goal of participating in the experiment -- then an event is sent to the backend. An event typically looks something like "experiment ABC, variant red, user participated", or "experiment XYZ, variant blue, check out goal reached".

Alephbet might send duplicate events, but each event should include a uuid to allow the backend to de-duplicate it. More below

Data Store - Redis HyperLogLog

The data store keeps a tally of each event that comes into the system. Being able to count unique events (de-duplication) was important to keep an accurate count. One approach would be to store each event in an entry / database row / document, and then run some kind of a unique count on it. Or we could use a nifty algorithm called HyperLogLog. HyperLogLog allows you to count unique counts without storing each and every item.

In terms of storage space, redis HyperLogLog offers a fixed size of 12k per counter. This gives us ample space for storing experiment data with low memory footprint.

Backend - AWS Lambda / API Gateway

The backend had to take care of a few simple types of requests:

  • track an event - receive a (HTTP) request with some json data -- experiment name, variant, goal and uuid, and then push it to redis.
  • extract the counters for a specific experiment, or all experiments into some json that can be presented on the dashboard.

Dashboard

New! access your dashboard with gimel dashboard

How does tracking work?

Check out Alephbet.

Command Reference

  • gimel --help - prints a help screen.
  • gimel configure - opens your editor so you can edit the config.json file. Use it to update your redis settings.
  • gimel preflight - runs preflight checks to make sure you have access to AWS, redis etc.
  • gimel deploy - deploys the code and configs to AWS automatically.

Advanced

custom API endpoints

If you want to use different API endpoints, you can add your own extra_wiring into the config.json file (e.g. using gimel configure).

for example, this will add a .../prod/my_tracking_endpoint URL pointing to the gimel-track lambda:

{
    "redis": {
       ...
    },
    "extra_wiring": [
        {
            "lambda": {
                "FunctionName": "gimel-track",
                "Handler": "gimel.track",
                "MemorySize": 128,
                "Timeout": 3
            },
            "api_gateway": {
                "pathPart": "my_tracking_endpoint",
                "method": {
                    "httpMethod": "GET",
                    "apiKeyRequired": false,
                    "requestParameters": {
                        "method.request.querystring.namespace": false,
                        "method.request.querystring.experiment": false,
                        "method.request.querystring.variant": false,
                        "method.request.querystring.event": false,
                        "method.request.querystring.uuid": false
                    }
                }
            }
        }
    ]
}

see WIRING

Privacy, Ad-blockers (GDPR etc)

Gimel provides a backend for A/B test experiment data. This data is aggregated and does not contain any personal information at all. It merely stores the total number of actions with a certain variation against another.

As such, Gimel should meet privacy requirements of GDPR and similar privacy regulations.

Nevertheless, important disclaimers:

  • I am not a lawyer, and it's entirely up to you if and how you decide to use Gimel. Please check with your local regulations and get legal advice to decide on your own.
  • Some ad-blockers are extra vigilent, and would block requests with the track keyword in the URL. Therefore, track requests to Gimel might be blocked by default. As the library author, I make no attempts to conceal the fact that a form of tracking is necessary to run A/B tests, even if I believe it to be respecting privacy.
  • Users who decide to use Gimel can, if they wish, assign a different endpoint that might get past ad-blockers, but that's entirely up to them. see custom API endpoints on how this can be achieved.
  • As with almost any tool, it can be use for good or evil. Some A/B tests can be seen as manipulative, unfair or otherwise illegitimate. Again, use your own moral compass to decide whether or not it's ok to use A/B testing, or specific A/B tests.

License

Gimel is distributed under the MIT license. All 3rd party libraries and components are distributed under their respective license terms.

Copyright (C) 2016 Yoav Aner

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated
documentation files (the "Software"), to deal in the Software without restriction, including without limitation the
rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit
persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the
Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE
WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR
OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gimel-1.5.0.tar.gz (160.5 kB view details)

Uploaded Source

Built Distribution

gimel-1.5.0-py3-none-any.whl (181.5 kB view details)

Uploaded Python 3

File details

Details for the file gimel-1.5.0.tar.gz.

File metadata

  • Download URL: gimel-1.5.0.tar.gz
  • Upload date:
  • Size: 160.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.0

File hashes

Hashes for gimel-1.5.0.tar.gz
Algorithm Hash digest
SHA256 995a0ac675c08f66d9aed56139ba518cc46abfad8da67c051005a9448ac8fabd
MD5 230c37b60201a43f5798e52a7dfd8420
BLAKE2b-256 f0c56538ea4243c4d74f35eb46354467a15b53434f326595c8155bc1db17a22d

See more details on using hashes here.

File details

Details for the file gimel-1.5.0-py3-none-any.whl.

File metadata

  • Download URL: gimel-1.5.0-py3-none-any.whl
  • Upload date:
  • Size: 181.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.0

File hashes

Hashes for gimel-1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2a6be8820ca70e24373e7218db05f3957a2e6316bf39c36709e15aa9d8a8d035
MD5 caba9f3bf7d0e98c28b1878771ef5ffc
BLAKE2b-256 7819a82fcc0c6e94db471f51e6059f7b0fb34b4fdcc672d5c780463b26534ec0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page