Skip to main content

A streaming library for reading raw event data from Mixpanel's export API

Project description

# mixpanel-export-stream
A small Python library for exporting and reading raw event data from Mixpanel while taking advantage of the streamable JSONLine data format.

This library is based on the Mixpanel's own data export library found [here](https://mixpanel.com/site_media/api/v2/mixpanel.py).

## Installation
To install the mixpanel export library, simply run the following:

`pip install mixpanel_export`

## What do you mean by a "stream"?

Mixpanel's raw event data API returns data in JSONLine format, meaning each event is a JSON object on its own line.

Mixpanel gives a warning to **not** attempt to read this data until it has downloaded in its entirety due to being zipped in `gzip` format, but this compression type is streamable, meaning it shouldn't matter.

Streaming reduces the footprint in RAM significantly as compared to Mixpanel's own Python API implementation when you only care about doing reduces over events.

## Example

Let's assume we want to get all events 'A' with a property 'B' that is equal to "2". Events 'A' also have a property 'C', which is some random string value. We want the results grouped and tallied by 'C' to see how many property 'C' events occurred.

This is simple and fast to do with this library.

```python
from collections import Counter
from mixpanel_export import EventStream

api_key = '...'
api_secret = '...'

es = EventStream(api_key, api_secret)

params = {
'event': ["A"],
'from_date': '...',
'to_date': '...',
'where': 'property["B"] == "2"'
}

count = Counter()


def do_count(e):
count[e['properties']['C']] += 1

# allow the request a maximum of 5 minutes (300 seconds) before expiring
total = es.request(params, do_count, expire=300)
print("Total events processed %s, tally: %s" % (total, count))
```

For more details on what you can use for the `params` argument, refer to Mixpanel's documentation [here](https://mixpanel.com/docs/api-documentation/data-export-api).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mixpanel_export-0.2.post7.tar.gz (19.7 kB view details)

Uploaded Source

File details

Details for the file mixpanel_export-0.2.post7.tar.gz.

File metadata

File hashes

Hashes for mixpanel_export-0.2.post7.tar.gz
Algorithm Hash digest
SHA256 10eee808d1b140071898d5ba8eb5419d359a8bb7e7a8f271560191ae1b384dcd
MD5 86d38971f274843686aa8728beb13077
BLAKE2b-256 2110b608c2b1c110f2a841b39b0a011a00810c361249825c256ebfd5641f1f25

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page