Hamilton, the micro-framework for creating dataframes.

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Natural Language
- English
Programming Language

Project description

Hamilton

The general purpose micro-framework for creating dataflows from python functions!

Specifically, Hamilton defines a novel paradigm, that allows you to specify a flow of (delayed) execution, that forms a Directed Acyclic Graph (DAG). It was original built to solve creating wide (1000+) column dataframes. Core to the design of Hamilton is a clear mapping of function name to dataflow output. That is, Hamilton forces a certain paradigm with writing functions, and aims for DAG clarity, easy modifications, with always unit testable and naturally documentable code.

For the backstory on how Hamilton came about, see our blog post!.

Getting Started

Here's a quick getting started guide to get you up and running in less than 15 minutes. If you need help join our discord community to chat/ask Qs/etc. For the latest updates, follow us on twitter!

Installation

Requirements:

Python 3.6+

To get started, first you need to install hamilton. It is published to pypi under sf-hamilton:

pip install sf-hamilton

Note: to use the DAG visualization functionality, you should instead do:

pip install sf-hamilton[visualization]

While it is installing we encourage you to start on the next section.

Note: the content (i.e. names, function bodies) of our example code snippets are for illustrative purposes only, and don't reflect what we actually do internally.

Hamilton in <15 minutes

Hamilton is a new paradigm when it comes to creating, um, dataframes (let's use dataframes as an example, otherwise you can create ANY python object). Rather than thinking about manipulating a central dataframe, as is normal in some data engineering/data science work, you instead think about the column(s) you want to create, and what inputs are required. There is no need for you to think about maintaining this dataframe, meaning you do not need to think about any "glue" code; this is all taken care of by the Hamilton framework.

For example rather than writing the following to manipulate a central dataframe object df:

df['col_c'] = df['col_a'] + df['col_b']

you write

def col_c(col_a: pd.Series, col_b: pd.Series) -> pd.Series:
    """Creating column c from summing column a and column b."""
    return col_a + col_b

In diagram form: example The Hamilton framework will then be able to build a DAG from this function definition.

So let's create a "Hello World" and start using Hamilton!

Your first hello world.

By now, you should have installed Hamilton, so let's write some code.

Create a file my_functions.py and add the following functions:

import pandas as pd

def avg_3wk_spend(spend: pd.Series) -> pd.Series:
    """Rolling 3 week average spend."""
    return spend.rolling(3).mean()

def spend_per_signup(spend: pd.Series, signups: pd.Series) -> pd.Series:
    """The cost per signup in relation to spend."""
    return spend / signups

The astute observer will notice we have not defined spend or signups as functions. That is okay, this just means these need to be provided as input when we come to actually wanting to create a dataframe.

Note: functions can take or create scalar values, in addition to any python object type.

Create a my_script.py which is where code will live to tell Hamilton what to do:

import importlib
import logging
import sys

import pandas as pd
from hamilton import driver

logger = logging.getLogger(__name__)
logging.basicConfig(stream=sys.stdout)
initial_columns = {  # load from actuals or wherever -- this is our initial data we use as input.
    # Note: these do not have to be all series, they could be scalar inputs.
    'signups': pd.Series([1, 10, 50, 100, 200, 400]),
    'spend': pd.Series([10, 10, 20, 40, 40, 50]),
}
# we need to tell hamilton where to load function definitions from
module_name = 'my_functions'
module = importlib.import_module(module_name)
dr = driver.Driver(initial_columns, module)  # can pass in multiple modules
# we need to specify what we want in the final dataframe.
output_columns = [
    'spend',
    'signups',
    'avg_3wk_spend',
    'spend_per_signup',
]
# let's create the dataframe!
# if you only did `pip install sf-hamilton` earlier:
df = dr.execute(output_columns)
# else if you did `pip install sf-hamilton[visualization]` earlier:
# dr.visualize_execution(output_columns, './my-dag.dot', {})
print(df)

Run my_script.py

python my_script.py

You should see the following output:

   spend  signups  avg_3wk_spend  spend_per_signup
0     10        1            NaN            10.000
1     10       10            NaN             1.000
2     20       50      13.333333             0.400
3     40      100      23.333333             0.400
4     40      200      33.333333             0.200
5     50      400      43.333333             0.125

You should see the following image if you ran dr.visualize_execution(output_columns, './my-dag.dot', {}):

hello_world_image

Congratulations - you just created your Hamilton dataflow that created a dataframe!

Discord Community

We have a small but active community on discord. Come join us!

License

Hamilton is released under the BSD 3-Clause Clear License. If you need to get in touch about something, contact us at algorithms-opensource (at) stitchfix.com.

Contributing

We take contributions, large and small. We operate via a Code of Conduct and expect anyone contributing to do the same.

To see how you can contribute, please read our contributing guidelines and then our developer setup guide.

Prescribed Development Workflow

In general we prescribe the following:

Ensure you understand Hamilton Basics.
Familiarize yourself with some of the Hamilton decorators. They will help keep your code DRY.
Start creating Hamilton Functions that represent your work. We suggest grouping them in modules where it makes sense.
Write a simple script so that you can easily run things end to end.
Join our discord community to chat/ask Qs/etc.

For the backstory on Hamilton we invite you to watch ~9 minute lightning talk on it that we gave at the apply conference: video, slides.

PyCharm Tips

If you're using Hamilton, it's likely that you'll need to migrate some code. Here are some useful tricks we found to speed up that process.

Live templates

Live templates are a cool feature and allow you to type in a name which expands into some code.

E.g. For example, we wrote one to make it quick to stub out Hamilton functions: typing graphfunc would turn into ->

def _(_: pd.Series) -> pd.Series:
   """""""
   return _

Where the blanks are where you can tab with the cursor and fill things in. See your pycharm preferences for setting this up.

Multiple Cursors

If you are doing a lot of repetitive work, one might consider multiple cursors. Multiple cursors allow you to do things on multiple lines at once.

To use it hit option + mouse click to create multiple cursors. Esc to revert back to a normal mode.

Contributors

Code Contributors

Stefan Krawczyk (@skrawcz)
Elijah ben Izzy (@elijahbenizzy)
Danielle Quinn (@danfisher-sf)
Rachel Insoft (@rinsoft-sf)
Shelly Jang (@shellyjang)
Vincent Chu (@vslchusf)
Christopher Prohm (@chmp)
James Lamb (@jameslamb)

Bug Hunters/Special Mentions

Nils Olsson (@nilsso)
Michał Siedlaczek (@elshize)

Project details

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

1.89.0

Oct 11, 2025

1.88.0

Mar 29, 2025

1.87.0

Feb 12, 2025

1.86.1

Jan 4, 2025

1.86.1rc2 pre-release

Feb 8, 2025

1.86.1rc1 pre-release

Feb 4, 2025

1.86.0rc0 pre-release

Jan 2, 2025

1.85.1

Dec 17, 2024

1.85.0

Dec 12, 2024

1.84.0

Dec 4, 2024

1.83.3

Nov 26, 2024

1.83.2

Nov 22, 2024

1.83.1

Nov 14, 2024

1.83.0

Nov 6, 2024

1.82.0

Oct 30, 2024

1.81.2

Oct 24, 2024

1.81.1

Oct 24, 2024

1.81.0

Oct 16, 2024

1.80.0

Oct 10, 2024

1.79.0

Oct 3, 2024

1.79.0rc0 pre-release

Oct 3, 2024

1.78.0

Sep 26, 2024

1.77.1

Sep 23, 2024

1.77.0

Sep 18, 2024

1.77.0rc0 pre-release

Sep 13, 2024

1.76.0

Sep 10, 2024

1.75.1

Sep 4, 2024

1.75.0

Aug 28, 2024

1.74.0

Aug 20, 2024

1.73.2

Aug 14, 2024

1.73.1

Aug 7, 2024

1.73.0

Jul 29, 2024

1.72.1

Jul 23, 2024

1.72.1.dev2 pre-release

Jul 24, 2024

1.72.1.dev1 pre-release

Jul 24, 2024

1.72.1.dev0 pre-release

Jul 24, 2024

1.72.0

Jul 22, 2024

1.71.0

Jul 18, 2024

1.71.0rc1 pre-release

Jul 11, 2024

1.71.0rc0 pre-release

Jul 11, 2024

1.70.0

Jul 9, 2024

1.70.0rc1 pre-release

Jul 9, 2024

1.70.0rc0 pre-release

Jul 3, 2024

1.69.0

Jul 2, 2024

1.69.0rc1 pre-release

Jun 30, 2024

1.69.0rc0 pre-release

Jun 28, 2024

1.68.1rc1 pre-release

Jun 28, 2024

1.68.1rc0 pre-release

Jun 28, 2024

1.68.0

Jun 27, 2024

1.68.0rc0 pre-release

Jun 28, 2024

1.67.0

Jun 20, 2024

1.66.1

Jun 15, 2024

1.66.1rc0 pre-release

Jun 14, 2024

1.66.0

Jun 13, 2024

1.66.0rc1 pre-release

Jun 13, 2024

1.66.0rc0 pre-release

Jun 12, 2024

1.65.0

Jun 5, 2024

1.65.0rc1 pre-release

Jun 10, 2024

1.65.0rc0 pre-release

Jun 9, 2024

1.64.1

Jun 1, 2024

1.64.0

May 28, 2024

1.64.0rc0 pre-release

May 23, 2024

1.63.0

May 22, 2024

1.62.0

May 14, 2024

1.61.0

May 6, 2024

1.60.0

Apr 30, 2024

1.59.1

Apr 24, 2024

1.59.0

Apr 23, 2024

1.58.0

Apr 15, 2024

1.57.0

Apr 9, 2024

1.56.0

Apr 2, 2024

1.55.1

Mar 28, 2024

1.55.0

Mar 26, 2024

1.54.1

Mar 19, 2024

1.54.1rc0 pre-release

Mar 19, 2024

1.54.0

Mar 19, 2024

1.53.0

Mar 12, 2024

1.53.0rc0 pre-release

Mar 6, 2024

1.52.0

Mar 5, 2024

1.51.1

Feb 27, 2024

1.51.1rc0 pre-release

Feb 27, 2024

1.51.0

Feb 27, 2024

1.50.1

Feb 19, 2024

1.50.0 yanked

Feb 19, 2024

Reason this release was yanked:

Superseded by `1.50.1`.

1.49.2

Feb 14, 2024

1.49.1

Feb 13, 2024

1.49.0

Feb 13, 2024

1.48.1rc1 pre-release

Feb 8, 2024

1.48.1rc0 pre-release

Feb 8, 2024

1.48.0

Feb 6, 2024

1.47.0

Jan 30, 2024

1.47.0rc0 pre-release

Jan 30, 2024

1.46.0

Jan 22, 2024

1.46.0rc0 pre-release

Jan 18, 2024

1.45.0

Jan 15, 2024

1.45.0rc1 pre-release

Jan 16, 2024

1.45.0rc0 pre-release

Jan 16, 2024

1.44.0

Jan 12, 2024

1.43.1

Jan 2, 2024

1.43.1rc0 pre-release

Jan 2, 2024

1.43.0

Jan 2, 2024

1.43.0rc0 pre-release

Jan 2, 2024

1.42.0

Dec 27, 2023

1.42.0rc0 pre-release

Dec 30, 2023

1.41.0

Dec 21, 2023

1.40.1

Dec 18, 2023

1.40.0

Dec 14, 2023

1.39.1

Dec 5, 2023

1.39.1rc0 pre-release

Nov 28, 2023

1.39.0

Nov 27, 2023

1.38.1

Nov 22, 2023

1.38.1rc0 pre-release

Nov 18, 2023

1.38.0

Nov 14, 2023

1.38.0rc2 pre-release

Nov 14, 2023

1.38.0rc1 pre-release

Nov 14, 2023

1.38.0rc0 pre-release

Nov 11, 2023

1.37.2rc0 pre-release

Nov 9, 2023

1.37.1

Nov 8, 2023

1.37.1rc0 pre-release

Nov 9, 2023

1.37.0

Nov 7, 2023

1.36.0

Oct 30, 2023

1.35.1

Oct 25, 2023

1.35.0

Oct 23, 2023

1.34.1

Oct 18, 2023

1.34.0

Oct 16, 2023

1.33.1

Oct 9, 2023

1.33.0

Oct 9, 2023

1.32.0

Oct 5, 2023

1.31.1rc0 pre-release

Oct 4, 2023

1.31.0

Oct 2, 2023

1.30.2rc0 pre-release

Oct 4, 2023

1.30.1

Sep 26, 2023

1.30.0

Sep 25, 2023

1.29.1rc0 pre-release

Sep 22, 2023

1.29.0

Sep 18, 2023

1.28.0

Sep 11, 2023

1.27.4

Sep 4, 2023

1.27.3

Aug 31, 2023

1.27.2

Aug 25, 2023

1.27.1

Aug 24, 2023

1.27.0

Aug 24, 2023

1.27.0rc6 pre-release

Aug 12, 2023

1.27.0rc5 pre-release

Aug 12, 2023

1.27.0rc4 pre-release

Aug 10, 2023

1.27.0rc3 pre-release

Aug 9, 2023

1.27.0rc2 pre-release

Aug 8, 2023

1.27.0rc1 pre-release

Aug 8, 2023

1.27.0rc0 pre-release

Aug 8, 2023

1.26.5

Aug 20, 2023

1.26.4

Aug 17, 2023

1.26.3

Aug 17, 2023

1.26.2

Aug 16, 2023

1.26.1

Aug 15, 2023

1.26.0

Aug 8, 2023

1.26.0rc0 pre-release

Aug 7, 2023

1.25.0

Aug 5, 2023

1.24.0

Jul 25, 2023

1.23.2

Jul 20, 2023

1.23.1

Jul 3, 2023

1.23.0

May 25, 2023

1.22.6

May 18, 2023

1.22.5

May 1, 2023

1.22.4

Apr 28, 2023

1.22.3

Apr 23, 2023

1.22.2

Apr 20, 2023

1.22.1

Apr 15, 2023

1.22.0

Apr 13, 2023

1.22.0rc1 pre-release

Apr 10, 2023

1.22.0rc0 pre-release

Apr 8, 2023

1.21.2

Apr 8, 2023

1.21.1

Apr 3, 2023

1.21.0

Mar 27, 2023

1.20.1

Mar 22, 2023

1.20.0

Mar 20, 2023

1.20.0rc1 pre-release

Mar 16, 2023

1.20.0rc0 pre-release

Mar 16, 2023

1.19.0

Mar 6, 2023

1.19.0rc0 pre-release

Mar 5, 2023

1.18.0

Feb 27, 2023

1.18.0rc0 pre-release

Feb 27, 2023

1.17.0

Feb 20, 2023

1.17.0rc1 pre-release

Feb 20, 2023

1.17.0rc0 pre-release

Feb 20, 2023

1.16.0

Feb 9, 2023

1.16.0rc0 pre-release

Feb 9, 2023

1.15.0

Jan 30, 2023

1.15.0rc0 pre-release

Jan 28, 2023

1.14.1

Jan 16, 2023

1.14.1rc0 pre-release

Jan 16, 2023

1.13.0

Jan 2, 2023

1.13.0rc1 pre-release

Dec 29, 2022

1.13.0rc0 pre-release

Dec 29, 2022

1.12.0

Dec 27, 2022

1.12.0rc3 pre-release

Dec 23, 2022

1.12.0rc1 pre-release

Nov 28, 2022

1.12.0rc0 pre-release

Nov 28, 2022

1.11.1

Dec 20, 2022

1.11.0

Oct 21, 2022

1.11.0rc0 pre-release

Oct 14, 2022

1.10.0

Aug 20, 2022

1.10.0rc0 pre-release

Aug 16, 2022

1.9.0

Jul 14, 2022

1.9.0rc0 pre-release

Jul 13, 2022

1.8.0

Jul 3, 2022

1.7.1

Jun 27, 2022

1.7.1rc1 pre-release

Jun 26, 2022

1.7.1rc0 pre-release

Jun 26, 2022

1.7.0

May 2, 2022

1.7.0rc1 pre-release

May 2, 2022

1.7.0rc0 pre-release

Apr 30, 2022

1.6.0

Apr 12, 2022

1.5.1

Apr 11, 2022

This version

1.5.1rc2 pre-release

Apr 11, 2022

1.5.1rc0 pre-release

Mar 25, 2022

1.5.0

Mar 25, 2022

1.5.0rc0 pre-release

Mar 23, 2022

1.4.0

Feb 10, 2022

1.3.0

Feb 7, 2022

1.3.0rc2 pre-release

Feb 7, 2022

1.3.0rc1 pre-release

Feb 6, 2022

1.2.0

Dec 14, 2021

1.1.1

Oct 14, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sf-hamilton-1.5.1rc2.tar.gz (48.0 kB view details)

Uploaded Apr 11, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sf_hamilton-1.5.1rc2-py3-none-any.whl (36.7 kB view details)

Uploaded Apr 11, 2022 Python 3

File details

Details for the file sf-hamilton-1.5.1rc2.tar.gz.

File metadata

Download URL: sf-hamilton-1.5.1rc2.tar.gz
Upload date: Apr 11, 2022
Size: 48.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.9

File hashes

Hashes for sf-hamilton-1.5.1rc2.tar.gz
Algorithm	Hash digest
SHA256	`b7b633a03075aadaac779923ebeb40d1e030fd7f71b0641599965af6f7bf34bf`
MD5	`8e4613a82dfecce35e02d306bea1b7e2`
BLAKE2b-256	`f671ba7c69b5884254ebaf9a257be3d37d44d47c032ff9b27fad2565aba2c1ce`

See more details on using hashes here.

File details

Details for the file sf_hamilton-1.5.1rc2-py3-none-any.whl.

File metadata

Download URL: sf_hamilton-1.5.1rc2-py3-none-any.whl
Upload date: Apr 11, 2022
Size: 36.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.9

File hashes

Hashes for sf_hamilton-1.5.1rc2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9bd9ca4cb5ab0ef6f6644385d99bb5c8d73d9c9f60553ac5fd1548fa4c156fa1`
MD5	`1305757f6fb47f0463a49d8a92265aaa`
BLAKE2b-256	`c55cfdc1cd6b3a10e049162c27282b7a1fb5be65953c06c3f322a3f18df536f9`

See more details on using hashes here.

sf-hamilton 1.5.1rc2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Hamilton

Getting Started

Installation

Hamilton in <15 minutes

Your first hello world.

Discord Community

License

Contributing

Prescribed Development Workflow

PyCharm Tips

Live templates

Multiple Cursors

Contributors

Code Contributors

Bug Hunters/Special Mentions

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes