A package that implements Marginal Distribution Models (MDMs)

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

MDM Py

This package is a Python implementation of Marginal Distribution Models (MDMs), which can be used in Discrete Choice Modelling.

Documentation

Documentation is kindly hosted by Read The Docs.

Install

This package is uploaded to PyPI. Hence,

pip install mdmpy

should work.

How to use

Simplest Case

Gradient Descent

In the simplest case, we will use the Multinomial Logit (MNL) model, which is used as a default. Assuming numpy, scipy and pandas are installed, we generate choice data assuming a random utility model:

from string import ascii_uppercase as letters
import pandas as pd
import scipy.stats as stats
import numpy as np

NUM_INDIV   = 57
NUM_CHOICES = 3
NUM_ATTR    = 4

np.random.seed(2019)
X = np.random.random((NUM_ATTR, NUM_INDIV * NUM_CHOICES))
true_beta = np.random.random(NUM_ATTR)
V = np.dot(true_beta.T, X)
V = np.reshape(V, (NUM_INDIV,NUM_CHOICES))
eps = stats.gumbel_r.rvs(size=NUM_INDIV * NUM_CHOICES)
eps = np.reshape(eps, (NUM_INDIV, NUM_CHOICES))
U = V + eps
highest_util = np.argmax(U, 1)

df = pd.DataFrame(X.T)
df['choice'] = [1 if idx == x else 0 for idx in highest_util for x in range(NUM_CHOICES)]
df['individual'] = [indiv for indiv in range(NUM_INDIV) for _ in range(NUM_CHOICES)]
df['altvar'] = [altlvl for _ in range(NUM_INDIV) for altlvl in letters[:NUM_CHOICES]]

With this package, we will assume that df is the dataframe which is simply given to us. Instead of having the code itself find out how many individuals, choices and coefficients or attributes there are, we will simply feed them into the class. To perform a gradient descent with this class, we will use the grad_desc method, using the df from above as input,

import mdmpy

# In a typical case one would load df before this line
mdm = mdmpy.MDM(df, 4, 3, [0, 1, 2, 3])
np.random.seed(4)
init_beta = np.random.random(4)
grad_beta = mdm.grad_desc(init_beta)
print(grad_beta)
# expected output [0.30238122 0.07955214 0.86779824 0.50951981]

Solver

The MDM class acts as a wrapper and adds the necessary pyomo variables and sets to model the problem, but requires a solver. IPOPT, an interior point solver, is recommended. If you have such a solver, it can be called. Assuming IPOPT is being used:

import mdmpy

ipopt_exec_path = /path/to/ipopt # Replace with proper path
mdm = mdmpy.MDM(df, 4, 3, [0, 1, 2, 3])
mdm.model_init()
mdm.model_solve("ipopt",ipopt_exec_path)
print([mdm.m.beta[idx].value for idx in mdm.m.beta])
# expected output [0.30238834989235025, 0.07953888508425154, 0.8678050334295714, 0.5095096796373667]

Todo

Add documentation and more meaningful comments
- Add more type hints, especially those involving Python builtins
Add tests.
Put pandas into extras_require of setup.py, and remove the dependency.
- Input of MDM class will become a NumPy array rather than a dataframe.
- Dataframe conversion will be turned into a utility function, likely using try-except block for imports

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.0.15.18

Jul 5, 2023

0.0.15.17

Feb 21, 2019

0.0.15.16

Feb 15, 2019

0.0.15.15

Feb 15, 2019

0.0.15.14

Feb 15, 2019

0.0.15.13

Feb 15, 2019

0.0.15.12

Feb 14, 2019

0.0.15.11

Feb 14, 2019

0.0.15.10

Feb 14, 2019

This version

0.0.15.9

Feb 14, 2019

0.0.15.8

Feb 11, 2019

0.0.15.7

Feb 11, 2019

0.0.15.6

Feb 8, 2019

0.0.15.5

Feb 8, 2019

0.0.15.4

Feb 4, 2019

0.0.15.3

Feb 4, 2019

0.0.15.2

Feb 1, 2019

0.0.15.1

Jan 24, 2019

0.0.13

Jan 22, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mdmpy-0.0.15.9.tar.gz (9.3 kB view hashes)

Uploaded Feb 14, 2019 Source

Built Distribution

mdmpy-0.0.15.9-py3-none-any.whl (14.5 kB view hashes)

Uploaded Feb 14, 2019 Python 3

Hashes for mdmpy-0.0.15.9.tar.gz

Hashes for mdmpy-0.0.15.9.tar.gz
Algorithm	Hash digest
SHA256	`8ea2f1748a70c996433d1a37f7981c89f43d0c5c0c57ea2b494c8018e47ae86e`
MD5	`4d8734ee0dc80856d24ff74afdfe187f`
BLAKE2b-256	`949c283fa490389fe8c59e4e85911b1924cfae66f973f60f5121aef493326c1b`

Hashes for mdmpy-0.0.15.9-py3-none-any.whl

Hashes for mdmpy-0.0.15.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ef52cc518c102a5f1d19d955e6ddcc55ccfa715ce82fa098653218bf72d4655e`
MD5	`5c494abd100019afc2694bfe4b63a956`
BLAKE2b-256	`4d24cb5b0ca47a2f24fd38aff43256c949d25331ae8cc21522f390347f4ede76`