Model Configuration#

This notebook provides details about the model configuration in PyMC-Marketing. This API provides a very flexible way to express the model Priors and likelihoods so that we can have enough expressiveness to build custom models.

Setup#

import arviz as az
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
import pymc as pm
from pymc_extras.prior import Prior

from pymc_marketing.model_config import ModelConfigError, parse_model_config

az.style.use("arviz-darkgrid")
plt.rcParams["figure.figsize"] = [12, 7]
plt.rcParams["figure.dpi"] = 100

%load_ext autoreload
%autoreload 2
%config InlineBackend.figure_format = "retina"

seed = sum(map(ord, "Create priors to reflect your marketing beliefs"))
rng = np.random.default_rng(seed)

Prior Distributions#

The Prior class is our way to expression distributions and relationships between them.

Basic Usage#

Every prior will need a distribution name that comes from PyMC. View the full list in the documentation here.

scalar_distribution = Prior("Normal")
scalar_distribution

Prior("Normal")

If not, then an exception will be raised.

try:
    Prior("UnknownDistribution")
except Exception as e:
    print(e)

PyMC doesn't have a distribution of name 'UnknownDistribution'

Specific parameters can be passed as keyword arguments but are not required if the PyMC distribution has defaults.

scalar_distribution = Prior("Normal", mu=0, sigma=3.5)
scalar_distribution

Prior("Normal", mu=0, sigma=3.5)

There will be a check at initialization against the PyMC distribution.

try:
    Prior("Normal", mu=0, b=1)
except Exception as e:
    print(e)

Parameters {'b', 'mu'} are not a subset of the pymc distribution parameters {'mu', 'sigma', 'tau'}

However, there are some limitations to that. Take this invalid parameterization below:

invalid_distribution = Prior("Normal", mu=1, sigma=1, tau=2)

The create_variable method is used to make variables from this distribution and it isn’t until then that there is an error.

with pm.Model():
    try:
        invalid_distribution.create_variable("mu")
    except Exception as e:
        print(e)

Can't pass both tau and sigma

This goes for incorrect values as well!

invalid_distribution = Prior("Normal", mu=1, sigma=-1)
invalid_distribution

Prior("Normal", mu=1, sigma=-1)

The dims parameter are used to express the dimensions of the distribution. This variable is not part of a model so the values are not reflected with the distribution itself.

vector_distribution = Prior("Normal", dims="channel")
vector_distribution

Prior("Normal", dims="channel")

If there are dimensions, then the coords need to exist in the larger model. But they exist separately and can defined in advance.

Tip

Each Prior instance is just instructions to create a variable, so they can be used multiple times!

coords = {"channel": ["C1", "C2"]}
with pm.Model(coords=coords) as model:
    alpha = vector_distribution.create_variable("alpha")
    beta = vector_distribution.create_variable("beta")

pm.model_to_graphviz(model)

../../_images/a9f1ff3294e31737db96eb5b15ffe06e9746c3be6329948f79b59415e0fd0e02.svg

If the coords were not specified then this would cause a PyMC error.

with pm.Model() as model:
    try:
        vector_distribution.create_variable("var")
    except Exception as e:
        print(e)

"Dimensions {'channel'} are unknown to the model and cannot be used to specify a `shape`."

The variables can get arbitrarily large as well by providing additional dims in a tuple.

matrix_distribution = Prior("Normal", dims=("channel", "geo"))
matrix_distribution

Prior("Normal", dims=("channel", "geo"))

tensor_distribution = Prior("Normal", dims=("channel", "geo", "store"))
tensor_distribution

Prior("Normal", dims=("channel", "geo", "store"))

Hierarchical Variables#

Hierarchical variables can be defined by using distributions as the parameters of another distribution. The parent distributions will usually have a larger dimensionality than each of the parameters.

hierarchical_variable = Prior(
    "Normal",
    mu=Prior("Normal"),
    sigma=Prior("HalfNormal"),
    dims="channel",
)

We can use the to_graph method to create visualize the variable with dummy coordinates.

Note

No need to worry about variable naming! The child parameters will all be automatically named based on the parent.

hierarchical_variable.to_graph()

../../_images/355652fb869a4dea6546506d24db8c65d5ff2c32ca5367d6f712fca7064d6170.svg

Warning

The validity of the parameter values will not be checked.

There can be negative sigma parameter in a Normal distribution whether that comes from value or another distribution!

Prior("Normal", mu=1, sigma=-1)

Prior("Normal", mu=1, sigma=-1)

Prior("Normal", sigma=Prior("Normal"))

Prior("Normal", sigma=Prior("Normal"))

Tip

Model reparamterizations can help with model convergence depending on the model posterior!

For Normal distribution, the common non-centered parameterization is supported with the centered flag.

non_centered_hierarchical_variable = Prior(
    "Normal",
    mu=Prior("Normal"),
    sigma=Prior("HalfNormal"),
    dims="channel",
    # Flag for non-centered
    centered=False,
)
non_centered_hierarchical_variable.to_graph()

../../_images/2c5b8bd8239aeeae169815d27850310555378d9529a3b60fd695bbf12a591532.svg

Other distributions can be hierarchical as well. Just use distributions for parameters of a parent distribution. For instance, the Beta distribution has two positive parameters, alpha and beta which can be reflect as HalfNormal distributions.

zero_to_one_variable = Prior(
    "Beta",
    alpha=Prior("HalfNormal"),
    beta=Prior("HalfNormal"),
    dims="channel",
)
zero_to_one_variable.to_graph()

../../_images/7de2d1391aa0d588617fbf76f77997680ed661cde27eaf64c2d0492c5a0cd257.svg

Transformations#

The transform variable can be used for any of the distributions to change its domain.

These transformations will be taken from pytensor.tensor or pm.math in that order.

Below is a non-centered hierarchical Normal distribution that is put through a sigmoid function in order to change the domain to (0, 1).

hierarchical_zero_to_one_distribution = Prior(
    "Normal",
    mu=Prior("Normal"),
    sigma=Prior("HalfNormal"),
    dims="channel",
    centered=False,
    transform="sigmoid",
)
hierarchical_zero_to_one_distribution.to_graph()

../../_images/4e799890a83f10c20c9d10aa22b9318232af72ba6a8d3d4fadf8b88d1275ed8b.svg

Automatic Broadcasting#

The broadcasting of the dims will be handled automatically.

For instance, the mu of the variable needs to be transposed in order to work with ("channel", "geo") dims.

With all this functionality, we can see that the prior distributions can become quite expressive!

def create_2d_variable(mu_dims, sigma_dims) -> Prior:
    mu = Prior(
        "Normal",
        mu=Prior("Normal"),
        sigma=Prior("HalfNormal"),
        dims=mu_dims,
    )
    sigma = Prior(
        "Normal",
        mu=Prior("Normal"),
        sigma=Prior("HalfNormal"),
        centered=False,
        dims=sigma_dims,
        transform="exp",
    )
    return Prior(
        "Normal",
        mu=mu,
        sigma=sigma,
        dims=("channel", "geo"),
    )


variable_2d = create_2d_variable(mu_dims="channel", sigma_dims="geo")
variable_2d.to_graph()

../../_images/9c4e5a4c996312fd4c490be68e1b47895fb594e3ed50e47051588684716c7093.svg

And the user can spend more time thinking about just the assumptions of the variables rather than logic to ensure the dimensions work.

different_assumptions_2d = create_2d_variable(mu_dims="channel", sigma_dims="channel")

different_assumptions_2d.to_graph()

../../_images/53e952c9da3045536ebe5fab4d76fd56643b8ed122af4da08aa124270b8679ce.svg

Serialization#

The to_dict and from_dict methods can be helpful for storage of the distributions.

variable_2d_dict = variable_2d.to_dict()
variable_2d_dict

{'dist': 'Normal',
 'kwargs': {'mu': {'dist': 'Normal',
   'kwargs': {'mu': {'dist': 'Normal'}, 'sigma': {'dist': 'HalfNormal'}},
   'dims': ('channel',)},
  'sigma': {'dist': 'Normal',
   'kwargs': {'mu': {'dist': 'Normal'}, 'sigma': {'dist': 'HalfNormal'}},
   'centered': False,
   'dims': ('geo',),
   'transform': 'exp'}},
 'dims': ('channel', 'geo')}

Prior.from_dict(variable_2d_dict).to_graph()

Use in PyMC-Marketing#

Distributions will be expressed in this manner throughout the package including but not limited to:

MMM components
- media transformations

Backwards compatibility#

The from_dict method will create a Prior instance from the previous format.

For instance, take this previous configuration:

old_model_config = {
    "alpha": {
        "dist": "Normal",
        "kwargs": {
            "mu": 0,
            "sigma": 1,
        },
    },
    "beta": {
        "dist": "Laplace",
        "kwargs": {
            "mu": 1,
            "b": 0.5,
        },
    },
}

This can be parsed with the Prior.from_dict constructor for each key. Much more consise too!

new_model_config = {
    name: Prior.from_dict(key) for name, key in old_model_config.items()
}

new_model_config

{'alpha': Prior("Normal", mu=0, sigma=1),
 'beta': Prior("Laplace", mu=1, b=0.5)}

new_model_config["alpha"].to_dict()

{'dist': 'Normal', 'kwargs': {'mu': 0, 'sigma': 1}}

The parse_model_config function will do just this and is used internally. It also provides some deprecation warnings.

parse_model_config(old_model_config)

{'alpha': Prior("Normal", mu=0, sigma=1),
 'beta': Prior("Laplace", mu=1, b=0.5)}

As well as the ability to catch some errors while parsing.

invalid_model_config = {
    "alpha": {
        "dist": "InvalidDistribution",
    },
    "beta": {
        "dist": "Normal",
        "kwargs": {"mu": "one", "sigma": "two"},
    },
    "gamma": {
        "dist": "HalfNormal",
        "kwargs": {"mu": 1},
    },
}

try:
    parse_model_config(invalid_model_config)
except ModelConfigError as e:
    print(e)

3 errors occurred while parsing model configuration. Errors: Parameter alpha: PyMC doesn't have a distribution of name 'InvalidDistribution', Parameter beta: Parameters must be one of the following types: (int, float, np.array, Prior, pt.TensorVariable). Incorrect parameters: {'mu': <class 'str'>, 'sigma': <class 'str'>}, Parameter gamma: Parameters {'mu'} are not a subset of the pymc distribution parameters {'sigma', 'tau'}

Appendix: Validation#

Behind the scenes, the Prior class uses th validate_call from Pydantic to ensure that the parameters are valid.`

try:
    Prior()
except Exception as e:
    print(e)

1 validation error for __init__
distribution
  Missing required argument [type=missing_argument, input_value=ArgsKwargs(<unprintable tuple object>), input_type=ArgsKwargs]
    For further information visit https://errors.pydantic.dev/2.9/v/missing_argument

try:
    Prior(1)
except Exception as e:
    print(e)

1 validation error for __init__
1
  Input should be a valid string [type=string_type, input_value=1, input_type=int]
    For further information visit https://errors.pydantic.dev/2.9/v/string_type

%load_ext watermark
%watermark -n -u -v -iv -w -p pymc_marketing,pytensor

Last updated: Mon Sep 09 2024

Python implementation: CPython
Python version       : 3.11.5
IPython version      : 8.16.1

pymc_marketing: 0.8.0
pytensor      : 2.22.1

pandas    : 2.1.2
matplotlib: 3.8.4
arviz     : 0.20.0.dev0
numpy     : 1.24.4
pymc      : 5.15.1

Watermark: 2.4.3