Custom Models with MMM components#

The underlying components used in the MMM class provide flexibility to build other, custom models. With a little knowledge of PyMC and how to customize these PyMC-Marketing components, a lot of different use-cases can be covered.

This notebook is not an introduction but rather an advance example for those trying to understand the PyMC-Marketing internals for flexibility for custom use-cases.

Overview#

This notebook will cover the currently exposed model components from the PyMC-Marketing API. At the moment, this includes:

media transformations
- adstock: how today’s media has an effect in the future
- saturation: the diminishing returns for media
recurring seasonality

For each of these, the flexibility and customization will be showcased and combined together in a toy model with with PyMC directly.

Setup#

from functools import partial

import arviz as az
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
import pymc as pm
import pymc.dims as pmd
import xarray as xr
from pymc_extras.prior import Prior
from xarray import DataArray

from pymc_marketing import mmm
from pymc_marketing.mmm.events import (
    AsymmetricGaussianBasis,
    EventEffect,
    GaussianBasis,
    HalfGaussianBasis,
)
from pymc_marketing.plot import plot_curve

az.style.use("arviz-darkgrid")
plt.rcParams["figure.figsize"] = [12, 7]
plt.rcParams["figure.dpi"] = 100

%config InlineBackend.figure_format = "retina"

/home/ricardo/Documents/pymc/pymc/dims/__init__.py:66: UserWarning: The `pymc.dims` module is experimental and may contain critical bugs (p=0.087).
Please report any issues you encounter at https://github.com/pymc-devs/pymc/issues.
API changes are expected in future releases.

  __init__()

seed = sum(map(ord, "PyMC-Marketing provides flexible model components"))
rng = np.random.default_rng(seed)

draw = partial(pm.draw, random_seed=rng)

Media Transformations#

There are classes for each of the adstock and saturation transformations. They can be imported from the pymc_marketing.mmm module.

saturation = mmm.MichaelisMentenSaturation()

Saturation curves can take many different forms. In this example, we will use the Michaelis Menten curve which we provide in the MichaelisMentenSaturation class.

This curve has two parameters, alpha and lam.

A characteristic of these curves are diminishing returns in order to indicate saturation of a media variable. This can be seen in the plateauing as x increases.

../../_images/58a3a519e88884f034b941c94bf11bde915cf49fab69b1e67f2df32d6779699f.png

Adding Parameter Dimensions#

In most cases, a separate saturation function will be estimated for each media channel. A dimension needs to be added to the prior of the function parameters to account for this.

Let’s create some example data to work toward this transformation.

For this example, we will have 2 years of media spend for 4 channels

n_dates = 52 * 2
dates = pd.date_range("2023-01-01", periods=n_dates, freq="W-MON")

channels = ["C1", "C2", "C3", "C4"]

coords = {
    "date": dates,
    "channel": channels,
}

df_spends = random_spends(coords=coords).to_pandas()
df_spends.head()

channel	C1	C2	C3	C4
date
2023-01-02	0.000000	2.830228	0.0	0.357625
2023-01-09	0.000000	0.594478	0.0	0.977781
2023-01-16	0.285379	0.000000	0.0	0.197317
2023-01-23	0.000000	0.000000	0.0	0.000000
2023-01-30	0.000000	0.000000	0.0	0.000000

../../_images/7fbcd7d5555bf05beb716ffb6dfa95bace2e10520a962328adc457711790fa5b.png

As mentioned, the default priors do not have a channel dimension. In order to use with the in our model with “channel” dim, we have to add the dims to each of the function priors.

for dist in saturation.function_priors.values():
    dist.dims = "channel"

saturation.function_priors

{'alpha': Prior("Gamma", mu=2, sigma=1, dims="channel"),
 'lam': Prior("HalfNormal", sigma=1, dims="channel")}

The previous workflow can be used to understand our priors still. Just pass the coords to the sample_prior method in order to add dims to the appropriate variables.

Since each channel prior is the same, there will just be some noise between the HDI and curve samples.

curve = saturation.sample_curve(prior)
saturation.plot_curve(curve);

Sampling: []

../../_images/881a65f2c4c8a728ea102381dacb714b1392fe7989ab8522a9a4e071a06e4480.png

Using in PyMC Model#

When using the transformation in a larger PyMC model, the apply method will be used.

This method will:

create distributions based on prior specification of the instance
apply the transformation to the data

The dims parameter is the shape of parameters and not the data. The data has a different shape but will need to be broadcastable with the parameters!

with pm.Model(coords=coords) as model:
    saturated_spends = saturation.apply(
        DataArray(
            df_spends.values,
            dims=(
                "date",
                "channel",
            ),
        )
    )

Since independent alpha and lam were specified, we see that in the model graph below:

pm.model_to_graphviz(model)

../../_images/8479eccf949d657aa47cf459baa507fb297cbabbade44b0418bb8b3a8ea4e481.svg

Note

Neither the df_spends nor saturated_spends show in the model. If needed, use pmd.Data and pmd.Deterministic to save off.

Our variable will be (date, channel) dims.

saturated_spends.type.shape

../../_images/f2c0f87ae203ec76f747530f591f7b6d24fa721b7dafd362025ab40d6b56ce81.png

We can manipulate this in anyway we’d like to connect it in with the larger model.

Changing Assumptions#

As hinted above, the priors for the function parameters are customizable which can lead to many different models. Change the priors, change the model.

The prior distributions just need to follow the distribution API here.

Instead of the defaults, we can use:

hierarchical parameter for lam parameter
common alpha parameter

hierarchical_lam = Prior(
    "HalfNormal",
    sigma=Prior("HalfNormal", sigma=1),
    dims="channel",
)
common_alpha = Prior("Gamma", mu=2, sigma=1)
priors = {
    "lam": hierarchical_lam,
    "alpha": common_alpha,
}

saturation = mmm.MichaelisMentenSaturation(priors=priors)

saturation.function_priors

{'alpha': Prior("Gamma", mu=2, sigma=1),
 'lam': Prior("HalfNormal", sigma=Prior("HalfNormal", sigma=1), dims="channel")}

Then this can be used in a new PyMC model which leads to a much different model graph than before!

with pm.Model(coords=coords) as model:
    saturated_spends = saturation.apply(
        DataArray(
            df_spends.values,
            dims=(
                "date",
                "channel",
            ),
        )
    )

pm.model_to_graphviz(model)

../../_images/89d0c604788f362f10fb955a39824729d9b5347a7afd243c2cc68342a1dd50e9.svg

The shape of the output will still be (date, channel) even though some of the parameter’s dims has changed.

saturated_spends.type.shape

../../_images/60b690bd07710eeebacb708492adfa0a7d69a26cb1046bba1424277f5b96ad6a.png

The previous workflow still helps us understand the produced curves:

sample_prior
sample_curve
plot_curve

prior = saturation.sample_prior(coords=coords, random_seed=rng)

Sampling: [saturation_alpha, saturation_lam, saturation_lam_sigma]

Though they all look the same in the prior, the data generation process is indeed different as seen in the model graph.

curve = saturation.sample_curve(prior)
saturation.plot_curve(curve);

Sampling: []

../../_images/ee5c49ed8086a933a503e84dc626dd28eeb10896ccef38324c880d4782d73d62.png

Geo Hierarchical Model#

The dimensions of the parameters are not limited to 1D so additional hierarchies can be defined.

Below defines:

alpha which is hierarchical across channels
lam which is common across all geos but different channels

# For reference
mmm.MichaelisMentenSaturation.default_priors

{'alpha': Prior("Gamma", mu=2, sigma=1), 'lam': Prior("HalfNormal", sigma=1)}

hierarchical_alpha = Prior(
    "Gamma",
    mu=Prior("HalfNormal", sigma=1, dims="geo"),
    sigma=Prior("HalfNormal", sigma=1, dims="geo"),
    dims=("channel", "geo"),
)
common_lam = Prior("HalfNormal", sigma=1, dims="channel")
priors = {
    "alpha": hierarchical_alpha,
    "lam": common_lam,
}
saturation = mmm.MichaelisMentenSaturation(priors=priors)

Our new data set needs to have information for geo now. This is channel spends by date and geo. This is stored in an xarray.DataArray which can be converted to a 3D numpy.ndarray.

Displaying the data is easy with pandas.

geo_coords = {
    **coords,
    "geo": ["Region1", "Region2", "Region3"],
}

geo_spends = random_spends(coords=geo_coords)

geo_spends.to_series().unstack("channel").head(6)

	channel	C1	C2	C3	C4
date	geo
2023-01-02	Region1	0.0	0.469285	1.800792	0.851362
	Region2	0.0	0.506669	0.000000	0.795277
	Region3	0.0	1.504671	1.104576	0.000000
2023-01-09	Region1	0.0	1.996656	1.024629	0.000000
	Region2	0.0	0.000000	0.000000	0.574270
	Region3	0.0	0.013698	1.131477	0.365130

As long as the dims argument of apply can broadcast with the data going in, then the media transformations can be used!

Here, the data is in the shape (date, channel, geo) so it can broadcast with the parameters in shape (channel, geo) to create the saturated spends.

with pm.Model(coords=geo_coords) as geo_model:
    geo_data = pmd.Data(
        "geo_data",
        geo_spends.to_numpy(),
        dims=("date", "channel", "geo"),
    )
    saturated_geo_spends = pmd.Deterministic(
        "saturated_geo_spends",
        saturation.apply(geo_data),
    )

The saturation assumptions can be seen in the model graph:

pm.model_to_graphviz(geo_model)

../../_images/97e35c2f1f6d17bca55d3a07a0b1d3f98ef6921069fd37024348c3dc3b982832.svg

Tip

The PyMC model context will stay the same but changing model assumptions will happen with input data and prior configuration!

Seasonality#

Recurring seasonality can be modeled with either a MonthlyFourier or YearlyFourier instance.

yearly = mmm.YearlyFourier(n_order=2)

There is a similar workflow to understand these priors as before:

sample_prior: Sample all the priors
sample_curve: Sample the curve across the whole period
plot_curve: Plot the HDI and few samples

prior = yearly.sample_prior()
curve = yearly.sample_curve(prior)
yearly.plot_curve(curve);

Sampling: [fourier_beta]
Sampling: []

../../_images/56d0f549b7280ebe01c0d0af90e6a43100507fdca8669d60559e28ba1b65f5e0.png

This also supports arbitrary hierarchies that can be defined with the Prior class. Pass these in with the prior parameters.

Note

A dimension associated with the prefix will be required! By default it is fourier

prior = Prior(
    "Normal",
    mu=DataArray([0, 0, -1, 0], dims="fourier"),
    sigma=Prior("Gamma", mu=0.15, sigma=0.1, dims="fourier"),
    dims=("geo", "fourier"),
)
yearly = mmm.YearlyFourier(n_order=2, prior=prior)

The above workflow works here as well! The coords just need to be passed like in pm.Model.

coords = {
    "geo": ["A", "B"],
}
prior = yearly.sample_prior(coords=coords, xdist=True)
curve = yearly.sample_curve(prior)

Sampling: [fourier_beta, fourier_beta_sigma]
Sampling: []

Based on the hierarchical priors, we can see similar seasonality betweens geos. However, they are not exactly the same!

subplot_kwargs = {"ncols": 1}
sample_kwargs = {"n": 3}
fig, _ = yearly.plot_curve(
    curve, subplot_kwargs=subplot_kwargs, sample_kwargs=sample_kwargs
)
fig.suptitle("Prior seasonality");

../../_images/f90d954c9beb1a45731de34ee9ef09162c9a75f8887d0730eb6125b2b70e8b8b.png

Events#

You can add latent events using Gaussian Basis, this will model events as Gaussian bumps as described by Juan Orduz.

# Example event window
df_events = pd.DataFrame(
    {
        "event": ["Product Launch"],
        "start_date": pd.to_datetime(["2025-03-10"]),
        "end_date": pd.to_datetime(["2025-03-12"]),
    }
)

# Build basis matrix of day offsets relative to event window


def difference_in_days(model_dates, event_dates):
    if hasattr(model_dates, "to_numpy"):
        model_dates = model_dates.to_numpy()
    if hasattr(event_dates, "to_numpy"):
        event_dates = event_dates.to_numpy()
    return (model_dates[:, None] - event_dates) / np.timedelta64(1, "D")


def create_basis_matrix(df_events: pd.DataFrame, model_dates: np.ndarray):
    start_dates = df_events["start_date"]
    end_dates = df_events["end_date"]

    s_ref = difference_in_days(model_dates, start_dates)
    e_ref = difference_in_days(model_dates, end_dates)

    return np.where(
        (s_ref >= 0) & (e_ref <= 0),
        0,
        np.where(np.abs(s_ref) < np.abs(e_ref), s_ref, e_ref),
    )


# Model dates and basis matrix
n_days = 60
dates = pd.date_range("2025-02-15", periods=n_days, freq="D")
X = X = DataArray(create_basis_matrix(df_events, dates), dims=("date", "event"))

# Create three different event effects
half_after = HalfGaussianBasis(
    priors={
        "sigma": Prior("Gamma", mu=7, sigma=1, dims="event"),
    },
    mode="after",
    include_event=False,
)

half_before = HalfGaussianBasis(
    priors={
        "sigma": Prior("Gamma", mu=7, sigma=1, dims="event"),
    },
    mode="before",
    include_event=False,
)

gaussian = GaussianBasis(
    priors={
        "sigma": Prior("Gamma", mu=7, sigma=1, dims="event"),
    },
)

effect_size = Prior("Normal", mu=1, sigma=1, dims="event")

# Create effects for each basis
effect_after = EventEffect(basis=half_after, effect_size=effect_size, dims=("event",))
effect_before = EventEffect(basis=half_before, effect_size=effect_size, dims=("event",))
effect_gaussian = EventEffect(basis=gaussian, effect_size=effect_size, dims=("event",))

coords = {"date": dates, "event": df_events["event"].to_numpy()}


# Sample prior curves for all three effects
curves = {}
for name, effect in [
    ("HalfGaussian (After)", effect_after),
    ("HalfGaussian (Before)", effect_before),
    ("Gaussian", effect_gaussian),
]:
    with pm.Model(coords=coords):
        event_curve = pmd.Deterministic(
            "effect", effect.apply(X), dims=("date", "event")
        )
        idata = pm.sample_prior_predictive()
    curves[name] = idata.prior["effect"]

# Plot all three effects with HDIs and samples using plot_curve
fig, axes = plt.subplots(1, 3, figsize=(15, 4), sharey=True)

for i, (name, curve) in enumerate(curves.items()):
    _fig, _axes = plot_curve(
        curve,
        "date",
        n_samples=10,
        axes=np.array([axes[i]]),
    )
    axes[i].set_title(name)
    axes[i].set_xlabel("Date")
    axes[i].grid(True, alpha=0.3)

axes[0].set_ylabel("Effect")
fig.suptitle("Event Effect Basis Comparison (Prior with HDIs)")
fig.tight_layout()

/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:330: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  out = ptx.math.exp(logp(rv, x))
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4ECEF6EA0>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4ECEF6EA0>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4ECEF6EA0>), MakeVector{dtype='int64'}.0, [49.], [0.14285714]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
Sampling: [basis_sigma, event_effect_size]
/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:330: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  out = ptx.math.exp(logp(rv, x))
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4EF0C8740>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4EF0C8740>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4EF0C8740>), MakeVector{dtype='int64'}.0, [49.], [0.14285714]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
Sampling: [basis_sigma, event_effect_size]
/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:268: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  out = pmd.math.exp(logp(rv, x))
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4E372ECE0>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4E372ECE0>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4E372ECE0>), MakeVector{dtype='int64'}.0, [49.], [0.14285714]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
Sampling: [basis_sigma, event_effect_size]
/tmp/ipykernel_2955/1096890975.py:103: UserWarning: The figure layout has changed to tight
  fig.tight_layout()

../../_images/4c5258e60117f413fd5acc46f56d046c7f6597269fe84bdda22c84163882f448.png

df_events = pd.DataFrame(
    {
        "event": ["e1"],
        "start_date": pd.to_datetime(["2023-01-10"]),
        "end_date": pd.to_datetime(["2023-01-11"]),
    }
)

dates = pd.date_range("2023-01-01", periods=25, freq="D")


def create_basis_matrix(df_events: pd.DataFrame, model_dates: np.ndarray):
    start_dates = df_events["start_date"]
    end_dates = df_events["end_date"]
    s_ref = difference_in_days(model_dates, start_dates)
    e_ref = difference_in_days(model_dates, end_dates)
    return np.where(
        (s_ref >= 0) & (e_ref <= 0),
        0,
        np.where(np.abs(s_ref) < np.abs(e_ref), s_ref, e_ref),
    )


X = DataArray(create_basis_matrix(df_events, dates), dims=("date", "event"))

asymmetric_after = AsymmetricGaussianBasis(
    priors={
        "sigma_before": Prior("Gamma", mu=3, sigma=1, dims="event"),
        "sigma_after": Prior("Gamma", mu=7, sigma=2, dims="event"),
        "a_after": Prior("Normal", mu=3, sigma=0.5, dims="event"),
    },
    event_in="after",
)

asymmetric_before = AsymmetricGaussianBasis(
    priors={
        "sigma_before": Prior("Gamma", mu=8, sigma=2, dims="event"),
        "sigma_after": Prior("Gamma", mu=1, sigma=5, dims="event"),
        "a_after": Prior("Normal", mu=1, sigma=0.5, dims="event"),
    },
    event_in="before",
)

asymmetric_exclude = AsymmetricGaussianBasis(
    priors={
        "sigma_before": Prior("Gamma", mu=2, sigma=2, dims="event"),
        "sigma_after": Prior("Gamma", mu=3, sigma=1, dims="event"),
        "a_after": Prior("Normal", mu=-1, sigma=0.5, dims="event"),
    },
    event_in="exclude",
)

effect_size = Prior("Normal", mu=1, sigma=1, dims="event")

effect_after = EventEffect(
    basis=asymmetric_after, effect_size=effect_size, dims=("event",)
)
effect_before = EventEffect(
    basis=asymmetric_before, effect_size=effect_size, dims=("event",)
)
effect_exclude = EventEffect(
    basis=asymmetric_exclude, effect_size=effect_size, dims=("event",)
)

coords = {"date": dates, "event": df_events["event"].to_numpy()}

# Sample prior curves for all three effects
curves = {}
for name, effect in [
    ("AsymmetricGaussian (After)", asymmetric_after),
    ("AsymmetricGaussian (Before)", asymmetric_before),
    ("AsymmetricGaussian (Exclude)", asymmetric_exclude),
]:
    with pm.Model(coords=coords):
        event_curve = pmd.Deterministic("effect", effect.apply(X))
        idata = pm.sample_prior_predictive()
    curves[name] = idata.prior["effect"]

# Plot all three effects with HDIs and samples using plot_curve
fig, axes = plt.subplots(1, 3, figsize=(20, 5), sharey=True)

for i, (name, curve) in enumerate(curves.items()):
    _fig, _axes = plot_curve(
        curve,
        "date",
        n_samples=10,
        axes=np.array([axes[i]]),
    )
    axes[i].set_title(name)
    axes[i].set_xlabel("Date")
    axes[i].grid(True, alpha=0.3)

# Rotate and align nicely
fig.autofmt_xdate(rotation=30)
fig.suptitle("Event Effect Basis Comparison (Prior with HDIs)")
fig.tight_layout()

/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:432: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  ptx.math.exp(logp(rv_before, x)),
/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:437: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  ptx.math.exp(logp(rv_after, x)) * a_after,
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4EC174F20>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4EC174F20>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4EC174F20>), MakeVector{dtype='int64'}.0, [12.25], [0.57142857]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4EC1749E0>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4EC1749E0>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4EC1749E0>), MakeVector{dtype='int64'}.0, [9.], [0.33333333]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
Sampling: [basis_a_after, basis_sigma_after, basis_sigma_before]
/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:432: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  ptx.math.exp(logp(rv_before, x)),
/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:437: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  ptx.math.exp(logp(rv_after, x)) * a_after,
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4EC44DD20>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4EC44DD20>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4EC44DD20>), MakeVector{dtype='int64'}.0, [0.04], [25.]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA5162019A0>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA5162019A0>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA5162019A0>), MakeVector{dtype='int64'}.0, [16.], [0.5]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
Sampling: [basis_a_after, basis_sigma_after, basis_sigma_before]
/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:432: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  ptx.math.exp(logp(rv_before, x)),
/home/ricardo/Documents/pymc-marketing/pymc_marketing/mmm/events.py:437: UserWarning: RandomVariables {gamma_rv{"(),()->()"}.out} were found in the derived graph. These variables are a clone and do not match the original ones on identity.
If you are deriving a quantity that depends on model RVs, use `model.replace_rvs_by_values` first. For example: `logp(model.replace_rvs_by_values([rv])[0], value)`
  ptx.math.exp(logp(rv_after, x)) * a_after,
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4EC44E5E0>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4EC44E5E0>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4EC44E5E0>), MakeVector{dtype='int64'}.0, [9.], [0.33333333]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
/home/ricardo/Documents/pymc/pymc/pytensorf.py:801: UserWarning: RNG Variable RNG(<Generator(PCG64) at 0x7FA4EC3720A0>) has multiple distinct clients [(gamma((core_dims=(((), ()), ()), extra_dims=('event',)))(RNG(<Generator(PCG64) at 0x7FA4EC3720A0>), TensorFromXTensor.0, XElemwise{scalar_op=Mul()}.0, XElemwise{scalar_op=Reciprocal()}.0), 0), (gamma_rv{"(),()->()"}(RNG(<Generator(PCG64) at 0x7FA4EC3720A0>), MakeVector{dtype='int64'}.0, [1.], [2.]), 0)], likely due to an inconsistent random graph. No default update will be returned.
  warnings.warn(
Sampling: [basis_a_after, basis_sigma_after, basis_sigma_before]
/tmp/ipykernel_2955/1519355296.py:98: UserWarning: The figure layout has changed to tight
  fig.tight_layout()

../../_images/2b9242aedf35a18c1e74188b54d831f18efcbbe8dcd640aa941a0f20cf51837a.png

Summary#

Custom models are possible using the components that build up the MMM class and PyMC distributions themselves. With some prior distribution configuration and the components that PyMC-Marketing provides, novel models can be built up to fit various use-cases and various model assumptions.

Much of the flexibility will come from the prior distribution configuration rather then the transformation themselves. This is meant to keep a standard interface while working with them regardless what their role is.

If there is any suggestions or feedback on how to make better custom models with the package, create a GitHub Issue or chime into the various discussions.

Though models can be built up like this, the prebuilt structures provide many benefits as well. For instance, the MMM class provides:

scaling of input and output data
plotting methods for parameters, predictive data, contributions, etc
customized adstock and saturation transformations
out of sample predictions
lift test integration
budget optimization

Our recommendation is to start with the prebuilt models and work up from there.

%load_ext watermark
%watermark -n -u -v -iv -w -p pymc_marketing,pytensor

Last updated: Thu, 26 Feb 2026

Python implementation: CPython
Python version       : 3.13.11
IPython version      : 9.9.0

pymc_marketing: 0.18.2
pytensor      : 2.38.0+3.g9b81d36bb

arviz         : 0.23.1
matplotlib    : 3.10.8
numpy         : 2.3.5
pandas        : 2.3.3
pymc          : 5.28.0
pymc_extras   : 0.9.2.dev0+gb4ee3c133.d20260226
pymc_marketing: 0.18.2
xarray        : 2025.12.0

Watermark: 2.6.0