Implement vectorized adstock transformations #114

lucianopaz · 2022-12-14T14:49:45Z

Closes #22

Convert the old adstock transformations to use the vectorized convolution
Write tests for the vectorized convolution with multiple dimensions and broadcasting patterns
Maybe add an assert to prevent segfaults due to passing a incorrect axis?

pymmmc/transformers.py

lucianopaz · 2022-12-14T16:26:05Z

pymmmc/transformers.py

@ricardoV94, is there another way to implement this shape broadcasting?

If I’m not explicit about it, broadcasting of symbolic x and w fails and takes the shape of x as the correct one (maybe symbolic isn’t the correct term, I mean tensors that have None in their shape).

When I am explicit but the arrays fail to broadcast at run time, I get segfaults (probably due to the C backend trying to read or write to memory locations that aren’t part of the array)

From a closer look at the source I think that there should be a couple of assert Ops there.

drbenvincent · 2022-12-14T16:37:42Z

Could I recommend:
a) placing an assert at the start of the function to ensure sizes of the data and parameters are aligned EDIT: I see that that's already on the todo list.
b) is it worth doing some timing tests, even if informal. I'm finding that inference is slowing down significantly when going from [4, T] sized inputs (ie 4 time series) to [4, 150, T] (ie 4 time series each of which has 150 geographical regions.

drbenvincent · 2022-12-14T17:12:56Z

Feature request: At the moment there's the assumption that if our data is shape [4, 150, T], then we have parameters of shape [4, 150]. This makes sense in the case that we want independent parameters for each predictor*region combination, whether we place a hierarchy on that or not. But, this is likely to significantly slow down inference.

Would it be possible to have the option (or an alternative function) which allows you to just have different priors over predictors which is applied over all regions. So in this example, that would be data of size [4, 150, T], but parameters of size [4]

lucianopaz · 2022-12-14T17:34:18Z

Feature request: At the moment there's the assumption that if our data is shape [4, 150, T], then we have parameters of shape [4, 150]. This makes sense in the case that we want independent parameters for each predictor*region combination, whether we place a hierarchy on that or not. But, this is likely to significantly slow down inference.

Would it be possible to have the option (or an alternative function) which allows you to just have different priors over predictors which is applied over all regions. So in this example, that would be data of size [4, 150, T], but parameters of size [4]

This is already supported, just pass the parameters with shape (4, 1) and the same parameter value will be used for every entry in the second axis of x. The logic behind this is that the T dimension of x ids ignored, and the rest of the axis are broadcasted following the same rules that numpy does. That’s why you got a failure to broadcast when the parameter had shape (4,), because 4 and 150 don’t broadcast together

pymc_marketing/mmm/transformers.py

lucianopaz · 2022-12-22T13:17:51Z

@ricardoV94, I'd like to ask for your input on the params_broadcast_shapes function that I added here. As I mentioned in some older comments, the pytensor version is not robust to broadcasting failures (shapes that cannot broadcast together are taken to broadcast just fine and can cause segfaults down the line). This version might be suitable for pytensor itself, if you think that it's OK. My main question is whether the approach that I followed for getting "concrete shapes" (shapes that are not TensorVariables themselves) is fine, or if there is something that I might have missed.

lucianopaz · 2022-12-26T22:34:30Z

I opened pymc-devs/pytensor#152, related to the problem I found with params_broadcast_shapes. If the fix from this PR looks ok, I can also apply it over on pytensor

…ssing lines

lucianopaz · 2023-01-03T09:53:03Z

Pinging @ricardoV94 to have a look. I updated the params_shape_broadcast to rely on the tensors broadcastable attribute. If this isn't desirable, I can undo the last commit.

ricardoV94 · 2023-01-05T16:07:47Z

@lucianopaz sorry for the delay. I think pymc-devs/pytensor#175 is a better solution

lucianopaz · 2023-01-05T20:17:26Z

@lucianopaz sorry for the delay. I think pymc-devs/pytensor#175 is a better solution

I don’t agree. I think that PR addresses a separate and valid issue: broadcast_to should check that the tensor can in fact broadcast to the requested shape. This PR actually enforces a similar check when computing the output shape of each tensor in params_shape_broadcast. It also enforces te broadcastable flag, which is ignored in the pytensor version.

ricardoV94 · 2023-01-05T20:52:25Z

This seems like an odd place to reimplement a utility like this. Right now the broadcastable flag is ignored by design all over Pytensor. Wouldn't that be the place to start fixing things?

juanitorduz · 2023-03-08T15:47:33Z

Hey! Any blockers for this one? Anything I could do to support ?

ricardoV94 · 2023-03-08T16:01:23Z

Would be good to see if we can remove the custom params_broadcast_shapes and use the one in PyTensor, now that we fixed BroadcastTo

ricardoV94 · 2023-04-12T08:08:26Z

Closed in favor of #221

twiecki reviewed Dec 14, 2022

View reviewed changes

pymmmc/transformers.py Outdated Show resolved Hide resolved

lucianopaz commented Dec 14, 2022

View reviewed changes

ricardoV94 added enhancement New feature or request MMM labels Dec 15, 2022

lucianopaz force-pushed the vectorized_conv branch from eea28d4 to 3b68971 Compare December 20, 2022 14:26

twiecki reviewed Dec 20, 2022

View reviewed changes

pymc_marketing/mmm/transformers.py Outdated Show resolved Hide resolved

lucianopaz force-pushed the vectorized_conv branch from 3b68971 to 1573905 Compare December 22, 2022 13:13

lucianopaz requested a review from ricardoV94 December 22, 2022 13:14

lucianopaz marked this pull request as ready for review December 22, 2022 13:14

lucianopaz mentioned this pull request Dec 23, 2022

Make channel_contributions contain a channel axis and exploit that in plots #119

Merged

lucianopaz force-pushed the vectorized_conv branch from 1573905 to eb4c204 Compare December 26, 2022 20:45

lucianopaz added 4 commits January 3, 2023 10:51

Update pyproject.toml to track pymc_marketing coverage and display mi…

8d421c6

…ssing lines

Add params_broadcast_shapes that is robust to broadcasting failures

3ce886c

Implement vectorized adstock transformations

dde2da3

Make params_broadcast_shapes use tensor broadcastable attribute

9643df7

lucianopaz force-pushed the vectorized_conv branch from eb4c204 to 9643df7 Compare January 3, 2023 09:51

cluhmann mentioned this pull request Mar 9, 2023

MMM transformer(s) fail when parameters are not scalars? #196

Closed

ricardoV94 mentioned this pull request Mar 30, 2023

Implement vectorized adstock transformations #221

Merged

ricardoV94 closed this Apr 12, 2023

twiecki deleted the vectorized_conv branch September 11, 2024 07:12

Implement vectorized adstock transformations #114

Implement vectorized adstock transformations #114

Uh oh!

Conversation

lucianopaz commented Dec 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

lucianopaz Dec 14, 2022

Choose a reason for hiding this comment

Uh oh!

lucianopaz Dec 14, 2022

Choose a reason for hiding this comment

Uh oh!

drbenvincent commented Dec 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drbenvincent commented Dec 14, 2022

Uh oh!

lucianopaz commented Dec 14, 2022

Uh oh!

Uh oh!

lucianopaz commented Dec 22, 2022

Uh oh!

lucianopaz commented Dec 26, 2022

Uh oh!

lucianopaz commented Jan 3, 2023

Uh oh!

ricardoV94 commented Jan 5, 2023

Uh oh!

lucianopaz commented Jan 5, 2023

Uh oh!

ricardoV94 commented Jan 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juanitorduz commented Mar 8, 2023

Uh oh!

ricardoV94 commented Mar 8, 2023

Uh oh!

ricardoV94 commented Apr 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

lucianopaz commented Dec 14, 2022 •

edited

Loading

drbenvincent commented Dec 14, 2022 •

edited

Loading

ricardoV94 commented Jan 5, 2023 •

edited

Loading