Longitudinal models #520

NathanielF · 2023-02-05T09:50:16Z

Example Notebook on Longitudinal Analysis and Growth Curve Trajectories.

Related to this issue: #508

Notebook follows style guide https://docs.pymc.io/en/latest/contributing/jupyter_style.html
PR description contains a link to the relevant issue:
- a tracker one for existing notebooks (tracker issues have the "tracker id" label)
- or a proposal one for new notebooks
Check the notebook is not excluded from any pre-commit check: https://github.com/pymc-devs/pymc-examples/blob/main/.pre-commit-config.yaml

Example of iterative model construction on longitudinal data following the Willett and Singer book. In the first model we follow the text quite closely, and in the second model we choose an alternative likelihood model and fully difrerent set of priors.

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

review-notebook-app · 2023-02-05T09:50:21Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

NathanielF · 2023-02-13T10:21:43Z

I think this is close to done now. Tagging @aloctavodia and @drbenvincent since you both expressed an interest in the topic.
The broad structure of the notebook is as follows:

I focus on 2 examples of iterative model building and the extraction of the within-individual and across-individual trajectories of growth learned by the model. I've tried to explain the crucial aspects of the model as i understand them without getting too caught up in the confusing vocabulary of fixed and random effects, so i'd be grateful for feedback on this aspect especially. I don't want to add to the confusion around these terms!

In the first example i follow the textbook very closely using a normal likelihood and derive parameter estimates akin to those reported in the text. In the second example i go off-reservation and use a gumbel likelihood with censoring which looks more appropriate to me than the Normal likelihood they were constrained to use in the text. I also briefly call out that you can (and probably should) use Bambi for these kinds of models where possible.

drbenvincent · 2023-03-02T17:24:09Z

Sorry for slow response @NathanielF. Fingers crossed I'll have time to review this weekend, or early next week.

NathanielF · 2023-03-02T19:24:04Z

Sorry for slow response @NathanielF. Fingers crossed I'll have time to review this weekend, or early next week.

No problem! Thanks for letting me know.

NathanielF · 2023-03-09T20:30:42Z

Giving this one another nudge in case Friday is a good day for this review @drbenvincent?

review-notebook-app · 2023-03-11T17:27:55Z

View / edit / reply to this conversation on ReviewNB

drbenvincent commented on 2023-03-11T17:27:55Z
----------------------------------------------------------------

First line should have the name of the notebook in the parentheses, not the _title_ of the notebook

NathanielF commented on 2023-03-12T13:54:09Z
----------------------------------------------------------------

Amended

review-notebook-app · 2023-03-11T17:27:56Z

View / edit / reply to this conversation on ReviewNB

drbenvincent commented on 2023-03-11T17:27:56Z
----------------------------------------------------------------

Double check the pymc.sampling_jax import line, just include what's necessary. Optionally, if you use PyMC 5.1.0, you can avoid importing these I believe and provide a nuts_sampler argument to pm.sample.

NathanielF commented on 2023-03-12T13:57:42Z
----------------------------------------------------------------

Yeah, i don't need the blackjax sampler. Removed now.

review-notebook-app · 2023-03-11T17:27:57Z

View / edit / reply to this conversation on ReviewNB

drbenvincent commented on 2023-03-11T17:27:57Z
----------------------------------------------------------------

Capitalize Bayesian

Structure makes it sound like Bambi is used exclusively, but it's only used for one model?

NathanielF commented on 2023-03-12T13:59:12Z
----------------------------------------------------------------

Adjusted this now to be more explicit that the bambi section is a digression to explain an alternative method. Capitalised Bayesian too

review-notebook-app · 2023-03-11T17:27:58Z

View / edit / reply to this conversation on ReviewNB

drbenvincent commented on 2023-03-11T17:27:58Z
----------------------------------------------------------------

Ad space after try/except block

NathanielF commented on 2023-03-12T13:59:39Z
----------------------------------------------------------------

Adjusted.

review-notebook-app · 2023-03-11T17:27:59Z

View / edit / reply to this conversation on ReviewNB

drbenvincent commented on 2023-03-11T17:27:59Z
----------------------------------------------------------------

Use the hide-input cell tag. See style guide for more info.

NathanielF commented on 2023-03-12T13:59:52Z
----------------------------------------------------------------

Hidden

review-notebook-app · 2023-03-11T17:28:00Z

View / edit / reply to this conversation on ReviewNB

drbenvincent commented on 2023-03-11T17:28:00Z
----------------------------------------------------------------

Use the hide-input cell tag. See style guide for more info.

NathanielF commented on 2023-03-12T14:02:43Z
----------------------------------------------------------------

hidden

NathanielF · 2023-03-12T14:02:44Z

hidden

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:02:58Z

hidden

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:04:36Z

adjusted. Note one concern here is the color formatting renders differently to color the whole line rather than just the symbols on the site.

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:04:50Z

Hidden

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:05:06Z

Adjusted

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:05:21Z

hidden

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:05:47Z

hidden

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:06:01Z

Adjusted

View entire conversation on ReviewNB

NathanielF · 2023-03-12T14:06:19Z

adjusted

View entire conversation on ReviewNB

drbenvincent

Just a few minor style issues:

For the multi-line equation block where we have the align environment, if you place the $ symbol before the = or ~ then everything aligns to that. At the moment it's right aligned.
Add time series to the tags
Consider moving from the Case Studies section to the Time Series section of the repo
Missing space after full stop for "our model.The"
Remove commented out line... # grade = pm.MutableData('grade_data', df_external['GRADE'].values)
Ideally, it would be possible to suppress console output after plotting with ;. But I can't remember if the black formatting removes that. I believe there's a way to suppress black from removing that
Where you do use sample_numpyro_nuts, can you use the hide-output (cell tag) because we get a whole bunch of progress bar output that we don't need. Or maybe there's a specific kwarg for that sampler to suppress that?
In the rendered website output, there's something really broken about the rendering of the L2 Watermark header, after the References L2 header. I've not double checked, but can probably be fixed by having that in it's own cell. Either way, there's something not working there. More info here https://www.pymc.io/projects/docs/en/latest/contributing/jupyter_style.html#watermark

NathanielF · 2023-03-12T15:23:40Z

Done.

View entire conversation on ReviewNB

drbenvincent · 2023-03-12T16:42:56Z

Nice. I'll endeavour to do a final pass, reviewing content/clarity in the next few days and we can ship this :) Do feel free to ping me, but it is on my list.

…eries tag and aligning equations

NathanielF · 2023-03-12T17:20:42Z

Consider moving from the Case Studies section to the Time Series section of the repo

On this point, i think i prefer it in the case studies section. Time series analysis/modelling is (in my head anyway) a different kind of beast than longitudinal curve analysis. They're clearly, related... but just not the same bucket for me. Can move it if you feel strongly about it, but i prefer it as it is.

On the watermark thing i wasn't sure what you meant... it looks ok to me:

NathanielF · 2023-03-12T17:21:23Z

Nice. I'll endeavour to do a final pass, reviewing content/clarity in the next few days and we can ship this :) Do feel free to ping me, but it is on my list.

Perfect. Thanks for your help on this!

NathanielF · 2023-03-20T09:22:39Z

Just a small nudge @drbenvincent.

drbenvincent

Could you add a coord for the observations for all models. We have 246 in the early models, and 254 with later models.

drbenvincent · 2023-04-10T11:04:32Z

Typo in this sentence "Implementing the model is PyMC is as follows"

drbenvincent · 2023-04-10T11:06:39Z

I'm wondering if we could get the model math for the "The Uncontrolled Effects of Parental Alcoholism" model. Don't worry about the colour coding if that gets messy.

Same for the "Model controlling for Peer Effects" model

drbenvincent · 2023-04-10T11:13:34Z

After this sentence

The formula specification uses 1 to denote an intercept term and a conditional | operator to denote a subject level parameter combined with the global parameter of the same type in the manner specified above.

could you just spell out how to read the hierarchical term (1 + age_14 | id) in the Bambi model equation

And we don't need the 0 + part of the equation I believe?

drbenvincent · 2023-04-10T11:21:24Z

I think there's one instance of a lower case "bambi", but everything else is capitalised.

This is great. Happy to approve after these minor tweaks 👍
Sorry for the delay in finishing the review

NathanielF · 2023-04-10T12:39:07Z

Thanks @drbenvincent! Will get to these tonight.

NathanielF · 2023-04-10T19:47:32Z

Made those changes @drbenvincent. Seemed to work in the Review Notebook, but docs build check renders strangely:
Seems to effect all notebooks, so likely something to do with the broader site render than my specific PR here.

Probably related to this: #541

drbenvincent

So the formatting issue looks like a broader issue, not something specific to this notebook. So happy to approve. Let me know if there are any minor tweaks you wanted to make, otherwise I'll merge.

Thanks again for the contribution, and for bearing with the tardy reviews.

NathanielF · 2023-04-11T14:23:11Z

Happy with it. Feel free to merge! Thanks again

NathanielF added 3 commits February 1, 2023 15:25

[Longitudinal Analysis pymc-devs#508] first commit

44bd804

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

[Longitudinal Analysis pymc-devs#508] adding 3rd model

13b4a2c

Signed-off-by: Nathaniel <NathanielF@users.noreply.github.com>

[Longitudinal Analysis pymc-devs#508] adding 4th model

69e8341

NathanielF added 13 commits February 5, 2023 15:40

[Longitudinal Analysis pymc-devs#508] tidied plots

e0e0d3b

[Longitudinal Analysis pymc-devs#508] adding in polynomial time curves

924308c

[Longitudinal Analysis pymc-devs#508] adding polynomial plots

6e5015c

[Longitudinal Analysis pymc-devs#508] adding write-up

94c336e

[Longitudinal Analysis pymc-devs#508] adding myst notebook

2196f0d

[Longitudinal Analysis pymc-devs#508] adding myst notebook formatted

36c70cf

[Longitudinal Analysis pymc-devs#508] added gender to final model

1e36d50

[Longitudinal Analysis pymc-devs#508] added numpyro fit

0cea8d6

[Longitudinal Analysis pymc-devs#508] added bambi example

109b165

[Longitudinal Analysis pymc-devs#508] added latex formula specification

0ea3a88

[Longitudinal Analysis pymc-devs#508] added waic comparison

5709f3a

[Longitudinal Analysis pymc-devs#508] fixed minor typos

efa208f

[Longitudinal Analysis pymc-devs#508] tighter writing.

8365bde

NathanielF marked this pull request as ready for review February 13, 2023 10:13

NathanielF requested a review from aloctavodia February 15, 2023 12:34

NathanielF requested a review from drbenvincent February 24, 2023 10:28

drbenvincent requested changes Mar 12, 2023

View reviewed changes

[Longitudinal Analysis pymc-devs#508] hiding output and adding time s…

cf3ede7

…eries tag and aligning equations

drbenvincent self-requested a review April 10, 2023 10:58

drbenvincent reviewed Apr 10, 2023

View reviewed changes

NathanielF added 2 commits April 10, 2023 20:17

[Longitudinal Analysis pymc-devs#508] adjusting with Ben's feedback

6ba7c29

[Longitudinal Analysis pymc-devs#508] adding some more text

e0ef184

NathanielF requested a review from drbenvincent April 10, 2023 19:56

drbenvincent approved these changes Apr 11, 2023

View reviewed changes

drbenvincent merged commit ea709bb into pymc-devs:main Apr 11, 2023

Longitudinal models #520

Longitudinal models #520

Conversation

NathanielF commented Feb 5, 2023 • edited Loading

Example Notebook on Longitudinal Analysis and Growth Curve Trajectories.

review-notebook-app bot commented Feb 5, 2023

NathanielF commented Feb 13, 2023

drbenvincent commented Mar 2, 2023

NathanielF commented Mar 2, 2023

NathanielF commented Mar 9, 2023

review-notebook-app bot commented Mar 11, 2023 • edited Loading

review-notebook-app bot commented Mar 11, 2023 • edited Loading

review-notebook-app bot commented Mar 11, 2023 • edited Loading

review-notebook-app bot commented Mar 11, 2023 • edited Loading

review-notebook-app bot commented Mar 11, 2023 • edited Loading

review-notebook-app bot commented Mar 11, 2023 • edited Loading

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

drbenvincent left a comment

Choose a reason for hiding this comment

NathanielF commented Mar 12, 2023

drbenvincent commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 12, 2023

NathanielF commented Mar 20, 2023

drbenvincent left a comment

Choose a reason for hiding this comment

drbenvincent commented Apr 10, 2023

drbenvincent commented Apr 10, 2023 • edited Loading

drbenvincent commented Apr 10, 2023 • edited Loading

drbenvincent commented Apr 10, 2023

NathanielF commented Apr 10, 2023

NathanielF commented Apr 10, 2023 • edited Loading

drbenvincent left a comment

Choose a reason for hiding this comment

NathanielF commented Apr 11, 2023

NathanielF commented Feb 5, 2023 •

edited

Loading

review-notebook-app bot commented Mar 11, 2023 •

edited

Loading

review-notebook-app bot commented Mar 11, 2023 •

edited

Loading

review-notebook-app bot commented Mar 11, 2023 •

edited

Loading

review-notebook-app bot commented Mar 11, 2023 •

edited

Loading

review-notebook-app bot commented Mar 11, 2023 •

edited

Loading

review-notebook-app bot commented Mar 11, 2023 •

edited

Loading

drbenvincent commented Apr 10, 2023 •

edited

Loading

drbenvincent commented Apr 10, 2023 •

edited

Loading

NathanielF commented Apr 10, 2023 •

edited

Loading