update hierarchical partial pooling notebook #219

mjhajharia · 2021-08-31T23:50:19Z

adds dims and coords to use plot_forest without yticklabels, @OriolAbril since the arviz label thing is WIP from what i inferred this seems to be an ok way?
changed numpy random generator from legacy to default_rng() (the new standard one)
also, there wasn't a lot to update in this notebook, it was pretty short, if there's things i'm missing let me know!

review-notebook-app · 2021-08-31T23:50:23Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

review-notebook-app · 2021-09-01T07:17:15Z

View / edit / reply to this conversation on ReviewNB

chiral-carbon commented on 2021-09-01T07:17:15Z
----------------------------------------------------------------

thanks, looks good! do add -p xarray with the watermark.

OriolAbril · 2021-09-01T10:09:15Z

does the notebook run without changes with both v3 and v4?

mjhajharia · 2021-09-01T10:10:06Z

yes

review-notebook-app · 2021-09-16T20:07:03Z

View / edit / reply to this conversation on ReviewNB

MarcoGorelli commented on 2021-09-16T20:07:03Z
----------------------------------------------------------------

apply is usually pretty slow, the second line can probably be done much faster with

player_names = data['FirstName'] + ' ' + data['LastName']

Just for an example:

In [2]: import pandas as pd
In [3]: df = pd.DataFrame({'FirstName': ['foo']*1000, 'LastName': ['bar']*1000})
In [4]: %%timeit

   ...: df.apply(lambda x: x.FirstName + " " + x.LastName, axis=1)

   ...: 

   ...: 
14.5 ms ± 1.45 ms per loop (mean ± std. dev. of 7 runs, 100 loops each)
In [5]: %%timeit
   ...: df['FirstName'] + ' ' + df['LastName']

   ...: 

   ...: 
213 µs ± 9.21 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

michaelosthege commented on 2021-10-08T13:51:03Z
----------------------------------------------------------------

Don't know if it renders well, but a pm.model_to_graphviz might be nice here.

mjhajharia commented on 2021-11-21T00:13:51Z
----------------------------------------------------------------

thanks @michaelosthege it came out nice

MarcoGorelli

Looks good to me, just left a comment which is more of an FYI than anything else

OriolAbril · 2021-09-22T00:36:49Z

@mjhajharia can you rerun it with v3 for now still and add the post directive and use bibtex references? link to updated style guide: https://github.com/pymc-devs/pymc3/wiki/PyMC3-Jupyter-Notebook-Style-Guide

mjhajharia · 2021-10-06T21:11:05Z

re executed everything with v3. the only changes were replacing aesara.tensor with theano.tensor

michaelosthege · 2021-10-08T13:51:04Z

Don't know if it renders well, but a pm.model_to_graphviz might be nice here.

View entire conversation on ReviewNB

review-notebook-app · 2021-10-08T13:53:06Z

View / edit / reply to this conversation on ReviewNB

michaelosthege commented on 2021-10-08T13:53:05Z
----------------------------------------------------------------

Line #2.        trace = pm.sample(2000, tune=2000, chains=2, target_accept=0.95, return_inferencedata=True)

Didn't we decide to call it idata ?

OriolAbril commented on 2021-10-08T14:04:11Z
----------------------------------------------------------------

I think we haven't decided anything yet, the issue is open https://github.com/pymc-devs/pymc/issues/4821 and nothing has been aded to the style guide yet. And it's also unclear if we should enforce those names or offer them as suggestions in case author is not opinionated about naming

review-notebook-app · 2021-10-08T13:53:07Z

View / edit / reply to this conversation on ReviewNB

michaelosthege commented on 2021-10-08T13:53:06Z
----------------------------------------------------------------

The names are all in braces. Did they end up as length-1 lists? Maybe doublecheck if model.coords["player_names"] has the right type.

OriolAbril commented on 2021-10-08T14:08:41Z
----------------------------------------------------------------

I was probably the one to do it but I am not sure why. This is the default style for plot_forest, see https://arviz-devs.github.io/arviz/api/generated/arviz.plot_forest.html also for example. The variable name is only shown once, and the coord values which are what differs are written in all cases, but using the same "format". I guess this allows to distinguish [Max Alvis] is coordinate Max Alvis of variable theta whereas Max Alvis is independent variable Max Alvis. I am opinionated on the labelling creation and labeller class, but not really on the actual formats so any suggestions are welcome

mjhajharia commented on 2021-11-21T00:16:47Z
----------------------------------------------------------------

the actual type is correct, var_names="thetas" works as a temporary patch for this case

review-notebook-app · 2021-10-08T14:00:33Z

View / edit / reply to this conversation on ReviewNB

OriolAbril commented on 2021-10-08T14:00:33Z
----------------------------------------------------------------

post directive should go in the first cell but under the title. I don't really understand why, but adding this here doesn't work. i.e. there is no hierarchical model tag in https://pymc-examples--219.org.readthedocs.build/en/219/blog/tag.html nor the notebook appears listed in https://pymc-examples--219.org.readthedocs.build/en/219/blog/tag/pymc3model.html.

also cc @martinacantaro, tags page says "hierarchical model" and "linear model" but tags currently in use are "hierarchical" and "linear model", not sure what to do, change hierarchical and add model? leave as is? Once decided on something we do need to enforce it so tags are actually meaningful

review-notebook-app · 2021-10-08T14:00:34Z

View / edit / reply to this conversation on ReviewNB

OriolAbril commented on 2021-10-08T14:00:34Z
----------------------------------------------------------------

the text with (1) and (2) feels like it should be a numbered list instead

review-notebook-app · 2021-10-08T14:00:35Z

View / edit / reply to this conversation on ReviewNB

OriolAbril commented on 2021-10-08T14:00:35Z
----------------------------------------------------------------

can you also add dims to y. It won't have any effect on the shape but it will define the dims in inferencedata which would be useful if doing complicated postprocessing, they may appear in the graphviz diagram and even as annotation alone they are also helpful I think

OriolAbril · 2021-10-08T14:04:12Z

I think we haven't decided anything yet, the issue is open https://github.com/pymc-devs/pymc/issues/4821 and nothing has been aded to the style guide yet. And it's also unclear if we should enforce those names or offer them as suggestions in case author is not opinionated about naming

View entire conversation on ReviewNB

OriolAbril · 2021-10-08T14:08:42Z

I was probably the one to do it but I am not sure why. This is the default style for plot_forest, see https://arviz-devs.github.io/arviz/api/generated/arviz.plot_forest.html also for example. The variable name is only shown once, and the coord values which are what differs are written in all cases, but using the same "format". I guess this allows to distinguish [Max Alvis] is coordinate Max Alvis of variable theta whereas Max Alvis is independent variable Max Alvis. I am opinionated on the labelling creation and labeller class, but not really on the actual formats so any suggestions are welcome

View entire conversation on ReviewNB

OriolAbril

marking as request changes for the post directive thing, the rest of the comments are minor. Once the post directive is right and tags are working consider this approved even if I don't review again

mjhajharia · 2021-11-21T00:13:52Z

thanks @michaelosthege it came out nice

View entire conversation on ReviewNB

mjhajharia · 2021-11-21T00:16:47Z

the actual type is correct, var_names="thetas" works as a temporary patch for this case

View entire conversation on ReviewNB

mjhajharia · 2021-11-21T00:34:59Z

btw we could maybe add a test for references.bib syntax errors as well, so it gets fixed in pre-commit or something

mjhajharia · 2021-11-21T00:40:18Z

checked the render, everything worked out, I did the requested changes but because of rebasing git doesn't recognise them, can someone approve and merge?

mjhajharia force-pushed the main branch from f108946 to 5d1f536 Compare September 13, 2021 14:28

mjhajharia requested a review from OriolAbril September 13, 2021 14:29

MarcoGorelli approved these changes Sep 16, 2021

View reviewed changes

mjhajharia force-pushed the main branch 2 times, most recently from 56eaec5 to 00bd4e5 Compare October 6, 2021 21:03

mjhajharia requested a review from MarcoGorelli October 6, 2021 21:10

OriolAbril requested changes Oct 8, 2021

View reviewed changes

mjhajharia added 7 commits November 21, 2021 04:53

update notebook to use dims and coords

f1c6be5

precommit

603dcf5

watermark

da3a043

rerun with v3 and theano

0204391

Update references.bib

83cf7b8

Update references.bib

c332c03

precommit again✨

ccba41d

mjhajharia force-pushed the main branch from d6a3289 to ccba41d Compare November 20, 2021 23:26

mjhajharia added 3 commits November 21, 2021 05:52

changes requested and suggested

7cd45f7

formatting thing

ab98122

references.bib syntax

59fb9b4

mjhajharia requested a review from OriolAbril November 21, 2021 00:40

michaelosthege approved these changes Nov 21, 2021

View reviewed changes

michaelosthege merged commit 7d92add into pymc-devs:main Nov 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update hierarchical partial pooling notebook #219

update hierarchical partial pooling notebook #219

mjhajharia commented Aug 31, 2021

review-notebook-app bot commented Aug 31, 2021

review-notebook-app bot commented Sep 1, 2021

OriolAbril commented Sep 1, 2021

mjhajharia commented Sep 1, 2021 via email

review-notebook-app bot commented Sep 16, 2021 •

edited

Loading

MarcoGorelli left a comment

OriolAbril commented Sep 22, 2021

mjhajharia commented Oct 6, 2021

michaelosthege commented Oct 8, 2021

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

OriolAbril commented Oct 8, 2021

OriolAbril commented Oct 8, 2021

OriolAbril left a comment

mjhajharia commented Nov 21, 2021

mjhajharia commented Nov 21, 2021

mjhajharia commented Nov 21, 2021

mjhajharia commented Nov 21, 2021

update hierarchical partial pooling notebook #219

update hierarchical partial pooling notebook #219

Conversation

mjhajharia commented Aug 31, 2021

review-notebook-app bot commented Aug 31, 2021

review-notebook-app bot commented Sep 1, 2021

OriolAbril commented Sep 1, 2021

mjhajharia commented Sep 1, 2021 via email

review-notebook-app bot commented Sep 16, 2021 • edited Loading

MarcoGorelli left a comment

Choose a reason for hiding this comment

OriolAbril commented Sep 22, 2021

mjhajharia commented Oct 6, 2021

michaelosthege commented Oct 8, 2021

review-notebook-app bot commented Oct 8, 2021 • edited Loading

review-notebook-app bot commented Oct 8, 2021 • edited Loading

review-notebook-app bot commented Oct 8, 2021 • edited Loading

review-notebook-app bot commented Oct 8, 2021 • edited Loading

review-notebook-app bot commented Oct 8, 2021 • edited Loading

OriolAbril commented Oct 8, 2021

OriolAbril commented Oct 8, 2021

OriolAbril left a comment

Choose a reason for hiding this comment

mjhajharia commented Nov 21, 2021

mjhajharia commented Nov 21, 2021

mjhajharia commented Nov 21, 2021

mjhajharia commented Nov 21, 2021

review-notebook-app bot commented Sep 16, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading

review-notebook-app bot commented Oct 8, 2021 •

edited

Loading