Update and test example notebooks #6

znicholls · 2022-06-09T13:31:41Z

Adds new example notebooks for all climate models we currently support and adds infrastructure for testing the notebooks. Note that this means we're effectively running the WG3 reproduction test twice. I would suggest removing that duplication in a follow up PR.

Tests added
Documentation added
Example added (in the documentation, to an existing notebook, or in a new notebook)
Description in CHANGELOG.rst added (single line such as: (`#XX <https://github.com/iiasa/climate-assessment/pull/XX>`_) Added feature which does something)

Depends on:

Add run workflow function #1

Supercedes #2

znicholls · 2022-06-09T16:40:08Z

#8 to be addressed in future

znicholls · 2022-06-17T07:28:05Z

@phackstock and @jkikstra any objections to merging?

phackstock · 2022-06-17T08:10:40Z

I just took a quick look at the files changed and I was wondering why you we need .github/workflows/get-infiller-database.py. Is that not covered by the confest fixture version of downloading the required files?

znicholls · 2022-06-17T10:42:23Z

I was trying to work out if I could do a user like workflow (assuming that users don’t know and don’t want to know anything about tests and fixtures). I couldn’t work out how to just call the fixture without running the tests so I just duplicated the functionality. My thinking is that we would remove the WG3 tests in favour of the notebooks in a follow up PR because they test the same thing. As a result, the duplication should only be temporary hence acceptable?

…

On Fri, 17 Jun 2022 at 10:10 am, Philip Hackstock ***@***.***> wrote: I just took a quick look at the files changed and I was wondering why you we need .github/workflows/get-infiller-database.py. Is that not covered by the confest fixture version of downloading the required files? — Reply to this email directly, view it on GitHub <#6 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFUH5G7662KHJZBWR6SWA6LVPQXIVANCNFSM5YKDBIHA> . You are receiving this because you authored the thread.Message ID: ***@***.***>

phackstock · 2022-06-17T11:17:43Z

@znicholls not sure I can follow your explanation. Would the purpose of this be that the users can run the tests locally? If so that mechanism already exists and is explained in the skip reason. In order to not skip the user simply need to download this file from the Scenario Explorer and place it in the correct folder.
The way that .github/workflows/get-infiller-database.py would currently work is that it would crash because the environment variables SCENARIO_EXPLORER_USER and SCENARIO_EXPLORER_PASSWORD would need to be set in order to enable the download.
In the conftest this is deliberately not the case as simply downloading the file and placing it in the correct spot causes the tests to be executed.
If you would really want to include get-infiller-database.py I think a better spot would be within the core package as .github/workflows should be reserved for yaml files that define workflows.

znicholls · 2022-06-17T14:13:44Z

@znicholls not sure I can follow your explanation

Oops, let me try again.

So, the overall idea is that we can run the notebooks as part of the CI routine. They test reproduction of what is in the WG3 database, but just in a way which is easier for new users, understand and explore.

In order to reproduce the WG3 database, we need to download the infiller database. So the question is, where to put that script? My thinking was:

in tests/conftest.py. This would be best, but I don't know if it is possible to force pytest to run a fixture before running the notebooks (I can't find a way to run a fixture standalone).
as a standalone script. This is what I've gone for given the issue with the above. I put it in the workflows folder to highlight that it's very specific for this workflow (i.e. only devs should be using it for this one application, it's not meant to be touched by others), but I'd be happy to move it. My suggestion would be to put it in tests/utils_scripts.
Put this function in src and support it (we could do this if we're happy to have failing tests if pyam ever changes its interface and I don't love the idea of supporting a bunch of download helpers as that's really outside the project's scope...)

Does that help?

phackstock · 2022-06-20T08:35:35Z

@znicholls thanks for the explanation, now it get it.

Not sure I am the biggest fan though of using juptyer notebooks for this dual purpose.
I really like them for introducing users to the concepts of each model and how to run them. For this they're doing a great job here.
However, any changes to a notebook like this, who's primary purpose is playing around with and getting a feeling for the software package could break the tests.
Additionally, it the notebooks there are many things that are done for illustrative purposes like printing out intermediate results or plotting. To me a test should be as single focus as possible. In my opinion, any changes introduced to a test should only be done to fix a test in case any code changes deliberate broke it.
In short I would vote for keeping the tests as is, keeping the notebooks as is and reverting the github action workflows to the way they were before.
What do you think? Also @jkikstra what's your take on this?

znicholls · 2022-06-20T09:34:07Z

Thanks for the thoughts, very helpful and good to understand. Let me start by saying that I’m happy to keep the WG3 reproduction tests as they are (even if they duplicate the same test in principle, makes things clearer conceptually?). tl;dr I think it’s great to demonstrate how to reproduce the database results in a notebook (rather than only in testing code which users might not find) and if we’re going to claim that the notebook reproduces the results, we should test it. —- Re users changing notebooks and breaking the tests, that’s ok no? We would never merge such changes to the notebooks back in so if the CI is broken, that doesn’t matter? Or do I miss something here? I think about notebooks as documentation, and I’m quite a big fan of doctests. So I think it’s important to test the notebooks, irrespective of what they do (nothing frustrates me more than trying out an example only to find it’s broken). Then the question is what should the notebooks demonstrate. In this PR I try to demonstrate two things in the notebooks: 1) how to reproduce the WG3 database and 2) how to run your own custom scenario. For the first application, I think it’s a great thing to have. If someone can pull the notebook off the shelf and reproduce the database results, that’s a great starting point. If we’re going to make a claim that our notebook can do such a reproduction, it’s important to make sure that the claim is true. So that’s why I’d vote to a) keep it and b) test it. For the second application, I also think it’s a great thing to have. Users can see how they’d run their own scenario (and what format their data needs to have). I haven’t tested it because I didn’t want to preference one climate mode over any other (but that might be an overly careful approach). We could test it by just making sure it runs (or even making sure that the output it produces is stable if we wanted, but that may add confusing stuff for users so might be best avoided). I would be very uncomfortable keeping things as they are for two reasons. The first is that nothing is tested, so we could easily break our notebooks without realising (never a good look). The second is that only FaIR is demonstrated. We should have examples of how to run all three climate models. Hope that helps

…

On Mon, 20 Jun 2022 at 10:35 am, Philip Hackstock ***@***.***> wrote: @znicholls <https://github.com/znicholls> thanks for the explanation, now it get it. Not sure I am the biggest fan though of using juptyer notebooks for this dual purpose. I really like them for introducing users to the concepts of each model and how to run them. For this they're doing a great job here. However, any changes to a notebook like this, who's primary purpose is playing around with and getting a feeling for the software package could break the tests. Additionally, it the notebooks there are many things that are done for illustrative purposes like printing out intermediate results or plotting. To me a test should be as single focus as possible. In my opinion, any changes introduced to a test should only be done to fix a test in case any code changes deliberate broke it. In short I would vote for keeping the tests as is, keeping the notebooks as is and reverting the github action workflows to the way they were before. What do you think? Also @jkikstra <https://github.com/jkikstra> what's your take on this? — Reply to this email directly, view it on GitHub <#6 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFUH5G5VUCUUW7DLXCUDSJDVQAUOHANCNFSM5YKDBIHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

phackstock · 2022-06-20T09:55:36Z

@znicholls thanks for the detailed response. I see your points and I think we agree on all the important ones.
Illustration of how to replicate and customize running the climate assessment pipeline is important and as part of that testing that the results line up is important as well.
The only thing where I'm slightly hesitant about is having it as part of a GitHub action, although I agree that with your point that examples that we show must work and the best way of ensuring that is to test them directly and not build some type of mock-up and test that. So ultimately I'm with you on that as well.

The only thing I'd request as a change is to move the get_infiller_download_link function into the utils module. There should really only be GitHub action yaml files in the .github/workflows folder and this way the test can also import it from the utils so we have less code duplication.

znicholls · 2022-06-20T11:06:25Z

The only thing I'd request as a change is to move the get_infiller_download_link function into the utils module. There should really only be GitHub action yaml files in the .github/workflows folder and this way the test can also import it from the utils so we have less code duplication.

Ok will do (maybe tonight, maybe Thursday after scenario forum), thanks for the discussion!

znicholls · 2022-06-21T11:55:52Z

This will fail until pyam database is back up

phackstock

Looks good to me, thanks for all the updates @znicholls.
Will merge as soon as the tests run through if that's ok with you.

znicholls · 2022-06-22T13:05:52Z

Great thanks

…

On Wed, 22 Jun 2022 at 2:55 pm, Philip Hackstock ***@***.***> wrote: ***@***.**** approved this pull request. Looks good to me, thanks for all the updates @znicholls <https://github.com/znicholls>. Will merge as soon as the tests run through if that's ok with you. — Reply to this email directly, view it on GitHub <#6 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFUH5GYD7ZZ2LEK5QJ2DSBDVQMELRANCNFSM5YKDBIHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

znicholls mentioned this pull request Jun 9, 2022

Update and test example notebooks #2

Closed

5 tasks

znicholls force-pushed the example-notebooks branch from 27a902c to 02e2775 Compare June 9, 2022 14:00

znicholls marked this pull request as ready for review June 9, 2022 15:56

znicholls requested a review from phackstock June 9, 2022 16:33

znicholls added 6 commits June 9, 2022 18:36

Update and test notebooks

2cb0b48

CHANGELOG

a78ab3d

Turn on test for debugging

a01b0d0

Update CHANGELOG

60c3512

Format notebooks

80232ce

Put tests back on normal schedule

9ff630c

znicholls force-pushed the example-notebooks branch from 8e7d45e to 9ff630c Compare June 9, 2022 16:36

Remove TODO, see #8

f01cafb

znicholls requested a review from jkikstra June 17, 2022 07:27

znicholls added 5 commits June 21, 2022 12:11

Remove now redundant copy command from MAGICC instructions

00261dc

Move get infiller database script

04d041d

Trigger CI

9b59980

Remove anchors as unsupported

7fef9d1

Revert trigger changes

bcde0cf

Ignore MAGICC downloaded files

0315693

phackstock mentioned this pull request Jun 21, 2022

Add CITATION.cff #7

Merged

znicholls added 8 commits June 22, 2022 07:31

Update wg3.yaml

75637f7

Format and turn on nightly tests

58dfec8

Format

3f58f63

Remove unused import

1b47cbf

Fix paths

6f26838

Fix path

216f32b

Avoid skipping tests

8fb8099

Put tests back on schedule

820717f

phackstock approved these changes Jun 22, 2022

View reviewed changes

phackstock merged commit c6af289 into main Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update and test example notebooks #6

Update and test example notebooks #6

znicholls commented Jun 9, 2022 •

edited

Loading

znicholls commented Jun 9, 2022

znicholls commented Jun 17, 2022

phackstock commented Jun 17, 2022

znicholls commented Jun 17, 2022 via email

phackstock commented Jun 17, 2022

znicholls commented Jun 17, 2022

phackstock commented Jun 20, 2022

znicholls commented Jun 20, 2022 via email

phackstock commented Jun 20, 2022

znicholls commented Jun 20, 2022

znicholls commented Jun 21, 2022

phackstock left a comment

znicholls commented Jun 22, 2022 via email

Update and test example notebooks #6

Update and test example notebooks #6

Conversation

znicholls commented Jun 9, 2022 • edited Loading

znicholls commented Jun 9, 2022

znicholls commented Jun 17, 2022

phackstock commented Jun 17, 2022

znicholls commented Jun 17, 2022 via email

phackstock commented Jun 17, 2022

znicholls commented Jun 17, 2022

phackstock commented Jun 20, 2022

znicholls commented Jun 20, 2022 via email

phackstock commented Jun 20, 2022

znicholls commented Jun 20, 2022

znicholls commented Jun 21, 2022

phackstock left a comment

Choose a reason for hiding this comment

znicholls commented Jun 22, 2022 via email

znicholls commented Jun 9, 2022 •

edited

Loading