Refactor steps in blending code #453

sidekock · 2025-01-20T20:27:12Z

See #443

…4 for the refactoring

Co-authored-by: mats-knmi <[email protected]>

….velocity_perturbations = [] in __initialize_random_generators

…to come later

…xed seed assingments

…as the master

codecov · 2025-01-20T20:37:42Z

Codecov Report

Attention: Patch coverage is 72.22222% with 10 lines in your changes missing coverage. Please review.

Project coverage is 84.31%. Comparing base (a7dae54) to head (5fad664).
Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
pysteps/nowcasts/steps.py	72.22%	10 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #453      +/-   ##
==========================================
+ Coverage   84.26%   84.31%   +0.05%     
==========================================
  Files         160      160              
  Lines       13067    13251     +184     
==========================================
+ Hits        11011    11173     +162     
- Misses       2056     2078      +22

Flag	Coverage Δ
unit_tests	`84.31% <72.22%> (+0.05%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sidekock · 2025-01-20T20:41:08Z

Both visual tests and assert statements give the same conclusion: the output is exactly the same!

sidekock · 2025-01-20T20:51:19Z

@dnerini The codecov check is failing but the only things it is actuary failing on in the deepcopys I have done in the code and on the timing which does not seem worth it to write tests for as this is a very basic subtraction. Would it be able to disable it for these things or how does this work? I know you where able to do it for the re-factoring of the steps nowcasting.

RubenImhoff

Hi @sidekock and @mats-knmi,

Thanks a lot for your work here! I'm sorry I was not around the past days to answer some questions and think along. However, this looks great! With the large number of changes, it is becoming difficult to check everything thoroughly, so at least I'm happy to see that it gives exactly the same results. I think a to do for a new PR could be updating the documentation, but that is probably easier and cleaner once this PR is pushed.

RubenImhoff · 2025-01-21T08:20:33Z

pysteps/blending/steps.py

-      To further reduce memory usage, both this array and the ``velocity_models`` array
-      can be given as float32. They will then be converted to float64 before computations
-      to minimize loss in precision.
+# TODO: compare old and new version of the code, run a benchmark to compare the two


Is this TODO still necessary? I can imagine we'd like to have this done prior to merging this PR.

The two versions do now provide identical output. I just want to make sure this is the case for all configurations, meaning:

Signle core single member

Single core multiple member

Multiple core single member

Multiple core multiple members (less cores than members)

Multiple core multiple members (more cores than members)

The reason I want to do this is that I discovered that the pysteps tests do not properly handle this (all tests passed in a previous version of the code but it crashed on issues related to number of cores and number of members).
If you also propose other tests, I would love to hear suggestions!
The tests would exist of running both the master and refactored branch and then doing the following checks:

DataArray.identical(other) @mats-knmi do you know if this actually checks the values of the nowcast or only the structure and metadata. Its a bit unclear to me but since it is very fast, I would assume it does not test the data itself.

Depending on the previous answer, I could also check some random fields of random times and members.

Lastly I can do a visual inspection but if the previous two are fine I do not think this matter but it's a matter of having enough confidence before I actually make this the default code :)

Identical should do an element wise comparison (at least that is what I gather from the docs). The docs say that identical is the same as equals but also checks metadata. The docs for equals say that it compares not only the structure but also the contents of the datasets. Doing a very simple test I do seem to be able to confirm this.

>>> xr.Dataset(data_vars={"x": ("x", np.array([1,2,3,4], dtype=np.float64)), "y": ("x", np.array([4,3,2,1], dtype=np.float64))}).identical(xr.Dataset(data_vars={"x": ("x", np.array([1,2,3,4], dtype=np.float64)), "y": ("x", np.array([4,3,2,1], dtype=np.float64))})) True >>> xr.Dataset(data_vars={"x": ("x", np.array([1,2,3,4], dtype=np.float64)), "y": ("x", np.array([4,3,2,1], dtype=np.float64))}).identical(xr.Dataset(data_vars={"x": ("x", np.array([1,2,3,4], dtype=np.float64)), "y": ("x", np.array([4,3,2,2], dtype=np.float64))})) False

Thanks, I should have done such a test myself but totally forgot. Ill do the last checks in the next few days and then we can close this 6 -month-work-in-progress :)

dnerini · 2025-01-21T12:21:05Z

@dnerini The codecov check is failing but the only things it is actuary failing on in the deepcopys I have done in the code and on the timing which does not seem worth it to write tests for as this is a very basic subtraction. Would it be able to disable it for these things or how does this work? I know you where able to do it for the re-factoring of the steps nowcasting.

@sidekock we can simply ignore the failing check and merge whenever you feel it's ready

sidekock · 2025-02-01T21:24:58Z

Results of my test on belgian NWP blending:

Single core single member ✔️
Single core multiple member ✔️
Multiple core single member ✔️
Multiple core multiple members (less cores than members) ✔️
Multiple core multiple members (more cores than members) ✔️
Test with longer lead time (6 hours with 5min resolution, previous ones where all 1hour) ✔️

All tests seem to pass. I will run all the pysteps tests one more time and if they all pass I think it is time to release this into the wild :)

sidekock · 2025-02-01T22:44:47Z

@RubenImhoff @mats-knmi all tests are still succeeding and all the extra tests that I have run say the two datasets are identical. Seems we can Squash and merge or do you think more tests need to be done?

mats-knmi · 2025-02-03T07:20:32Z

Sounds good, I don't have anything else, so squash and merge away!

RubenImhoff · 2025-02-03T10:11:18Z

Same for me, fantastic work! :)

sidekock · 2025-02-03T10:16:30Z

@dnerini Do I also have to bump the version or is this something I leave up to you? Or is this not something that is done for code refactoring?

mats-knmi · 2025-02-03T10:18:22Z

I would like a verion bump, even if it is just for the reproducibility with the RNG that I merged earlier, since that would help me out with the tests that we have running.

sidekock · 2025-02-03T10:46:11Z

I would like a verion bump, even if it is just for the reproducibility with the RNG that I merged earlier, since that would help me out with the tests that we have running.

Doing some last changes to the documentation and then ill be on that to make a last commit before I merge

dnerini · 2025-02-03T13:20:51Z

Fantastic to see this merged! Congratulations for all the hard work you put into this, well done!

sidekock · 2025-02-03T13:24:09Z

Fantastic to see this merged! Congratulations for all the hard work you put into this, well done!

@dnerini Thanks! The tests for the docs to seem to be failing due to the codecov:

Also the version does not seem to be bumped. Don't know what I did wrong because I changed the version in the PKG-info

dnerini · 2025-02-03T20:16:14Z

@sidekock the error while compiling the docs was caused by a deprecation in scipy (see #455), while the version needed to be bumped also in the setup.py (5ede008).

now it should be all good, shall we release v1.14?

sidekock · 2025-02-03T20:54:20Z

Yes, please! Then I can finally cross this one on my to-do list :)

sidekock and others added 30 commits November 18, 2024 11:28

Refactored all names in the steps blending code from old to new

2066f14

Made some name changes but test still do not pass

72d0fbc

Fixed naming changes, now the tests pass

1ce563e

Built the rough scaffolding for the blending class

fbe551b

Refactored untill no rain case

46a93e5

Added code to estimation of ar parameters of radar

1eede39

Next go, start with forecast loop #7

a18f1f6

Added some uniformity between nowcast and blending steps. Now at # 8.…

8d16c11

…4 for the refactoring

Small changes since prev commit

88df97d

All code is tranfered. Last part of the main loop needs to be refactored

7ee0020

Everything is refactored, no test ran as of yet

f387981

Old forecast function is updated to fit newly refactored code

760c185

Removed old code which is no longer used

8d8905a

6 more tests that fail

d6249f5

All tests pass, still need to fix TODOs

38702b3

Updated gitignore

5ff1713

Cleanup of params and state dataclasses, next step: better typing

d999501

Cleanup of params and state dataclasses, now all tests pass

ed20ecc

Added correct typing to all parts of params and state

701e726

Ready for pull request

b9de511

Made changes for Codacy review

38ed195

Added aditional tests which currently fail in master branch

32b656f

Update .gitignore

4fe9f78

Co-authored-by: mats-knmi <[email protected]>

Used the __zero_precip_time in __zero_precipitation_forecast()

b31d55c

Changed typing hints to python 3.10+ version

cc02593

Added comments back to the State dataclass

4e4a148

Changed the self.__state.velocity_perturbations = [] to self.__params…

0f4e037

….velocity_perturbations = [] in __initialize_random_generators

Added code changes as suggested by Ruben, comments and documentation …

9f413aa

…to come later

Added frozen functionality to dataclasses, removed reset_state and fi…

c72d953

…xed seed assingments

Added frozen dataclass to nowcast

00f057b

sidekock added 3 commits January 8, 2025 14:20

Update to probmatching comments to keep in track with main

9b216a7

Fix for multithreading issue, this produces exactly the same results …

561e7ac

…as the master

New commit for new pr

4fb784e

sidekock added the documentation label Jan 20, 2025

sidekock requested review from ladc, RubenImhoff and mats-knmi January 20, 2025 20:27

sidekock self-assigned this Jan 20, 2025

mats-knmi approved these changes Jan 20, 2025

View reviewed changes

RubenImhoff approved these changes Jan 21, 2025

View reviewed changes

sidekock added 4 commits February 3, 2025 12:08

Added additional documentation

d0c3aba

Bump version

489ed62

Updates some files that do not pass the new black version

b2594a5

Updated examples to work with new black version

5fad664

sidekock merged commit 92deda1 into master Feb 3, 2025
8 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor steps in blending code #453

Refactor steps in blending code #453

sidekock commented Jan 20, 2025

codecov bot commented Jan 20, 2025 •

edited

Loading

sidekock commented Jan 20, 2025

sidekock commented Jan 20, 2025

RubenImhoff left a comment

RubenImhoff Jan 21, 2025

sidekock Jan 24, 2025 •

edited

Loading

mats-knmi Jan 27, 2025

sidekock Jan 27, 2025

dnerini commented Jan 21, 2025

sidekock commented Feb 1, 2025 •

edited

Loading

sidekock commented Feb 1, 2025 •

edited

Loading

mats-knmi commented Feb 3, 2025

RubenImhoff commented Feb 3, 2025

sidekock commented Feb 3, 2025

mats-knmi commented Feb 3, 2025

sidekock commented Feb 3, 2025

dnerini commented Feb 3, 2025

sidekock commented Feb 3, 2025

dnerini commented Feb 3, 2025

sidekock commented Feb 3, 2025

Refactor steps in blending code #453

Refactor steps in blending code #453

Conversation

sidekock commented Jan 20, 2025

codecov bot commented Jan 20, 2025 • edited Loading

Codecov Report

sidekock commented Jan 20, 2025

sidekock commented Jan 20, 2025

RubenImhoff left a comment

Choose a reason for hiding this comment

RubenImhoff Jan 21, 2025

Choose a reason for hiding this comment

sidekock Jan 24, 2025 • edited Loading

Choose a reason for hiding this comment

mats-knmi Jan 27, 2025

Choose a reason for hiding this comment

sidekock Jan 27, 2025

Choose a reason for hiding this comment

dnerini commented Jan 21, 2025

sidekock commented Feb 1, 2025 • edited Loading

sidekock commented Feb 1, 2025 • edited Loading

mats-knmi commented Feb 3, 2025

RubenImhoff commented Feb 3, 2025

sidekock commented Feb 3, 2025

mats-knmi commented Feb 3, 2025

sidekock commented Feb 3, 2025

dnerini commented Feb 3, 2025

sidekock commented Feb 3, 2025

dnerini commented Feb 3, 2025

sidekock commented Feb 3, 2025

codecov bot commented Jan 20, 2025 •

edited

Loading

sidekock Jan 24, 2025 •

edited

Loading

sidekock commented Feb 1, 2025 •

edited

Loading

sidekock commented Feb 1, 2025 •

edited

Loading