New computational gridding #71

dhardestylewis · 2022-07-27T03:07:26Z

tldr; new gridding scheme

44 hours to retile each new subgrid (40 hrs wait + 4 hrs compute)
new gridding will be 460x460 subgrids
each new subgrid will have ~175x175 tiles
each new tile will have 256x256 pixels
each new tile will have 0 pixel buffer

ref; old gridding scheme

120 hours to retile each old subgrid (0 hrs wait + 120 hrs compute)
old gridding had 16x17 subgrids
each old subgrid had ~166x166 tiles
each old tile had 1600x1600 pixels
each old tile had 100 pixel buffer

scratch sheet for reference to set up new griddings on different systems

general approach to estimate maximum number of compute threads (ie CPUs) available for mass parallelization

50 jobs available max in Stampede2's normal queue (wait-time 40 hours)
minus 2 jobs left open for long queue (wait-time 7 seconds)
equals 48 jobs on Stampede2 normal queue

64 maximum recommended cores per node on normal queue

69 maximum available nodes per job (topping off at requesting 69*48=3312 nodes out of 3360 total normal nodes available -- we want fewer because there will likely be a small number of nodes down at any given time, ~10)

48 jobs * 64 cores per node * 69 nodes per job =
211968 cores available, one for each separate computational grid

Deriving computational grid from parallel threads available

sqrt(211968) ~= 460,
so gridding will be 460x460
because this is simple, close enough, less than the total number of cores available, and won't add any significant amount of time to the overall compute

Deriving tiles per compute grid

note the rest of these estimates are derived from previous computational results and assume minimal overhead & linear scaling between computational runs

If new total computational envelope of Texas is similar to previous envelope and adjusting for the smaller width x height of each new tile, there will be an estimated 175x175 tiles within each subgrid.
(New envelope will be slightly larger because will rely on HUC8s instead of HUC12s)

Estimating compute time under new computational grid

Previous gridding took 120 hours to retile 71 billion pixels within each subgrid, new gridding estimated to take 4 hours to retile 2 billion pixels within each new subgrid

The text was updated successfully, but these errors were encountered:

dhardestylewis · 2022-07-27T03:14:33Z

memory should be less of an issue this round because we are running roughly half as many subgrids per node in parallel as previous run. Previous run lost 7/272 subgrids to memory errors

dhardestylewis · 2022-07-27T03:39:28Z

estimated to cost 14,000 SUs

dhardestylewis · 2022-07-27T03:40:26Z

these are just the data tiles, not viz tiles

dhardestylewis · 2022-07-27T03:48:10Z

align tiles with quarter quads

dhardestylewis · 2022-08-19T00:46:13Z

at 1m gridding by request from @TNRIS

following previous estimate above and adjusting from .25m basemap to 1m basemap

1m basemap gridding scheme

new gridding will be 460x460 subgrids
each new subgrid will have ~44x44 tiles
each new tile will have 256x256 pixels
each new tile will have 0 pixel buffer

dhardestylewis assigned brentporter, wmobley, jeaimehp, dhardestylewis and tkhan-tacc Jul 27, 2022

dhardestylewis mentioned this issue Aug 16, 2022

High level overview of next steps #79

Open

38 tasks

dhardestylewis assigned Khushbooagarwalhub Aug 20, 2022

dhardestylewis mentioned this issue Aug 20, 2022

Timing info on 1st big retiling run #9

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New computational gridding #71

New computational gridding #71

dhardestylewis commented Jul 27, 2022 •

edited

Loading

dhardestylewis commented Jul 27, 2022 •

edited

Loading

dhardestylewis commented Jul 27, 2022

dhardestylewis commented Jul 27, 2022

dhardestylewis commented Jul 27, 2022

dhardestylewis commented Aug 19, 2022

New computational gridding #71

New computational gridding #71

Comments

dhardestylewis commented Jul 27, 2022 • edited Loading

tldr; new gridding scheme

ref; old gridding scheme

scratch sheet for reference to set up new griddings on different systems

general approach to estimate maximum number of compute threads (ie CPUs) available for mass parallelization

Deriving computational grid from parallel threads available

Deriving tiles per compute grid

Estimating compute time under new computational grid

dhardestylewis commented Jul 27, 2022 • edited Loading

dhardestylewis commented Jul 27, 2022

dhardestylewis commented Jul 27, 2022

dhardestylewis commented Jul 27, 2022

dhardestylewis commented Aug 19, 2022

at 1m gridding by request from @TNRIS

following previous estimate above and adjusting from .25m basemap to 1m basemap

1m basemap gridding scheme

dhardestylewis commented Jul 27, 2022 •

edited

Loading

dhardestylewis commented Jul 27, 2022 •

edited

Loading