Local scheduler #36

bschroeter · 2024-11-19T01:43:03Z

Discussions around portability of this software suggest that a local execution strategy would be useful.

This issue will serve as a place to collect thoughts and requirements to formalise the approach.

bschroeter · 2024-11-21T05:18:08Z

There are a few ways to approach this, some more complicated than others.

The simplest approach for this would be to have a client that simply performs a subprocess.run of a bash (?) command against a script, with additional variables passed in via the subprocess environment. This would essentially execute immediately as there is no queue to wait in.

The client could then return a unique identifier for the completed process which could then be used for dependency downstream via an internal lookup dict of completed processes (in order to check return codes).

This would be a largely serial operation, but would have some dependency built in.

Another approach is to look into non-blocking subprocess methods:

https://stackoverflow.com/questions/16071866/non-blocking-subprocess-call

Which could allow tracking of PIDs to maintain dependency.

A third approach could be to use Dask to assemble a delayed architecture as a local scheduler:
https://docs.dask.org/en/stable/delayed.html

This would require some kind of final call to the scheduler to trigger computation inside the client object, and would require Dask as a dependency to the project.

Lastly, the graph approach that I demonstrated might be useful here as well.

I am partial to the first option, as it is the easiest to implement in a short timeframe.

@ccarouge, do you have any opinions here?

ccarouge · 2024-11-27T00:24:36Z

A serial operation would make this really slow for application to benchcab for example. In benchcab tests right now, we can get the results of the fluxsite tests in 10 min if using 48 cores. Going serial is obviously going to make this a lot more painful.

That said, the speed performance is probably not the priority now as a full benchcab suite should be only needed to run a few times at the end of development. People can reduce the number of runs to get quick turn around during development

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local scheduler #36

Local scheduler #36

bschroeter commented Nov 19, 2024

bschroeter commented Nov 21, 2024

ccarouge commented Nov 27, 2024

Local scheduler #36

Local scheduler #36

Comments

bschroeter commented Nov 19, 2024

bschroeter commented Nov 21, 2024

ccarouge commented Nov 27, 2024