-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Include simple .progress switch in future_lapply #111
Comments
To start out, you're not the first to ask for this, and you won't be the last. The problem is not the implementation - the problem is the API and the contract to the developer and end-user.
This is key. I 100% understand that a Another argument is the design contract of future.apply, which says it should work just like base-R apply functions - no more, no less. As soon we start adding additional features (other wishes are out there), it is no longer a one-to-one mapping, and all of a sudden we made it slightly more complicated to go from One could also argue that if progress reporting should be built in in future.apply, then it should also be built in into base-R apply functions. Along these lines, I think furrr implemented its own progress reporting before purrr had one. I could imagine that furrr will try to re-align its API with that of purrr at some point. OTH, this is not an easy move to make because less than a year ago, the roadmap was "furrr currently has its own progress bar through the usage of .progress = TRUE, but in the future this will be deprecated in favor of generic and robust progress updates through the progressr package." [1]. Davis touches on this problem at the end of [1] saying "progressr represents an exciting move towards a unified framework for progress notifications in R, but it is still early in its development cycle and needs more usage and feedback to settle on the best API. In the future, the plan is for furrr to become more tightly integrated with progressr so that this is much easier." [1] https://furrr.futureverse.org/articles/progress.html A better alternative is to provide a mechanism for "injecting" a progress reporter to any map-reduce function, where future.apply is completely unaware of the solution. With some syntactic sugar, it might end up looking like: y <- future_lapply(X, f) %with_progress% TRUE
y <- lapply(X, f) %with_progress% TRUE
y <- future_map(X, f) %with_progress% TRUE
y <- map(X, f) %with_progress% TRUE or y <- future_lapply(X, P(f))
y <- lapply(X, P(f))
y <- future_map(X, P(f))
y <- map(X, P(f)) or y <- future_lapply(P(X), f)
y <- lapply(P(X), f)
y <- future_map(P(X), f)
y <- map(P(X), f) I've played around with variants of this over the years, but yet have to find a satisfying solution that is not "hackish". Also, this is something that can be solved by the R community and not just me. Gabor has done some work along these lines, cf. https://cli.r-lib.org/articles/progress.html#progress-bars-for-mapping-functions-cli_progress_along. All that said, there is nothing preventing someone else from building an API on top of the existing future-based map-reduce solutions, just like you did. We have seen this happening before outside Futureverse, e.g. pbmcapply and pbapply. So, sorry for being that conservative.
As you know, I say the user should be in full control of the progress reporting. So, they should use Progress reporting in R is still in its infancy. There are so many more things that need to be figured out. Nested progress reporting is one, especially since more and more functions start reporting on progress. What should happen when such functions call each other? Maybe there will grow out standard for standardized, generic hook functions that progress frameworks can hook into? I don't know the answer to this, but I'm trying to keep as many doors as possible open for when that day comes. FWIW, I'm also learning about best design pattern the more I work with progress reporting myself. For example, instead of: FUN_progress <- function(X, ...) {
res <- FUN(X, ...)
p()
res
} I tend to like: FUN_progress <- function(X, ...) {
on.exit(p())
FUN(X, ...)
} a bit more. I also haven't decided if progress should be reported when an iteration is starting or finishing (as above), or both. Maybe different scenarios require different progress reporting. |
Thank you for explaining you thought process on this topic. It seems to be a more complicated issue than I initially thought. Maybe even setting an environment variable to subscribe to signalling progress would be possible, e.g. |
Both purrr and furrr's
map*
functions have a.progress
switch to enable a progress bar. It would be very convenient to also have it available, e.g. in future_lapply.Here is a quick implementation of a
.progress
switch which uses a CLI-based progress bar, that is also in use in purrr and that also seems to be favoured by furrr:I know about futureverses "back-end/ront-end" design philisophy to separate progress signaling from displaying progress. However, if one quickly wants to parallelize some lapply-type code in RStudio, having a simple .progress switch is more convenient than having to set up the progressor, including it in the function to be parallelized, wrapping it in a
with_progress()
and remembering/researching how these three pieces fit together. Also, this implementation would possibly serve as a template to update the furrr's.progress
implementation to using the CLI package.The text was updated successfully, but these errors were encountered: