New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[RLlib] Docs do-over (new API stack): New `MetricsLogger` API rst page. #49538

Open

sven1977 wants to merge 23 commits into ray-project:master from sven1977:docs_redo_metrics_logger

+471 −7

Contributor

sven1977 commented Jan 2, 2025 •

edited

Loading

Docs do-over (new API stack): New MetricsLogger API rst page.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

sven1977 added 5 commits

December 30, 2024 23:30

wip

a41d642

Signed-off-by: sven1977 <[email protected]>

wip

f89b5eb

Signed-off-by: sven1977 <[email protected]>

wip

5598bba

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'cleanup_examples_folder_42_custom_vpg_learner' into doc…

03936b6

…s_redo_metrics_logger


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

97469b8

…_redo_metrics_logger

sven1977 requested review from maxpumperla, simonsays1980 and a team as code owners

January 2, 2025 15:54

sven1977 changed the title ~~[RLlib] Docs do-over (new API stack): New MetricsLogger API rst page.~~ [WIP; RLlib] Docs do-over (new API stack): New MetricsLogger API rst page.

sven1977 added 7 commits

January 2, 2025 21:54

wip

3ae86b7

Signed-off-by: sven1977 <[email protected]>

wip

a7551c1

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

f4cd6c1

…_redo_metrics_logger

Signed-off-by: sven1977 <[email protected]>

# Conflicts:
#	rllib/BUILD

wip

d332a30

Signed-off-by: sven1977 <[email protected]>

wip

d5bc917

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

4ec9c7f

…_redo_metrics_logger

fix

0c15dcb

Signed-off-by: sven1977 <[email protected]>

sven1977 changed the title ~~[WIP; RLlib] Docs do-over (new API stack): New MetricsLogger API rst page.~~ [RLlib] Docs do-over (new API stack): New MetricsLogger API rst page.

sven1977 added rllib rllib-docs-or-examples rllib-newstack labels

sven1977 assigned simonsays1980 and angelinalg

sven1977 added 2 commits

January 21, 2025 10:14

fix

8bf0829

Signed-off-by: sven1977 <[email protected]>

wip

6a4f0c5

Signed-off-by: sven1977 <[email protected]>

simonsays1980 approved these changes

View reviewed changes

Collaborator

simonsays1980 left a comment

LGTM. Just some nits.

doc/source/rllib/metrics-logger.rst Outdated

+              .. note::
+                  So far, RLlib components owning a :py:class:`~ray.rllib.utils.metrics_logger.MetricsLogger`
+                  instance are :py:class:`~ray.rllib.algorithms.algorithm.Algorithm`, :py:class:`~ray.rllib.env.env_runner.EnvRunner`,
+                  :py:class:`~ray.rllib.core.learner.learner.Learner`, and all :py:class:`~ray.rllib.connectors.connector_v2.ConnectorV2` classes.

Collaborator

simonsays1980 Jan 21, 2025

EpisodeReplayBuffers

Contributor Author

sven1977 Jan 21, 2025

fixed

doc/source/rllib/metrics-logger.rst Outdated

+              - Users can log scalar values over time, such as losses or rewards.
+              - Thereby, users can configure different reduction types, in particular ``mean``, ``min``, ``max``, ``sum``, or ``None``.
+              - Users can also specify sliding windows, over which these reductions take place, for example ``window=100`` to average over the

Collaborator

simonsays1980 Jan 21, 2025

Maybe say something about clear_on_reduce and how aggregates are held in the history instead of accumulating large lists.

Contributor Author

sven1977 Jan 21, 2025

done

doc/source/rllib/metrics-logger.rst

		which doesn't alter any underlying values.


		Instead of providing a flat key, you can also log a value under some nested key through passing in a tuple:

Collaborator

simonsays1980 Jan 21, 2025

Very nice!

doc/source/rllib/metrics-logger.rst

+              ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+              In case you are logging a nested structure of values, for example
+              ``{"time_s": 0.1, "lives": 5, "rounds_played": {"player1": 10, "player2": 4}}`` and all values have the exact same log settings

Collaborator

simonsays1980 Jan 21, 2025

A dict that is for any reason empty, will be reduced away, which is for collecting result dicts for a table is challenging.

Contributor Author

sven1977 Jan 21, 2025

Ok, let me try this right now and fix, if possible ...

Contributor Author

sven1977 Jan 21, 2025

Yeah, an empty dict is not logged at all. Meaning ...

metrics.log_dict({}, key="abc")

... results in metrics not undergoing any changes, which I think makes a lot of sense. There are simply no keys to be logged and the underlying tree.map_structure doesn't have any nested structure to loop through.

doc/source/rllib/metrics-logger.rst Outdated




		Timing things

Collaborator

simonsays1980 Jan 21, 2025

"Timing processes"?

Contributor Author

sven1977 Jan 21, 2025

Fine, renamed into Timing code blocks. :D

doc/source/rllib/metrics-logger.rst Outdated

+                  # EMA should be ~1.1sec.
+                  assert 1.15 > logger.peek("my_block_to_be_timed") > 1.05
+              Also note that the default logging behavior is through exponential mean averaging (EMA), with a default coefficient of 0.01.

Collaborator

simonsays1980 Jan 21, 2025

Maybe describing how to use a simple mean instead?

Contributor Author

sven1977 Jan 21, 2025

Actually, we don't support this yet :D
You can do a simple mean over a window, yes, but not over all lifetime, b/c we don't have a proper mechanism yet to keep the list of values small. You could do logger.log_value([key], [value], reduce="mean", window=float("inf")), but that would cause a mem leak at some point. We need to fix this by keeping track of the number of items logged thus far and then keeping the internal list smaller, but still providing the proper lifetime average. To be completely honest, not sure how useful a lifetime average would be (as opposed to an EMA, which makes more sense to report). But we should fix this either way ...

sven1977 added 5 commits

January 21, 2025 13:23


          fixes

Signed-off-by: sven1977 <[email protected]>


          fixes

da202b8

Signed-off-by: sven1977 <[email protected]>


          fixes

b9b49fa

Signed-off-by: sven1977 <[email protected]>


          fixes

818164b

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

71e979b

…_redo_metrics_logger

sven1977 added 4 commits

January 21, 2025 14:54

wip

abcef21

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

4b5e38e

…_redo_metrics_logger

Signed-off-by: sven1977 <[email protected]>

# Conflicts:
#	doc/source/rllib/package_ref/algorithm.rst


          merge

58dde5d

Signed-off-by: sven1977 <[email protected]>

wip

a376e99

Signed-off-by: sven1977 <[email protected]>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

rllib rllib-docs-or-examples rllib-newstack