New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[RLlib; docs] Docs do-over (new API stack): Rewrite/enhance "getting started" rst page. #49950

Open

sven1977 wants to merge 28 commits into ray-project:master from sven1977:docs_redo_getting_started

+814 −991

Contributor

sven1977 commented Jan 18, 2025 •

edited

Loading

Docs do-over (new API stack): Rewrite/enhance "getting started" rst page.

Rename file from rllib-training.html to getting-started.html.
Translate everything to the new API stack and simplify a little.
Vale cleanup.
Move example code into ..testcode blocks.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

sven1977 added 12 commits

January 3, 2025 22:26

wip

bed55cc

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

…_redo_getting_started

Signed-off-by: sven1977 <[email protected]>

# Conflicts:
#	doc/source/rllib/rllib-training.rst


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

e33624c

…_redo_getting_started

Signed-off-by: sven1977 <[email protected]>

# Conflicts:
#	doc/source/rllib/rllib-training.rst

wip

042909d

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

1ef61cc

…_redo_getting_started

wip

84b63f4

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

8d67d4f

…_redo_getting_started


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

6b2c97a

…_redo_getting_started


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

42a3a59

…_redo_getting_started

wip

e0d6ce6

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

570d73c

…_redo_getting_started

wip

bcd594c

Signed-off-by: sven1977 <[email protected]>

sven1977 requested review from maxpumperla, simonsays1980 and a team as code owners

January 18, 2025 18:00

sven1977 added 2 commits

January 18, 2025 19:01

wip

b6001a1

Signed-off-by: sven1977 <[email protected]>


          vale and lint

Signed-off-by: sven1977 <[email protected]>

sven1977 assigned simonsays1980 and angelinalg

sven1977 added rllib rllib-docs-or-examples rllib-newstack rllib-oldstack-cleanup labels

sven1977 added 5 commits

January 20, 2025 09:13

wip

e46558e

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

28da682

…_redo_getting_started

fix

be93ff1

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

5b888e1

…_redo_getting_started

fix

00ea51c

Signed-off-by: sven1977 <[email protected]>

simonsays1980 approved these changes

View reviewed changes

Collaborator

simonsays1980 left a comment

LGTM. Some nits here and there. Great introduction for users into RLlib.

doc/source/rllib/getting-started.rst

+              In this tutorial, you learn how to design, customize, and run an end-to-end RLlib learning experiment
+              from scratch. This includes picking and configuring an :py:class:`~ray.rllib.algorithms.algorithm.Algorithm`,
+              running a couple of training iterations, saving the state of your
+              :py:class:`~ray.rllib.algorithms.algorithm.Algorithm` from time to time, running a separate

Collaborator

simonsays1980 Jan 20, 2025

Awesome! This is what most people are looking for.

doc/source/rllib/getting-started.rst

+              Python API
+              ~~~~~~~~~~
+              RLlib's Python API provides all the flexibility required for applying the library to any

Collaborator

simonsays1980 Jan 20, 2025

Do we have any other API than the Python one?

Contributor Author

sven1977 Jan 20, 2025

Nope :D We got rid of the CLI, b/c of the maintenance burden, its stark limitations, and it being more or less a duplicate of a subset of what the python API could do.

Contributor Author

sven1977 Jan 20, 2025

Well, we are working on the external access protocol for clients to connect to and communicate with RLlib, but that's heavily wip.

doc/source/rllib/getting-started.rst Outdated

		)


		To scale your setup and define, how many EnvRunner actors you want to leverage,

Collaborator

simonsays1980 Jan 20, 2025

Shall we put all class names into ``?

Collaborator

simonsays1980 Jan 20, 2025

Also we might want to add that these EnvRunners are used to rollout the policy and collect samples?

Contributor Author

sven1977 Jan 20, 2025

done

doc/source/rllib/getting-started.rst

+              .. testcode::
+                  # Build the Algorithm (PPO).
+                  ppo = config.build_algo()

Collaborator

simonsays1980 Jan 20, 2025

Does build still work?

Contributor Author

sven1977 Jan 20, 2025

Yup, but you get a warning.

doc/source/rllib/getting-started.rst

+                  from pprint import pprint
+                  for _ in range(5):
+                      pprint(ppo.train())

Collaborator

simonsays1980 Jan 20, 2025

Nice!

doc/source/rllib/getting-started.rst

+                      # Define your custom env class by subclassing gymnasium.Env:
+                      class ParrotEnv(gym.Env):
+                          """Environment in which the agent learns to repeat the seen observations.

Collaborator

simonsays1980 Jan 20, 2025

Haha! Awesome!

doc/source/rllib/getting-started.rst Outdated

+                      # Point your config to your custom env class:
+                      config = (
+                          PPOConfig()
+                          .environment(ParrotEnv)  # add `env_config=[some Box space] to customize the env

Collaborator

simonsays1980 Jan 20, 2025

Maybe a missing " ` "?

Contributor Author

sven1977 Jan 20, 2025

done and clarified more. Also fixed the env accepting this suggested setting.

doc/source/rllib/getting-started.rst

+                      class CustomTorchRLModule(TorchRLModule):
+                          def setup(self):
+                              # You have access here to the following already set attributes:
+                              # self.observation_space

Collaborator

simonsays1980 Jan 20, 2025

Great description!!

doc/source/rllib/index.rst


		At the end of your script, RLlib evaluates the trained Algorithm:
		algo.stop()

Collaborator

simonsays1980 Jan 20, 2025

Haha. Yes that is needed.

Collaborator

simonsays1980 Jan 20, 2025

We might however show it explicitly as otherwise users might run into problems.

Collaborator

simonsays1980 Jan 20, 2025

... in their own code

Contributor Author

sven1977 Jan 20, 2025

Great idea. Will add a one-liner for this API.

Contributor Author

sven1977 Jan 20, 2025

done

rllib/algorithms/algorithm.py

+                      The `state` of an instantiated Algorithm can be retrieved by calling its
+                      `get_state` method. It contains all information necessary
+                      to create the Algorithm from scratch. No access to the original code (e.g.

Collaborator

simonsays1980 Jan 20, 2025

Does this work now also with algorithms that had defined new attributes/methods? If the class is available it should imo.

Contributor Author

sven1977 Jan 20, 2025

Yeah, I think so. Users can decide to override the get_state/set_state APIs to add more stateful stuff to their state-dicts, but the basic functionality (restoring EnvRunners, RLModule, Learner optimizer states, connector pipelines, etc..) works across all algos.

wip

85fd123

Signed-off-by: sven1977 <[email protected]>

sven1977 added 8 commits

January 20, 2025 14:35


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

89023b9

…_redo_getting_started

fix

e5203d2

Signed-off-by: sven1977 <[email protected]>

wip

512806d

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

5a6e766

…_redo_getting_started

wip

7286da0

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

5c14bc3

…_redo_metrics_logger

Signed-off-by: sven1977 <[email protected]>

# Conflicts:
#	doc/source/rllib/package_ref/algorithm.rst

wip

79ab310

Signed-off-by: sven1977 <[email protected]>


          Merge branch 'master' of https://github.com/ray-project/ray into docs…

1bd500c

…_redo_getting_started

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

rllib rllib-docs-or-examples rllib-newstack rllib-oldstack-cleanup