Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Outdated code #21092

Open
lmntrx-sys opened this issue Mar 26, 2025 · 1 comment
Open

Outdated code #21092

lmntrx-sys opened this issue Mar 26, 2025 · 1 comment
Assignees

Comments

@lmntrx-sys
Copy link

lmntrx-sys commented Mar 26, 2025

Some of the lines of code in the actor-critic implementation in the reinforcement learning code examples has some bugs related to evolution if the libraries and some few tweaks needed.

Especially related to new versions of the environment being used. Issues arise during the fundamental actions being taken on the environment
On the first run one runs into this error

      `ValueError: Exception encountered when calling Functional.call().
      
      Invalid input shape for input [-0.04058227]. Expected shape (None, 4), but input has incompatible shape (1,)
      
      Arguments received by Functional.call():
        • inputs=tf.Tensor(shape=(1,), dtype=float32)
        • training=None
        • mask=None`

well there many other small tweaks needed to make it run on the first shot

here are some of the snippets of code

`state = env.reset()[0]`   

this specifically brings an issue with how python types are processed

`while True:  # Run until solved
    state = env.reset()[0]
    episode_reward = 0
    with tf.GradientTape() as tape:
        for timestep in range(1, max_steps_per_episode):

            state = ops.convert_to_tensor(state)
            state = ops.expand_dims(state, 0)` 

This piece as a whole brings an error where the state cannot be converted to a tensor.
These are just some of the issues and some few more

@lmntrx-sys
Copy link
Author

lmntrx-sys commented Mar 26, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants