rewards always zero

Hi, @kaymen99 , thanks for your code. Is there anything wrong with the `her_augmentation` function under `HER.py` file where the re-computed reward is always zero?

```
reward = agent.env.compute_reward(future_achgoal, future_achgoal, 1.0)
```
And why should we take the future observation as the augmented observation, shouldn't we keep the observation in the current timestep, i.e., `obs, _, _ = obs_array[index].values()` as the augmented observation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rewards always zero #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

rewards always zero #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions