# Copyright (c) Meta Platforms, Inc. and affiliates.
#
# This source code is licensed under the MIT license found in the
# LICENSE file in the root directory of this source tree.

# implements a function that takes a sequence of returns and multiply its by the policy log_prob to get a differentiable objective