Activity
Fixed critic pointer output device inconsistency
Fixed critic pointer output device inconsistency
Legacy organized (forward and local_forward)
Legacy organized (forward and local_forward)
Local forward vectorized (the legacy remains as og-s)
Local forward vectorized (the legacy remains as og-s)
context vector mask fixed; working on padding masks over samples in a…
context vector mask fixed; working on padding masks over samples in a…
env settings and hyperparam settings
env settings and hyperparam settings
supports state-action history and dynamic seed display
supports state-action history and dynamic seed display
Data of the new model collected
Data of the new model collected
train code for a new model; data-collect/view updated accordingly
train code for a new model; data-collect/view updated accordingly
Checkpoint supported (date automation not supported yet)
Checkpoint supported (date automation not supported yet)
Data uploaded, collect/view updated
Data uploaded, collect/view updated
bug fix: mask reverted; self-loop kept
bug fix: mask reverted; self-loop kept
critical bug fixed: att_score dtype and context_token value
critical bug fixed: att_score dtype and context_token value
Avoids all-zero errors from env_check in RLlib
Avoids all-zero errors from env_check in RLlib
Add clarity in the RLlib level model (model_rllib.py)
Add clarity in the RLlib level model (model_rllib.py)