- Singapore
-
04:04
(UTC +08:00) - https://scitator.com
- @Scitator
Stars
Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
etna-team / etna
Forked from tinkoff-ai/etnaETNA – Time-Series Library
corl-team / katakomba
Forked from tinkoff-ai/katakombaData-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
corl-team / CORL
Forked from tinkoff-ai/CORLHigh-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
A Library for Advanced Deep Time Series Models.
Time-Series Work Summary in CS Top Conferences (NIPS, ICML, ICLR, KDD, AAAI, WWW, IJCAI, CIKM, ICDM, ICDE, etc.)
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Lightweight, useful implementation of conformal prediction on real data.
A curated list of trustworthy deep learning papers. Daily updating...
Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, Offline RL Workshop
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Benchmarks for Out-of-Distribution Generalization in Time Series Tasks
High throughput synchronous and asynchronous reinforcement learning
DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation
The Official Repository for "Generalized OOD Detection: A Survey"
Benchmarking Generalized Out-of-Distribution Detection
Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight
"Probabilistic Embeddings Revisited" paper official repository