self-improving-RL/experiments at main · resuldagdanov/self-improving-RL · GitHub

Name		Name	Last commit message	Last commit date
parent directory ..
agents		agents
algorithms		algorithms
analyses		analyses
configs		configs
evaluation		evaluation
models		models
results		results
training		training
utils		utils
README.md		README.md

README.md

Experiments

Please follow the readme instructions to construct experiments.

An illustration of continuous RL training pipeline for safety-critical black-box systems.

Conduct Experiments

Detailed instructions on how to conduct experiments can be found in the readme. The following is a brief summary of the steps.

Run Black-Box Verification Algorithms

Grid-Search Validation

cd ./algorithms
python grid_search.py

Monte-Carlo-Search Validation

cd ./algorithms
python monte_carlo_search.py

Cross-Entropy-Search Validation

cd ./algorithms
python ce_search.py

Bayesian-Optimization-Search Validation

cd ./algorithms
python bayesian_search.py

Adaptive-Multilevel-Splitting-Search Validation

cd ./algorithms
python ams_search.py

Self-Improvement of Black-Box System

Train RL on Custom Verification Scenarios

cd ./training
python self_improvement.py

Analysis of Results

Open Jupyter Notebook

cd ./analyses
python3 -m notebook