Codebase for the ACL 2023 paper "Efficient Shapley Values Estimation by Amortization for Text Classification"
Author Team:
-
Chenghao Yang (yangalan1996@gmail.com) (University of Chicago)
-
Fan Yin (fanyin20@cs.ucla.edu) (University of California, Los Angeles)
Supervisor Team:
- He He (New York University)
- Kai-Wei Chang (University of California, Los Angeles)
Industrial Support:
Xiaofei Ma, Bing Xiang (AWS AI Labs)
If you use this code as part of any published research, please acknowledge the following paper (it encourages researchers who publish their code!):
@inproceedings{yang-2023-amortized,
title = "Efficient Shapley Values Estimation by Amortization for Text Classification",
author = "Yang, Chenghao and Yin, Fan and He, He and Chang, Kai-Wei and Ma, Xiaofei and Xiang, Bing",
booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics",
year = "2023",
publisher = "Association for Computational Linguistics",
}
thermostat
: Configuration files we use to run Ranking Stability experiments. You should update it with original thermotet Repo to get most up-to-date implementation. Here we just upload our updated thermostat repo.InterpCalib
: Calibration via Interpretation (ACL'22) We update the implementation contributed by Xi Ye to 1) allow using different random seeds when computing Shapley Values; 2) updated the I/O interface to use our own outputs to do calibration. We here upload our updated InterpCalib repo.- Other parts are just our own codes. Chenghao will do better packaging later to improve readability. But importantly,
amortized_model.py
is the main Amortized Model code and you can run viarun.py
.
For the main repo:
git clone https://github.com/yangalan123/Amortized-Interpretability.git
cd Amortized-Interpretability
git submodule update --init --recursive
conda create -p ./env python=3.9 # Python 3.8 should also work, but we use 3.9 here for better compatibility. This is subject to future updates.
conda activate ./env # the environment position is optional, you can choose whatever places you like to save dependencies. Here I choose ./env for example.
cd thermostat # we now have the local thermostat copy, so let's build it first
pip install -e . # please ignore the requirements.txt in thermostat/, as it is older and may result in unexpected errors
cd ..
pip install -r requirements.txt # build the main repo dependencies
For the dependency of thermostat
and InterpCalib
, please check their individual README.
- First we need to compute a bunch of Shapley Values with different random seeds. We use
thermostat
to do this. Please checkthermostat/README.md
for more details. We prepare a running scriptthermostat/run.sh
to assist you. For example, you can run
bash run.sh task=yelp_polarity model=bert explainer=svs-3600 seed=1 batch_size=1 device=0
We understand it might be computationally expensive to run all the seeds. So we provide pre-computed Shapley Values here.
You can download and unzip it under the thermostat
directory. The resulted directory structure should be thermostat/experiments/...
.
- Then you can compute Spearman's Ranking Correlation Coefficient (SRCC) between different Shapley Values.
Check out
internal_correlation.py
for more details. You need to update the directory of Shapley Values in the code. This file will automatically create Table 1-2 and Figure 2 in the paper.
- We mainly use the pre-computed
thermostat
Shapley Values to train our Amortized Model. You should follow the instructions inthermostat/README.md
to compute Shapley Values first. - We need to create the dataset for training and evaluation purposes. Please check out
create_dataset.py
for more details. You need to update the filepath there. - Then, you can run
run.py
to train the Amortized Model. You can check outrun.sh
for more details. The example running command would be:
CUDA_VISIBLE_DEVICES=${device} python run.py --seed ${seed} --lr ${lr} -e ${epoch} --train_bsz ${train_bsz} --explainer ${explainer} --topk 10 --task ${task} -tm ${target_model} --storage_root ${output_dir}
Note that running run.py
will automatically compute the performance numbers in Table 3 and you can find them in the output directory. To save computational resources, this code will first check the model is already trained and saved. If not, it will train the model.
Feel free to change the setting in config.py
for better training performance.
- You can use
compute_amortized_model_consistency.py
to compute the consistency between Amortized Model and Shapley Values. You need to update the filepath there. This file will automatically create Table 5 in the paper.