LLM Robustness Against Misinformation in Biomedical Question Answering

Code and resources that implement experiments for the paper LLM Robustness Against Misinformation in Biomedical Question Answering.

Installation

Install Python 3.12

Create and activate a virtual environment:

python -m venv .venv
source .venv/bin/activate

Install dependencies:

pip install --upgrade pip
pip install -e .

Data

The directory data contains three subdirectories:

input that contains two files (binary and rest, i.e. free-form, questions).
adversarial that contains the files with generated adversarial (wrong) answers only for free-form questions.
results that contains final files

Each JSON file in the subdirectories is zipped and needs to be unzipped.

Code

To replicate the results, you can directly use the notebooks that process the files in the results directory.

To repeat all the steps, use the scripts:

For binary questions: TODO
For free-form questions, there are generally three types of scripts:
- main_<model_name>_<model_name>_rest.py (model_name: mixtral, llama, gemma, or gpt-4o): takes the input data, and generates adversarial contexts and answers by the same model (adversarial model == target model).
- main_<model_name>_model_rest.py will use the resulting file produced in the previous step and will generate the answers by the target models (target model != adversarial model and != gpt-4o).
- main_<model_name>_gpt4o_rest.py will use gpt4o as a target model (additionally saves the logprobs from gpt-4o).

Citation

@misc{bondarenko:2024,
      title={LLM Robustness Against Misinformation in Biomedical Question Answering}, 
      author={Alexander Bondarenko and Adrian Viehweger},
      year={2024},
      eprint={2410.21330},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2410.21330}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
src/llm_robustness		src/llm_robustness
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Robustness Against Misinformation in Biomedical Question Answering

Installation

Data

Code

Citation

About

Releases

Packages

Languages

License

alebondarenko/llm-robustness

Folders and files

Latest commit

History

Repository files navigation

LLM Robustness Against Misinformation in Biomedical Question Answering

Installation

Data

Code

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages