deep-thinking

Iterative Forward Tuning Boosts In-context Learning in Language Models

Links 🤔

Paper
- ArXiv
Code
- GitHub
Demos
- HuggingFace Gradio Space
- ModelScope / 魔搭

Overview 🤔

The single-turn ICL of LLM is incoordinate with the decision making process of humans by learning from analogy. In this work, we propose an effective and efficient two-stage framework to boost ICL. We divide the ICL process into Deep-Thinking and inference stages.

The Deep-Thinking stage performs iterative forward optimization of demonstrations, which is expected to boost the reasoning abilities of LLMs at test time. It produces accumulated meta-gradients by manipulating the Key-Value matrices in the self-attention modules of the Transformer.

Then, the inference stage only takes the test query as input without concatenating demonstrations and applies the learned meta-gradients through attention for output prediction. In this way, demonstrations are not required during the inference stage since they are already learned and stored in the definitive meta-gradients.

LLMs can be effectively and efficiently adapted to downstream tasks. Extensive experiments on ten classification and multiple-choice datasets show that our method achieves substantially better performance than standard ICL in terms of both accuracy and efficiency.

Requirements 🤔

All experiments are conducted on a single NVIDIA A100 (80G) GPU.

If you want 8-bit inference disabled

torch                  1.10.1+cu111
transformers           4.26.1
datasets

If you want 8-bit inference enabled

accelerate             0.18.0
bitsandbytes           0.38.1
torch                  1.10.1+cu111
transformers           4.26.1
datasets

Note: The versions of the Python packages listed above are not mandatory. We provide specific versions because our experiments are conducted on those versions. Additionally, you may encounter version conflicts or others issues when running experiments (especially for packages transformers and bitsandbytes), which may require manual debugging to find the most suitable version.

Get started 🤔

If you already have HuggingFace cached models, head to anchor.py and modify checkpoints_root.

The main file: task_logprob_main.py.

We provide two ways to run experiments:

Run in debug model

Edit task_logprob_main.py, set DEBUG = True

# ...
if __name__ == "__main__":
    DEBUG = True # <---
    # ...

Modify whatever you want, and type python task_logprob_main.py to run.

Run all experiments in one command-line.

# There are 4 options for each sh file: SEED, NUM_K, IN_8BIT, GPU_IDX
# And we set SEED=0, NUM_K=1, IN_8BIT=true

# To disable 8bit inference, set `IN_8BIT` to `false`.

# run_main_{name}.sh  SEED  NUM_K  IN_8BIT  GPU_IDX
bash scripts/run_main_agnews.sh 0 1 true 0
bash scripts/run_main_sst2.sh 0 1 true 0
bash scripts/run_main_trec.sh 0 1 true 0
bash scripts/run_main_qasc.sh 0 1 true 0
# ...
# See ./scripts for more datasets.

We need your help! 🤔

There are still many questions and new discoveries worth exploring. We invite interested readers to join us in uncovering the potential of this method.

Can the momentum-based meta-optimizer be replaced by more advanced methods?
Are there better estimation methods for pseudo-gradients?
How can we determine forward tuning steps earlier?
In which other tasks can this iterative method be applied?
Can this iterative method be used to overcome persistent difficulties in other areas?

Citation 🤔

If you finding our work interesting or helpful to you, please cite this repo 😘.

@article{yang2023iterative,
  title={Iterative Forward Tuning Boosts In-context Learning in Language Models},
  author={Yang, Jiaxi and Hui, Binyuan and Yang, Min and Li, Binhua and Huang, Fei and Li, Yongbin},
  journal={arXiv preprint arXiv:2305.13016},
  year={2023}
}

Name		Name	Last commit message	Last commit date
parent directory ..
images		images
models		models
scripts		scripts
tasks		tasks
utils		utils
README.md		README.md
anchor.py		anchor.py
common.py		common.py
task_logprob_main.py		task_logprob_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deep-thinking

deep-thinking

README.md

Iterative Forward Tuning Boosts In-context Learning in Language Models

Links 🤔

Overview 🤔

Requirements 🤔

Get started 🤔

We need your help! 🤔

Citation 🤔

Files

deep-thinking

Directory actions

More options

Directory actions

More options

Latest commit

History

deep-thinking

Folders and files

parent directory

README.md

Iterative Forward Tuning Boosts In-context Learning in Language Models

Links 🤔

Overview 🤔

Requirements 🤔

Get started 🤔

We need your help! 🤔

Citation 🤔