CAME

CAME Optimizer - Pytorch

This repository provides a script and recipe to train the BERT model with our proposed CAME optimizer in:

CAME: Confidence-guided Adaptive Memory Efficient Optimization

This work has been accepted by ACL2023 main conference.

In this work, we studied a confidence-guided strategy to reduce the instability of existing memory efficient optimizers. Based on this strategy, we proposed CAME to simultaneously achieve two goals: fast convergence as in traditional adaptive methods, and low memory usage as in memory-efficient methods.

Training

The script including the setting of hyperparameters to pretrain BERT:

bash run_came_pretraining.sh

The startup file corresponding to the script:

startup_came.py

Pytorch implementation:

came.py: the Pytorch implementation of our proposed CAME optimizer.

Pretraining Results

Memory Usage Comparison

Usage

from came import CAME
optimizer = CAME(model.parameters(), lr=2e-4, weight_decay=1e-2, betas=(0.9, 0.999, 0.9999), eps=(1e-30, 1e-16))

Name		Name	Last commit message	Last commit date
parent directory ..
data		data
processors		processors
scripts		scripts
triton		triton
v1.1		v1.1
vocab		vocab
.DS_Store		.DS_Store
Dockerfile		Dockerfile
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
adafactor.py		adafactor.py
bert-large-uncased-vocab.txt		bert-large-uncased-vocab.txt
bert_config.json		bert_config.json
bert_large_config.json		bert_large_config.json
bert_pretrain.png		bert_pretrain.png
bind.sh		bind.sh
bind_pyt.py		bind_pyt.py
came.py		came.py
came_pcode.png		came_pcode.png
configurations.yml		configurations.yml
create_data.sh		create_data.sh
create_pretraining_data.py		create_pretraining_data.py
extract_features.py		extract_features.py
file_utils.py		file_utils.py
inference.py		inference.py
memory.png		memory.png
modeling.py		modeling.py
optimization.py		optimization.py
requirements.txt		requirements.txt
run.sub		run.sub
run_came_pretraining.sh		run_came_pretraining.sh
run_glue.py		run_glue.py
run_pretraining.py		run_pretraining.py
run_squad.py		run_squad.py
run_swag.py		run_swag.py
run_validation.sh		run_validation.sh
schedulers.py		schedulers.py
start_data.py		start_data.py
startup_came.py		startup_came.py
tokenization.py		tokenization.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CAME

CAME

README.md

CAME Optimizer - Pytorch

Training

The script including the setting of hyperparameters to pretrain BERT:

The startup file corresponding to the script:

Pytorch implementation:

Pretraining Results

Memory Usage Comparison

Usage

Citation

Files

CAME

Directory actions

More options

Directory actions

More options

Latest commit

History

CAME

Folders and files

parent directory

README.md

CAME Optimizer - Pytorch

Training

The script including the setting of hyperparameters to pretrain BERT:

The startup file corresponding to the script:

Pytorch implementation:

Pretraining Results

Memory Usage Comparison

Usage

Citation