This repo is a codebase snapshot of lxucs/coref-hoi; active issues or updates are maintained in lxucs/coref-hoi repository.
This repository contains the implementation of the paper: Revealing the Myth of Higher-Order Inference in Coreference Resolution.
The basic end-to-end coreference model is a PyTorch re-implementation based on the TensorFlow model following similar preprocessing (see this repository).
There are four higher-order inference (HOI) methods experimented: Attended Antecedent, Entity Equalization, Span Clustering, and Cluster Merging. All are included here except for Entity Equalization which is experimented in the equivalent TensorFlow environment (see this separate repository).
Files:
- run.py: training and evaluation
- model.py: the coreference model
- higher_order.py: higher-order inference modules
- analyze.py: result analysis
- preprocess.py: converting CoNLL files to examples
- tensorize.py: tensorizing example
- conll.py, metrics.py: same CoNLL-related files from the repository
- experiments.conf: different model configurations
Set up environment and data for training and evaluation:
- Install Python3 dependencies:
pip install -r requirements.txt
- Create a directory for data that will contain all data files, models and log files; set
data_dir = /path/to/data/dir
in experiments.conf - Prepare dataset (requiring OntoNotes 5.0 corpus):
./setup_data.sh /path/to/ontonotes /path/to/data/dir
For SpanBERT, download the pretrained weights from this repository, and rename it /path/to/data/dir/spanbert_base
or /path/to/data/dir/spanbert_large
accordingly.
Provided trained models:
- SpanBERT + no HOI: download
- SpanBERT + Attended Antecedent: download
- SpanBERT + Span Clustering: download
- SpanBERT + Cluster Merging: download
- SpanBERT + Entity Equalization: see repository
The name of each directory corresponds with a configuration in experiments.conf. Each directory has two trained models inside.
If you want to use the official evaluator, download and unzip conll 2012 scorer under this directory.
Evaluate a model on the dev/test set:
- Download the corresponding model directory and unzip it under
data_dir
python evaluate.py [config] [model_id] [gpu_id]
- e.g. Attended Antecedent:
python evaluate.py train_spanbert_large_ml0_d2 May08_12-38-29_58000 0
- e.g. Attended Antecedent:
python run.py [config] [gpu_id]
- [config] can be any configuration in experiments.conf
- Log file will be saved at
your_data_dir/[config]/log_XXX.txt
- Models will be saved at
your_data_dir/[config]/model_XXX.bin
- Tensorboard is available at
your_data_dir/tensorboard
Some important configurations in experiments.conf:
data_dir
: the full path to the directory containing dataset, models, log filescoref_depth
andhigher_order
: controlling the higher-order inference modulebert_pretrained_name_or_path
: the name/path of the pretrained BERT model (HuggingFace BERT models)max_training_sentences
: the maximum segments to use when document is too long; for BERT-Large and SpanBERT-Large, set to3
for 32GB GPU or2
for 24GB GPU
@inproceedings{xu-choi-2020-revealing,
title = "Revealing the Myth of Higher-Order Inference in Coreference Resolution",
author = "Xu, Liyan and Choi, Jinho D.",
booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
month = nov,
year = "2020",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.emnlp-main.686",
pages = "8527--8533"
}