GitHub - facebookresearch/TreeNLG at a01bce34a4c9e5cb1fb27d9709da5659c5cf08b4

Constrained Decoding for Neural NLG from Compositional Representations in Task-Oriented Dialogue

Code and dataset supporting the paper:

Anusha Balakrishnan, Jinfeng Rao, Kartikeya Upasani, Michael White and Rajen Subba. Constrained Decoding for Neural NLG from Compositional Representations in Task-Oriented Dialogue. To appear in Proc. ACL-19.

If you find this code or dataset useful in your research, please consider citing our paper.

Reference

@InProceedings{Balakrishnan2019constrainednlg,
author = {Anusha Balakrishnan and Jinfeng Rao and Kartikeya Upasani and Michael White and Rajen Subba},
title = {Constrained Decoding for Neural {NLG} from Compositional Representations in Task-Oriented Dialogue},
booktitle = {Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics},
month = {July},
year = {2019},
note = {To appear}
}

Data

In addition to the weather and enriched E2E challenge dataset from our paper, we released another weather_challenge dataset, which contains harder weather scenarios in train/val/test files. Each response was collected by providing annotators, who are native English speakers, with a user query, a compositional meaning representation (with discourse relations and dialog acts), and a context. Currently context is not provided yet but we will release soon. All of these are made available in our dataset. See our linked paper for more details.

Data Statistics

Dataset	Train	Val	Test	Disc_Test
Weather	25390	3078	3121	454
Weather_Challenge	32684	3397	3382	-
E2E	42061	4672	4693	230

Disc_Test is a more challenging subset of our test set that contains discourse relations, which is also the subset we report results in Disc column in Table 7 in our paper. Note that there are some minor differences of data statistics to our paper, please use the statistics above.

Results

We noticed that slightly higher numbers can be obtained by tuning hyper-parameters compared to the numbers we reported in our paper. Therefore, we update all the automatic numbers (BLEU and tree accuracy) here and please use numbers below when citing our results. For tree accuracy, we report the number on the whole test set, as well as on two disjoint subsets: no-discourse subset that contains examples without any discourse act; discourse subset contains example with 1+ discourse acts.

Weather Dataset

Dataset	BLEU	TreeAcc(whole)	TreeAcc(no-discourse)	TreeAcc(discourse)
S2S-Tree	76.12	94.00	96.66	86.59
S2S-Constr	76.60	97.15	98.76	94.45

Weather Challenge Dataset

Dataset	BLEU	TreeAcc(whole)	TreeAcc(no-discourse)	TreeAcc(discourse)
S2S-Tree	76.75	91.10	96.62	83.3
S2S-Constr	77.45	95.74	98.52	91.61

E2E Dataset

Dataset	BLEU	TreeAcc(whole)	TreeAcc(no-discourse)	TreeAcc(discourse)
S2S-Tree	74.58	97.06	99.68	95.28
S2S-Constr	74.69	99.25	99.89	97.78

Code

Computing tree accuracy:

python compute_tree_acc.py -tsv ~/seq2seq_out.tsv

Output file should be tab-separated with columns id, input, pred, target.

License

TreeNLG is released under CC-BY-NC-4.0, see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
constrained_decoding		constrained_decoding
data		data
tree_accuracy		tree_accuracy
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
compute_tree_acc.py		compute_tree_acc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constrained Decoding for Neural NLG from Compositional Representations in Task-Oriented Dialogue

Reference

Data

Data Statistics

Results

Weather Dataset

Weather Challenge Dataset

E2E Dataset

Code

License

About

Releases

Packages

Contributors 4

Languages

License

facebookresearch/TreeNLG

Folders and files

Latest commit

History

Repository files navigation

Constrained Decoding for Neural NLG from Compositional Representations in Task-Oriented Dialogue

Reference

Data

Data Statistics

Results

Weather Dataset

Weather Challenge Dataset

E2E Dataset

Code

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages