Skip to content

Commit

Permalink
Add conversation source information
Browse files Browse the repository at this point in the history
  • Loading branch information
RavitejaAnantha committed Apr 18, 2021
1 parent b65c7d1 commit 3ce0cb2
Show file tree
Hide file tree
Showing 3 changed files with 8 additions and 10 deletions.
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
dataset/__MACOSX/
dataset/qrecc_test.json
dataset/qrecc_train.json
.DS_Store
17 changes: 7 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ We used questions in NQ dataset as prompts to create conversations explicitly ba
For each query we collect query rewrites by resolving references, the resulting query rewrite is a context-independent version of the original (context-dependent) question.
The rewritten query is then used to with a search engine to answer the question. Each query is also annotated with answer, link to the web page that used to produce the answer.

Each conversation in the dataset contains a unique `Conversation_no`, `Turn_no` unique within a conversation, the original `Question`, `Context`, `Rewrite` and `Answer` with `Answer_URL`.
Each conversation in the dataset contains a unique `Conversation_no`, `Turn_no` unique within a conversation, the original `Question`, `Context`, `Rewrite`, `Answer` with `Answer_URL` and the `Conversation_source`.

```json
{
Expand All @@ -36,23 +36,20 @@ Each conversation in the dataset contains a unique `Conversation_no`, `Turn_no`
"Answer": "Tesla Inc. is an American automotive and energy company based in Palo Alto, California. The company specializes in electric car manufacturing and, through its SolarCity subsidiary, solar panel manufacturing.",
"Answer_URL": "https://en.wikipedia.org/wiki/Tesla,_Inc.",
"Conversation_no": 74,
"Turn_no": 2
"Turn_no": 2,
"Conversation_source": "trec"
}
```

## Evaluation

### Evaluate performance on Query Rewriting task

Coming Soon!

### Evaluate performance on Retrieval Question Answering task

Coming Soon!
To evaluate retrieval QA, use [evaluate_retrieval.py](https://github.com/apple/ml-qrecc/blob/main/utils/evaluate_retrieval.py)

### Evaluate performance on Extractive Question Answering task

Coming Soon!
To evaluate extractive QA, use [evaluate_qa.py](https://github.com/apple/ml-qrecc/blob/main/utils/evaluate_qa.py)

## Citation

Expand All @@ -62,8 +59,8 @@ Please cite the following if you found QReCC dataset, our [paper](https://arxiv.
@article{qrecc,
title={Open-Domain Question Answering Goes Conversational via Question Rewriting},
author={Anantha, Raviteja and Vakulenko, Svitlana and Tu, Zhucheng and Longpre, Shayne and Pulman, Stephen and Chappidi, Srinivas},
journal={arXiv preprint arXiv:2010.04898},
year={2020}
journal={Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies},
year={2021}
}
```

Expand Down
Binary file modified dataset/qrecc_data.zip
Binary file not shown.

0 comments on commit 3ce0cb2

Please sign in to comment.