PriMock57

This repository contains the data and annotations described in the papers:

The dataset consists of 57 mock medical primary care consultations held over 5 days by 7 Babylon clinicians and 57 Babylon employees acting as patients, using case cards with presenting complaints, symptoms, medical & general history etc. The data in this repository includes:

Audio recordings of the consultations (audio folder);
Manual utterance-level transcriptions of the recordings (transcripts folder);
Consultation notes written by the consulting clinicians (notes folder);
Human evaluation annotations & data (human_eval_data folder).

The scripts folder includes some data transformation scripts (utterance extraction, transcript collation etc.)

More detailed descriptions are found in each folder's README.md files.

How to clone

Due to their size, the audio files are stored using Git Large File Storage (https://git-lfs.github.com/). To clone the repository:

Install Git LFS using the link above. For Mac, you can use Homebrew: brew install git-lfs
Set up Git LFS for your user account: git lfs install
You can now clone this repository: git clone https://github.com/babylonhealth/primock57.git

Contacts

Alex Papadopoulos Korfiatis (alex.papadopoulos@babylonhealth.com)
Francesco Moramarco (francesco.moramarco@babylonhealth.com)

Citing

@inproceedings{korfiatis2022primock57,
  title={(in press): PriMock57: A Dataset Of Primary Care Mock Consultations},
  author={Papadopoulos Korfiatis, Alex and Moramarco, Francesco and Sarac, Radmila and Savkov, Aleksandar},
  booktitle={Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics},
  year={2022}
}

@inproceedings{moramarco2022human,
  title={(In press): Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation},
  author={Moramarco, Francesco and Papadopoulos Korfiatis, Alex and Perera, Mark and Juric, Damir and Flann, Jack and Reiter, Ehud and Belz, Anya and Savkov, Aleksandar},
  booktitle={Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
audio		audio
consultation_checklists		consultation_checklists
human_eval_data		human_eval_data
notes		notes
scripts		scripts
transcripts		transcripts
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PriMock57

How to clone

Contacts

Citing

About

Releases

Packages

Languages

License

rr-h/primock57

Folders and files

Latest commit

History

Repository files navigation

PriMock57

How to clone

Contacts

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages