Intuition Scribe

Prototype of an automated medical scribe, done in collaboration with doctors from University of British Columbia and St. Paul's Hospital.

The goal of Intuition Scribe is to automatically produce a admission note from an audio recording of a patient-doctor interview.

See this Figma prototype for a demo.

Design

The pipeline follows these stages:

Speech recognition The audio is converted to text. Currently the best method is to use the Rev.ai speech-to-text API, which also inserts punctuation and capitalization. See rev_transcription.py for more details.
Speech diarization The text is assigned to speakers (either the doctor or patient) to form a dialogue transcript. Currently the Resemblyzer library is used, which is based off of Generalized End-To-End Loss for Speaker Verification. This method requires that for each conversation, a snippet of audio from each speaker is fed to the model for calibration. See diarization.py for more details.
Question and Answer Summarization After a transcript is created, the question-answer turns in the dialogue are extracted and then summarized. The question-answer turns are determined from the punctuation (prescence of question marks from the speech recognition). These turns are summarized using T5. This model is trained on a dataset collected from YouTube and labelled by our doctors. See t5/train_qa_summarizer.py for more details.
Categorization The summarized statements are then assigned to sections of the admission note, like Chief Complaint, History of Present Illness, Social History, etc. Current a keyword based assignment is used.

SNOMED CT terms file: this library of medical keywords is used for determining which text is medically relevant
Huggingface transformers for the T5 model

coqa/: using the CoQA dataset for extra training data for the question-answer summarization model
gpt/: using a few-shot tuned GPT2 model for the question-answer summarization model

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
archive		archive
coqa		coqa
gpt		gpt
resemblyzer		resemblyzer
snomed_ct		snomed_ct
t5		t5
terms		terms
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
diarization.py		diarization.py
diarization_tune.py		diarization_tune.py
print_transcript.py		print_transcript.py
rev_diarization.py		rev_diarization.py
rev_diarization_test.py		rev_diarization_test.py
rev_transcription.py		rev_transcription.py
scribe.py		scribe.py
scribe_test.py		scribe_test.py
speech_to_text_google.py		speech_to_text_google.py
utilities.py		utilities.py