[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
nlp
video
vision
captioning-videos
vision-and-language
grounding
pytorch-implementation
visual-grounding
video-grounding
video-object-grounding
object-grounding
-
Updated
Jun 10, 2020 - Python