"Audio-Visual Interpretable and Controllable Video Captioning."

Yapeng Tian et al. (2019)
a service of Schloss Dagstuhl - Leibniz Center for Informatics