"Multimodal embedding fusion for robust speaker role recognition in video ..."

Mickael Rouvier et al. (2015)
a service of Schloss Dagstuhl - Leibniz Center for Informatics