"XGPT: Cross-modal Generative Pre-Training for Image Captioning."

Qiaolin Xia et al. (2020)
a service of Schloss Dagstuhl - Leibniz Center for Informatics