Research Themes Multimodal Coreference Resolution Scene Graph Generation Image Captioning Semi-Supervised/Noisy Label Learning