Multimodal Abstractive Summarization for Open-Domain Videos. In: Visually Grounded Interaction and Language (ViGIL), pp. 1-8, Neural Information Processing Systems (NIPS) Foundation, La Jolla, CA, USA (pdf, local PDF, local PDF, local PDF, obd, bibtex)