Frame-and segment-level features and candidate pool evaluation for video caption generation

Rakshith Shetty, Jorma Laaksonen

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

82 Sitaatiot (Scopus)

Abstrakti

We present our submission to the Microsoft Video to Language Challenge of generating short captions describing videos in the challenge dataset. Our model is based on the encoder-decoder pipeline, popular in image and video captioning systems. We propose to utilize two different kinds of video features, one to capture the video content in terms of objects and attributes, and the other to capture the motion and action information. Using these diverse features we train models specializing in two separate input sub-domains. We then train an evaluator model which is used to pick the best caption from the pool of candidates generated by these domain expert models. We argue that this approach is better suited for the current video captioning task, compared to using a single model, due to the diversity in the dataset. Efficacy of our method is proven by the fact that it was rated best in MSR Video to Language Challenge, as per human evaluation. Additionally, we were ranked second in the automatic evaluation metrics based table.

AlkuperäiskieliEnglanti
OtsikkoMM 2016 - Proceedings of the 2016 ACM Multimedia Conference
KustantajaACM
Sivut1073-1076
Sivumäärä4
ISBN (elektroninen)9781450336031
DOI - pysyväislinkit
TilaJulkaistu - 1 lokak. 2016
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaACM Multimedia - Amsterdam, Alankomaat
Kesto: 15 lokak. 201619 lokak. 2016
Konferenssinumero: 24

Conference

ConferenceACM Multimedia
LyhennettäACMMM
Maa/AlueAlankomaat
KaupunkiAmsterdam
Ajanjakso15/10/201619/10/2016

Sormenjälki

Sukella tutkimusaiheisiin 'Frame-and segment-level features and candidate pool evaluation for video caption generation'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.
  • Suomalainen laskennallisen päättelyn huippuyksikkö

    Xu, Y., Rintanen, J., Kaski, S., Anwer, R., Parviainen, P., Soare, M., Vuollekoski, H., Rezazadegan Tavakoli, H., Peltola, T., Blomstedt, P., Puranen, S., Dutta, R., Gebser, M., Mononen, T., Bogaerts, B., Tasharrofi, S., Pesonen, H., Weinzierl, A. & Yang, Z.

    01/01/201531/12/2017

    Projekti: Academy of Finland: Other research funding

Siteeraa tätä