BIP! Finder - Improved Image Captioning via Policy Gradient optimization of SPIDEr

2017 • Improved Image Captioning via Policy Gradient optimization of SPIDEr

Authors: Liu, Siqi, Zhu, Zhenhai, Ye, Ning, Guadarrama, Sergio, Murphy, Kevin

Venue: 2017 IEEE International Conference on Computer Vision (ICCV)

Type: Publication

Abstract: Current image captioning methods are usually trained via (penalized) maximum likelihood estimation. However, the log-likelihood score of a caption does not correlate well with human assessments of quality. Standard syntactic evaluation metrics, such as BLEU, METEOR and ROUGE, are also not well correlated. The newer SPICE and CIDEr metrics are better correlated, but have traditionally been hard to optimize for. In this paper, we show how to use a policy gradient (PG) method to directly optimize a linear combination of SPICE and CIDEr (a combinat... (read more)

Topics: Artificial intelligence

DOI: 10.1109/iccv.2017.100 (Found 2 versions)

BIP! social metrics: 0 1
External links: Crossref OpenAIRE

BibTex PDF

Topic-specific impact indicators

Popularity: This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
Influence: This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Citation Count: This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Impulse: This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.