A picture is worth a thousand (coherent) words: building a natural description of images. Google Research Blob. November 17, 2014.
Google has developed a machine-learning system that can automatically produce captions to accurately describe images the first time it sees them. It can describe a complex scene which requires a deeper representation of what’s going on in the scene, capturing how the various objects relate to one another and translating it all into natural-sounding language. The full paper "Show and Tell: A Neural Image Caption Generator" is here.
No comments:
Post a Comment