Generating text that describes an Image
CS231n Lecture 10 - Recurrent Neural Networks, Image Captioning, LSTM
DenseCap: Fully Convolutional Localization Networks for Dense Captioning