Modulating early visual processing by language
Video understanding – DeepStory: Video Story QA by Deep Embedded Memory Networks