Spotlight

Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries