Describir: A computational perspective on visual attention