An attention function can be described as mapping a query and a set of key-value pairs to an output, where the query, keys, values, and output are all vectors. The output is computed as a weighted sumof the values, where the weight assigned to each value is computed by a compatibility function of thequery with the corresponding key.
The frontal and parietal cortex: Eye movements and attention
Predictive coding is related to attention
The normalization model of attention. Model proposes attention is mostly accomplished by multiplying input by an attention field. Furthermore, the propose a model of attention that incorporates divisive normalization (code on paper)
We propose that this computational principle endows the brain with the capacity to increase sensitivity to faint stimuli presented alone and to reduce the impact of task irrelevant distracters when multiple stimuli are presented.
The three basic components of the model are: the stimulation field, the suppressive field, and the attention field