Convolutional neural network, Image processing, Computer graphics
Multi-scale networks and an application.
Scene Parsing with Multiscale Feature Learning, Purity Trees, and Optimal Covers
http://www.robots.ox.ac.uk/~vgg/hzbook/
http://www.clement.farabet.net/research.html#parsing
Hand-eye coordination. See work on grabbing objects in Robotics
Nice notes on mathematical optimization for computer graphics and computer vision
algorithm
The algorithm we used here is a scaled-up version of the one presented in [24] (Unsupervised Learning of Visual Features through Spike Timing Dependent Plasticity). We used a five-layer hierarchical network, with a classifier at the end, similar to HMAX model. Specifically, we alternated simple cells that gain selectivity through a sum operation, and complex cells that gain shift and scale invariance through a max operation. Spike timing -coding (stronger signal fires first). Weight sharing, as in CNNs.
The image is copied and scaled, and presented to copies of the network (like Cortical columns). To increase the sparsity at a given scale and location (corresponding to one cortical column), only the spike corresponding to the best matching orientation is propagated (i.e. a winner-take-all inhibition is employed).
See also Human vision
A biologically inspired spiking model of visual processing for image feature detection
Dynamic vision cameras!!
Unsupervised Learning of Visual Features through Spike Timing Dependent Plasticity
http://ai.stanford.edu/~haosu/
https://www.wikiwand.com/en/Image-based_modeling_and_rendering