For our discussion here, let’s say k=3.

“position-sensitive score maps” are not filters. We start with the feature map and use 9 different groups of convolution filters to create 9 position-sensitive score maps for each class. These are kind of feature maps but each one responsible for different part of the object. Say we are working on the top left corner, then a position-sensitive score maps will store the feature suppose to be the top left corner of the object that wants to be detected. The purpose here is to detect a face which the top left corner should be an eye.

Written by

Deep Learning

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store