The Equivalence of Information-Theoretic and Likelihood-Based Methods for Neural Dimensionality Reduction
Fig 5
A second example Bernoulli neuron for which fails to identify the most-informative one-dimensional subspace.
The stimulus space has two dimensions, denoted s1 and s2, and stimuli were drawn iid from a standard Gaussian (0,1). (A) The nonlinearity f(s1,s2) = p(spike|s1,s2) is excitatory in s1 and suppressive in s2; brighter intensity indicates higher spike probability. (B) Contour plot of the stimulus-conditional densities given the two possible responses: “spike” (red) or “no-spike” (blue), along with the raw stimulus distribution (black). (C) Information carried by silences (I0), single spikes (Iss), and total Bernoulli information (IBer = I0+Iss) as a function of subspace orientation. The MID estimate is the maximum of Iss, but the total Bernoulli information is in fact 13% higher at due to the incorporation of no-spike information. Although both stimulus axes are clearly relevant to the neuron, MID identifies the less informative one. As with the previous figure, equations (19) and (20) detail the equivalence between IBer and ℒlnb, so that this figure can be viewed from either an information-theoretic or likelihood-based perspective.