Efficient Coding of Time-Relative Structure Using Spikes

1 January 2005

journal article
Published by MIT Press in Neural Computation

Vol. 17 (1), 19-45
https://doi.org/10.1162/0899766052530839

Abstract

Nonstationary acoustic features provide essential cues for many auditory tasks, including sound localization, auditory stream analysis, and speech recognition. These features can best be characterized relative to a precise point in time, such as the onset of a sound or the beginning of a harmonic periodicity. Extracting these types of features is a difficult problem. Part of the difficulty is that with standard block-based signal analysis methods, the representation is sensitive to the arbitrary alignment of the blocks with respect to the signal. Convolutional techniques such as shift-invariant transformations can reduce this sensitivity, but these do not yield a code that is efficient, that is, one that forms a nonredundant representation of the underlying structure. Here, we develop a non-block-based method for signal representation that is both time relative and efficient. Signals are represented using a linear superposition of time-shiftable kernel functions, each with an associated magnitude and temporal position. Signal decomposition in this method is a non-linear process that consists of optimizing the kernel function scaling coefficients and temporal positions to form an efficient, shift-invariant representation. We demonstrate the properties of this representation for the purpose of characterizing structure in various types of nonstationary acoustic signals. The computational problem investigated here has direct relevance to the neural coding at the auditory nerve and the more general issue of how to encode complex, time-varying signals with a population of spiking neurons.

Keywords

This publication has 14 references indexed in Scilit:

Improved audio coding using a psychoacoustic model based on a cochlear filter bank
IEEE Transactions on Speech and Audio Processing, 2002
Efficient coding of natural sounds
Nature Neuroscience, 2002
Atomic Decomposition by Basis Pursuit
SIAM Review, 2001
Matching pursuit and atomic signal models based on recursive filter banks
IEEE Transactions on Signal Processing, 1999
Probabilistic framework for the adaptation and comparison of image codes
Journal of the Optical Society of America A, 1999
THE ROLE OF TIMING IN THE BRAIN STEM AUDITORY NUCLEI OF VERTEBRATES
Annual Review of Physiology, 1999
Matching pursuits with time-frequency dictionaries
IEEE Transactions on Signal Processing, 1993
Speech processing in the auditory system II: Lateral inhibition and the central processing of speech evoked activity in the auditory nerve
The Journal of the Acoustical Society of America, 1985
Isolated and Connected Word Recognition--Theory and Selected Applications
IEEE Transactions on Communications, 1981
A Mathematical Theory of Communication
Bell System Technical Journal, 1948

Cited by 77 articles