Efficient image sequence coding by vector quantization of spatiotemporal bandpass outputs

by Bernhard Wegmann, Christoph Zetzsche

Abstract:

A coding scheme for image sequences is designed in analogy to human visual information processing. We propose a feature-specific vector quantization method applied to multi-channel representation of image sequences. The vector quantization combines the corresponding local/momentary amplitude coefficients of a set of three-dimensional analytic band-pass filters being selective for spatiotemporal frequency, orientation, direction and velocity. Motion compensation and decorrelation between successive frames is achieved implicitly by application of a non-rectangular subsampling to the 3D-bandpass outputs. The nonlinear combination of the outputs of filters which are selective for constantly moving one- dimensional (i.e. spatial elongated) image structures allows a classification of the local/momentary signal features with respect to their intrinsic dimensionality. Based on statistical investigations a natural hierarchy of signal features is provided. This is then used to construct an efficient encoding procedure. Thereby, the different sensitivity of the human vision to the various signal features can be easily incorporated. For a first example, all multi- dimensional vectors are mapped to constantly moving 1D-structures.

Reference:

Efficient image sequence coding by vector quantization of spatiotemporal bandpass outputs (Bernhard Wegmann, Christoph Zetzsche), In Visual Communications and Image Processing \textquotesingle92 (Petros Maragos, ed.), SPIE-Intl Soc Optical Eng, volume 1818, 1992.

Bibtex Entry:

@InProceedings{Wegmann1992, author = {Bernhard Wegmann and Christoph Zetzsche}, title = {Efficient image sequence coding by vector quantization of spatiotemporal bandpass outputs}, booktitle = {Visual Communications and Image Processing {\textquotesingle}92}, year = {1992}, editor = {Petros Maragos}, volume = {1818}, pages = {1146-1154}, month = {nov}, publisher = {{SPIE}-Intl Soc Optical Eng}, abstract = {A coding scheme for image sequences is designed in analogy to human visual information processing. We propose a feature-specific vector quantization method applied to multi-channel representation of image sequences. The vector quantization combines the corresponding local/momentary amplitude coefficients of a set of three-dimensional analytic band-pass filters being selective for spatiotemporal frequency, orientation, direction and velocity. Motion compensation and decorrelation between successive frames is achieved implicitly by application of a non-rectangular subsampling to the 3D-bandpass outputs. The nonlinear combination of the outputs of filters which are selective for constantly moving one- dimensional (i.e. spatial elongated) image structures allows a classification of the local/momentary signal features with respect to their intrinsic dimensionality. Based on statistical investigations a natural hierarchy of signal features is provided. This is then used to construct an efficient encoding procedure. Thereby, the different sensitivity of the human vision to the various signal features can be easily incorporated. For a first example, all multi- dimensional vectors are mapped to constantly moving 1D-structures.}, doi = {10.1117/12.131386}, url = {10.1117/12.131386">http://dx.doi.org/10.1117/12.131386}, }