High quality low complexity scalable wavelet audio coding
- Resource Type
- Conference
- Authors
- Dobson, W.K.; Yang, J.J.; Smart, K.J.; Guo, F.K.
- Source
- 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing Acoustics, speech, and signal processing Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on. 1:327-330 vol.1 1997
- Subject
- Signal Processing and Analysis
Components, Circuits, Devices and Systems
Audio coding
Bit rate
Wavelet packets
Psychoacoustic models
Wavelet domain
Wavelet transforms
Quantization
Tree data structures
Masking threshold
Wavelet coefficients
- Language
- ISSN
- 1520-6149
This paper presents an audio coder for real-time multimedia applications. To achieve high quality at low bit rate, the audio coder uses a wavelet packet decomposition to transform the audio data into the wavelet domain, and a psychoacoustic model is used to minimize quantization noise. The wavelet packet decomposition tree structures were chosen in a way to closely mimic the critical bands in a psychoacoustic model. Instead of determining the masking thresholds in the Fourier domain, the wavelet coefficients are used to drive the psychoacoustic model directly. Most of the standard industrial sampling frequencies are supported by this coder. An efficient bit rate control scheme was designed such that the audio coder operates at virtually any desired bit rate level. The audio coder achieves near perceptually lossless quality at or below 80 kb/s for most audio sources. Real-time encoding/decoding is possible by using only a fraction of a Pentium or faster CPU.