Professional Documents
Culture Documents
Chapter 14
MPEG Audio Compression
14.1 Psychoacoustics
14.2 MPEG Audio
14.3 Other Commercial Audio Codecs
14.4 The Future: MPEG-7 and MPEG-21
14.5 Further Exploration
14.1 Psychoacoustics
Frequency Masking
Threshold of Hearing
20
10
0
10
102 103 104
Hz
2
Threshold(f ) = 3.64(f /1000)0.8 6.5 e0.6(f /10003.3) + 103 (f /1000)4
(14.1)
The threshold units are dB; the frequency for the origin
(0,0) in formula (14.1) is 2,000 Hz: Threshold(f ) = 0 at
f =2 kHz
The threshold in Fig. 14.3 plots the audible level for a single
masking tone (1 kHz)
Fig. 14.4 shows how the plot changes if other masking tones
are used
70
60
Audible tone
50
40 Inaudible tone
dB
30
20
10
0
10
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Frequency (kHz)
70
1 4 8
60
50
40
dB
30
20
10
0
10
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Frequency (kHz)
Temporal Masking
60
40 Test tone
dB
Mask tone
20
5 0 10 100 1000
Delay time (ms)
Fig. 14.6: The louder is the test tone, the shorter it takes for
our hearing to get over hearing the masking.
60
50
40
Level (dB)
30
20
10
MPEG Layers
May take into account the actual width of the critical bands:
For practical purposes, audible frequencies are divided into
25 main critical bands (Table 14.1)
To keep simplicity, adopts a uniform width for all fre-
quency analysis filters, using 32 overlapping subbands
What to drop
Audio
(PCM) Encoded
input Time to Bit allocation, bitstream
Bitstream
frequency quantizing and
formatting
transformation coding
Psychoacoustic
modeling
Encoded Decoded
bitstream Bitstream Frequency Frequency PCM audio
sample to time
unpacking
reconstruction transformation
12 12 12
samples samples samples
Subband filter 0
12 12 12
samples samples samples
Subband filter 1
12 12 12
samples samples samples
Audio (PCM) Subband filter 2
samples In
...
...
...
...
12 12 12
samples samples samples
Subband filter 31
Layer 1
Each subband filter produces 1 sample out Frame
for every 32 samples in
Layer 2 and Layer 3
Frame
PCM
audio signal
Main difference:
Advantage:
Main difference:
PCM
audio signal
Bitstream Huffman
Coded audio
signal formatting coding