Download A/85:2013
Transcript
ATSC A/85:2013 Establishing and Maintaining Audio Loudness, Annex F 12 March 2013 The loudness of each channel is calculated individually, then summed to get the overall loudness of the program. (The loudness weighting curve is shown in Figure F.8). The peak level calculation has to include the center and surround channel downmixing coefficients (cmixlev and surmixlev) selected (ideally) during the production process. The output of the calculation is the maximum peak level of the worst case downmix condition. Loudness and peak values are normalized by the overall program dialnorm value before being passed to the subsequent calculation stages. The loudness and peak level data used for the “compr” gain word calculations represent these values over an entire AC-3 sync frame (32 ms). The audio decoder includes an overlap and add, thus even the coarsely timed “compr” gain words are applied smoothly. The dynamic range control portion of the AC-3 system has to deal with all types of program material, so it includes a method of adjusting the attack and decay time constants that are determined automatically depending upon the program material.. The audio data after loudness normalization is mapped onto the selected DRC profile to generate a static gain value. It is also used to select one of four time constants, depending on whether the input loudness is larger (attack) or smaller (decay) than the smoothed loudness, and by how much. If the input loudness is larger than the smoothed loudness, and the amount is greater than the attack threshold, then the "fast attack" time constant is selected. This is generally a very fast time constant, designed to provide very quick convergence of the compressor for very loud events. If the input loudness is larger than the smoothed loudness, but not by more than the attack threshold, then the "slow attack" time constant is selected. This is generally a moderately fast time constant, designed to provide reasonably quick convergence of the dynamic range controller for moderately loud events If the input loudness is smaller than the smoothed loudness, but not by more than the decay threshold, then the "slow decay" time constant is selected. This is generally a very slow time constant, designed to provide slow and therefore inaudible release of the compressor during soft passages. If the input loudness is smaller than the smoothed loudness, and the amount is greater than the decay threshold, then the "fast decay" time constant is selected. This is generally a moderate time constant, designed to provide a faster release at the end of very loud events. The other principal part of the DRC data generation process is the peak limiting function. The clip level is known (0 dB FS) thus the difference between that and the normalized peak level of the program is the maximum allowable gain before clipping. If the static gain words call for more gain, the smooth and limit gain block limits the static gain words to the maximum allowable gain. The time constant just computed ensures that the gain changes will not be abrupt enough to cause objectionable artifacts, and are appropriate to the program material. Note that there are two parallel processes going on; one running at the audio block rate (~ 5.3 ms) generating dynrng gain words, and the other running at the sync frame rate (32 ms) generating the compr gain words. 55