MPEG1 Audio Coding Algorithm

MPEG1 Audio Coding Algorithm

ISO/IEC JTC1/WG11ISO/IEC JTC1/WG11

Presented byPresented by

Chai-Yeol RimChai-Yeol Rim

Contents

Physiology of Human Ear Physiology of Human Ear Data ReductionData Reduction Encoding structure of MPEG1 AudioEncoding structure of MPEG1 Audio

Physiology of Human Ear (1) PsychoacousticsPsychoacoustics

귀의 구조귀의 구조

Physiology of Human Ear (2) PsychoacousticsPsychoacoustics

주파수 분별력 주파수 분별력 1000Hz1000Hz 와 와 1002Hz1002Hz 의 구분의 구분 ??

Loosely coupled brain’s functionLoosely coupled brain’s function좌우 귀가 좌우 뇌와 꼬여 존재좌우 귀가 좌우 뇌와 꼬여 존재좌측 뇌는 좌측 뇌는 Speech, Speech, 우측 뇌는 우측 뇌는 melodicmelodic 정보에 익숙 정보에 익숙

주파수에 주파수에 logarithmiclogarithmic 한 특징을 갖는다한 특징을 갖는다100~200Hz100~200Hz 구간과 구간과 1K~2KHz1K~2KHz 의 구간이 같게 느낌의 구간이 같게 느낌

WideWide 한 한 dynamic rangedynamic range0dB~120dB0dB~120dB 의 인지도의 인지도

Physiology of Human Ear (3) Robinson-Dadson equal loudness contourRobinson-Dadson equal loudness contour

Physiology of Human Ear (4) Critical BandsCritical Bands

주파수 분별력 주파수 분별력 주파수 고저에 따라 차이주파수 고저에 따라 차이Critical Bandwidth = 24.7(4.37F +1)Critical Bandwidth = 24.7(4.37F +1)

Eberhard Zwicker Eberhard Zwicker 15KHz15KHz 이하에서의 이하에서의 24 critical band24 critical band 를 작성를 작성

Physiology of Human Ear (5)

Physiology of Human Ear (6) 청각의 청각의 ThresholdThreshold 와 와 MaskingMasking

Physiology of Human Ear (7) Masking curveMasking curveA :A : 주파수주파수B :Critical-BandB :Critical-Band

Physiology of Human Ear (8) Masking thresholdMasking threshold

sound pressure levelsound pressure level 에 따라 가변에 따라 가변

Data Reduction (1) Perceptual codingPerceptual coding

Sampling Sampling 주파수의 변경 주파수의 변경 : Nyquist theorem: Nyquist theorem 에 제한에 제한Dynamic range Dynamic range 변경 변경 : Quantization Noisy: Quantization Noisy 가중가중

Data Reduction (2) Two type of frequency domain coderTwo type of frequency domain coder

Transform Coder :Transform Coder :

Good frequency resolutionGood frequency resolution

Poor time resolutionPoor time resolution Subband Coder :Subband Coder :

Poor frequency resolutionPoor frequency resolution

Good time resolutionGood time resolution

Data Reduction (3) Basic structure of Encoder / DecoderBasic structure of Encoder / Decoder

Data Reduction (4) Sub-Band CoderSub-Band Coder

Data Reduction (5) Sub-Band CodingSub-Band Coding 의 과정 의 과정 (1)(1)

Data Reduction (6) Sub-Band CodingSub-Band Coding 의 과정 의 과정 (2)(2)

Data Reduction (7) Bit allocation Bit allocation

Audible Audible 신호의 크기에 따라서 신호의 크기에 따라서 bitbit 할당할당

MPEG1 Audio Encoding 구조 (1) MPEG-1 Audio StandardMPEG-1 Audio Standard

Sampling rate : Sampling rate : 32, 44.1, 48KHz32, 44.1, 48KHz Bit rate : Bit rate : 32~224kbps/channel32~224kbps/channel

64~448kbps for stereo64~448kbps for stereo Background Two AlgorithmBackground Two Algorithm

MUSCAM (Masking-pattern Adaptive SubbaMUSCAM (Masking-pattern Adaptive Subband Coding And Multiplexing)nd Coding And Multiplexing)

ASPEC (Adaptive Spectral Perceptual EntroASPEC (Adaptive Spectral Perceptual Entropy Coding)py Coding)

MPEG1 Audio Encoding 구조 (2) MPEG-1 Audio StandardMPEG-1 Audio Standard

3 Layer coding : 3 Layer coding : Layer I, II, IIILayer I, II, III Layer I : Layer I : 상대적으로 상대적으로 High rateHigh rate 인 경우 사용인 경우 사용 Layer II : Layer II :

채널당 채널당 96~128kbps96~128kbps 전송율전송율Layer IIA Layer IIA 는 는 joint stereo versionjoint stereo version

Layer III : Layer III : Low data rate Low data rate 을 대상을 대상 , , 가장 복잡가장 복잡

Layer I,IILayer I,II 는 는 MUSCAMMUSCAM 을 을 Layer IIILayer III 는 는 ASPECASPEC

MPEG1 Audio Encoding 구조 (3) MPEG-1 Layer I CodingMPEG-1 Layer I Coding

MUSCAMMUSCAM 방식의 응용방식의 응용 3232 개의 개의 subband codingsubband coding 512 point512 point 의 의 FFTFFT 로 로 psychoacoustic modelipsychoacoustic modeli

ngng Subband dataSubband data 에 대한 에 대한 bit allocationbit allocation 수행수행 12 subband sample 12 subband sample 에 대한 에 대한 codingcoding

MPEG1 Audio Encoding 구조 (4)

MPEG1 Audio Encoding 구조 (5) MPEG-1 Layer II CodingMPEG-1 Layer II Coding

MUSCAMMUSCAM 방식의 응용방식의 응용 3232 개의 개의 subband codingsubband coding 1024 point1024 point 의 의 FFTFFT 로 로 psychoacoustic mopsychoacoustic mo

delingdeling Subband dataSubband data 에 대한 에 대한 bit allocationbit allocation 수행수행 33 개의 개의 12 subband sample 12 subband sample 에 대한 에 대한 codincodin

gg Layer ILayer I 에 비한 효과적인 에 비한 효과적인 CodingCoding 수행수행

MPEG1 Audio Encoding 구조 (6) MPEG-1 MPEG-1

Layer I/II Layer I/II 의 비교의 비교

MPEG1 Audio Encoding 구조 (7) MPEG-1 Layer III CodingMPEG-1 Layer III Coding

MUSCAMMUSCAM 과 과 ASPECASPEC 방식의 합성방식의 합성 Layer I,II Layer I,II 비교 좋은 품질의 비교 좋은 품질의 audio coding audio coding 3232 개의 개의 subband codingsubband coding Subband Subband 계수를 계수를 MDCTMDCT 수행수행 1024 point1024 point 의 의 FFTFFT 로 로 psychoacoustic modepsychoacoustic mode

lingling Subband dataSubband data 에 대한 에 대한 bit allocationbit allocation 수행수행 Huffman & Run-length codingHuffman & Run-length coding

MPEG1 Audio Encoding 구조 (8)

MPEG1 Audio Encoding 구조 (9) MDCT block for Layer III codingMDCT block for Layer III coding

MPEG1 Audio Encoding 구조 (10) Psychoacoustic ModelingPsychoacoustic Modeling

512, 1024 FFT 512, 1024 FFT 수행수행 Subband data codingSubband data coding 을 위한 최대 을 위한 최대 SPLSPL 결정결정 QuietQuiet 상태의 상태의 absolute thresholdabsolute threshold 구함구함 TonalTonal 과 과 nontonalnontonal 성분을 구함성분을 구함 Masking thresholdMasking threshold 를 구함를 구함 Global masking thresholdGlobal masking threshold 를 구함를 구함 각 각 subbandsubband 에 대한 에 대한 Minimum masking threshMinimum masking thresh

oldold 를 구함를 구함 NMR = SNR - SMR NMR = SNR - SMR 계산계산

MPEG1 Audio Encoding 구조 (11) MPEG1MPEG1

LayerI,II LayerI,II

EncodingEncoding

Flow-chartFlow-chart

정리 MPEG1 Layer I,II,III CodingMPEG1 Layer I,II,III Coding

Psychoacoustic modelingPsychoacoustic modelingSubband CodingSubband CodingTransform coding (MDCT)Transform coding (MDCT)Bit allocationBit allocationHuffman entropy codingHuffman entropy coding

MPEG1MPEG1 의 음질의 결정 요인의 음질의 결정 요인Psychoacoustic modelingPsychoacoustic modelingBit allocationBit allocation

Documents

MPEG1 Audio Coding Algorithm