KR20060008078A

KR20060008078A - A method and a apparatus of advanced low bit rate linear prediction coding with plp coefficient for mobile phone

Info

Publication number: KR20060008078A
Application number: KR1020040057739A
Authority: KR
Inventors: 김찬우
Original assignee: 엘지전자 주식회사
Priority date: 2004-07-23
Filing date: 2004-07-23
Publication date: 2006-01-26
Also published as: KR100619893B1; ATE480852T1; CN1737904A; EP1619665A1; EP1619665B1; JP2006039559A; DE602005023385D1

Abstract

A voice coding apparatus and method of a mobile communications terminal can embody higher compressibility and ensure high sound quality, compared with the case of using a Linear Prediction (LP) coefficient, by performing a Linear Predictive Coding (LPC) using a Perceptual Linear Prediction (PLP) coefficient.

Description

Improved low-rate linear predictive coding apparatus and method for mobile terminals {A METHOD AND A APPARATUS OF ADVANCED LOW BIT RATE LINEAR PREDICTION CODING WITH PLP COEFFICIENT FOR MOBILE PHONE}

도1 은 일반적인 저전송률의 선형예측코딩 음성신호 출력 모델 기능 구성도, 1 is a block diagram of a typical low-prediction linear predictive coded speech signal output model;

도2 는 종래 기술 휴대단말기의 저전송률 선형예측코딩 장치 기능 구성도, 2 is a functional block diagram of a low-rate linear predictive coding apparatus of a conventional mobile terminal;

도3 은 종래 기술에 의한 휴대단말기의 저전송률 선형예측코딩 방법 순서도, 3 is a flow chart of a low-rate linear prediction coding method of a portable terminal according to the prior art;

도4 는 본 발명 휴대단말기의 개선된 저전송률 선형예측코딩 장치 기능구성도, 4 is a functional block diagram of an improved low-rate linear predictive coding apparatus of the present invention portable terminal.

도5 는 본 발명 휴대단말기의 피엘피 계수부 상세 기능구성도, 5 is a detailed functional diagram of the PLP counter of the mobile terminal of the present invention;

도6 은 본 발명 휴대단말기의 개선된 저전송률 선형예측코딩 방법 순서도. 6 is a flowchart of an improved low-rate linear predictive coding method of a mobile terminal of the present invention.

** 도면의 주요 부분에 대한 부호 설명 **** Explanation of symbols on the main parts of the drawing **

500 : 암호부 510 : 피엘피 계수부 511 : 에프에프티부500: encryption unit 510: PLP counter 511: FFT unit

512 : 필터뱅크부 513 : 라우드니스부 514 : 매칭부512: filter bank section 513: loudness section 514: matching section

515 : 역에프에프티부 516 : 위상처리부 517 : 주파수특성부515: inverse FFT part 516: phase processing part 517: frequency characteristic part

520 : 유성무성식별부 530 : 무성차단부 540 : 피치검출부520: voiceless voice identification unit 530: voice breaking unit 540: pitch detection unit

550 : 코딩부550: coding unit

본 발명은 휴대단말기의 음성신호 저전송률 코딩에 관한 것으로, 특히, 청각적 효과가 반영된 PLP 계수를 사용하여 음성신호를 높은 압축률로 코딩하면서도 낮은 전송률로 전송하는 휴대단말기의 개선된 저전송률 선형예측코딩 장치 및 방법에 관한 것이다. The present invention relates to low bit rate coding of a speech signal of a mobile terminal, and in particular, improved low bit rate linear prediction coding of a portable terminal for transmitting a low bit rate while coding a speech signal at a high compression rate using a PLP coefficient reflecting an auditory effect. An apparatus and method are provided.

이동통신 시스템은 휴대단말기(MS: MOBILE STATION)를 이용하여 해당 기지국(RAN: RADIO ACCESS NETWORK)이 형성하는 서비스 영역(SERVICE AREA) 안을 자유롭게 이동하면서 이동교환국(MSC: MOBILE SWITCHING CENTER)의 감시와 제어와 스위칭(SWITCHING)에 의하여 설정된 통신경로를 경유하고, 언제 어디서나 원하는 상대방과 즉시 무선접속하여 통신하는 것으로, 개인이 항상 직접 휴대하면서 어디든지 이동하는 첨단 무선통신장비 이다. The mobile communication system freely moves within the service area formed by the base station (RAN: RADIO ACCESS NETWORK) using a mobile terminal (MS: MOBILE STATION) and monitors and controls the mobile switching station (MSC: MOBILE SWITCHING CENTER). Through the communication path set by the switching (SWITCHING), it is an advanced wireless communication equipment to move anywhere and always carry the person directly by wireless connection to communicate with the desired party anytime and anywhere.

상기 휴대단말기(UE)를 포함하는 이동통신 시스템은, 초기에 음성급 신호를 이용하는 통신방식으로 운용되고, 점차 통신요구 및 전송하고자 하는 데이터의 량이 많아지므로 숫자, 문자, 기호 등을 이용하는 메시지 데이터 통신 기능이 부가되었고, 현재의 3세대(3GPP) 이동통신 시스템은, 상기의 음성급 신호와 문자급 신호에 영상신호가 포함되는 멀티미디어급 통신을 제공하는 방식으로 발전하고 있다. The mobile communication system including the mobile terminal (UE) is initially operated in a communication method using a voice level signal, and gradually increases the communication request and the amount of data to be transmitted, message data communication using numbers, letters, symbols, etc. Functions have been added, and current generation 3GPP mobile communication systems are being developed in such a manner as to provide multimedia-class communication in which video signals are included in the above-mentioned voice and text signals.

상기 이동통신 시스템은 한정된 무선자원을 이용하여 다수 가입자가 동시에 채널을 할당받아 점유하고, 통신신호를 전송하며, 상기 채널은 다수가 동시에 사용하도록 하기 위하여 대역폭을 제한하므로, 전송할 수 있는 데이터 전송속도(BIT RATE)가 제한된다. The mobile communication system uses a limited radio resource to occupy and occupy a plurality of subscribers at the same time, transmit a communication signal, and the channel is limited in bandwidth so that the channel can be used simultaneously. BIT RATE) is limited.

상기와 같이 할당된 채널을 통하여 전송할 수 있는 데이터 속도가 제한되므로, 다수가 동시에 채널을 할당받아 통신할 수 있는 장점이 있으나, 상기 통신 이용자는 서로 상대방에게 많은 데이터를 전송하고자 한다. Since the data rate that can be transmitted through the allocated channel is limited as described above, a plurality of channels can be simultaneously assigned and communicated with each other. However, the communication users want to transmit a lot of data to each other.

상기와 같이 할당된 채널의 제한된 데이터 전송속도를 이용하여 많은 데이터를 전송하는 기술이 압축 코딩기술이며, 상기 압축 코딩기술은 압축률이 높을 수록, 적은 크기의 데이터로 많은 정보를 전송한다. As described above, a technique of transmitting a large amount of data using a limited data transmission rate of an allocated channel is a compression coding technique. The compression coding technique transmits a large amount of information with a smaller size as the compression ratio is higher.

상기 휴대단말기에서의 음성신호 압축에는 다수 코딩방식이 있으며, 데이터 전송속도로 구분되는 압축 코딩방식에는 넓은 대역폭 또는 높은 전송속도의 채널을 필요로 하는 것으로, 약 16 KBPS 보다 높은 전송률의 고전송률(HIGH BIT RATE) 코딩(CODING) 방식과, 보통의 대역폭 또는 전송속도의 채널을 필요로 하는 것으로, 2.4 KBPS 내지 16 KBPS 범위의 전송률에 의한 중전송률(MEDIUM BIT RATE) 코딩 방식과 낮은 대역폭 또는 전송률의 채널을 필요로 하는 것으로, 75 BPS 내지 2.4 KBPS 범위의 전송률에 의한 저전송률(LOW BIT RATE) 코딩 방식이 있다. There are many coding schemes for voice signal compression in the mobile terminal, and a compression coding scheme divided into data transmission rates requires a wide bandwidth or a channel having a high transmission rate, and has a high data rate higher than about 16 KBPS. BIT RATE coding method and a channel having a normal bandwidth or transmission rate, the medium bit rate coding method and a low bandwidth or transmission channel with a transmission rate ranging from 2.4 KBPS to 16 KBPS As a need for a low bit rate (LOW BIT RATE) coding scheme using a data rate ranging from 75 BPS to 2.4 KBPS.

상기 음성신호의 코딩에서 샘플링 레이트는 일반적으로 8 KBPS를 사용하고, 고전송율(HIGH BIT RATE) 코딩 방식은 일반적인 오디오 코딩을 이용하는 스피치 코딩(SPEECH CODING) 방식, PCM(PULSE CODE MODULATION) 코딩 방식, ADPCM(ADAPTIVE DELTA PULSE CODE MODULATION) 코딩 방식 등이 있고, 중전송률(MEDIUM BIT RATE) 코딩 방식은 CELP와 해당 VARIATION에 의한 것으로, LD-CELP 코딩 방식, CS-ACELP 코딩 방식, VSELP 코딩 방식, MELP 코딩 방식과, 필요에 따라, 높은 주파수 성분(8 KHz 내지 16 KHz)을 다른 방식으로 코딩한 광대역 스피치 코딩(WIDE BAND SPEECH CODING) 방식 등이 있다. In the coding of the speech signal, a sampling rate is generally 8 KBPS, and the high bit rate coding scheme is a speech coding scheme using general audio coding, a pulse code modulation (PCM) coding scheme, or an ADPCM scheme. (ADAPTIVE DELTA PULSE CODE MODULATION) coding scheme, and the medium bit rate coding scheme is based on CELP and corresponding VARIATION, LD-CELP coding scheme, CS-ACELP coding scheme, VSELP coding scheme, MELP coding scheme And a wideband speech coding method in which a high frequency component (8 KHz to 16 KHz) is coded in another manner as necessary.

또한, 상기 저전송률(LOW BIT RATE) 코딩 방식은 LPC(LINEAR PREDICTIVE CODING) 코딩 방식, RELP(RANDOM EXCITED LINEAR PREDICTIVE CODING) 코딩 방식, FORMANTS VOCODER 코딩 방식, CEPSTRAL VOCODER 코딩 방식 등이 있다. The low bit rate coding scheme may include a LINEAR PREDICTIVE CODING (LPC) coding scheme, a RANDOM EXCITED LINEAR PREDICTIVE CODING (RELP) coding scheme, a FORMANTS VOCODER coding scheme, and a CEPSTRAL VOCODER coding scheme.

상기 고전송률 코딩은 많은 데이터를 전송하므로, 송수신되는 음질이 가장 좋으며, 음질을 중요시하는데 적용되고, 상기 중전송률 코딩은 음질이 보통이며, 상기 저전송률 코딩은 음질이 중요하지 않고, 내용을 확인하는데 충분한 응용분야에 적용된다. Since the high rate coding transmits a lot of data, the sound quality transmitted and received is best, and the sound quality is important. The medium rate coding is sound quality in general, and the low rate coding is not important for sound quality. Applicable to sufficient applications.

상기 휴대단말기에 할당된 채널은 데이터 전송속도(BIT RATE)가 고정 제한되어 있고, 전송하고자 하는 정보의 량은 음성신호를 포함하여, 문자 등의 데이터 신호와, 영상 또는 이미지 신호가 포함되어 많으므로, 압축 코딩 기술을 사용하는 동시에, 음성급 신호는 적은 대역폭을 점유하는 저전송률 코딩 방식을 채택한다. Since the data rate is fixedly limited in the channel allocated to the mobile terminal, the amount of information to be transmitted includes a voice signal, a data signal such as a text, and a video or image signal. In addition to using compression coding techniques, speech-grade signals adopt a low rate coding scheme that occupies less bandwidth.

따라서, 상기와 같이 저전송률(LOW BIT RATE) 코딩 방식으로 음성을 압축하여 전송하는 경우, 음질이 나쁜 문제가 있으므로, 저전송률 코딩 방식에서 음질을 제고하는 동시에 압축 효율을 높이는 기술 개발의 필요가 있다. Therefore, when voice is compressed and transmitted using a low bit rate coding scheme as described above, there is a problem in that the sound quality is bad. Therefore, there is a need to develop a technology for improving sound quality and improving compression efficiency in the low bit rate coding scheme. .

이하, 종래 기술에 의한 휴대단말기의 저전송률 음성코딩 장치 및 방법을 첨부된 도면을 참조하여 설명한다. Hereinafter, a low bit rate voice coding apparatus and method of a portable terminal according to the prior art will be described with reference to the accompanying drawings.

종래 기술을 설명하기 위하여 첨부된 것으로, 도1 은 일반적인 저전송률의 선형예측코딩 음성신호 출력 모델 기능 구성도 이고, 도2 는 종래 기술에 의한 휴 대단말기의 저전송률 선형예측코딩 장치 기능 구성도 이며, 도3 은 종래 기술에 의한 휴대단말기의 저전송률 선형예측코딩 방법 순서도 이다. Attached to explain the prior art, FIG. 1 is a functional block diagram of a typical low-prediction linear predictive coding voice signal output model, and FIG. 2 is a functional block diagram of a low-rate linear predictive coding apparatus of a mobile terminal according to the prior art. 3 is a flowchart of a low-rate linear predictive coding method of a mobile terminal according to the prior art.

상기 도1을 참조하여, 일반적인 저전송률의 선형예측코딩 음성신호 출력모델을 설명하면, 유성음부(10)는 피치주기(PITCH PERIOD) 신호를 인가받고, 상기 입력된 피치주기 신호의 주기에 의하여 임펄스 열(IMPULSE TRAIN)에 의한 유성음(VOICE SOUND)을 생성(GENERATION)하여 출력하며, 상기 유성음부(10)로부터 출력되는 유성음은 글로탈(GLOTTAL) 필터부(20)에 인가되어 음성신호 성분으로 형성(SHAPING)되어 출력된다. Referring to FIG. 1, a general low-prediction linear predictive coded voice signal output model will be described. The voiced sound unit 10 receives a pitch period signal and impulses the period of the input pitch period signal. Generate and output VOICE SOUND by IMPULSE TRAIN, and the voiced sound output from the voiced sound unit 10 is applied to the GLOTTAL filter unit 20 to form a voice signal component. It is outputted with SHAPING.

상기 글로탈 필터부(20)에서 출력되는 신호는, 멀티플라이어부(30)에 인가되어 음성게인(VOICE GAIN)(Av) 값이 곱하여지므로, 적정한 레벨로 조절되어 스위치부(60)에 인가된다. Since the signal output from the global filter unit 20 is applied to the multiplier unit 30 and multiplied by a VOICE GAIN value, the signal is adjusted to an appropriate level and applied to the switch unit 60.

무성음부(40)는, 랜덤노이즈(RANDOM NOISE)에 의한 무성음(UNVOICE SOUND) 신호를 생성하여 해당 멀티플라이어부(50)에 출력하고, 상기 멀티플라이어부(50)는 잡음게인(NOISE GAIN)에 의한 An값을 곱하므로, 적정한 레벨로 조절되어 스위치부(60)에 인가된다. The unvoiced sound unit 40 generates a unvoiced sound signal based on random noise and outputs it to the corresponding multiplier unit 50, and the multiplier unit 50 generates an noise due to noise gain. Since the value is multiplied, it is adjusted to an appropriate level and applied to the switch unit 60.

상기 스위치부(60)는, 유성음 신호와 무성음 신호를 스위칭하여 선택 출력하므로, 여기된 신호(EXCITATION SIGNAL)Ug[n]를 출력하며, 상기 여기된 신호(Ug[N])는 보칼필터부(70)에 인가되어 보칼트랙(VOCAL TRACT) 처리한 신호(Hvocal(z))로 출력한다. Since the switch unit 60 selects and outputs a voiced signal and an unvoiced signal, the switch unit 60 outputs an excited signal (EXCITATION SIGNAL) Ug [n], and the excited signal Ug [N] is a vocal filter unit ( 70 is output as a signal Hvocal (z) processed by VOCAL TRACT.

상기 Hvocal(z)은 보칼필터부(70)를 DISCRETE-TIME MODELING 한 것이며, 이 것을 기반으로 선형예측코딩(LPC: LINEAR PREDICTION CODING)을 한다. The Hvocal (z) is a DISCRETE-TIME MODELING of the vocal filter unit 70, and performs linear prediction coding (LPC: LINEAR PREDICTION CODING) based on this.

즉, 무성음 신호(UNVOICED SIGNAL)는 RANDOM NOISE로 여기(EXCITATION)되어 발생되지 않으나, LPC 보코더(VOCODER)는 RANDOM NOISE로 여기 해주고, 유성음부(10)의 주기는 피치(PITCH)가 되며, PITCH EXTRACTION ALGORITHM으로 추출한다. That is, the unvoiced signal (UNVOICED SIGNAL) is not generated by being excited as RANDOM NOISE, but the LPC vocoder (VOCODER) is excited by RANDOM NOISE, and the period of the voiced sound part 10 becomes pitch (PITCH), and PITCH EXTRACTION Extract with ALGORITHM.

상기 보칼필터부(70)의 신호(Hvocal(z))는 래디에이션 필터부(80)에 인가되어 래디에이션(RADIATION) 처리(Hrad(z))하므로 최종적인 스피치 신호(S[n])로 출력한다. The signal Hvocal (z) of the vocal filter unit 70 is applied to the radiation filter unit 80 and radiated (Hrad (z)) so as to give the final speech signal S [n]. Output

상기와 같은 구성에 의한 것으로, 일반적인 저전송률의 선형예측코딩 음성신호 출력 모델은, L. R. RABINER 와 R. W. SCHAFER에 의하여 1978년 발표된 문헌 "DIGITAL PROCESSING OF SPEECH SIGNAL" ENGLEWOOD CLIFFS, NJ: PRENTICE HALL 에 자세히 설명되어 있다. With the above configuration, a general low-prediction linear predictive coded speech signal output model is described in detail in the document "DIGITAL PROCESSING OF SPEECH SIGNAL" ENGLEWOOD CLIFFS, NJ: PRENTICE HALL, published in 1978 by LR RABINER and RW SCHAFER. It is.

상기와 같은 모델에서 피치주기(PITCH PERIOD)는 약간씩 차이가 있으나, 음성급 스피치 신호가 출력되는 기능을 회로적인 기능으로 설명할 수 있다. In the above model, the pitch period (PITCH PERIOD) is slightly different, but the function of outputting the speech level speech signal can be described as a circuit function.

이하, 상기 첨부된 도2를 참조하여, 종래 기술에 의한 휴대단말기의 저전송률 선형예측코딩 장치를 설명한다. Hereinafter, a low rate linear predictive coding apparatus of a portable terminal according to the prior art will be described with reference to FIG. 2.

휴대단말기의 암호부(100)에 스피치 신호(S[n])가 입력되면, 오토코리레이션부(110)에서 입력하여 오토코리레이션(AUTOCORRELATION) 함수(FUNCTION)를 연산(COMPUTATION)한 신호(rx[n])를 출력하며, 상기 신호(rx[n])는 LP 계수부(130)에 인가되어 계수(an)와 게인(G)이 연산되어 코딩부(160)에 출력된다. When the speech signal S [n] is input to the encryption unit 100 of the portable terminal, a signal rx obtained by inputting from the autocorrelation unit 110 to compute an AUTOCORRELATION function FUNCTION. [n]) is output, and the signal rx [n] is applied to the LP coefficient unit 130 to calculate the coefficients an and the gain G, and output them to the coding unit 160.

또한, 상기 암호부(100)에 입력된 스피치 신호(S[n])는, 유성/무성식별부 (120)에 인가되고 분석되므로, 유성음 신호인지 무성음 신호인지를 구별하는 해당 식별신호를 생성하여 상기 코딩부(160)에 인가하는 동시에, 상기 입력되는 스피치 신호(S[n])를 무성차단부(140)에 출력하여 무성음 신호를 차단하고, 유성음 신호만을 통과시켜 피치검출부(150)에 인가한다. In addition, since the speech signal S [n] input to the encryption unit 100 is applied to and analyzed by the voiced / voiceless identification unit 120, it generates a corresponding identification signal for distinguishing whether it is a voiced sound signal or an unvoiced sound signal. At the same time, the speech signal S [n] is outputted to the unblocking unit 140 to block the unvoiced sound signal, and passes only the voiced sound signal to the pitch detection unit 150. do.

상기 피치검출부(150)는 입력되는 유성음 신호로부터 피치주기(PITCH PERIOD) 신호(P)를 검출하여 상기 코딩부(160)에 출력한다. The pitch detector 150 detects a pitch period signal P from the voiced sound signal input and outputs the pitch period signal P to the coding unit 160.

상기 코딩부(160)는 입력되는 계수(an)와, 게인(G)과, 피치주기(P)와, 유성/무성 식별신호를 파라메터 코딩(CODING) 또는 저전송률로 인코딩(ENCODING)하여 제어부(300)에 출력하고, 상기 제어부(300)의 해당 처리에 의하여 무선부(400)에 인가되므로, 지정된 상대방에 송신한다. The coding unit 160 encodes the input coefficient (an), the gain (G), the pitch period (P), and the voiced and unvoiced identification signal by parameter coding (CODING) or a low data rate to control the control unit ( 300 is output to the wireless unit 400 by the corresponding process of the control unit 300, and transmits to the designated counterpart.

상기 무선부(400)를 통하여 상대방으로부터 수신된 저전송률 음성급 신호는 제어부(300)의 해당 분석 및 제어에 의하여 복호부(200)에 인가되고, 상기 복호부(200)는 디코딩부(210)에서 입력하여, 해당 파라메터(PARAMETER)를 분리검출하고, 상기 검출된 파라메터를 각각 출력한다. The low rate voice level signal received from the other party through the wireless unit 400 is applied to the decoding unit 200 by the corresponding analysis and control of the control unit 300, the decoding unit 200 is the decoding unit 210 Inputting from, separates and detects the corresponding parameter and outputs each detected parameter.

상기 디코딩부(210)로부터 출력되는 파라메터 중에서, 피치주기(P) 파라메터는 유성음발생부(220)에 인가되어 유성음 신호를 해당 피치주기에 의하여 생성하도록 하며, 상기 생성된 유성음은 스위치부(240)에 출력한다. Among the parameters output from the decoding unit 210, a pitch period (P) parameter is applied to the voiced sound generator 220 to generate a voiced sound signal by the pitch period, the generated voiced sound is the switch unit 240 Output to

상기 스위치부(240)는 무성음발생부(230)로부터 인가되는 무성음신호도 함께 입력하고, 상기 디코딩부(210)로부터 출력되는 파라메터 중에서 유성/무성 식별신호 파라메터를 입력하며, 상기 유성음발생부(220)로부터 인가되는 유성음신호와 상 기 무성음발생부(230)로부터 인가되는 무성음신호를 선택적으로 멀티플라이어부(250)에 출력한다. The switch unit 240 also inputs an unvoiced sound signal applied from the unvoiced sound generator 230, inputs a voiced / unvoiced identification signal parameter among the parameters output from the decoder 210, and the voiced sound generator 220. The voiced sound signal applied from) and the unvoiced sound signal applied from the unvoiced sound generator 230 are selectively output to the multiplier unit 250.

상기 멀티플라이어부(250)는, 상기 디코딩부(210)로부터 인가되는 게인(G) 파라메터에 의하여, 상기 스위치부(240)로부터 입력되는 유성음신호와 무성음신호를 소정 레벨로 조정하여 신서사이저부(260)에 출력하며, 상기 신서사이저부(260)는 상기 디코딩부(210)로부터 인가되는 계수(an)에 의하여 상기 멀티플라이어부(250)로부터 입력되는 신호를 처리하므로 스피치 신호(S[n])로 복조하여 출력한다. The multiplier unit 250 adjusts the voiced sound signal and the unvoiced sound signal input from the switch unit 240 to a predetermined level by a gain (G) parameter applied from the decoding unit 210 to the synthesizer unit 260. The synthesizer 260 demodulates the speech signal S [n] because the synthesizer 260 processes a signal input from the multiplier 250 by a coefficient applied from the decoder 210. Output

상기와 같은 구성의 종래 기술에 의한 저전송률 선형예측코딩 장치는 구성이 복잡하고 부피가 커지며 제조시간과 비용이 많이 소요되는 등의 문제가 있다. The low-rate linear predictive coding device according to the prior art of the above-described configuration has a problem that the configuration is complicated, bulky, manufacturing time and cost are high.

이하, 상기 첨부된 도3을 참조하여, 종래 기술에 의한 것으로, 휴대단말기의 저전송률 선형예측코딩 방법을 설명한다. Hereinafter, with reference to the accompanying FIG. 3, a low-rate linear predictive coding method of a portable terminal will be described.

휴대단말기(MS)로 음성신호의 저전송률(LOW BIT RATE) 코딩(CODING)하고 전송하는 경우(S10), 입력되는 음성신호 또는 스피치 신호를 오토코리레이션(AUTOCORRELATION) 연산(COMPUTATION)하고(S20), LP 계수(COEFFICIENT) 연산한다(S30). In the case of low bit rate coding (CODING) and transmitting the voice signal to the mobile terminal (MS) (S10), AUTOCORRELATION operation (COMPUTATION) of the input voice signal or speech signal (S20) , LP coefficient (COEFFICIENT) is calculated (S30).

상기 연산에서 구하여진 계수값(an)과 게인값(G)과 피치주기값(P)과 유성/무성 식별신호를 이용하여 파라메터 코딩하고 저전송률(LOW BIT RATE) 방식으로 지정된 상대방에게 전송한다(S40). Using the coefficient value (an), the gain value (G), the pitch period value (P), and the voiced / unvoiced identification signal obtained in the operation, the parameter is coded and transmitted to the counterpart designated by the low bit rate method (Low Bit Rate). S40).

상기 휴대단말기(MS)는, 상대방으로부터 음성신호를 수신하고, 디코딩(DECODING)하는 경우(S50), 저전송률 수신한 신호를 해당 디코딩 처리하여 복조된 음성급 신호 또는 스피치 신호로 출력한다(S60). When the mobile terminal MS receives a voice signal from the other party and decodes the signal (S50), the mobile terminal MS decodes the received low-rate signal as a demodulated voice-level signal or a speech signal (S60). .

상기와 같은 구성의 종래 기술은, 음성신호를 코딩 또는 인코딩하는 과정에서 오토코리레이션 연산과 엘피 계수 연산하므로 과정이 복잡한 동시에 사람의 청각적 효과를 반영하지 못하여 엠에스이(MSE: MEAN SQUARED ERROR) 측면에서 오차가 크게 발생하는 문제가 있다. In the prior art having the above-described configuration, the autocorrelation operation and the Elp coefficient operation are performed in the process of coding or encoding the voice signal, so that the process is complicated and does not reflect the human auditory effect, so in terms of MSE (MEAN SQUARED ERROR) There is a problem that a large error occurs.

본 발명은 휴대단말기의 음성급 신호를 코딩하고 저전송률로 송신하는데 있어서, 피엘피(PLP) 계수 처리하므로, 사람의 청각적 효과를 반영하고 엠에스이(MSE) 개념으로 오차를 적게하여 음질을 높이며, 비트레이트를 줄이어 압축효율을 제고하는 휴대단말기의 개선된 저전송률 선형예측코딩 장치 및 방법을 제공하는 것이 그 목적이다. The present invention encodes a voice signal of a mobile terminal and transmits it at a low data rate, thereby processing PLP coefficients, thereby reflecting the human auditory effect and increasing the sound quality by reducing errors in the MSE concept. It is an object of the present invention to provide an improved low-rate linear predictive coding apparatus and method of a portable terminal which reduces bitrate to improve compression efficiency.

상기와 같은 목적을 달성하기 위하여 안출한 본 발명은, 휴대단말기의 암호부에 입력되는 음성급 스피치 신호를 청각적 효과가 반영되는 피엘피 계수 처리하여 계수값과 게인값을 출력하는 피엘피 계수부와; 상기 피엘피 계수부에 입력되는 음성급 스피치 신호가 유성음 신호인지 무성음 신호인지를 판단하여 식별신호를 출력하는 유성무성식별부와; 상기 유성무성식별부의 출력신호를 입력하고 무성음 신호를 차단하여 유성음 신호만을 통과시키는 무성차단부와; 상기 무성차단부로부터 인가되는 유성음신호를 분석하여 피치주기를 검출하고 출력하는 피치검출부와; 상기 피엘피 계수부와 피치검출부와 유성무성식별부로부터 각각 인가되는 값들을 이용하여 낮은 차수의 저전송률 파라메터 코딩하는 코딩부가 포함되어 이루어지는 구 성을 특징으로 한다. The present invention devised in order to achieve the above object, the PLP counter for outputting the coefficient value and the gain value by processing the PELP coefficient of the speech-level speech signal input to the encryption unit of the mobile terminal reflecting the auditory effect Wow; A voiceless voice discrimination unit configured to determine whether the voice level speech signal input to the PLP counter is a voiced sound signal or an unvoiced voice signal and output an identification signal; An unvoiced blocker configured to input an output signal of the voiceless voice discriminator and to block only the voiced sound signal to pass only the voiced sound signal; A pitch detector for detecting and outputting a pitch period by analyzing voiced sound signals applied from the unvoiced cutoff unit; Characterized in that the configuration comprises a coding unit for low-order low-rate parameter coding using the values applied from the PLP coefficient unit, the pitch detection unit and the voiceless voice discriminator, respectively.

또한, 상기와 같은 목적을 달성하기 위하여 안출한 본 발명은, 휴대단말기의 암호부에 입력되는 음성급 스피치 신호를 저전송률로 처리하여 전송할 것인지 판단하는 과정과; 상기 과정에서 저전송률 전송하는 것으로 판단하는 경우, 패스트 퓨리어 트랜스폼 처리하고, 필터뱅크 처리하는 에프에프티 과정과; 상기 과정의 신호를 사람의 청각에 적합하게 라우드니스 처리하고 파워매칭 처리하여 출력하는 청각과정과; 상기 과정의 신호를 역패스트 퓨리어 트랜스폼 처리하고 청각에 적합하게 위상보정 처리하는 역에프에프티 과정과; 상기 과정의 신호를 청각에 적합하게 주파수 특성 처리하고 파라메터 코딩하여 저전송률로 송신하는 전송과정으로 이루어지는 것을 특징으로 한다. In addition, the present invention devised to achieve the above object, the process of determining whether to transmit the voice level speech signal input to the encryption unit of the portable terminal at a low transmission rate; An FFT process of performing fast Fourier transform processing and filter bank processing when it is determined that the low data rate is transmitted in the above process; An auditory process of outputting the signal of the process by loudness processing and power matching processing suitable for human hearing; An inverse FFT process of performing a reverse fast Fourier transform process on the signal of the process and performing phase correction processing suitable for hearing; Characterized in that the signal of the above process is characterized in that the process consisting of a transmission process for transmitting at a low transmission rate by processing the frequency characteristics and parameter coding.

이하, 본 발명에 의한 것으로, 휴대단말기의 개선된 저전송률 선형예측코딩 장치 및 방법을 첨부된 도면을 참조하여 설명한다. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described with reference to the accompanying drawings.

본 발명을 설명하기 위하여 첨부된 것으로, 도4 는 본 발명에 의한 휴대단말기의 개선된 저전송률 선형예측코딩 장치 기능구성도 이며, 도5 는 본 발명에 의한 휴대단말기의 피엘피 계수부 상세 기능구성도 이고, 도6 는 본 발명에 의한 휴대단말기의 개선된 저전송률 선형예측코딩 방법 순서도 이다. 4 is a functional block diagram of an improved low-rate linear predictive coding device of a mobile terminal according to the present invention, and FIG. 5 is a detailed functional configuration of a PLP counter of a mobile terminal according to the present invention. 6 is a flowchart illustrating an improved low bit rate linear prediction coding method of a mobile terminal according to the present invention.

상기 도4를 참조하여, 본 발명에 의한 것으로, 휴대단말기의 개선된 저전송률 선형예측코딩 장치를 설명하면, 이동통신용 휴대단말기(MS)의 암호부(500)에 입력되는 음성급 스피치 신호(S[n])를 청각적 효과가 반영되고 낮은 차수가 적용되는 피엘피(PLP: PERCEPTUAL LINEAR PREDICTION) 계수 처리하여 계수값(an)과 게인값(G)을 출력하는 것으로, 휴대단말기(MS)의 암호부(500)에 입력되는 음성급 스피치 신호를 패스트 퓨리어 트랜스폼(FFT: FAST FOURIER TRANSFORM) 처리하여 출력하는 에프에프티부(511); 상기 에프에프티부(511)의 신호를 입력하여 소정 주파수 밴드별로 구분 여파하고 샘플링하여 출력하는 필터뱅크(CRITICAL BAND INTEGRATION AND RESAMPLING)부(512); 상기 필터뱅크부(512)의 신호를 입력하여 사람의 청각에 적합하도록 이퀄라이져 처리와 라우드니스 처리하여 출력하는 라우드니스(EQUAL-LOUDNESS CURVE)부(513); 상기 라우드니스부(513)의 신호를 입력하고 사람의 청각에 적합하게 전력레벨을 조정하는 매칭(POWER LAW OF HEARING)부(514); 상기 매칭부(514)의 신호를 입력하고 역(INVERSE) 패스트 퓨리어 트랜스폼(FFT) 처리 출력하는 역에프에프티부(515); 상기 역에프에프티부(515)의 신호를 입력하고 사람의 청각에 적합하게 위상보상하여 출력하는 위상처리(SOLVING SET OF LINEAR EQUATIONS)부(516); 상기 위상처리부(516)의 신호를 입력하고 사람의 청각에 적합하게 주파수 특성을 보상하여 출력하는 주파수특성(CEPSTRAL RECURSION)부(517)로 이루어지고, 암호부(500)에 입력되는 음성급 스피치 신호를 분석하여 사람의 청각적 효과가 낮은 차수로 반영되는 계수값과 이득값의 파라메터를 각각 검출하여 코딩부(550)에 출력하는 피엘피 계수부(510)와, Referring to FIG. 4, according to the present invention, an improved low-rate linear predictive coding apparatus of a mobile terminal will be described. A voice level speech signal S input to an encryption unit 500 of a mobile terminal MS will be described. [n]) processes the PEL (PLP: PERCEPTUAL LINEAR PREDICTION) coefficients to which the auditory effect is reflected and outputs the coefficient value (an) and gain value (G). An FFT unit 511 for processing a FOUR FOURIER TRANSFORM (FFT) to output a speech-grade speech signal input to the encryption unit 500; A filter bank (CRITICAL BAND INTEGRATION AND RESAMPLING) unit 512 for inputting the signal of the FFT unit 511 to filter, sample, and output the filter according to a predetermined frequency band; An equalizer process and a loudness process for inputting the signal of the filter bank unit 512 to be suitable for human hearing; A matching part (514) for inputting the signal of the loudness unit (513) and adjusting the power level to suit the hearing of a person; An inverse FFT unit 515 for inputting a signal of the matching unit 514 and outputting an inverse fast Fourier transform (FFT) process; SOLVING SET OF LINEAR EQUATIONS unit 516 for inputting the signal of the inverse FFT unit 515, and phase-compensating and outputting the image according to human hearing; A voice level speech signal, which is input to the encryption unit 500, is composed of a frequency characteristic (CEPSTRAL RECURSION) unit 517 which inputs the signal of the phase processing unit 516 and compensates and outputs the frequency characteristic suitable for human hearing. The PLP coefficient unit 510 which detects parameters of the coefficient value and the gain value reflected by the low order of the human auditory effect and outputs them to the coding unit 550,

상기 피엘피 계수부(510)에 입력되는 음성급 스피치 신호가 유성음 신호인지 무성음 신호인지를 판단하여 해당 식별신호를 출력하는 것으로, 암호부(500)에 입력되는 음성급 스피치 신호를 분석하여, 유성음(VOICE SOUND) 신호이면 유성음 식 별신호를 코딩부(550)에 파라메터로 출력하고, 무성음(UNVOICE SOUND) 신호이면 무성음 식별신호를 코딩부(550)에 파라메터로 출력하는 유성무성식별부(520)와, Determining whether the voice level speech signal input to the PLP counter 510 is a voiced sound signal or an unvoiced sound signal and outputting the corresponding identification signal. The voice level speech signal input to the encryption unit 500 is analyzed and voiced sound. In the case of the VOICE SOUND signal, the voiced voice identification signal is output to the coding unit 550 as a parameter. In the case of the unvoiced sound signal, the voiced voice identification unit 520 is output to the coding unit 550 as a parameter. Wow,

상기 유성무성식별부(520)의 출력신호를 입력하고 무성음 신호를 차단하여 유성음 신호만을 통과시키는 무성차단부(530)와, An unvoiced blocker 530 which inputs the output signal of the voiced voice discrimination unit 520 and blocks the unvoiced sound signal and passes only the voiced sound signal;

상기 무성차단부(530)로부터 인가되는 유성음신호를 분석하여 피치주기를 검출하고 출력하는 것으로, 유성음(VOICE SOUND) 신호의 피치주기 값(P)을 검출하고 파라메터 값으로 코딩부에 출력하는 피치검출부(540)와, The pitch detection unit detects and outputs a pitch period by analyzing voiced sound signals applied from the unvoiced cutoff unit 530. The pitch detector detects the pitch period value P of the voiced sound signal and outputs the parameter value to the coding unit. With 540,

상기 피엘피 계수부(510)와 피치검출부(540)와 유성무성식별부(520)로부터 각각 인가되는 값들을 이용하여 낮은 차수의 저전송률(LOW BIT RATE) 파라메터 코딩하는 것으로, 상기 피엘피 계수부(510)로부터 파라메터로 인가되는 계수값(an)과 이득값(G)을 입력하고, 상기 피치검출부(540)로부터 인가되는 피치주기값(P)을 파라메터로 입력하며, 상기 유성무성식별부(520)로부터 인가되는 유성무성식별신호를 파라메터로 입력하여, 파라메터 코딩(PARAMETER CODING) 처리하고 저전송률(LOW BIT RATE)로 출력하는 코딩부(550)가 포함되어 구성된다. By using the values applied from the PLP counter 510, the pitch detector 540, and the voiceless voice discriminator 520, respectively, the low order low bit rate parameter coding is performed. Input a coefficient value (an) and a gain value (G) applied as a parameter from (510), and inputs the pitch period value (P) applied from the pitch detection unit 540 as a parameter, the planetary voice identification unit ( A coding unit 550 for inputting a voiceless voice identification signal applied from 520 as a parameter, processing a parameter coding, and outputting at a low bit rate is included.

이하, 상기와 같은 구성의 본 발명에 의한 것으로, 휴대단말기의 개선된 저전송률 선형예측코딩 장치를 첨부된 도면을 참조하여 상세히 설명한다. Hereinafter, the present invention having the above-described configuration will be described in detail with reference to the accompanying drawings.

본 발명은 휴대단말기에서 음성급 스피치 신호를 높은 압축률과 낮은 저전송률로 상대방에게 전송하는데 있어서, 선형예측코딩(LPC: LINEAR PREDICTIVE CODING) 방식을 이용하는 것보다 더욱 낮은 계수의 차수를 이용하는 피엘피(PLP: PERCEPTUAL LINEAR PREDICTION) 계수부(510)를 이용하여, 종래의 LPC(LINEAR PREDICTION COEFFICIENT)를 사용하는 경우보다, 압축률을 높이는 동시에, 더욱 낮은 저전송률로 전송하는 것이다. The present invention provides a PLP using a lower order than a linear predictive coding (LPC) method in transmitting a speech level speech signal to a counterpart at a high compression rate and a low transmission rate in a mobile terminal. By using the PERCEPTUAL LINEAR PREDICTION coefficient unit 510, the compression rate is increased and the transmission rate is lowered at a lower transmission rate than when using the conventional LAR (LINEAR PREDICTION COEFFICIENT).

상기 종래 기술에서 적용되는 LPC와 본 발명에서 적용되는 PLP의 큰 차이는 다음과 같다. The large difference between the LPC applied in the prior art and the PLP applied in the present invention are as follows.

상기 LPC는 기본적으로 다음의 수식에 의하여 엠에스이(MSE: MEAN SQUARED ERROR)가 최소의 값이 되도록 ak 값을 구하는 것이다. The LPC basically calculates an ak value such that MSE (MEAN SQUARED ERROR) is the minimum value according to the following equation.

(수식 1)(Formula 1)

상기 수식에 의하여 구해진 계수의 차수는 8 KHz 샘플링 레이트(SAMPLING RATE)를 가지는 경우, 약 10차(8차 내지 12차)를 이용하며, 상기와 같이 구하여진 LP 계수(LPC)가 선형예측을 이용하는 LPC, CELP, MELP, RELP 등의 각종 코딩에 적용되며, 이러한 내용은, W. B. KLEIJIN 과 K. K. PALIWAL에 의한 것으로, SPEECH CODING AND SYNTHESIS, AMSTERDAM, THE NETHERLANDS, ELSEVIER, 1955에 보다 상세히 설명되어 있다. The order of the coefficients obtained by the above formula is about 10th order (8th to 12th order) when the 8 KHz sampling rate has a sampling rate, and the LP coefficients obtained as described above use the linear prediction. Applied to various codings such as LPC, CELP, MELP, RELP, and the like, which is made by WB KLEIJIN and KK PALIWAL, which are described in more detail in SPEECH CODING AND SYNTHESIS, AMSTERDAM, THE NETHERLANDS, ELSEVIER, 1955.

본 발명에 적용되는 PLP 방식은, 1990년 HERMANSKY의 논문에서 소개된 것으로, 기존의 MFCC(MEL-FREQUENCY CEPSTRAL COEFFICIENT)와 마찬가지로 HUMAN AUDITORY 특성을 이용하지만, DCT(DISCREET-TIME COSINE TRANSFORM)이 수행되는 것 인 반면, 선형예측(LINEAR PREDICTION)을 하는 것처럼, LEVINSON-DURBIN RECURSION을 하는 것이 다른 점이며, B. GOLD AND MORGAN의, SPEECH AND AUDIO PROCESSING, NEWTO가, JOHN WILEY & SONS, INC, 2000 문헌 및 H. HERMANSKY의, PERCEPTUAL LINEAR PREDICTIVE(PLP) ANALYSIS OF SPEECH, J. ACOUST. SOC. AM. VOL. 87, PP.1738-1752, 1990 문헌에 보다 상세히 설명되어 있다. The PLP method applied to the present invention, which was introduced in a 1990 paper by HERMANSKY, uses the HUMAN AUDITORY property like the existing MFCC (MEL-FREQUENCY CEPSTRAL COEFFICIENT), but DCT (DISCREET-TIME COSINE TRANSFORM) is performed. On the other hand, LEVINSON-DURBIN RECURSION is different, as does LINEAR PREDICTION, and B. GOLD AND MORGAN, SPEECH AND AUDIO PROCESSING, NEWTO, JOHN WILEY & SONS, INC, 2000, and H. HECEANSKY, PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH, J. ACOUST. SOC. AM. VOL. 87, PP. 1738-1752, 1990, described in more detail.

즉, 본 발명은, 저전송률을 위하여 선형예측코딩(LPC)에서 LPC(LINEAR PREDICTION COEFFICIENT) 계수를 적용하는 대신에 PLP(PERCEPTUAL LINEAR PREDICTION) 계수를 적용하는 것이 중요한 차이점이다. That is, in the present invention, it is important to apply a PLP (PERCEPTUAL LINEAR PREDICTION) coefficient instead of applying a LINEAR PREDICTION COEFFICIENT (LPC) coefficient in LPC.

일반적인 LPC 계수를 적용하여 신서사이즈 필터(SYNTHESIS FILTER)를 구성하며, 상기 필터의 LPC 계수가 ai 로 주어지면 신서사이즈 필터의 오류 예측 필터(ERROR PREDICTION FILTER)는 수식

로 표현된다. A SYNTHESSIS FILTER is constructed by applying general LPC coefficients, and when the LPC coefficient of the filter is given as ai, the ERROR PREDICTION FILTER of the synthesized filter is

It is expressed as

상기 수식을 이용하여 추출된 피치(PITCH) 정보와 유성음/무성음 정보를 적용하고, LPC 계수와 피치 정보, 그리고 유성무성 식별신호를 전송한다. PITCH information and voiced / unvoiced information extracted using the above equation are applied, and the LPC coefficient, the pitch information, and the voiced voice identification signal are transmitted.

그러나, 본 발명에서는 PLP 계수를 적용하여 스펙트럼(SPECTRUM)을 구하는 것이다. However, in the present invention, the spectrum (SPECTRUM) is obtained by applying the PLP coefficients.

상기 PLP 계수에는 사람의 청각적 효과가 반영되므로, 상기 PLP 계수를 적용하여 연산된 스펙트럼의 엠에스이(MSE) 값은 LPC 계수를 적용한 값보다 오차(ERROR)가 크지만, 청각적 효과를 고려하였을 경우는 오차가 더 작게된다. Since the human auditory effect is reflected in the PLP coefficients, the MSE value of the spectrum calculated by applying the PLP coefficients is larger than the value using the LPC coefficients, but the acoustic effect is considered. The error becomes smaller.

상기 PLP 계수로 SHORT-DELAY PREDICTION을 하면, SAMPLING RATE를 8 KHz로 하는 경우, 7차 계수를 적용하여 전송률(BIT RATE)을 낮추며, 유성무성식별신호와 피치주기 신호에 의한 파라메터 정보가 전송되어야 하므로, 더 이상의 낮은 차수를 적용하지 못한다. When the SHORT-DELAY PREDICTION is performed with the PLP coefficient, when the sampling rate is set to 8 KHz, the 7th coefficient is applied to reduce the bit rate, and thus the parameter information by the voiceless voice identification signal and the pitch period signal should be transmitted. However, no lower order can be applied.

그러므로, 종래의 10 차 계수를 적용하는 경우보다 7 차 계수를 적용하므로, 약 30 %의 전송률을 낮추는 장점이 있다. Therefore, since the seventh order is applied than the conventional tenth order, the transmission rate of about 30% is lowered.

복호부에 구성되는 것으로, 음성급 스피치 신호를 생성하여 출력하는 신서사이저는 LPC 계수를 사용하는 경우와 동일하거나 유사한 방법으로 신서사이즈 필터를 하며, 다음 수식으로 표현된다. The synthesizer, which is composed of a decoder, generates and outputs a speech speech signal and performs a synthesizer filter in the same or similar manner as in the case of using the LPC coefficient, and is expressed by the following equation.

(수식 2)(Formula 2)

본 발명의 구성을 좀더 상세히 설명하면, 암호부(500)에 음성급 스피치 신호(S[n])가 입력되면, 피엘피 계수부(510)에서 입력하는 동시에 동일한 신호를 유성무성식별부(520)로 입력한다. Referring to the configuration of the present invention in more detail, when the speech level speech signal S [n] is input to the encryption unit 500, the PLP input unit 510 simultaneously inputs the same signal to the voiceless voice identification unit 520 ).

상기 피엘피 계수부(510)는 입력되는 스피치 신호(S[n])를 에프에프티부(511)에서 패스트 퓨리어 트랜스폼(FAST FOURIER TRANSFORM) 처리하고, 필터뱅크부(512)에서, 일 실시 예로, 약 26개 정도로 이루어지는 필터뱅크부(512) 구성에 의하여 각각의 해당 필터에 의한 주파수 단위로 잡음(NOISE) 성분을 제거하며, 라우드니스부(513)에 인가되어 사람에게 적합한 신호가 되도록 이퀄라이저(EQUALIZER) 처리와 라우드니스(LOUDNESS) 처리된다. The PLP counter 510 processes the input speech signal S [n] in the FFT unit 511 by FAST FOURIER TRANSFORM, and performs the filter bank unit 512 in one step. For example, by the configuration of about 26 filter banks 512, a noise component is removed in units of frequencies by respective filters, and applied to the loudness unit 513 so that an equalizer is applied to a signal suitable for a person. EQUALIZER) processing and LOUDNESS processing.

상기와 같이 처리된 음성급 스피치 신호는, 상기 매칭부(514)에 인가되어 사람이 듣기에 적당한 전력(POWER)으로 매칭(MATCHING) 처리되고, 역에프에프티부(515)에 인가되어 역(INVERSE) 패스트 퓨리어 트랜스폼(FFT) 처리되며, 상기 위상처리부(516)에 인가되어 사람이 청각적으로 듣기 적당한 위상(PHASE)이 되도록 위상보정 처리되고, 상기 주파수특성부(517)에 인가되어 사람이 청각적 특성이 반영되도록 주파수 특성 보상처리되어 출력된다. The speech-level speech signal processed as described above is applied to the matching unit 514 to be matched with power suitable for human listening, and is applied to the inverse FFT unit 515 to be inversed. Fast Fourier Transform (FFT) processing, and applied to the phase processing unit 516 to perform phase correction processing so that a human is audibly audible phase (PHASE), and applied to the frequency characteristic unit 517 The frequency characteristic compensation process is output so that this acoustic characteristic is reflected.

상기와 같이 피엘피 계수부(510)는 입력되는 음성급 스피치 신호를 사람의 청각적 특성을 반영한 낮은 차수의 계수값(an)과 이득값(G)을 파라메터값으로 코딩부(550)에 출력한다. As described above, the PLP counting unit 510 outputs the input speech level signal to the coding unit 550 as low-order coefficient values (an) and gain values (G) reflecting human auditory characteristics. do.

상기 유성무성식별부(520)는 입력되는 음성급 스피치 신호가 유성음(VOICE SOUND) 신호 인지 무성음(UNVOICED SOUND) 신호 인지의 판단 및 식별 신호를 상기 코딩부(550)에 출력하는 동시에, 스피치 신호를 무성차단부(530)에 인가하므로 무성음 신호가 통과되지 못하도록 차단하며, 유성음 신호만이 통과되도록 한다. The voiceless voice discriminator 520 outputs a speech signal to the coding unit 550 and determines whether the voice speech signal input is a voiced sound signal or a unvoiced sound signal. Since it is applied to the unvoiced blocking unit 530, the unvoiced sound signal is blocked from passing, and only the voiced sound signal is passed.

상기 무성차단부(530)에 의하여 유성음 신호만을 입력하는 피치검출부(540)는, 상기 입력되는 유성음 신호를 분석하여 피치 주기(P)를 검출하고, 상기 검출된 피치 주기(P)를 코딩부(550)에 파라메터값으로 출력한다. The pitch detecting unit 540 which inputs only the voiced sound signal by the unvoiced cutoff unit 530 detects the pitched voice signal by analyzing the input voiced sound signal, and encodes the detected pitch period P by the coding unit ( 550) as a parameter value.

상기 코딩부(550)는 피엘피 계수부(510)로부터 인가되는 것으로, 청각적 특성이 반영된 계수값(an), 게인값(G) 파라메터를 입력하고, 상기 유성무성식별부(520)로부터 유성무성식별신호를 파라메터로 입력하며, 상기 피치검출부(540)로부 터 피치주기(P)를 파라메터로 입력한다. The coding unit 550 is applied from the PLP coefficient unit 510, and inputs a coefficient value (an) and a gain value (G) parameter reflecting the auditory characteristics, and receives the voiced signal from the voiceless voice discriminator 520. Input the unvoiced identification signal as a parameter, and input the pitch period (P) from the pitch detection unit 540 as a parameter.

상기 코딩부(550)는, 상기와 같이 각각 입력되는 파라메터값 만을 이용하여 음성급 스피치 신호를 코딩 처리하므로, 계수의 차수가 낮은 저전송률로 압축하여 출력 송신한다. Since the coding unit 550 processes the speech-grade speech signal using only the parameter values respectively input as described above, the coding unit 550 compresses and outputs the low-rate data with low order of coefficient.

따라서, 상기와 같은 구성의 본 발명은, PLP 계수를 이용하여 차수를 낮추므로, 압축률을 높이는 장점과 더욱 낮은 저전송률로 음성급 신호를 전송하는 장점이 있다. Therefore, the present invention having the above configuration has the advantage of lowering the order by using the PLP coefficient, thereby increasing the compression rate and transmitting the voice level signal at a lower low transmission rate.

이하, 상기 첨부된 도6을 참조하여 본 발명에 의한 것으로, 휴대단말기의 개선된 저전송률 선형예측코딩 방법을 설명한다. Hereinafter, the present invention will be described with reference to the attached FIG. 6, which describes an improved low-rate linear predictive coding method of a mobile terminal.

이동통신용 휴대단말기(MS)의 암호부(500)에 입력되는 음성급 스피치 신호를 저전송률(LOW BIT RATE)로 처리하여 전송할 것인지 판단하는 과정(S100)과, A process of determining whether to transmit a voice level speech signal input to the encryption unit 500 of the mobile communication mobile terminal (MS) at a low bit rate (S100);

상기 과정(S100)에서 저전송률 전송하는 것으로 판단하는 경우, 패스트 퓨리어 트랜스폼 처리하고, 필터뱅크 처리하는 것으로, 휴대단말기(MS)의 암호부(500)에 입력되는 음성급 스피치 신호를 패스트 퓨리어 트랜스폼(FFT) 처리하는 과정(S110); 상기 과정(S110)의 음성급 스피치 신호를 필터뱅크 처리하여 사람의 청각적 특성에 적합하게 잡음 성분을 제거하는 과정(S120)으로 이루어지는 에프에프티 과정과, If it is determined in step S100 that the low transmission rate is transmitted, the fast Fourier transform processing and the filter bank processing, the voice-quality speech signal input to the encryption unit 500 of the mobile terminal (MS) Fast Fury A transform (FFT) process (S110); An FFT process comprising a process of removing a noise component to be suitable for an auditory characteristic of a person by performing a filterbank process on the voice level speech signal of the process (S110),

상기 에프에프티 과정의 신호를 사람의 청각에 적합하게 라우드니스 처리하고 파워매칭 처리하여 출력하는 것으로, 상기 에프에프티 과정의 음성급 스피치 신호를 사람의 청각에 적합한 크기의 소리 성분으로 라우드니스 처리하여 통과시키는 과정(S130); 상기 과정(S130)의 신호를 사람이 청각적으로 듣기 적합하도록 출력전력을 매칭하는 과정(S140)으로 이루어지는 청각과정과, Loudness processing and power matching to output the signal of the FM process, the process of passing the voice level speech signal of the FM process with sound components of a size suitable for human hearing (S130); An auditory process comprising a step (S140) of matching an output power so that a person is audibly audible to the signal of the process (S130),

상기 청각과정의 신호를 역패스트 퓨리어 트랜스폼 처리하고 청각에 적합하게 위상보정 처리하는 것으로, 상기 청각과정의 음성급 스피치 신호를 역패스트 퓨리어 트랜스폼(IFFT) 처리하는 과정(S150); 상기 과정(S150)의 음성급 스피치 신호를 사람의 청각에 적합하게 하는 SOLVING SET OF LINEAR EQUATION의 더빈(DURBIN) 처리에 의하여 위상보상 하는 과정(S160)으로 이루어지는 역에프에프티 과정과, Performing a reverse fast Fourier transform on the signal of the auditory process and a phase correction process suitable for hearing, and performing a reverse fast Fourier transform (IFFT) on the speech level speech signal of the auditory process; An inverse FFT process comprising a step S160 of performing a phase compensation by a DURBIN process of SOLVING SET OF LINEAR EQUATION to make the speech level speech signal of step S150 suitable for human hearing;

상기 역에프에프티과정의 신호를 청각에 적합하게 주파수 특성 처리하고 파라메터 코딩하여 저전송률로 송신하는 것으로, 상기 역패스트 퓨리어 트랜스폼 과정의 음성급 스피치 신호를 사람의 청각에 적합하게 캡스트럴 리커젼(CEPSTRAL RECURSION) 처리에 의하여 주파수 특성을 보상하는 과정(S170); 상기 과정(S170)의 음성급 스피치 신호를 각각의 해당 파라메터로 코딩(CODING) 처리하여 저전송률(LOW BIT RATE)로 송신하는 과정(S180)으로 이루어지는 전송과정으로 구성된다. Capturing the signal of the inverse FFT process to a low frequency rate by processing the frequency characteristics and parameter coding to suit the hearing, the capsular riker to the speech level speech signal of the reverse fast Fourier transform process suitable for human hearing Compensating for the frequency characteristic by a CEPSTRAL RECURSION process (S170); The transmission process consists of a step (S180) of transmitting the voice level speech signal of the step (S170) by coding (CODING) with each corresponding parameter at a low bit rate (LOW BIT RATE).

이하, 상기와 같은 구성의 본 발명에 의한 것으로, 휴대단말기의 개선된 저전송률 선형예측코딩 방법을 첨부된 도면을 참조하여 상세히 설명한다. Hereinafter, the present invention having the above configuration will be described in detail with reference to the accompanying drawings.

이동통신용 휴대단말기의 암호부(500)는 입력되는 음성급 신호를 저전송률(LOW BIT RATE)로 상대방에 전송하고자 하는지 판단하고(S100), 상기 판단(S100)에서 음성급 신호를 저전송률로 전송하는 경우, 패스트 퓨리어 트랜스폼 처리하며(S110), 필터뱅크 처리하여 잡음성 신호를 제거한다(S120). The encryption unit 500 of the mobile terminal for mobile communication determines whether to transmit the input voice level signal to the other party at a low bit rate (SLOW BIT RATE) (S100), and transmits the voice level signal at a low bit rate in the determination (S100). In this case, the fast Fourier transform process (S110) and the filter bank process to remove the noise signal (S120).

상기와 같이 처리된 음성급 신호를 라우드니스 처리하여, 사람의 청각적 특 성에 적합한 성분을 출력하고(S130), 상기 신호를 청각적 특성에 의하여 듣기 적당하도록 출력레벨을 조정하는 파워 매칭 처리하며(S140), 역(INVERSE) 패스트 퓨리어 트랜스폼(FFT) 처리한다(S150). Loudness processing of the voice level signal processed as described above, and outputs a component suitable for the human auditory characteristics (S130), and a power matching process for adjusting the output level to be appropriate to hear the signal by the auditory characteristics (S140) Inverse fast Fourier transform (FFT) processing (S150).

상기와 같이 처리된 신호를 사람의 청각에 의하여 듣기 적당하도록 위상보정 처리하고(S160), 또한, 청각적 특성에 적합하도록 주파수 특성 보상처리한다(S170). The signal processed as described above is subjected to phase correction processing so as to be suitable for hearing by human hearing (S160), and frequency characteristic compensation processing so as to be suitable for auditory characteristics (S170).

상기와 같이 피엘피 계수부에 의하여, 입력된 음성급 스피치 신호를 사람의 청각적 특성이 반영되도록 처리하여, 차수가 낮아진 계수값(an)과 이득값(G)을 파라메터 값으로 코딩부(550)에 출력하며, 상기 코딩부(550)는 피치주기(P) 파라메터와 유성무성식별신호를 각각 입력하여 파라메터 코딩하므로, 낮은 차수에 의하여 높은 압축률로 압축하는 동시에 저전송률로 코딩하여 지정된 상대방에 송신한다(S180). As described above, the PLP coefficient processing unit processes the input speech level speech signal to reflect the human auditory characteristics, thereby encoding the coefficient value (an) and the gain value (G) whose parameters are lower than the coding unit 550. Since the coding unit 550 inputs the pitch period (P) parameter and the voiceless voice identification signal to each other, the coding unit 550 compresses at a high compression rate according to a low order and simultaneously codes at a low data rate to transmit to a designated counterpart. (S180).

따라서, 상기 본 발명은, LPC 계수를 사용하는 종래 기술대비 PLP 계수를 사용하므로, 높은 압축률로 개선하고, 또한, 음성급 신호를 보다 낮은 저전송률로 개선 송신하는 장점이 있다. Therefore, since the present invention uses the PLP coefficient compared to the conventional technique using the LPC coefficient, the present invention has the advantage of improving the high compression rate and further improving and transmitting the voice level signal at a lower low transmission rate.

상기와 같은 구성의 본 발명은, 음성급 신호를 전송하는데 있어서, PLP 계수를 사용하므로, 청각적 특성을 반영하고, 계수의 차수를 낮추어 압축률을 높이는 산업적 이용효과가 있다. According to the present invention having the above-described configuration, since the PLP coefficient is used to transmit a voice level signal, there is an industrial use effect of reflecting an auditory characteristic and increasing the compression ratio by lowering the order of the coefficient.

또한, 음성급 신호를 개선된 저전송률로 전송하므로, 채널을 효율적으로 운 용하는 사용상 편리한 효과가 있다. In addition, since the voice signal is transmitted at an improved low transmission rate, there is a convenient effect of using the channel efficiently.

Claims

A PLP counter for outputting a coefficient value and a gain value by processing PLP coefficients in which a voice level speech signal input to an encryption unit of the mobile terminal reflects an auditory effect;

A voiceless voice discrimination unit for determining whether the voice level speech signal input to the PLP counter is a voiced sound signal or an unvoiced voice signal and outputting an identification signal;

An unvoiced blocker configured to input an output signal of the voiced voice discriminator and to block only the voiced sound signal to pass only the voiced sound signal;

A pitch detector for detecting and outputting a pitch period by analyzing voiced sound signals applied from the unvoiced cutoff unit;

An improved low-rate linear predictive coding apparatus of a mobile terminal, characterized by comprising a coding unit for coding a low-order low-rate parameter using values applied from the PLP coefficient unit, the pitch detector unit, and the voiceless voice discriminator unit, respectively. .

According to claim 1, wherein the PLP counting unit,

A FFT unit which outputs the voice level speech signal input to the encryption unit of the mobile terminal by performing fast Fourier transform processing;

A filter bank unit for inputting the signal of the FFT unit, filtering, sampling, and outputting the filter according to a predetermined frequency band;

A loudness unit for inputting a signal of the filter bank unit to output an equalizer process and a loudness process to be suitable for human hearing;

A matching unit for inputting a signal of the loudness unit and adjusting a power level according to hearing of a person;

An inverse FFT unit for inputting a signal of the matching unit and outputting an inverse fast Fourier transform process;

A phase processing unit which inputs a signal of the inverse FFT unit and phase-compensates and outputs a phase suitable for human hearing;

And a frequency characteristic unit configured to input a signal of the phase processing unit and compensate and output a frequency characteristic to be suitable for hearing of a person.

According to claim 1, wherein the PLP counting unit,

Improvement of the mobile terminal, characterized in that it consists of analyzing the speech level speech signal input to the encryption unit of the mobile terminal and detecting the coefficient value and gain value parameters reflecting the human auditory effect and outputting them to the coding unit. Low-rate linear predictive coding device.

According to claim 1,

The voiceless voice identification unit analyzes the voice level speech signal input to the encryption unit of the mobile terminal, and outputs the voiced voice identification signal as a parameter to the coding unit if the voiced voice signal, and converts the voiced voice identification signal to the coding unit if the voiced signal is a parameter. Output,

And the pitch detecting unit is configured to detect a pitch period value of the voiced sound signal input from the unvoiced blocking unit, and output the coding unit as a parameter value to the coding unit.

The method of claim 1, wherein the coding unit,

Input a coefficient value and a gain value applied as a parameter from the PLP coefficient unit, input a pitch period value applied from the pitch detection unit as a parameter, and input a voiced voice identification signal applied from the voiceless voice identification unit as a parameter Improved linearity predictive coding apparatus of a portable terminal, characterized in that the parameter coding process and output at a low transmission rate.

A process of determining whether to transmit a speech level speech signal input to an encryption unit of the portable terminal at a low transmission rate, and

In the process, if it is determined that the low-rate transmission, the fast Fourier transform processing, the filter bank processing FFT process,

An auditory process of outputting the signal of the process by loudness processing and power matching process suitable for human hearing;

An inverse FFT process of performing a reverse fast Fourier transform process on the signal of the above process and performing phase correction processing suitable for hearing;

An improved low-rate linear predictive coding method of a portable terminal, characterized by comprising: a transmission process for processing the signal of the process in a frequency characteristic suitable for hearing and transmitting at a low data rate by parameter coding.

The method of claim 6, wherein the FFT process,

Performing a fast Fourier transform process on the voice level speech signal input to the encryption unit of the mobile terminal;

An improved low rate linear predictive coding method of a portable terminal, characterized in that it comprises a process of removing the noise component by filtering the speech level speech signal of the process.

The method of claim 6, wherein the hearing process,

A process of loudness passing the voice level speech signal of the fft process with a sound component having a size suitable for human hearing;

And a step of matching the output power so that a person is audibly audible to the signal of the above process.

The method of claim 6, wherein the reverse FFT process,

Performing a reverse fast Fourier transform process on the speech level speech signal of the auditory process;

An improved low-rate linear predictive coding method of a portable terminal, characterized by comprising the step of performing phase compensation of the speech level speech signal of the above process by a dubin process suitable for human hearing.

The method of claim 6, wherein the transmission process,

Compensating for the frequency characteristics by capsular recursion processing for the speech-grade speech signal of the inverse fast Fourier transform process, for human hearing;

An improved low-rate linear predictive coding method of a portable terminal, characterized in that it comprises the step of coding the speech speech signal of the process with each corresponding parameter to transmit at a low transmission rate.