KR100911771B1

KR100911771B1 - A apparatus of packet loss concealment with realtime voice communication on internet and method thereof

Info

Publication number: KR100911771B1
Application number: KR1020070120394A
Authority: KR
Inventors: 김홍국; 강진아; 문정훈
Original assignee: 한국과학기술정보연구원
Priority date: 2007-11-23
Filing date: 2007-11-23
Publication date: 2009-08-10
Also published as: KR20090053520A

Abstract

The present invention relates to a terminal apparatus and a method for concealing packet loss of voice communication, and particularly, to a terminal for real-time communication with an audio signal through the Internet, a variable bit rate in which the total bit rate of an audio signal and surplus information input in real time does not increase. A transmitter that encodes the data, records a sound quality evaluation result, converts the result into a real-time transmission format, and transmits the packet to the Internet; And a receiving end receiving a real time audio packet through the Internet, checking a sound quality evaluation result and feeding it back to the transmitting end, sequentially recording a packet decoded at a variable bit rate, removing jitter, feeding back a sound quality evaluation result, and outputting a reproduced voice signal. Characterized in that the configuration, the excess information is transmitted without increasing the transmission bit rate to recover the loss caused without affecting the bandwidth or data transmission rate of the occupied channel, and efficiently concealed to improve the quality of the voice signal It works.

Audio, Encoding, Decoding, Format, Packet, RTP, Real Time, Buffer, Quality, Internet

Description

A APPARATUS OF PACKET LOSS CONCEALMENT WITH REALTIME VOICE COMMUNICATION ON INTERNET AND METHOD THEREOF}

본 발명은 인터넷을 이용하는 음성급 신호 실시간 통신 장치의 데이터 손실을 보완하는 것으로, 특히, 음성급 신호에 의한 패킷 데이터의 손실을 은닉하기 위한 잉여 정보를 전체 전송비트율 증가 없이 전송하고 수신측에서 측정한 음질평가 결과에 의하여 송신측에서 가변비트율과 데이터의 유형을 결정하여 전송하는 음성통신의 패킷 손실을 은닉하는 단말장치 및 방법에 관한 것이다. The present invention compensates for the data loss of the voice signal real-time communication device using the Internet. In particular, the redundant information for concealing the loss of packet data caused by the voice signal is transmitted without increasing the overall transmission bit rate and measured at the receiving end. The present invention relates to a terminal apparatus and a method for concealing packet loss of voice communication in which a transmission side determines a variable bit rate and a data type according to a sound quality evaluation result.

인간의 사회활동과 지적 활동이 활발해 지면서 지정된 상대방에게 전달하거나 의사 전달을 위하여 통신할 경우가 자주 발생하는 동시에 전달하고자 하는 정보의 양이 많아지게 되었다. As social and intellectual activities of human beings become more active, cases of communicating to designated parties or communicating for communication often occur, and the amount of information to be delivered increases.

또한, 통신시간은 일정하게 정해지지 않고, 필요한 때 원하는 상대방과 즉시 접속하여 통신하는 것이 일반적이다. In addition, the communication time is not fixed regularly, and it is common to immediately connect and communicate with a desired counterpart when necessary.

통신이라 함은, 소리, 기호 등을 이용하여 상대방과 직접 정보 교환할 수 없 는 먼 거리 또는 원거리의 상대방과 정보를 교환하는 것을 의미하고, 운송수단 등의 발달에 의하여 통신하고자 하는 상대방이 지구상의 어느 곳에 위치하는지 모르는 경우가 있을 수 있으며, 통신은 비용을 비교적 많이 발생하는 것이 일반적이다. Communication means the exchange of information with a far-distance or long-distance counterpart that cannot exchange information directly with the other party by using sounds, symbols, etc. There may be cases where you do not know where it is located, and communication is generally expensive.

상대방과 직접 접속하여 의사 및 정보를 전달하는 방식이 실시간 통신방식이고, 전달하고자 하는 정보를 송신하고 상대방이 여유 있는 시간에 확인하여 회신하도록 하는 방식이 비실시간 통신방식이다. The real-time communication method is a method of directly connecting with the other party and delivering the doctor and information, and the non-real-time communication method is to transmit the information to be delivered and to confirm and reply to the other party at a leisurely time.

상기와 같이 통신하고자 하는 상대방이 지구상의 어느 곳에 위치하는지에 관계없고 실시간(REALTIME)으로 통신하지 않고 비실시간(NON-REALTIME)으로 통신하며, 전송되는 정보의 량이 매우 많아도 되고 통신비용이 비교적 저렴한 통신방식 중에 하나가 인터넷이다. As described above, regardless of where the other party is located on the earth and communicates in real time (NON-REALTIME) rather than in real time (REALTIME), the amount of information to be transmitted may be very large and the communication cost is relatively low One way is the Internet.

상기 인터넷은 다수 경로를 통하여 패킷을 송수신하고, 상기 송수신하는 패킷 데이터 처리에 소정 시간이 소요되며, 망의 특성에 의하여 지연(DELAY)과 지터(JITTER)가 발행하므로 이러한 특성에 민감하지 않은 비실시간 데이터 통신에 사용되어 왔다. The Internet transmits and receives a packet through multiple paths, and takes a predetermined time to process the transmitted and received packet data, and the delay and jitter are issued by the characteristics of the network. It has been used for data communication.

인터넷을 이용하여 음성급 신호를 실시간 송수신하는 인터넷 전화(INTERNET TELEPHONY) 통신 서비스를 제공하기 위해서는 신뢰성(RELIABILITY), 음성품질(VOICE QUALITY), QoS(QUALITY OF SERVICE) 및 표준화(STANDARDIZATION) 등이 보장되어야 하며, 명확성(CLARITY), 지연(DELAY) 및 에코(ECHO) 등의 요소도 포함되고, 어느 하나의 요소에 문제가 있는 경우 전체 음성 품질에 영향을 준다. RELIABILITY, VOICE QUALITY, QUALITY OF SERVICE, and STANDARDIZATION must be guaranteed in order to provide INTERNET TELEPHONY communication service that sends and receives voice level signals in real time using the Internet. It also includes factors such as CLARITY, DELAY, and ECHO, and if there is a problem with any one of them, it affects the overall voice quality.

그러나 인터넷을 이용하는 경우의 저렴한 통신비용과 전 세계 어디에서나 접속하여 통신할 수 있고, 광대역 통신 시스템 장비 개발 등에 힘입어 오디오 신호가 포함되는 음성신호를 이용한 실시간 통신 기술이 계속 개발 및 발전되고 있다. However, the real-time communication technology using voice signals including audio signals continues to be developed and developed due to low communication costs in the case of using the Internet and to connect and communicate anywhere in the world, and to develop broadband communication system equipment.

이러한 인터넷 통신기술의 발전에 의하여 실시간 오디오 미디어 통신 서비스가 더욱 선호되고 있다. Due to the development of the Internet communication technology, the real-time audio media communication service is more preferred.

상기 인터넷은 망 접속에 혼잡 또는 트래픽(TRAFFIC)의 병목현상이 발생하는 경우, 즉 트래픽이 많이 발생하는 경우는 패킷 데이터의 전송 지연이 발생하거나 전송 오류 등에 의한 손실이 발생하므로, 음성급 또는 오디오 신호를 이용하는 실시간 통신 서비스에서는 품질 또는 QoS(QUALITY OF SERVICE)를 적정하게 확보하기 어렵다. When the Internet has congestion or a bottleneck of traffic (TRAFFIC) in the network connection, that is, when a lot of traffic occurs, a packet-level transmission delay or a loss due to a transmission error occurs. In the real-time communication service using the PSA, it is difficult to adequately secure quality or quality of service (QoS).

그러므로 실시간 응용(APPLICATION)의 경우, 패킷 손실을 줄이기 위하여 최소한의 성능을 보장하는 기술, 일례로, 대역폭 제한(BANDWIDTH LIMITATION), 패킷 손실 복구(LOSS RECOVERY SCHEME), 대기열(QUEUING), 패킷 분류(PACKET CLASSIFICATION), 전진 에러정정(FORWARD ERROR CORRECTION: FEC) 등과 같은 기술이 필요하다.Therefore, for real-time applications, techniques that guarantee minimal performance to reduce packet loss, such as bandwidth limitation (BANDWIDTH LIMITATION), packet loss recovery (LOSS RECOVERY SCHEME), QUEUING, and packet classification (PACKET) CLASSIFICATION, FORWARD ERROR CORRECTION (FEC), etc.

일반적으로 인터넷을 통하여 전송되는 패킷 데이터의 스트리밍(STREAMING)에서 약 5% 이상 손실이 발생하고 손실에 의하여 무음 구간이 생성되는 경우 정상적인 실시간 통신이 어려울 정도로 음질 저하를 발생하므로, 인터넷으로 스트리밍 전송되는 패킷 데이터의 손실을 보상하기 위하여 많은 기술이 개발되고 있다. Generally, about 5% or more loss occurs in streaming of packet data transmitted through the Internet, and when a silent section is generated due to the loss, the sound quality is degraded so that normal real-time communication is difficult. Many techniques are being developed to compensate for data loss.

종래 기술에서는, 손실된 패킷 데이터에 의하여 발생하는 무음 구간을 원래 의 오디오 신호 파형(WAVEFORM)으로 근사화시켜 보간(INTERPOLATION)하는 방식을 이용한다. In the prior art, a method of interpolating by approximating a silent section generated by lost packet data to the original audio signal waveform WAVEFORM is used.

또한, 다른 종래 기술에서는 손실이 발생한 구간에서 잡음을 사용하거나 반복 재생 등과 같은 방식을 이용하여 보간 한다. In addition, in the prior art, the interpolation is performed using a noise or a method such as repeated playback in a section where a loss occurs.

일반적으로 오디오 통신신호에 의한 하나의 패킷은 약 20 내지 30 밀리세컨드(MILLISECOND) 시간에 해당하므로 단 하나의 패킷에 손실이 발생하는 경우에도 끊김 간격 또는 무음 구간이 상대적으로 크게 되어 보간(INTERPOLATION)하기 어렵다. In general, one packet due to the audio communication signal corresponds to about 20 to 30 milliseconds (MILLISECOND) time, so even in the event of loss of only one packet, the intermittent interval or silence interval becomes relatively large to interpolate. it's difficult.

이러한 종래 기술을 보다 개발시킨 것으로, MPEG(MOVING PICTURE EXPERTS GROUP) 오디오 코덱을 위한 압축 영역에서의 보간법 기술을 제시한 것으로, "ERROR CONCEALMENT FOR COMPRESSED DIGITAL AUDIO" P. LAUBER AND R. SPERSCHNEIDER, IN AES 111TH CONVENTION, NY, PP. 1-11, SEPT. 2001 및 “ERROR MITIGATION IN MPEG-AUDIO PACKET COMMUNICATION SYSTEMS" S. QUACKENBUSH AND P. DRIESSEN, IN AES 115TH CONVENTION, NY, PP. 1-11, OCT. 2003 등이 있다. In order to further develop such a prior art, an interpolation technique in a compression domain for an MPEG (MOVING PICTURE EXPERTS GROUP) audio codec is presented, "ERROR CONCEALMENT FOR COMPRESSED DIGITAL AUDIO" P. LAUBER AND R. SPERSCHNEIDER, IN AES 111TH CONVENTION, NY, PP. 1-11, SEPT. 2001 and "ERROR MITIGATION IN MPEG-AUDIO PACKET COMMUNICATION SYSTEMS" S. QUACKENBUSH AND P. DRIESSEN, IN AES 115TH CONVENTION, NY, PP. 1-11, OCT. 2003.

또한, PARAMETRIC AUDIO MODELING을 사용하는 방법을 제시한 것으로 “PACKET LOSS CONCEALMENT BASED ON SINUSOIDAL EXTRAPOLATION" J. LINDBLOM AND P. HEDELIN, IN PROC. ICASSP, PP. 173-176, MAY 2002 등이 있다. In addition, a method of using PARAMETRIC AUDIO MODELING is presented, such as "PACKET LOSS CONCEALMENT BASED ON SINUSOIDAL EXTRAPOLATION" J. LINDBLOM AND P. HEDELIN, IN PROC. ICASSP, PP.173-176, MAY 2002.

상기와 같이 패킷 데이터의 손실을 복원하기 위한 알고리즘을 패킷 손실을 은닉하는(PACKET LOSS CONCEALMENT: PLC) 알고리즘이라고 한다. As described above, the algorithm for recovering the loss of packet data is called a packet loss concealment (PLC) algorithm.

상기 PLC 알고리즘은 크게 송신단 기반과 수신단 기반의 두 방법으로 나누어 짐을 “ADAPTIVE RECOVERY TECHNIQUES FOR REAL-TIME AUDIO STREAMS" W. T. LIAO, J. C. CHEN, M. S. CHEN, IN PROC. IEEE INFOCOM, ANCHORAGE, AK, VOL. 2, PP. 815-823, APR. 2001 를 통해서 알 수 있다. The PLC algorithm is divided into two methods, a transmitter-based and a receiver-based method. “ADAPTIVE RECOVERY TECHNIQUES FOR REAL-TIME AUDIO STREAMS” WT LIAO, JC CHEN, MS CHEN, IN PROC. IEEE INFOCOM, ANCHORAGE, AK, VOL. PP. 815-823, APR. 2001.

상기 송신단 기반의 PLC 알고리즘은 수신단에서 수신한 패킷에 손실이 발생한 경우, 이전 패킷에서 함께 전송된 잉여 정보(REDUNDANCY) 또는 부가정보(SIDE INFORMATION)를 손실된 패킷 대신에 이용하도록 하는 것으로, 전진 에러 정정(FORWARD ERROR CORRECTION: FEC), 인터리빙(INTERLEAVING), 재전송(RE-TRANSMISSION) 방법 등이 있다. The transmitter-based PLC algorithm uses redundancy information or side information transmitted together with the previous packet in place of the lost packet when the packet received at the receiver is lost. (FORWARD ERROR CORRECTION: FEC), INTERLEAVING, RE-TRANSMISSION, etc.

또한, 수신단 기반의 PLC 알고리즘은 수신단에서 유효한 패킷 데이터, 즉 손실이 발생된 패킷 이전에 수신된 패킷 데이터와 이후에 수신된 패킷 데이터 등을 이용하여 상기 손실된 패킷 데이터를 복원하는 방법으로, 삽입(INSERTION), 보간(INTERPOLATION), 모델 기반 복원(MODEL-BASED RECOVERY) 방법 등이 있다. In addition, the receiver-based PLC algorithm restores the lost packet data by using valid packet data at the receiving end, that is, packet data received before the packet in which the loss occurred and packet data received thereafter. INSERTION, INTERPOLATION, and MODEL-BASED RECOVERY.

그러나 상기와 같은 종래 기술의 송신단 기반 PLC 알고리즘은 손실이 발생된 패킷 데이터의 복원을 위하여 좋은 결과를 제시하고 있으나, 해당 정보의 추가적인 전송에 의하여 전송 비트율이 증가하고, 이전 프레임 또는 패킷에 대한 인코딩을 다시 처리하여야 되는 등의 처리 지연(PROCESSING DELAY)이 발생하는 문제가 있다. However, although the transmitter-based PLC algorithm of the related art has a good result for restoring lost packet data, the transmission bit rate is increased by additional transmission of the corresponding information, and the encoding of the previous frame or packet is performed. There is a problem that a processing delay occurs, such as a need to be processed again.

또한, 상기의 종래 기술에 의한 것으로, 수신단 기반 PLC 알고리즘은 송신단이 제공하는 추가적인 정보가 없어도 수신단에서 자체적이며 독립적으로 손실된 패킷을 복원하는 장점이 있으나, 패킷 손실률이 높아지면 자체적으로 복원하기 어려 우므로 음성급 통신신호의 품질(QoS)이 급격히 떨어지는 문제가 있다. In addition, according to the prior art, the receiver-based PLC algorithm has an advantage of recovering the packet lost independently and independently at the receiver even without additional information provided by the transmitter, but is difficult to recover itself when the packet loss rate is high. Therefore, there is a problem in that the quality (QoS) of the voice-level communication signal drops sharply.

상기와 같은 문제점은 “PACKET LOSS CONCEALMENT FOR AUDIO STREAMING BASED ON THE GAPES AND MAPES ALGORITHMS" H. OFIR AND D. MALAH, IN IEEE 24TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS, ISRAEL, PP. 280-284, NOV. 2006 과 ”연속적인 프레임 손실에 강인한 G.729 프레임 손실 은닉 알고리즘“ 조충상, 이영한, 김홍국, 대한음성학회 한국음성과학회 공동학술대회 발표논문집 제 1 호, PP. 307-310, 2007.5 에 의하여 확인된다. Problems such as "PACKET LOSS CONCEALMENT FOR AUDIO STREAMING BASED ON THE GAPES AND MAPES ALGORITHMS" H. OFIR AND D. MALAH, IN IEEE 24TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS, ISRAEL, PP. 280-284, NOV. 2006 and “G.729 Frame Loss Concealment Algorithm Robust to Continuous Frame Loss” Cho, Choong-Sang, Young-Han Lee, Hong-Kook Kim, and Korean Society of Speech Sciences Conference 1, pp. 307-310, 2007.5.

또한, 상기와 같은 종래 기술은 상관관계가 높은 음성신호에 손실이 발생하는 경우, 손실이 발생한 구간의 데이터를 예측하여 어느 정도의 복구를 할 수 있지만, 음악 등과 같은 광대역 오디오 스트림의 경우에는 패킷 단위 신호 사이의 상관관계를 예측하기 어려워 부가정보 없이 자체적으로 복원하기 어려운 등의 문제가 있다. In addition, in the prior art as described above, when a loss occurs in a highly correlated voice signal, a certain amount of recovery can be performed by predicting data in a section in which the loss occurs, but in the case of a wideband audio stream such as music, a packet unit There is a problem that it is difficult to predict the correlation between signals, and thus it is difficult to restore itself without additional information.

본 발명은 상기와 같은 종래 기술의 문제점을 감안하여 안출한 것으로서, 특히, 인터넷으로 실시간 전송되는 음성급 신호에 손실이 발생하는 경우 전체적인 전송 비트율의 증가 없이 잉여 정보를 전송하여 손실 구간의 음성신호 패킷을 복구 또는 은닉하는 것으로, 음성통신의 패킷 손실을 은닉하는 단말장치 및 방법을 제공하는 것이 그 목적이다. The present invention has been made in view of the above-mentioned problems of the prior art, and in particular, when a loss occurs in a real-time voice level signal transmitted over the Internet, a voice signal packet of a loss interval is transmitted by transmitting surplus information without increasing the overall transmission bit rate. It is an object of the present invention to provide a terminal apparatus and a method for concealing packet loss of voice communication by recovering or concealing the data.

본 발명의 다른 목적은, 수신측에서 음성신호 패킷을 수신하고 음질평가한 결과를 이용하여 송신측에서 가변비트율을 결정하고 부가 음성 데이터의 유형에 의한 잉여 정보를 결정하여 송신하도록 하는 음성통신의 패킷 손실을 은닉하는 단말장치 및 방법을 제공하는 것이 그 목적이다. Another object of the present invention is to use a result of receiving a voice signal packet at a receiving side and evaluating sound quality, to determine a variable bit rate at the transmitting side, and to determine and transmit surplus information according to the type of additional voice data. It is an object of the present invention to provide a terminal apparatus and method for concealing a loss.

상기와 같은 목적을 달성하기 위하여 안출한 본 발명은, 인터넷을 통하여 오디오 신호로 실시간 통신하는 단말기에 있어서, 실시간 입력되는 오디오 신호와 잉여정보를 전체 비트율이 증가하지 않는 가변비트율로 인코드하고 음질평가 결과를 기록하여 실시간 전송 포맷으로 변환하며 패킷으로 인터넷에 송신하는 송신단; 및 인터넷을 통하여 실시간 오디오 패킷을 수신하고 음질평가 결과를 확인하여 송신단에 피드백하며 가변비트율로 디코딩한 패킷을 순차 기록하고 지터를 제거하며 음질평가 결과를 피드백하고 재생된 음성신호를 출력하는 수신단; 이 포함되는 구성을 제시한다. The present invention devised to achieve the above object, in the terminal for real-time communication to the audio signal via the Internet, the real-time input of the audio signal and the surplus information is encoded at a variable bit rate does not increase the overall bit rate and sound quality evaluation A transmitter that records the result, converts it into a real-time transmission format, and transmits the packet to the Internet; And a receiving end receiving a real time audio packet through the Internet, checking a sound quality evaluation result and feeding it back to the transmitting end, sequentially recording a packet decoded at a variable bit rate, removing jitter, feeding back a sound quality evaluation result, and outputting a reproduced voice signal. This presents the included configuration.

바람직하게, 상기 송신단은 실시간 인가되는 오디오 신호를 샘플링하여 입력하는 사운드 캡쳐부; 입력하는 오디오 신호를 디지털 신호로 인코딩하고 전체 비트율이 증가되지 않는 상태에서 이전 패킷에 대한 잉여 정보를 포함하여 인코딩하는 오디오 인코딩부; 오디오 인코딩부가 인코딩한 오디오 신호에 수신단이 피드백한 음질측정결과를 포함 기록하여 실시간 전송포맷으로 변환하는 실시간 포맷부; 실시간 포맷부가 포맷한 오디오 신호를 인터넷으로 실시간 전송할 패킷으로 변환하여 송신하는 실시간 패킷부; 를 포함하여 구성되는 것을 특징으로 한다. Preferably, the transmitting end is a sound capture unit for sampling and input the audio signal is applied in real time; An audio encoding unit for encoding an input audio signal into a digital signal and encoding the surplus information of the previous packet in a state where the total bit rate is not increased; A real time format unit which records the sound quality measurement result fed back to the audio signal encoded by the audio encoding unit and converts the result into a real time transmission format; A real-time packet unit for converting and transmitting the audio signal formatted by the real-time format unit into a packet for real-time transmission to the Internet; Characterized in that comprises a.

또한, 상기 수신단은 인터넷을 통하여 상대편 단말기가 송신한 실시간 패킷을 수신하는 실시간 수신부; 실시간 수신부가 수신한 패킷을 분석하여 음질평가 결과를 확인하고 상기 송신단에 피드백하는 실시간 분석부; 실시간 분석부에서 인가되는 신호로부터 가변비트율을 분석하여 수신된 패킷을 디코딩하는 오디오 디코더부; 디코딩된 패킷을 순서번호대로 기록하며 지터를 제거하는 버퍼부; 버퍼부에 저장된 패킷을 순차적으로 인가받고 설정된 소정 숫자 주기 마다 음질평가 결과를 측정하여 상기 송신단에 피드백하는 품질측정부; 품질측정부로부터 인가되는 오디오 신호를 출력하는 오디오 렌더부; 를 포함하여 구성되는 것을 특징으로 한다. The receiver may include a real time receiver configured to receive a real time packet transmitted from an opposite terminal through the Internet; A real time analyzer for analyzing a packet received by a real time receiver to check a sound quality evaluation result and feeding back to the transmitter; An audio decoder for analyzing a variable bit rate from a signal applied by a real time analyzer and decoding a received packet; A buffer unit for recording the decoded packets in order and removing jitter; A quality measuring unit which sequentially receives the packets stored in the buffer unit and measures a sound quality evaluation result at each predetermined number period and feeds back to the transmitting end; An audio renderer for outputting an audio signal applied from the quality measurer; Characterized in that comprises a.

또한, 상기 오디오 인코딩부는 입력되는 오디오 신호와 잉여정보를 전체 비트율이 증가하지 않는 가변비트율로 인코딩하는 구성으로 이루어지는 것을 특징으로 한다. The audio encoding unit may be configured to encode an input audio signal and surplus information at a variable bit rate without increasing the overall bit rate.

또한, 상기 가변비트율은 수신단으로부터 확인되어 피드백된 음질평가 결과 를 분석하여 확인되는 구성으로 이루어지는 것을 특징으로 한다. In addition, the variable bit rate is characterized in that consisting of a configuration that is confirmed by analyzing the sound quality evaluation results confirmed and feedback from the receiving end.

또한, 상기 가변비트율은 상대방 단말기로부터 패킷에 포함되어 수신된 음질평가 결과를 분석하여 확인하는 구성으로 이루어지는 것을 특징으로 한다. In addition, the variable bit rate is characterized by consisting of a configuration for analyzing and confirming the sound quality evaluation result received in the packet from the other terminal.

또한, 상기 실시간 포맷부는 수신단이 피드백한 측정 음질평가 결과를 페이로드 포맷의 예측 영역(MOS)에 기록하는 구성으로 이루어지는 것을 특징으로 한다. The real-time format unit may be configured to record a measurement sound quality evaluation result fed back by a receiver in a prediction region MOS of a payload format.

또한, 상기 오디오 디코더부는 실시간 분석부의 신호를 분석하여 가변비트율을 확인하고 확인된 가변비트율로 디코딩하는 구성으로 이루어지는 것을 특징으로 한다. The audio decoder may be configured to analyze a signal of the real-time analyzer to identify a variable bit rate and to decode the identified variable bit rate.

또한, 상기 수신부가 수신한 신호는 현재 패킷과 이전 패킷에 의한 잉여 정보 중에서 적어도 하나 이상이 포함되는 구성으로 이루어지는 것을 특징으로 한다. In addition, the signal received by the receiving unit is characterized in that the configuration comprises at least one or more of the excess information of the current packet and the previous packet.

또한, 상기 잉여 정보는 이전 패킷의 그 이전 패킷에 대한 잉여 정보가 더 포함되어 이루어지는 구성을 특징으로 한다. In addition, the surplus information is characterized in that the configuration further comprises a surplus information for the previous packet of the previous packet.

또한, 상기 가변비트율은 엠펙2(MPEG-2) 에이에이시(AAC)에 의하여 이루어지는 구성을 특징으로 한다. In addition, the variable bit rate is characterized by a configuration made by the MPEG-2 (AAC).

또한, 상기 지터는 인터넷에서 발생하는 변이로 이루어지는 구성을 특징으로 한다. In addition, the jitter is characterized by a configuration consisting of variations occurring in the Internet.

또한, 상기 버퍼부는 디코딩된 패킷 신호를 버퍼에 기록하므로 상기 지터를 제거하는 구성으로 이루어지는 것을 특징으로 한다. In addition, the buffer unit is configured to remove the jitter since the decoded packet signal is written to the buffer.

또한, 상기 품질측정부는 버퍼부로부터 설정된 소정 숫자 주기의 오디오 패킷 신호를 인가받은 후에 음질평가 결과를 측정하는 구성으로 이루어지는 것을 특 징으로 한다. The quality measuring unit may be configured to measure a sound quality evaluation result after receiving an audio packet signal of a predetermined number period set from a buffer unit.

또한, 상기 음질평가는 아이티유티(ITU-T) 표준 피563(P.563)의 논-인트루시브(NON-INTRUSIVE) 방식을 이용하여 평가되는 객관적 결과로 이루어지는 것을 특징으로 한다. In addition, the sound quality evaluation is characterized in that the objective result is evaluated by using the NON-INTRUSIVE method of the ITU-T standard P563 (P.563).

또한, 상기 품질측정부는 음질평가의 결과를 1 내지 5 의 범위 값으로 표현하는 것을 특징으로 한다. In addition, the quality measuring unit is characterized in that the result of the sound quality evaluation expressed in the range of 1 to 5.

또한, 상기 음질평가의 결과 값은 송신단의 실시간 포맷부에 피드백되어 기록되는 구성으로 이루어지는 것을 특징으로 한다. In addition, the result value of the sound quality evaluation is characterized in that consisting of a configuration that is fed back to the real-time format portion of the transmitter is recorded.

또한, 상기 실시간 분석부는 확인된 음질평가 결과를 송신단에 피드백하는 구성으로 이루어지는 것을 특징으로 한다. In addition, the real-time analysis unit is characterized in that consisting of a configuration for feeding back the confirmed sound quality evaluation results to the transmitting end.

또한, 상기 오디오 렌더부는 버퍼부로부터 인가되는 패킷 단위의 오디오 신호를 사운드 카드를 통하여 실시간으로 재생하고 출력하는 구성으로 이루어지는 것을 특징으로 한다. The audio renderer may be configured to reproduce and output an audio signal of a packet unit applied from a buffer unit in real time through a sound card.

상기와 같은 목적을 달성하기 위하여 안출한 본 발명은, 음성통신의 패킷 손실을 은닉하는 방법에 있어서, 음성통신 단말기의 송신단에 의하여 오디오 신호를 입력하고 실시간 캡처하는 과정; 캡처 된 오디오 신호를 확인된 음질평가 결과로 분석된 가변비트율에 의하여 잉여정보와 함께 인코딩하는 과정; 인코딩된 오디오 신호와 측정된 음질평가 결과를 실시간 전송 포맷으로 변환하는 과정; 및 실시간 전송 포맷의 오디오 신호를 패킷으로 변환하여 인터넷에 송신하는 과정; 을 포함하 는 구성을 제시한다. In order to achieve the above object, the present invention provides a method for concealing packet loss of a voice communication, the method comprising: inputting an audio signal by a transmitting end of a voice communication terminal and capturing it in real time; Encoding the captured audio signal together with the surplus information by the variable bit rate analyzed by the confirmed sound quality evaluation result; Converting the encoded audio signal and the measured sound quality evaluation result into a real-time transmission format; And converting an audio signal of a real time transmission format into a packet and transmitting it to the Internet. Present a configuration that includes a.

바람직하게, 상기 인코딩하는 과정은 확인된 음질평가 결과에 의하여 잉여정보를 인코딩하지 않는 것을 특징으로 한다. Preferably, the encoding process does not encode the surplus information based on the confirmed sound quality evaluation result.

또한, 상기 인코딩하는 과정은 확인된 음질평가 결과에 의하여 하나의 잉여정보 또는 두 개의 잉여정보를 함께 전체 비트율이 증가하지 않는 상태에서 인코딩하는 것을 특징으로 한다. The encoding process may be performed by encoding one surplus information or two surplus information together in a state in which the total bit rate does not increase according to the confirmed sound quality evaluation result.

또한, 상기 하나의 잉여정보는 현재 인코딩할 패킷의 이전 패킷에 대한 정보 인 것을 특징으로 한다. In addition, the one surplus information is characterized in that the information on the previous packet of the packet to be currently encoded.

또한, 상기 두 개의 잉여정보는 현재 인코딩할 패킷의 이전 패킷과 그 이전 패킷에 대한 정보 인 것을 특징으로 한다. In addition, the two surplus information is characterized in that the information on the previous packet and the previous packet of the current packet to be encoded.

또한, 상기 측정된 음질평가 결과는 상대방으로부터 수신된 오디오 패킷을 설정된 소정 숫자 단위로 누적하여 음질평가한 결과인 것을 특징으로 한다. The measured sound quality evaluation result may be a result of accumulating the audio packets received from the counterpart by a predetermined predetermined number unit.

상기와 같은 목적을 달성하기 위하여 안출한 본 발명은, 음성통신의 패킷 손실을 은닉하는 방법에 있어서, 음성통신 단말기의 수신단에 의하여 오디오 패킷을 실시간 수신하고 분석하여 확인된 음질평가 결과를 송신단에 피드백하는 과정; 음질평과 결과로부터 분석된 가변비트율로 수신된 패킷을 디코딩하고 순서번호에 의하여 버퍼에 저장하는 과정; 및 버퍼에 저장되고 설정된 소정 숫자 단위의 패킷으로부터 측정된 음질평가를 송신단에 피드백하는 동시에 실시간 오디오 신호로 변환하여 출력하는 과정; 이 포함되는 구성을 제시한다. In order to achieve the above object, the present invention provides a method for concealing packet loss of voice communication, wherein the receiving end of the voice communication terminal receives and analyzes an audio packet in real time and feeds back the confirmed sound quality evaluation result to the transmitting end. Process of doing; Decoding the received packet at a variable bit rate analyzed from the sound quality and the result and storing the received packet in a buffer by a sequence number; And converting the sound quality evaluation measured from the packet of the predetermined number unit stored in the buffer into a real time audio signal while feeding back to the transmitter. This presents the included configuration.

바람직하게, 상기 확인된 음질평가 결과는 수신된 패킷을 분석하여 기록된 음질평가 결과를 확인하는 것을 특징으로 한다. Preferably, the confirmed sound quality evaluation result is characterized in that to confirm the recorded sound quality evaluation results by analyzing the received packet.

또한, 상기 가변비트율은 확인된 음성평가 결과로부터 분석하여 확인하는 것을 특징으로 한다. In addition, the variable bit rate is characterized by confirming by analyzing from the confirmed speech evaluation results.

또한, 상기 음질평가는 버퍼에 설정된 소정 숫자 단위로 패킷이 누적 기록되는 주기마다 측정하는 것을 특징으로 한다. In addition, the sound quality evaluation is characterized in that for every cycle in which packets are accumulated and recorded in a predetermined number unit set in the buffer.

또한, 상기 오디오 신호의 출력은 음질평가의 측정과 관계없이 버퍼에 순서대로 기록되는 오디오 패킷을 인가받아 변환하고 출력하는 것을 특징으로 한다. In addition, the output of the audio signal is characterized by receiving and converting and outputting audio packets which are sequentially recorded in the buffer regardless of the measurement of the sound quality evaluation.

상기와 같은 목적을 달성하기 위하여 안출한 본 발명은, 음성통신의 패킷 손실을 은닉하는 방법에 있어서, 음성통신 단말기의 수신단에 의하여 상대방으로부터 수신된 패킷을 분석하고 음질평가 결과를 확인하는 단계; 분석된 음질평가 결과에 의하여 전송비트율을 가변하고 캡처된 오디오 신호와 잉여 정보를 코딩하는 단계; 수신된 패킷을 설정된 소정 주기 단위로 누적하여 객관적인 음질평가를 측정하는 단계; 및 측정된 음질평가 결과를 송신단에 피드백하여 포맷의 지정 영역에 기록하고 송신하는 단계; 가 포함되는 구성을 제시한다. In order to achieve the above object, the present invention provides a method for concealing packet loss of voice communication, comprising: analyzing a packet received from a counterpart by a receiving end of a voice communication terminal and confirming a sound quality evaluation result; Varying the transmission bit rate according to the analyzed sound quality evaluation result and coding the captured audio signal and the redundant information; Accumulating the received packets in a predetermined predetermined unit of time and measuring an objective sound quality evaluation; And feeding back the measured sound quality evaluation result to the transmitting end to record and transmit in the designated area of the format; Present a configuration that includes.

바람직하게, 상기 잉여정보를 코딩하는 단계는 분석된 음질평가 결과에 의하여 잉여 정보를 코딩하지 않거나 이전 패킷의 잉여정보를 코딩하거나 이전 패킷과 그 이전 패킷의 잉여정보를 코딩하는 것 중에서 선택된 어느 하나인 것을 특징으로 한다. Preferably, the step of coding the surplus information is any one selected from coding the surplus information or coding the surplus information of the previous packet and the previous packet according to the analysis result of the sound quality evaluation It is characterized by.

상기와 같은 구성의 본 발명은 인터넷을 통하여 전송되는 패킷 데이터의 손실을 복구하는 잉여정보를 전체 전송 비트율의 증가 없이 송신하므로 점유한 채널의 대역폭 또는 데이터 전송속도에 영향을 주지 않으면서 발생된 손실을 신속하게 복구하는 산업적 이용효과가 있다. The present invention having the above-described configuration transmits the surplus information for recovering the loss of packet data transmitted through the Internet without increasing the overall transmission bit rate, thereby reducing the loss generated without affecting the bandwidth or data transmission rate of the occupied channel. There is an industrial benefit of rapid recovery.

또한, 상기와 같은 구성의 본 발명은 인터넷에 접속된 수신측에서 음질평가한 결과를 이용하여 송신측에서 가변비트율과 데이터 유형을 결정하여 송신하므로 발생된 손실을 정확하고 효율적으로 은닉하여 음성급 신호의 품질을 높이는 사용상 편리한 효과가 있다. In addition, according to the present invention, the variable bit rate and data type are determined and transmitted at the transmitting side by using the result of sound quality evaluation at the receiving side connected to the Internet. There is a convenient effect to increase the quality of.

이하, 상기와 같은 구성의 본 발명에 의한 것으로, 음성통신의 패킷 손실을 은닉하는 단말장치 및 방법에 대한 바람직한 실시 예를 첨부된 도면을 참조하여 상세히 설명한다. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, preferred embodiments of a terminal apparatus and method for concealing packet loss of voice communication according to the present invention having the above configuration will be described in detail with reference to the accompanying drawings.

실시 예Example

본 발명을 설명하기 위하여 첨부된 것으로, 도 1 은 본 발명이 적용되는 인터넷 시스템의 일례에 의한 전체 기능 구성도 이고, 도 2 는 본 발명의 일례에 의 한 것으로 음성통신의 패킷 손실을 은닉하는 단말장치 기능 구성도 이며, 도 3 은 본 발명의 설명을 위한 것으로 일반적인 오디오 패킷의 상세 필드 구성도 이고, 도 4 는 본 발명의 일례에 의한 것으로 전송되는 오디오 패킷의 상세 필드 구성도 이며, 도 5 는 본 발명의 일례에 의한 것으로 수신단에서의 실시간 음질평가 방식 도시도 이고, 도 6 은 본 발명의 일례에 의한 것으로 송신단이 수신되어 피드백된 음질평가 결과에 따라 잉여 정보 인코딩을 차별적으로 수행하는 방식 설명도 이며, 도 7 은 본 발명의 일례에 의한 것으로 장치 사이에서 송수신 되는 패킷의 버퍼링과 음질평가 결과를 측정하여 피드백하는 상태 설명도 이고, 도 8 은 본 발명의 일 실험에 의하여 검출된 결과 도시도 이며, 도 9 는 본 발명의 일례에 의하여 음성통신의 패킷 손실을 은닉하는 단말장치에 의한 송신단 운용방법 순서도 이고, 도 10 은 본 발명의 일례에 의하여 음성통신의 패킷 손실을 은닉하는 단말장치에 의한 수신단 운용방법 순서도 이다. 1 is an overall functional configuration diagram according to an example of the Internet system to which the present invention is applied, and FIG. 2 is a terminal for concealing packet loss of voice communication according to an example of the present invention. Fig. 3 is a detailed field configuration diagram of a general audio packet for explaining the present invention, and Fig. 4 is a detailed field configuration diagram of an audio packet transmitted according to an example of the present invention. FIG. 6 is a diagram illustrating a real-time sound quality evaluation method according to an exemplary embodiment of the present invention, and FIG. 6 is a diagram illustrating a method of differentially performing surplus information encoding according to a sound quality evaluation result that a transmitter receives and is fed back according to an example of the present invention. FIG. 7 illustrates an example of the present invention, which measures buffering and sound quality evaluation results of packets transmitted and received between devices. FIG. 8 is a diagram illustrating a result detected by an experiment of the present invention, and FIG. 9 is a flowchart of a method of operating a transmitting end by a terminal device concealing packet loss of voice communication according to an example of the present invention. FIG. 10 is a flowchart illustrating a method of operating a receiving end by a terminal device for concealing packet loss of voice communication according to an example of the present invention.

본 발명의 일례를 설명함에 있어서, 본 발명과 직접적으로 관련 없고, 잘 알려져 있는 기술 내용에 대하여서는 도면 도시 및 설명을 생략하므로, 본 발명의 요지를 흐리지 않고 명확하게 전달한다. In describing an example of the present invention, drawings and descriptions of well-known technical contents that are not directly related to the present invention are omitted, and thus the present invention is clearly communicated without obscuring the gist of the present invention.

도 1 을 참조하여 본 발명이 적용되는 인터넷 시스템을 설명하면, 음성신호를 입력하여 디지털 데이터로 샘플링하고 패킷 단위로 변환하고 UDP/IP 및 RTP 프로토콜 등을 포함하여 이용하며 송수신하는 단말장치(100, 100‘)와 인터넷(200)이 포함되는 구성이다. Referring to FIG. 1, the Internet system to which the present invention is applied includes a terminal device 100 that inputs and receives voice signals, converts them into digital data, converts them into packet units, uses UDP / IP and RTP protocols, etc. 100 ') and the Internet 200 are included.

상기 단말장치(100, 100‘)는 송신단(110)과 수신단(120)이 포함되는 구성이다. The terminal device 100, 100 ′ is configured to include a transmitting end 110 and a receiving end 120.

상기 송신단(110)은, 입력되는 음성신호를 샘플링하고, 확인된 음질평가 결과로부터 분석된 가변비트율로 잉여 정보를 포함하여 인코딩하며, 측정된 품질평가 결과를 포함하는 실시간 포맷으로 변환하고 패킷 처리된 신호로 인터넷(200)에 송신 한다. The transmitter 110 samples the input audio signal, encodes the received audio signal with the variable bit rate analyzed from the confirmed sound quality evaluation result, converts it into a real time format including the measured quality evaluation result, and processes the packet. The signal is transmitted to the Internet 200.

상기 수신단(120)은 인터넷(200)에 접속되어 패킷을 수신하고, 상대방의 송신단에서 전송한 음질평가 결과를 추출하여 확인하며, 상기 확인된 음질평가 결과를 상기 송신단(110)에 피드백하고, 상기 음질평가 결과를 분석하여 수신된 패킷의 인코딩된 가변비트율을 확인하여 디코딩하며, 순서번호대로 버퍼에 기록하여 지터(JITTER)를 제거하고, 설정된 소정 숫자 주기로 누적된 패킷으로부터 음질평가 결과를 측정하는 동시에 디코딩된 패킷을 순서대로 오디오 신호로 변환하여 출력한다. The receiver 120 is connected to the Internet 200 to receive a packet, extracts and confirms a sound quality evaluation result transmitted from the other party's transmitter, and feeds back the confirmed sound quality evaluation result to the transmitter 110, Analyze the results of the sound quality evaluation to identify and decode the encoded variable bit rate of the received packets, remove the jitter by recording them in the buffer in sequence number, and measure the sound quality evaluation results from the accumulated packets at a predetermined number of periods. The decoded packets are converted into audio signals in order and output.

상기 인터넷(INTERNET)(200)은 전 세계적으로 분산된 다수의 컴퓨터 또는 허브(HUB)가 서로 그물 또는 망(NETWORK)과 같이 상호 연결되고 패킷 데이터를 교환하여 통신하는 것으로 지정된 상대방을 선택하여 대량의 정보를 송수신한다. The INTERNET 200 selects a counterpart designated by a plurality of computers or hubs distributed around the world, designated as communicating with each other by exchanging packet data with each other, such as a net or a network. Send and receive information.

상기 인터넷(200)을 통하여 패킷(PACKET)을 전송하는 프로토콜에는 TCP(TRANSMISSION CONTROL PROTOCOL)와 UDP(USER DATAGRAM PROTOCOL)가 있으며, 상 기 인터넷(200)에 접속된 상대방을 구분하고 인식하기 위한 기술에는 IP(INTERNET PROTOCOL) 프로토콜이 있다. Protocols for transmitting a packet through the Internet 200 include TCP (TRANSMISSION CONTROL PROTOCOL) and UDP (USER DATAGRAM PROTOCOL), and a technique for distinguishing and recognizing the other party connected to the Internet 200 includes: There is an IP (INTERNET PROTOCOL) protocol.

상기 IP는 망으로 이루어진 인터넷(200)에서 송신자와 수신자를 구분하는 것으로, 숫자와 영문자를 이용하는 주소 정보이다. The IP distinguishes a sender and a receiver in the Internet 200 formed of a network, and is IP address information using numbers and English letters.

상기 TCP는 디지털 데이터 정보를 소정 단위의 패킷(PACKET) 또는 데이터그램(DATAGRAM)(이하 패킷이라 한다)으로 분할하고 각각에 일련번호를 부여하여 신뢰성이 확보된 상태에서 송수신 되도록 하는 것으로, 망 연결에 의하여 분할된 각각의 패킷 정보가 주어진 일련번호에 의한 순서를 바꾸어 수신되는 경우 순서에 맞추어 조립하며, IP에 의하여 지정된 상대방 수신자가 전송된 패킷 데이터를 오류 없이 수신하였는지 확인하고, 오류가 발생한 상태를 확인한 경우는 해당 일련번호의 패킷을 다시 송신하는 방식을 사용하여 전송되는 데이터에 신뢰성을 확보한다. The TCP divides the digital data information into a packet or a datagram (hereinafter referred to as a packet) of a predetermined unit and assigns a serial number to each packet so that the data can be transmitted and received in a reliable state. If the packet information divided by each is received by changing the order by the given serial number, it is assembled according to the order. Check whether the other party's receiver designated by IP received the transmitted packet data without error, and check the status of the error. In this case, the reliability of the transmitted data is ensured by using a method of transmitting the packet of the serial number again.

상기 UDP는 분할된 각 패킷에 순서번호를 부여하지 않은 것이 큰 차이이고, 전송되는 데이터의 양 또는 크기가 매우 작은 경우에 많이 사용된다. The UDP is a big difference in not assigning a sequence number to each divided packet, and is widely used when the amount or size of data to be transmitted is very small.

이러한 UDP는 패킷 데이터를 신속하게 전송하는 장점 등에 의하여 실시간 통신에 의한 미디어 전송에서 주로 사용하지만, 손실이 발생된 패킷을 재전송하는 과정이 없으므로 손실이 발생하는 경우 패킷의 손실을 은닉하는(PACKET LOSS CONCEALMENT: PLC) 기술을 사용한다. Although UDP is mainly used for media transmission by real time communication due to the advantage of rapidly transmitting packet data, there is no process of retransmitting a packet in which loss occurs. Therefore, a packet loss conceals packet loss when a loss occurs. : PLC) technology.

상기 PLC 기술은 손실된 패킷 대신에 이전 패킷을 재생하므로 수신자가 패킷 손실을 느끼지 못하게 하는 기술이다. The PLC technology is a technology that does not allow the receiver to feel packet loss because the previous packet is played instead of the lost packet.

본 발명에서는 기본적으로 UDP 프로토콜을 이용하면서 패킷의 순서를 지정하 기 위하여 RTP(REALTIME TRANSPORT PROTOCOL) 프로토콜을 이용한다. In the present invention, RTP (REALTIME TRANSPORT PROTOCOL) protocol is basically used to specify the order of packets while using the UDP protocol.

도 2 를 참조하여, 본 발명의 일례에 의한 것으로, 음성통신의 패킷 손실을 은닉하는 단말장치를 설명하면, 송신단(110)과 수신단(120)으로 구분된다. Referring to FIG. 2, a terminal device for concealing packet loss of voice communication according to an example of the present invention will be divided into a transmitting end 110 and a receiving end 120.

상기 본 발명에 의한 것으로, 음성통신의 패킷 손실을 은닉하는 단말장치는 전진 에러 정정(FEC) 방식 중에 하나인 잉여 정보 코딩 방식을 이용한다. According to the present invention, the terminal apparatus concealing packet loss of voice communication uses a redundant information coding scheme, which is one of the forward error correction (FEC) schemes.

상기 송신단(110)은 사운드 캡쳐부(111), 오디오 인코딩부(112), 실시간 포맷부(113), 실시간 패킷부(114)를 포함하여 구성되고, 상기 수신단(120)은 실시간 수신부(121), 실시간 분석부(122), 오디오 디코더부(123), 버퍼부(124), 품질측정부(125), 오디오 렌더부(126)를 포함하는 구성이다. The transmitter 110 includes a sound capture unit 111, an audio encoder 112, a real time format unit 113, and a real time packet unit 114, and the receiver 120 includes a real time receiver 121. , A real time analyzer 122, an audio decoder 123, a buffer 124, a quality measurer 125, and an audio renderer 126.

상기 송신단(110)의 사운드 캡쳐부(111)는 입력되는 오디오 신호를 설정된 소정 주파수로 샘플링(SAMPLING)하므로 디지털 신호로 입력한다. The sound capture unit 111 of the transmitter 110 samples the input audio signal with a predetermined frequency and inputs it as a digital signal.

상기 오디오 인코딩부(112)는 상기 사운드 캡쳐부(111)가 입력한 오디오 신호를 인코딩(ENCODING)하는 것으로, 수신단(120)의 실시간분석부(122)로부터 피드백(FEEDBACK)되어 제공되는 확인된 음질평가 결과를 분석하여 가변비트율을 결정하고 결정된 가변비트율에 의하여 입력된 오디오(AUDIO) 신호를 인코딩한다. The audio encoding unit 112 encodes an audio signal input by the sound capture unit 111, and is provided with feedback from the real-time analyzer 122 of the receiver 120 to provide the confirmed sound quality. The evaluation result is analyzed to determine a variable bit rate and to encode an input audio signal according to the determined variable bit rate.

상기 수신단(120)의 실시간분석부(122)는 인터넷(200)을 통하여 접속된 상대방 단말장치의 송신단(110)으로부터 전송된 오디오 패킷을 분석하므로 기록된 음질평가 결과를 추출하여 확인한다. The real-time analyzer 122 of the receiving end 120 analyzes the audio packet transmitted from the transmitting end 110 of the counterpart terminal device connected through the Internet 200, thereby extracting and confirming the recorded sound quality evaluation result.

즉, 상대방이 통보해준 음질평가 결과 값을 이용하여 상기 오디오 인코딩 부(112)는 송신할 오디오 신호의 인코딩할 비트레이트(BIT RATE) 또는 가변비트율(VARIABLE BIT RATE)을 결정한다. That is, the audio encoding unit 112 determines the bit rate or variable bit rate to be encoded of the audio signal to be transmitted using the sound quality evaluation result value informed by the other party.

상기 가변비트율은 일례로, IETF의 RFC(REQUEST FOR COMMENTS) DRAFT에서 정의된 MPEG-2 AAC의 RTP PAYLOAD FORMAT을 근간으로 하는 가변 비트율이 지원되는 코덱에 의하여 적용된다. For example, the variable bit rate is applied by a codec that supports a variable bit rate based on the RTP PAYLOAD FORMAT of MPEG-2 AAC defined in the IETF RFC (REQUEST FOR COMMENTS) DRAFT.

그러므로 송신단은 패킷 손실의 은닉을 위하여 이전 패킷에 대한 잉여정보를 전송하게 되는 것으로, 즉, 현재 패킷과 이전 패킷의 잉여 정보의 전송 비트율을 각각 조절하여 전체 비트율이 증가되지 않고 유지되도록 한다. Therefore, the transmitting end transmits surplus information about the previous packet to conceal packet loss, that is, by adjusting the transmission bit rates of the surplus information of the current packet and the previous packet, respectively, so that the overall bit rate is maintained without increasing.

일례로, 상기 음질평가 결과 값이 소정 기준 이상인 경우, 송신할 패킷의 프레임에 상기 입력된 현재의 오디오 신호를 인코딩하여 첨가하고, 상기 음질평가 결과 값이 설정된 범위의 값인 경우는 전체 비트율이 증가하지 않는 상태에서 비트율을 가변시켜, 상기 현재의 오디오 신호와 이전 패킷에 의한 오디오 신호를 각각 동일한 비트율로 인코딩한다. For example, when the sound quality evaluation result value is greater than or equal to a predetermined criterion, the input current audio signal is encoded and added to a frame of a packet to be transmitted, and when the sound quality evaluation result value is a value within a set range, the overall bit rate does not increase. In this case, the bit rate is varied so that the current audio signal and the audio signal of the previous packet are encoded at the same bit rate.

또한, 상기 음질평가 결과 값이 설정된 소정 기준 이하인 경우에도 전체 비트율이 증가하지 않는 상태에서 비트율을 가변시켜, 상기 현재의 오디오 신호를 위한 패킷(N)에 절반의 비트율을 할당하고 이전 패킷(N-1)과 그 이전 패킷(N-2)에 의한 오디오 신호에 각각 동일한 비트율을 할당하여 인코딩한다. In addition, even when the result of the sound quality evaluation is less than or equal to a predetermined criterion, the bit rate is varied while the overall bit rate does not increase, and half of the bit rate is allocated to the packet N for the current audio signal. The same bit rate is allocated to the audio signal by 1) and the previous packet N-2, and encoded.

상기와 같은 일례는 첨부된 도 6 의 설명에서 상세하게 다시 설명한다. Such an example will be described in detail again in the accompanying description of FIG. 6.

상기 실시간 포맷부(113)는 상기 오디오 인코딩부(112)가 전체 비트율이 증가하지 않는 상태에서 상대방 단말장치가 송신한 음질평가 결과 값을 확인하여 현 재 오디오 패킷으로 인코딩하거나 또는 현재 오디오 패킷과 이전의 오디오 패킷을 인코딩하거나 또는 현재 오디오 패킷과 이전 및 그 이전의 오디오 패킷을 인코딩한 신호를 실시간 포맷(RTP PAYLOAD FORMAT)으로 변환한다. The real-time format unit 113 checks the sound quality evaluation result value transmitted from the counterpart terminal device while the audio encoding unit 112 does not increase the overall bit rate, and encodes the current audio packet or transfers the current audio packet to the previous one. Encode an audio packet or convert a signal that encodes a current audio packet and a previous and previous audio packet into a real-time format (RTP PAYLOAD FORMAT).

또한, 상기의 변환과 동시에 수신단(120)의 품질측정부(125)에서 상대방 단말장치가 송신한 오디오 패킷의 이전 스트림(STREAM) 까지 수신하여 측정한 음질평가 결과 값을 상기 변환된 실시간 포맷의 예측 영역(PREDICTION FIELD) 또는 MOS(MEAN OPINION SCORE) 영역에 포함 기록한다. In addition, at the same time as the above conversion, the quality measurement unit 125 of the receiver 120 receives the previous stream STREAM of the audio packet transmitted from the counterpart terminal device and estimates the sound quality evaluation result value of the converted real-time format. Record in the PREDICTION FIELD or MOS (MEAN OPINION SCORE) area.

상기 실시간 포맷부(113)에 의하여 실시간 전송에 적합한 포맷으로 변환하는 것은 패킷의 연속성 및 패킷 손실의 복원에 필요한 정보와 다중 오디오 패킷의 전송 및 그에 대한 정보를 제공하므로 가능하다. The conversion by the real-time format unit 113 into a format suitable for real-time transmission is possible because it provides information necessary for continuity of packets and restoration of packet loss, and transmission of multiple audio packets and information on the same.

상기 실시간 포맷부(113)가 포맷한 RTP PAYLOAD FORMAT은 상대방에서 수신하여 예측필드를 분석하므로 잉여 정보의 인코딩 상태와 잉여 정보의 유형(TYPE), 즉, 이전 패킷이 하나가 포함되었는지 또는 두 개가 포함되었는지의 유형을 확인하게 된다. Since the RTP payload format formatted by the real time formatter 113 analyzes the prediction field received from the counterpart, the encoding state of the surplus information and the type (type) of the surplus information, that is, includes one or two previous packets. You will be asked to confirm the type.

상기 실시간 패킷부(114)는 상기 실시간 포맷부(113)가 제공하는 패킷의 신호를 최종적으로 RTP 패킷으로 변환하여 인터넷(200)에 전송한다. The real time packet unit 114 finally converts a signal of a packet provided by the real time format unit 113 into an RTP packet and transmits it to the Internet 200.

이때 상기 송신단(110)은 UDP/IP 프로토콜을 적용하고 있으나, 전송되는 패킷의 순서를 위하여 RTP 프로토콜을 다시 적용한다. At this time, the transmitter 110 applies the UDP / IP protocol, but applies the RTP protocol again for the order of the transmitted packets.

상기 수신단(120)의 실시간 수신부(121)는 인터넷(200)으로부터 수신된 RTP 패킷을 수신한다. 이때 송신단(110)과 반대로 RTP 프로토콜을 먼저 적용하고 UDP 프로토콜을 적용한다. The real-time receiver 121 of the receiver 120 receives an RTP packet received from the Internet 200. In this case, as opposed to the transmitter 110, the RTP protocol is first applied and then the UDP protocol is applied.

상기 실시간 분석부(122)는 상기 실시간 수신부(121)가 제공하는 신호를 분석하여 음질평가 결과 값을 추출하여 확인하고, 상기 확인된 음질평가 결과 값을 송신단(110)의 오디오 인코딩부(112)에 피드백한다. The real-time analyzer 122 analyzes a signal provided by the real-time receiver 121 to extract and confirm a sound quality evaluation result value, and checks the checked sound quality evaluation result value in the audio encoding unit 112 of the transmitter 110. Feedback to.

상기 확인된 음질평가 결과 값은 상대방 송신단에서 전송한 것으로, 상대방 송신단이 인코딩한 잉여정보 및 잉여 정보의 유형 등을 분석하여 가변비트율에 적용되는 것으로 MPEG-2 AAC가 지원되는 코덱에 적용되는 값이 된다. The confirmed sound quality evaluation result value is transmitted from the other party's transmitter, and is applied to a variable bit rate by analyzing the excess information encoded by the other party's transmitter and the type of the redundant information, and the value applied to a codec supported by MPEG-2 AAC. do.

상기 오디오 디코더부(123)는 상기 실시간 분석부(122)로부터 확인된 음질평가 결과 값과 수신된 패킷을 인가받고, 상대방 송신단에서 인코딩(ENCODING)한 가변비트율을 확인하며, 상기 확인된 가변비트율로 상기 인가된 패킷을 디코딩(DECODING)한다. The audio decoder 123 receives the sound quality evaluation result value and the received packet received from the real-time analyzer 122, checks a variable bit rate encoded by the other party's transmitter, and checks the variable bit rate. DECODING the applied packet.

상기 버퍼부(124)는 상기 오디오 디코더부(123)가 디코딩한 오디오 패킷을 RTP 프로토콜에 의한 순서번호(SEQUENCE NO.)가 지정하는 순서대로 버퍼에 기록한다. 상기 버퍼부에 기록하므로, 인터넷(200)에서 발생하는 지터(JITTER)의 지연변이를 제거하게 된다. The buffer unit 124 records the audio packet decoded by the audio decoder unit 123 in the buffer in the order designated by the sequence number (SEQUENCE NO.) According to the RTP protocol. Since the data is written to the buffer unit, the delay variation of jitter generated in the Internet 200 is eliminated.

상기 버퍼부(124)에 의하여 지터가 제거된 오디오 패킷은 품질측정부(125)에 제공되어 설정된 소정의 숫자의 오디오 패킷이 반복적으로 누적되는 동시에 상기 오디오 렌더부(126)에는 일정하고 규칙적인 시간 간격으로 제공된다. The jitter has been removed by the buffer unit 124 is provided to the quality measuring unit 125 and a predetermined number of audio packets are repeatedly accumulated, and the audio render unit 126 has a constant and regular time. Are provided at intervals.

상기 품질측정부(125)는 오디오 패킷이 설정된 소정 숫자로 누적되는 경우, 수신된 오디오 패킷에 의한 음질평가를 측정하고, 상기 측정된 음질평가 결과 값을 송신단(110)의 실시간 포맷부(113)에 피드백하여 기록되도록 한다. When the audio packet is accumulated by a predetermined number, the quality measuring unit 125 measures the sound quality evaluation by the received audio packet, and transmits the measured sound quality evaluation result value to the real time format unit 113 of the transmitter 110. Feedback should be recorded to.

상기 품질측정부는 아이티유티(ITU-T: INTERNATIONAL TELECOMMUNICATION UNION-TELECOMMUNICATION STANDARDIZATION SECTOR) 표준인 피 563(P.563)과 같은 논 인트루시브(NON-INTRUSIVE) 방식을 이용하여 객관적으로 음질을 평가한다. The quality measuring unit objectively evaluates sound quality using a non-intrusive method such as P. 563 (P.563), which is an INTERNATIONAL TELECOMMUNICATION UNION-TELECOMMUNICATION STANDARDIZATION SECTOR (ITU-T) standard.

상기 평가의 결과는 1 내지 5 사이의 MOS(MEAN OPINION SCORE) 값으로 표현된다. The result of the evaluation is expressed as a MEAN OPINION SCORE value between 1 and 5.

상기 MOS 값은 0 에서 1 까지는 BAD, 2 까지는 POOR, 3 까지는 FAIR, 4 까지는 GOOD, 5 까지는 EXCELLENT 상태를 의미한다. 일반적으로 3.6의 값이면 음성통신이 가능하다. The MOS value means 0 to 1 BAD, 2 to POOR, 3 to FAIR, 4 to GOOD, and 5 to EXCELLENT. In general, if the value is 3.6, voice communication is possible.

상기 오디오 렌더부(126)는 규칙적으로 인가되는 오디오 패킷을 재생하여 일정하게 연속상태로 출력한다. The audio renderer 126 reproduces the audio packet regularly applied and outputs it in a continuous state constantly.

상기와 같이 오디오 신호를 양방향으로 실시간 전송하므로 통신하는 장비를 오디오 컨퍼런싱 시스템(AUDIO CONFERENCING SYSTEM: ACS) 이라고도 한다. As described above, since the audio signal is transmitted in real time in both directions, the communicating device is also referred to as an audio conferencing system (ACS).

도 3 을 참조하여, 본 발명의 설명을 위한 것으로 일반적인 오디오 패킷의 상세 필드 구성을 설명하면, IETF의 RFC(REQUEST FOR COMMENTS) DRAFT에서 정의된 MPEG-2 및 MPEG-4 AAC 오디오를 위한 것이며, RTP PAYLOAD FORMAT을 기준으로 음질평가 방법을 교환하는 원래의 포맷이다. Referring to FIG. 3, the detailed field configuration of a general audio packet will be described for the purposes of the present invention, and is for MPEG-2 and MPEG-4 AAC audio defined in IETF RFC (REQUEST FOR COMMENTS) DRAFT. This is the original format for exchanging sound quality evaluation methods based on PAYLOAD FORMAT.

상기 포맷(FORMAT)은 하나의 패킷(PACKET) 또는 다중경로를 통하여 전송됨에 의하여 하나 이상의 패킷이 될 수 있으며, 상기 패킷(PACKET)에는 다수의 프레 임(FRAME)을 포함하고 있으나, 이하 설명을 간단하게 하고 용이하게 이해하도록 하기 위하여 프레임으로 구분하여야 할 필요가 없는 경우에는 프레임을 포맷으로 기재 및 설명하기로 한다. The format may be one or more packets by being transmitted through one packet or multipath, and the packet includes a plurality of frames, but the following description is brief. In the case where it is not necessary to distinguish the frame to make it easy and easy to understand, the frame will be described and described in a format.

상기 도 3 의 포맷은 첫째, 프레임이 인터넷을 통하여 전송되는 과정에서 손실되는 경우에 복원하기 위한 정보와 둘째, 프레임이 여러 RTP 패킷에 분할(FRAGMENTATION) 되었는지의 정보와 셋째, 프레임을 그룹화 및 인터리빙(INTERLEAVING) 했는지에 대한 정보와 넷째, 실제 오디오 프레임의 정보 데이터를 포함하여 제공하므로, 패킷의 손실 은닉 및 동기화 능력을 제공하는 구성이다. The format of FIG. 3 includes first, information for restoring when a frame is lost in the process of being transmitted through the Internet, second, information on whether the frame is divided into several RTP packets, and third, grouping and interleaving the frame. Fourth, since it provides the information on whether the INTERLEAVING) and the information data of the actual audio frame is included, it is a configuration that provides the ability to conceal the packet loss and synchronization.

상기 도 3 을 상세히 설명하면, PVS 영역(301)은 32 비트의 PRD VECTOR 영역(305, 306, 307)이 몇 개 인지를 표시한다. 3, the PVS region 301 indicates how many 32-bit PRD VECTOR regions 305, 306, and 307 are present.

상기 PRD VECTOR 영역(305, 306, 307)은 각 프레임에 대한 PQ(PREDICTABILITY QUANTIFIER) 정보를 비트 이동으로 합쳐 놓은 것으로, “0”의 값인 경우는 다른 AAC 프레임을 이용하여 손실된 정보를 예측할 수 없는 상태이고, “1”의 값인 경우는 이전 프레임 또는 이전과 이후의 양 프레임으로부터 예측할 수 있는 상태이며, “2”의 값인 경우에는 이전 프레임 또는 다음 프레임 또는 이전 및 이후의 양 프레임으로부터 예측할 수 없는 상태를 표시한다. The PRD VECTOR regions 305, 306, and 307 combine PQ (PREDICTABILITY QUANTIFIER) information for each frame by bit shift. If the value is “0”, the lost information cannot be predicted using another AAC frame. If the value is “1”, the state can be predicted from the previous frame or both frames before and after. If the value is “2”, the state is unpredictable from the previous frame, the next frame, or both frames before and after. Is displayed.

M 영역(302)은 현재 패킷에 몇 개의 프레임이 연속되어 있는지를 표시하는 것으로, 프레임이 그룹화 또는 인터리빙 되어 있는지를 표시한다. The M area 302 indicates how many frames are contiguous in the current packet, and indicates whether the frames are grouped or interleaved.

F 영역(303)은 현재 패킷에 포함된 프레임의 분할 여부는 표시하는 것으로, 프레임이 여러 RTP 패킷에 분할되어 있는지를 표시한다. The F area 303 indicates whether the frame included in the current packet is divided, and indicates whether the frame is divided into several RTP packets.

TYPE 영역(308)은 데이터의 타입을 표시하는 것으로, ‘0’의 값이면 인코딩된 원본 데이터이고, ‘1’의 값이면 인코딩된 원본 데이터이지만 패킷 복원을 위한 잉여 정보 데이터이며, ‘2’의 값이면 비트율 감소를 위하여 높은 압축률로 압축 되어 인코딩된 데이터를 표시한다. The TYPE area 308 indicates the type of data. If the value is '0', the encoded original data is encoded. If the value is '1', the encoded original data is encoded original data but is redundant information data for packet recovery. A value indicates the encoded data compressed at a high compression rate to reduce the bit rate.

SEQ 영역(309)은 프레임의 순서번호를 표시하고, LENGTH 영역(310)은 프레임의 길이를 바이트(BYTE) 수로 표시한다. The SEQ area 309 indicates the sequence number of the frame, and the LENGTH area 310 indicates the length of the frame in bytes.

도 4 를 참조하여 본 발명의 일례에 의한 것으로 전송되는 오디오 패킷의 상세 필드 구성을 설명한다. With reference to FIG. 4, the detailed field structure of the audio packet transmitted by the example of this invention is demonstrated.

상기 도 4 는 본 발명을 위하여 수정된 RTP PAYLOAD FORMAT이고, 프레임의 예측정보를 기록하는 영역인 PRD VECTOR 영역에 수신단으로부터 주기적으로 측정되는 음질평가 결과를 MOS 영역(410)에 기록하도록 수정한다. 4 is a modified RTP PAYLOAD FORMAT modified for the present invention, and the sound quality evaluation result periodically measured from the receiving end is recorded in the MOS region 410 in the PRD VECTOR region, which is a region for recording prediction information of the frame.

상기 RTP PAYLOAD FORMAT의 PVS 영역에 기록된 값이 “1” 이상이면 상기 MOS 영역(410)에 기록된 값을 참조하고, 상기 PVS 영역에 기록된 값이 “0”이면 상기 MOS 영역(410)이 값을 참조하지 않는다. If the value recorded in the PVS region of the RTP PAYLOAD FORMAT is equal to or greater than "1", the value recorded in the MOS region 410 is referred to. If the value recorded in the PVS region is "0", the MOS region 410 is determined. Does not reference a value.

상기 MOS 영역(410)에 기록되는 값은 1 내지 5 사이의 값으로, 음질평가 결과가 기록되었는지를 확인할 수 있다. The value recorded in the MOS region 410 is a value between 1 and 5, and it may be confirmed whether a sound quality evaluation result is recorded.

즉, 상기 MOS 영역(410)에 기록된 값이 1 이상이면, PRD VECTOR 가 존재하고 그에 따른 음질평가 결과 값이 피드백되었는지를 확인한다. That is, if the value recorded in the MOS region 410 is 1 or more, it is checked whether the PRD VECTOR exists and the sound quality evaluation result value is fed back.

송신단에서 이전 프레임에 대한 잉여 정보를 전송하게 되는 경우에는 메인 프레임(420)의 다음 순서인 오디오 데이터 영역(430, 440)을 그룹화하여 전송한다. When the transmitter transmits the surplus information for the previous frame, the audio data regions 430 and 440 which are the next order of the main frame 420 are grouped and transmitted.

도 5 를 참조하여 본 발명의 일례에 의한 것으로 수신단에서의 실시간 음질평가 방식을 설명한다. Referring to Figure 5 by way of example of the present invention will be described the real-time sound quality evaluation method at the receiving end.

상기 도 5 는 송신단에서 잉여 정보를 인코딩하는 가변비트율을 결정하기 위한 것으로, 상기 가변비트율을 분석할 수 있는 정보로 활용되는 음질평가 결과를 확보하는 것이다. 5 is for determining a variable bit rate for encoding the surplus information at the transmitter, to ensure a sound quality evaluation result used as information for analyzing the variable bit rate.

본 발명에 의한 단말장치의 수신단(120)은 수신된 RTP 패킷을 분석하고(510), RTP PAYLOAD FORMAT을 분석하며(520), 압축되어 전송된 오디오 데이터를 디코딩한다(S530). The receiver 120 of the terminal device according to the present invention analyzes the received RTP packet (510), analyzes the RTP PAYLOAD FORMAT (520), and decodes the compressed and transmitted audio data (S530).

이때 패킷 손실이 발생하는 경우, 디코딩할 데이터가 NULL 상태가 되어 디코딩 및 재생과정이 생략되며, 다음 순서의 음질평가 측정은 손실된 패킷을 포함한 총 N 개의 패킷이 도달하여 생성된 RAW 오디오 데이터가 생성될 때마다 음질평가를 측정한다. In this case, if packet loss occurs, the data to be decoded is NULL, and the decoding and playback process are skipped. In the next step, the sound quality measurement is performed by generating RAW audio data generated by the arrival of N packets including the lost packet. Whenever possible, measure the sound quality.

상기 MOS로 표현되는 음질평가 측정을 위하여 광대역의 오디오 신호를 다운 샘플링하고(540), 상기 다운 샘플링된 오디오 데이터는 실제 음질평가 블록(550)으로 입력된다. The downsampled audio signal of the wideband audio signal is measured 540 for the sound quality evaluation measurement represented by the MOS, and the down sampled audio data is input to the actual sound quality evaluation block 550.

상기 음질평가 방법은 ITU-T 표준 P. 563에 의한 것으로, 참조음원(REFERENCE) 없이 독립적으로 음질평가를 수행할 수 있는 논 인트루시브(NON - INTRUSIVE) 평가방법을 이용한다. The sound quality evaluation method is based on the ITU-T standard P. 563, and uses a non-intrusive evaluation method that can independently perform sound quality evaluation without a reference sound source.

상기 구성은 최종적으로 N 개의 RTP 패킷을 수신하고 하나의 MOS 값을 출력한다. The configuration finally receives N RTP packets and outputs one MOS value.

도 6 을 참조하여 본 발명의 일례에 의한 것으로 송신단이 수신되어 피드백된 음질평가 결과에 따라 잉여 정보 인코딩을 차별적으로 수행하는 방법을 설명한다. Referring to FIG. 6, a method of differentially performing surplus information encoding according to a sound quality evaluation result fed back and received by a transmitting end will be described.

상기 도 6 을 상세히 설명하면, 일례로, N 번째로 전송하는 패킷의 측정된 음질평가 결과 값(MOS m)이 음질평가 제 1 기준 값(MOS 1) 보다 크거나 같은 경우, 채널에 할당된 비트율(BITRATE f)을 모두 사용하는 상태가 위에 도시되어 있다. Referring to FIG. 6 in detail, for example, when the measured sound quality evaluation result value (MOS m) of the Nth transmitted packet is greater than or equal to the sound quality evaluation first reference value (MOS 1), the bit rate allocated to the channel The state of using (BITRATE f) altogether is shown above.

가운데 도시된 것은, 측정된 음질평가 결과 값(MOS m)이 음질평가 제 1 기준 값(MOS 1) 보다 작은 동시에 음질평가 제 2 기준 값(MOS 2)보다 크거나 같은 경우, 채널에 할당된 비트율(BITRATE f)을 N 번째로 전송하는 패킷을 전송할 비트율(BITRATE p)과 잉여정보로 사용되는 N-1 번째 전송한 패킷을 전송할 비트율(BITRATE r)로 양분한다. Shown in the middle is the bit rate assigned to the channel if the measured sound quality evaluation result value MOS m is less than the sound quality first reference value MOS 1 and greater than or equal to the sound quality second reference value MOS 2. It divides the bit rate (BITRATE f) to transmit the N-th packet to the bit rate (BITRATE p) and the bit rate (BITRATE r) to transmit the N-1th transmitted packet used as surplus information.

상기 N 번째로 전송하는 패킷에 할당되는 비트율 “BITRATE f”를 “BITRATE p”로 가변하고, 잉여정보로 사용될 N-1 번째 전송한 패킷을 위하여 “BITRATE r”의 비트율을 할당한다. 즉, BITRATE r = BITRATE f - BITRATE p 가 된다. The bit rate “BITRATE f” allocated to the Nth transmitted packet is changed to “BITRATE p”, and a bit rate of “BITRATE r” is allocated for the N−1th transmitted packet to be used as surplus information. That is, BITRATE r = BITRATE f-BITRATE p.

다시 설명하면, N-1번째 패킷에 손실이 발행한 경우, 패킷을 전송하는 채널에 할당된 전송속도를 양분하여, N 번째 패킷에 1/2의 전송속도를 할당하고 N-1의 패킷에 1/2의 전송속도를 할당하여 동시 전송하므로 수신단에서 발생된 손실을 은 닉하도록 한다. In other words, when a loss occurs in the N-1 th packet, the transmission rate assigned to the channel for transmitting the packet is divided into two, the 1/2 transmission rate is allocated to the N th packet, and the packet is transmitted to the N-1 packet. Since the transmission rate is assigned at the rate of / 2, the transmission loss is concealed.

아래에 도시된 것은, 측정된 N 번째 패킷의 음질평가 결과 값(MOS m)이 음질평가 제 2 기준 값(MOS 2) 보다 작은 경우, 채널에 할당된 비트율(BITRATE f)의 1/2 값인 BITRATE p는 N 번째 패킷을 위하여 할당한다. Shown below is BITRATE, which is one half of the bit rate (BITRATE f) assigned to the channel when the quality evaluation result value (MOS m) of the measured Nth packet is smaller than the quality evaluation second reference value (MOS 2). p is allocated for the Nth packet.

상기 손실된 패킷을 은닉하는 잉여정보를 위하여 할당한 비트율(BITRATE r)은 다시 제 1 비트율(BITRATE r1)과 제 2 비트율(BITRATE r2)로 양분하고, 제 1 비트율(BITRATE r1)은 N-1 번째 패킷을 위하여 할당하며, 제 2 비트율(BITRATE r2)은 N-2 번째 패킷을 위하여 할당한다. The bit rate (BITRATE r) allocated for the redundant information concealing the lost packet is divided into a first bit rate (BITRATE r1) and a second bit rate (BITRATE r2), and the first bit rate (BITRATE r1) is N-1. The second bit rate (BITRATE r2) is allocated for the N-2 th packet.

즉, BITRATE r = BITRATE r1 + BITRATE r2, BITRATE f = BITRATE p + BITRATE r 이 된다. That is, BITRATE r = BITRATE r1 + BITRATE r2, and BITRATE f = BITRATE p + BITRATE r.

상기와 같이 채널에 할당된 전체 비트율 또는 전송속도를 증가키시지 않는 상태에서, 현재의 N 번째 패킷과 손실이 발생한 패킷을 은닉하기 위한 잉여정보로 사용되는 N-1 번째 패킷의 비트율을 가변시켜 동시에 전송한다. 또한, 손실이 크게 발생한 경우는 N 번째와 N-1 번째와 N-2 번째 패킷의 각각의 비트율을 가변시켜 동시 전송하지만, N-1 번째와 N-2 번째에 할당된 비트율보다 N 번째 패킷에 많은 비트율을 할당하여 오디오 신호의 품질을 최적상태로 유지시키는 상태가 도시되어 있다. As described above, without increasing the overall bit rate or transmission rate allocated to the channel, the bit rate of the current N-th packet and the N-1 th packet used as surplus information for concealing a lost packet are varied at the same time. send. In the case where a large loss occurs, the respective bit rates of the N th, N-1 th, and N-2 th packets are changed and transmitted simultaneously, but the N th packet is more than the bit rate allocated to the N th th and N-2 th packets. The state of allocating a large number of bit rates to keep the quality of the audio signal optimal is shown.

도 7 을 참조하여 본 발명의 일례에 의한 것으로 장치 사이에서 송수신 되는 패킷의 버퍼링과 음질평가 결과를 측정하여 피드백하는 상태를 설명한다. Referring to FIG. 7, a state of measuring and feeding back buffering and sound quality evaluation results of packets transmitted and received between devices according to an example of the present invention will be described.

송신측(Tx)은 오디오 패킷을 실시간으로 인코딩하고 RTP 프로토콜을 적용하여 수신측(Rx)으로 전송한다. 수신측(Rx)은 수신된 패킷을 디코딩하여 버퍼에 RTP 프로토콜에서 지정한 순서에 의하여 순차적으로 기록한다. The transmitting side Tx encodes the audio packet in real time and transmits it to the receiving side Rx by applying the RTP protocol. The receiving side Rx decodes the received packets and sequentially writes them in the buffer in the order specified by the RTP protocol.

수신측(Rx)은 버퍼에 기록된 패킷의 숫자가 설정되어 지정된 숫자 N이 되면, 음질평가 결과 값인 MOS를 측정하고, 상기 측정된 음질평가 결과 값(MOS)을 송신측(Tx)에 피드백(FEEDBACK)한다. When the number of packets recorded in the buffer is set and becomes the designated number N, the receiving side Rx measures the MOS which is a sound quality evaluation result value, and feeds back the measured sound quality evaluation result value MOS to the transmitting side Tx. FEEDBACK).

송신측은 피드백된 음질평가 결과 값(MOS)에 의하여 다음 순서의 패킷을 RTP 프로토콜 적용으로 전송하지만, 필요한 경우 잉여정보를 위하여 이전에 전송하였던 패킷을 가변비트율 적용으로 함께 전송할 수 있다. The sender transmits packets of the next order according to the feedback quality evaluation result value (MOS) by applying the RTP protocol, but, if necessary, may transmit the previously transmitted packets together with the variable bit rate by applying the redundant information.

상기와 같은 과정이 설정된 소정 숫자 N 주기 마다 반복되는 상태가 도시되어 있다. A state in which the above process is repeated every predetermined number N periods is shown.

실험 예Experiment example

상기와 같은 본 발명은 다자간 음성협업 소프트웨어인 RAT(ROBUST AUDIO TOOL)와 MPEG-2 AAC 공개소스를 이용하여 실험한다. The present invention as described above is experimented using the multi-party voice collaboration software RAT (ROBUST AUDIO TOOL) and MPEG-2 AAC open source.

상기 RAT 구조에 적합하게 MPEG-2 AAC 인코딩과 디코딩 API(APPLICATION PROGRAM INTERFACE) 라이브러리를 구현하여 RAT 프로그램에 추가하므로 오디오 통신이 가능하게 하여 본 발명에 의한 패킷 손실을 은닉하는 단말장치에 적용한다. Since the MPEG-2 AAC encoding and decoding API (APPLICATION PROGRAM INTERFACE) library is implemented in accordance with the RAT structure and added to the RAT program, audio communication is possible and the present invention is applied to a terminal device that conceals packet loss according to the present invention.

송신단에 입력되는 오디오의 음원은 32초의 여성음성이며, 32 KHz로 샘플링 한 SQAM(SOUND QUALITY ASSESSMENT MATERIAL) 스테레오(STEREO) 신호이고 MPEG - 2 AAC의 인코딩 비트율은 146 Kbit/s 로 설정한다. The source of audio input to the transmitter is 32 seconds of female voice, SQAM (SOUND QUALITY ASSESSMENT MATERIAL) stereo (STEREO) signal sampled at 32 KHz, and the encoding bit rate of MPEG-2 AAC is set to 146 Kbit / s.

음질평가를 위한 다운 샘플링 팩터는 1/4로 하여 8 KHz의 모노(MONO) 음질평가용 수신 음성을 확보한다. The down-sampling factor for sound quality evaluation is set to 1/4 to secure the received voice for 8 KHz mono sound quality evaluation.

수신단의 음질평가 결과 측정 주기 N 은 250 패킷에 해당하는 8초로 설정하고, 음성통화가 가능한 음질평가 기준 값(MOS)은 3.6 수준으로 설정하며, 잉여정보 인코딩을 위한 현재 패킷과 이전 패킷의 가변 비트율은 각각 1/2 로 설정한다. The measurement period N of the receiving end of the sound quality evaluation result is set to 8 seconds corresponding to 250 packets, and the sound quality evaluation reference value (MOS) for voice communication is set to 3.6 level, and the variable bit rate of the current packet and the previous packet for encoding the surplus information Are set to 1/2 each.

상기 음원은 실험을 위하여 무손실 상태와 손실상태로 각각 준비하고, 상기 손실 음원의 패턴은 ITU-T 표준 G.191에 정의된 GILBERT-ELLIOT 모델을 이용하여 7 % 손실로 제작한다. The sound source is prepared in a lossless state and a lossy state, respectively, for the experiment, and the pattern of the lossy sound source is produced with 7% loss using the GILBERT-ELLIOT model defined in ITU-T standard G.191.

상기와 같이 손실된 음원을 종래의 송신측 단말장치와 수신측 단말장치를 이용하여 측정하고, 상기 본 발명의 구성에 의한 송신측 단말장치와 수신측 단말장치를 이용하여 다시 측정한 결과가 첨부된 도 8에 도시되어 있다. As a result, the lost sound source is measured using a conventional transmitting terminal device and a receiving terminal device, and the measurement result is again measured using the transmitting terminal device and the receiving terminal device according to the configuration of the present invention. 8 is shown.

도 8을 다시 설명하면, 4회 실험한 것으로, 본 발명이 적용되지 않은 종래의 단말장치에서는 7 % 손실된 음원을 전송하는 경우, 측정된 음질평가 결과 값(MOS)이 평균 2.7 이고, 본 발명이 적용된 단말장치에서는 평균 3.6 이었다. Referring to FIG. 8 again, the experiment was performed four times. In the conventional terminal device to which the present invention is not applied, the measured sound quality evaluation result (MOS) is 2.7 on average when the sound source lost 7% is transmitted. In this terminal device, the average was 3.6.

즉, 본 발명이 적용되는 경우, 종래 기술대비 약 24 %의 음질이 개선된다. That is, when the present invention is applied, the sound quality of about 24% compared with the prior art is improved.

도 9 를 참조하여 본 발명의 일례에 의한 것으로 음성통신의 패킷 손실을 은닉하는 단말장치에 의한 송신단 운용방법을 설명하면, 송신단은 오디오 신호가 인 가되면 설정된 비트율로 샘플링하여 실시간 캡처하므로 입력한다(S100). Referring to FIG. 9, a transmission terminal operating method by a terminal apparatus concealing packet loss of voice communication according to an example of the present invention will be described. S100).

상기와 같이 입력된 오디오 신호는 수신단으로부터 피드백된 것으로 확인된 음질평가 결과 값을 분석하므로 설정된 가변비트율을 확인하고, 상기 확인된 가변비트율로 오디오 신호를 실시간으로 인코딩한다(S110). The input audio signal as described above analyzes the sound quality evaluation result value confirmed to be fed back from the receiving end, thereby checking the set variable bit rate and encoding the audio signal at the determined variable bit rate in real time (S110).

상기 음질평가 결과 값의 분석에 의하여 현재 패킷만을 인코딩하거나 또는 이전 패킷을 포함하고 MPEG-2 AAC에 의한 가변비트율로 함께 인코딩한다. By analyzing the sound quality evaluation result value, only the current packet is encoded or the previous packet is included and encoded together at a variable bit rate by MPEG-2 AAC.

상기 송신단은 인코딩된 오디오 패킷을 RTP PAYLOAD FORMAT으로 포맷하면서, 수신단으로부터 측정된 음질평가 결과 값을 피드백 받아 상기 포맷에 포함시켜 기록한다(S120). 상기 포맷하는 패킷의 구성은 첨부된 도4에 상세히 도시되어 있다. The transmitting end formats the encoded audio packet into the RTP PAYLOAD FORMAT, receives the sound quality evaluation result measured from the receiving end, and includes the feedback in the format (S120). The configuration of the formatted packet is shown in detail in FIG.

상기와 같이 포맷된 패킷은 RTP 프로토콜을 적용하여 패킷화하고 인터넷에 전송되므로 상대방에 송신된다(S130). The packet formatted as described above is packetized using the RTP protocol and transmitted to the other party (S130).

도 10 을 참조하여 본 발명의 일례에 의한 것으로 음성통신의 패킷 손실을 은닉하는 단말장치에 의한 수신단 운용방법을 설명하면, 인터넷을 통하여 오디오 패킷을 RTP 프로토콜에 의하여 실시간 수신한다(S200). Referring to FIG. 10, a method of operating a receiver by a terminal device for concealing packet loss of voice communication according to an example of the present invention will be described. In real time, an audio packet is received through an RTP protocol through the Internet (S200).

상기와 같이 수신된 오디오 패킷은 실시간으로 분석되어 기록된 음질평가 결과 값을 추출 및 확인하고, 상기 확인된 음질평가 결과 값은 송신단으로 피드백한다(S210). The received audio packet is analyzed and extracted in real time to extract and confirm the recorded sound quality evaluation result value, and the confirmed sound quality evaluation result value is fed back to the transmitter (S210).

상기 확인된 음질평가 결과 값을 분석하여 수신된 오디오 패킷의 가변 비트율을 확인하고, 상기 확인된 가변비트율을 이용하여 상기 수신된 오디오 패킷을 디 코딩한다(S220). 상기 오디오 패킷의 디코딩은 오디오 디코더부에서 처리된다.The variable bit rate of the received audio packet is checked by analyzing the value of the confirmed sound quality evaluation result, and the received audio packet is decoded using the determined variable bit rate (S220). The decoding of the audio packet is processed in the audio decoder section.

상기 디코딩된 오디오 패킷은 해당 순서번호에 의하여 버퍼에 순차적으로 기록하므로 인터넷을 통하여 전송되면서 발생된 지터(JITTER)를 제거한다(S230). The decoded audio packet is sequentially recorded in the buffer according to the corresponding sequence number, thereby removing jitter generated while being transmitted through the Internet (S230).

상기 버퍼에 기록된 오디오 패킷을 인가받아 설정된 소정 숫자 주기 마다 누적된 패킷의 음질평가 결과를 측정하고, 상기 측정된 음질평가 결과 값을 송신단에 피드백하며(S240), 또한, 상기 인가되는 오디오 패킷은 설정된 소정 숫자로 누적됨에 관계없이 규칙적으로 오디오 신호로 변환하여 실시간 재생 및 출력한다(S250). After receiving the audio packet recorded in the buffer, the sound quality evaluation result of the accumulated packets is measured every predetermined number period, and the measured sound quality evaluation result value is fed back to the transmitter (S240). Irrespective of the accumulation of a predetermined number, the audio signal is regularly converted into an audio signal and reproduced and output in real time (S250).

본 발명을 일례로 설명하였으나, 반드시 이러한 일례에 국한되는 것이 아니고, 본 발명의 기술사상을 벗어나지 않는 범위 내에서 다양하게 변형 실시될 수 있다. 따라서 본 발명에 개시된 일예들은 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 일례에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명의 권리범위에 포함되는 것으로 해석되어야 한다. Although the present invention has been described as an example, it is not necessarily limited to such an example, and various modifications can be made without departing from the spirit of the present invention. Therefore, the examples disclosed in the present invention are not intended to limit the technical idea of the present invention but to describe the present invention, and the scope of the technical idea of the present invention is not limited by these examples. The scope of protection of the present invention should be interpreted by the following claims, and all technical ideas within the scope equivalent thereto should be construed as being included in the scope of the present invention.

도 1 은 본 발명이 적용되는 인터넷 시스템의 일례에 의한 전체 기능 구성도, 1 is an overall functional configuration diagram according to an example of the Internet system to which the present invention is applied;

도 2 는 본 발명의 일례에 의한 것으로 음성통신의 패킷 손실을 은닉하는 단말장치 기능 구성도, 2 is a diagram illustrating a terminal device function concealing packet loss of voice communication according to an example of the present invention;

도 3 은 본 발명의 설명을 위한 것으로 일반적인 오디오 패킷의 상세 필드 구성도, 3 is a view illustrating a detailed field configuration of a general audio packet for explaining the present invention;

도 4 는 본 발명의 일례에 의한 것으로 전송되는 오디오 패킷의 상세 필드 구성도, 4 is a detailed field configuration diagram of an audio packet transmitted by an example of the present invention;

도 5 는 본 발명의 일례에 의한 것으로 수신단에서의 실시간 음질평가 방식 도시도, 5 is a diagram illustrating a real-time sound quality evaluation method according to an example of the present invention.

도 6 은 본 발명의 일례에 의한 것으로 송신단이 수신되어 피드백된 음질평가 결과에 따라 잉여 정보 인코딩을 차별적으로 수행하는 방식 설명도, FIG. 6 is a diagram illustrating a method of differentially performing surplus information encoding according to a sound quality evaluation result of a transmitter receiving and being fed back according to an example of the present invention;

도 7 은 본 발명의 일례에 의한 것으로 장치 사이에서 송수신 되는 패킷의 버퍼링과 음질평가 결과를 측정하여 피드백하는 상태 설명도, 7 is a diagram illustrating a state of measuring and feeding back buffering and sound quality evaluation results of packets transmitted and received between devices according to an example of the present invention;

도 8 은 본 발명의 일 실험에 의하여 검출된 결과 도시도, 8 is a diagram showing results detected by one experiment of the present invention;

도 9 는 본 발명의 일례에 의하여 음성통신의 패킷 손실을 은닉하는 단말장치에 의한 송신단 운용방법 순서도, 9 is a flowchart illustrating a method for operating a transmitting end by a terminal apparatus concealing packet loss of voice communication according to an example of the present invention;

도 10 은 본 발명의 일례에 의하여 음성통신의 패킷 손실을 은닉하는 단말장치에 의한 수신단 운용방법 순서도. 10 is a flowchart illustrating a method of operating a receiving end by a terminal device for concealing packet loss of voice communication according to an example of the present invention.

** 도면의 주요 부분에 대한 부호 설명 ** ** Explanation of symbols on the main parts of the drawing **

100, 100‘ : 단말장치 110 : 송신단100, 100 ': terminal 110: transmitter

111 : 사운드 캡쳐부 112 : 오디오 인코딩부111: sound capture section 112: audio encoding section

113 : 실시간 포맷부 114 : 실시간 패킷부113: real time format unit 114: real time packet unit

120 : 수신단 121 : 실시간 수신부120: receiver 121: real-time receiver

122 : 실시간 분석부 123 : 오디오 디코더부122: real-time analysis unit 123: audio decoder unit

124 : 버퍼부 125 : 품질측정부124: buffer section 125: quality measurement section

126 : 오디오 렌더부 200 : 인터넷126: audio render unit 200: the Internet

Claims

delete

In a terminal device for real-time communication with audio signals through the Internet,

A transmitter for encoding real-time input audio signals and surplus information at a variable bit rate at which the overall bit rate does not increase, recording a sound quality evaluation result, converting the result into a real-time transmission format, and transmitting the packet to the Internet; And

A receiving end receiving a real-time audio packet through the Internet, checking a sound quality evaluation result and feeding it back to the transmitting end, sequentially recording a packet decoded at a variable bit rate, removing jitter, feeding back a sound quality evaluation result, and outputting a reproduced voice signal; The transmitter includes a sound capture unit that samples and inputs an audio signal applied in real time, and encodes the input audio signal into a digital signal and includes surplus information of a previous packet in a state where the total bit rate is not increased. An audio encoding unit for encoding, a real-time format unit for recording a sound quality measurement result fed back to the receiver by converting the audio signal encoded by the audio encoding unit into a real-time transmission format, and an audio signal formatted by the real-time format unit on the Internet in real time And a real time packet unit for converting and transmitting the packet into a transmission packet.

The method of claim 2, wherein the receiving end,

A real time receiving unit which receives a real time packet transmitted from the opposite terminal device through the Internet;

A real-time analyzer for analyzing a packet received by the real-time receiver to check a sound quality evaluation result and feeding back to the transmitter;

An audio decoder for analyzing a variable bit rate from a signal applied by the real time analyzer and decoding a received packet;

A buffer unit which records the decoded packets in order and removes jitter;

A quality measuring unit which sequentially receives the packets stored in the buffer unit and measures a sound quality evaluation result at each predetermined number period, and feeds back a feedback signal to the transmitting end;

An audio renderer which outputs an audio signal applied from the quality measurer; Terminal for concealing packet loss of voice communication, characterized in that it comprises a.

The method of claim 2, wherein the audio encoding unit,

And a terminal for concealing packet loss of voice communication, characterized in that the audio signal and the surplus information are encoded at a variable bit rate without increasing the overall bit rate.

The method of claim 4, wherein the variable bit rate,

A terminal device for concealing packet loss of voice communication, characterized in that consisting of a configuration that is confirmed by analyzing the sound quality evaluation results confirmed and feedback from the receiving end.

The method of claim 4 or 5, wherein the variable bit rate,

A terminal device for concealing packet loss of voice communication, characterized in that configured to analyze and confirm the sound quality evaluation result received in the packet from the other terminal device.

The method of claim 2, wherein the real-time format unit,

And a packet recording apparatus for concealing packet loss of voice communication, characterized in that the receiver is configured to record the measured sound quality evaluation result fed back to the prediction region (MOS) in a payload format.

The method of claim 3, wherein the audio decoder,

And analyzing the signal of the real-time analyzer to confirm the variable bit rate and to decode the identified variable bit rate.

The signal of claim 3, wherein the signal received by the receiver is

A terminal apparatus for concealing packet loss of voice communication, characterized in that at least one of the surplus information of the current packet and the previous packet is included.

The method of claim 9, wherein the surplus information,

Terminal equipment for concealing packet loss of the voice communication, characterized in that the configuration further comprises a surplus information for the previous packet of the previous packet.

delete

The method of claim 3, wherein the jitter,

And a terminal device for concealing packet loss of voice communication, characterized by a configuration consisting of mutations occurring in the Internet.

The method of claim 3, wherein the buffer unit,

And writing the decoded packet signal into a buffer to remove the jitter.

The method of claim 3, wherein the quality measuring unit,

And a terminal for concealing packet loss of voice communication, characterized in that the sound quality evaluation result is measured after receiving an audio packet signal of a predetermined number period set from the buffer unit.

The method of claim 14, wherein the sound quality evaluation,

Terminal equipment for concealing packet loss of voice communication, comprising objective results evaluated using the NON-INTRUSIVE method of the ITU-T standard P563 (P.563) .

The method of claim 3 or claim 14, wherein the quality measuring unit,

A terminal device for concealing packet loss of voice communication, characterized by expressing the result of the sound quality evaluation in the range of 1 to 5.

The method of claim 16, wherein the result value of the sound quality evaluation,

A terminal apparatus for concealing packet loss of voice communication, characterized in that the configuration is fed back to the real-time format portion of the transmitter.

The method of claim 3, wherein the real-time analysis unit,

And a terminal for concealing packet loss of voice communication, characterized in that said feedback is fed back to said transmitting end.

The audio rendering unit of claim 3, wherein the audio renderer comprises:

And a packet unit for concealing packet loss of voice communication, characterized in that the audio signal of a packet unit applied from the buffer unit is reproduced and output in real time through a sound card.

delete

In the method of concealing packet loss of voice communication,

Inputting and real time capturing an audio signal by a transmitting end of a voice communication terminal device;

Encoding the captured audio signal with surplus information by a variable bit rate analyzed as a result of the confirmed sound quality evaluation;

Converting the encoded audio signal and the measured sound quality evaluation result into a real-time transmission format; And

Converting the audio signal of the real time transmission format into a packet and transmitting it to the Internet;

And the encoding process does not encode the surplus information according to the confirmed sound quality evaluation result.

The method of claim 21, wherein the encoding process,

And encoding one surplus information or two surplus information together without increasing the overall bit rate according to the confirmed sound quality evaluation result.

The method of claim 22, wherein the one surplus information,

A method for concealing packet loss in voice communication, characterized in that it is information about a previous packet of a packet to be currently encoded.

The method of claim 22, wherein the two surplus information,

A method for concealing packet loss in voice communication, characterized in that it is information about a previous packet and a previous packet of the current packet to be encoded.

The method of claim 21, wherein the measured sound quality evaluation results,

A method for concealing packet loss in voice communication, characterized in that the audio quality received from the other party is a result of accumulating the audio packets by a predetermined predetermined number unit.

delete

In the method of concealing packet loss of voice communication,

Receiving and analyzing an audio packet in real time by a receiving end of the voice communication terminal device and feeding back the confirmed sound quality evaluation result to the transmitting end;

Decoding the received packet at a variable bit rate analyzed from the sound quality and the result and storing the received packet in a buffer by a sequence number; And

Feeding back a sound quality evaluation measured from a packet of a predetermined number unit stored in the buffer to a transmitting end and simultaneously converting the sound quality evaluation into a real time audio signal;

And confirming the recorded sound quality evaluation result by analyzing the received packet.

The method of claim 27, wherein the variable bit rate,

And confirming by analyzing from the confirmed voice evaluation result.

The method of claim 27, wherein the sound quality evaluation,

A method for concealing packet loss in voice communication, characterized in that the measurement is performed for each period in which packets are accumulated and recorded in predetermined number units set in the buffer.

The method of claim 27, wherein the output of the audio signal,

And receiving and converting the audio packets sequentially recorded in the buffer regardless of the measurement of the sound quality evaluation.

In the method of concealing packet loss of voice communication,

Analyzing, by the receiving end of the voice communication terminal device, a packet received from the other party and confirming a sound quality evaluation result;

Varying a transmission bit rate according to the analyzed sound quality evaluation result and coding a captured audio signal and redundant information;

Accumulating the received packets on a predetermined predetermined unit of time and measuring an objective sound quality evaluation; And

Feeding back the measured sound quality evaluation result to a transmitting end to record and transmit in a designated area of a format; Method of concealing packet loss of voice communication comprising a.

32. The method of claim 31, wherein coding the surplus information comprises:

The packet loss of the voice communication, characterized in that it is any one selected from the coding of the surplus information, the coding of the surplus information of the previous packet or the coding of the surplus information of the previous packet and the previous packet according to the analyzed sound quality evaluation result. How to conceal.