KR20120068525A

KR20120068525A - Apparatus and method for down mixing of wave field synthesis signal

Info

Publication number: KR20120068525A
Application number: KR1020100130187A
Authority: KR
Inventors: 유재현; 정현주; 전상배; 서정일; 강경옥; 성굉모
Original assignee: 한국전자통신연구원
Priority date: 2010-12-17
Filing date: 2010-12-17
Publication date: 2012-06-27
Also published as: KR101758914B1

Abstract

PURPOSE: A down-mixing apparatus of a sound field synthesis signal and a method thereof are provided to reproduce optimized sound field synthesis signal in a multi-channel audio reproducing system in a lower multi-channel audio play system. CONSTITUTION: A reproducing location information estimating unit(110) estimates reproducing location information of an inputted sound field mixing signal. A main signal estimating unit(120) estimates a main signal of the sound field signal based on reproducing location information. A sound image fixing information estimating unit(130) estimates the sound field synthesis signal based on the main signal.

Description

Apparatus and method for downmixing a sound field synthesis signal {APPARATUS AND METHOD FOR DOWN MIXING OF WAVE FIELD SYNTHESIS SIGNAL}

본 발명은 음장 합성 신호의 다운 믹싱 장치 및 방법에 관한 것으로, 보다 상세하게는 상위 채널에 따라 생성된 음장 합성 신호를 다운 믹스하여 재생하는 장치 및 방법에 관한 것이다. The present invention relates to an apparatus and method for downmixing a sound field synthesis signal, and more particularly, to an apparatus and method for downmixing and reproducing a sound field synthesis signal generated according to an upper channel.

멀티채널 오디오 재생 기술은 2채널 스테레오에서 시작하여 5.1채널, 7.1채널로 확장되었으며, 최근에는 3개 레이어(layer)를 사용하는 22.2채널 오디오 재생 시스템도 개발되었다.Multichannel audio playback technology has been extended to 5.1 and 7.1 channels starting with two-channel stereo, and recently 22.2 channel audio playback systems using three layers have been developed.

그러나, 현재 가정에서 일반적으로 사용되는 멀티채널 오디오 재생 시스템은 5.1채널이나 7.1 채널이므로 22.2 채널 오디오 시스템에 최적화된 음장 합성 신호를 재생하기 어렵다는 한계가 있었다.However, since the multi-channel audio reproduction system generally used in homes is 5.1 channel or 7.1 channel, it is difficult to reproduce sound field synthesis signals optimized for 22.2 channel audio systems.

따라서, 22.2 채널과 같이 고위 멀티 채널 오디오 재생 시스템에 최적화된 음장 합성 신호를 5.1채널이나 7.1 채널 같은 하위 멀티 채널 오디오 재생 시스템에서 호환 할 수 있도록 하는 방법이 요구되고 있다.Therefore, there is a need for a method for making a sound field synthesis signal optimized for a high-level multi-channel audio reproduction system such as 22.2 channels to be compatible with lower multi-channel audio reproduction systems such as 5.1 or 7.1 channels.

본 발명은 음장 합성 신호를 다운 믹스함으로써 하위 멀티 채널 오디오 재생 시스템에서도 고위 멀티 채널 오디오 재생 시스템에 최적화된 읍장 합성 신호를 재생하여 사용자에게 음장을 제공하는 장치 및 방법을 제공한다. The present invention provides an apparatus and a method for providing a sound field to a user by downmixing a sound field synthesis signal to reproduce a town synthesis signal optimized for a high-level multichannel audio reproduction system even in a lower multichannel audio reproduction system.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치는 입력 받은 음장 합성 신호의 재생 위치 정보를 추정하는 재생 위치 정보 추정부; 상기 재생 위치 정보를 기초로 상기 음장 합성 신호의 주요 신호(Primary signal)를 추정하는 주요 신호 추정부; 및 상기 주요 신호를 기초로 상기 음장 합성 신호의 음상정위 정보를 추정하는 음상정위 정보 추정부를 포함할 수 있다.A downmixing apparatus for a sound field synthesis signal according to an embodiment of the present invention includes: a playback position information estimator for estimating play position information of an input sound field synthesis signal; A main signal estimator for estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And a stereophonic information estimator for estimating stereoscopic information of the sound field synthesis signal based on the main signal.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 재생 위치 정보 추정부는, 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다.The playback position information estimator of the downmixing apparatus of the sound field synthesis signal according to an embodiment of the present invention may estimate the playback position information based on amplitude and delay between adjacent channels of the sound field synthesis signal.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 주요 신호 추정부는, 음장 합성 신호에 음장 합성 렌더링을 역으로 수행하여 적어도 하나의 주요 신호를 추정할 수 있다.The main signal estimator of the downmixing apparatus of the sound field synthesis signal according to an exemplary embodiment of the present invention may estimate at least one main signal by performing sound field synthesis rendering on the sound field synthesis signal in reverse.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 주요 신호 추정부는 음장 합성 신호에 포함된 음원 신호 중 주요 신호로 추정 되지 않은 신호를 부가 신호(ambient signal)로 정의할 수 있다.The main signal estimating unit of the downmixing apparatus of the sound field synthesis signal according to an embodiment of the present invention may define a signal that is not estimated as a main signal among sound source signals included in the sound field synthesis signal as an additional signal.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 음상정위 정보 추정부는 음장 합성 신호의 각 채널 별 주요 신호의 비율을 기초로 채널의 위치를 추정하고, 추정한 채널의 위치와 재생 위치 정보를 기초로 음상정위 정보를 추정할 수 있다.The sound phase information estimating unit of the downmixing device of the sound field synthesis signal according to an embodiment of the present invention estimates the position of the channel based on the ratio of the main signal of each channel of the sound field synthesis signal, and estimates the position of the channel and the playback position. The stereotactic information can be estimated based on the information.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법은 입력 받은 음장 합성 신호의 재생 위치 정보를 추정하는 단계; 상기 재생 위치 정보를 기초로 상기 음장 합성 신호의 주요 신호(Primary signal)를 추정하는 단계; 및 상기 주요 신호를 기초로 상기 음장 합성 신호의 음상정위 정보를 추정하는 단계를 포함할 수 있다.A downmixing method of a sound field synthesis signal according to an embodiment of the present invention includes estimating reproduction position information of an input sound field synthesis signal; Estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And estimating sound position information of the sound field synthesis signal based on the main signal.

본 발명의 일실시예에 의하면, 음장 합성 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정하여 음장 합성 신호를 다운 믹스함으로써 하위 멀티 채널 오디오 재생 시스템에서도 고위 멀티 채널 오디오 재생 시스템에 최적화된 읍장 합성 신호를 재생할 수 있다.According to an embodiment of the present invention, the down-composition of sound field synthesis signals is estimated based on the sound field synthesis signal and downmixed by the sound field synthesis signal. Can reproduce the signal.

도 1은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치를 도시한 블록 다이어그램이다.
도 2는 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 동작 일례이다.
도 3은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법을 도시한 플로우차트이다.1 is a block diagram illustrating an apparatus for downmixing a sound field synthesis signal according to an embodiment of the present invention.
2 is an example of the operation of the downmixing device of the sound field synthesis signal according to an embodiment of the present invention.
3 is a flowchart illustrating a downmixing method of a sound field synthesis signal according to an embodiment of the present invention.

이하, 본 발명의 실시예를 첨부된 도면을 참조하여 상세하게 설명한다. 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법은 음장 합성 신호의 다운 믹싱 장치에 의해 수행될 수 있다.DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. The downmixing method of the sound field synthesis signal according to the embodiment of the present invention may be performed by the downmixing device of the sound field synthesis signal.

도 1은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치를 도시한 블록 다이어그램이다. 1 is a block diagram illustrating an apparatus for downmixing a sound field synthesis signal according to an embodiment of the present invention.

도 1을 참고하면, 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치(100)는 재생 위치 정보 추정부(110), 주요 신호 추정부(120), 음상정위 정보 추정부(130), 및 랜더링부(140)를 포함할 수 있다. Referring to FIG. 1, the apparatus 100 for mixing down a sound field synthesis signal according to an exemplary embodiment of the present invention may include a reproduction position information estimator 110, a main signal estimator 120, and a stereophonic information estimator 130. , And may include a rendering unit 140.

재생 위치 정보 추정부(110)는 사용자로부터 입력 받은 음장 합성 신호(WFS Signal: Wave Field Synthesis Signal)의 재생 위치 정보를 추정할 수 있다. 구체적으로 재생 위치 정보 추정부(110)는 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다. 이때, 재생 위치 정보 추정부(110)가 추정하는 재생 위치 정보는 음장 합성 신호의 각 채널이 재생되는 출력 장치 간의 간격, 출력 장치들의 배치 방향 및 출력 장치의 위치 중 적어도 하나를 포함할 수 있다. 이때, 출력 장치는 라우드 스피커와 같이 각 채널에 포함된 오디오 신호를 재생하여 출력하는 장치이다.The reproduction position information estimator 110 may estimate the reproduction position information of a sound field synthesis signal (WFS signal) received from a user. In more detail, the playback position information estimator 110 may estimate the playback position information based on an amplitude and a delay between adjacent channels of the sound field synthesis signal. In this case, the reproduction position information estimated by the reproduction position information estimator 110 may include at least one of an interval between output devices for reproducing each channel of the sound field synthesis signal, an arrangement direction of the output devices, and a position of the output device. In this case, the output device is a device that reproduces and outputs an audio signal included in each channel, such as a loudspeaker.

또한, 음장 합성 신호에 재생 위치 정보가 포함된 경우, 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치(100)는 재생 위치 정보 추정부(110)의 동작을 생략하고, 바로 주요 신호 추정부(120)를 실행할 수도 있다.In addition, when the reproduction position information is included in the sound field synthesis signal, the downmixing apparatus 100 of the sound field synthesis signal according to an embodiment of the present invention omits the operation of the reproduction position information estimator 110, and immediately the main signal. The estimator 120 may also be executed.

주요 신호 추정부(120)는 재생 위치 정보 추정부(110)가 추정한 재생 위치 정보를 기초로 음장 합성 신호의 주요 신호(Primary signal)를 추정할 수 있다. 구체적으로 주요 신호 추정부(120)는 음장 합성 신호에 음장 합성 렌더링을 역으로 수행하여 적어도 하나의 주요 신호를 추정할 수 있다. The main signal estimator 120 may estimate a primary signal of the sound field synthesis signal based on the play position information estimated by the play position information estimator 110. In more detail, the main signal estimator 120 estimates at least one main signal by performing sound field synthesis rendering inversely on the sound field synthesis signal.

이때, 주요 신호 추정부(120)는 특정 채널에 큰 음압으로 포함된 음원 신호를 주요 신호로 추정할 수 있다. 이때, 특정 채널은 음장 합성 신호에 포함된 채널 중 하나일 수도 있고, 기 설정된 채널일 수도 있다. 또한, 주요 신호 추정부(120)는 다른 신호에 비해서 음압이 큰 신호를 주요 신호로 추정할 수도 있고, 일정 값보다 음압이 큰 신호를 주요 신호로 추정할 수도 있다.In this case, the main signal estimator 120 may estimate a sound source signal included with a large sound pressure in a specific channel as the main signal. In this case, the specific channel may be one of channels included in the sound field synthesis signal, or may be a preset channel. In addition, the main signal estimator 120 may estimate a signal having a higher sound pressure as a main signal than other signals, or may estimate a signal having a larger sound pressure than a predetermined value as a main signal.

또한, 주요 신호 추정부(120)는 음장 합성 신호에 포함된 음원 신호 중 주요 신호로 추정 되지 않은 신호를 부가 신호(ambient signal)로 정의할 수 있다.In addition, the main signal estimator 120 may define a signal that is not estimated as a main signal among sound source signals included in the sound field synthesis signal as an additional signal.

음상정위 정보 추정부(130)는 주요 신호 추정부(120)가 추정한 주요 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정할 수 있다. 구체적으로, 음상정위 정보 추정부(130)는 음장 합성 신호의 각 채널에 포함된 주요 신호의 비율을 기초로 각 채널의 위치를 추정하고, 추정한 채널의 위치와 재생 위치 정보를 기초로 각 주요 신호의 음상정위 정보를 추정할 수 있다.The sound phase information estimating unit 130 may estimate sound position information of the sound field synthesis signal based on the main signal estimated by the main signal estimating unit 120. Specifically, the stereophonic information estimator 130 estimates the position of each channel based on the ratio of the main signals included in each channel of the sound field synthesis signal, and based on the estimated channel position and the playback position information, Sound stereo information of the signal can be estimated.

이때, 특정 채널에서 특정 주요 신호가 가장 큰 음압을 가지는 경우, 상기 특정 채널에 인접한 다른 채널에서 상기 특정 주요 신호는 상기 특정 채널보다 적은 음압을 가질 수 있다. 일례로, 제2 채널과 제3 채널이 각각 제1 채널의 좌우에 인접한 경우, 주요 신호 중 하나가 제1 채널에서 가장 큰 음압을 가지면, 상기 주요 신호는 제2 채널과 제3 채널에서 두 번째로 큰 음압을 가질 수 있다. 따라서, 음상정위 정보 추정부(130)는 각 채널 별 주요 신호의 음압 비율에 따라 음장 합성 신호를 재생할 오디오 시스템의 채널 정보에 대응하는 채널들의 위치를 추정할 수 있다. 일례로, 음장 합성 신호가 22.2채널이고, 음장 합성 신호를 재생할 오디오 시스템이 5.1 채널인 경우, 음상정위 정보 추정부(130)는 22.2채널의 각 채널별 주요 신호의 음압 비율에 따라 22.2채널의 각 채널들이 각각 5.1 채널 중 어떤 채널에 대응하는지를 추정할 수 있다.In this case, when a specific main signal has the highest sound pressure in a specific channel, the specific main signal in another channel adjacent to the specific channel may have a lower sound pressure than the specific channel. For example, when the second channel and the third channel are adjacent to the left and right of the first channel, respectively, if one of the main signals has the largest sound pressure in the first channel, the main signal is the second in the second channel and the third channel. It can have a large sound pressure. Accordingly, the stereotactic information estimator 130 may estimate the positions of the channels corresponding to the channel information of the audio system to reproduce the sound field synthesis signal according to the sound pressure ratio of the main signal for each channel. For example, when the sound field synthesis signal is 22.2 channels, and the audio system for reproducing the sound field synthesis signal is 5.1 channels, the stereotographic information estimator 130 may determine each of the 22.2 channels according to the sound pressure ratio of the main signals for each channel of the 22.2 channels. It is possible to estimate which of the 5.1 channels corresponds to each of the channels.

또한, 본 발명에 따른 재생 위치 정보 추정부(110), 주요 신호 추정부(120), 및 음상정위 정보 추정부(130)는 다운 믹서(Down mixer)에 포함된 세부 구성일 수 있다.In addition, the reproduction position information estimator 110, the main signal estimator 120, and the stereotactic information estimator 130 according to the present invention may have a detailed configuration included in a down mixer.

랜더링부(140)는 음상정위 정보 추정부(130)가 추정한 주요 신호와 음상정위 정보 추정부(130)가 정의한 부가 신호를 음장 합성 신호를 재생할 오디오 시스템에 적합한 형태로 랜더링하여 재생할 수 있다. 이때, 랜더링부(140)는 채널 믹서(Channel mixer)일 수 있다.The rendering unit 140 may render and reproduce the main signal estimated by the stereotactic information estimation unit 130 and the additional signal defined by the stereotactic information estimation unit 130 in a form suitable for an audio system that reproduces the sound field synthesis signal. In this case, the rendering unit 140 may be a channel mixer.

구체적으로, 랜더링부(140)는 음상정위 정보를 기초로 음장 합성 신호를 재생할 오디오 시스템에서 상기 주요 신호에 대응하는 채널을 선택하고, 상기 주요 신호에 패닝을 적용하여 상기 대응하는 채널에서 재생할 수 있다.In detail, the rendering unit 140 may select a channel corresponding to the main signal in the audio system to reproduce the sound field synthesis signal based on the stereotographic information, and apply panning to the main signal to reproduce the channel in the corresponding channel. .

일례로, 음장 합성 신호를 재생할 오디오 시스템이 5.1 채널이고, 음상정위 정보 추정부(130)가 추정한 주요 신호의 음상정위 정보가 C(Center) 채널인 경우, 랜더링부(140)는 별다른 패닝 없이 C 채널에서 해당 주요 신호를 재생할 수 있다. 반면, 음상정위 정보 추정부(130)가 추정한 주요 신호의 음상정위 정보가 BC(Back Center) 채널인 경우, 5.1 채널의 오디오 시스템에서 BC 채널은 없다. 따라서, 랜더링부(140)는 BC 채널에서 가장 가까운 채널인 LS(Left Side) 채널과 RS(Right Side) 채널을 선택하고, 주요 신호에 패닝을 적용하여 LS 채널과 RS 채널에서 재생하도록 할 수 있다.For example, when the audio system for reproducing the sound field synthesis signal is 5.1 channel, and the stereotactic information of the main signal estimated by the stereotactic information estimation unit 130 is the C (Center) channel, the rendering unit 140 performs no panning. The main signal can be played on the C channel. On the other hand, when the stereotactic information of the main signal estimated by the stereotactic information estimation unit 130 is a BC (Back Center) channel, there is no BC channel in the 5.1-channel audio system. Accordingly, the rendering unit 140 may select a left side (LS) channel and a right side (RS) channel, which are channels closest to the BC channel, and apply panning to a main signal to reproduce the LS and RS channels. .

또한, 랜더링부(140)는 음장 합성 신호를 재생할 오디오 시스템에 후방 채널이 있는 경우, 부가 신호를 상기 오디오 시스템의 후방 채널에 인가하여 재생할 수 있다. 그리고, 랜더링부(140)는 상기 오디오 시스템에 후방 채널이 없는 경우, 부가 신호를 상기 오디오 시스템의 모든 채널에 파위(power) 비율로 인가하여 재생할 수 있다. 일례로, 랜더링부(140)는 상기 오디오 시스템이 2채널인 경우에는 모든 채널에 부가 신호를

로 인가하고, 상기 오디오 시스템이 3채널인 경우에는 모든 채널에 부가 신호를

으로 인가할 수 있다.In addition, when the audio system to reproduce the sound field synthesis signal has a rear channel, the rendering unit 140 may apply the additional signal to the rear channel of the audio system and reproduce the additional signal. When the audio system does not have a rear channel, the rendering unit 140 may apply and reproduce an additional signal to all channels of the audio system at a power ratio. For example, the rendering unit 140 provides additional signals to all channels when the audio system has two channels.

If the audio system is three channels, additional signals are applied to all channels.

Can be applied as

도 2는 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 동작 일례이다. 2 is an example of the operation of the downmixing device of the sound field synthesis signal according to an embodiment of the present invention.

도 2는 M개의 채널 신호로 구성된 음장 합성 신호를 5.1 채널 신호로 다운믹싱하여 재생하는 과정의 동작 일례이다. 이때, 음장 합성 신호는 5.1 채널보다 많은 채널을 가진 신호일 수 있다.2 illustrates an example of a process of downmixing and reproducing a sound field synthesis signal composed of M channel signals into a 5.1 channel signal. In this case, the sound field synthesis signal may be a signal having more than 5.1 channels.

먼저, 본 발명에 따른 재생 위치 정보 추정부(110), 주요 신호 추정부(120), 및 음상정위 정보 추정부(130)를 포함하는 다운 믹서(Down mixer)(210)는 도 2에 도시된 바와 같이 음장 합성 신호를 구성하는 M개의 채널 신호를 입력 받을 수 있다.First, a down mixer 210 including a reproduction position information estimator 110, a main signal estimator 120, and a stereotactic information estimator 130 according to the present invention is illustrated in FIG. 2. As described above, the M channel signals constituting the sound field synthesis signal may be input.

이때, 다운 믹서(210)는 수신한 음장 합성 신호의 재생 위치 정보를 기초로 음장 합성 신호의 주요 신호를 추정하고 부가 신호를 정의할 수 있다. 수신한 음장 합성 신호의 재생 위치 정보가 포함되지 않은 경우, 다운 믹서(210)는 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다.In this case, the down mixer 210 may estimate the main signal of the sound field synthesis signal and define the additional signal based on the reproduction position information of the received sound field synthesis signal. If the reproduction position information of the received sound field synthesis signal is not included, the down mixer 210 may estimate the reproduction position information based on amplitude and delay between adjacent channels of the sound field synthesis signal.

다음으로 다운 믹서(210)는 추정한 주요 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정할 수 있다.Next, the down mixer 210 may estimate sound phase information of the sound field synthesis signal based on the estimated main signal.

그 다음으로 다운 믹서(210)는 추정한 음상정위 정보와 주요 신호, 및 정의한 부가 신호를 채널 믹서(220)로 전송할 수 있다.Next, the down mixer 210 may transmit the estimated sound position information, the main signal, and the defined additional signal to the channel mixer 220.

마지막으로 채널 믹서(220)는 수신한 음상정위 정보를 기초로 주요 신호와 부가 신호를 랜더링하여 5.1 채널에서 재생할 수 있다. Finally, the channel mixer 220 may render the main signal and the additional signal on the basis of the received sound stereo information and reproduce the 5.1 signal.

도 3은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법을 도시한 플로우차트이다.3 is a flowchart illustrating a downmixing method of a sound field synthesis signal according to an embodiment of the present invention.

단계(S310)에서 재생 위치 정보 추정부(110)는 사용자로부터 입력 받은 음장 합성 신호의 재생 위치 정보를 추정할 수 있다. 구체적으로 재생 위치 정보 추정부(110)는 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다. 또한, 사용자로부터 입력 받은 음장 합성 신호에 재생 위치 정보가 포함된 경우, 단계(S310)는 생략될 수 있다.In operation S310, the reproduction position information estimator 110 may estimate the reproduction position information of the sound field synthesis signal received from the user. In more detail, the playback position information estimator 110 may estimate the playback position information based on an amplitude and a delay between adjacent channels of the sound field synthesis signal. In addition, when the reproduction position information is included in the sound field synthesis signal input from the user, step S310 may be omitted.

단계(S320)에서 주요 신호 추정부(120)는 단계(S310)에서 추정한 재생 위치 정보를 기초로 음장 합성 신호의 주요 신호를 추정할 수 있다. 구체적으로 주요 신호 추정부(120)는 음장 합성 신호에 음장 합성 렌더링을 역으로 수행하여 적어도 하나의 주요 신호를 추정할 수 있다. In operation S320, the main signal estimator 120 may estimate the main signal of the sound field synthesis signal based on the reproduction position information estimated in operation S310. In more detail, the main signal estimator 120 estimates at least one main signal by performing sound field synthesis rendering inversely on the sound field synthesis signal.

단계(S330)에서, 주요 신호 추정부(120)는 단계(S310)에서 수신한 음장 합성 신호에 포함된 음원 신호 중 단계(S320)에서 주요 신호로 추정 되지 않은 신호를 부가 신호(ambient signal)로 정의할 수 있다.In step S330, the main signal estimator 120 converts a signal not estimated as a main signal in step S320 among the sound source signals included in the sound field synthesis signal received in step S310 as an additional signal. Can be defined

단계(S340)에서 음상정위 정보 추정부(130)는 단계(S320)에서 추정한 주요 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정할 수 있다. 구체적으로, 음상정위 정보 추정부(130)는 음장 합성 신호의 각 채널에 포함된 주요 신호의 비율을 기초로 각 채널의 위치를 추정하고, 추정한 채널의 위치와 재생 위치 정보를 기초로 각 주요 신호의 음상정위 정보를 추정할 수 있다.In operation S340, the stereotactic information estimator 130 may estimate the stereoscopic information of the sound field synthesis signal based on the main signal estimated in operation S320. Specifically, the stereophonic information estimator 130 estimates the position of each channel based on the ratio of the main signals included in each channel of the sound field synthesis signal, and based on the estimated channel position and the playback position information, Sound stereo information of the signal can be estimated.

단계(S350)에서 랜더링부(140)는 단계(S320)에서 추정한 주요 신호와 단계(S330)에서 정의한 부가 신호를 음장 합성 신호를 재생할 오디오 시스템에 적합한 형태로 랜더링하여 재생할 수 있다. 구체적으로, 랜더링부(140)는 단계(S340)에서 추정한 음상정위 정보를 기초로 음장 합성 신호를 재생할 오디오 시스템에서 상기 주요 신호에 대응하는 채널을 선택하고, 상기 주요 신호에 패닝을 적용하여 상기 대응하는 채널에서 재생할 수 있다.In operation S350, the rendering unit 140 may render and reproduce the main signal estimated in operation S320 and the additional signal defined in operation S330 in a form suitable for an audio system that reproduces the sound field synthesis signal. In detail, the rendering unit 140 selects a channel corresponding to the main signal in the audio system to reproduce the sound field synthesis signal based on the stereotographic information estimated in step S340, and applies panning to the main signal. Can play on the corresponding channel.

이때, 랜더링부(140)는 음장 합성 신호를 재생할 오디오 시스템에 후방 채널이 있는 경우, 부가 신호를 상기 오디오 시스템의 후방 채널에 인가하여 재생할 수 있다. 그리고, 랜더링부(140)는 상기 오디오 시스템에 후방 채널이 없는 경우, 부가 신호를 상기 오디오 시스템의 모든 채널에 인가하여 재생할 수 있다.In this case, the rendering unit 140 may apply the additional signal to the rear channel of the audio system and play the audio signal when the rear channel is included in the audio system to reproduce the sound field synthesis signal. When the audio system does not have a rear channel, the rendering unit 140 may apply and play an additional signal to all channels of the audio system.

본 발명은 음장 합성 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정하여 음장 합성 신호를 다운 믹스함으로써 하위 멀티 채널 오디오 재생 시스템에서도 고위 멀티 채널 오디오 재생 시스템에 최적화된 읍장 합성 신호를 재생할 수 있다.According to the present invention, the sound field synthesis signal is estimated based on the sound field synthesis signal to downmix the sound field synthesis signal, so that the low-level multichannel audio reproduction system can reproduce the town synthesis signal optimized for the high-level multichannel audio reproduction system.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다.As described above, the present invention has been described by way of limited embodiments and drawings, but the present invention is not limited to the above embodiments, and those skilled in the art to which the present invention pertains various modifications and variations from such descriptions. This is possible.

그러므로, 본 발명의 범위는 설명된 실시예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등한 것들에 의해 정해져야 한다.Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined not only by the claims below but also by the equivalents of the claims.

110: 재생 위치 정보 추정부
120: 주요 신호 추정부
130: 음상정위 정보 추정부
140: 랜더링부
110: playback position information estimation unit
120: main signal estimator
130: phonetic information estimation unit
140: rendering unit

Claims

A reproduction position information estimator for estimating reproduction position information of the received sound field synthesis signal;
A main signal estimator for estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And
A stereophonic information estimator for estimating stereoscopic information of the sound field synthesis signal based on the main signal
Device comprising a.

The method of claim 1,
The reproduction position information estimating unit,
And estimate the reproduction position information based on amplitude and delay between adjacent channels of the sound field synthesis signal.

The method of claim 2,
The reproduction position information,
And at least one of an interval between an output device in which each channel of the sound field synthesis signal is reproduced, an arrangement direction of the output devices, and a position of the output device.

The method of claim 1,
The main signal estimator,
And inversely performing sound field synthesis rendering on the sound field synthesis signal to estimate at least one major signal.

The method of claim 4, wherein
The main signal estimator,
And a signal which is not estimated as the main signal among sound source signals included in the sound field synthesis signal is defined as an ambient signal.

The method of claim 1,
The negative stereotographic information estimating unit,
And estimating the position of the channel based on the ratio of the main signal of each channel of the sound field synthesis signal, and estimating the sound stereolocation information based on the estimated position of the channel and the reproduction position information.

The method of claim 5,
Rendering unit for reproducing the main signal and the additional signal by rendering in a form suitable for an audio system to reproduce the sound field synthesis signal
Device further comprising.

The method of claim 7, wherein
The rendering unit,
And selecting a channel corresponding to the main signal in the audio system based on the stereophonic information and applying panning to the main signal to reproduce the channel in the corresponding channel.

The method of claim 7, wherein
The rendering unit,
And when the audio system has a rear channel, apply the additional signal to the rear channel of the sound field reproduction system to reproduce the additional signal.

The method of claim 7, wherein
The rendering unit,
And when the audio system does not have a rear channel, the additional signal is applied to all channels of the sound field reproducing system for reproduction.

Estimating reproduction position information of the input sound field synthesis signal;
Estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And
Estimating sound phase information of the sound field synthesis signal based on the main signal;
&Lt; / RTI >

The method of claim 11,
Estimating the reproduction position information,
And estimating the reproduction position information based on an amplitude and a delay between adjacent channels of the sound field synthesis signal.

The method of claim 12,
Estimating the reproduction position information,
And at least one of an interval between output devices at which each channel of the sound field synthesis signal is reproduced, an arrangement direction of the output devices, and a position of the output device.

The method of claim 11,
Estimating the main signal,
And performing at least one sound field synthesis rendering on the sound field synthesis signal to estimate at least one main signal.

The method of claim 14,
Defining a signal not estimated as the main signal among sound source signals included in the sound field synthesis signal as an ambient signal
&Lt; / RTI >

The method of claim 11,
Estimating the stereophonic information,
Estimating a position of a channel based on a ratio of main signals of each channel of the sound field synthesis signal; And
Estimating the stereophonic information based on the estimated channel position and the reproduction position information;
&Lt; / RTI >

16. The method of claim 15,
Rendering the main signal and the additional signal in a form suitable for an audio system to reproduce the sound field synthesis signal;
&Lt; / RTI >

The method of claim 17,
The reproducing step,
Selecting a channel corresponding to the main signal in the audio system based on the stereophonic information; And
Applying panning to the primary signal to play on the corresponding channel
&Lt; / RTI >

The method of claim 17,
The reproducing step,
And when the audio system has a rear channel, applying the additional signal to the rear channel of the sound field reproduction system to reproduce the additional signal.

The method of claim 17,
The reproducing step,
And when the audio system does not have a rear channel, the additional signal is applied to all channels of the sound field reproducing system for reproduction.