KR101758914B1

KR101758914B1 - Apparatus and method for down mixing of wave field synthesis signal

Info

Publication number: KR101758914B1
Application number: KR1020100130187A
Authority: KR
Inventors: 유재현; 정현주; 전상배; 서정일; 강경옥; 성굉모
Original assignee: 한국전자통신연구원
Priority date: 2010-12-17
Filing date: 2010-12-17
Publication date: 2017-07-17
Also published as: KR20120068525A

Abstract

음장 합성 신호를 다운믹싱하여 재생하는 방법 및 장치가 개시된다. 음장 합성 신호의 다운 믹싱 장치는 입력 받은 음장 합성 신호의 재생 위치 정보를 추정하는 재생 위치 정보 추정부; 상기 재생 위치 정보를 기초로 상기 음장 합성 신호의 주요 신호(Primary signal)를 추정하는 주요 신호 추정부; 및 상기 주요 신호를 기초로 상기 음장 합성 신호의 음상정위 정보를 추정하는 음상정위 정보 추정부를 포함할 수 있다. A method and apparatus for downmixing and reproducing sound field synthesized signals are disclosed. A downmixing apparatus for a sound field synthesis signal includes: a reproduction position information estimation unit for estimating reproduction position information of an input sound field synthesis signal; A main signal estimator for estimating a main signal of the sound field synthesized signal based on the reproduction position information; And a sound localization information estimating unit for estimating sound localization information of the sound field synthesized signal based on the main signal.

Description

TECHNICAL FIELD [0001] The present invention relates to an apparatus and a method for downmixing a sound field synthesis signal,

본 발명은 음장 합성 신호의 다운 믹싱 장치 및 방법에 관한 것으로, 보다 상세하게는 상위 채널에 따라 생성된 음장 합성 신호를 다운 믹스하여 재생하는 장치 및 방법에 관한 것이다. The present invention relates to an apparatus and method for downmixing a sound field synthesis signal, and more particularly, to an apparatus and method for downmixing and reproducing a sound field synthesis signal generated according to an upper channel.

멀티채널 오디오 재생 기술은 2채널 스테레오에서 시작하여 5.1채널, 7.1채널로 확장되었으며, 최근에는 3개 레이어(layer)를 사용하는 22.2채널 오디오 재생 시스템도 개발되었다.Multichannel audio playback technology has been extended to 5.1 channels and 7.1 channels starting from 2-channel stereo, and recently a 22.2-channel audio playback system using three layers has also been developed.

그러나, 현재 가정에서 일반적으로 사용되는 멀티채널 오디오 재생 시스템은 5.1채널이나 7.1 채널이므로 22.2 채널 오디오 시스템에 최적화된 음장 합성 신호를 재생하기 어렵다는 한계가 있었다.However, since the multi-channel audio reproduction system generally used in the home is 5.1 channel or 7.1 channel, it is difficult to reproduce the sound field synthesis signal optimized for the 22.2 channel audio system.

따라서, 22.2 채널과 같이 고위 멀티 채널 오디오 재생 시스템에 최적화된 음장 합성 신호를 5.1채널이나 7.1 채널 같은 하위 멀티 채널 오디오 재생 시스템에서 호환 할 수 있도록 하는 방법이 요구되고 있다.Accordingly, there is a need for a method for making a sound field synthesis signal optimized for a higher multichannel audio reproduction system, such as 22.2 channels, compatible with a lower multichannel audio reproduction system such as 5.1 channel or 7.1 channel.

본 발명은 음장 합성 신호를 다운 믹스함으로써 하위 멀티 채널 오디오 재생 시스템에서도 고위 멀티 채널 오디오 재생 시스템에 최적화된 음장 합성 신호를 재생하여 사용자에게 음장을 제공하는 장치 및 방법을 제공한다. The present invention provides an apparatus and method for providing a sound field to a user by reproducing a sound field synthesis signal optimized for a high-level multi-channel audio reproduction system even in a low-multi-channel audio reproduction system by downmixing a sound field synthesis signal.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치는 입력 받은 음장 합성 신호의 재생 위치 정보를 추정하는 재생 위치 정보 추정부; 상기 재생 위치 정보를 기초로 상기 음장 합성 신호의 주요 신호(Primary signal)를 추정하는 주요 신호 추정부; 및 상기 주요 신호를 기초로 상기 음장 합성 신호의 음상정위 정보를 추정하는 음상정위 정보 추정부를 포함할 수 있다.An apparatus for downmixing a sound field synthesis signal according to an embodiment of the present invention includes: a reproduction position information estimation unit for estimating reproduction position information of an input sound field synthesis signal; A main signal estimator for estimating a main signal of the sound field synthesized signal based on the reproduction position information; And a sound localization information estimating unit for estimating sound localization information of the sound field synthesized signal based on the main signal.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 재생 위치 정보 추정부는, 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다.The reproduction position information estimation unit of the downmixing apparatus for a sound field synthesis signal according to an embodiment of the present invention can estimate the reproduction position information based on the amplitude and the delay between adjacent channels of the sound field synthesis signal.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 주요 신호 추정부는, 음장 합성 신호에 음장 합성 렌더링을 역으로 수행하여 적어도 하나의 주요 신호를 추정할 수 있다.The main signal estimator of the downmixing apparatus for a sound field synthesis signal according to an embodiment of the present invention can estimate at least one main signal by performing a sound field synthesis rendering in a sound field synthesis signal inversely.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 주요 신호 추정부는 음장 합성 신호에 포함된 음원 신호 중 주요 신호로 추정 되지 않은 신호를 부가 신호(ambient signal)로 정의할 수 있다.The main signal estimator of the downmixing apparatus for a sound field synthesis signal according to an embodiment of the present invention can define a signal that is not estimated as a main signal among the sound source signals included in the sound field synthesis signal as an ambient signal.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 음상정위 정보 추정부는 음장 합성 신호의 각 채널 별 주요 신호의 비율을 기초로 채널의 위치를 추정하고, 추정한 채널의 위치와 재생 위치 정보를 기초로 음상정위 정보를 추정할 수 있다.The sound localization information estimation unit of the downmixing apparatus for a sound field synthesis signal according to an embodiment of the present invention estimates the position of a channel based on the ratio of main signals for each channel of the sound field synthesis signal, It is possible to estimate the image position information based on the information.

본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법은 입력 받은 음장 합성 신호의 재생 위치 정보를 추정하는 단계; 상기 재생 위치 정보를 기초로 상기 음장 합성 신호의 주요 신호(Primary signal)를 추정하는 단계; 및 상기 주요 신호를 기초로 상기 음장 합성 신호의 음상정위 정보를 추정하는 단계를 포함할 수 있다.A method of downmixing a sound field synthesis signal according to an embodiment of the present invention comprises: estimating a reproduction position information of an input sound field synthesis signal; Estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And estimating sound image position information of the sound field synthesized signal based on the main signal.

본 발명의 일실시예에 의하면, 음장 합성 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정하여 음장 합성 신호를 다운 믹스함으로써 하위 멀티 채널 오디오 재생 시스템에서도 고위 멀티 채널 오디오 재생 시스템에 최적화된 음장 합성 신호를 재생할 수 있다.According to an embodiment of the present invention, the sound field synthesis information is estimated based on the sound field synthesis signal on the basis of the sound field synthesis signal, thereby downmixing the sound field synthesis signal. Thus, even in a lower multi-channel audio reproduction system, A signal can be reproduced.

도 1은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치를 도시한 블록 다이어그램이다.
도 2는 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 동작 일례이다.
도 3은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법을 도시한 플로우차트이다.1 is a block diagram illustrating an apparatus for downmixing a sound field synthesis signal according to an embodiment of the present invention.
2 is an example of the operation of an apparatus for downmixing a sound field synthesis signal according to an embodiment of the present invention.
3 is a flowchart illustrating a downmixing method of a sound field synthesis signal according to an embodiment of the present invention.

이하, 본 발명의 실시예를 첨부된 도면을 참조하여 상세하게 설명한다. 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법은 음장 합성 신호의 다운 믹싱 장치에 의해 수행될 수 있다.DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. A downmixing method of a sound field synthesis signal according to an embodiment of the present invention may be performed by a downmixing apparatus for a sound field synthesis signal.

도 1은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치를 도시한 블록 다이어그램이다. 1 is a block diagram illustrating an apparatus for downmixing a sound field synthesis signal according to an embodiment of the present invention.

도 1을 참고하면, 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치(100)는 재생 위치 정보 추정부(110), 주요 신호 추정부(120), 음상정위 정보 추정부(130), 및 랜더링부(140)를 포함할 수 있다. 1, an apparatus 100 for downmixing a sound field synthesis signal according to an embodiment of the present invention includes a playback position information estimation unit 110, a main signal estimation unit 120, a sound phase orientation information estimation unit 130, And a rendering unit 140, as shown in FIG.

재생 위치 정보 추정부(110)는 사용자로부터 입력 받은 음장 합성 신호(WFS Signal: Wave Field Synthesis Signal)의 재생 위치 정보를 추정할 수 있다. 구체적으로 재생 위치 정보 추정부(110)는 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다. 이때, 재생 위치 정보 추정부(110)가 추정하는 재생 위치 정보는 음장 합성 신호의 각 채널이 재생되는 출력 장치 간의 간격, 출력 장치들의 배치 방향 및 출력 장치의 위치 중 적어도 하나를 포함할 수 있다. 이때, 출력 장치는 라우드 스피커와 같이 각 채널에 포함된 오디오 신호를 재생하여 출력하는 장치이다.The reproduction position information estimation unit 110 can estimate reproduction position information of a sound field synthesis signal (WFS signal) input from a user. Specifically, the reproduction position information estimation unit 110 can estimate the reproduction position information based on the amplitudes and delays between adjacent channels of the sound field synthesis signal. At this time, the reproduction position information estimated by the reproduction position information estimation unit 110 may include at least one of an interval between the output devices for reproducing each channel of the sound field synthesis signal, a placement direction of the output devices, and a position of the output device. At this time, the output device is a device such as a loudspeaker that reproduces and outputs an audio signal included in each channel.

또한, 음장 합성 신호에 재생 위치 정보가 포함된 경우, 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치(100)는 재생 위치 정보 추정부(110)의 동작을 생략하고, 바로 주요 신호 추정부(120)를 실행할 수도 있다.In addition, when the sound field synthesis signal includes the reproduction position information, the apparatus 100 for down-mixing the sound field synthesis signal according to the embodiment of the present invention may omit the operation of the reproduction position information estimation unit 110, The estimation unit 120 may be executed.

주요 신호 추정부(120)는 재생 위치 정보 추정부(110)가 추정한 재생 위치 정보를 기초로 음장 합성 신호의 주요 신호(Primary signal)를 추정할 수 있다. 구체적으로 주요 신호 추정부(120)는 음장 합성 신호에 음장 합성 렌더링을 역으로 수행하여 적어도 하나의 주요 신호를 추정할 수 있다. The main signal estimator 120 may estimate the primary signal of the sound field synthesis signal based on the reproduction position information estimated by the reproduction position information estimator 110. [ Specifically, the main signal estimator 120 may estimate at least one main signal by performing a sound field synthesis rendering on the sound field synthesis signal inversely.

이때, 주요 신호 추정부(120)는 특정 채널에 큰 음압으로 포함된 음원 신호를 주요 신호로 추정할 수 있다. 이때, 특정 채널은 음장 합성 신호에 포함된 채널 중 하나일 수도 있고, 기 설정된 채널일 수도 있다. 또한, 주요 신호 추정부(120)는 다른 신호에 비해서 음압이 큰 신호를 주요 신호로 추정할 수도 있고, 일정 값보다 음압이 큰 신호를 주요 신호로 추정할 수도 있다.At this time, the main signal estimator 120 may estimate a sound source signal included in a specific channel with a large sound pressure as a main signal. At this time, the specific channel may be one of the channels included in the sound field synthesis signal, or may be a predetermined channel. Also, the main signal estimator 120 may estimate a signal having a larger sound pressure as a main signal, or a signal having a larger sound pressure than a predetermined value as a main signal, as compared with other signals.

또한, 주요 신호 추정부(120)는 음장 합성 신호에 포함된 음원 신호 중 주요 신호로 추정 되지 않은 신호를 부가 신호(ambient signal)로 정의할 수 있다.Also, the main signal estimator 120 may define a signal that is not estimated as a main signal among the sound source signals included in the sound field synthesis signal as an ambient signal.

음상정위 정보 추정부(130)는 주요 신호 추정부(120)가 추정한 주요 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정할 수 있다. 구체적으로, 음상정위 정보 추정부(130)는 음장 합성 신호의 각 채널에 포함된 주요 신호의 비율을 기초로 각 채널의 위치를 추정하고, 추정한 채널의 위치와 재생 위치 정보를 기초로 각 주요 신호의 음상정위 정보를 추정할 수 있다.The sound localization information estimation unit 130 can estimate the sound localization information of the sound field synthesis signal based on the main signal estimated by the main signal estimation unit 120. [ Specifically, the sound localization information estimation unit 130 estimates the positions of the respective channels based on the ratio of the main signals included in the respective channels of the sound field synthesis signal, and based on the estimated channel position and the reproduction position information, It is possible to estimate the image position information of the signal.

이때, 특정 채널에서 특정 주요 신호가 가장 큰 음압을 가지는 경우, 상기 특정 채널에 인접한 다른 채널에서 상기 특정 주요 신호는 상기 특정 채널보다 적은 음압을 가질 수 있다. 일례로, 제2 채널과 제3 채널이 각각 제1 채널의 좌우에 인접한 경우, 주요 신호 중 하나가 제1 채널에서 가장 큰 음압을 가지면, 상기 주요 신호는 제2 채널과 제3 채널에서 두 번째로 큰 음압을 가질 수 있다. 따라서, 음상정위 정보 추정부(130)는 각 채널 별 주요 신호의 음압 비율에 따라 음장 합성 신호를 재생할 오디오 시스템의 채널 정보에 대응하는 채널들의 위치를 추정할 수 있다. 일례로, 음장 합성 신호가 22.2채널이고, 음장 합성 신호를 재생할 오디오 시스템이 5.1 채널인 경우, 음상정위 정보 추정부(130)는 22.2채널의 각 채널별 주요 신호의 음압 비율에 따라 22.2채널의 각 채널들이 각각 5.1 채널 중 어떤 채널에 대응하는지를 추정할 수 있다.At this time, when a specific main signal has a largest sound pressure in a specific channel, the specific main signal may have a lower sound pressure than the specific channel in another channel adjacent to the specific channel. For example, if the second channel and the third channel are adjacent to the left and right of the first channel, respectively, if one of the key signals has the greatest sound pressure in the first channel, As shown in FIG. Therefore, the sound localization information estimation unit 130 can estimate the positions of the channels corresponding to the channel information of the audio system to reproduce the sound field synthesis signal according to the sound pressure ratio of the main signal for each channel. For example, when the sound field synthesis signal is 22.2 channels and the audio system to reproduce the sound field synthesis signal is 5.1 channels, the sound phase information estimation unit 130 calculates the phase angle of the 22.2 channels according to the sound pressure ratio of the main signal for each channel of 22.2 channels. It is possible to estimate which one of the channels corresponds to which of the 5.1 channels.

또한, 본 발명에 따른 재생 위치 정보 추정부(110), 주요 신호 추정부(120), 및 음상정위 정보 추정부(130)는 다운 믹서(Down mixer)에 포함된 세부 구성일 수 있다.Also, the playback position information estimation unit 110, the main signal estimation unit 120, and the sound image position information estimation unit 130 according to the present invention may have a detailed configuration included in a down mixer.

랜더링부(140)는 음상정위 정보 추정부(130)가 추정한 주요 신호와 음상정위 정보 추정부(130)가 정의한 부가 신호를 음장 합성 신호를 재생할 오디오 시스템에 적합한 형태로 랜더링하여 재생할 수 있다. 이때, 랜더링부(140)는 채널 믹서(Channel mixer)일 수 있다.The rendering unit 140 may render and reproduce the main signal estimated by the sound phase information estimation unit 130 and the additional signal defined by the sound phase information estimation unit 130 in a form suitable for the audio system to reproduce the sound field synthesis signal. At this time, the rendering unit 140 may be a channel mixer.

구체적으로, 랜더링부(140)는 음상정위 정보를 기초로 음장 합성 신호를 재생할 오디오 시스템에서 상기 주요 신호에 대응하는 채널을 선택하고, 상기 주요 신호에 패닝을 적용하여 상기 대응하는 채널에서 재생할 수 있다.Specifically, the rendering unit 140 may select a channel corresponding to the main signal in an audio system for reproducing a sound field synthesized signal based on the sound image position information, and apply panning on the main signal to reproduce the corresponding channel .

일례로, 음장 합성 신호를 재생할 오디오 시스템이 5.1 채널이고, 음상정위 정보 추정부(130)가 추정한 주요 신호의 음상정위 정보가 C(Center) 채널인 경우, 랜더링부(140)는 별다른 패닝 없이 C 채널에서 해당 주요 신호를 재생할 수 있다. 반면, 음상정위 정보 추정부(130)가 추정한 주요 신호의 음상정위 정보가 BC(Back Center) 채널인 경우, 5.1 채널의 오디오 시스템에서 BC 채널은 없다. 따라서, 랜더링부(140)는 BC 채널에서 가장 가까운 채널인 LS(Left Side) 채널과 RS(Right Side) 채널을 선택하고, 주요 신호에 패닝을 적용하여 LS 채널과 RS 채널에서 재생하도록 할 수 있다.For example, when the audio system to reproduce the sound field synthesis signal is 5.1 channels and the sound image position information of the main signal estimated by the sound phase information estimation unit 130 is a C (Center) channel, the rendering unit 140 can generate The main signal can be played back on the C channel. On the other hand, when the sound image position information of the main signal estimated by the sound phase information estimation unit 130 is a BC (Back Center) channel, there is no BC channel in the 5.1 channel audio system. Therefore, the rendering unit 140 can select the LS (Left Side) channel and the RS (Right Side) channel that are closest to the BC channel, and apply panning on the main signal to reproduce the LS channel and the RS channel .

또한, 랜더링부(140)는 음장 합성 신호를 재생할 오디오 시스템에 후방 채널이 있는 경우, 부가 신호를 상기 오디오 시스템의 후방 채널에 인가하여 재생할 수 있다. 그리고, 랜더링부(140)는 상기 오디오 시스템에 후방 채널이 없는 경우, 부가 신호를 상기 오디오 시스템의 모든 채널에 파위(power) 비율로 인가하여 재생할 수 있다. 일례로, 랜더링부(140)는 상기 오디오 시스템이 2채널인 경우에는 모든 채널에 부가 신호를

로 인가하고, 상기 오디오 시스템이 3채널인 경우에는 모든 채널에 부가 신호를

으로 인가할 수 있다.In addition, if there is a rear channel in the audio system to reproduce the sound field synthesis signal, the rendering unit 140 can reproduce the additional signal by applying the additional signal to the rear channel of the audio system. If there is no back channel in the audio system, the rendering unit 140 can reproduce by applying the additional signal to all the channels of the audio system at a power ratio. For example, when the audio system is two channels, the rendering unit 140 outputs an additional signal to all channels

If the audio system has three channels, it applies an additional signal to all channels

As shown in FIG.

도 2는 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 장치의 동작 일례이다. 2 is an example of the operation of an apparatus for downmixing a sound field synthesis signal according to an embodiment of the present invention.

도 2는 M개의 채널 신호로 구성된 음장 합성 신호를 5.1 채널 신호로 다운믹싱하여 재생하는 과정의 동작 일례이다. 이때, 음장 합성 신호는 5.1 채널보다 많은 채널을 가진 신호일 수 있다.2 is an example of an operation of a process of downmixing a sound field synthesis signal composed of M channel signals into a 5.1 channel signal and reproducing the 5.1 channel signal. At this time, the sound field synthesis signal may be a signal having more channels than 5.1 channels.

먼저, 본 발명에 따른 재생 위치 정보 추정부(110), 주요 신호 추정부(120), 및 음상정위 정보 추정부(130)를 포함하는 다운 믹서(Down mixer)(210)는 도 2에 도시된 바와 같이 음장 합성 신호를 구성하는 M개의 채널 신호를 입력 받을 수 있다.2, the down mixer 210 includes a playback position information estimation unit 110, a main signal estimation unit 120, and a sound image position information estimation unit 130 according to the present invention. M channel signals constituting the sound field synthesis signal can be input as shown in FIG.

이때, 다운 믹서(210)는 수신한 음장 합성 신호의 재생 위치 정보를 기초로 음장 합성 신호의 주요 신호를 추정하고 부가 신호를 정의할 수 있다. 수신한 음장 합성 신호의 재생 위치 정보가 포함되지 않은 경우, 다운 믹서(210)는 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다.At this time, the down mixer 210 can estimate the main signal of the sound field synthesis signal and define the additional signal based on the reproduction position information of the received sound field synthesis signal. If the reproduction position information of the received sound field synthesis signal is not included, the downmixer 210 can estimate the reproduction position information based on the amplitude and delay between adjacent channels of the sound field synthesis signal.

다음으로 다운 믹서(210)는 추정한 주요 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정할 수 있다.Next, the downmixer 210 can estimate the sound localization information of the sound field synthesis signal based on the estimated main signal.

그 다음으로 다운 믹서(210)는 추정한 음상정위 정보와 주요 신호, 및 정의한 부가 신호를 채널 믹서(220)로 전송할 수 있다.Next, the downmixer 210 may transmit the estimated sound image position information, the main signal, and the defined additional signal to the channel mixer 220.

마지막으로 채널 믹서(220)는 수신한 음상정위 정보를 기초로 주요 신호와 부가 신호를 랜더링하여 5.1 채널에서 재생할 수 있다. Finally, the channel mixer 220 can reproduce the main signal and the additional signal on the basis of the received sound image position information and reproduce it on the 5.1 channel.

도 3은 본 발명의 일실시예에 따른 음장 합성 신호의 다운 믹싱 방법을 도시한 플로우차트이다.3 is a flowchart illustrating a downmixing method of a sound field synthesis signal according to an embodiment of the present invention.

단계(S310)에서 재생 위치 정보 추정부(110)는 사용자로부터 입력 받은 음장 합성 신호의 재생 위치 정보를 추정할 수 있다. 구체적으로 재생 위치 정보 추정부(110)는 음장 합성 신호의 인접 채널간의 진폭(Amplitude)과 지연(Delay)에 기초하여 재생 위치 정보를 추정할 수 있다. 또한, 사용자로부터 입력 받은 음장 합성 신호에 재생 위치 정보가 포함된 경우, 단계(S310)는 생략될 수 있다.In operation S310, the reproduction position information estimation unit 110 may estimate the reproduction position information of the sound field synthesis signal received from the user. Specifically, the reproduction position information estimation unit 110 can estimate the reproduction position information based on the amplitudes and delays between adjacent channels of the sound field synthesis signal. Also, if the sound field synthesis signal received from the user includes the reproduction position information, step S310 may be omitted.

단계(S320)에서 주요 신호 추정부(120)는 단계(S310)에서 추정한 재생 위치 정보를 기초로 음장 합성 신호의 주요 신호를 추정할 수 있다. 구체적으로 주요 신호 추정부(120)는 음장 합성 신호에 음장 합성 렌더링을 역으로 수행하여 적어도 하나의 주요 신호를 추정할 수 있다. In step S320, the main signal estimator 120 may estimate the main signal of the sound field synthesis signal based on the reproduction position information estimated in step S310. Specifically, the main signal estimator 120 may estimate at least one main signal by performing a sound field synthesis rendering on the sound field synthesis signal inversely.

단계(S330)에서, 주요 신호 추정부(120)는 단계(S310)에서 수신한 음장 합성 신호에 포함된 음원 신호 중 단계(S320)에서 주요 신호로 추정 되지 않은 신호를 부가 신호(ambient signal)로 정의할 수 있다.In step S330, the main signal estimator 120 multiplies the signal not estimated as the main signal in step S320 among the sound source signals included in the sound field synthesis signal received in step S310 as an ambient signal Can be defined.

단계(S340)에서 음상정위 정보 추정부(130)는 단계(S320)에서 추정한 주요 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정할 수 있다. 구체적으로, 음상정위 정보 추정부(130)는 음장 합성 신호의 각 채널에 포함된 주요 신호의 비율을 기초로 각 채널의 위치를 추정하고, 추정한 채널의 위치와 재생 위치 정보를 기초로 각 주요 신호의 음상정위 정보를 추정할 수 있다.In step S340, the sound localization information estimation unit 130 may estimate the sound localization information of the sound field synthesis signal based on the main signal estimated in step S320. Specifically, the sound localization information estimation unit 130 estimates the positions of the respective channels based on the ratio of the main signals included in the respective channels of the sound field synthesis signal, and based on the estimated channel position and the reproduction position information, It is possible to estimate the image position information of the signal.

단계(S350)에서 랜더링부(140)는 단계(S320)에서 추정한 주요 신호와 단계(S330)에서 정의한 부가 신호를 음장 합성 신호를 재생할 오디오 시스템에 적합한 형태로 랜더링하여 재생할 수 있다. 구체적으로, 랜더링부(140)는 단계(S340)에서 추정한 음상정위 정보를 기초로 음장 합성 신호를 재생할 오디오 시스템에서 상기 주요 신호에 대응하는 채널을 선택하고, 상기 주요 신호에 패닝을 적용하여 상기 대응하는 채널에서 재생할 수 있다.In step S350, the rendering unit 140 may render the main signal estimated in step S320 and the additional signal defined in step S330 by rendering the sound field synthesis signal in a form suitable for the audio system to be reproduced. Specifically, the rendering unit 140 selects a channel corresponding to the main signal in the audio system to reproduce the sound field synthesis signal based on the sound image position information estimated in step S340, and applies panning to the main signal, It can be reproduced on the corresponding channel.

이때, 랜더링부(140)는 음장 합성 신호를 재생할 오디오 시스템에 후방 채널이 있는 경우, 부가 신호를 상기 오디오 시스템의 후방 채널에 인가하여 재생할 수 있다. 그리고, 랜더링부(140)는 상기 오디오 시스템에 후방 채널이 없는 경우, 부가 신호를 상기 오디오 시스템의 모든 채널에 인가하여 재생할 수 있다.At this time, if there is a rear channel in the audio system to reproduce the sound field synthesis signal, the rendering unit 140 can reproduce the additional signal by applying the additional signal to the rear channel of the audio system. If there is no back channel in the audio system, the rendering unit 140 can apply the additional signal to all channels of the audio system to reproduce the audio signal.

본 발명은 음장 합성 신호를 기초로 음장 합성 신호의 음상정위 정보를 추정하여 음장 합성 신호를 다운 믹스함으로써 하위 멀티 채널 오디오 재생 시스템에서도 고위 멀티 채널 오디오 재생 시스템에 최적화된 음장 합성 신호를 재생할 수 있다.The present invention can reproduce a sound field synthesis signal optimized for a high-level multi-channel audio reproduction system even in a lower multi-channel audio reproduction system by down-mixing the sound field synthesis signal by estimating the sound field synthesis information of the sound field synthesis signal based on the sound field synthesis signal.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다.While the invention has been shown and described with reference to certain preferred embodiments thereof, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. This is possible.

그러므로, 본 발명의 범위는 설명된 실시예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등한 것들에 의해 정해져야 한다.Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined by the equivalents of the claims, as well as the claims.

110: 재생 위치 정보 추정부
120: 주요 신호 추정부
130: 음상정위 정보 추정부
140: 랜더링부
110: Playback position information estimating unit
120: main signal estimation unit
130: sound phase orientation information estimating unit
140:

Claims

A reproduction position information estimating unit for estimating reproduction position information of the input sound field synthesis signal;
A main signal estimator for estimating a main signal of the sound field synthesized signal based on the reproduction position information; And
A sound localization information estimating section for estimating sound localization information of the sound field synthesis signal based on the main signal,
Lt; / RTI >
Wherein the playback position information estimating unit estimates,
Mixer to estimate the reproduction position information based on an amplitude and a delay between adjacent channels of the sound field synthesis signal.

delete

The method according to claim 1,
The playback position information may include:
Wherein the sound field synthesis signal includes at least one of an interval between output devices through which each channel of the sound field synthesis signal is reproduced, a placement direction of the output devices, and a position of the output device.

A reproduction position information estimating unit for estimating reproduction position information of the input sound field synthesis signal;
A main signal estimator for estimating a main signal of the sound field synthesized signal based on the reproduction position information; And
A sound localization information estimating section for estimating sound localization information of the sound field synthesis signal based on the main signal,
Lt; / RTI >
Wherein the main signal estimator comprises:
Wherein at least one main signal is estimated by performing a sound field synthesis rendering on the sound field synthesis signal in reverse.

5. The method of claim 4,
Wherein the main signal estimator comprises:
Wherein a signal not estimated as the main signal among the sound source signals included in the sound field synthesis signal is defined as an ambient signal.

A reproduction position information estimating unit for estimating reproduction position information of the input sound field synthesis signal;
A main signal estimator for estimating a main signal of the sound field synthesized signal based on the reproduction position information; And
A sound localization information estimating section for estimating sound localization information of the sound field synthesis signal based on the main signal,
Lt; / RTI >
Wherein the sound image position information estimation unit comprises:
Estimating a position of a channel based on a ratio of a main signal for each channel of the sound field synthesis signal and estimating the sound field position information based on the estimated channel position and the reproduction position information, Down mixer.

6. The method of claim 5,
The main signal and the additional signal are rendered and reproduced in a form corresponding to the audio system to reproduce the sound field synthesis signal,
Mixer for down-mixing the sound field synthesized signal.

8. The method of claim 7,
The rendering unit may include:
Wherein the channel selecting unit selects a channel corresponding to the main signal in the audio system on the basis of the sound image position information, and performs panning on the main signal to reproduce the sound signal in the corresponding channel.

8. The method of claim 7,
The rendering unit may include:
And if the audio system has a rear channel, the additional signal is applied to a rear channel of the audio system for reproduction.

8. The method of claim 7,
The rendering unit may include:
Wherein the additional signal is applied to all channels of the audio system and reproduced when the audio system does not have a rear channel.

Estimating reproduction position information of an input sound field synthesis signal;
Estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And
Estimating sound image position information of the sound field synthesized signal based on the main signal
Lt; / RTI >
Wherein the step of estimating the reproduction position information comprises:
And estimating the reproduction position information based on an amplitude and a delay between adjacent channels of the sound field synthesized signal.

delete

12. The method of claim 11,
Wherein the step of estimating the reproduction position information comprises:
Wherein the sound field synthesis signal includes at least one of an interval between output devices through which each channel of the sound field synthesis signal is reproduced, a placement direction of the output devices, and a position of the output device.

Estimating reproduction position information of an input sound field synthesis signal;
Estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And
Estimating sound image position information of the sound field synthesized signal based on the main signal
Lt; / RTI >
Wherein the step of estimating the main signal comprises:
Wherein at least one main signal is estimated by performing a sound field synthesis rendering on the sound field synthesis signal in reverse.

15. The method of claim 14,
A step of defining a signal not estimated as the main signal among the sound source signals included in the sound field synthesis signal as an ambient signal;
Lt; RTI ID = 0.0 > downmixing < / RTI >

Estimating reproduction position information of an input sound field synthesis signal;
Estimating a primary signal of the sound field synthesis signal based on the reproduction position information; And
Estimating sound image position information of the sound field synthesized signal based on the main signal
Lt; / RTI >
Wherein the step of estimating the sound image orientation information comprises:
Estimating a position of a channel based on a ratio of a main signal for each channel of the sound field synthesis signal; And
Estimating the sound image position information based on the estimated channel position and the reproduction position information
Lt; RTI ID = 0.0 > downmixing < / RTI >

16. The method of claim 15,
Rendering the main signal and the additional signal in a form corresponding to an audio system to reproduce the sound field synthesis signal and reproducing
Lt; RTI ID = 0.0 > downmixing < / RTI >

18. The method of claim 17,
The method of claim 1,
Selecting a channel corresponding to the main signal in the audio system based on the sound image position information; And
Applying panning to the main signal to reproduce on the corresponding channel
Lt; RTI ID = 0.0 > downmixing < / RTI >

18. The method of claim 17,
The method of claim 1,
Wherein if the audio system has a rear channel, the additional signal is applied to a rear channel of the audio system for reproduction.

18. The method of claim 17,
The method of claim 1,
And if the audio system does not have a backward channel, the additional signal is applied to all channels of the audio system for reproduction.