KR20190084106A

KR20190084106A - System and method for loudspeaker position estimation

Info

Publication number: KR20190084106A
Application number: KR1020197017051A
Authority: KR
Inventors: 구앙지 시; 마이클 엠. 구뒨; 에드워드 스타인
Original assignee: 디티에스, 인코포레이티드
Priority date: 2016-11-16
Filing date: 2017-11-10
Publication date: 2019-07-15
Also published as: US20180136898A1; CN110192397B; US11622220B2; CN110192397A; US10887716B2; US10575114B2; US20200267489A1; US10313817B2; WO2018093671A1; US20190268710A1; EP3542555B1; US20180249273A1; US20180139560A1; US9986359B1; JP7035048B2; JP2020501419A; EP3542555A1; WO2018093670A1; US10375498B2; KR102456765B1

Abstract

라우드스피커의 위치를 추정하고, 부정확한 라우드스피커 배향 또는 라우드스피커와 마이크로폰 어레이 사이의 경로에서의 막힘과 같은 비정상 조건이 검출되는 경우 청취자에게 통지하기 위한 시스템 및 방법의 실시예가 설명된다. 예를 들어, 다중채널 서라운드 사운드 시스템의 전방 컴포넌트는 마이크로폰 어레이 및 위치 추정 엔진을 포함할 수 있다. 위치 추정 엔진은 라우드스피커와 마이크로폰 어레이 사이의 거리를 추정할 수 있다. 또한, 위치 추정 엔진은 제1 기법을 사용하여 라우드스피커의 각도를 추정할 수 있다. 위치 추정 엔진은 또한 제2 기법을 사용하여 라우드스피커의 각도를 추정할 수 있다. 2개의 각도가 프로세싱되어 비정상 조건이 존재하는지의 여부를 결정할 수 있다. 비정상 조건이 존재하는 경우, 청취자는 통지받을 수 있고, 그래픽 사용자 인터페이스에서 문제를 해결하기 위한 제안을 제공받을 수 있다.There is described an embodiment of a system and method for estimating the position of a loudspeaker and notifying the listener when an abnormal condition such as an incorrect loudspeaker orientation or blockage in the path between the loudspeaker and the microphone array is detected. For example, a front component of a multi-channel surround sound system may include a microphone array and a location estimation engine. The location estimation engine can estimate the distance between the loudspeaker and the microphone array. In addition, the position estimation engine may estimate the angle of the loudspeaker using the first technique. The position estimation engine may also estimate the angle of the loudspeaker using a second technique. Two angles may be processed to determine whether an abnormal condition is present. If there is an abnormal condition, the listener may be notified and may be offered a suggestion to solve the problem in the graphical user interface.

Description

System and method for loudspeaker position estimation

[관련 출원에 대한 상호 참조][Cross reference to related application]

본 출원은 발명의 명칭이 "라우드스피커 위치 추정을 위한 시스템 및 방법(SYSTEM AND METHOD FOR LOUDSPEAKER POSITION ESTIMATION)"이고 2016년 11월 16일 자로 출원된 미국 가특허 출원 제62/423,041호에 대해 35 U.S.C. §119(e) 하에서 우선권을 주장하며, 그 전체가 본원에 참조로 인용되어 있다.This application claims the benefit of US Provisional Application No. 62 / 423,041, filed November 16, 2016, entitled " SYSTEM AND METHOD FOR LOUDSPEAKER POSITION ESTIMATION " Priority is claimed under §119 (e), which is incorporated herein by reference in its entirety.

[배경 기술]BACKGROUND ART [0002]

서라운드 사운드 시스템은 일반적으로 고품질의 청취 경험을 달성하기 위해 청취자에 의한 보정(calibration)을 필요로 한다. 통상적으로, 서라운드 사운드 시스템은 스위트 스팟(sweet spot) 또는 디폴트 청취 위치에 배치된 다중-요소 마이크로폰을 사용하여 수동으로 보정되어 서라운드 사운드 시스템의 각 라우드스피커에서 재생되는 테스트 신호를 수신한다. 다중-요소 마이크로폰은 통상적으로 긴 케이블을 통해 오디오/비디오(A/V) 수신기 또는 프로세서에 테더링(tethering)된다. 그러나, 다중-요소 마이크로폰을 스위트 스팟 또는 디폴트 청취 위치에 물리적으로 배치하는 것은 청취자에게는 번거로울 수 있다.Surround sound systems typically require calibration by a listener to achieve a high quality listening experience. Typically, the surround sound system receives a test signal that is manually corrected using a sweet spot or a multi-element microphone placed in the default listening position and reproduced on each loudspeaker of the surround sound system. Multi-element microphones are typically tethered to an audio / video (A / V) receiver or processor over long cables. However, physically placing the multi-element microphone in a sweet spot or a default listening position can be cumbersome for the listener.

본 개시의 일 양태는 다중채널 서라운드 사운드 시스템에서 라우드스피커의 위치를 추정하기 위한 장치를 제공한다. 본 장치는, 제1 마이크로폰 및 제2 마이크로폰을 포함하는 마이크로폰 어레이를 포함하며, 제1 마이크로폰은 라우드스피커가 테스트 신호를 출력하는 데 응답하여 제1 오디오 신호를 생성하도록 구성되고, 제2 마이크로폰은 라우드스피커가 테스트 신호를 출력하는 데 응답하여 제2 오디오 신호를 생성하도록 구성된다. 본 장치는 마이크로폰 어레이에 커플링된 위치 추정 엔진을 더 포함하고, 위치 추정 엔진은: 제1 오디오 신호 및 제2 오디오 신호에 기초하여 도달 시간차 추정치를 결정하고; 도달 시간차 추정치에 기초하여 제1 각도를 결정하고; 제1 오디오 신호로부터 도출된 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하고; 제2 오디오 신호로부터 도출된 임펄스 응답에서 제2 직접 경로 컴포넌트를 식별하고; 제1 직접 경로 컴포넌트 및 제2 직접 경로 컴포넌트에 기초하여 제2 각도를 결정하고; 제1 각도와 제2 각도의 비교에 기초하여 비정상 조건이 존재하는지의 여부를 결정하도록 구성된다.One aspect of the present disclosure provides an apparatus for estimating the position of a loudspeaker in a multi-channel surround sound system. The apparatus includes a microphone array including a first microphone and a second microphone, wherein the first microphone is configured to generate a first audio signal in response to the loudspeaker outputting a test signal, And to generate a second audio signal in response to the speaker outputting the test signal. The apparatus further includes a position estimation engine coupled to the microphone array, the position estimation engine comprising: determining an arrival time difference estimate based on the first audio signal and the second audio signal; Determine a first angle based on an arrival time difference estimate; Identify a first direct path component in an impulse response derived from the first audio signal; Identify a second direct path component in an impulse response derived from the second audio signal; Determine a second angle based on the first direct path component and the second direct path component; Based on a comparison of the first angle and the second angle, whether or not an abnormal condition exists.

선행 단락의 장치는 이하의 특징, 즉, 위치 추정 엔진은 또한: 제1 오디오 신호를 하나 이상의 제1 세그먼트로 분할하고, 제2 오디오 신호를 하나 이상의 제2 세그먼트로 분할하고, 제1 푸리에 변환(Fourier transform)을 형성하기 위해 하나 이상의 제1 세그먼트 중 첫 번째 세그먼트의 푸리에 변환을 생성하고, 제2 푸리에 변환을 형성하기 위해 하나 이상의 제2 세그먼트 중 첫 번째 세그먼트의 푸리에 변환을 생성하고, 제1 푸리에 변환 및 제2 푸리에 변환에 기초하여 도달 시간차 추정치를 결정하도록 구성되는 것; 위치 추정 엔진은 또한: 하나 이상의 제1 세그먼트 및 하나 이상의 제2 세그먼트의 생성된 푸리에 변환에 기초하여 복수의 도달 시간차 추정치를 결정하고, 복수의 도달 시간차 추정치를 히스토그램으로 집계하고, 히스토그램에서 가장 많은 발생을 갖는, 복수의 도달 시간차 추정치 중의 도달 시간차 추정치에 기초하여 도달 시간차 추정치를 결정하도록 구성되는 것; 위치 추정 엔진은 또한, 제1 오디오 신호로부터 도출된 임펄스 응답 내의 최고 진폭에 기초하여, 제1 오디오 신호로부터 도출된 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하도록 구성되는 것; 위치 추정 엔진은 또한: 제1 직접 경로 컴포넌트를 포함하는 제1 시간 윈도우를 선택하고, 제2 직접 경로 컴포넌트를 포함하는 제2 시간 윈도우를 선택하고, 제1 시간 윈도우의 데이터 및 제2 시간 윈도우의 데이터를 사용하여 상호 상관을 결정하고, 결정된 상호 상관을 사용하여 제2 각도를 결정하도록 구성되는 것; 위치 추정 엔진은 또한: 제1 각도와 제2 각도를 비교하고, 제1 각도 및 제2 각도가 임계 각도값 내에 있지 않다는 결정에 응답하여 비정상 조건이 존재한다고 결정하도록 구성되는 것; 임계 각도값은 0°와 15° 사이의 값을 포함하는 것; 본 장치는, 비정상 조건이 존재한다는 결정에 응답하여 네트워크를 통해 사용자 디바이스에 통지를 송신하도록 구성된 통지 생성기를 더 포함하는 것; 통지는 라우드스피커의 각도가 부정확하다는 표시, 물체가 라우드스피커와 마이크로폰 어레이 사이의 경로를 막고 있다는 표시, 라우드스피커의 극성이 부정확하다는 표시, 및 라우드스피커의 위치가 부정확하다는 표시 중 하나를 포함하는 것; 본 장치는 사운드바(soundbar), 오디오/비주얼(A/V: audio/visual) 수신기, 중앙 스피커, 및 텔레비전 중 하나를 포함하는 것; 및 다중채널 서라운드 사운드 시스템은 스테레오, 2.1, 3.1, 5.1, 5.2, 7.1, 7.2, 11.1, 11.2, 및 22.2 스피커 레이아웃 중 하나로 배열되는 것 중 임의의 하위 조합을 포함할 수 있다.The apparatus of the preceding paragraph is characterized in that the position estimation engine further comprises: dividing the first audio signal into one or more first segments, dividing the second audio signal into one or more second segments, and performing a first Fourier transform Generating a Fourier transform of a first one of the one or more first segments to form a first Fourier transform, generating a Fourier transform of a first one of the one or more second segments to form a second Fourier transform, And to determine an arrival time difference estimate based on the second Fourier transform; The location estimation engine may also: determine a plurality of arrival time difference estimates based on the generated Fourier transform of the at least one first segment and the at least one second segment, aggregate the plurality of arrival time difference estimates into a histogram, And to determine an arrival time difference estimate based on arrival time difference estimates in a plurality of arrival time difference estimates; The position estimation engine is also configured to identify a first direct path component in an impulse response derived from the first audio signal, based on the highest amplitude in the impulse response derived from the first audio signal; The location estimation engine may also: select a first time window that includes the first direct path component, a second time window that includes the second direct path component, compare the data of the first time window and the data of the second time window To determine a cross-correlation using the data, and to determine a second angle using the determined cross-correlation; The position estimation engine is further configured to: compare the first angle with the second angle, and to determine that an abnormal condition is present in response to determining that the first and second angles are not within the threshold angle value; The critical angle value includes a value between 0 and 15; The apparatus further comprises a notification generator configured to send a notification to the user device over the network in response to the determination that an abnormal condition is present; The notification includes either an indication that the angle of the loudspeaker is incorrect, an indication that the object is blocking the path between the loudspeaker and the microphone array, an indication that the polarity of the loudspeaker is incorrect, and an indication that the location of the loudspeaker is incorrect ; The apparatus includes one of a soundbar, an audio / visual (A / V) receiver, a center speaker, and a television; And a multi-channel surround sound system may include any subset of those arranged in one of the stereo, 2.1, 3.1, 5.1, 5.2, 7.1, 7.2, 11.1, 11.2, and 22.2 speaker layouts.

본 개시의 다른 양태는 다중채널 서라운드 사운드 시스템에서 라우드스피커의 위치를 추정하는 방법을 제공한다. 본 방법은, 마이크로폰 어레이의 제1 마이크로폰으로부터 제1 오디오 신호를 수신하고 마이크로폰 어레이의 제2 마이크로폰으로부터 제2 오디오 신호를 수신하는 단계; 제1 오디오 신호 및 제2 오디오 신호에 기초하여 도달 시간차 추정치를 결정하는 단계; 도달 시간차 추정치에 기초하여 제1 각도를 결정하는 단계; 제1 오디오 신호로부터 도출된 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하는 단계; 제2 오디오 신호로부터 도출된 임펄스 응답에서 제2 직접 경로 컴포넌트를 식별하는 단계; 제1 직접 경로 컴포넌트 및 제2 직접 경로 컴포넌트에 기초하여 제2 각도를 결정하는 단계; 및 제1 각도와 제2 각도의 비교에 기초하여 비정상 조건이 존재하는지의 여부를 결정하는 단계를 포함한다.Another aspect of the disclosure provides a method of estimating the position of a loudspeaker in a multi-channel surround sound system. The method includes receiving a first audio signal from a first microphone of a microphone array and receiving a second audio signal from a second microphone of the microphone array; Determining an arrival time difference estimate based on the first audio signal and the second audio signal; Determining a first angle based on an arrival time difference estimate; Identifying a first direct path component in an impulse response derived from the first audio signal; Identifying a second direct path component in an impulse response derived from the second audio signal; Determining a second angle based on the first direct path component and the second direct path component; And determining whether an abnormal condition exists based on a comparison of the first angle and the second angle.

선행 단락의 방법은 이하의 특징, 즉, 도달 시간차 추정치를 결정하는 단계는: 제1 오디오 신호를 하나 이상의 제1 세그먼트로 분할하는 단계, 제2 오디오 신호를 하나 이상의 제2 세그먼트로 분할하는 단계, 제1 푸리에 변환을 형성하기 위해 하나 이상의 제1 세그먼트 중 첫 번째 세그먼트의 푸리에 변환을 생성하는 단계, 제2 푸리에 변환을 형성하기 위해 하나 이상의 제2 세그먼트 중 첫 번째 세그먼트의 푸리에 변환을 생성하는 단계, 및 제1 푸리에 변환 및 제2 푸리에 변환에 기초하여 도달 시간차 추정치를 결정하는 단계를 더 포함하는 것; 제1 오디오 신호로부터 도출된 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하는 단계는, 제1 오디오 신호로부터 도출된 임펄스 응답 내의 최고 진폭에 기초하여, 제1 오디오 신호로부터 도출된 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하는 단계를 더 포함하는 것; 비정상 조건이 존재하는지의 여부를 결정하는 단계는: 제1 각도와 제2 각도를 비교하는 단계, 및 제1 각도 및 제2 각도가 임계 각도값 내에 있지 않다는 결정에 응답하여 비정상 조건이 존재한다고 결정하는 단계를 더 포함하는 것; 임계 각도값은 0°와 15° 사이의 값을 포함하는 것; 본 방법은, 비정상 조건이 존재한다는 결정에 응답하여 네트워크를 통해 사용자 디바이스에 통지를 송신하는 단계를 더 포함하고, 통지는 라우드스피커의 각도가 부정확하다는 표시, 물체가 라우드스피커와 마이크로폰 어레이 사이의 경로를 막고 있다는 표시, 라우드스피커의 극성이 부정확하다는 표시, 및 라우드스피커의 위치가 부정확하다는 표시 중 하나를 포함하는 것 중 임의의 하위 조합을 포함할 수 있다.The method of the preceding paragraph comprises the steps of: dividing the first audio signal into one or more first segments, dividing the second audio signal into one or more second segments, Generating a Fourier transform of a first one of the one or more first segments to form a first Fourier transform, generating a Fourier transform of a first one of the one or more second segments to form a second Fourier transform, And determining an arrival time difference estimate based on the first Fourier transform and the second Fourier transform; Wherein identifying a first direct path component in an impulse response derived from a first audio signal comprises comparing a first direct path component in an impulse response derived from a first audio signal to a first direct path component in an impulse response derived from a first audio signal, Further comprising identifying a path component; Determining whether an abnormal condition is present comprises: comparing the first angle with a second angle, and determining that an abnormal condition is present in response to determining that the first and second angles are not within the threshold angle value Further comprising the steps of: The critical angle value includes a value between 0 and 15; The method further comprises transmitting a notification to the user device over the network in response to the determination that an abnormal condition is present, wherein the indication is an indication that the angle of the loudspeaker is incorrect, the object is a path between the loudspeaker and the microphone array , An indication that the polarity of the loudspeaker is incorrect, and an indication that the location of the loudspeaker is inaccurate.

본 개시의 다른 양태는, 실행가능 명령어가 저장된 비일시적 물리적 컴퓨터 저장 장치를 제공하며, 실행가능 명령어는, 하드웨어 프로세서에 의해 실행될 때, 적어도: 테스트 신호를 송신하도록 라우드스피커에 명령하고; 제1 기법을 사용하여, 마이크로폰 어레이 내의 제1 마이크로폰으로부터 기록된 제1 오디오 신호 및 마이크로폰 어레이 내의 제2 마이크로폰으로부터 기록된 제2 오디오 신호에 기초하여 제1 각도를 결정하고; 제2 기법을 사용하여 제1 오디오 신호 및 제2 오디오 신호에 기초하여 제2 각도를 결정하고; 제1 각도와 제2 각도의 비교에 기초하여 비정상 조건이 존재하는지의 여부를 결정하도록 구성된다.Another aspect of the present disclosure provides a non-volatile physical computer storage device in which executable instructions are stored, wherein the executable instructions, when executed by a hardware processor, instruct at least: a loudspeaker to transmit a test signal; Determine a first angle based on a first audio signal recorded from a first microphone in the microphone array and a second audio signal recorded from a second microphone in the microphone array using the first technique; Determine a second angle based on the first audio signal and the second audio signal using a second technique; Based on a comparison of the first angle and the second angle, whether or not an abnormal condition exists.

선행 단락의 비일시적 물리적 컴퓨터 저장 장치는 이하의 특징, 즉, 제1 기법은 일반화된 상호 상관 및 위상 변환(GCC-PHAT: Generalized Cross Correlation And Phase Transform) 기법을 포함하는 것; 및 제2 기법은 직접 경로 컴포넌트(DPC: direct path component) 기법을 포함하는 것 중 임의의 하위 조합을 포함할 수 있다.The non-volatile physical computer storage of the preceding paragraph includes the following features: the first technique includes a Generalized Cross Correlation and Phase Transform (GCC-PHAT) technique; And the second technique may include any subset of those that include direct path component (DPC) techniques.

본 개시의 요약을 목적으로, 본 발명의 특정 양태, 이점 및 신규한 특징들이 본원에 설명되어 있다. 이러한 모든 이점이 본원에 개시된 본 발명의 임의의 특정 실시예에 따라 반드시 달성될 수 있는 것은 아니라는 것을 이해해야 한다. 따라서, 본원에 개시된 본 발명은 본원에서 교시되거나 제시될 수 있는 다른 이점을 반드시 달성하지 않고도 본원에서 교시된 바와 같은 이점 또는 이점의 그룹을 달성하거나 최적화하는 방식으로 실시되거나 수행될 수 있다.For purposes of summarizing the disclosure, certain aspects, advantages, and novel features of the invention are described herein. It is to be understood that not all of these advantages may necessarily be achieved in accordance with any particular embodiment of the invention disclosed herein. Accordingly, the invention disclosed herein may be practiced or carried out in a manner that accomplishes or optimizes a group of benefits or advantages as taught herein without necessarily achieving other benefits which may be or will be taught herein.

도면 전체에서, 참조 번호는 참조된 요소 간의 대응 관계를 나타내기 위해 재사용된다. 도면은 본원에 설명된 본 발명의 실시예를 예시하기 위해서 제공되는 것이지 그 범위를 제한하기 위해 제공되는 것은 아니다.
도 1a는 일 실시예에 따라 라우드스피커 위치를 추정하고 다중채널 서라운드 사운드 시스템을 보정하기 위한 예시적인 실내 환경을 도시하는 하이-레벨 블록도를 나타낸다.
도 1b는 일 실시예에 따라 네트워크를 통해 사운드바와 통신하는 사용자 디바이스를 도시하는 블록도를 나타낸다.
도 2a는 예시적인 라우드스피커 위치 추정 프로세스를 나타낸다.
도 2b는 예시적인 직접 경로 컴포넌트(DPC) 선택 프로세스를 나타낸다.
도 2c는 DPC를 사용하는 예시적인 라우드스피커 각도 결정 프로세스를 나타낸다.
도 2d는 예시적인 라우드스피커 비정상 조건 검출 프로세스를 나타낸다.
도 3은 도 1a의 마이크로폰 어레이 중 제1 마이크로폰에 대한 임펄스 응답 및 도 1a의 마이크로폰 어레이 중 제2 마이크로폰에 대한 임펄스 응답을 도시하는 예시적인 그래프를 나타낸다.
도 4는 결정된 제1 각도와 결정된 제2 각도가 유사한 값을 갖는 상황을 도시하는 예시적인 그래프를 나타낸다.
도 5는 결정된 제1 각도 및 결정된 제2 각도가 유사한 값을 갖지 않는 상황을 도시하는 예시적인 그래프를 나타낸다.
도 6은 도 1b의 사용자 디바이스에 의해 디스플레이될 수 있는 예시적인 그래픽 사용자 인터페이스(GUI: graphical user interface)를 나타낸다.
도 7은 다른 예시적인 라우드스피커 위치 추정 프로세스를 나타낸다.
도 8은 일 실시예에 따라 도 1a의 예시적인 실내 환경에서의 제1 각도의 예를 도시하는 하이-레벨 블록도를 나타낸다.Throughout the drawings, reference numerals are reused to indicate the correspondence between the referenced elements. The drawings are provided to illustrate embodiments of the invention described herein and are not provided to limit the scope thereof.
FIG. 1A illustrates a high-level block diagram illustrating an exemplary indoor environment for estimating a loudspeaker position and correcting a multi-channel surround sound system in accordance with one embodiment.
1B illustrates a block diagram illustrating a user device communicating with a sound bar over a network in accordance with one embodiment.
2A illustrates an exemplary loudspeaker position estimation process.
Figure 2B illustrates an exemplary direct path component (DPC) selection process.
Figure 2C shows an exemplary loudspeaker angle determination process using DPC.
Figure 2d shows an exemplary loudspeaker abnormal condition detection process.
Figure 3 shows an exemplary graph showing the impulse response for a first microphone in the microphone array of Figure la and the impulse response for a second one of the microphone arrays of Figure 1a.
Figure 4 shows an exemplary graph illustrating a situation where the determined first angle and the determined second angle have similar values.
Figure 5 shows an exemplary graph illustrating a situation where the determined first angle and the determined second angle do not have a similar value.
FIG. 6 shows an exemplary graphical user interface (GUI) that may be displayed by the user device of FIG. 1B.
7 illustrates another exemplary loudspeaker position estimation process.
FIG. 8 shows a high-level block diagram illustrating an example of a first angle in the exemplary indoor environment of FIG. 1A in accordance with one embodiment.

도입Introduction

배경 기술에서 전술한 바와 같이, 보정의 목적을 위해 스위트 스팟 또는 디폴트 청취 위치에 다중-요소 마이크로폰을 물리적으로 배치하도록 청취자에게 요구하는 것은 번거로울 수 있다. 따라서, 하나 이상의 마이크로폰은 중앙의 알려진 위치에 통합될 수 있다. 예를 들어, 하나 이상의 마이크로폰은, 사운드바, A/V 수신기, 중앙 스피커, 텔레비전, 및/또는 텔레비전 아래 또는 위의 디바이스 등과 같은 다중채널 서라운드 사운드 시스템의 전방 컴포넌트에 통합될 수 있다.As described in the background, it may be cumbersome to require the listener to physically place the multi-element microphone at the sweet spot or default listening position for the purpose of correction. Thus, one or more microphones may be integrated at a known location in the center. For example, the one or more microphones may be integrated into a front component of a multi-channel surround sound system, such as a sound bar, an A / V receiver, a center speaker, a television, and / or a device below or above the television.

표준 레이아웃은 최적 또는 거의 최적의 청취 경험을 달성하기 위해, 하나 이상의 마이크로폰의 중앙의 알려진 위치 또는 다른 기준점(예를 들어, 청취 위치)에 대해 상대적으로 하나 이상의 라우드스피커를 배치할 위치(들)를 나타낼 수 있다. 그러나, 청취자가 최적의 위치에 라우드스피커를 배치하는 것은 종종 어렵다. 예를 들어, 벽, 문, 가구, 및/또는 다른 물체는, 청취자가 하나 이상의 라우드스피커를 최적의 위치에 배치하지 못하게 할 수 있다. 라우드스피커가 최적의 위치에 배치될 수 없는 경우, 오디오 이미지의 왜곡이 발생할 수 있으며 청취자는 컨텐츠 제작자가 의도한 사운드 신(sound scene)을 경험하지 못할 수 있다.The standard layout may include position (s) to place one or more loudspeakers relative to a known location in the center of one or more microphones or other reference points (e.g., listening positions) to achieve an optimal or near optimal listening experience . However, it is often difficult for the listener to place the loudspeaker in the optimal position. For example, walls, doors, furniture, and / or other objects may prevent the listener from placing the one or more loudspeakers in an optimal position. If the loudspeaker can not be placed in an optimal position, distortion of the audio image may occur and the listener may not experience the sound scene intended by the content creator.

라우드스피커가 최적의 위치에 배치될 수 없는 경우, 라우드스피커로 전송되는 오디오 신호가 수정되어 공간 보정 및 보상을 통해 왜곡을 최소화할 수 있다. 공간 보정의 경우, 하나 이상의 마이크로폰의 중앙의 알려진 위치에 대해 상대적이거나 청취 위치에 대해 상대적인 라우드스피커의 위치를 결정하는 것이 중요할 수 있다.If the loudspeaker can not be placed in an optimal position, the audio signal transmitted to the loudspeaker can be modified to minimize distortion through spatial correction and compensation. In the case of spatial correction, it may be important to determine the position of the loudspeaker relative to the known position in the center of one or more microphones or relative to the listening position.

통상적으로, 라우드스피커의 위치는 라우드스피커를 통해 보정 신호를 재생하고, 보정 마이크로폰(들)으로 결과적인 음향 신호를 수신하고, 마이크로폰 출력 신호(들)를 기록하고, 기록된 신호(들)를 위상 변환 가중화를 갖는 일반화된 상호 상관(GCC-PHAT: Generalized Cross Correlation with Phase Transform weighting)과 같은 기법으로 분석함으로써 추정된다. GCC-PHAT 기법은, 라우드스피커가 하나 이상의 마이크로폰에 직접 대향할 때 신뢰성 있는 결과를 생성할 수 있다. 그러나, GCC-PHAT 기법은 다른 상황에서는 신뢰성 없는 결과를 생성할 수 있다. 예를 들어, GCC-PHAT 기법은, 물체가 라우드스피커와 하나 이상의 마이크로폰 사이의 경로를 막는 경우 신뢰성 없는 결과를 생성할 수 있다. 다른 예로서, GCC-PHAT 기법은, 라우드스피커가 하나 이상의 마이크로폰을 향하는 방향 이외의 방향으로 지향되는 경우, 신뢰성 없는 결과를 생성할 수 있다.Typically, the position of the loudspeaker reproduces the correction signal through the loudspeaker, receives the resulting acoustic signal with the correction microphone (s), records the microphone output signal (s), and outputs the recorded signal (s) (GCC-PHAT: Generalized Cross Correlation with Phase Transform Weighting). The GCC-PHAT technique can produce reliable results when a loudspeaker is directly opposite one or more microphones. However, the GCC-PHAT technique can produce unreliable results in other situations. For example, the GCC-PHAT technique can produce unreliable results if an object blocks the path between the loudspeaker and one or more microphones. As another example, the GCC-PHAT technique may produce unreliable results if the loudspeaker is oriented in a direction other than the direction toward one or more microphones.

따라서, 라우드스피커의 위치 및/또는 배향을 추정하고, 라우드스피커 배향이 부정확하거나 물체가 라우드스피커와 마이크로폰 사이의 경로를 막고 있는 경우 청취자에게 통지하는 라우드스피커 위치 추정 시스템의 실시예가 본원에 설명된다. 예를 들어, 라우드스피커 위치 추정 시스템은 2개 이상의 마이크로폰을 포함하는 디바이스(예를 들어, AV 수신기, 사운드바, 중앙 스피커, 텔레비전 등) 내에 통합될 수 있다. 라우드스피커 위치 추정 시스템은 테스트 신호(예를 들어, 최대 길이 시퀀스)를 출력하도록 라우드스피커에 명령할 수 있다. 각각의 마이크로폰은 출력된 테스트 신호의 결과로서 생성된 음향 신호를 수신할 수 있다. 각각의 마이크로폰은 음향 신호를 오디오 신호로 변환할 수 있으며, 오디오 신호는 저장 디바이스에 기록될 수 있다. 라우드스피커 위치 추정 시스템은 라우드스피커의 거리를 결정할 수 있다. 또한, 라우드스피커 위치 추정 시스템은 GCC-PHAT 기법 및 기록된 오디오 신호를 사용하여 라우드스피커의 추정된 각도를 결정할 수 있다. 라우드스피커 위치 추정 시스템은 또한, 기록된 오디오 신호로부터 얻어진 직접 경로 컴포넌트(DPC)를 사용하여 라우드스피커의 추정된 각도를 결정할 수 있다.Thus, an embodiment of a loudspeaker location estimation system is described herein that estimates the location and / or orientation of the loudspeaker and notifies the listener if the loudspeaker orientation is incorrect or the object is blocking the path between the loudspeaker and the microphone. For example, a loudspeaker position estimation system may be integrated within a device (e.g., an AV receiver, a sound bar, a center speaker, a television, etc.) that includes two or more microphones. The loudspeaker position estimation system may command the loudspeaker to output a test signal (e.g., a maximum length sequence). Each microphone can receive the resulting acoustic signal of the output test signal. Each microphone can convert an acoustic signal into an audio signal, and the audio signal can be written to the storage device. The loudspeaker position estimation system can determine the distance of the loudspeaker. The loudspeaker position estimation system can also use the GCC-PHAT technique and the recorded audio signal to determine the estimated angle of the loudspeaker. The loudspeaker position estimation system may also use a direct path component (DPC) obtained from the recorded audio signal to determine an estimated angle of the loudspeaker.

라우드스피커 위치 추정 시스템은 2개의 추정된 각도를 비교할 수 있다. 2개의 추정된 각도가 임계 각도값 내에 있는 경우, 라우드스피커 위치 추정 시스템은 청취자에게 마이크로폰과 라우드스피커 사이의 막힘이 검출되지 않았음을 통지할 수 있다. 라우드스피커 위치 추정 시스템은 마찬가지로 청취자에게 라우드스피커가 청취 원의 중심을 향하여 정확하게 배향되어 있음을 통지할 수 있다. 선택적으로, 라우드스피커 위치 추정 시스템은, 2개의 추정된 각도가 임계 각도값 내에 있는 경우 추정된 각도 중 적어도 하나에 기초하여 보상 컴포넌트에 대한 파라미터를 도출할 수 있으며, 파라미터는 장래에 비이상적인 라우드스피커 배치 또는 다른 비정상 조건(예를 들어, 라우드스피커를 통해 재생하기 전에 오디오를 프로세싱할 때)을 보상하는 데 사용될 수 있다. 그러나, 2개의 추정된 각도가 임계 각도값 내에 있는 경우, 라우드스피커의 부정확하거나 비이상적인 각도 및/또는 부정확하거나 비이상적인 라우드스피커 위치가 존재할 수 있다. 라우드스피커 위치 추정 시스템은 하나의 추정된 각도 또는 두 추정된 각도 다를 이상적인 각도(예를 들어, 이상적인 라우드스피커 레이아웃에서 특정된 각도)와 비교할 수 있고, 비교된 각도가 다른 임계 각도값 내에 있지 않으면, 라우드스피커 위치 추정 시스템은 라우드스피커의 부정확하거나 비이상적인 각도 및/또는 부정확하거나 비이상적인 라우드스피커 위치가 존재하는 것으로 결정할 수 있다. 2개의 추정된 각도가 임계 각도값 내에 있지 않으면, 라우드스피커 위치 추정 시스템은 청취자에게 비정상 조건이 검출되었음을 통지할 수 있다. 비정상 조건의 예는 마이크로폰과 라우드스피커 사이의 막힘, 라우드스피커의 부정확하거나 비이상적인 각도, 부정확한 라우드스피커 극성 및/또는 부정확하거나 비이상적인 라우드스피커 위치를 포함할 수 있다. 모바일 컴퓨팅 디바이스와 같은 사용자 디바이스 상에서 실행되는 애플리케이션은 라우드스피커 위치 추정 시스템과 통신하고 통지를 디스플레이하는 인터페이스를 생성할 수 있다. 대안적으로 또는 추가적으로, 라우드스피커 위치 추정 시스템은 비이상적인 라우드스피커 배치 또는 다른 비정상 조건을 보상하기 위해 라우드스피커로 전송된 신호를 수정하는 보상 컴포넌트에 정보를 제공할 수 있다. 라우드스피커 위치 추정 시스템의 부가적인 상세 사항은 도 1a 내지 도 8과 관련하여 아래에서 설명된다.The loudspeaker position estimation system can compare the two estimated angles. If the two estimated angles are within the critical angle value, the loudspeaker position estimation system may notify the listener that clogging between the microphone and the loudspeaker has not been detected. The loudspeaker position estimation system can likewise notify the listener that the loudspeaker is correctly oriented towards the center of the listening circle. Alternatively, the loudspeaker position estimation system may derive a parameter for the compensation component based on at least one of the estimated angles when the two estimated angles are within the threshold angle value, And may be used to compensate for placement or other abnormal conditions (e.g., when processing audio before playing through a loudspeaker). However, if the two estimated angles are within the critical angle value, there may be an inaccurate or non-ideal angle of the loudspeaker and / or an incorrect or unrealistic loudspeaker position. The loudspeaker position estimation system can compare one estimated angle or two estimated angles with an ideal angle (e.g., an angle specified in an ideal loudspeaker layout), and if the compared angle is not within a different threshold angle value, The loudspeaker position estimation system may determine that there is an incorrect or non-ideal angle of the loudspeaker and / or an incorrect or unrealistic loudspeaker position. If the two estimated angles are not within the critical angle value, the loudspeaker position estimation system may notify the listener that an abnormal condition has been detected. Examples of abnormal conditions may include clogging between the microphone and the loudspeaker, incorrect or unrealistic angle of the loudspeaker, incorrect loudspeaker polarity, and / or incorrect or unrealistic loudspeaker position. An application running on a user device, such as a mobile computing device, may communicate with the loudspeaker location system and create an interface to display the notification. Alternatively or additionally, the loudspeaker position estimation system may provide information to a compensation component that modifies the signal transmitted to the loudspeaker to compensate for non-ideal loudspeaker placement or other abnormal conditions. Additional details of the loudspeaker location estimation system are described below with respect to Figures 1A-8.

예시적인 라우드스피커 위치 추정 시스템의 개요An overview of an exemplary loudspeaker position estimation system

개요의 방식으로, 도 1a는 일 실시예에 따라 라우드스피커 위치를 추정하고 다중채널 서라운드 사운드 시스템을 보정하기 위한 예시적인 실내 환경(100)을 도시하는 하이-레벨 블록도를 나타낸다. 다중채널 서라운드 사운드 시스템은 스테레오, 2.1, 3.1, 5.1, 5.2, 7.1, 7.2, 11.1, 11.2 또는 22.2와 같은 표준화된 라우드스피커 레이아웃에 따라 종종 배열된다. 웨이브 필드 합성(WFS: wave field synthesis) 어레이 또는 다른 객체-기반 렌더링 레이아웃과 같은 다른 라우드스피커 레이아웃 또는 어레이가 또한 사용될 수 있다. 사운드바는 모니터 또는 텔레비전과 같은 디스플레이 디바이스 위 또는 아래에 장착될 수 있는 특수 라우드스피커 인클로저이다. 최근의 사운드바 모델은 종종 선택적인 중앙 스피커 및/또는 서브우퍼(subwoofer)도 갖는 좌측 및 우측 채널 스피커를 통합하는 스피커 어레이를 포함하는 전력을 공급받는 시스템이다. 사운드바는 유선 또는 무선 서라운드 스피커 및/또는 서브우퍼와 접속되는 경우 독립형 서라운드 사운드 시스템 또는 홈 시어터 시스템의 주요 전방 컴포넌트 중 어느 하나에 대한 유연한 해결책이 되었다.1 a shows a high-level block diagram illustrating an exemplary indoor environment 100 for estimating a loudspeaker position and calibrating a multi-channel surround sound system in accordance with one embodiment. Multi-channel surround sound systems are often arranged according to a standardized loudspeaker layout such as stereo, 2.1, 3.1, 5.1, 5.2, 7.1, 7.2, 11.1, 11.2 or 22.2. Other loudspeaker layouts or arrays such as wave field synthesis (WFS) arrays or other object-based render layouts may also be used. The soundbar is a special loudspeaker enclosure that can be mounted above or below a display device such as a monitor or television. Modern sound bar models are powered systems that often include an array of speakers incorporating left and right channel speakers with optional center speakers and / or subwoofers. The soundbar has become a flexible solution to either a standalone surround sound system or a major front component of a home theater system when connected to wired or wireless surround speakers and / or a subwoofer.

도 1a에서, 실내 환경(100)은 3.1 라우드스피커 배열(예를 들어, 사운드바(110), 좌측 서라운드 라우드스피커(106), 우측 서라운드 라우드스피커(108) 및 서브우퍼(104)), 텔레비전(102)(또는 모니터 또는 비디오 스크린), 청취자(120) 및 카우치(122)를 포함한다. 사운드바(110)는 그 인클로저에 통합된 스피커 어레이(112), 마이크로폰 어레이(114), 위치 추정 엔진(116), 보정 엔진(도시하지 않음) 및 A/V 프로세싱 엔진(도시하지 않음)을 포함할 수 있다. 다른 실시예에서, 사운드바(110)는 도 1a에 나타낸 것과 상이한, 더 적은, 또는 더 많은 컴포넌트를 포함한다.In FIG. 1A, the indoor environment 100 includes three loudspeaker arrangements (e. G., Soundbar 110, left surround loudspeaker 106, right surround loudspeaker 108 and subwoofer 104) 102 (or a monitor or video screen), a listener 120, and a couch 122. The sound bar 110 includes a speaker array 112 integrated into the enclosure, a microphone array 114, a position estimation engine 116, a correction engine (not shown) and an A / V processing engine (not shown) can do. In another embodiment, the soundbar 110 includes fewer or more components than the one shown in FIG. 1A.

DVD, Blu-ray® 및 스트리밍 컨텐츠의 등장과 확산은 다중채널 사운드트랙의 광범위한 이용 가능성으로 이어졌다. 대부분의 현대적인 서라운드 사운드 포맷은 이러한 컨텐츠를 적절하게 재생하기 위한 이상적인 라우드스피커 배치를 특정한다. 서라운드 사운드 시스템을 소유하고 있는 통상적인 청취자는 실내 레이아웃이나 가구 배치와 같은 실용적인 이유 때문에 라우드스피커 설정에 대한 이러한 사양을 종종 따를 수 없다. 이는 종종 컨텐츠 제작자의 의도와 청취자의 공간 오디오 경험 사이의 불일치로 이어진다. 예를 들어, 권장되는 배열 원(130)을 따라 라우드스피커를 배치하고 청취자가 권장되는 배열 원(130)의 중심 또는 그 근처에 앉는 것이 최상의 관례로서 종종 권장된다. 권장되는 라우드스피커 배열에 대한 더욱 상세한 사항은 국제 전기 통신 연합(ITU: International Telecommunication Union) 보고서 ITU-R BS.2159-4(05/2012) "가정 및 방송 애플리케이션에서의 다중채널 사운드 기술"에서 볼 수 있으며, 그 전체가 참조로 통합된다. 그러나, 실내 환경(100) 또는 사용자 선호도에서의 실내 제약으로 인해, 우측 서라운드 라우드스피커(108)는 그 권장되는 위치(109)에 배치되지 않고, 청취자(120)는 권장되는 배열 원(130)의 중심으로부터 떨어진 카우치(122)에 앉아 있다.The advent and spread of DVD, Blu-ray® and streaming content has led to the wide availability of multi-channel soundtracks. Most modern surround sound formats specify the ideal loudspeaker placement for proper reproduction of such content. A typical listener who owns a surround sound system often can not follow these specifications for loudspeaker settings for practical reasons such as interior layout or furniture placement. This often leads to inconsistencies between the content producer's intent and the listener's spatial audio experience. For example, it is often best practice to place a loudspeaker along the recommended array source 130 and sit at or near the center of the array source 130 where the listener is recommended. More details on the recommended loudspeaker arrangement can be found in the International Telecommunication Union (ITU) Report ITU-R BS.2159-4 (05/2012) "Multichannel Sound Technology in Home and Broadcast Applications" And is incorporated by reference in its entirety. However, due to room constraints in the indoor environment 100 or user preferences, the right surround loudspeaker 108 is not placed at its recommended position 109, and the listener 120 is not in the recommended array source 130 And seated on the couch 122 away from the center.

일반적으로 공간 보정으로 알려진 이러한 문제에 대한 한 가지 해결책은 통상적으로 청취자가 디폴트 청취 위치(또는 스위트 스팟)에 마이크로폰 어레이를 배치할 것을 요구한다. 그 다음, 시스템은 각각의 라우드스피커를 통해 테스트 신호를 재생하고, 대응하는 마이크로폰 신호(예를 들어, 마이크로폰 어레이에 의해 캡처된 음향 신호의 변환된 버전)를 기록하고, 기록된 신호를 분석하여 각각의 라우드스피커의 위치를 근사화한다. 각각의 라우드스피커(106 및 108)의 위치를 근사화함으로써, 시스템은 보상 프로세스를 사용하여 다중채널 사운드트랙을 실제 스피커 레이아웃으로 공간적으로 재포맷한다. 명확히 설명하면, 공간 보정 프로세스는 일반적으로 각각의 라우드스피커의 위치를 근사화하고, 라우드스피커 설정 에러를 결정하기 위해 근사화된 위치를 이용하고, 설정 에러에 대한 해결책을 제시하고, 그리고/또는 보상 프로세스에 의해 사용될 수 있는 보상 파라미터(예를 들어, 라우드스피커 위치 추정치)를 추정하는 것을 포함한다. 보상 프로세스는 (예를 들어, 라우드스피커가 비이상적인 시스템 레이아웃에 있을 때) 최적의 재생을 위해 오디오 신호를 수정하기 위해 추정된 보상 파라미터를 사용하는 것을 포함할 수 있다. 그러나, 이러한 공간 보정 프로세스는 통상적인 청취자에게 위협적이거나 불편할 수 있다. 청취자(120)가 다른 위치로 이동할 때, 이러한 기존의 방법은 이러한 변화를 검출하고 보상할 방법이 없으며, 청취자(120)는 새로운 청취 위치에 배치된 마이크로폰으로 수동으로 전체 보정 프로세스를 거쳐야 한다. 대조적으로, 사운드바(110) 내의 통합된 마이크로폰 어레이(114)를 사용하여, 그 전체가 본원에 참조로 통합되고, 발명의 명칭이 "청취자 위치 추정을 포함하는 서라운드 사운드 시스템의 공간 보정(SPATIAL CALIBRATION OF SURROUND SOUND SYSTEMS INCLUDING LISTENER POSITION ESTIMATION)"인 미국 특허 공보 제2015/0016642호에 더욱 상세하게 설명되어 있는 바와 같이, 사운드바(110) 내의 보정 엔진(도시하지 않음)이 라우드스피커(106 및 108)에 대한 공간 보정을 수행할 수 있을 뿐만 아니라 최소한의 사용자 개입으로 청취자(120)의 위치를 추정할 수 있다. 일부 실시예에서, 그 전체가 본원에 참조로 통합되고, 2016년 11월 21일 자로 출원되고 발명의 명칭이 "서라운드 사운드 시스템을 보정하기 위한 그래픽 사용자 인터페이스(GRAPHICAL USER INTERFACE FOR CALIBRATING A SURROUND SOUND SYSTEM)"인 미국 특허 출원 제15/357,971호에 보다 상세히 설명되어 있는 바와 같이, 청취자 위치는 그래픽 사용자 인터페이스를 통해 특정될 수 있다. 공간 보정에 대한 또 다른 접근법은 각각의 라우드스피커(106 및 108)에 마이크로폰을 통합하는 것으로, 이는 중앙 컴포넌트에 통합된 소형 마이크로폰 어레이를 사용하는 것에 비해 엄청나게 비쌀 수 있다.One solution to this problem, commonly known as spatial correction, typically requires the listener to place the microphone array in the default listening position (or sweet spot). The system then reproduces the test signal through each loudspeaker, records the corresponding microphone signal (e.g., the converted version of the acoustic signal captured by the microphone array), analyzes the recorded signal, Of the loudspeaker. By approximating the location of each loudspeaker 106 and 108, the system spatially reforms the multi-channel soundtrack into a real speaker layout using a compensation process. Specifically, the spatial correction process generally approximates the position of each loudspeaker, utilizes the approximated position to determine the loudspeaker setup error, presents a solution to the setup error, and / (E.g., a loudspeaker position estimate) that can be used by the receiver. The compensation process may include using the estimated compensation parameters to modify the audio signal for optimal playback (e.g., when the loudspeaker is in a non-ideal system layout). However, such a spatial correction process may be both intimidating and inconvenient to a typical listener. When the listener 120 moves to another location, this conventional method has no way to detect and compensate for such a change, and the listener 120 has to go through the entire calibration process manually with the microphone placed at the new listening position. In contrast, using an integrated microphone array 114 in the sound bar 110, the entirety of which is incorporated herein by reference and is entitled "SPATIAL CALIBRATION OF A SURROUND SOUND SYSTEM CONTAINING LISTENER LOCATION INSPECTION (Not shown) within the sound bar 110 is provided to the loudspeakers 106 and 108, as described in more detail in US Patent Publication No. 2015/0016642, entitled " OF SURROUND SOUND SYSTEMS INCLUDING LISTENER POSITION ESTIMATION & As well as estimate the location of the listener 120 with minimal user intervention. In some embodiments, the entirety of which is incorporated herein by reference, and is entitled " GRAPHICAL USER INTERFACE FOR CALIBRATING A SURROUND SOUND SYSTEM ", filed on November 21, 2016, &Quot;, as described in more detail in U. S. Patent Application Serial No. < RTI ID = 0.0 > 15 / 357,971, < / RTI > Another approach to spatial correction is to incorporate a microphone into each of the loudspeakers 106 and 108, which can be prohibitively expensive compared to using a miniature microphone array integrated in the central component.

그러나, 보상 파라미터의 정확한 추정치는 라우드스피커(106 및 108)의 위치의 정확한 추정치에 의존할 수 있다. 부정확한 라우드스피커(106 및 108) 위치 추정치는 열등한 오디오 신호 수정 및/또는 차선의 청취 경험으로 이어질 수 있다. 예를 들어, 라우드스피커(106 또는 108)가 마이크로폰 어레이(114)로부터 멀어지게 향하거나 마이크로폰 어레이(114)와 라우드스피커(106 또는 108) 사이의 경로가 물체에 의해 차단되면, 종래의 라우드스피커 위치 추정 기법은 실패할 수 있다. 예를 들어, 좌측 서라운드 스피커(106)의 전면(front surface)이 마이크로폰 어레이(114)를 지향하는 방향으로 배향되지 않기 때문에, 통상의 라우드스피커 위치 추정 기법은 좌측 서라운드 라우드스피커(106)의 위치를 부정확하게 추정할 수 있다. 다른 예로서, 카우치(122)가 우측 서라운드 라우드스피커(108)와 마이크로폰 어레이(114) 사이의 경로에 놓여 있기 때문에, 우측 서라운드 스피커(108)의 전면이 마이크로폰 어레이(114)를 지향하는 방향으로 배향되더라도 통상적인 라우드스피커 위치 추정 기법이 우측 서라운드 스피커(108)의 위치를 부정확하게 추정할 수 있다.However, an accurate estimate of the compensation parameter may depend on an accurate estimate of the location of the loudspeakers 106 and 108. Incorrect loudspeaker 106 and 108 position estimates may lead to inferior audio signal modification and / or lane listening experience. For example, if the loudspeaker 106 or 108 is moved away from the microphone array 114 or the path between the microphone array 114 and the loudspeaker 106 or 108 is blocked by the object, Estimation techniques can fail. For example, since the front surface of the left surround loudspeaker 106 is not oriented in the direction that points toward the microphone array 114, a conventional loudspeaker position estimation technique may be used to determine the position of the left surround loudspeaker 106 Can be inaccurately estimated. As another example, since the couch 122 is in the path between the right surround loudspeaker 108 and the microphone array 114, the front of the right surround loudspeaker 108 is oriented in the direction toward the microphone array 114 A conventional loudspeaker position estimation technique may inaccurately estimate the position of the right surround speaker 108.

따라서, 라우드스피커(106 또는 108)가 마이크로폰 어레이(114)의 방향으로 배향되지 않거나 라우드스피커(106 또는 108)와 마이크로폰 어레이(114) 사이의 경로가 차단된 경우에도, 위치 추정 엔진(118)이 라우드스피커(106 및 108) 위치 추정을 향상시키는 기법을 구현할 수 있다. 예를 들어, 위치 추정 엔진(116)은 제1 기법을 사용하여 라우드스피커(106 또는 108)의 위치를 추정할 수 있다. 제1 기법을 사용하여 추정된 위치는 베이스라인 추정치로 고려될 수 있다. 위치 추정 엔진(116)은 또한 제2 기법을 사용하여 라우드스피커(106 또는 108)의 위치를 추정할 수 있다. 제2 기법을 사용하여 추정된 위치는 막힌 경로 또는 부정확한 라우드스피커 배향과 같은 비정상 상황을 검출하는 데 활용될 수 있다. 위치 추정 엔진(116)은 두 위치 추정치를 다 사용하여 보다 신뢰성 있는 라우드스피커 위치 추정치를 도출할 수 있다. 라우드스피커 위치 추정치는 사용자 통지를 생성하고 그리고/또는 비이상적인 라우드스피커(106 또는 108) 배치 또는 다른 비정상 조건을 보상하는 데 사용될 수 있다. 제1 및 제2 기법에 대한 추가적인 상세 사항이 도 2a 내지 도 2d에 대하여 후술된다. 위치 추정 엔진(116)은 본원에서 2개의 위치 추정 기법을 사용하는 것으로 설명되었지만, 이는 한정적인 것으로 의도된 것이 아니다. 위치 추정 엔진(116)은 임의의 수의 위치 추정 기법(예를 들어, 3, 4, 5, 6 등)의 결과를 조합하여 보다 신뢰성 있는 라우드스피커 위치 추정치를 도출할 수 있다. 신경망과 같은 인공 지능 또는 휴리스틱 접근법이 사용되어 위치 추정 기법의 결과를 조합하여 보다 신뢰성 있는 라우드스피커 위치 추정치를 도출할 수 있다.Thus, even if the loudspeaker 106 or 108 is not oriented in the direction of the microphone array 114 or the path between the loudspeaker 106 or 108 and the microphone array 114 is blocked, A technique may be implemented that improves loudspeaker 106 and 108 position estimation. For example, the position estimation engine 116 may estimate the position of the loudspeaker 106 or 108 using a first technique. The estimated position using the first technique can be considered as a baseline estimate. The location estimation engine 116 may also estimate the location of the loudspeaker 106 or 108 using a second technique. The estimated position using the second technique can be utilized to detect an abnormal situation such as a clogged path or incorrect loudspeaker orientation. The position estimation engine 116 may use both position estimates to derive a more reliable loudspeaker position estimate. The loudspeaker position estimate may be used to generate user notifications and / or compensate for non-ideal loudspeaker 106 or 108 placement or other abnormal conditions. Additional details of the first and second techniques are described below with respect to Figures 2A-2D. Although the location estimation engine 116 is described herein as using two position estimation techniques, it is not intended to be limiting. The position estimation engine 116 may combine the results of any number of position estimation techniques (e.g., 3, 4, 5, 6, etc.) to yield a more reliable loudspeaker position estimate. An artificial intelligence or heuristic approach such as a neural network may be used to derive more reliable loudspeaker position estimates by combining the results of the location estimation technique.

또한, 사운드바(110)는 비정상 상황이 검출될 때 청취자(120)에 대한 통지를 생성할 수 있다. 도 1b는 일 실시예에 따라 네트워크(115)를 통해 사운드바(110)와 통신하는 사용자 디바이스(140)를 도시하는 블록도를 나타낸다. 네트워크(115)는 근거리 네트워크(LAN: local area network), 광역 네트워크(WAN: wide area network), 인터넷, 또는 이들의 조합을 포함할 수 있다. 도 1b에 나타낸 바와 같이, 예시적인 사운드바(110)는 위치 추정 엔진(116) 및 통지 생성기(118)를 포함한다.In addition, the soundbar 110 may generate a notification to the listener 120 when an abnormal condition is detected. 1B illustrates a block diagram illustrating a user device 140 in communication with a sound bar 110 via a network 115 in accordance with one embodiment. The network 115 may include a local area network (LAN), a wide area network (WAN), the Internet, or a combination thereof. As shown in FIG. 1B, an exemplary sound bar 110 includes a position estimation engine 116 and a notification generator 118.

사용자 디바이스(140)는 청취자(120)에 의해 동작되는 데스크톱 컴퓨터, 랩톱, 태블릿, 퍼스널 디지털 기기(PDA: personal digital assistant), 텔레비전, (스마트폰과 같은) 무선 휴대용 디바이스, 사운드바, 셋-톱 박스, A/V 수신기, 홈 시어터 시스템 컴포넌트, 및/또는 이들의 조합 등일 수 있다. 사용자 디바이스(140)는 사용자 디바이스(140)로 하여금 사용자 인터페이스를 디스플레이하게 하는 애플리케이션을 실행할 수 있다. 예를 들어, 사용자 인터페이스는 사운드바(110), 서브우퍼(104), 좌측 서라운드 라우드스피커(106) 및/또는 우측 서라운드 라우드스피커(108)의 추정된 위치를 디스플레이할 수 있다. 사용자 인터페이스는 또한 청취자(120)로 하여금 보정(예를 들어, 공간 보정 및/또는 비(non)공간 보정)을 개시할 수 있게 하는 기능을 포함할 수 있다. 청취자(120)가 보정을 개시하면, 사용자 디바이스(140)는 보정 동작을 개시하도록 네트워크(115)를 통해 사운드바(110)(예를 들어, 도시하지 않은, 사운드바(110)에 매립된 보정 엔진)에 지시할 수 있다. 보정 동작의 일부로서, 사운드바(110)는 본원에 설명된 방식으로 하나 이상의 라우드스피커(106 및 108)의 위치를 추정하도록 위치 추정 엔진(116)에 명령한다. 위치 추정의 결과에 기초하여, 위치 추정 엔진(116)은, 비정상 조건이 존재하는지의 여부를 결정할 수 있다. 비정상 조건의 예는 마이크로폰 어레이(114)와 라우드스피커(106 또는 108) 사이의 경로를 막고 있는 물체, 부정확한 라우드스피커(106 또는 108) 각도, 부정확한 라우드스피커(106 또는 108) 극성 및/또는 부정확하거나 비이상적인 라우드스피커(106 또는 108) 위치를 포함할 수 있다. 이러한 결정은 위치 추정 엔진(116)에 의해 통지 생성기(118)로 전송될 수 있다. 또한, 위치 추정치(또는 위치 추정치로부터 도출된 파라미터)는, 본원에 설명되는 바와 같이, 배치된 라우드스피커(106 및 108)를 통한 최적의 재생을 위해 다중채널 오디오를 수정하는, 사운드바(110) 내의 보상 컴포넌트(예를 들어, 본원에 설명되는 보상 프로세스를 실행하는 컴포넌트)로 전송될 수 있다.User device 140 may be a desktop computer, a laptop, a tablet, a personal digital assistant (PDA), a television, a wireless portable device (such as a smart phone), a sound bar, a set- Box, an A / V receiver, a home theater system component, and / or a combination thereof. User device 140 may execute an application that causes user device 140 to display a user interface. For example, the user interface may display the estimated position of the sound bar 110, the subwoofer 104, the left surround loudspeaker 106 and / or the right surround loudspeaker 108. The user interface may also include functionality that allows the listener 120 to initiate corrections (e.g., spatial correction and / or non-spatial correction). When the listener 120 initiates the calibration, the user device 140 performs a calibration operation on the sound bar 110 (e.g., a calibration embedded in the sound bar 110 Engine). As part of the correction operation, the soundbar 110 commands the position estimation engine 116 to estimate the position of one or more loudspeakers 106 and 108 in the manner described herein. Based on the result of the position estimation, the position estimation engine 116 can determine whether or not an abnormal condition exists. Examples of abnormal conditions include an object blocking the path between the microphone array 114 and the loudspeaker 106 or 108, an incorrect loudspeaker 106 or 108 angle, an incorrect loudspeaker 106 or 108 polarity, and / May include incorrect or unrealistic loudspeaker 106 or 108 locations. This determination may be sent by the location estimation engine 116 to the notification generator 118. [ In addition, the position estimate (or parameters derived from the position estimate) may be modified by the sound bar 110, which modifies the multi-channel audio for optimal playback through the placed loudspeakers 106 and 108, (E. G., A component that implements the compensation process described herein).

통지 생성기(118)는 위치 추정 엔진(116)에 의해 제공되는 결정에 기초하여 통지를 생성할 수 있다. 예를 들어, 비정상 조건이 검출되지 않으면, 통지 생성기(118)는 비정상 조건이 검출되지 않는다는 통지를 생성할 수 있거나, 어떠한 통지도 생성하지 않을 수 있다. 대안적으로, 비정상 조건이 검출되면, 통지 생성기(118)는 어떠한 비정상 조건이 검출되었는지를 나타내는 통지를 생성하고, 청취자(120)에게 어떻게 문제를 고치는지를 명령할 수 있고 그리고/또는 문제가 보정되는 보정 옵션을 청취자(120)에게 다시 한번 선택하도록 명령할 수 있다. 선택적으로, 사운드바(110)는 더욱 상세하게 후술되는 바와 같이, 문제를 자동으로 고칠 수 있다.The notification generator 118 may generate a notification based on the determination provided by the location estimation engine 116. [ For example, if an abnormal condition is not detected, the notification generator 118 may generate a notification that an abnormal condition is not detected, or may not generate any notification. Alternatively, if an abnormal condition is detected, the notification generator 118 may generate a notification that indicates what abnormal conditions have been detected, instruct the listener 120 how to fix the problem, and / And instruct the listener 120 to select the correction option again. Optionally, the soundbar 110 can automatically correct the problem, as described in more detail below.

통지 생성기(118)는 네트워크(115)를 통해 사용자 디바이스(140)에 통지를 송신할 수 있다. 일단 통지 생성기(118)로부터 통지가 수신되면, 애플리케이션은 사용자 디바이스(140)로 하여금 사용자 인터페이스에 통지를 디스플레이하도록 할 수 있다.The notification generator 118 may send a notification to the user device 140 via the network 115. Once a notification is received from the notification generator 118, the application may cause the user device 140 to display a notification in the user interface.

도 1a 및 도 1b는 서라운드 사운드 시스템 배열의 일례만을 나타낸다는 점에 유의한다. 다른 실시예는 더 많거나 더 적은 라우드스피커를 갖는 상이한 라우드스피커 레이아웃을 포함할 수 있다. 예를 들어, 사운드바(110)는 중앙 채널 스피커, 2개의 전방 채널 스피커(좌측 하나 및 우측 하나) 및 A/V 수신기로 대체되어 통상의 5.1 배열을 형성할 수 있다. 본 예에서, 마이크로폰 어레이(114)는 중앙 채널 스피커 또는 A/V 수신기에 통합될 수 있고, 보정 엔진, 위치 추정 엔진(116) 및 통지 생성기(118)는 A/V 수신기의 일부일 수 있고, 마이크로폰 어레이(114)는 보정 엔진, 위치 추정 엔진(116) 및/또는 통지 생성기(118)에 커플링될 수 있다. 사운드바(110)는 또한 텔레비전 또는 임의의 다른 A/V 컴포넌트로 대체될 수 있으며, 대체 AV 컴포넌트는 보정 엔진, 위치 추정 엔진(116), 통지 생성기(118) 및/또는 마이크로폰 어레이(114)를 포함한다.It should be noted that Figs. 1A and 1B show only one example of a surround sound system arrangement. Other embodiments may include different loudspeaker layouts with more or fewer loudspeakers. For example, the soundbar 110 may be replaced with a center channel speaker, two front channel speakers (left one and right one) and an A / V receiver to form a conventional 5.1 array. In this example, the microphone array 114 may be integrated into a center channel speaker or A / V receiver, and the correction engine, the position estimation engine 116 and the notification generator 118 may be part of an A / V receiver, The array 114 may be coupled to a correction engine, a position estimation engine 116, and / or a notification generator 118. The sound bar 110 may also be replaced by a television or any other A / V component and the alternate AV component may include a correction engine, a position estimation engine 116, a notification generator 118, and / or a microphone array 114 .

도 1a는 2개의 마이크로폰을 포함하는 것으로 마이크로폰 어레이(114)를 나타내지만, 한정적인 것으로 의도되는 것은 아니다. 보다 양호한 측정 및 위치 추정을 위해 상단 라우드스피커, 좌측 전방 라우드스피커, 우측 전방 라우드스피커 및/또는 다른 라우드스피커를 지향하도록 여분의 마이크로폰 또는 마이크로폰 어레이가 설치될 수 있다. 예를 들어, 제3 마이크로폰이 다른 2개의 마이크로폰과 상이한 깊이 및/또는 다른 높이(예를 들어, 다른 2개의 마이크로폰과 직선이 아님)로 마이크로폰 어레이(114)에 포함될 수 있다. 다른 2개의 마이크로폰에 의해 형성된 라인 외부에 제3 마이크로폰을 위치시키는 것은 위치 추정 엔진(116)으로 하여금 3차원(예를 들어, 방위각에 추가하여 천정각)으로 라우드스피커의 각도를 추정하도록 할 수 있다.1A shows a microphone array 114 that includes two microphones, but is not intended to be limiting. An extra microphone or microphone array may be installed to direct the upper loudspeaker, the left front loudspeaker, the right front loudspeaker, and / or other loudspeakers for better measurement and position estimation. For example, the third microphone may be included in the microphone array 114 at a different depth and / or height than the other two microphones (e.g., not with the other two microphones). Placing the third microphone outside the line formed by the other two microphones may cause the position estimation engine 116 to estimate the angle of the loudspeaker in three dimensions (e.g., zenith angle in addition to azimuth).

또한, 도 1a 및 도 1b는 사운드바(110)에 포함된 단일 위치 추정 엔진(116)을 나타내지만, 이는 한정적인 것으로 의도된 것은 아니다. 사운드바(110)는 임의의 수의 위치 추정 엔진(116)(예를 들어, 마이크로폰 어레이(114) 내의 복수의 상이한 서브셋에 대해 각각 하나의 위치 추정 엔진(116))을 포함할 수 있다. 예를 들어, 위치 추정 엔진(116)은 컴퓨터 프로세싱 유닛(CPU), 메모리, 및/또는 다른 하드웨어 컴포넌트(예를 들어, 아날로그 대 디지털 변환기(ADC: analog to digital converter))를 포함하는 하드웨어 디바이스일 수 있다. 버스가 마이크로폰 어레이(114) 내의 마이크로폰에 CPU를 직접 커플링하여, CPU가 마이크로폰으로부터 기록된 오디오 신호를 프로세싱할 수 있다. 사운드바(110)가 단일 위치 추정 엔진(116)을 포함하는 경우, 버스는 마이크로폰 어레이(114) 내의 각각의 마이크로폰을 각각의 마이크로폰으로부터 기록된 오디오 신호를 일시적으로 저장하는 버퍼에 커플링시킬 수 있다. 스위칭 메커니즘은 프로세싱을 위해 버퍼로부터 CPU로 데이터를 보내는 데 사용될 수 있다. 대안적으로, 단일 위치 추정 엔진(116)은 복수의 CPU를 포함할 수 있고 버스는 마이크로폰 어레이(114) 내의 각각의 마이크로폰을 프로세싱을 위해 개별적인 CPU에 커플링시킬 수 있다.1A and 1B show a single location estimation engine 116 included in the sound bar 110, but this is not intended to be limiting. The soundbar 110 may include any number of position estimation engines 116 (e.g., one location estimation engine 116 for each of a plurality of different subsets within the microphone array 114). For example, the position estimation engine 116 may be a hardware device that includes a computer processing unit (CPU), memory, and / or other hardware components (e.g., an analog to digital converter . A bus couples the CPU directly to a microphone in the microphone array 114 so that the CPU can process the recorded audio signal from the microphone. If the soundbar 110 includes a single location estimation engine 116, the bus may couple each microphone in the microphone array 114 to a buffer that temporarily stores the recorded audio signal from each microphone . The switching mechanism can be used to send data from the buffer to the CPU for processing. Alternatively, the single location estimation engine 116 may comprise a plurality of CPUs and the bus may couple each microphone in the microphone array 114 to a separate CPU for processing.

예시적인 라우드스피커 위치 추정 프로세스An exemplary loudspeaker position estimation process

도 2a는 예시적인 라우드스피커 위치 추정 프로세스(200)를 나타낸다. 일 실시예에서, 프로세스(200)는 도 1a 및 도 1b에 대하여 논의한 사운드바(110)(예를 들어, 위치 추정 엔진(116) 및/또는 통지 생성기(118))를 포함하여, 본원에 설명되는 시스템들 중 임의의 것에 의해 수행될 수 있다. 실시예에 따라, 프로세스(200)는 더 적은 그리고/또는 추가적인 블록을 포함할 수 있거나, 블록은 나타낸 것과 다른 순서로 수행될 수 있다.FIG. 2A shows an exemplary loudspeaker position estimation process 200. FIG. In one embodiment, the process 200 is described herein, including the sound bar 110 (e.g., the position estimation engine 116 and / or the notification generator 118) discussed with respect to FIGS. 1A and 1B. Lt; / RTI > may be performed by any of the following systems. Depending on the embodiment, process 200 may include fewer and / or additional blocks, or blocks may be performed in a different order than shown.

블록(204, 206, 208, 210, 212, 214, 216)은 라우드스피커의 위치를 추정하기 위한 제1 예시적인 기법에 대응한다. 예를 들어, 제1 기법은 GCC-PHAT 기법일 수 있으며, 제1 각도 추정치를 생성할 수 있다. 또한, 도 2b 및 도 2c에서 설명되는 프로세스(218 및 220)는 각각 라우드스피커의 위치를 추정하기 위한 제2 기법에 대응한다. 예를 들어, 제2 기법은 DPC 기법일 수 있고, 제2 각도 추정치를 생성할 수 있다.Blocks 204, 206, 208, 210, 212, 214, 216 correspond to a first exemplary technique for estimating the location of a loudspeaker. For example, the first technique may be a GCC-PHAT technique and may generate a first angle estimate. In addition, processes 218 and 220 described in Figures 2B and 2C respectively correspond to a second technique for estimating the position of the loudspeaker. For example, the second technique may be a DPC technique and may generate a second angle estimate.

프로세스(200)는 청취자가 보정을 개시한 후에 블록(202)에서 개시될 수 있고, 보정을 수행하기 위해 사운드바(110)에 의해 구현되는 몇몇 프로세스 중 하나일 수 있다. 본원에 설명되는 프로세스(200)는 단일 라우드스피커의 위치를 추정하고 그리고/또는 추정된 위치에 대한 통지를 생성하기 위해 사운드바(110)(예를 들어, 위치 추정 엔진(116) 및/또는 통지 생성기(118))에 의해 구현될 수 있다. 사운드바(110)는 다중채널 서라운드 사운드 시스템에서 하나 이상의 라우드스피커에 대해 프로세스(200)를 반복할 수 있다. 예를 들어, 사운드바(110)는 좌측 서라운드 라우드스피커(106)에 대해 한 번, 우측 서라운드 스피커(108)에 대해 한 번 프로세스(200)를 수행할 수 있다.Process 200 may be initiated at block 202 after the listener initiates calibration and may be one of several processes implemented by sound bar 110 to perform calibration. The process 200 described herein may be performed by a soundbar 110 (e.g., a location estimation engine 116 and / or a notification device) to estimate a location of a single loudspeaker and / Generator 118). &Lt; / RTI > The sound bar 110 may repeat the process 200 for one or more loudspeakers in a multi-channel surround sound system. For example, the soundbar 110 may perform the process 200 once for the left surround loudspeaker 106 and once for the right surround loudspeaker 108. [

블록(202)에서, 라우드스피커는 테스트 신호를 송신하도록 명령을 받는다. 예를 들어, 테스트 신호는 최대 길이 시퀀스(예를 들어, 의사랜덤 바이너리 시퀀스)일 수 있다. 라우드스피커는 유선 또는 무선 접속을 통해 위치 추정 엔진(116)에 의해 테스트 신호를 송신하도록 명령받을 수 있다. 일 실시예에서, 테스트 신호는 500ms 동안 출력된다. 라우드스피커는 루프백 지연(예를 들어, 하드웨어 버퍼링, 하드웨어 필터링, 디지털에서 아날로그로의 신호 변환, 및/또는 아날로그에서 디지털로의 신호 변환 등에 의해 초래되는 야기된 지연과 같은, 사운드바(110) 및/또는 라우드스피커의 하드웨어 컴포넌트에 의해 야기된 지연)과 같은 지연 후에 테스트 신호를 출력할 수 있다. 마이크로폰 어레이(114) 내의 마이크로폰은 출력된 테스트 신호의 결과로서 생성된 음향 신호를 각각 수신할 수 있다. 그 다음, 마이크로폰 어레이(114) 내의 마이크로폰은 또한, 음향 신호를 오디오 신호에 각각 대응하는 전기 신호로 변환할 수 있다. 오디오 신호는 후속적으로 저장 디바이스(도시하지 않음)에 기록될 수 있다. 예를 들어, 오디오 신호는 출력된 테스트 신호의 직접 경로 컴포넌트뿐만 아니라 실내 환경(100)에서의 물체(예를 들어, 벽, 물체, 등)에 의해 야기되는 반향도 포함할 수 있다. 블록(202)을 완료한 후, 프로세스(200)는 블록(204) 및 프로세스(218)로 계속된다.At block 202, the loudspeaker is commanded to transmit a test signal. For example, the test signal may be a full length sequence (e.g., a pseudorandom binary sequence). The loudspeaker may be instructed to transmit a test signal by the location estimation engine 116 via a wired or wireless connection. In one embodiment, the test signal is output for 500ms. The loudspeaker may be coupled to the soundbar 110 and / or the microphone 110, such as a delay introduced by a loopback delay (e.g., hardware buffering, hardware filtering, signal conversion from digital to analog, and / / RTI > and / or the delay caused by the hardware components of the loudspeaker). The microphones in the microphone array 114 can each receive acoustic signals resulting from the output test signals. The microphone in microphone array 114 may then also convert the acoustic signal into an electrical signal corresponding to the audio signal, respectively. The audio signal may subsequently be written to a storage device (not shown). For example, the audio signal may include echo caused by an object (e.g., a wall, an object, etc.) in the indoor environment 100 as well as a direct path component of the output test signal. After completing block 202, process 200 continues at block 204 and process 218.

블록(204)에서, 변수 n은 마이크로폰 어레이(114) 내의 마이크로폰의 수와 동일하게 설정된다. 예를 들어, 변수 n은 2, 3, 4, 5, 6 등으로 설정될 수 있다.At block 204, the variable n is set equal to the number of microphones in the microphone array 114. For example, the variable n may be set to 2, 3, 4, 5, 6, and so on.

블록(206)에서, 변수는 i는 1과 동일하게 설정된다. 변수 i는 마이크로폰 어레이(114) 내의 특정 마이크로폰을 식별할 수 있다.In block 206, the variable i is set equal to one. Variable i may identify a particular microphone in the microphone array 114. [

블록(208)에서, 마이크로폰 i로부터 기록된 오디오 신호의 푸리에 변환이 수행되어 푸리에 변환 i를 생성한다. 마이크로폰 i로부터의 오디오 신호는 설정된 시간량 동안 기록될 수 있다. 예를 들어, 설정된 시간량은 사운드바(110)로부터의 라우드스피커의 최대 거리 및 테스트 신호의 시간 길이에 기초할 수 있다. 예로서, 사운드바(110)로부터의 라우드스피커의 최대 거리는 15m와 20m 사이일 것으로 예측될 수 있다. 따라서, 공기 중 소리의 속도가 근사적으로 342m/s인 경우, 출력된 테스트 신호가 라우드스피커로부터 마이크로폰 i에 도달하는 예측되는 최대 전파 시간량은 근사적으로 43.9ms와 58.5ms 사이일 수 있다. 마이크로폰 i로부터의 오디오 신호는 적어도 예측되는 최대 전파 시간량에 테스트 신호의 시간 길이를 더한 시간(예를 들어, 적어도 근사적으로 558.5ms) 동안 기록될 수 있다. 푸리에 변환은 전체 기록된 오디오 신호(예를 들어, 오디오 신호의 근사적으로 558.5ms)에 대해 수행될 수 있다. 대안적으로, 푸리에 변환은 기록된 오디오 신호의 일부(예를 들어, 기록된 오디오 신호의 10ms 내지 30ms 세그먼트)에 대해 수행될 수 있고 그리고/또는 복수의 푸리에 변환이 기록된 오디오 신호의 상이한 일부에 대해 수행될 수 있다(예를 들어, 푸리에 변환은 기록된 오디오 신호의 각각의 30ms 세그먼트에 대해 수행될 수 있다).At block 208, a Fourier transform of the audio signal recorded from microphone i is performed to produce a Fourier transform i . The audio signal from the microphone i can be recorded for a set amount of time. For example, the set amount of time may be based on the maximum distance of the loudspeaker from the sound bar 110 and the length of time of the test signal. By way of example, the maximum distance of the loudspeaker from the sound bar 110 may be expected to be between 15 m and 20 m. Thus, when the speed of sound in air is approximately 342 m / s, the expected maximum amount of propagation time that the output test signal reaches the microphone i from the loudspeaker may be approximately between 43.9 ms and 58.5 ms. The audio signal from the microphone i can be recorded at least for a time (for example, at least approximately 558.5 ms) plus the estimated maximum time of propagation time plus the time length of the test signal. The Fourier transform may be performed on the entire recorded audio signal (e.g., approximately 558.5 ms of the audio signal). Alternatively, the Fourier transform may be performed on a portion of the recorded audio signal (e.g., a 10 ms to 30 ms segment of the recorded audio signal) and / or a plurality of Fourier transforms may be performed on a different portion of the recorded audio signal (E.g., the Fourier transform can be performed for each 30 ms segment of the recorded audio signal).

블록(210)에서, 프로세스(200)는 변수 i가 변수 n과 동일한지의 여부를 결정한다. 변수 i 및 n이 동일하다면, 프로세스(200)는 마이크로폰 어레이(114) 내의 각각의 마이크로폰으로부터 기록된 오디오 신호에 대해 푸리에 변환을 수행하였고 블록(214)으로 진행할 수 있다. 그렇지 않으면, 프로세스(200)는 마이크로폰 어레이(114) 내의 각각의 마이크로폰으로부터 기록된 오디오 신호에 대해 푸리에 변환을 수행하지 않았고, 블록(212)으로 진행할 수 있다.At block 210, the process 200 determines whether the variable i is equal to the variable n . If the variables i and n are the same, then the process 200 may perform a Fourier transform on the recorded audio signal from each microphone in the microphone array 114 and proceed to block 214. Otherwise, the process 200 has not performed a Fourier transform on the recorded audio signal from each microphone in the microphone array 114, and may proceed to block 212. [

블록(212)에서, 변수 i는 1씩 증가된다. 변수 i를 1씩 증가시킨 후, 프로세스(200)는 다시 블록(208)으로 되돌아간다.At block 212, the variable i is incremented by one. After incrementing the variable i by 1, the process 200 returns to block 208 again.

블록(214)에서, 푸리에 변환에 기초하여 도달 시간차가 결정된다. 예를 들어, 마이크로폰 어레이(114) 내의 마이크로폰의 상이한 위치를 고려하면, 출력된 테스트 신호(예를 들어, 음향 신호의 형태)는, 상이한 시간에 마이크로폰 어레이(114) 내의 각각의 마이크로폰에 도달할 수 있다. 도달 시간차는 이 시간차를 나타낼 수 있다. 마이크로폰 어레이(114)가 2개의 마이크로폰을 포함하는 경우, 도달 시간차는 다음과 같이 결정될 수 있다:In block 214, the arrival time difference is determined based on the Fourier transform. Considering the different locations of the microphones in the microphone array 114, for example, the output test signals (e.g., in the form of acoustic signals) can reach each microphone in the microphone array 114 at different times have. The arrival time difference can represent this time difference. If the microphone array 114 includes two microphones, the arrival time difference can be determined as follows:

(1)

(One)

여기서,

는 2개의 마이크로폰 사이의 도달 시간차이고,

및

는 각각 2개의 마이크로폰으로부터 기록된 오디오 신호의 푸리에 변환이며,

는 가중 함수이다. 가중 함수는 다음과 같이 정의될 수 있다:here,

Is the arrival time difference between the two microphones,

And

Is a Fourier transform of an audio signal recorded from two microphones, respectively,

Is a weighting function. The weight function can be defined as:

(2)

대안적인 실시예에서, 위치 추정 엔진(116)은 가능한 도달 시간차 추정치 세트를 결정하고, 제1 각도를 결정하는 데 사용하기 위해 하나의 도달 시간차 추정치를 도달 시간차 추정치인 것으로 선택한다. 예를 들어, 전술한 바와 같이, 푸리에 변환은 기록된 오디오 신호의 10ms 내지 30ms 세그먼트와 같은 기록된 오디오 신호의 일부에 대해 수행될 수 있다. 테스트 신호는 더 긴 기간(예를 들어, 500ms) 동안 지속될 수 있기 때문에, 기록된 오디오 신호는 유사한 시간 길이를 가질 수 있고, 복수의 동일하거나 거의 동일한 세그먼트로 분할 또는 파티셔닝될 수 있다. 그 후 도달 시간차는 각 세그먼트에 대해 결정될 수 있다. 예를 들어, 제1 도달 시간차가 마이크로폰 어레이(114) 내의 제1 마이크로폰으로부터 기록된 첫 번째 세그먼트 및 마이크로폰 어레이(114) 내의 제2 마이크로폰으로부터 기록된 첫 번째 세그먼트에 대해 결정될 수 있고, 제2 도달 시간차가 마이크로폰 어레이(114) 내의 제1 마이크로폰으로부터 기록된 두 번째 세그먼트 및 마이크로폰 어레이(114) 내의 제2 마이크로폰으로부터 기록된 두 번째 세그먼트에 대해 결정될 수 있는 등이다. 위치 추정 엔진(116)은 다양한 도달 시간차 추정치를 히스토그램으로 조직화할 수 있다. 예를 들어, 위치 추정 엔진(116)은 도달 시간차 추정치를 양자화한 다음, 양자화된 도달 시간차 추정치를 비닝(binning)할 수 있다. 가장 높은 발생 횟수(예를 들어, 가장 높은 수의 양자화된 도달 시간차 추정치를 갖는 빈)를 갖는 도달 시간차 추정치는 제1 각도를 결정하는 데 사용할 도달 시간차 추정치로서 선택될 수 있다. 선택적으로, 각각의 도달 시간차 추정치는 서브샘플 정확도를 달성하기 위해 보간된 상호-상관으로부터 도출될 수 있다. 각각의 도달 시간차 추정치는 정수부 및 소수부를 포함할 수 있다. 도달 시간차 추정치 정수부는 히스토그램으로 조직화될 수 있다. 도달 시간차의 정수부는 히스토그램에 기초하여 선택될 수 있고(예를 들어, 가장 높은 발생 횟수를 갖는 도달 시간차 추정치에 대응하는 정수부가 도달 시간차의 정수부로서 선택될 수 있음), 선택된 도달 시간차 추정치에 대응하는 소수부가 정수부에 다시 더해질 수 있다. 예로서, 선택된 도달 시간차 추정치에 대응하는 소수부는 그 정수부가 선택된 빈 내에 있었던 원래 도달 시간차 추정치의 소수부의 평균을 취함으로써 도출될 수 있다.In an alternative embodiment, the position estimation engine 116 determines a set of possible arrival time difference estimates and chooses one arrival time difference estimate to be the arrival time difference estimate for use in determining the first angle. For example, as described above, the Fourier transform can be performed on a portion of the recorded audio signal, such as a 10 ms to 30 ms segment of the recorded audio signal. Since the test signal may last for a longer period (e.g., 500 ms), the recorded audio signal may have a similar time length and may be partitioned or partitioned into a plurality of identical or nearly identical segments. The arrival time difference thereafter can be determined for each segment. For example, a first arrival time difference may be determined for a first segment recorded from a first microphone in the microphone array 114 and a first segment recorded from a second microphone in the microphone array 114, May be determined for a second segment recorded from a first microphone in the microphone array 114 and a second segment recorded from a second microphone in the microphone array 114, and so on. The location estimation engine 116 may organize the various arrival time difference estimates into a histogram. For example, the position estimation engine 116 may quantize the arrival time difference estimate and then bin the quantized arrival time difference estimate. The arrival time difference estimate with the highest occurrence count (e.g., the bin with the highest number of quantized arrival time difference estimates) may be selected as the arrival time difference estimate to use to determine the first angle. Optionally, each arrival time difference estimate can be derived from interpolated cross-correlation to achieve sub-sample accuracy. Each arrival time difference estimate may include an integer part and a fractional part. The arrival time difference estimate integer part can be organized into a histogram. The integer part of the arrival time difference may be selected based on the histogram (for example, the integer part corresponding to the arrival time difference estimate with the highest occurrence count may be selected as the integer part of the arrival time difference) The fractional part can be added back to the integer part. As an example, the fractional part corresponding to the selected arrival time difference estimate can be derived by taking the average of the fractional part of the original arrival time difference estimate whose integer part was in the selected bin.

블록(216)에서, 라우드스피커에 대한 제1 각도는 도달 시간차에 기초하여 결정된다. 예를 들어, 마이크로폰 어레이(114) 내의 마이크로폰 간의 거리가 알려질 수 있다. 그러면 제1 각도는 다음과 같이 결정될 수 있다:At block 216, the first angle to the loudspeaker is determined based on the arrival time difference. For example, the distance between the microphones in the microphone array 114 may be known. The first angle can then be determined as follows:

(3)

여기서, c는 공기 중의 소리의 속도이고, d는 마이크로폰 어레이(114) 내의 마이크로폰 간의 거리이고,

는 도달 시간차이다. 제1 각도는 마이크로폰 어레이(114) 내의 마이크로폰을 통과하는 라인에 대해 마이크로폰 어레이(114)의 중심에 라우드스피커의 중심을 연결하는 라인의 각도를 나타낼 수 있다. 예로서, 마이크로폰 어레이(14) 내의 마이크로폰이 x-y 좌표 평면에서 y-성분을 갖지 않는 라인으로 정렬되고, 라우드스피커의 중심과 마이크로폰 어레이(114)의 중심을 연결하는 라인이 x-y 좌표 평면에서 x-성분을 갖지 않도록 라우드스피커의 중심이 위치되는 경우, 결정되는 제1 각도는 90°일 수 있다. 도 8은 일 실시예에 따라 도 1a의 예시적인 실내 환경(100)에서의 제1 각도(810)의 예를 도시하는 하이-레벨 블록도를 나타낸다. 도 8에 나타낸 바와 같이, 라인(820)은 좌측 서라운드 라우드스피커(106)의 중심을 마이크로폰 어레이(114)의 중심에 연결하고, 라인(830)은 마이크로폰 어레이(114) 내의 마이크로폰을 통과하는 라인을 나타낸다. 제1 각도(810)는 라인(820)과 라인(830) 사이의 각도를 나타낸다. 제1 각도가 결정된 후에, 프로세스(200)는 도 2d와 관련하여 보다 상세히 설명되는 프로세스(222)로 진행한다. 일 실시예에서, 제1 각도는 2개 초과의 마이크로폰을 포함하는 마이크로폰 어레이(114)에 대해 (1) 각 쌍의 마이크로폰에 대해 제1 각도를 결정한 후(예를 들어, 각 쌍의 마이크로폰에 대해 블록(214 및 216)을 반복함), 결과를 융합하고, 그리고/또는 (2) 도달 방향을 결정하기 위해 선형 대수학 공식을 사용함으로써 결정될 수 있다.Where c is the velocity of sound in the air, d is the distance between the microphones in the microphone array 114,

Is the time difference of arrival. The first angle may indicate the angle of the line connecting the center of the loudspeaker to the center of the microphone array 114 relative to the line passing through the microphone in the microphone array 114. As an example, a microphone in the microphone array 14 is aligned in a line having no y-component in the xy coordinate plane, and a line connecting the center of the loudspeaker and the center of the microphone array 114 is an x- The center of the loudspeaker is located, the first angle determined may be 90 degrees. FIG. 8 shows a high-level block diagram illustrating an example of a first angle 810 in the exemplary indoor environment 100 of FIG. 1A according to one embodiment. 8, line 820 connects the center of left surround loudspeaker 106 to the center of microphone array 114 and line 830 connects the center of microphone array 114 to a line passing through the microphone in microphone array 114 . The first angle 810 represents the angle between line 820 and line 830. After the first angle is determined, the process 200 proceeds to the process 222 described in more detail with respect to FIG. 2D. In one embodiment, the first angle is determined for a microphone array 114 that includes more than two microphones (1) after determining a first angle for each pair of microphones (e.g., for each pair of microphones Blocks 214 and 216), fusing the results, and / or (2) using the linear algebraic formulas to determine the direction of arrival.

도 2b는 예시적인 DPC 선택 프로세스(218)를 나타낸다. 일 실시예에서, 프로세스(218)는 도 1a 및 도 1b와 관련하여 전술한 사운드바(110)(예를 들어, 위치 추정 엔진(116))를 포함하여, 본원에 설명된 시스템들 중 임의의 것에 의해 수행될 수 있다. 실시예에 따라, 프로세스(218)는 더 적은 및/또는 추가적인 블록을 포함할 수 있거나 블록은 나타낸 것과 다른 순서로 수행될 수 있다.FIG. 2B illustrates an exemplary DPC selection process 218. FIG. In one embodiment, the process 218 may include any of the systems described herein, including the sound bar 110 (e.g., the position estimation engine 116) described above with respect to FIGS. 1A and 1B &Lt; / RTI > Depending on the embodiment, process 218 may include fewer and / or additional blocks, or blocks may be performed in a different order than shown.

프로세스(218)는, 프로세스(200)의 블록(202)이 완료된 후 블록(224)에서 개시될 수 있다. 블록(224)에서, 변수 n은 마이크로폰 어레이(114) 내의 마이크로폰의 수와 동일하게 설정된다. 예를 들어, 변수 n은 2, 3, 4, 5, 6 등으로 설정될 수 있다.Process 218 may be initiated at block 224 after block 202 of process 200 is completed. At block 224, the variable n is set equal to the number of microphones in the microphone array 114. For example, the variable n may be set to 2, 3, 4, 5, 6, and so on.

블록(226)에서, 변수 i는 1과 동일하게 설정된다. 변수 i는 마이크로폰 어레이(114) 내의 특정 마이크로폰을 식별할 수 있다.At block 226, the variable i is set equal to one. Variable i may identify a particular microphone in the microphone array 114. [

블록(228)에서, 마이크로폰 i로부터 기록된 오디오 신호 및 테스트 신호로부터 도출된 임펄스 응답 내의 최대 피크가 결정된다. 일례로서, 위치 추정 엔진(116)은 기록된 오디오 신호의 푸리에 변환을 취하고 그 결과를 테스트 신호의 푸리에 변환으로 나눔으로써 임펄스 응답을 도출할 수 있다. 나눔은 전달 함수의 푸리에 변환으로 귀결된다. 그 다음, 위치 추정 엔진(116)은 임펄스 응답을 도출하기 위해 전달 함수의 푸리에 변환의 역 푸리에 변환을 취할 수 있다. 일 실시예에서, 위치 추정 엔진(116)은 라우드스피커가 테스트 신호를 출력하도록 명령된 후 하나의 루프백 지연에 대응하는 시간에서 개시하고 테스트 신호를 수신하는 예측되는 최대 시간량에 테스트 신호의 시간 길이를 더한 시간에 대응하는 시간에서 종료하는 시간 윈도우 내에서 최대 피크를 식별한다.At block 228, the maximum peak in the impulse response derived from the audio signal and test signal recorded from microphone i is determined. As an example, the position estimation engine 116 may derive an impulse response by taking a Fourier transform of the recorded audio signal and dividing the result by the Fourier transform of the test signal. Sharing results in a Fourier transform of the transfer function. The position estimation engine 116 may then take an inverse Fourier transform of the Fourier transform of the transfer function to derive the impulse response. In one embodiment, the position estimation engine 116 determines whether the loudspeaker starts at a time corresponding to one loopback delay after being instructed to output a test signal, Lt; RTI ID = 0.0 > time window < / RTI >

블록(230)에서, 결정된 최대 피크에 기초하여 임계 진폭이 결정된다. 예를 들어, 임계 진폭은 결정된 최대 피크의 세트 퍼센티지(예를 들어, 50%, 60%, 70%, 80%, 90% 등)일 수 있다.At block 230, the critical amplitude is determined based on the determined maximum peak. For example, the critical amplitude may be a set percentage (e.g., 50%, 60%, 70%, 80%, 90%, etc.) of the maximum peak determined.

블록(232)에서, 최대 피크의 시간 이전의 시간에 대응하고 임계 진폭보다 큰, 임펄스 응답 내의 피크가 존재하는지의 여부가 결정된다. 일부 경우에, 실내 환경(100)에서의 반향으로 인해, 최대 피크는 DPC가 아니다(예를 들어, 하나 이상의 표면에서 반사된 후에 마이크로폰 i에 도달할 수 있는 다른 오디오 신호와 반대로 라우드스피커로부터 마이크로폰 i로의 직접 경로를 진행하는 오디오 신호). 따라서, 위치 추정 엔진(116)은 최대 피크 이전에 발생하고 특정 임계값 초과의 진폭을 갖는 다른 피크를 발견함으로써 DPC를 결정할 수 있다.At block 232, it is determined whether there is a peak in the impulse response that corresponds to the time before the maximum peak time and that is greater than the threshold amplitude. In some cases, due to echo in the indoor environment 100, the maximum peak is not a DPC (e.g., from a loudspeaker to a microphone i (as opposed to other audio signals that may reach microphone i after being reflected at one or more surfaces) The audio signal proceeding directly to the path of the audio signal). Thus, the position estimation engine 116 can determine the DPC by finding another peak that occurs before the maximum peak and has an amplitude above a certain threshold.

블록(234)에서, 임계 진폭보다 큰 진폭을 갖는, 최대 피크 이전의 피크가 존재하면 프로세스(218)는 블록(238)으로 계속된다. 그렇지 않으면, 프로세스(218)는 블록(236)으로 계속된다.At block 234, process 218 continues to block 238 if there is a peak before the maximum peak, with an amplitude greater than the critical amplitude. Otherwise, process 218 continues to block 236. [

블록(236)에서, 마이크로폰 i의 DPC는 최대 피크로 설정된다. 예를 들어, DPC는 최대 피크로 설정될 수 있는데, 그 이유는 DPC로 간주될 만큼 충분히 높은 진폭을 가진 최대 피크 이전의 다른 피크가 발생하지 않았기 때문이다. 도 3은 마이크로폰 어레이(114) 내의 마이크로폰(314A)에 대한 임펄스 응답 및 마이크로폰 어레이(114) 내의 마이크로폰(314B)에 대한 임펄스 응답을 도시하는 예시적인 그래프(300)를 나타낸다. 도 3에서 나타낸 바와 같이, 피크(310A)는 마이크로폰(314A)의 임펄스 응답에 대한 최대 피크이고, 피크(310B)는 마이크로폰(314B)의 임펄스 응답에 대한 최대 피크이다. 피크(310A 및 310B) 이전에 발생하는 어떠한 피크도 임계 진폭을 초과할 수 없으므로 피크(310A)가 마이크로폰(314A)에 대한 임펄스 응답의 DPC로 설정될 수 있고 피크(310B)가 마이크로폰(314B)에 대한 임펄스 응답의 DPC로 설정될 수 있다.At block 236, the DPC of microphone i is set to the maximum peak. For example, the DPC can be set to the maximum peak because no other peaks have occurred before the maximum peak with an amplitude high enough to be considered a DPC. 3 shows an exemplary graph 300 illustrating the impulse response to the microphone 314A in the microphone array 114 and the impulse response to the microphone 314B in the microphone array 114. [ As shown in FIG. 3, peak 310A is the maximum peak for the impulse response of microphone 314A and peak 310B is the maximum peak for the impulse response of microphone 314B. Peak 310A can be set to the DPC of the impulse response to microphone 314A and peak 310B can be set to microphone 314B because no peak that occurs prior to peaks 310A and 310B can exceed the critical amplitude Can be set to the DPC of the impulse response.

블록(238)에서, 마이크로폰 i의 DPC는, 최대 피크의 시간 전의 시간에 대응하고 임계 진폭보다 큰, 임펄스 응답 내의 제1 피크가 되도록 설정된다. 예를 들어, 최대 피크 이전에 발생하는 복수의 피크는 임계 진폭을 초과할 수 있다. 그러나, 임계 진폭을 초과하는 제1 피크가 DPC로서 선택될 수 있다.At block 238, the DPC of microphone i is set to be the first peak in the impulse response, corresponding to the time before the maximum peak and greater than the threshold amplitude. For example, the plurality of peaks occurring before the maximum peak may exceed the critical amplitude. However, a first peak exceeding the critical amplitude may be selected as the DPC.

블록(240)에서, 프로세스(218)는 변수 i가 변수 n과 동일한지의 여부를 결정한다. 변수 i 및 n이 동일하다면, 프로세스(218)는 마이크로폰 어레이(114) 내의 각각의 마이크로폰에 대한 DPC를 결정하고 프로세스(220)의 블록(244)으로 진행할 수 있다. 그렇지 않으면, 프로세스(218)는 마이크로폰 어레이(114) 내의 각각의 마이크로폰에 대한 DPC를 결정하지 않고 블록(242)으로 진행할 수 있다.At block 240, process 218 determines whether variable i is equal to variable n . If the variables i and n are the same, the process 218 may determine the DPC for each microphone in the microphone array 114 and proceed to block 244 of the process 220. Otherwise, the process 218 may proceed to block 242 without determining the DPC for each microphone in the microphone array 114.

블록(242)에서, 변수 i는 1씩 증가된다. 변수 i를 1씩 증가시킨 후, 프로세스(218)는 블록(228)으로 되돌아간다.At block 242, the variable i is incremented by one. After incrementing the variable i by one, the process 218 returns to block 228. [

도 2c는 DPC를 사용하는 예시적인 라우드스피커 각도 결정 프로세스(220)를 나타낸다. 일 실시예에서, 프로세스(220)는 도 1a 및 도 1b와 관련하여 전술한 사운드바(110)(예를 들어, 위치 추정 엔진(116))를 포함하여, 본원에 설명된 시스템들 중 임의의 것에 의해 수행될 수 있다. 실시예에 따라, 프로세스(220)는 더 적은 및/또는 추가적인 블록을 포함할 수 있거나, 블록은 나타낸 것과 다른 순서로 수행될 수 있다.2C shows an exemplary loudspeaker angle determination process 220 using DPC. In one embodiment, the process 220 may include any of the systems described herein, including the sound bar 110 (e.g., the position estimation engine 116) described above with respect to FIGS. 1A and 1B &Lt; / RTI > Depending on the embodiment, process 220 may include fewer and / or additional blocks, or blocks may be performed in a different order than shown.

프로세스(220)는, 프로세스(218)의 블록(240)이 완료된 후 블록(244)에서 개시될 수 있다. 블록(244)에서, 각각의 DPC 주변의 시간 윈도우가 선택된다. 예를 들어, 위치 추정 엔진(116)은 각각의 마이크로폰 i DPC에 대해, 각각의 DPC 주변의 시간 윈도우를 선택할 수 있다. 시간 윈도우는 DPC 피크의 시간 전에 수 ms(예를 들어, 10ms, 20ms 등)로 시작할 수 있고, DPC 피크의 시간 이후의 수 ms(예를 들어, 10ms, 20ms 등)로 종료할 수 있다.Process 220 may be initiated at block 244 after block 240 of process 218 is complete. At block 244, a time window around each DPC is selected. For example, the location estimation engine 116 may, for each microphone i DPC, select a time window around each DPC. The time window may start a few ms (e.g., 10 ms, 20 ms, etc.) before the time of the DPC peak and may end with a few ms (e.g., 10 ms, 20 ms, etc.) after the time of the DPC peak.

블록(246)에서, 선택된 시간 윈도우 간의 상호 상관은 시간 지연을 추정하도록 결정된다. 예를 들어, 위치 추정 엔진(116)은 선택된 시간 윈도우에 포함된 데이터의 상호 상관을 결정할 수 있다. 추정된 시간 지연은 상호-상관된 데이터의 시작(예를 들어, 마이크로폰 i 시간 윈도우의 시작에 대응하는 시간)과 상호-상관된 데이터가 가장 높은 진폭을 갖는 시간 사이의 시간 길이에 대응할 수 있다. 선택적으로, 상호 상관 출력에 대해 보간이 수행되어 시간 지연 추정의 정확도를 더 향상시킬 수 있다. 대안적인 실시예에서, 위치 추정 엔진(116)은 제2 마이크로폰에 대한 DPC 피크에 대응하는 시간으로부터 제1 마이크로폰에 대한 DPC 피크에 대응하는 시간을 감산함으로써 추정된 시간 지연을 결정할 수 있다. 그러나, 이 접근법은 상호 상관을 수행하는 것보다 더 높은 잡음을 산출할 수 있는데, 그 이유는 1 샘플의 에러도 일부 실시예에서 결과적인 각도 결정에 상당한 영향을 미칠 수 있기 때문이다.At block 246, cross-correlation between the selected time windows is determined to estimate the time delay. For example, the location estimation engine 116 may determine the cross-correlation of the data contained in the selected time window. The estimated time delay may correspond to the length of time between the start of the inter-correlated data (e.g., the time corresponding to the start of the microphone i time window) and the time when the mutually-correlated data has the highest amplitude. Alternatively, interpolation may be performed on the cross correlation output to further improve the accuracy of the time delay estimation. In an alternative embodiment, the position estimation engine 116 may determine the estimated time delay by subtracting the time corresponding to the DPC peak for the first microphone from the time corresponding to the DPC peak for the second microphone. However, this approach can yield higher noise than performing cross-correlation, since the error of one sample can also have a significant impact on the resulting angle determination in some embodiments.

블록(248)에서, 추정된 시간 지연에 기초하여 제2 각도가 결정된다. 예를 들어, 식 (3)이 추정된 시간 지연(예를 들어, 도달 시간차를 대체)과 함께 사용되어 제2 각도를 결정할 수 있다. 제2 각도는 마이크로폰 어레이(114) 내의 마이크로폰을 통과하는 라인에 대해 마이크로폰 어레이(114)의 중심에 라우드스피커의 중심을 연결하는 선의 각도를 나타낼 수 있다. 프로세스(220)가 제2 각도를 결정하면, 프로세스(220)는 프로세스(222)의 블록(250)으로 계속될 수 있다.At block 248, a second angle is determined based on the estimated time delay. For example, equation (3) may be used in conjunction with an estimated time delay (e.g., replacing the arrival time difference) to determine a second angle. The second angle may represent the angle of the line connecting the center of the loudspeaker to the center of the microphone array 114 relative to the line passing through the microphone in the microphone array 114. If process 220 determines a second angle, process 220 may continue at block 250 of process 222.

도 2d는 예시적인 라우드스피커 비정상 조건 검출 프로세스(222)를 나타낸다. 일 실시예에서, 프로세스(222)는 도 1a 및 도 1b와 관련하여 전술한 사운드바(110)(예를 들어, 위치 추정 엔진(116) 및/또는 통지 생성기(118))를 포함하여 본원에 설명된 임의의 시스템에 의해 수행될 수 있다. 실시예에 따라, 프로세스(222)는 더 적은 및/또는 추가적인 블록을 포함할 수 있거나, 블록은 나타낸 것과 다른 순서로 수행될 수 있다.FIG. 2D illustrates an exemplary loudspeaker abnormal condition detection process 222. FIG. In one embodiment, the process 222 may be implemented in any of a variety of ways, including the sound bar 110 (e.g., the position estimation engine 116 and / or the notification generator 118) described above with respect to FIGS. Can be performed by any of the systems described. Depending on the embodiment, process 222 may include fewer and / or additional blocks, or blocks may be performed in a different order than shown.

프로세스(222)는, 프로세스(220)의 블록(248)이 완료된 후 블록(250)에서 개시될 수 있다. 블록(250)에서, 제1 각도는 제2 각도와 비교된다. 일부 실시예에서, 블록(202, 204, 206, 208, 210, 212, 214 및 216), 프로세스(218) 및 프로세스(220)는 수회 반복된다. 따라서, 제1 각도와 제2 각도는 일련의 테스트에 걸쳐 비교될 수 있다.Process 222 may be initiated at block 250 after block 248 of process 220 is complete. At block 250, the first angle is compared to the second angle. In some embodiments, blocks 202, 204, 206, 208, 210, 212, 214 and 216, process 218 and process 220 are repeated a number of times. Thus, the first angle and the second angle can be compared over a series of tests.

라우드스피커의 출력이 마이크로폰 어레이(114)에 직접 대향하고 있을 때 결정된 제1 각도는 정확할 수 있다. 그러나, 라우드스피커의 출력이 마이크로폰 어레이(114)를 지향하는 방향 이외의 방향을 향하고 있을 때, 또는 마이크로폰 어레이(114)와 라우드스피커 사이의 경로에 막힘이 있을 때, 결정된 제1 각도는 정확하지 않을 수 있다. 결정된 제2 각도는 또한, 라우드스피커의 출력이 마이크로폰 어레이(114)를 직접 향하고 있을 때 정확할 수 있고, 라우드스피커가 마이크로폰 어레이(114)를 향한 방향 이외의 방향을 향하고 있거나 또는 마이크로폰 어레이(114)와 라우드스피커 사이의 경로에 막힘이 있을 때 결정된 제1 각도보다 더욱 정확할 수 있다.The first angle determined when the output of the loudspeaker is directly opposite the microphone array 114 may be accurate. However, when the output of the loudspeaker is directed in a direction other than the direction toward the microphone array 114, or when there is clogging in the path between the microphone array 114 and the loudspeaker, the determined first angle is not accurate . The determined second angle may also be accurate when the output of the loudspeaker is pointing directly at the microphone array 114 and the loudspeaker may be pointing in a direction other than the direction towards the microphone array 114, May be more accurate than the first angle determined when there is clogging in the path between the loudspeakers.

따라서, 블록(252)에서, 비교에 기초하여 제1 각도와 제2 각도 사이의 차이가 임계 각도값보다 큰지의 여부가 결정된다. 두 각도 사이의 차이가 임계 각도값을 초과하면, 이는 비정상 조건이 존재함을 나타낼 수 있다. 일례로서, 임계 각도값은 0°와 15° 사이일 수 있다. 라우드스피커가 마이크로폰 어레이(114)를 향하지 않거나 물체가 라우드스피커와 마이크로폰 어레이(114) 사이의 경로를 막을 때의 DPC 기법의 비교적 일관된 정확도 및 GCC-PHAT 기법의 부정확성을 고려하면, 위치 추정 엔진(116)은 이러한 상황에서 DPC 방법에 의해 얻어진 결과를 제공할 수 있다. 그러나, 두 각도 사이의 차이가 임계 각도값을 초과하지 않더라도, 라우드스피커의 부정확하거나 비이상적인 각도 및/또는 부정확하거나 비이상적인 라우드스피커 위치가 여전히 존재할 수 있다.Thus, at block 252, it is determined based on the comparison whether the difference between the first angle and the second angle is greater than the threshold angle value. If the difference between the two angles exceeds the threshold angle value, this may indicate that an abnormal condition exists. As an example, the critical angle value may be between 0 and 15 degrees. Considering the relatively consistent accuracy of the DPC technique and the inaccuracy of the GCC-PHAT technique when the loudspeaker is not facing the microphone array 114 or when the object blocks the path between the loudspeaker and the microphone array 114, ) Can provide results obtained by the DPC method in this situation. However, even if the difference between the two angles does not exceed the critical angle value, there may still be an incorrect or unrealistic angle of the loudspeaker and / or an incorrect or unrealistic loudspeaker position.

블록(254)에서, 프로세스(222)는, 2개의 각도가 임계 각도값 내에 있는 경우, 블록(256)으로 계속된다. 그렇지 않으면, 프로세스(222)는 블록(262)으로 계속된다. 도 4는 결정된 제1 각도(410)(예를 들어, GCC-PHAT 기법을 사용하여 결정됨) 및 결정된 제2 각도(420)(예를 들어, DPC 기법을 사용하여 결정됨)가 유사한 값을 갖는 상황을 도시하는 예시적인 그래프(400)를 나타낸다. 예를 들어, 결정된 제1 각도(410) 및 결정된 제2 각도(420)의 값은 임계 각도값 내에 있을 수 있다(예를 들어, 각도는 근사적으로 0.3° 이내임). 복수의 제1 및 제2 각도(410 및 420)가 결정되도록 복수의 테스트가 실행될 수 있다. 도 5는, 결정된 제1 각도(510)(예를 들어, GCC-PHAT 기법을 사용하여 결정됨) 및 결정된 제2 각도(450)(예를 들어, DPC 기법을 사용하여 결정됨)가 유사한 값을 갖지 않는 상황을 도시하는 예시적인 그래프(500)를 나타낸다. 예를 들어, 결정된 제1 각도(510) 및 결정된 제2 각도(520)의 값은 임계 각도값 내에 있지 않을 수 있다(예를 들어, 각도는 근사적으로 11° 떨어져 있음). 복수의 제1 및 제2 각도(510, 520)가 결정되도록 복수의 테스트가 실행될 수 있다.At block 254, process 222 continues at block 256 if the two angles are within the threshold angle value. Otherwise, process 222 continues to block 262. [ 4 illustrates a situation where the determined first angle 410 (e.g., determined using the GCC-PHAT technique) and the determined second angle 420 (determined using the DPC technique) have similar values Lt; RTI ID = 0.0 > 400 < / RTI > For example, the values of the determined first angle 410 and the determined second angle 420 may be within a critical angle value (e.g., the angle is within approximately 0.3 degrees). A plurality of tests may be performed such that a plurality of first and second angles 410 and 420 are determined. 5 illustrates that the determined first angle 510 (e.g., determined using the GCC-PHAT technique) and the determined second angle 450 (determined, for example, using the DPC technique) Gt; 500 < / RTI > For example, the values of the determined first angle 510 and the determined second angle 520 may not be within the critical angle value (e.g., the angle is approximately 11 degrees away). A plurality of tests may be performed such that a plurality of first and second angles 510, 520 are determined.

블록(256)에서, 제1 및/또는 제2 각도는 이상적인 각도와 비교된다. 이상적인 각도는 이상적인 라우드스피커 레이아웃으로부터 도출되거나 제공된 각도일 수 있다. 비교는, 라우드스피커의 부정확하거나 비이상적인 각도 및/또는 부정확하거나 비이상적인 라우드스피커 위치가 존재하는지의 여부를 결정하기 위해 수행될 수 있다.At block 256, the first and / or second angles are compared with an ideal angle. The ideal angle may be an angle derived or provided from an ideal loudspeaker layout. The comparison may be performed to determine whether there is an incorrect or unrealistic angle of the loudspeaker and / or whether there is an inaccurate or unrealistic loudspeaker position.

블록(258)에서, 프로세스(222)는, 제1 및/또는 제2 각도가 (예를 들어, 라우드스피커의 부정확하거나 비이상적인 각도 및/또는 부정확하거나 비이상적인 라우드스피커 위치가 검출되지 않음을 나타내는) 이상적인 각도의 임계 각도값(블록(252 및 254)에서와 같은 임계 각도값 또는 다른 것 중 어느 하나) 내에 존재한다면 블록(260)으로 계속된다. 그렇지 않으면, 프로세스(222)는 블록(262)으로 계속된다.At block 258, the process 222 determines whether the first and / or second angles are indicative of an inaccurate or non-ideal angle of the loudspeaker and / or an inaccurate or unrealistic loudspeaker position is not detected (Either a critical angle value such as in blocks 252 and 254 or some other) of the ideal angle. Otherwise, process 222 continues to block 262. [

블록(260)에서, 에러는 검출되지 않는다. 위치 추정 엔진(116)은 어떠한 에러도 검출되지 않았음을 통지 생성기(118)에 나타낼 수 있다. 통지 생성기(118)는 에러가 검출되지 않았음, 라우드스피커의 각도가 정확함 및/또는 라우드스피커와 사운드바(110)(예를 들어, 마이크로폰 어레이(114)) 사이의 경로를 막고 있는 물체가 없음을 나타내는 통지를 생성할 수 있다. 대안적으로, 통지 생성기(118)는 통지를 생성하지 않을 수도 있다. 사운드바(110)는 결정된 각도 및/또는 DPC 피크에 기초하여 결정된 거리 중 어느 하나를 사용하여 보정의 나머지 부분을 수행할 수 있다.At block 260, no error is detected. The location estimation engine 116 may indicate to the notification generator 118 that no errors have been detected. The notification generator 118 may determine that no error is detected, the angle of the loudspeaker is correct and / or there is no object blocking the path between the loudspeaker and the sound bar 110 (e.g., microphone array 114) Lt; / RTI > Alternatively, the notification generator 118 may not generate a notification. The sound bar 110 may perform the remainder of the correction using either the determined angle and / or the distance determined based on the DPC peak.

블록(262)에서, 비정상 조건이 검출된다. 위치 추정 엔진(116)은 비정상 조건이 검출되었음을 통지 생성기(118)에 나타낼 수 있다. 통지 생성기(118)는 물체가 마이크로폰 어레이(114)와 라우드스피커 사이의 경로를 막고 있음 및/또는 해당 물체가 제거되어야 함, 라우드스피커의 부정확하거나 비이상적인 각도가 검출됨, 부정확하거나 비이상적인 라우드스피커 위치가 검출됨, 및/또는 부정확한 극성이 검출됨을 나타내는 통지를 생성할 수 있다.At block 262, an abnormal condition is detected. The location estimation engine 116 may indicate to the notification generator 118 that an abnormal condition has been detected. The notification generator 118 may be used to determine if an object is blocking the path between the microphone array 114 and the loudspeaker and / or the object must be removed, an incorrect or unrealistic angle of the loudspeaker is detected, A position is detected, and / or an incorrect polarity is detected.

대안적으로 또는 추가적으로, 통지는 라우드스피커가 부정확하게 배향되었음을 나타낼 수 있다. 예를 들어, 통지는, 청취자(120)에게 라우드스피커의 배향을 확인(예를 들어, 라우드스피커가 원(130)의 중심을 향하여 배향되는지의 여부를 확인)하도록 청취자(120)에게 통지할 수 있다.Alternatively or additionally, the notification may indicate that the loudspeaker is incorrectly oriented. For example, the notification may notify the listener 120 to confirm the orientation of the loudspeaker to the listener 120 (e.g., to determine whether the loudspeaker is oriented towards the center of the circle 130) have.

대안적으로 또는 추가적으로, 통지는 라우드스피커가 부정확한 위치를 가지고 있음(예를 들어, 라우드스피커가 배선된 출력이 주어졌을 때 라우드스피커가 잘못된 측에 배치되어 있음)을 나타낼 수 있다. 예를 들어, 보정이 개시될 때, 사운드바(110)는 테스트 신호를 출력하도록 하나 이상의 라우드스피커에 명령한다. 그러므로, 사운드바(110)는 예를 들어 주어진 시간에 어떠한 서라운드 라우드스피커가 테스트 신호를 출력하고 있는지를 안다. 좌측 서라운드 라우드스피커(106)가 우측 서라운드 잭에 연결되고 우측 서라운드 라우드스피커(108)가 좌측 서라운드 잭에 연결되도록 청취자(120)가 좌측 서라운드 우측 서라운드 라우드스피커를 배선하면, 사운드바(110)는, 우측 서라운드 라우드스피커(108)가 테스트 신호를 출력하고 있을 것으로 예측될 때, 테스트 신호를 출력하도록 좌측 서라운드 라우드스피커(106)에 명령할 수 있다. 결정된 제2 각도(및/또는 결정된 제1 각도)는, 우측 서라운드 라우드스피커(108)가 아니라 좌측 서라운드 라우드스피커(106)가 테스트 신호를 출력하고 있었던 경우 예측되는 값을 가질 수 있다. 따라서, 대안적인 지정의 라우드스피커에 대해 예측되는 값과 유사한 제2 각도(및/또는 결정된 제1 각도)의 값(예를 들어, 우측 서라운드 라우드스피커에 대해 예측되는 값과 유사한 좌측 서라운드 라우드스피커에 대해 결정된 제2 각도의 값)이 위치 추정 엔진(116)에 의해 인식될 수 있고, 청취자(120)는 적절하게 통지받을 수 있다. 추가적인 실시예에서, 사운드바(110)는, 청취자(120)가 라우드스피커를 수동으로 교환할 필요가 없는 상황에서 사운드바(110)의 내부 배선을 재라우팅할 수 있다.Alternatively or additionally, the notification may indicate that the loudspeaker has an incorrect location (e.g., the loudspeaker is located on the wrong side when the wired output is given). For example, when calibration is initiated, the soundbar 110 commands one or more loudspeakers to output a test signal. Therefore, the sound bar 110 knows, for example, which surround loudspeaker is outputting a test signal at a given time. If the listener 120 wires the left surround right surround loudspeaker so that the left surround loudspeaker 106 is connected to the right surround jack and the right surround loudspeaker 108 is connected to the left surround jack, When the right surround loudspeaker 108 is predicted to be outputting a test signal, it can command the left surround loudspeaker 106 to output a test signal. The determined second angle (and / or determined first angle) may have a value that is predicted when the left surround loudspeaker 106 is not a right surround loudspeaker 108 but is outputting a test signal. Thus, a value of a second angle (and / or a determined first angle) similar to the value predicted for an alternate designation loudspeaker (e.g., a value similar to the value predicted for the right surround loudspeaker) And the listener 120 may be properly notified of the position of the second angle. In a further embodiment, the sound bar 110 may re-route the internal wiring of the sound bar 110 in a situation where the listener 120 does not need to manually replace the loudspeaker.

대안적으로 또는 추가적으로, 통지는 라우드스피커가 부정확한 극성을 가지고 있음을 나타낼 수 있다. 예를 들어, 사용자는 라우드스피커의 양(positive)의 입력을 음(negative)의 잭(jack)에 연결하고 라우드스피커의 음의 입력을 양의 잭에 연결했을 수 있다. 이러한 상황에서, 마이크로폰 어레이(114) 내의 마이크로폰으로부터 기록된 오디오 신호는 테스트 신호에서 위상이 벗어날 수 있다(예를 들어, 180° 위상차). 따라서, 위치 추정 엔진(116)은 기록된 오디오 신호와 테스트 신호를 비교할 수 있다. 두 신호가 특정 값(예를 들어, 180°)의 임계값 내에서 서로 위상차가 나는 경우, 부정확한 극성이 위치 추정 엔진(116)에 의해 인식될 수 있고 청취자(120)가 적절하게 통지받을 수 있다. 추가적인 실시예에서, 사운드바(110)는, 청취자(120)가 라우드스피커를 수동으로 재배선할 필요가 없는 상황에서 극성을 반전시키기 위해 사운드바(110)의 내부 배선을 재라우팅할 수 있다.Alternatively or additionally, the notification may indicate that the loudspeaker has an incorrect polarity. For example, a user may have connected a positive input of a loudspeaker to a negative jack and a negative input of a loudspeaker to a positive jack. In this situation, the audio signal recorded from the microphone in the microphone array 114 may be out of phase in the test signal (e.g., 180 degrees out of phase). Therefore, the position estimation engine 116 can compare the recorded audio signal with the test signal. If the two signals are out of phase with respect to one another within a threshold of a particular value (e.g., 180), incorrect polarity can be recognized by the position estimation engine 116 and the listener 120 can be properly notified have. In a further embodiment, the soundbar 110 may re-route the internal wiring of the sound bar 110 to reverse the polarity in a situation where the listener 120 does not need to manually re-route the loudspeaker.

대안적으로 또는 추가적으로, 통지는 라우드스피커 위치가 부정확하거나 비이상적이라는 것 및/또는 라우드스피커가 재위치지정되어야 하는 위치를 나타낼 수 있다. 예를 들어, DPC 피크는 라우드스피커와 마이크로폰 어레이(114) 사이의 거리에 대응할 수 있다. DPC 피크는 라우드스피커로부터 마이크로폰 어레이(114)로의 오디오 신호의 직접 경로에 대응하기 때문에, 루프백 지연을 라우드스피커가 테스트 신호를 출력하도록 명령받는 시간에 더하고, DPC 피크가 발생하는 시간으로부터 결합된 시간을 빼는 것은 오디오 신호가 라우드스피커로부터 마이크로폰 어레이(114)로 진행한 시간에 대응한다. 이 시간은, 해당 시간을 공기 중의 소리의 속도와 곱함으로써 위치 추정 엔진(116)에 의해 거리 추정치로 변환될 수 있다. 거리 추정치는 라우드스피커와 마이크로폰 어레이(114) 사이의 거리를 나타낼 수 있다. 거리 추정치는 라우드스피커의 가능한 위치를 결정하기 위해 각도 추정치(예를 들어, 제1 각도)와 결합될 수 있다. 위치 추정 엔진(116)은 결정된 가능한 위치에 기초하여 최적의 재생을 위해 오디오 신호를 수정하도록 보상 컴포넌트에 의한 사용을 위해 보상 파라미터를 도출할 수 있고 그리고/또는 결정된 가능한 위치는 사용자 디바이스(140) 상의 그래픽 사용자 인터페이스 상에 디스플레이될 수 있다. 위치 추정 엔진(116)은 또한 라우드스피커를 재위치지정하기 위한 제안된 위치로서 가능한 위치(예를 들어, 표준 레이아웃에서 가장 가까운 위치)에 가까운 권장되는 배열 원(130)을 따라 위치를 결정할 수 있다. 일부 실시예에서, 보정은 라우드스피커를 권장되는 위치로 재위치지정한 후에 다시 실행될 수 있다. 일부 실시예에서, 라우드스피커의 하나 이상의 가능한 위치가 사용자 디바이스(140) 상의 그래픽 사용자 인터페이스 상에 디스플레이되고, 청취자(120)는 라우드스피커의 물리적 위치와 가장 잘 매칭되는 위치를 선택할 수 있다(예를 들어, 청취자(120)가 라우드스피커를 재위치지정하도록 요청받지 않음).Alternatively or additionally, the notification may indicate that the loudspeaker position is inaccurate or non-ideal and / or where the loudspeaker should be repositioned. For example, the DPC peak may correspond to the distance between the loudspeaker and the microphone array 114. Since the DPC peak corresponds to the direct path of the audio signal from the loudspeaker to the microphone array 114, the loopback delay is added to the time at which the loudspeaker is commanded to output a test signal, and the combined time from the time the DPC peak occurs Subtracting corresponds to the time the audio signal traveled from the loudspeaker to the microphone array 114. This time can be converted to a distance estimate by the position estimation engine 116 by multiplying the time with the velocity of sound in the air. The distance estimate may represent the distance between the loudspeaker and the microphone array 114. The distance estimate may be combined with an angle estimate (e.g., a first angle) to determine a possible position of the loudspeaker. The location estimation engine 116 may derive a compensation parameter for use by the compensation component to modify the audio signal for optimal playback based on the determined possible location and / or determine a possible location on the user device 140 May be displayed on the graphical user interface. The location estimation engine 116 may also determine a location along the recommended array source 130 that is close to a possible location (e.g., the closest location in a standard layout) as a suggested location for relocating the loudspeaker . In some embodiments, the correction may be performed again after repositioning the loudspeaker to the recommended position. In some embodiments, one or more possible locations of the loudspeaker are displayed on the graphical user interface on the user device 140, and the listener 120 may select a location that best matches the physical location of the loudspeaker (e.g., For example, the listener 120 is not asked to reposition the loudspeakers).

블록(260 또는 262)에서 생성된 통지는 통지 생성기(118)에 의해 네트워크(115)를 통해 사용자 디바이스(140)로 송신될 수 있다. 사용자 디바이스(140)는 통지를 디스플레이할 수 있다.The notification generated in block 260 or 262 may be transmitted by notification generator 118 to user device 140 via network 115. [ User device 140 may display a notification.

대안적인 실시예에서, 결정된 각도 사이의 차이가 임계 각도값을 초과하면, 위치 추정 엔진(116)은 보정을 수행하는 데 사용하기 위한 DPC 피크에 기초하여 결정된 제2 각도 및/또는 결정된 거리를 선택한다. 따라서, 통지가 생성되지 않고 보정이 완료될 수 있다.In an alternative embodiment, if the difference between the determined angles exceeds a threshold angle value, the position estimation engine 116 may select a second angle and / or a determined distance based on the DPC peak for use in performing the calibration do. Therefore, correction can be completed without generating a notification.

예시적인 그래픽 사용자 인터페이스An exemplary graphical user interface

도 6은 도 1b의 사용자 디바이스(140)에 의해 디스플레이될 수 있는 예시적인 그래픽 사용자 인터페이스(GUI)(600)를 나타낸다. 예를 들어, 사용자 디바이스(140)는 사용자 디바이스(140)로 하여금 GUI(600)를 디스플레이하게 하는 애플리케이션을 실행할 수 있다. GUI(600)는 보정 버튼(660), 보상 버튼(662) 및 사운드바(610), 청취자(620), 좌측 서라운드 라우드스피커(606) 및 우측 서라운드 라우드스피커(608)의 그래픽 표현을 포함할 수 있다.FIG. 6 shows an exemplary graphical user interface (GUI) 600 that may be displayed by the user device 140 of FIG. 1B. For example, the user device 140 may execute an application that causes the user device 140 to display the GUI 600. GUI 600 may include a graphical representation of a correction button 660, a compensation button 662 and a sound bar 610, a listener 620, a left surround loudspeaker 606 and a right surround loudspeaker 608 have.

보정 버튼(660)의 선택은 사용자 디바이스(140)로 하여금 공간적 및/또는 비공간적(예를 들어, 부정확한 극성 검출) 보정을 수행하도록 사운드바(610)(예를 들어, GUI(600)에 도시된 표현이 아닌 물리적 디바이스)에 명령하게 할 수 있다. 보정의 일부로서, 사운드바(610)는 본원에 설명되는 바와 같이 통지를 생성할 수 있다. 통지가 생성되면, 통지는 사용자 디바이스(140)에 송신되어 GUI(600)에 디스플레이될 수 있다. 대안적으로, 사운드바(610)는 보정의 결과를 사용자 디바이스(140)에 송신할 수 있고, 사용자 디바이스(140)는, 통지가 생성되어야 하는지의 여부를 결정할 수 있다.The selection of the correction button 660 may be performed by the user device 140 in the sound bar 610 (e.g., in the GUI 600) to perform spatial and / or non-spatial (e.g., incorrect polarity detection) A physical device other than the depicted representation). As part of the correction, the soundbar 610 may generate a notification as described herein. Once the notification is generated, the notification may be sent to the user device 140 and displayed on the GUI 600. [ Alternatively, the soundbar 610 may send the result of the correction to the user device 140, and the user device 140 may determine whether a notification should be generated.

보상 버튼(662)의 선택은 보상 프로세싱을 온(on) 및 오프(off)로 토글링(toggling)할 수 있다. 또한, 청취자(620) 아이콘이 청취자(620)의 실제 위치를 나타내기 위해 GUI 내에서 선택되고 드래깅(dragging)될 수 있다. GUI(600) 내에서, 청취자(620)로 하여금 라우드스피커(606 및 608) 레이아웃, 청취자(620) 위치, 및/또는 라우드스피커(606 및 608) 위치 등을 변경할 수 있게 하는 추가적인 옵션이 선택될 수 있다(도시하지 않음). 예를 들어, 통지가 청취자(620)에 제공되고 그리고/또는 보정이 1회 이상 개시된 후에 비정상 조건이 계속 검출된다면, 이러한 추가적인 옵션(예를 들어, 청취자(620)로 하여금 몇몇 가능한 디스플레이된 옵션으로부터 GUI(600) 내의 라우드스피커의 위치를 선택할 수 있게 하는 것과 같은 수동 보정 옵션)이 청취자(620)에게 제시될 수 있다.The selection of the compensation button 662 may toggle compensation processing on and off. Also, the listener 620 icon may be selected and dragged within the GUI to indicate the actual location of the listener 620. Within the GUI 600, additional options are selected to allow the listener 620 to change the layout of the loudspeakers 606 and 608, the location of the listener 620, and / or the locations of the loudspeakers 606 and 608, (Not shown). For example, if a notification is provided to the listener 620 and / or an abnormal condition is continuously detected after the correction has been initiated more than once, then this additional option (e.g., enabling the listener 620 to select from several possible displayed options A manual correction option, such as to be able to select the location of the loudspeaker in the GUI 600) may be presented to the listener 620.

다른 예시적인 라우드스피커 위치 추정 프로세스Other exemplary loudspeaker position estimation processes

도 7은 다른 예시적인 라우드스피커 위치 추정 프로세스(700)를 나타낸다. 일 실시예에서, 프로세스(700)는 도 1a 및 도 1b와 관련하여 논의된 사운드바(110)(예를 들어, 위치 추정 엔진(116) 및/또는 통지 생성기(118))를 포함하여, 본원에 설명되는 시스템들 중 임의의 것에 의해 수행될 수 있다. 실시예에 따라, 프로세스(700)는 더 적은 및/또는 추가적인 블록을 포함할 수 있거나, 블록은 나타낸 것과 다른 순서로 수행될 수 있다.FIG. 7 shows another exemplary loudspeaker position estimation process 700. In one embodiment, the process 700 may include the sound bar 110 (e.g., the position estimation engine 116 and / or the notification generator 118) discussed with respect to FIGS. 1A and 1B, May be performed by any of the systems described in U.S. Pat. Depending on the embodiment, process 700 may include fewer and / or additional blocks, or blocks may be performed in a different order than shown.

블록(702)에서, 라우드스피커는 테스트 신호를 송신하도록 명령받는다. 예를 들어, 테스트 신호는 최대 길이 시퀀스일 수 있다. 라우드스피커는 유선 또는 무선 접속을 통해 위치 추정 엔진(116)에 의해 테스트 신호를 송신하도록 명령받을 수 있다. 마이크로폰 어레이(114) 내의 마이크로폰은 출력된 테스트 신호의 결과로서 오디오 신호를 각각 생성할 수 있다. 오디오 신호는 저장 디바이스(도시하지 않음)에 기록될 수 있다.At block 702, the loudspeaker is commanded to transmit a test signal. For example, the test signal may be a maximum length sequence. The loudspeaker may be instructed to transmit a test signal by the location estimation engine 116 via a wired or wireless connection. The microphone in the microphone array 114 may each generate an audio signal as a result of the output test signal. The audio signal may be recorded in a storage device (not shown).

블록(704)에서, 제1 기법을 사용하여, 마이크로폰 어레이(114) 내의 제1 마이크로폰으로부터 기록된 제1 오디오 신호 및 마이크로폰 어레이(114) 내의 제2 마이크로폰으로부터 기록된 제2 오디오 신호에 기초하여 제1 각도가 결정된다. 제1 기법은 라우드스피커의 위치를 추정하는 데 사용되는 임의의 기법일 수 있다. 예를 들어, 제1 기법은 본원에 설명되는 GCC-PHAT 기법일 수 있다.At block 704, using a first technique, a first audio signal recorded from a first microphone in the microphone array 114 and a second audio signal recorded from a second microphone in the microphone array 114, 1 The angle is determined. The first technique may be any technique used to estimate the position of the loudspeaker. For example, the first technique may be the GCC-PHAT technique described herein.

블록(706)에서, 제2 기법을 사용하여 제1 오디오 신호 및 제2 오디오 신호에 기초하여 제2 각도가 결정된다. 제2 기법은 제1 기법 이외의 다른 라우드스피커의 위치를 추정하는 데 사용되는 임의의 기법일 수 있다. 예를 들어, 제2 기법은 본원에 설명되는 DPC 기법일 수 있다.At block 706, a second angle is determined based on the first audio signal and the second audio signal using a second technique. The second technique may be any technique used to estimate the position of a loudspeaker other than the first technique. For example, the second technique may be the DPC technique described herein.

도 7에 나타낸 바와 같이, 블록(704 및 706)이 순서대로 수행되고, 블록(704)이 먼저 수행된다. 그러나 이것은 한정적인 것으로 의도된 것은 아니다. 예를 들어, 블록(704 및 706)이 순차적으로 수행될 수 있고, 블록(706)이 먼저 수행될 수 있다. 다른 예로서, 블록(704 및 706)은 동시에 수행될 수 있다.As shown in FIG. 7, blocks 704 and 706 are performed in order, and block 704 is performed first. However, this is not intended to be limiting. For example, blocks 704 and 706 may be performed sequentially, and block 706 may be performed first. As another example, blocks 704 and 706 may be performed simultaneously.

블록(708)에서, 제1 각도와 제2 각도의 비교에 기초하여 비정상 조건이 존재하는지의 여부가 결정된다. 예를 들어, 제1 각도 및 제2 각도가 임계 각도값 초과만큼 차이가 나는 경우, 본원에 설명되는 하나 이상의 비정상 조건과 같은 비정상 조건이 존재할 수 있다. 그렇지 않으면, 비정상 조건이 존재하지 않을 수 있다.At block 708, it is determined whether there is an abnormal condition based on a comparison of the first angle and the second angle. For example, if the first and second angles differ by more than the threshold angle value, there may be an abnormal condition, such as one or more abnormal conditions described herein. Otherwise, an abnormal condition may not exist.

추가적인 실시예Additional Examples

본 개시의 일 양태는 다중채널 서라운드 사운드 시스템에서 라우드스피커의 위치를 추정하기 위한 장치를 제공한다. 본 장치는, 제1 마이크로폰 및 제2 마이크로폰을 포함하는 마이크로폰 어레이를 포함하며, 제1 마이크로폰은 라우드스피커가 테스트 신호를 출력하는 데 응답하여 제1 오디오 신호를 생성하도록 구성되고, 제2 마이크로폰은 라우드스피커가 테스트 신호를 출력하는 데 응답하여 제2 오디오 신호를 생성하도록 구성된다. 본 장치는 마이크로폰 어레이에 커플링된 위치 추정 엔진을 더 포함하고, 위치 추정 엔진은: 제1 오디오 신호 및 제2 오디오 신호에 기초하여 도달 시간차를 결정하고; 도달 시간차에 기초하여 제1 각도를 결정하고; 제1 오디오 신호로부터 도출된 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하고; 제2 오디오 신호로부터 도출된 임펄스 응답에서 제2 직접 경로 컴포넌트를 식별하고; 제1 직접 경로 컴포넌트 및 제2 직접 경로 컴포넌트에 기초하여 제2 각도를 결정하고; 제1 각도와 제2 각도의 비교에 기초하여 비정상 조건이 존재하는지의 여부를 결정하도록 구성된다.One aspect of the present disclosure provides an apparatus for estimating the position of a loudspeaker in a multi-channel surround sound system. The apparatus includes a microphone array including a first microphone and a second microphone, wherein the first microphone is configured to generate a first audio signal in response to the loudspeaker outputting a test signal, And to generate a second audio signal in response to the speaker outputting the test signal. The apparatus further includes a position estimation engine coupled to the microphone array, the position estimation engine comprising: determining an arrival time difference based on the first audio signal and the second audio signal; Determine a first angle based on a time difference of arrival; Identify a first direct path component in an impulse response derived from the first audio signal; Identify a second direct path component in an impulse response derived from the second audio signal; Determine a second angle based on the first direct path component and the second direct path component; Based on a comparison of the first angle and the second angle, whether or not an abnormal condition exists.

선행 단락의 장치는 이하의 특징, 즉, 위치 추정 엔진은 또한: 제1 푸리에 변환을 형성하기 위해 제1 오디오 신호의 푸리에 변환을 생성하고, 제2 푸리에 변환을 형성하기 위해 제2 오디오 신호의 푸리에 변환을 생성하고, 제1 푸리에 변환 및 제2 푸리에 변환에 기초하여 도달 시간차를 결정하도록 구성되는 것; 위치 추정 엔진은 또한: 제1 오디오 신호의 상이한 부분 및 제2 오디오 신호의 상이한 부분에 기초하여 다른 도달 시간차를 결정하고, 도달 시간차 및 다른 도달 시간차를 집계하고, 집계된 도달 시간차에서의 도달 시간차에 대응하는 값의 발생 횟수에 기초하여 제1 각도를 결정하기 위한 도달 시간차를 선택하도록 구성되는 것; 위치 추정 엔진은 또한, 제1 오디오 신호로부터 도출된 임펄스 응답 내의 최고 진폭에 기초하여, 제1 오디오 신호로부터 도출된 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하도록 구성되는 것; 위치 추정 엔진은 또한, 제1 오디오 신호로부터 도출되는 임펄스 응답의 최고 진폭 전에 발생하고 최고 진폭의 임계값 내에 있는 제1 오디오 신호로부터 도출되는 임펄스 응답 내의 제1 진폭에 기초하여 제1 오디오 신호로부터 도출되는 임펄스 응답에서 제1 직접 경로 컴포넌트를 식별하도록 구성되는 것; 위치 추정 엔진은 또한: 제1 직접 경로 컴포넌트를 포함하는 제1 시간 윈도우를 선택하고, 제2 직접 경로 컴포넌트를 포함하는 제2 시간 윈도우를 선택하고, 제1 시간 윈도우의 데이터 및 제2 시간 윈도우의 데이터를 사용하여 상호 상관을 결정하고, 결정된 상호 상관을 사용하여 제2 각도를 결정하도록 구성되는 것; 위치 추정 엔진은 또한: 제1 각도와 제2 각도를 비교하고, 제1 각도 및 제2 각도가 임계 각도값 내에 있지 않다는 결정에 응답하여 비정상 조건이 존재하는 것으로 결정하도록 구성되는 것; 위치 추정 엔진은 또한, 제1 각도 및 제2 각도가 임계 각도값 내에 있다는 결정에 응답하여 비정상 조건이 존재하지 않는 것으로 결정하도록 구성되는 것; 임계 각도값은 0°와 15° 사이를 포함하는 것; 본 장치는 비정상 조건이 존재한다는 결정에 응답하여 네트워크를 통해 사용자 디바이스에 통지를 송신하도록 구성된 통지 생성기를 더 포함하는 것; 통지는 라우드스피커의 각도가 부정확하다는 표시, 물체가 라우드스피커와 마이크로폰 어레이 사이의 경로를 막고 있다는 표시, 라우드스피커의 극성이 부정확하다는 표시, 및 라우드스피커의 위치가 부정확하거나 비이상적이라는 표시 중 하나를 포함하는 것; 위치 추정 엔진은 또한, 테스트 신호를 출력하도록 라우드스피커에 명령하도록 구성되는 것; 본 장치는 사운드바, 오디오/비주얼(A/V: audio/visual) 수신기, 중앙 스피커, 및 텔레비전 중 하나를 포함하는 것; 다중채널 서라운드 사운드 시스템은 스테레오, 2.1, 3.1, 5.1, 5.2, 7.1, 7.2, 11.1, 11.2, 및 22.2 스피커 레이아웃 중 하나로 배열되는 것 중 임의의 하위 조합을 포함할 수 있다.The apparatus of the preceding paragraph has the following features: the position estimation engine also generates a Fourier transform of the first audio signal to form a first Fourier transform, and a Fourier transform of the second audio signal to form a second Fourier transform, And to determine a time difference of arrival based on the first Fourier transform and the second Fourier transform; The location estimation engine may also be configured to: determine another arrival time difference based on different portions of the first audio signal and the different portion of the second audio signal, aggregate arrival time differences and other arrival time differences, And to select a time difference of arrival for determining a first angle based on the number of occurrences of the corresponding value; The position estimation engine is also configured to identify a first direct path component in an impulse response derived from the first audio signal, based on the highest amplitude in the impulse response derived from the first audio signal; The position estimation engine may also be adapted to derive from the first audio signal based on a first amplitude in the impulse response that occurs before the highest amplitude of the impulse response derived from the first audio signal and that is within the threshold of the highest amplitude, Wherein the first direct path component is configured to identify a first direct path component in an impulse response; The location estimation engine may also: select a first time window that includes the first direct path component, a second time window that includes the second direct path component, compare the data of the first time window and the data of the second time window To determine a cross-correlation using the data, and to determine a second angle using the determined cross-correlation; The position estimation engine is also configured to: compare the first angle with the second angle, and to determine that an abnormal condition is present in response to determining that the first and second angles are not within the threshold angle value; The position estimation engine is further configured to determine that an abnormal condition is not present in response to determining that the first angle and the second angle are within the threshold angle value; Critical angle values include between 0 ° and 15 °; The apparatus further comprises a notification generator configured to send a notification to the user device over the network in response to the determination that an abnormal condition is present; The notification can be one of an indication that the angle of the loudspeaker is incorrect, an indication that the object is blocking the path between the loudspeaker and the microphone array, an indication that the loudspeaker polarity is incorrect, and an indication that the loudspeaker is inaccurate or not ideal. Include; The location estimation engine is also configured to command the loudspeaker to output a test signal; The apparatus includes one of a sound bar, an audio / visual (A / V) receiver, a center speaker, and a television; A multi-channel surround sound system may include any subset of those arranged in one of stereo, 2.1, 3.1, 5.1, 5.2, 7.1, 7.2, 11.1, 11.2, and 22.2 speaker layouts.

용어Terms

본원에 설명된 것 이외의 많은 다른 변형이 본 개시로부터 명백할 것이다. 예를 들어, 실시예에 따라, 본원에 설명되는 임의의 알고리즘의 특정 동작, 이벤트 또는 기능은 상이한 순서로 수행될 수 있고, 추가, 병합, 또는 함께 생략될 수 있다(예를 들어, 설명된 모든 행위 또는 이벤트가 알고리즘 실시에 필요한 것은 아님). 또한, 특정 실시예에서, 동작 또는 이벤트는 예를 들어, 순차적이 아니라 멀티-스레드 프로세싱, 인터럽트 프로세싱, 또는 다중 프로세서 또는 프로세서 코어를 통해 또는 다른 병렬 아키텍처 상에서 동시에 수행될 수 있다. 또한, 다른 작업 또는 프로세스가 함께 기능할 수 있는 상이한 머신 및/또는 컴퓨팅 시스템에 의해 수행될 수 있다.Many other variations other than those described herein will be apparent from the present disclosure. For example, according to an embodiment, certain actions, events, or functions of any algorithm described herein may be performed in a different order and may be added, merged, or omitted together (e.g., Action or event is not required for algorithm implementation). Also, in certain embodiments, operations or events may be performed, for example, not sequentially, but multi-threaded processing, interrupt processing, or multiple processors or processor cores, or concurrently on other parallel architectures. Also, other work or processes may be performed by different machines and / or computing systems that may function together.

본원에 개시된 실시예와 관련하여 설명된 다양한 예시적인 논리 블록, 모듈 및 알고리즘 단계는 전자 하드웨어, 컴퓨터 소프트웨어, 또는 둘 다의 조합으로 구현될 수 있다. 하드웨어 및 소프트웨어의 이러한 상호 교환 가능성을 명확하게 나타내기 위해, 다양한 예시적인 컴포넌트, 블록, 모듈 및 단계가 일반적으로 기능의 관점에서 전술되었다. 이러한 기능이 하드웨어 또는 소프트웨어로 구현되는지의 여부는 전체 시스템에 부과된 특정 애플리케이션 및 설계 제약 사항에 따른다. 설명된 기능은 각각의 특정 애플리케이션에 대해 다양한 방식으로 구현될 수 있지만, 이러한 구현 결정은 본 개시의 범위를 벗어나게 하는 것으로 해석되어서는 안 된다.The various illustrative logical blocks, modules, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, and steps have been described above generally in terms of functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. The described functionality may be implemented in various ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.

본원에 개시된 실시예와 관련하여 설명된 다양한 예시적인 논리 블록 및 모듈은 디지털 논리 회로, 범용 프로세서, 디지털 신호 프로세서(DSP: digital signal processor), 애플리케이션 특유 집적 회로(ASIC: application specific integrated circuit), 필드 프로그래머블 게이트 어레이(FPGA: field programmable gate array) 또는 다른 프로그래머블 논리 디바이스, 이산 게이트 또는 트랜지스터 논리, 이산 하드웨어 컴포넌트, 또는 본원에 설명된 기능을 수행하도록 설계된 이들의 임의의 조합을 포함하는 하드웨어 프로세서와 같은 머신에 의해 구현 또는 수행될 수 있다. 범용 프로세서는 마이크로프로세서일 수 있지만, 대안적으로, 프로세서는 컨트롤러, 마이크로컨트롤러 또는 상태 머신, 또는 이들의 조합 등일 수 있다. 프로세서는 컴퓨터-실행가능 명령어를 프로세싱하도록 구성된 전기 회로를 포함할 수 있다. 다른 실시예에서, 프로세서는 컴퓨터-실행가능 명령어를 프로세싱하지 않고 논리 동작을 수행하는 FPGA 또는 다른 프로그래머블 디바이스를 포함한다. 프로세서는 또한 컴퓨팅 디바이스의 조합, 예를 들어, DSP와 마이크로프로세서, 복수의 마이크로프로세서, DSP 코어와 관련된 하나 이상의 마이크로프로세서, 또는 임의의 다른 이러한 구성의 조합으로서 구현될 수 있다. 컴퓨팅 환경은 몇 개를 예로 들면, 마이크로프로세서, 메인프레임 컴퓨터, 디지털 신호 프로세서, 휴대용 컴퓨팅 디바이스, 디바이스 제어기, 또는 기기 내의 연산 엔진에 한정되지 않지만 이를 포함하는 임의의 유형의 컴퓨터 시스템을 포함할 수 있다.The various illustrative logical blocks and modules described in connection with the embodiments disclosed herein may be implemented or performed with a digital logic circuit, a general purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC) Such as a hardware processor including a programmable logic device, a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein As shown in FIG. A general purpose processor may be a microprocessor, but in the alternative, the processor may be a controller, microcontroller or state machine, or a combination thereof. The processor may comprise an electrical circuit configured to process computer-executable instructions. In another embodiment, the processor includes an FPGA or other programmable device that performs logic operations without processing computer-executable instructions. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors associated with a DSP core, or any other such configuration. The computing environment may include any type of computer system, including, but not limited to, a microprocessor, mainframe computer, digital signal processor, handheld computing device, device controller, or computing engine within the device .

본원에 개시된 실시예와 관련하여 설명된 방법, 프로세스 또는 알고리즘의 단계는 하드웨어, 하나 이상의 메모리 디바이스에 저장되고 하나 이상의 프로세서에 의해 실행되는 소프트웨어 모듈, 또는 이들의 조합으로 직접 구현될 수 있다. 소프트웨어 모듈은 RAM 메모리, 플래시 메모리, ROM 메모리, EPROM 메모리, EEPROM 메모리, 레지스터, 하드 디스크, 이동식 디스크, CD-ROM 또는 임의의 다른 형태의 비일시적 컴퓨터-판독 가능 저장 매체, 매체 또는 본 기술 분야에 알려진 물리적 컴퓨터 저장 장치에 상주할 수 있다. 예시적인 저장 매체는, 프로세서가 저장 매체로부터 정보를 판독하고 저장 매체에 정보를 기입할 수 있도록 프로세서에 커플링될 수 있다. 대안적으로, 저장 매체는 프로세서에 통합될 수 있다. 저장 매체는 휘발성 또는 비휘발성일 수 있다. 프로세서 및 저장 매체는 ASIC에 상주할 수 있다.The steps of a method, process, or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module stored in one or more memory devices and executed by one or more processors, or a combination of both. A software module may reside in RAM, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, removable disk, CD-ROM or any other form of non-volatile computer-readable storage medium, May reside in a known physical computer storage device. An exemplary storage medium may be coupled to the processor such that the processor can read information from, and write information to, the storage medium. Alternatively, the storage medium may be integrated into the processor. The storage medium may be volatile or non-volatile. The processor and the storage medium may reside in an ASIC.

달리 명시하지 않는 한, 또는 사용된 문맥 내에서 달리 이해되지 않는 한, 본원에서 사용되는, 특히, "할 수 있다(can)", "할 수도 있다(might)", "할 수도 있다(may)", "예를 들어(e.g.)"와 같은 조건어는 일반적으로 특정 실시예는 특정의 특징, 요소 및/또는 상태를 포함하지 않지만 다른 실시예는 포함하지 않는다는 것을 전달하도록 의도된다. 따라서, 이러한 조건어는 일반적으로 하나 이상의 실시예에 대해 특징, 요소 및/또는 상태가 어떤 방식으로든 요구되거나 하나 이상의 실시예가 필자 입력 또는 프롬프팅으로 또는 이들 없이, 이러한 특징, 요소 및/또는 상태가 임의의 특정 실시예에 포함되거나 임의의 특정 실시예에서 수행되어야 하는지의 여부를 결정하기 위해 반드시 논리를 포함한다는 것을 암시하는 것으로 의도되지는 않는다. "포함하는(comprising)", "포괄하는(including)", "갖는(having)" 등의 용어는 동의어이며 개방형으로 포괄적으로 사용되며 추가적인 요소, 특징, 동작(act), 동작(operation) 등을 배제하지 않는다. 또한, "또는"이라는 용어는 포괄적 의미로 사용되며(배타적 의미로 사용되지는 않음), 예를 들어, 요소의 목록을 연결하는 데 사용될 때, "또는"이라는 용어는 목록의 요소 중 하나, 일부 또는 전부를 의미한다. 또한, 본원에서 사용되는 "각각의"라는 용어는 그 통상적인 의미를 갖는 것 이외에, "각각"이라는 용어가 적용되는 요소들의 세트의 임의의 서브셋을 의미할 수 있다.May "," may "," may "," may ", and the like, as used herein, unless the context clearly indicates otherwise. Quot; and "(e. G.)" Generally are intended to convey that a particular embodiment does not include a particular feature, element, and / or state but does not include other embodiments. Thus, it will be appreciated that such terms are generally used herein to refer to features, elements, and / or conditions in any manner whatsoever of one or more embodiments, or that one or more embodiments may be implemented with input, Quot; is intended to be inclusive or not to imply that it is intended to be < RTI ID = 0.0 > embracing < / RTI > logic to determine whether or not it should be performed in any particular embodiment. The terms "comprising", "including", "having", and the like are synonymous and are used in an open fashion and include additional elements, features, acts, operations, etc. Do not exclude. Also, the term "or" is used in a generic sense (not used in an exclusive sense), for example, when used to link a list of elements, the term "or" Or all of them. Also, as used herein, the term "each" may mean any subset of the set of elements to which the term "each" applies, in addition to having its ordinary meaning.

"X, Y 및 Z 중 적어도 하나"라는 문구와 같은 택일어는 달리 명시하지 않는 한 일반적으로 항목, 용어 등이 X, Y 또는 Z 또는 그 조합을 전달하는 것으로 사용되는 문맥으로 이해되어야 한다. 따라서, 이러한 택일어는 일반적으로 특정 실시예가 각각에 대해 존재하도록 X 중 적어도 하나, Y 중 적어도 하나 및 Z 중 적어도 하나를 필요로 함을 암시하는 것으로 의도되지 않는다.Alternative terms such as "at least one of X, Y, and Z" should be understood as the context in which items, terms, etc. are generally used to convey X, Y or Z or a combination thereof unless otherwise specified. Thus, such alternatives are generally not intended to imply that at least one of X, at least one of Y, and at least one of Z exist for a particular embodiment to exist for each.

달리 명시하지 않는 한, "어느(a)" 또는 "어떤(an)"과 같은 관사는 일반적으로 하나 이상의 설명된 항목을 포함하도록 해석되어야 한다. 따라서, "~하도록 구성되는 디바이스"와 같은 문구는 하나 이상의 인용된 디바이스를 포함하도록 의도된다. 이러한 하나 이상의 인용된 디바이스는 또한 명시된 기재 사항을 수행하도록 총괄적으로 구성될 수 있다. 예를 들어, "기재 사항 A, B 및 C를 수행하도록 구성된 프로세서"는 기재 사항 B 및 C를 수행하도록 구성된 제2 프로세서와 함께 동작하는 기재 사항 A를 수행하도록 구성된 제1 프로세서를 포함할 수 있다.Unless otherwise indicated, articles such as "a" or "an" are to be construed to generally include one or more of the described items. Thus, a phrase such as " a device configured to "is intended to include one or more cited devices. Such one or more cited devices may also be collectively configured to perform the stated entries. For example, a "processor configured to perform subscriptions A, B, and C" may include a first processor configured to perform subscription A that operates in conjunction with a second processor configured to perform subscriptions B and C .

전술한 상세한 설명이 다양한 실시예에 적용되는 새로운 특징들을 나타내고, 설명하고, 적시하였지만, 나타낸 디바이스 또는 알고리즘의 형태 및 상세 사항에서의 다양한 생략, 치환 및 변화가 본 개시의 사상을 벗어나지 않고도 이루어질 수 있음이 이해될 것이다. 이해되는 바와 같이, 본원에 설명되는 본 발명의 특정 실시예는, 일부 특징이 다른 것들과 별도로 사용될 수 있거나 실시될 수 있기 때문에, 본원에서 개진된 모든 특징 및 이점을 제공하지 않는 형태로 구현될 수 있다.Although the foregoing detailed description has shown, described, and pointed out novel features which are applied to various embodiments, various omissions, substitutions and changes in the form and details of the devices or algorithms depicted may be made without departing from the spirit of the present disclosure. Will be understood. As will be appreciated, certain embodiments of the invention described herein may be implemented in a form that does not provide all the features and advantages recited in this disclosure, as some features may be used or practiced separately from others have.

Claims

An apparatus for estimating the position of a loudspeaker in a multi-channel surround sound system, comprising:
A microphone array comprising a first microphone and a second microphone, wherein the first microphone is configured to generate a first audio signal in response to the loudspeaker outputting a test signal, And to generate a second audio signal in response to outputting the test signal; And
A position estimation engine coupled to the microphone array, the position estimation engine comprising:
Determine an arrival time difference estimate based on the first audio signal and the second audio signal;
Determine a first angle based on the arrival time difference estimate;
Identify a first direct path component in an impulse response derived from the first audio signal;
Identify a second direct path component in an impulse response derived from the second audio signal;
Determine a second angle based on the first direct path component and the second direct path component;
And to determine whether an abnormal condition exists based on a comparison of the first angle and the second angle.

The method according to claim 1,
The location estimation engine also includes:
Dividing the first audio signal into one or more first segments;
Divide the second audio signal into one or more second segments;
Generate a Fourier transform of a first one of the one or more first segments to form a first Fourier transform;
Generate a Fourier transform of a first one of the one or more second segments to form a second Fourier transform;
And to determine the arrival time difference estimate based on the first Fourier transform and the second Fourier transform. &Lt; Desc / Clms Page number 19 >

3. The method of claim 2,
The location estimation engine also includes:
Determine a plurality of arrival time difference estimates based on the generated Fourier transform of the at least one first segment and the at least one second segment;
Aggregate the plurality of arrival time difference estimates into a histogram;
And determine arrival time difference estimates based on arrival time difference estimates in the plurality of arrival time difference estimates with the most occurrence in the histogram.

The method according to claim 1,
Wherein the position estimation engine is further configured to identify the first direct path component in an impulse response derived from the first audio signal based on a highest amplitude in an impulse response derived from the first audio signal, A device for estimating the position of a loudspeaker in a sound system.

The method according to claim 1,
The location estimation engine also includes:
Selecting a first time window including the first direct path component;
Selecting a second time window including the second direct path component;
Determine cross-correlation using data in the first time window and data in the second time window;
And determine the second angle using the determined cross-correlation. &Lt; Desc / Clms Page number 19 >

The method according to claim 1,
The location estimation engine also includes:
Compare the first angle with the second angle;
And determine that the abnormal condition is present in response to determining that the first angle and the second angle are not within a threshold angle value. &Lt; Desc / Clms Page number 21 >

The method according to claim 6,
Wherein the threshold angle value comprises a value between 0 and 15 degrees. &Lt; Desc / Clms Page number 13 >

The method according to claim 1,
Further comprising a notification generator configured to send a notification to the user device over the network in response to the determination that the abnormal condition is present.

9. The method of claim 8,
The notification may include an indication that the angle of the loudspeaker is incorrect, an indication that the object is blocking the path between the loudspeaker and the microphone array, an indication that the polarity of the loudspeaker is incorrect, and an indication that the location of the loudspeaker is incorrect Wherein the at least one loudspeaker is one of a plurality of loudspeakers.

The method according to claim 1,
An apparatus for estimating the position of a loudspeaker in a multi-channel surround sound system, the system comprising one of a soundbar, an audio / visual (A / V) receiver, a center speaker, and a television.

The method according to claim 1,
Wherein the multi-channel surround sound system is arranged in one of a stereo, 2.1, 3.1, 5.1, 5.2, 7.1, 7.2, 11.1, 11.2, and 22.2 speaker layout.

A method for estimating the position of a loudspeaker in a multi-channel surround sound system, comprising:
Receiving a first audio signal from a first microphone of the microphone array and receiving a second audio signal from a second microphone of the microphone array;
Determining an arrival time difference estimate based on the first audio signal and the second audio signal;
Determining a first angle based on the arrival time difference estimate;
Identifying a first direct path component in an impulse response derived from the first audio signal;
Identifying a second direct path component in an impulse response derived from the second audio signal;
Determining a second angle based on the first direct path component and the second direct path component; And
And determining whether an abnormal condition exists based on a comparison of the first angle and the second angle. &Lt; Desc / Clms Page number 19 >

13. The method of claim 12,
Wherein the determining the arrival time difference estimate comprises:
Dividing the first audio signal into one or more first segments;
Dividing the second audio signal into one or more second segments;
Generating a Fourier transform of a first one of the one or more first segments to form a first Fourier transform;
Generating a Fourier transform of a first one of the one or more second segments to form a second Fourier transform; And
Further comprising determining the arrival time difference estimate based on the first Fourier transform and the second Fourier transform. &Lt; RTI ID = 0.0 > 8. < / RTI >

13. The method of claim 12,
Wherein identifying a first direct path component in an impulse response derived from the first audio signal comprises determining a first direct path component in an impulse response derived from the first audio signal based on a maximum amplitude in an impulse response derived from the first audio signal, Further comprising identifying the first direct path component. &Lt; RTI ID = 0.0 > 8. < / RTI >

13. The method of claim 12,
Wherein the step of determining whether the abnormal condition is present comprises:
Comparing the first angle with the second angle; And
Further comprising determining that the abnormal condition is present in response to determining that the first angle and the second angle are not within a threshold angle value. &Lt; Desc / Clms Page number 17 >

16. The method of claim 15,
Wherein the threshold angle value comprises a value between 0 and 15 degrees.

13. The method of claim 12,
Further comprising transmitting a notification to the user device over the network in response to the determination that the abnormal condition is present, the notification indicating that the angle of the loudspeaker is incorrect, Wherein the loudspeaker comprises one of an indication that it is blocking the path, an indication that the polarity of the loudspeaker is incorrect, and an indication that the location of the loudspeaker is inaccurate.

18. A non-volatile physical computer storage device in which executable instructions are stored, the executable instructions comprising instructions that, when executed by a hardware processor,
Command a loudspeaker to transmit a test signal;
Determine a first angle based on a first audio signal recorded from a first microphone in the microphone array and a second audio signal recorded from a second microphone in the microphone array using the first technique;
Determine a second angle based on the first audio signal and the second audio signal using a second technique;
And determine whether an abnormal condition is present based on a comparison of the first angle and the second angle.

19. The method of claim 18,
Wherein the first technique comprises a Generalized Cross Correlation and Phase Transform (GCC-PHAT) technique.

19. The method of claim 18,
Wherein the second technique comprises a direct path component (DPC) technique.