TW202324372A

TW202324372A - Audio system with dynamic target listening spot and ambient object interference cancelation

Info

Publication number: TW202324372A
Application number: TW111130512A
Authority: TW
Inventors: 周開祥
Original assignee: 瑞昱半導體股份有限公司
Priority date: 2021-12-10
Filing date: 2022-08-13
Publication date: 2023-06-16
Also published as: TW202324373A; CN116261094A; TW202324375A; CN116261095A; CN116261093A; TW202324374A; CN116261096A

Abstract

A sound system is proposed, dynamically playing optimized audio signals based on user position. A sensor circuits dynamically senses a target space to generate a field environment information. First speaker and second speaker are arranged for audio playback. A host device recognizes a user in the field environment information, and determine the user position corresponding to the target space, and adaptively designate the user position as a target listening spot. A sensor circuit contains a camera capturing a field environment image out of the target space. A recognizer circuit analyzes the field environment image to obtain from the target space, an ambient object with location, size and acoustic attribute information. The control circuit performs a channel based compensation operation on the target listening spot to generate optimized first channel audio signal and second channel audio signal.

Description

Audio system that dynamically adjusts to the target listening point and eliminates distractions from ambient objects

本申請涉及音訊處理技術，其實是一種可依據音場空間中的狀況變化而動態調整播放效果的音響系統。This application relates to audio processing technology, which is actually an audio system that can dynamically adjust the playback effect according to the changes in the sound field space.

現有的音響系統，包含多個喇叭布局在一目標空間周圍而形成一環繞音場環境。每一喇叭可分別輸出對應的聲道音訊。在配置環繞音場環境時，音響系統的安裝人員通常會指派該目標空間的一中心區域為最佳聆聽點，做為安裝多個喇叭的依據。當多個喇叭同時播放多個聲道音訊時，位於該最佳聆聽點的使用者可獲得身歷其境的聆聽效果。The existing sound system includes a plurality of loudspeakers arranged around a target space to form a surround sound field environment. Each speaker can output corresponding channel audio respectively. When configuring the surround sound field environment, the installer of the sound system will usually designate a central area of the target space as the best listening point, as a basis for installing multiple speakers. When multiple speakers play multiple channels of audio at the same time, the user at the sweet spot can obtain an immersive listening effect.

然而，在現實環境中，使用者的聆聽效果很容易受到各種變數的影響。舉例來說，在傳統的音響系統中，最佳聆聽點的範圍是區域限定的。當使用者移動至該最佳聆聽點以外的區域時，雖然還是聽得到音響系統輸出的多個聲道音訊，但多個聲道音訊在使用者位置上產生的聆聽效果可能已經大打折扣或完全失效。此外，目標空間中的房間格局、傢俱位置和材質，都是可能干擾聆聽效果的環境物件。舉例來說，沙發、窗戶、和窗簾會吸收或反射一部份的聲音能量，而扭曲了最佳聆聽點上收到的各聲道音訊。However, in a real environment, the user's listening effect is easily affected by various variables. For example, in a traditional audio system, the range of the sweet spot is zone-bound. When the user moves to an area outside the sweet spot, although the multi-channel audio output by the audio system can still be heard, the listening effect of the multi-channel audio at the user's position may have been greatly reduced or completely eliminated. invalidated. In addition, the room layout, furniture position and materials in the target space are all environmental objects that may interfere with the listening effect. For example, couches, windows, and drapes absorb or reflect some of the sound energy, distorting the individual channel audio received at the sweet spot.

換句話說，傳統的音響系統無法動態調整最佳聆聽點的位置，使用者被迫限制移動以遷就最佳聆聽點的位置，實屬不便。另一方面，各聲道音訊可能受到環境物件的干擾而扭曲失真，使最佳聆聽點的範圍更加侷限甚至是消失。這麼一來，以高昂成本布建的音場環境就失去意義。In other words, the traditional audio system cannot dynamically adjust the position of the sweet spot, and the user is forced to restrict movement to accommodate the position of the sweet spot, which is really inconvenient. On the other hand, the audio of each channel may be distorted and distorted by the interference of environmental objects, so that the range of the sweet spot is further limited or even disappeared. In this way, the sound field environment built at high cost is meaningless.

有鑑於此，如何使音響系統隨著使用者移動而動態調整最佳聆聽點，並消除目標空間中的環境物件的干擾，是有待解決的問題。In view of this, how to make the audio system dynamically adjust the sweet spot as the user moves and eliminate the interference of environmental objects in the target space is a problem to be solved.

本說明書提供一種音響系統的實施例，可動態地依據使用者位置優化播放效果，其中音響系統包含一感測器電路，一第一喇叭和一第二喇叭，以及一主機裝置。該感測器電路設置為可動態地感測一目標空間而產生一音場環境資訊。第一喇叭和一第二喇叭，設置為可播放音訊。主機裝置耦接該感測器電路、該第一喇叭和該第二喇叭，包含一辨識電路，一控制電路以及一音訊傳輸電路。辨識電路設置為可從該音場環境資訊中辨識出一使用者並判斷該使用者在該目標空間中的一使用者位置。控制電路耦接該辨識電路，設置為可將該使用者位置動態地指派為一目標聆聽點。音訊傳輸電路耦接該控制電路、該第一喇叭和該第二喇叭，設置為可傳輸音訊。其中，該感測器電路包含一攝影機，設置為可捕捉該目標空間的一音場環境影像。該辨識電路分析該音場環境影像而獲取該目標空間中一環境物件的空間配置資訊和聲學屬性資訊。該控制電路依據該目標聆聽點，及該環境物件的空間配置資訊及聲學屬性資訊，進行通道基底補償運作以產生對該目標聆聽點優化的一第一聲道音訊和一第二聲道音訊。最後，該控制電路透過該音訊傳輸電路分別地輸出該第一聲道音訊和該第二聲道音訊至對應的該第一喇叭和該第二喇叭。This specification provides an embodiment of an audio system, which can dynamically optimize the playback effect according to the user's position, wherein the audio system includes a sensor circuit, a first speaker and a second speaker, and a host device. The sensor circuit is configured to dynamically sense a target space to generate sound field environment information. A first speaker and a second speaker are set to play audio. The host device is coupled to the sensor circuit, the first speaker and the second speaker, and includes an identification circuit, a control circuit and an audio transmission circuit. The identification circuit is configured to identify a user from the sound field environment information and determine a user position of the user in the target space. The control circuit is coupled to the identification circuit and configured to dynamically assign the user's position as a target listening point. The audio transmission circuit is coupled to the control circuit, the first speaker and the second speaker, and is configured to transmit audio. Wherein, the sensor circuit includes a camera configured to capture a sound field environment image of the target space. The identification circuit analyzes the sound field environment image to obtain spatial configuration information and acoustic attribute information of an environmental object in the target space. The control circuit performs channel floor compensation operation according to the target listening point, spatial configuration information and acoustic property information of the environmental object to generate a first channel audio and a second channel audio optimized for the target listening point. Finally, the control circuit respectively outputs the first channel audio and the second channel audio to the corresponding first speaker and the second speaker through the audio transmission circuit.

本說明書提供一種音響系統的實施例，可動態地依據使用者位置優化播放效果，其中音響系統包含一感測器電路，一第一喇叭和一第二喇叭，以及一主機裝置。該感測器電路設置為可動態地感測一目標空間而產生一音場環境資訊。一第一喇叭和一第二喇叭，設置為可播放音訊。主機裝置耦接該感測器電路、該第一喇叭和該第二喇叭，包含一辨識電路、一控制電路和一音訊傳輸電路。辨識電路設置為可從該音場環境資訊中辨識出一使用者並判斷該使用者在該目標空間中的一使用者位置。控制電路耦接該辨識電路，設置為可將該使用者位置動態地指派為一目標聆聽點。音訊傳輸電路耦接該控制電路、該第一喇叭和該第二喇叭，設置為可傳輸音訊。其中，該感測器電路包含一攝影機，設置為可捕捉該目標空間的一音場環境影像。該辨識電路分析該音場環境影像而獲取該目標空間中一環境物件的空間配置資訊和聲學屬性資訊。該控制電路將該目標空間對應至一物件基底空間，並依據該環境物件對應地在該物件基底空間中建立一補償音源物件。該補償音源物件的一中繼資料包含：該環境物件的座標位置、大小，以及對聲音的反射率和吸收率。該控制電路依據該目標聆聽點和該中繼資料進行一物件基底補償運作，抵消該環境物件對該目標聆聽點的干擾而產生對該目標聆聽點優化的一第一聲道音訊和一第二聲道音訊。該控制電路透過該音訊傳輸電路分別地輸出該第一聲道音訊和該第二聲道音訊至對應的該第一喇叭和該第二喇叭。This specification provides an embodiment of an audio system, which can dynamically optimize the playback effect according to the user's position, wherein the audio system includes a sensor circuit, a first speaker and a second speaker, and a host device. The sensor circuit is configured to dynamically sense a target space to generate sound field environment information. A first speaker and a second speaker are set to play audio. The host device is coupled to the sensor circuit, the first speaker and the second speaker, and includes an identification circuit, a control circuit and an audio transmission circuit. The identification circuit is configured to identify a user from the sound field environment information and determine a user position of the user in the target space. The control circuit is coupled to the identification circuit and configured to dynamically assign the user's position as a target listening point. The audio transmission circuit is coupled to the control circuit, the first speaker and the second speaker, and is configured to transmit audio. Wherein, the sensor circuit includes a camera configured to capture a sound field environment image of the target space. The identification circuit analyzes the sound field environment image to obtain spatial configuration information and acoustic attribute information of an environmental object in the target space. The control circuit corresponds the target space to an object base space, and correspondingly creates a compensation sound source object in the object base space according to the environmental object. A metadata of the compensation sound source object includes: the coordinate position, size, and reflection rate and absorption rate of the environmental object. The control circuit performs an object base compensation operation based on the target listening point and the relay data to offset the interference of the environmental object on the target listening point to generate a first channel audio and a second channel optimized for the target listening point channel audio. The control circuit respectively outputs the first channel audio and the second channel audio to the corresponding first speaker and the second speaker through the audio transmission circuit.

本說明書提供一種音響系統的實施例，可動態地依據使用者位置優化播放效果，其中音響系統包含一感測器電路，一第一喇叭和一第二喇叭，以及一主機裝置。該感測器電路設置為可動態地感測一目標空間而產生一音場環境資訊。該第一喇叭和該第二喇叭設置為可播放音訊。該主機裝置耦接該感測器電路、該第一喇叭和該第二喇叭，包含一辨識電路、一控制電路、一音訊傳輸電路和一人機介面電路。該辨識電路設置為可從該音場環境資訊中辨識出一使用者並判斷該使用者在該目標空間中的一使用者位置。該控制電路耦接該辨識電路，設置為可將該使用者位置動態地指派為一目標聆聽點。該音訊傳輸電路耦接該控制電路、該第一喇叭和該第二喇叭，設置為可傳輸音訊。該人機介面電路耦接該控制電路，設置為可運行一配置程序，而獲取該目標空間中一環境物件的空間配置資訊和聲學屬性資訊。其中，該控制電路依據該目標聆聽點，及該環境物件的空間配置資訊及聲學屬性資訊，進行一通道基底補償運作以產生對該目標聆聽點優化的一第一聲道音訊和一第二聲道音訊。該控制電路透過該音訊傳輸電路分別地輸出該第一聲道音訊和該第二聲道音訊至對應的該第一喇叭和該第二喇叭。This specification provides an embodiment of an audio system, which can dynamically optimize the playback effect according to the user's position, wherein the audio system includes a sensor circuit, a first speaker and a second speaker, and a host device. The sensor circuit is configured to dynamically sense a target space to generate sound field environment information. The first speaker and the second speaker are configured to play audio. The host device is coupled to the sensor circuit, the first speaker and the second speaker, and includes an identification circuit, a control circuit, an audio transmission circuit and a man-machine interface circuit. The identifying circuit is configured to identify a user from the sound field environment information and determine a user position of the user in the target space. The control circuit is coupled to the identification circuit and configured to dynamically assign the user's position as a target listening point. The audio transmission circuit is coupled to the control circuit, the first speaker and the second speaker, and is configured to transmit audio. The human-machine interface circuit is coupled to the control circuit, and is configured to run a configuration program to obtain spatial configuration information and acoustic attribute information of an environmental object in the target space. Wherein, the control circuit performs a channel floor compensation operation according to the target listening point, the spatial configuration information and the acoustic property information of the environmental object to generate a first channel audio and a second sound optimized for the target listening point Road audio. The control circuit respectively outputs the first channel audio and the second channel audio to the corresponding first speaker and the second speaker through the audio transmission circuit.

本說明書提供一種音響系統的實施例，可動態地依據使用者位置優化播放效果，其中音響系統包含一感測器電路，一第一喇叭和一第二喇叭，以及一主機裝置。該感測器電路設置為可動態地感測一目標空間而產生一音場環境資訊。該第一喇叭和該第二喇叭設置為可播放音訊。該主機裝置耦接該感測器電路、該第一喇叭和該第二喇叭，包含一辨識電路、一控制電路、一音訊傳輸電路和一人機介面電路。該辨識電路設置為可從該音場環境資訊中辨識出一使用者並判斷該使用者在該目標空間中的一使用者位置。該控制電路耦接該辨識電路，設置為可將該使用者位置動態地指派為一目標聆聽點。該音訊傳輸電路耦接該控制電路、該第一喇叭和該第二喇叭，設置為可傳輸音訊。該人機介面電路耦接該控制電路，設置為可運行一配置程序，而獲取該目標空間中一環境物件的空間配置資訊和聲學屬性資訊。其中，該控制電路將該目標空間對應至一物件基底空間，並依據該環境物件對應地在該物件基底空間中建立一補償音源物件。該補償音源物件的一中繼資料包含：該環境物件的座標位置、大小，以及對聲音的反射率和吸收率。該控制電路依據該目標聆聽點和該中繼資料進行一物件基底補償運作，抵消該環境物件對該目標聆聽點的干擾而產生對該目標聆聽點優化的一第一聲道音訊和一第二聲道音訊。該控制電路透過該音訊傳輸電路分別地輸出該第一聲道音訊和該第二聲道音訊至對應的該第一喇叭和該第二喇叭。This specification provides an embodiment of an audio system, which can dynamically optimize the playback effect according to the user's position, wherein the audio system includes a sensor circuit, a first speaker and a second speaker, and a host device. The sensor circuit is configured to dynamically sense a target space to generate sound field environment information. The first speaker and the second speaker are configured to play audio. The host device is coupled to the sensor circuit, the first speaker and the second speaker, and includes an identification circuit, a control circuit, an audio transmission circuit and a man-machine interface circuit. The identifying circuit is configured to identify a user from the sound field environment information and determine a user position of the user in the target space. The control circuit is coupled to the identification circuit and configured to dynamically assign the user's position as a target listening point. The audio transmission circuit is coupled to the control circuit, the first speaker and the second speaker, and is configured to transmit audio. The human-machine interface circuit is coupled to the control circuit, and is configured to run a configuration program to obtain spatial configuration information and acoustic attribute information of an environmental object in the target space. Wherein, the control circuit corresponds the target space to an object base space, and correspondingly creates a compensation sound source object in the object base space according to the environmental object. A metadata of the compensation sound source object includes: the coordinate position, size, and reflection rate and absorption rate of the environmental object. The control circuit performs an object base compensation operation based on the target listening point and the relay data to offset the interference of the environmental object on the target listening point to generate a first channel audio and a second channel optimized for the target listening point channel audio. The control circuit respectively outputs the first channel audio and the second channel audio to the corresponding first speaker and the second speaker through the audio transmission circuit.

上述實施例的優點之一，是音響系統可透過感測器動態地追蹤使用者位置，並持續地針對使用者位置優化播放效果。使用者不需要為了獲得最佳體驗而去遷就固定的聆聽位置。One of the advantages of the above embodiment is that the audio system can dynamically track the user's location through the sensor, and continuously optimize the playback effect according to the user's location. Users do not need to settle for a fixed listening position for the best experience.

上述實施例的另一優點，是音響系統可辨識目標空間中的環境物件，並據以調整聲道音訊以抵消環境物件的干擾。Another advantage of the above-mentioned embodiment is that the sound system can identify the environmental objects in the target space, and accordingly adjust the channel audio to counteract the interference of the environmental objects.

本發明的其他優點將搭配以下的說明和圖式進行更詳細的解說。Other advantages of the present invention will be explained in more detail with the following description and drawings.

以下將配合相關圖式來說明本發明的實施例。在圖式中，相同的標號表示相同或類似的元件或方法流程。Embodiments of the present invention will be described below in conjunction with related figures. In the drawings, the same reference numerals indicate the same or similar elements or method flows.

圖1為本發明一實施例的音響系統100的功能方塊圖。FIG. 1 is a functional block diagram of an audio system 100 according to an embodiment of the present invention.

音響系統100主要由一主機裝置130和多個喇叭構成。主機裝置130可控制多個喇叭而播放音訊。主機裝置130可以是電腦主機、準系統、嵌入式系統，或客製化的數位音訊處理設備。主機裝置130中包含一通信電路136，使主機裝置130可與一用戶設備150進行有線或無線連接，而做為音源訊號或資料的輸入管道。The audio system 100 is mainly composed of a host device 130 and a plurality of speakers. The host device 130 can control multiple speakers to play audio. The host device 130 can be a host computer, a barebone system, an embedded system, or a customized digital audio processing device. The host device 130 includes a communication circuit 136 , so that the host device 130 can be wired or wirelessly connected to a user equipment 150 as an input channel for audio signals or data.

用戶設備150可以是手機、電腦、電視棒，遊戲機，或其他的音訊源提供裝置，透過通信電路136提供音樂或聲音串流給主機裝置130。更進一步地說，音響系統100可利用通信電路136與用戶設備150或其他多媒體設備協同運作，而形成一套同時具有視頻功能和音頻功能的家庭劇院系統。舉例來說，目標空間170中還可包含一投影幕、螢幕或顯示器（未繪示），受到用戶設備150的控制而顯示畫面。又例如，用戶設備150可以是一頭戴式虛擬實境設備。使用者180可站在目標空間170中並透過用戶設備150看到畫面，而主機裝置130可受到用戶設備150的控制，而隨著畫面同步地播放音頻。本實施例中的通信電路136，可以是，但不限定於是高畫質多媒體介面（High Definition Multimedia Interface；HDMI）、數位傳輸介面（Sony/Philips Digital Interface Format；SPDIF）、無線區域網路模組，乙太網路模組，短波射頻收發器、或藍牙低功耗（Bluetooth Low Energy；BLE）第4版或第5版的演進應用、或通用序列埠（Universal Serial Bus）。The user equipment 150 can be a mobile phone, a computer, a TV stick, a game console, or other audio source providing devices, which provide music or audio streams to the host device 130 through the communication circuit 136 . Furthermore, the audio system 100 can use the communication circuit 136 to cooperate with the user equipment 150 or other multimedia equipment to form a home theater system with both video and audio functions. For example, the target space 170 may further include a projection screen, screen or display (not shown), which is controlled by the user equipment 150 to display images. For another example, the user device 150 may be a head-mounted virtual reality device. The user 180 can stand in the target space 170 and see the picture through the user equipment 150 , and the host device 130 can be controlled by the user equipment 150 to play audio synchronously with the picture. The communication circuit 136 in this embodiment may be, but not limited to, a high definition multimedia interface (High Definition Multimedia Interface; HDMI), a digital transmission interface (Sony/Philips Digital Interface Format; SPDIF), a wireless local area network module , Ethernet module, short-wave RF transceiver, or Bluetooth Low Energy (Bluetooth Low Energy; BLE) version 4 or version 5 evolution application, or Universal Serial Bus (Universal Serial Bus).

主機裝置130中還包含一音訊傳輸電路135，用於連接多個喇叭，並分別輸出多個聲道音訊使喇叭播放。主機裝置130透過音訊傳輸電路135控制多個喇叭的方式，可以是單向的數位或類比輸出，也可以是雙向的同步通信協議。音訊傳輸電路135和每一喇叭之間的連結方式，可以是有線介面、無線介面或兩者的混合應用。有線介面可以是，但不限定是複合影音端子、數位傳輸介面，或高畫質多媒體介面。無線介面可以是，但不限定是無線區域網路、短波射頻收發器、或藍牙低功耗第4版或第5版的演進應用。在進一步衍生的實施例中，由於音訊傳輸電路135和通信電路136在功能定位上皆為與外部元件連接的介面，在衍生的實作中可以是整合在一起的多功能雙向傳輸介面模組。音訊傳輸電路135和通信電路136採用各種公開的標準傳輸技術來實現元件之間的連接和傳輸，可增加音響系統100的未來功能擴充性，並減少元件損壞時的替換成本。The host device 130 also includes an audio transmission circuit 135 for connecting multiple speakers, and outputting multiple channel audio respectively for the speakers to play. The manner in which the host device 130 controls multiple speakers through the audio transmission circuit 135 may be a one-way digital or analog output, or a two-way synchronous communication protocol. The connection between the audio transmission circuit 135 and each speaker can be a wired interface, a wireless interface or a hybrid application of the two. The wired interface can be, but not limited to, a composite audio-visual terminal, a digital transmission interface, or a high-definition multimedia interface. The wireless interface can be, but is not limited to, a WLAN, a shortwave radio frequency transceiver, or an evolution of Bluetooth low energy version 4 or 5. In a further derivative embodiment, since the audio transmission circuit 135 and the communication circuit 136 are functionally positioned as interfaces for connecting with external components, they may be an integrated multifunctional bidirectional transmission interface module in a derivative implementation. The audio transmission circuit 135 and the communication circuit 136 use various public standard transmission technologies to realize the connection and transmission between components, which can increase the future functional expandability of the audio system 100 and reduce the replacement cost when components are damaged.

圖1中的目標空間170可以理解為一個可供使用者180使用音響系統100的三維立體空間。每個喇叭可配置在目標空間170中的不同位置，對應地播放一個聲道音訊。多個喇叭的圍繞配置，可以在一目標空間170中創造一個環繞音場環境。喇叭的數量和配置方式存在多種標準規格。舉例來說，在一個5.1聲道的環繞音響系統中，包含了兩個前置喇叭、一個中置喇叭、兩個環繞聲道喇叭，以及一個重低音喇叭，以包圍一目標聆聽點的方式創造出一環繞音場空間，共同向該目標聆聽點播放聲音。在一個7.1聲道的環境音響系統中，進一步地在該目標聆聽點的後面多配置一對後環繞聲道喇叭，可提供更加立體的音場效果。近年來還出現5.1.2聲道和7.2.2聲道等新規格，包含了更多的喇叭數量和特定方向的聲道配置，可達到更逼真的「全景聲」、「天空音效」或「地板音效」等效果。為了方便說明本實施例的音響系統100的技術特徵，圖1中僅繪示第一喇叭110和第二喇叭120為代表。其中，第一喇叭110接收並播放由主機裝置130提供的第一聲道音訊112，而第二喇叭120接收並播放由主機裝置130提供的第二聲道音訊122。必須理解的是，在實作中，本實施例的音響系統100並不限定於只能應用兩個喇叭，而是可應用於2.1聲道、4.1聲道、5.1聲道、7.2聲道或更多聲道的規格配置。目標空間170中的每個喇叭，可分別具有不同的音頻輸出規格。舉例來說，有的喇叭擅長輸出重低音，有的喇叭擅長輸出中高音。主機裝置130可依照不同喇叭規格而在目標空間170中規畫出各種不同特性的音場環境。The target space 170 in FIG. 1 can be understood as a three-dimensional space where the user 180 can use the sound system 100 . Each speaker can be arranged at a different position in the target space 170 to play a channel of audio correspondingly. The surround configuration of multiple speakers can create a surround sound field environment in a target space 170 . There are many standard specifications for the number and configuration of horns. For example, in a 5.1-channel surround sound system, two front speakers, a center speaker, two surround channel speakers, and a subwoofer are included to create a sound system that surrounds a target listening point. Create a surround sound field space, and play the sound to the target listening point together. In a 7.1-channel ambient sound system, a pair of rear surround channel speakers are further arranged behind the target listening point to provide a more three-dimensional sound field effect. In recent years, new specifications such as 5.1.2-channel and 7.2.2-channel have emerged, including more speakers and channel configurations in specific directions, which can achieve more realistic "panoramic sound", "sky sound" or " Floor Sound Effect” and other effects. To facilitate description of the technical features of the audio system 100 of this embodiment, only the first speaker 110 and the second speaker 120 are shown in FIG. 1 as representatives. Wherein, the first speaker 110 receives and plays the first channel audio 112 provided by the host device 130 , and the second speaker 120 receives and plays the second channel audio 122 provided by the host device 130 . It must be understood that, in practice, the audio system 100 of this embodiment is not limited to only two speakers, but can be applied to 2.1 channels, 4.1 channels, 5.1 channels, 7.2 channels or more Multi-channel specification configuration. Each speaker in the target space 170 may have different audio output specifications. For example, some speakers are good at outputting heavy bass, and some speakers are good at outputting mid-high. The host device 130 can plan various sound field environments with different characteristics in the target space 170 according to different speaker specifications.

在說明書及申請專利範圍中所指稱的「聲道」一詞，泛指各種實體聲道和邏輯聲道。邏輯聲道指的是在系統內部傳輸的音訊資料流，而實體聲道指的是每一喇叭藉以播放的訊號來源。在本實施例中，每個喇叭對應播放出來的第一聲道音訊112和第二聲道音訊122，屬於實體聲道，可以是一或多個邏輯聲道向下混音（down-mix）而產生的結果。舉例來說，一副耳機只有兩個喇叭，但可以同時聽見多個應用程式產生的音效。換句話說，多個應用程式的音效資料，可由系統向下混音為兩個實體聲道，並透過兩個喇叭播放為可聽聲音。因此，本實施例中的第一聲道音訊112和第二聲道音訊122，不限定為只包含單一邏輯聲道的音訊信號，也可以是多個邏輯聲道依據預定的比例混合產生的音訊信號。The term "sound channel" referred to in the specification and scope of patent application generally refers to various physical sound channels and logical sound channels. The logical channel refers to the audio data stream transmitted within the system, while the physical channel refers to the source of the signal played by each speaker. In this embodiment, each speaker corresponds to the first-channel audio 112 and the second-channel audio 122, which belong to physical channels and can be down-mixed by one or more logical channels. And the result produced. For example, a pair of headphones has only two speakers, but you can hear the sound effects generated by multiple applications at the same time. In other words, the audio data of multiple applications can be down-mixed into two physical audio channels by the system, and played as audible sound through two speakers. Therefore, the first-channel audio 112 and the second-channel audio 122 in this embodiment are not limited to audio signals that only include a single logical channel, but may also be audio generated by mixing multiple logical channels according to a predetermined ratio. Signal.

在圖1中，第一喇叭110和第二喇叭120配置在一目標空間170的兩側，對該目標空間170中的一目標聆聽點播放聲音。目標聆聽點可以理解為該音響系統100的播放效果最優化的位置。在一些音響系統中，又將目標聆聽點稱為聆聽甜蜜點（Listening Sweet Spot）。在大部份的情況下，目標聆聽點通常位於目標空間170的特定區域，例如中心點、軸線上、切平面上，或是多個喇叭的等效音量中心。在圖1中，以使用者180所在的第一位置171，來表示該目標空間170的目標聆聽點。當使用者180從第一位置171沿著移動軌跡173移動到第二位置172時，由於使用者180遠離了第一喇叭110而接近了第二喇叭120，使用者180所接收到的聆聽效果產生了偏差。傳統的音響系統無法追蹤使用者180的移動而對應地調整第二位置172收到的聆聽效果，而本實施例提議的解決方案將於後詳述。In FIG. 1 , the first speaker 110 and the second speaker 120 are arranged on two sides of a target space 170 , and play sound to a target listening point in the target space 170 . The target listening point can be understood as the position where the playback effect of the sound system 100 is optimized. In some audio systems, the target listening point is called the listening sweet spot (Listening Sweet Spot). In most cases, the target listening point is usually located in a specific area of the target space 170 , such as the center point, on the axis, on the tangent plane, or the equivalent volume center of multiple speakers. In FIG. 1 , the target listening point of the target space 170 is represented by the first position 171 where the user 180 is located. When the user 180 moves from the first position 171 to the second position 172 along the moving track 173, since the user 180 moves away from the first speaker 110 and approaches the second speaker 120, the listening effect received by the user 180 is produced. Deviation. The traditional audio system cannot track the movement of the user 180 and accordingly adjust the listening effect received by the second position 172 , but the solution proposed in this embodiment will be described in detail later.

另一方面，目標空間170中通常會包含一些環境物件175，例如沙發、桌子、窗廉、牆壁、天花板、地板。這些環境物件175隨著材質、大小和位置的不同，會對第一喇叭110和第二喇叭120播放的聲音產生不同程度的干擾反應。舉例來說，布質沙發或窗廉會吸收聲音，大理石地板或牆壁會反射聲音。換句話說，環境物件175的存在，會影響目標聆聽點收到的第一聲道音訊112和第二聲道音訊122。傳統的音響系統不具備辨識目標空間170中的環境物件175的能力，也不具備依據環境物件175的大小、材質、位置而補償第一聲道音訊112和第二聲道音訊122的功能。本實施例的音響系統100可計算並消除目標空間170中所有的環境物件175對第一聲道音訊112和第二聲道音訊122的干擾。為了方便說明，本實施例的圖1中僅繪示一個環境物件175來解釋音響系統100的運作方式。然而，必須理解的是，圖1的目標空間170中並非用以限定只能有一個環境物件175。解決環境物件175干擾的方案將於後詳述。On the other hand, the target space 170 usually includes some environmental objects 175 , such as sofas, tables, windowsills, walls, ceilings, and floors. Depending on the material, size and location of these environmental objects 175 , they will produce different degrees of interference to the sounds played by the first speaker 110 and the second speaker 120 . For example, a fabric sofa or window sills will absorb sound, and marble floors or walls will reflect sound. In other words, the existence of the environmental object 175 will affect the first channel audio 112 and the second channel audio 122 received by the target listening point. The traditional audio system does not have the ability to identify the environmental objects 175 in the target space 170 , nor does it have the function of compensating the first channel audio 112 and the second channel audio 122 according to the size, material, and position of the environmental objects 175 . The sound system 100 of the present embodiment can calculate and eliminate the interference of all environmental objects 175 in the target space 170 on the first channel audio 112 and the second channel audio 122 . For convenience of description, only one environment object 175 is shown in FIG. 1 of this embodiment to explain the operation of the sound system 100 . However, it must be understood that the target space 170 in FIG. 1 is not limited to only one environmental object 175 . The solution to the interference of the environmental object 175 will be described in detail later.

本實施例的主機裝置130中還包含一儲存電路131。儲存電路131可包含非揮發性記憶體，用於儲存主機裝置130運作所需的相關作業系統、應用軟體或韌體。儲存電路131中也可包含揮發性記憶體，用於做為控制電路132的運算記憶體。本實施例的主機裝置130中還包含一控制電路132。控制電路132可以是中央處理器、數位訊號處理器、或微控制器。控制電路132可從儲存電路131中讀取預存的作業系統、軟體或韌體而控制主機裝置130、第一喇叭110和第二喇叭120，以執行音訊播放運作。更進一步地說，本實施例的主機裝置130利用控制電路132進行一系列的音場補償運算，以動態地優化播放效果，解決傳統音響系統無法克服的缺點。The host device 130 of this embodiment further includes a storage circuit 131 . The storage circuit 131 may include a non-volatile memory for storing related operating systems, application software or firmware required for the operation of the host device 130 . The storage circuit 131 may also include a volatile memory, which is used as an operation memory of the control circuit 132 . The host device 130 of this embodiment further includes a control circuit 132 . The control circuit 132 can be a central processing unit, a digital signal processor, or a microcontroller. The control circuit 132 can read the pre-stored operating system, software or firmware from the storage circuit 131 to control the host device 130 , the first speaker 110 and the second speaker 120 to perform an audio playback operation. Furthermore, the host device 130 of this embodiment uses the control circuit 132 to perform a series of sound field compensation calculations to dynamically optimize the playback effect and solve the insurmountable shortcomings of traditional audio systems.

為了動態地優化目標聆聽點上的播放效果，本實施例的音響系統100包含了一感測器電路140，設置為可動態地感測一目標空間170而產生一音場環境資訊。感測器電路140可以是位於主機裝置130外部的元件，耦接至主機裝置130。感測器電路140可以是由攝影機610、紅外線感測器620、無線偵測器630其中之一或多者的搭配組合。感測器電路140所捕捉的音場環境資訊的型式，可隨著感測器電路140的實作方式而有不同組合。舉例來說，音場環境資訊可以是包含使用者和環境物件的影像、圖片、熱成像、無線電波造影其中之一或多者的搭配組合。在一實施例中，感測器電路140設置在目標空間170的周圍。可以理解的是，雖然圖1中只繪示了一個感測器電路140，但在實作時，音響系統100可包含多組感測器電路140，分別配置在目標空間170周圍的不同位置，以獲得更精準的音場環境資訊。In order to dynamically optimize the playback effect at the target listening point, the audio system 100 of this embodiment includes a sensor circuit 140 configured to dynamically sense a target space 170 to generate sound field environment information. The sensor circuit 140 may be a component located outside the host device 130 and coupled to the host device 130 . The sensor circuit 140 may be a combination of one or more of the camera 610 , the infrared sensor 620 , and the wireless detector 630 . The types of the sound field environment information captured by the sensor circuit 140 can be combined in different ways according to the implementation of the sensor circuit 140 . For example, the sound field environment information may include one or a combination of images, pictures, thermal imaging, and radio wave imaging of the user and the environmental objects. In one embodiment, the sensor circuit 140 is disposed around the target space 170 . It can be understood that, although only one sensor circuit 140 is shown in FIG. 1 , in practice, the audio system 100 may include multiple sets of sensor circuits 140 respectively arranged in different positions around the target space 170 . In order to obtain more accurate sound field environment information.

在本實施例的主機裝置130中，包含一辨識電路134，耦接該感測器電路140。辨識電路134可從該音場環境資訊中辨識出影響音場的關鍵資訊，使控制電路132據以動態地調整從第一喇叭110和第二喇叭120播放的第一聲道音訊112和第二聲道音訊122。舉例來說，辨識電路134可從該音場環境資訊中辨識出一使用者並判斷該使用者在該目標空間中的一使用者位置。由於感測器電路140提供的音場環境資訊可以有多種不同型式的組合，辨識電路134也可以對應地實作不同的辨識技術方案。舉例來說，當音場環境資訊是影像時，辨識電路134可採用人工智慧的辨識技術，來分辨影像中的使用者。透過人工智慧的應用，辨識電路134分析出影像中的使用者後，還可進一步地定位使用者頭部、臉部、甚至耳朵位置。若是感測器電路140可提供具有空間深度的三維影像、紅外線熱成像、或無線訊號等多元化資訊，將有助於辨識電路134獲得更精準的辨識結果。In this embodiment, the host device 130 includes an identification circuit 134 coupled to the sensor circuit 140 . The identification circuit 134 can identify the key information affecting the sound field from the sound field environment information, so that the control circuit 132 can dynamically adjust the first channel audio 112 and the second channel audio played from the first speaker 110 and the second speaker 120 accordingly. Channel Audio 122. For example, the identification circuit 134 can identify a user from the sound field environment information and determine a user position of the user in the target space. Since the sound field environment information provided by the sensor circuit 140 can be combined in various types, the identification circuit 134 can also correspondingly implement different identification technical solutions. For example, when the sound field environment information is an image, the identification circuit 134 can use artificial intelligence identification technology to identify the user in the image. Through the application of artificial intelligence, after the identification circuit 134 analyzes the user in the image, it can further locate the user's head, face, and even the position of the ears. If the sensor circuit 140 can provide multiple information such as three-dimensional images with spatial depth, infrared thermal imaging, or wireless signals, it will help the identification circuit 134 to obtain more accurate identification results.

主機裝置130為了計算環境物件175對音場環境造成的干擾程度，需要環境物件175的空間配置資訊和聲學屬性資訊。空間配置資訊可以包含環境物件175的大小、位置、形狀、以及各種外型特徵。聲學屬性資訊可包含對聲音的吸收率、反射率、及共振頻率等材質相關特徵。在一實施例中，辨識電路134可在辨識音場環境資訊時，進一步從音場環境資訊中辨識出目標空間170中的環境物件175的空間配置資訊，並查找聲學屬性資訊。為了辨識環境物件，需要物件資料庫。在一實施例中，主機裝置130中的儲存電路131還可用來儲存一物件資料庫。物件資料庫可包含用於辨識環境物件的各種外型特徵資訊，以及每個環境物件對應的各種聲學屬性資訊。例如，當主機裝置130需要計算一環境物件175對音場環境造成的干擾程度時，可先透過辨識電路134分析出環境物件175的物件名稱，再由主機裝置130讀取儲存電路131而查找該環境物件175對應的吸收率及反射率。In order to calculate the degree of interference caused by the environmental object 175 to the sound field environment, the host device 130 needs spatial configuration information and acoustic attribute information of the environmental object 175 . The spatial configuration information may include the size, position, shape, and various appearance characteristics of the environmental object 175 . The acoustic property information may include material-related characteristics such as sound absorption rate, reflectance rate, and resonance frequency. In one embodiment, when identifying the sound field environment information, the identification circuit 134 may further identify the spatial configuration information of the environmental objects 175 in the target space 170 from the sound field environment information, and search for the acoustic attribute information. In order to identify environmental objects, an object database is required. In one embodiment, the storage circuit 131 in the host device 130 can also be used to store an object database. The object database may include various appearance feature information for identifying environmental objects, and various acoustic property information corresponding to each environmental object. For example, when the host device 130 needs to calculate the degree of interference caused by an environmental object 175 to the sound field environment, it can first analyze the object name of the environmental object 175 through the identification circuit 134, and then the host device 130 reads the storage circuit 131 to find the Absorptivity and reflectivity corresponding to the environmental object 175 .

在實作中，辨識電路134可以是客製作的處理器晶片，搭配儲存電路131中既存的作業系統、軟體或韌體而執行人工智慧的辨識功能。辨識電路134也可以是控制電路132的其中一個核心或執行緒電路，執行儲存電路131中既存人工智慧軟體產品而實現辨識功能。辨識電路134也可以是特定人工智慧軟體產品的記憶體模塊，受到控制電路132的執行而完成辨識的功能。In practice, the identification circuit 134 can be a processor chip made by a customer, and cooperate with the existing operating system, software or firmware in the storage circuit 131 to perform the identification function of artificial intelligence. The identification circuit 134 can also be one of the core or thread circuits of the control circuit 132 , which executes the existing artificial intelligence software product in the storage circuit 131 to realize the identification function. The identification circuit 134 can also be a memory module of a specific artificial intelligence software product, which is executed by the control circuit 132 to complete the identification function.

主機裝置130中的人機介面電路133，可供使用者控制主機裝置130的運作。人機介面電路133可包含顯示屏幕、按鍵、轉盤、或觸控屏，可供使用者進行基本的音響系統100控制功能，例如調整音量、播放、及快進倒退等。在一實施例中，控制電路132也可透過人機介面電路133執行一配置程序，以供使用者設定各種音場情境，或將目標空間170中的環境物件175空間配置資訊告訴主機裝置130。舉例來說，在該配置程序中，控制電路132利用人機介面電路133接收使用者輸入的物件配置資料，例如一或多個環境物件175的物件名稱、類型、大小和位置。在控制電路132獲得這些空間配置資訊後，再從儲存於儲存電路131中的物件資料庫中查找對應的吸收率和反射率，以便後續的音場補償運作。在進一步衍生的實施例中，人機介面電路133也可以是由用戶設備150提供。使用者可利用用戶設備150操作該配置程序，最後用戶設備150透過通信電路136將設置結果傳送給控制電路132。The human-machine interface circuit 133 in the host device 130 can allow the user to control the operation of the host device 130 . The man-machine interface circuit 133 may include a display screen, buttons, dials, or a touch screen, allowing the user to perform basic control functions of the audio system 100, such as adjusting volume, playing, and fast forward and reverse. In one embodiment, the control circuit 132 can also execute a configuration program through the man-machine interface circuit 133 for the user to set various sound field situations, or inform the host device 130 of the spatial configuration information of the environmental objects 175 in the target space 170 . For example, in the configuration procedure, the control circuit 132 utilizes the human-machine interface circuit 133 to receive object configuration data input by the user, such as the object name, type, size and location of one or more environmental objects 175 . After the control circuit 132 obtains the spatial configuration information, it searches the corresponding absorptivity and reflectivity from the object database stored in the storage circuit 131 for subsequent sound field compensation operations. In a further derivative embodiment, the human-machine interface circuit 133 may also be provided by the user equipment 150 . The user can use the user equipment 150 to operate the configuration program, and finally the user equipment 150 transmits the setting result to the control circuit 132 through the communication circuit 136 .

主機裝置130還可透過通信電路136連接至一遠端資料庫160。在進一步的實施例中，原本利用儲存電路131儲存的物件資料庫，也可以透過遠端資料庫160來儲存。當主機裝置130需要計算一環境物件175對音場環境造成的干擾程度時，可先透過辨識電路134分析音場環境資訊而獲得一物件特徵值，再利用通信電路136接入遠端資料庫160，查找出符合該物件特徵值的一環境物件175，並獲得該環境物件175的音場屬性資訊。遠端資料庫160可以是位於雲端或其他系統中的服務器，與主機裝置130之間通過有線或無線的雙向網路通信技術而連線。遠端資料庫160除了可提供查找功能，也可以接受更新資料的上傳，以持續擴充資料庫內容。舉例來說，主機裝置130可利用結構式查詢語法（Structured Query Language；SQL）與遠端資料庫160溝通。The host device 130 can also be connected to a remote database 160 through the communication circuit 136 . In a further embodiment, the object database originally stored by the storage circuit 131 can also be stored through the remote database 160 . When the host device 130 needs to calculate the degree of interference caused by an environmental object 175 to the sound field environment, it can first analyze the sound field environment information through the identification circuit 134 to obtain an object characteristic value, and then use the communication circuit 136 to access the remote database 160 , find out an environmental object 175 that matches the characteristic value of the object, and obtain the sound field attribute information of the environmental object 175 . The remote database 160 can be a server located in the cloud or other systems, and is connected with the host device 130 through a wired or wireless two-way network communication technology. In addition to providing a search function, the remote database 160 can also accept uploads of updated data, so as to continuously expand the content of the database. For example, the host device 130 can use Structured Query Language (SQL) to communicate with the remote database 160 .

基於圖1的系統架構，本申請所提出的音響系統100可實現至少下列技術效果。首先，音響系統100可動態追蹤使用者位置，做為目標聆聽點。音響系統100還可動態地獲取環境物件的空間配置資訊，做為優化音場效果的依據。最後，音響系統100動態地根據使用者位置和環境物件的空間配置資訊，補償喇叭輸出，以消除物件干擾，優化目標聆聽點上的聆聽效果。動態追蹤使用者位置的實施方式可採用攝影機、紅外線感測器、或無線定位等多元技術方案。獲取環境物件的空間配置資訊的實施方式可以是自動進行或手動進行。舉例來說，音響系統100可利用攝影機捕捉影像並進行人工智慧辨識、或透過一配置程序讓使用者手動輸入現場的環境狀況。補償喇叭輸出的實施方式可以基於幾種不同演算法。舉例來說，本說明書介紹了通道基底（Channel Base）算法和物件基底（Object Base）算法。Based on the system architecture of FIG. 1 , the audio system 100 proposed in this application can achieve at least the following technical effects. Firstly, the audio system 100 can dynamically track the user's location as the target listening point. The sound system 100 can also dynamically acquire the spatial configuration information of the environmental objects as a basis for optimizing the sound field effect. Finally, the audio system 100 dynamically compensates the speaker output according to the user's position and the spatial configuration information of the environmental objects, so as to eliminate object interference and optimize the listening effect at the target listening point. The implementation of dynamic tracking of the user's location can adopt multiple technical solutions such as cameras, infrared sensors, or wireless positioning. The implementation of obtaining the spatial configuration information of the environmental objects can be performed automatically or manually. For example, the sound system 100 can use a camera to capture images and perform artificial intelligence recognition, or allow users to manually input the environmental conditions of the scene through a configuration program. The implementation of compensating the horn output can be based on several different algorithms. For example, this specification introduces the channel base (Channel Base) algorithm and the object base (Object Base) algorithm.

以下以圖2說明音響系統100動態追蹤使用者位置，利用攝影機獲取音場環境配置，並以通道基底補償運作補償喇叭輸出的實施例。FIG. 2 below illustrates an embodiment in which the sound system 100 dynamically tracks the user's position, uses a camera to obtain the configuration of the sound field environment, and uses channel floor compensation to compensate the speaker output.

圖2為本發明一實施例的動態音效優化方法流程圖。FIG. 2 is a flowchart of a method for optimizing dynamic sound effects according to an embodiment of the present invention.

在圖2的流程圖中，位於一特定裝置所屬欄位中的流程，即代表由該特定裝置所進行的流程。例如，標記在「感測器電路」欄位中的部分，是由感測器電路140所進行的流程；標記在「主機裝置」欄位中的部分，是由主機裝置130所進行的流程；標記在「喇叭」欄位中的部分，則是由第一喇叭110和/或第二喇叭120所進行的流程；其餘依此類推。前述的邏輯也適用於後續的其他流程圖中。In the flow chart of FIG. 2 , the process in the column of a specific device represents the process performed by the specific device. For example, the part marked in the column of "sensor circuit" is the process performed by the sensor circuit 140; the part marked in the column of "host device" is the process performed by the host device 130; The part marked in the "speaker" column is the process performed by the first speaker 110 and/or the second speaker 120; and so on for the rest. The aforementioned logic is also applicable to other subsequent flow charts.

在流程202中，由感測器電路140動態地感測目標空間而產生音場環境資訊。在本實施例中，音場環境資訊可以是目標空間170中的光學、熱學、或電磁波資訊。舉例來說，感測器電路140可以包含一攝影機，以錄影的方式持續拍攝目標空間170的視頻，或是以拍照的方式週期性捕捉目標空間170的靜態照片。在另一實施例中，該感測器電路140還可包含一紅外線感測器，設置為可捕捉該目標空間中的一熱成像資料。紅外線感測器所產生的熱成像資料，除了包含空間深度的資訊，對溫度變化也極為敏感，因此特別適合用來追蹤使用者位置。在另一實施例中，該感測器電路140還可包含一無線偵測器，設置在該目標空間中，並偵測一電子裝置的無線訊號。當一使用者手持著一個電子裝置時，無線偵測器可偵測該電子裝置的信標時間差或無線訊號強弱，做為追蹤使用者位置的輔助手段。該電子裝置可以是使用者自己的手機，特製的信標產生器，頭戴式虛擬實境設備、遊戲手把、或音響系統100的遙控器。可以理解的是，本實施例並不限定感測器電路140的數量，也不限定一次只能使用一種感測方案。舉例來說，本實施例的音響系統100可採用多個感測器電路140從不同的位置協同運作，或是同時採用一或多個攝影機、紅外線感測器和無線偵測器。藉此，主機裝置130可以獲得更完整的音場環境資訊，並在後續程序中獲得更精準的辨識結果。In the process 202 , the sensor circuit 140 dynamically senses the target space to generate sound field environment information. In this embodiment, the sound field environment information may be optical, thermal, or electromagnetic wave information in the target space 170 . For example, the sensor circuit 140 may include a camera, which continuously captures video of the target space 170 in a video mode, or periodically captures still photos of the target space 170 in a photographic mode. In another embodiment, the sensor circuit 140 may further include an infrared sensor configured to capture a thermal imaging data in the target space. The thermal imaging data generated by the infrared sensor, in addition to containing spatial depth information, is also extremely sensitive to temperature changes, so it is especially suitable for tracking the user's location. In another embodiment, the sensor circuit 140 may further include a wireless detector, which is disposed in the target space and detects a wireless signal of an electronic device. When a user holds an electronic device, the wireless detector can detect the beacon time difference or the strength of the wireless signal of the electronic device as an auxiliary means for tracking the user's location. The electronic device can be the user's own mobile phone, a special beacon generator, a head-mounted virtual reality device, a game handle, or a remote control of the audio system 100 . It can be understood that this embodiment does not limit the number of sensor circuits 140 , nor does it limit that only one sensing scheme can be used at a time. For example, the audio system 100 of this embodiment can use multiple sensor circuits 140 to work together from different locations, or use one or more cameras, infrared sensors and wireless detectors at the same time. In this way, the host device 130 can obtain more complete sound field environment information, and obtain more accurate identification results in subsequent procedures.

在流程204中，感測器電路140將感測到的音場環境資訊傳送給主機裝置130。感測器電路140可以是持續性的傳送資料，例如視頻，或是週期性的回傳靜態資料。感測器電路140傳送資料的頻率可依據音場環境資訊的信息量、追蹤精確度要求、和主機裝置130的計算能力而自適應地決定。感測器電路140和主機裝置130之間，可以是透過專屬線路連接，或是透過通信電路136連線。在進一步衍生的實施例中，感測器電路140可與喇叭共用音訊傳輸電路135，藉以透過音訊傳輸電路135傳送音場環境資訊給主機裝置130。In the process 204 , the sensor circuit 140 transmits the sensed sound field environment information to the host device 130 . The sensor circuit 140 can continuously transmit data, such as video, or periodically return static data. The frequency at which the sensor circuit 140 transmits data can be adaptively determined according to the information volume of the sound field environment information, the tracking accuracy requirement, and the computing capability of the host device 130 . The connection between the sensor circuit 140 and the host device 130 can be through a dedicated line or through the communication circuit 136 . In a further derivative embodiment, the sensor circuit 140 can share the audio transmission circuit 135 with the speaker, so as to transmit the sound field environment information to the host device 130 through the audio transmission circuit 135 .

在流程206中，主機裝置130依據從感測器電路140收到的音場環境資訊判斷使用者位置。主機裝置130中的辨識電路134可對音場環境資訊執行辨識程序，例如應用人工智慧。隨著感測器電路140的感測方案不同，辨識電路134的辨識演算法也對應地的不同。可以理解的是，目標空間170和使用者位置可以二維空間或三維空間來表示。若是音響系統100中只實作了單一感測器電路140，至少可以感知二維空間的位置信息。若是音響系統100在實作中增加感測器電路140的數量或混合多元感測方案，可獲得三維空間的深度資訊而更精確的判斷使用者位置或使用者頭部位置。在一實施例中，辨識電路134可依據攝影機捕捉的音場環境影像，動態地辨識該使用者的頭部位置、臉部方向、或耳朵位置。在另一實施例中，辨識電路134可分析紅外線感測器產生的熱成像資料的移動軌跡，以動態地判斷使用者180的位置。又例如，辨識電路134可依據無線偵測器偵測到的無線訊號的特徵，動態地定位電子裝置在目標空間170中的一座標值。藉著該座標值，控制電路132可進一步地推測使用者耳朵位置。In the process 206 , the host device 130 determines the location of the user according to the sound field environment information received from the sensor circuit 140 . The recognition circuit 134 in the host device 130 can execute a recognition program on the sound field environment information, such as applying artificial intelligence. As the sensing schemes of the sensor circuit 140 are different, the identification algorithms of the identification circuit 134 are also correspondingly different. It can be understood that the target space 170 and the user position can be expressed in two-dimensional space or three-dimensional space. If only a single sensor circuit 140 is implemented in the audio system 100 , at least the position information in a two-dimensional space can be sensed. If the audio system 100 increases the number of sensor circuits 140 or mixes multiple sensing schemes in practice, the depth information of the three-dimensional space can be obtained to more accurately determine the position of the user or the position of the user's head. In one embodiment, the recognition circuit 134 can dynamically recognize the user's head position, face direction, or ear position according to the sound field environment image captured by the camera. In another embodiment, the identification circuit 134 can analyze the movement track of the thermal imaging data generated by the infrared sensor to dynamically determine the location of the user 180 . For another example, the identification circuit 134 can dynamically locate the coordinate value of the electronic device in the target space 170 according to the characteristics of the wireless signal detected by the wireless detector. According to the coordinate value, the control circuit 132 can further estimate the position of the user's ear.

在流程208中，主機裝置130中的辨識電路134分析出使用者位置後，主機裝置130中的控制電路132動態地將使用者位置指派為目標聆聽點。為了便於描述後續的實施例，在此將目標空間170描述為一個二維座標空間或三維座標空間，而目標聆聽點可以表示為目標空間170中的一座標值。隨著多個喇叭的布局方式不同，目標聆聽點的範圍可以不止是單一點，也可以是一個面、或具有長寬高的立體區域範圍。舉例來說，在辨識電路134分析出使用者頭部位置或耳朵位置後，控制電路132可將使用頭部位置或耳朵位置指派為目標聆聽點。控制電路132會透過後續的補償運作，使目標聆聽點獲得的播放效果不受使用者移動的影響。在實作中，控制電路132是透過調整第一聲道音訊112和第二聲道音訊122來補償目標聆聽獲得的聆聽效果。可以理解的是，流程208可能是隨著使用者位置的改變而動態的執行。因此，流程208並不限定是照著圖2所繪示的順序執行。換句話說，目標聆聽點可隨著使用者位置改變而即時更新。具體的調整運算將於後述。In the process 208 , after the identification circuit 134 in the host device 130 analyzes the user's location, the control circuit 132 in the host device 130 dynamically assigns the user's location as the target listening point. For the convenience of describing the following embodiments, the target space 170 is described here as a two-dimensional coordinate space or a three-dimensional coordinate space, and the target listening point can be expressed as a coordinate value in the target space 170 . Depending on the layout of multiple speakers, the range of the target listening point can be not only a single point, but also a plane, or a three-dimensional area with length, width and height. For example, after the identification circuit 134 analyzes the user's head position or ear position, the control circuit 132 can assign the user's head position or ear position as the target listening point. The control circuit 132 will make the playback effect obtained at the target listening point not be affected by the user's movement through subsequent compensation operations. In practice, the control circuit 132 compensates the listening effect obtained by the target listening by adjusting the first-channel audio 112 and the second-channel audio 122 . It can be understood that the process 208 may be executed dynamically as the user's location changes. Therefore, the process 208 is not limited to be executed in the order shown in FIG. 2 . In other words, the target listening point can be updated in real time as the user's position changes. The specific adjustment operation will be described later.

在流程210中，主機裝置130中的辨識電路134還對感測器電路140提供的音場環境資訊進行進一步的辨識，而獲取目標空間170中的環境物件的空間配置資訊。換句話說，感測器電路140提供的音場環境資訊，不止是可用來判斷使用者位置，也可用來判斷目標空間170中存在的各種環境物件175。在一實施例中，該感測器電路140中的攝影機捕捉該目標空間170的一音場環境影像後，該辨識電路134分析該音場環境影像，從該目標空間170中辨識出一或多個環境物件175，以及這些環境物件175的空間配置資訊。空間配置資訊包含環境物件175的大小、位置、形狀、外觀特徵。辨識電路134還可透過人工智慧的演算或資料庫的檢索而判斷每一環境物件175的聲學屬性資訊，例如對聲音的吸收率和反射率。在進一步衍生的實施例中，辨識電路134還可根據音場環境影像判斷目標空間170的應用場景類別。應用場景類別可以包含劇院、客廳、浴室、戶外等。如果主機裝置130知道目標空間170的應用場景類別，可以更快速地辨識目標空間170中的環境物件175而減少誤判。相關實施例將在圖9中說明。In the process 210 , the recognition circuit 134 in the host device 130 further recognizes the sound field environment information provided by the sensor circuit 140 to obtain the spatial configuration information of the environmental objects in the target space 170 . In other words, the sound field environment information provided by the sensor circuit 140 can not only be used to determine the user's position, but also can be used to determine various environmental objects 175 existing in the target space 170 . In one embodiment, after the camera in the sensor circuit 140 captures a sound field environment image of the target space 170, the identification circuit 134 analyzes the sound field environment image, and recognizes one or more sound field environment images from the target space 170. environmental objects 175, and the spatial configuration information of these environmental objects 175. The spatial configuration information includes the size, position, shape, and appearance characteristics of the environmental objects 175 . The identification circuit 134 can also determine the acoustic attribute information of each environmental object 175 through artificial intelligence calculation or database search, such as the absorption rate and reflectance rate of sound. In a further derivative embodiment, the identification circuit 134 can also determine the application scene category of the target space 170 according to the sound field environment image. Application scenario categories can include theater, living room, bathroom, outdoor, etc. If the host device 130 knows the application scene category of the target space 170 , it can identify the environmental objects 175 in the target space 170 more quickly and reduce misjudgment. A related embodiment will be illustrated in FIG. 9 .

在流程212中，主機裝置130中的控制電路132可計算喇叭對目標聆聽點的播放效果受環境物件影響的程度。一喇叭對目標聆聽點的播放效果，可以定義為該目標聆聽點上從該喇叭所接收到的等效音量（Equal loudness）或聲壓值（Sound Pressure Level；SPL）。在ISO226標準中定義了一個等響曲線（Fletcher-Munson Curve），說明使用者在不同的子頻帶下感知到的等效音量，其實對應的是不同的聲壓值。在一實施例中，控制電路132可採用等響曲線做為播放效果的標準參考基準，計算各種情況對目標聆聽點所接收到的聲壓值。控制電路132可利用環境物件175的空間配置資訊和聲學屬性資訊來評估環境物件175對目標聆聽點造成的干擾，以便進一步計算消除干擾的方法。環境物件175的空間配置資訊和屬性資訊的影響包含許多種情境。舉例來說，環境物件175體積越大，對目標聆聽點的干擾係數可能越大。環境物件175的位置是否阻擋使用者180和喇叭，也決定了喇叭受影響的程度。環境物件175隨著材質不同，可能會吸收聲音或反彈聲音。因此控制電路132需要針對不同的聲學屬性選用對應的參數或公式來計算喇叭受影響的程度。In the process 212, the control circuit 132 in the host device 130 can calculate the degree to which the playback effect of the speaker on the target listening point is affected by the environmental objects. The playback effect of a speaker on the target listening point can be defined as the equivalent volume (Equal loudness) or sound pressure level (Sound Pressure Level; SPL) received from the speaker at the target listening point. An equal loudness curve (Fletcher-Munson Curve) is defined in the ISO226 standard, indicating that the equivalent volume perceived by the user under different sub-bands actually corresponds to different sound pressure values. In one embodiment, the control circuit 132 can use the equal loudness curve as a standard reference for the playback effect, and calculate the received sound pressure values for the target listening point in various situations. The control circuit 132 can use the spatial configuration information and the acoustic property information of the environmental object 175 to evaluate the interference caused by the environmental object 175 to the target listening point, so as to further calculate a method for eliminating the interference. The influence of the spatial configuration information and attribute information of the environment object 175 includes many kinds of situations. For example, the larger the volume of the environmental object 175 , the larger the interference coefficient to the target listening point may be. Whether the location of the environmental object 175 blocks the user 180 and the speaker also determines the degree to which the speaker is affected. The environmental object 175 may absorb sound or bounce sound depending on the material. Therefore, the control circuit 132 needs to select corresponding parameters or formulas for different acoustic properties to calculate the degree of speaker impact.

在流程214中，主機裝置130中的控制電路132採用通道基底補償運作，分別計算每一喇叭的聲道音訊需要的輸出補償值。通道基底補償運作在判斷對目標聆聽點的播放效果時，是以每一聲道音訊為單位分開計算的。以多個喇叭中的一第一喇叭110所播放的一第一聲道音訊112為例，在第一聲道音訊112透過空氣傳送到目標聆聽點之前，可能受到一環境物件175干擾而損失能量。目標聆聽點的位置改變，也會影響到第一聲道音訊112在目標聆聽點上產生的聲壓值。通過通道基底補償運作，控制電路132可算出第一聲道音訊112在目標聆聽點上的聲壓值改變量。本實施例的控制電路132為第一聲道音訊112加入輸出補償值，以抵消所述的聲壓值改變量，使目標聆聽點收到的第一聲道音訊112還原至受到影響之前的狀態。換句話說，輸出補償值具有與聲壓值改變量相同的數值，但是相反的正負極性。In the process 214 , the control circuit 132 in the host device 130 adopts the channel floor compensation operation to calculate the required output compensation value of the channel audio of each speaker. When the channel floor compensation operation judges the playback effect on the target listening point, it is calculated separately based on the audio of each channel. Taking a first channel audio 112 played by a first speaker 110 among multiple speakers as an example, before the first channel audio 112 is transmitted to the target listening point through the air, it may be disturbed by an environmental object 175 and lose energy . The change of the position of the target listening point will also affect the sound pressure value generated by the first channel audio 112 at the target listening point. Through the channel floor compensation operation, the control circuit 132 can calculate the change amount of the sound pressure value of the first channel audio 112 at the target listening point. The control circuit 132 of this embodiment adds an output compensation value to the first channel audio 112 to offset the change in the sound pressure value, so that the first channel audio 112 received by the target listening point returns to the state before being affected . In other words, the output compensation value has the same numerical value as the change amount of the sound pressure value, but opposite positive and negative polarities.

在流程216中，控制電路132依據輸出補償值調整並輸出聲道音訊給喇叭。由於調整後的聲道音訊已抵消使用者180在目標空間170中的位移影響和環境物件175造成的干擾，使用者180感受到的聆聽效果保持一致。以目標空間170中的第一喇叭110和第二喇叭120為例，控制電路132計算並調整第一聲道音訊112和第二聲道音訊122中的不同子頻帶的聲壓值，藉此抵消使用者180因移動感受到的等效音量偏差。另一方面，該控制電路（132）依據環境物件175的位置、大小、及聲學屬性資訊對該目標聆聽點造成的聲壓值改變量，對應地補償第一聲道音訊112和第二聲道音訊122。In the process 216, the control circuit 132 adjusts and outputs the channel audio to the speaker according to the output compensation value. Since the adjusted channel audio has offset the effect of the displacement of the user 180 in the target space 170 and the interference caused by the environmental objects 175 , the listening effect experienced by the user 180 remains consistent. Taking the first speaker 110 and the second speaker 120 in the target space 170 as an example, the control circuit 132 calculates and adjusts the sound pressure values of different sub-bands in the first channel audio 112 and the second channel audio 122, thereby canceling The equivalent volume deviation felt by the user 180 due to movement. On the other hand, the control circuit (132) compensates the first-channel audio 112 and the second-channel audio 112 correspondingly according to the amount of change in the sound pressure value of the target listening point caused by the position, size, and acoustic attribute information of the environmental object 175. Audio 122.

在流程218中，由每一喇叭透過音訊傳輸電路135對應地從主機裝置130接收聲道音訊。以目標空間170中的第一喇叭110和第二喇叭120為例，控制電路132透過音訊傳輸電路135分別地輸出第一聲道音訊112和第二聲道音訊122至對應的第一喇叭110和第二喇叭120。於是，第一喇叭110和第二喇叭120對應地播放調整後的第一聲道音訊112和第二聲道音訊122，使使用者180所在的目標聆聽點獲得優化的聆聽效果。為便於說明，圖1的目標空間170的實施例中僅繪示了兩個喇叭和一個環境物件175。然而，可以理解的是，在實作中，主機裝置130中可包含不止兩個喇叭，而環境物件175的數量也不限於一個。在進一步衍生的實施例中，每一喇叭擅長輸出的音頻範圍可能是不同的。例如有的喇叭是中高音喇叭，有的喇叭是重低音喇叭。控制電路132在調整聲道音訊的時候，還可進一步的根據不同喇叭的特性而調整對應輸出的第一聲道音訊112和第二聲道音訊122。In the process 218 , each speaker correspondingly receives channel audio from the host device 130 through the audio transmission circuit 135 . Taking the first speaker 110 and the second speaker 120 in the target space 170 as an example, the control circuit 132 respectively outputs the first channel audio 112 and the second channel audio 122 to the corresponding first speaker 110 and The second horn 120. Therefore, the first speaker 110 and the second speaker 120 play the adjusted first-channel audio 112 and second-channel audio 122 correspondingly, so that the target listening point where the user 180 is located can obtain an optimized listening effect. For ease of illustration, only two speakers and one environmental object 175 are shown in the embodiment of the target space 170 in FIG. 1 . However, it can be understood that, in practice, the host device 130 may include more than two speakers, and the number of environmental objects 175 is not limited to one. In a further derivative embodiment, the audio frequency range that each speaker is good at outputting may be different. For example, some speakers are high-pitched speakers, and some speakers are subwoofers. When the control circuit 132 adjusts the channel audio, it can further adjust the corresponding output first channel audio 112 and second channel audio 122 according to the characteristics of different speakers.

以下以圖3說明音響系統100動態追蹤使用者位置，利用攝影機獲取音場環境配置，並以物件基底補償運作補償喇叭輸出的實施例。The following uses FIG. 3 to illustrate an embodiment in which the sound system 100 dynamically tracks the user's position, uses a camera to acquire the configuration of the sound field environment, and compensates the output of the speaker through the operation of object floor compensation.

圖3為本發明一實施例的動態音效優化方法流程圖。FIG. 3 is a flowchart of a method for optimizing dynamic sound effects according to an embodiment of the present invention.

在圖3的流程圖中，位於一特定裝置所屬欄位中的流程，即代表由該特定裝置所進行的流程。例如，標記在「感測器電路」欄位中的部分，是由感測器電路140所進行的流程；標記在「主機裝置」欄位中的部分，是由主機裝置130所進行的流程；標記在「喇叭」欄位中的部分，則是由第一喇叭110和/或第二喇叭120所進行的流程；其餘依此類推。前述的邏輯也適用於後續的其他流程圖中。In the flow chart of FIG. 3 , the process in the column of a specific device represents the process performed by the specific device. For example, the part marked in the column of "sensor circuit" is the process performed by the sensor circuit 140; the part marked in the column of "host device" is the process performed by the host device 130; The part marked in the "speaker" column is the process performed by the first speaker 110 and/or the second speaker 120; and so on for the rest. The aforementioned logic is also applicable to other subsequent flow charts.

圖3中的流程202、204、206、208及210與前實施例相同，為節省篇幅，不再重複說明。The processes 202 , 204 , 206 , 208 and 210 in FIG. 3 are the same as those in the previous embodiment, and are not repeated for space saving.

在本實施例的音響系統100完成流程210時，控制電路132已追蹤使用者180的位置並指派為目標聆聽點，並且也獲得了目標空間170中的一或多個環境物件175的空間配置資訊。接著以後續流程說明物件基底補償運作，來調整每一喇叭的聲道音訊。When the audio system 100 of this embodiment completes the process 210, the control circuit 132 has tracked the position of the user 180 and assigned it as the target listening point, and also obtained the spatial configuration information of one or more environmental objects 175 in the target space 170 . Then, the operation of the object floor compensation is described in the following procedure to adjust the channel audio of each speaker.

物件基底（Object Based）聲學系統起源於虛擬實境的混音技術，能利用有限數量的實體喇叭模擬出音源物件移動的效果。現存的一些軟體產品，例如杜比音場產品（Dolby Atmos）、空間音訊工作站（Spatial Audio Workstation）、或數位空間實境（DSpatial Reality）等都屬於物件基底的聲學系統。使用者可透過一人機介面在一虛擬空間中定義音源物件的移動軌跡。而物件基底系統可利用實體喇叭模擬出該虛擬空間中的音源物件的聲音效果。位於目標聆聽點的使用者，藉此可真實地感受到音源物件在空間中移動。The Object Based acoustic system originated from the mixing technology of virtual reality, which can use a limited number of physical speakers to simulate the effect of moving sound source objects. Some existing software products, such as Dolby Atmos, Spatial Audio Workstation, or DSpatial Reality, are object-based acoustic systems. The user can define the moving track of the sound source object in a virtual space through a man-machine interface. The object-based system can use physical speakers to simulate the sound effects of the sound source objects in the virtual space. Users at the target listening point can truly feel the sound source object moving in space.

物件基底聲學系統是建立在大量聲學參數的陣列運算上。每一音源物件具有一中繼資料，用於描述該音源物件的類型、位置、大小（長寬高）、發散度（divergence）等。經過物件基底的陣列運算後，一音源物件所代表的聲音將會被指派至一或多個喇叭而共同播放，每一喇叭相對播放該音源件的一部份的聲音。換句話說，物件基底的陣列運算可利用多個喇叭來模擬一個音源物件的空間效果。圖3的實施例提出基於物件基底聲學系統的一物件基底補償運作，來解決傳統的播放效果問題。The object base acoustic system is based on the array operation of a large number of acoustic parameters. Each audio source object has a metadata for describing the type, location, size (length, width, height), divergence, etc. of the audio source object. After the object-based array operation, the sound represented by an audio source object will be assigned to one or more speakers to play together, and each speaker will play a part of the sound of the audio source object. In other words, the object-based array operation can use multiple speakers to simulate the spatial effect of an audio source object. The embodiment of FIG. 3 proposes an object base compensation operation based on the object base acoustic system to solve the traditional playback effect problem.

在流程312中，主機裝置130中的控制電路132依據環境物件175建立物件基底的補償音源物件。在實作中，該控制電路132會先將目標空間170對應至虛擬實境的一物件基底空間中，再依據環境物件175對應地在該物件基底空間中建立一補償音源物件，用於產生抵消該環境物件175的音源效果。對位於目標聆聽點上的使用者180而言，環境物件175的存在也可以類比為一個音源物件。在實際應用場合中，環境物件175可能將一喇叭發出的聲音反射至該目標聆聽點。環境物件175也可能阻擋或吸收一部份聲音，使一喇叭對該目標聆聽點發出的聲音受到衰減。換句話說，本實施例的控制電路132將環境物件175類比為音源物件後，就能對應地在該物件基底空間中建立具有相反音源效果的負音源物件，做為抵消干擾的手段。在本實施例所述的音源效果，可以是針對目標聆聽點產生的聲壓值、等效音量，或增益值。In the process 312 , the control circuit 132 in the host device 130 creates an object-based compensation sound source object according to the environment object 175 . In practice, the control circuit 132 first maps the target space 170 to an object base space of the virtual reality, and then correspondingly creates a compensation sound source object in the object base space according to the environment object 175 for generating offset The sound source effect of the environment object 175 . For the user 180 at the target listening point, the existence of the environmental object 175 can also be compared to an audio source object. In practical applications, the environmental object 175 may reflect the sound from a speaker to the target listening point. Environmental objects 175 may also block or absorb some sound, attenuating the sound emitted by a speaker to the target listening point. In other words, after the control circuit 132 of this embodiment compares the environmental object 175 to a sound source object, it can correspondingly create a negative sound source object with an opposite sound source effect in the base space of the object as a means of canceling the interference. The sound source effect described in this embodiment may be a sound pressure value, an equivalent volume, or a gain value generated for a target listening point.

在流程314中，主機裝置130將補償音源物件代入物件基底補償運作而產生聲道音訊。物件基底補償運作可利用現有的物件基底聲學產品中的物件基底陣列運算模組，依據音源物件的中繼資料，進行大量與聲學交互作用相關的陣列運算。舉例來說，該補償音源物件的一中繼資料包含：該環境物件175的座標位置、大小，以及對聲音的反射率和吸收率。控制電路132依據該目標聆聽點和該中繼資料進行一物件基底補償運作，抵消環境物件175對該目標聆聽點的干擾而產生對該目標聆聽點優化的第一聲道音訊112和第二聲道音訊122。In the process 314, the host device 130 substitutes the compensated audio source object into the object base compensation operation to generate channel audio. The object-based compensation operation can use the object-based array calculation module in the existing object-based acoustic products to perform a large number of array calculations related to the acoustic interaction based on the metadata of the sound source object. For example, a metadata of the compensating sound source object includes: the coordinate position, size, and reflection rate and absorption rate of the environmental object 175 . The control circuit 132 performs an object base compensation operation according to the target listening point and the relay data to cancel the interference of the environmental object 175 on the target listening point and generate the first channel audio 112 and the second sound optimized for the target listening point. Road News 122.

在一實施例中，物件基底補償運作是分別在多個子頻帶上進行的。由於聲音傳遞的特性，每一子頻帶上的聲壓值對等效音量的影響是不同的。以第一喇叭110產生的第一聲道音訊112對環境物件175的影響為例，本實施例的控制電路132可依據環境物件175的座標位置、大小，以及對聲音的反射率和吸收率，分別在多個子頻帶上計算環境物件175受第一聲道音訊112影響而被動產生的一音源效果。接著控制電路132依據該音源效果建立該補償音源物件。在本實施例中，補償音源物件是依據環境物件175而對應建立，其中繼資料具有與該環境物件175相同的座標位置、大小，以及對聲音的反射率和吸收率，但是產生的音源效果的正負號與環境物件175的相反。In one embodiment, the object floor compensation operation is performed on a plurality of frequency sub-bands respectively. Due to the characteristics of sound transmission, the impact of the sound pressure value on each sub-band on the equivalent volume is different. Taking the influence of the first channel audio 112 generated by the first speaker 110 on the environmental object 175 as an example, the control circuit 132 of this embodiment can base on the coordinate position, size, and reflectivity and absorption rate of the environmental object 175, A sound source effect passively generated by the environmental object 175 affected by the first channel audio 112 is calculated on the plurality of frequency sub-bands respectively. Then the control circuit 132 creates the compensated sound source object according to the sound source effect. In this embodiment, the compensating sound source object is correspondingly established according to the environmental object 175, and its metadata has the same coordinate position, size, and sound reflection rate and absorption rate as the environmental object 175, but the generated sound source effect is The sign is the opposite of that of the environment object 175 .

已知人耳可聽範圍在20赫茲（Hz）到20000Hz之間。本實施例可將人耳可聽範圍切成多個子頻帶區間而分別補償。每個子頻帶的區間大小，可以是指數區間。例如，以10為底的指數區間，可將聲音訊號區分為10Hz到100Hz、100Hz到1000Hz、1000Hz到10000Hz等多個子頻帶範圍。在其他的實施例中，也可以依據播放品質的精細度的需求，以2為底或4為底來切分指數區間。在音訊處理領域的等化器（Equalizer）中已存在切割多個子頻帶的處理技術，在此不再深入解釋。The audible range of the human ear is known to be between 20 Hertz (Hz) and 20000 Hz. In this embodiment, the audible range of the human ear can be divided into a plurality of sub-band intervals for compensation respectively. The interval size of each sub-band may be an exponential interval. For example, the index range with the base 10 can divide the sound signal into multiple sub-band ranges such as 10 Hz to 100 Hz, 100 Hz to 1000 Hz, and 1000 Hz to 10000 Hz. In other embodiments, the exponent interval may be divided into base 2 or base 4 according to the fineness requirement of playback quality. In the field of audio processing, the equalizer (Equalizer) already has a processing technology for cutting multiple sub-bands, so it will not be explained in depth here.

在控制電路132獲得該補償音源物件的該負音源效果後，運行一物件基底補償運作，將該負音源效果按照該混音運作結果決定的比例，對應地混入第一聲道音訊112和第二聲道音訊122中，而藉此抵消環境物件175對該目標聆聽點的干擾。關於物件基底補償運作，將於圖11至圖13的實施例中詳述。After the control circuit 132 obtains the negative sound source effect of the compensated sound source object, an object base compensation operation is performed, and the negative sound source effect is correspondingly mixed into the first channel audio 112 and the second sound source according to the ratio determined by the mixing operation result. channel audio 122, thereby canceling the interference of the environmental object 175 to the target listening point. The operation of object base compensation will be described in detail in the embodiments of FIG. 11 to FIG. 13 .

在流程316中，主機裝置130依照流程314的運算結果，將第一聲道音訊112和第二聲道音訊122對應地輸出給第一喇叭110和第二喇叭120。圖3的流程316與圖2實施例的流程216不同。圖2是針對既有的聲道音訊計算補償值，而去調整既有的聲道音訊。圖控制電路132在進行物件基底補償運作時，直接依照所有的中繼資料而一次計算出每一喇叭對應的聲道音訊。物件基底補償運作，將需要被抵消或補償的干擾成份，以補償音源的型式混入聲道音訊中。換句話說，因為聲道音訊中包含了補償音源物件發出的補償音源，使用者180在目標聆聽點上感受不到環境物件175的存在所造成的影響。In the process 316 , the host device 130 outputs the first channel audio 112 and the second channel audio 122 to the first speaker 110 and the second speaker 120 according to the operation result of the process 314 . The process 316 in FIG. 3 is different from the process 216 in the embodiment in FIG. 2 . FIG. 2 is to calculate the compensation value for the existing channel audio to adjust the existing channel audio. When the map control circuit 132 performs the object floor compensation operation, it directly calculates the channel audio corresponding to each speaker according to all the relay data at one time. The object base compensation operation mixes the interference components that need to be canceled or compensated into the channel audio in the form of the compensation source. In other words, because the channel audio includes the compensation sound source emitted by the compensation sound source object, the user 180 cannot feel the influence caused by the existence of the environmental object 175 at the target listening point.

由流程316可知，物件基底補償運作將目標聆聽點和環境物件轉譯為物件基底聲學系統的中繼資料，並建立補償音源物件，簡化了消除干擾及優化播放效果的運算過程。需要理解的是，本實施例的音響系統100可利用感測器電路140即時或週期性地追蹤使用者180的位置而動態地更新目標聆聽點。控制電路132所進行的物件基底補償運作，也可隨著目標聆聽點的改變而同步更新目標空間170中所有和該目標聆聽點的相對位置有關的中繼資料。It can be seen from the process 316 that the object base compensation operation translates the target listening point and the environment object into the relay data of the object base acoustic system, and creates a compensation sound source object, which simplifies the calculation process of eliminating interference and optimizing the playback effect. It should be understood that the audio system 100 of this embodiment can use the sensor circuit 140 to track the position of the user 180 in real time or periodically to dynamically update the target listening point. The object floor compensation operation performed by the control circuit 132 can also synchronously update all the metadata related to the relative position of the target listening point in the target space 170 as the target listening point changes.

圖3中的流程218，與前實施例相同，為節省篇幅，不再重複說明。The process 218 in FIG. 3 is the same as that of the previous embodiment, and will not be described repeatedly in order to save space.

以下以圖4說明音響系統100動態追蹤使用者位置，運行配置程序獲取音場環境配置，並以通道基底補償運作補償喇叭輸出的實施例。The following uses FIG. 4 to illustrate an embodiment in which the audio system 100 dynamically tracks the user's position, runs the configuration program to acquire the configuration of the sound field environment, and uses channel floor compensation to compensate the speaker output.

圖4為本發明一實施例的動態音效優化方法流程圖。FIG. 4 is a flowchart of a method for optimizing dynamic sound effects according to an embodiment of the present invention.

在圖4的流程圖中，位於一特定裝置所屬欄位中的流程，即代表由該特定裝置所進行的流程。例如，標記在「感測器電路」欄位中的部分，是由感測器電路140所進行的流程；標記在「主機裝置」欄位中的部分，是由主機裝置130所進行的流程；標記在「喇叭」欄位中的部分，則是由第一喇叭110和/或第二喇叭120所進行的流程；其餘依此類推。前述的邏輯也適用於後續的其他流程圖中。In the flow chart of FIG. 4 , the process in the column of a specific device represents the process performed by the specific device. For example, the part marked in the column of "sensor circuit" is the process performed by the sensor circuit 140; the part marked in the column of "host device" is the process performed by the host device 130; The part marked in the "speaker" column is the process performed by the first speaker 110 and/or the second speaker 120; and so on for the rest. The aforementioned logic is also applicable to other subsequent flow charts.

圖4中的流程202、204、206、及208與前實施例相同，為節省篇幅，不再重複說明。The processes 202 , 204 , 206 , and 208 in FIG. 4 are the same as those in the previous embodiment, and will not be described again to save space.

在本實施例的音響系統100完成流程210時，控制電路132已追蹤使用者180的位置並指派為目標聆聽點，並且也獲得了目標空間170中的一或多個環境物件175的空間配置資訊。接著要進行流程的是，使用物件基底演算法來調整每一喇叭的聲道音訊。When the audio system 100 of this embodiment completes the process 210, the control circuit 132 has tracked the position of the user 180 and assigned it as the target listening point, and also obtained the spatial configuration information of one or more environmental objects 175 in the target space 170 . The next step in the process is to use the object-based algorithm to adjust the channel audio of each speaker.

為了消除音場環境中的干擾，音響系統100需要取得目標空間170中各種環境物件175的空間配置資訊。In order to eliminate the interference in the sound field environment, the audio system 100 needs to obtain spatial configuration information of various environmental objects 175 in the target space 170 .

在流程410中，主機裝置130中的控制電路132可運行一配置程序而獲取目標空間170中的一或多個環境物件175的空間配置資訊。前實施例中，主機裝置130採用感測器電路140捕捉的音場環境資訊而自動辨識出環境物件175的空間配置資訊。在運行配置程序時，主機裝置130可利用一人機介面電路133與使用者互動，允許使用者手動輸入環境物件175的空間配置資訊。人機介面電路133可以提供一個畫面和一種輸入方式，讓使用者將目標空間170中各種物件的空間配置資訊定義在一個二維平面圖或三維立體圖中。環境物件175的空間配置資訊，可以包含環境物件175在目標空間170中的相對位置、大小、名稱、和材質種類。在進一步衍生的實施例中，使用者180可透過人機介面電路133告訴主機裝置130當下的目標空間170所屬的應用場景類別。在不同的應用場景中，例如開闊的室外空間、劇院空間、或浴室等，常見的環境物件175類型也不盡相同，使用者感受到的音場氛圍也不同。針對應用場景的不同而優化音場，也是音響系統100的重要功能之一。In the process 410 , the control circuit 132 in the host device 130 can run a configuration program to obtain spatial configuration information of one or more environmental objects 175 in the target space 170 . In the previous embodiment, the host device 130 uses the sound field environment information captured by the sensor circuit 140 to automatically recognize the spatial configuration information of the environment object 175 . When running the configuration program, the host device 130 can use a man-machine interface circuit 133 to interact with the user, allowing the user to manually input the spatial configuration information of the environmental object 175 . The man-machine interface circuit 133 can provide a screen and an input method, allowing the user to define the spatial configuration information of various objects in the target space 170 in a two-dimensional plan view or a three-dimensional stereo view. The spatial configuration information of the environmental object 175 may include the relative position, size, name, and material type of the environmental object 175 in the target space 170 . In a further derivative embodiment, the user 180 can inform the host device 130 of the type of application scenario to which the current target space 170 belongs through the man-machine interface circuit 133 . In different application scenarios, such as open outdoor space, theater space, or bathroom, etc., the types of common environmental objects 175 are also different, and the sound field atmosphere felt by users is also different. Optimizing the sound field according to different application scenarios is also one of the important functions of the sound system 100 .

不同的材質種類，具有不同的聲學屬性。主機裝置130運行該配置程序時，還進一步地依據使用者輸入的物件名稱或材質種類，查詢一物件資料庫而獲得環境物件175的聲學屬性資訊，例如對聲音的吸收率或反射率。藉此，主機裝置130可在後續的流程212中，依據前述空間配置資訊和聲學屬性資訊，計算每一喇叭對目標聆聽點的播放效果受環境物件175影響的程度。在進一步衍生的實施例中，主機裝置130可依據目標空間170的應用場景類別，優先使用對應的物件資料庫而更快速地辨識目標空間170中的環境物件175。相關實施例將在圖9中說明。Different types of materials have different acoustic properties. When the host device 130 runs the configuration program, it further searches an object database according to the object name or material type input by the user to obtain the acoustic attribute information of the environmental object 175 , such as the absorption rate or reflectivity of sound. In this way, the host device 130 can calculate the degree to which the playback effect of each speaker on the target listening point is affected by the environmental object 175 according to the aforementioned spatial configuration information and acoustic attribute information in the subsequent process 212 . In a further derivative embodiment, the host device 130 can preferentially use the corresponding object database according to the application scenario category of the target space 170 to more quickly identify the environmental objects 175 in the target space 170 . A related embodiment will be illustrated in FIG. 9 .

圖4中的流程212，214，216及218與前實施例相同，為節省篇幅，不再重複說明。The processes 212, 214, 216 and 218 in FIG. 4 are the same as those in the previous embodiment, and will not be repeated for space saving.

圖4的實施例說明了音響系統100除了可動態追蹤使用者位置，還允許使用者180透過一配置程序而設定目標空間170中的環境物件175的空間配置資訊。該配置程序提供了手動輸入的管道，以彌補辨識功能不足之處。使用者除了透過主動輸入來協助主機裝置130進行更準確的判斷，還有機會可依據自己的偏好，刻意指定不同的應用場景類別，或是刻意設置想像中的虛擬音源物件來改變播放效果。主機裝置130會以通道基底補償運作，依據目標空間170中的環境物件175的空間配置資訊，計算每一喇叭對應的輸出補償值。The embodiment in FIG. 4 illustrates that the sound system 100 can not only dynamically track the user's position, but also allow the user 180 to set the spatial configuration information of the environmental objects 175 in the target space 170 through a configuration program. The configurator provides a channel for manual input to make up for the insufficiency of the identification function. In addition to assisting the host device 130 to make more accurate judgments through active input, the user also has the opportunity to deliberately designate different application scenarios according to his own preferences, or deliberately set imaginary virtual sound source objects to change the playback effect. The host device 130 operates with channel floor compensation, and calculates the output compensation value corresponding to each speaker according to the spatial configuration information of the environmental object 175 in the target space 170 .

以下以圖5說明音響系統100動態追蹤使用者位置，運行配置程序獲取音場環境配置，並以物件基底補償運作補償喇叭輸出的實施例。The following uses FIG. 5 to illustrate an embodiment in which the audio system 100 dynamically tracks the user's position, runs the configuration program to obtain the configuration of the sound field environment, and compensates the output of the speaker through the object base compensation operation.

圖5為本發明一實施例的動態音效優化方法流程圖。FIG. 5 is a flowchart of a method for optimizing dynamic sound effects according to an embodiment of the present invention.

在圖5的流程圖中，位於一特定裝置所屬欄位中的流程，即代表由該特定裝置所進行的流程。例如，標記在「感測器電路」欄位中的部分，是由感測器電路140所進行的流程；標記在「主機裝置」欄位中的部分，是由主機裝置130所進行的流程；標記在「喇叭」欄位中的部分，則是由第一喇叭110和/或第二喇叭120所進行的流程；其餘依此類推。前述的邏輯也適用於後續的其他流程圖中。In the flow chart of FIG. 5 , the process in the column of a specific device represents the process performed by the specific device. For example, the part marked in the column of "sensor circuit" is the process performed by the sensor circuit 140; the part marked in the column of "host device" is the process performed by the host device 130; The part marked in the "speaker" column is the process performed by the first speaker 110 and/or the second speaker 120; and so on for the rest. The aforementioned logic is also applicable to other subsequent flow charts.

圖5中的流程202、204、206、208及210與前實施例相同，為節省篇幅，不再重複說明。The processes 202 , 204 , 206 , 208 and 210 in FIG. 5 are the same as those in the previous embodiment, and will not be repeated for space saving.

與圖4的實施例類似，圖5的實施例為了消除音場環境中的干擾，運行了與圖4相同的流程410。Similar to the embodiment of FIG. 4 , the embodiment of FIG. 5 runs the same process 410 as that of FIG. 4 in order to eliminate interference in the sound field environment.

在流程410中，主機裝置130運行配置程序而獲取目標空間170中的一或多個環境物件175的空間配置資訊。在圖4的實施例中，說明了主機裝置130可透過一人機介面電路133，接收使用者手動輸入環境物件175的空間配置資訊。在進一步的衍生實施例中，主機裝置130也可利用通信電路136接收用戶設備150或其他裝置傳送而來的空間配置資訊。舉例來說，用戶設備150可以是一手機，運行有一應用程式，用以提供類似人機介面電路133的功能。該應用程式允許使用者定義目標空間170的範圍和大小、各喇叭相對該目標空間170的位置、各種環境物件175的位置、大小、名稱和類型，至是使用者180本身所在的位置。該應用程式還可透過通信電路136與控制電路132溝通，而進行各種播放運作，例如播放、暫停、快轉、調整音量等。此外，使用者可透過人機介面電路133設定目標空間170的應用場景類別，使主機裝置130對目標空間170產生多元化的播放效果。In the process 410 , the host device 130 runs a configuration program to obtain spatial configuration information of one or more environmental objects 175 in the target space 170 . In the embodiment of FIG. 4 , it is illustrated that the host device 130 can receive the spatial configuration information of the environmental object 175 manually input by the user through a man-machine interface circuit 133 . In a further derivative embodiment, the host device 130 can also use the communication circuit 136 to receive the spatial configuration information transmitted from the user equipment 150 or other devices. For example, the user equipment 150 may be a mobile phone running an application program to provide functions similar to the human-machine interface circuit 133 . The application allows the user to define the scope and size of the target space 170 , the position of each speaker relative to the target space 170 , the position, size, name and type of various environmental objects 175 , and even the location of the user 180 himself. The application program can also communicate with the control circuit 132 through the communication circuit 136 to perform various playback operations, such as playing, pausing, fast forwarding, adjusting volume, and the like. In addition, the user can set the application scene category of the target space 170 through the man-machine interface circuit 133 , so that the host device 130 can produce diversified playback effects on the target space 170 .

在進一步衍生的實施例中，主機裝置130所連接的用戶設備150可能是一個虛擬實境裝置或遊戲機。用戶設備150產生音源訊號而使主機裝置130播放。而該音源訊號中可能包含在一虛擬實境空間中游走移動的虛擬物件，例如飛機或噴火龍。用戶設備150可將這些虛擬物件的中繼資料傳送至主機裝置130中，成為目標空間170的環境物件空間配置資訊的一部份。換句話說，主機裝置130可採用物件基底的聲學系統一視同仁地處理虛擬物件和實體物件。透過物件基底補償運作，主機裝置130可讓使用者感受到目標空間170中存在一虛擬物件，也可讓使用者感受不到目標空間170存在一實體物件的干擾。關於物件基底補償運作的實作，在圖11至13的實施例中有進一步的說明。In a further derivative embodiment, the user equipment 150 connected to the host device 130 may be a virtual reality device or a game console. The user equipment 150 generates an audio signal to be played by the host device 130 . And the audio signal may include a virtual object moving in a virtual reality space, such as an airplane or a fire-breathing dragon. The user equipment 150 can transmit the metadata of these virtual objects to the host device 130 to become a part of the space configuration information of the environment objects in the target space 170 . In other words, the host device 130 can use the object-based acoustic system to treat virtual objects and physical objects equally. Through the object base compensation operation, the host device 130 can make the user feel that there is a virtual object in the target space 170 , and can also make the user not feel the interference of a physical object in the target space 170 . The implementation of the object base compensation operation is further described in the embodiments of FIGS. 11 to 13 .

在本實施例的音響系統100完成流程410時，控制電路132已追蹤使用者180的位置並指派為目標聆聽點，並且也獲得了目標空間170中的一或多個環境物件175的空間配置資訊。接著在流程312至316中，主機裝置130使用物件基底演算法來調整每一喇叭的聲道音訊。由於流程312至316，以及流程218，與前實施例相同，為節省篇幅，不再重複說明。When the audio system 100 of this embodiment completes the process 410, the control circuit 132 has tracked the position of the user 180 and assigned it as the target listening point, and also obtained the spatial configuration information of one or more environmental objects 175 in the target space 170 . Then in the processes 312 to 316, the host device 130 uses the object-based algorithm to adjust the channel audio of each speaker. Since the processes 312 to 316 and the process 218 are the same as those in the previous embodiment, the description will not be repeated in order to save space.

圖5的實施例說明了音響系統100除了可動態追蹤使用者位置，還允許使用者180透過一配置程序而設定目標空間170中的環境物件175的空間配置資訊。該配置程序可與既有的虛擬實境技術整合，接收虛擬物件的空間配置資訊。音響系統100將實體的環境物件和虛擬物件轉換為格式一致的中繼資料，再將所有中繼資料套用至既有的物件基底聲學系統的物件基底陣列運算模組中，以進行物件基底補償運作。藉此，控制電路132不需要為不同物件額外開發運算模組，可降低成本，提高執行效率。The embodiment in FIG. 5 illustrates that the audio system 100 can not only dynamically track the user's position, but also allow the user 180 to set the spatial configuration information of the environmental objects 175 in the target space 170 through a configuration program. The configuration program can be integrated with existing virtual reality technology to receive spatial configuration information of virtual objects. The audio system 100 converts physical environmental objects and virtual objects into metadata with the same format, and then applies all the metadata to the object-based array computing module of the existing object-based acoustic system to perform object-based compensation operations . In this way, the control circuit 132 does not need to develop additional computing modules for different objects, which can reduce costs and improve execution efficiency.

以下以圖6說明感測器電路的幾種實施態樣，並說明通道基底的補償算法。The following uses FIG. 6 to illustrate several implementation aspects of the sensor circuit and the compensation algorithm of the channel base.

圖6為本發明一目標空間600示意圖，用於說明依據最佳聆聽點的位置計算音訊調整量的實施例。FIG. 6 is a schematic diagram of a target space 600 according to the present invention, which is used to illustrate an embodiment of calculating an audio adjustment amount based on the position of the best listening point.

本申請的音響系統100採用感測器電路140動態地感測目標空間600而產生音場環境資訊。音場環境資訊主要包含使用者180的位置，也可包含環境物件的空間配置資訊。動態感測的技術方案可以有多種選項。舉例來說，感測器電路140可以是由攝影機610、紅外線感測器620、無線偵測器630其中之一或多者的搭配組合，分別配置在目標空間600周圍的不同位置，提供具有空間深度的音場環境資訊以助於主機裝置130中的辨識電路134和控制電路132更有效率地追蹤使用者180位置。藉此，辨識電路134利用感測器電路140提供的音場環境資訊，不止是可以辨識出使用者180位置，還可辨識出臉部面對方向、耳朵位置，甚至手勢或身體姿態。可應用於調整音場的控制因素，因此而變得更加豐富。例如，專注偵測、睡眠偵測、手勢控制等。The sound system 100 of the present application uses the sensor circuit 140 to dynamically sense the target space 600 to generate sound field environment information. The sound field environment information mainly includes the location of the user 180, and may also include spatial configuration information of the environment objects. There are many options for the technical solution of dynamic sensing. For example, the sensor circuit 140 may be a combination of one or more of the camera 610, the infrared sensor 620, and the wireless detector 630, respectively arranged in different positions around the target space 600, providing a space with The in-depth sound field environment information helps the identification circuit 134 and the control circuit 132 in the host device 130 to track the position of the user 180 more efficiently. In this way, the recognition circuit 134 uses the sound field environment information provided by the sensor circuit 140 to not only recognize the position of the user 180, but also recognize the face facing direction, ear position, and even gestures or body postures. The control factors that can be applied to adjust the sound stage are thus enriched. For example, focus detection, sleep detection, gesture control, etc.

在圖6的目標空間600中，配置有一第一喇叭110和一第二喇叭120。通道基底補償運作可分別針對每一喇叭而計算輸出補償值。在預設的情況下，目標聆聽點位於目標空間600的中心，即圖6中的第一位置601。第一位置601與第一喇叭110和第二喇叭120的距離同樣為R1。這時的第一喇叭110和第一聲道音訊112所播放的第一聲道音訊112和第二聲道音訊122也是處於預設狀態，不需要針對位置進行任何補償處理。In the target space 600 of FIG. 6 , a first speaker 110 and a second speaker 120 are disposed. The channel floor compensation operation can calculate the output compensation value for each speaker separately. In a preset situation, the target listening point is located at the center of the target space 600 , that is, the first position 601 in FIG. 6 . The distance between the first position 601 and the first horn 110 and the second horn 120 is also R1. At this time, the first-channel audio 112 and the second-channel audio 122 played by the first speaker 110 and the first-channel audio 112 are also in a default state, and no compensation processing is required for the position.

當使用者180從第一位置601沿著移動軌跡173移動到第二位置602時，感測器電路140偵測到使用者180的新位置，而將音響系統100的目標聆聽點指派為第二位置602。這時使用者180與第一喇叭110的距離改變為R2，而使用者180與第二喇叭120的距離改變為R2’。對使用者180而言，第一喇叭110變遠了，所以接收到的第一聲道音訊112因距離而衰減。相對的，第二喇叭120變近了，接收到的第二聲道音訊122增強了。換句話說，第二位置602上接收到的第一聲道音訊112和第二聲道音訊122的強度已經失去平衡。本實施例利用通道基底的算法，使第二位置602接收到的聆聽效果還原至與第一位置601相同的預設狀態。換句話說，控制電路132透過補償第一喇叭110和第二喇叭120所輸出的第一聲道音訊112和第二聲道音訊122，以抵消使用者180因移動而產生的聆聽效果偏差。圖6顯示的目標空間600並不限定於只適用於水平配置的多喇叭環境。在配置有上喇叭和下喇叭的三維音場環境中，也同樣會出現距離偏差的問題。舉例來說，如果使用者從站姿變成坐姿，就會遠離上喇叭，而接近下喇叭。When the user 180 moves from the first position 601 to the second position 602 along the moving track 173, the sensor circuit 140 detects the new position of the user 180, and assigns the target listening point of the audio system 100 as the second position. Location 602. At this time, the distance between the user 180 and the first speaker 110 is changed to R2, and the distance between the user 180 and the second speaker 120 is changed to R2'. For the user 180, the first speaker 110 becomes far away, so the received first channel audio 112 is attenuated due to the distance. On the contrary, the second speaker 120 gets closer, and the received second channel audio 122 is enhanced. In other words, the intensities of the first channel audio 112 and the second channel audio 122 received at the second location 602 are out of balance. In this embodiment, the channel-based algorithm is used to restore the listening effect received by the second location 602 to the same preset state as that of the first location 601 . In other words, the control circuit 132 compensates the first-channel audio 112 and the second-channel audio 122 outputted by the first speaker 110 and the second speaker 120 , so as to offset the listening effect deviation caused by the movement of the user 180 . The target space 600 shown in FIG. 6 is not limited to a multi-speaker environment in a horizontal configuration. In a three-dimensional sound field environment with upper speakers and lower speakers, the problem of distance deviation also occurs. For example, if the user changes from a standing position to a sitting position, the user moves away from the upper horn and approaches the lower horn.

為了獲得較佳的補償效果，本實施例採用等效音量（Equal Loudness）做為計算標準。例如，本實施例可依據ISO226;2003協議所定義的等響曲線，計算目標聆聽點上需要補償的聲壓值。每一聲道音訊是切分成多個子頻帶分別處理。此外，隨著使用者180和喇叭的距離不同，採用的音場公式也不同。由於等響曲線中定義的是等效音量與聲壓值的線性關係，而等效音量的和以「分貝」為單位的「增益值」又有線性對應關係。因此本實施例中不限定是以等效音量、聲壓值、或增益值其中任一者為單位而進行調整。In order to obtain a better compensation effect, this embodiment uses an equivalent volume (Equal Loudness) as a calculation standard. For example, this embodiment can calculate the sound pressure value to be compensated at the target listening point according to the equal loudness curve defined in the ISO226;2003 protocol. Each channel audio is divided into multiple sub-bands and processed separately. In addition, as the distance between the user 180 and the speaker is different, the adopted sound field formula is also different. Since the equal loudness curve defines the linear relationship between the equivalent volume and the sound pressure value, and the equivalent volume and the "gain value" in "decibels" have a linear correspondence. Therefore, it is not limited in this embodiment to adjust with any one of the equivalent volume, the sound pressure value, or the gain value as the unit.

在音響系統100中，因空氣振動而傳遞聲音的空間稱為音場。由於反射作用存在，聲音在密閉的房間內，音場可區分為多種類型。（1）近音場(Near Field)：當使用者180位於相對接近音源的位置，該音源的物理影響（如壓力、位移、振動）會使聲音增強作用。（2）反射音場(Reverberant Field)：聲音經過物體反射後而造成波疊加效果。（3）自由音場(Free Field)：不受到前述近音場和反射音場干擾的音場。以上反射音場及自由音場又可統稱為遠音場(Far Field)。In the acoustic system 100, the space in which sound is transmitted due to air vibration is called a sound field. Due to the existence of reflection, the sound field can be divided into many types in a closed room. (1) Near Field: When the user 180 is located relatively close to the sound source, the physical influence of the sound source (such as pressure, displacement, vibration) will enhance the sound. (2) Reverberant Field: The wave superposition effect is caused by the sound reflected by the object. (3) Free Field (Free Field): A sound field that is not disturbed by the aforementioned near sound field and reflected sound field. The above reflected sound field and free sound field can be collectively referred to as far field (Far Field).

在現今的許多音響系統中，近音場和遠音場的定義方式各有不同。舉例來說，假設R是喇叭和使用者180的距離（米），L是喇叭的面寬（米），λ是一子頻帶訊號的代表波長（米），則遠音場的滿足條件包含下列幾種類型：In many audio systems today, near and far soundstages are defined in different ways. For example, assuming that R is the distance between the speaker and the user (meters), L is the surface width of the speaker (meters), and λ is the representative wavelength (meters) of a sub-band signal, the conditions for satisfying the far sound field include the following types:

R＞＞λ/2π （1）R＞＞λ/2π （1）

R＞＞L （2）R＞＞L （2）

R＞＞πL ²/2λ （3） R>>πL ² /2λ (3)

以圖1的第一喇叭110為例。當該目標聆聽點與該第一喇叭110的距離大於該子頻帶訊號的波長或該第一喇叭110的大小的一特定比例以上時，音響系統100判斷該音場類型為一遠音場。當該目標聆聽點與該第一喇叭110的距離小於該子頻帶訊號的波長或該第一喇叭110的大小的該特定比例時，判斷該音場類型為一近音場。在一較簡易的實作中，音響系統100可將一子頻帶訊號的中央頻率所對應的波長的兩倍值（2λ），定義為該子頻帶訊號的遠音場和近音場的分界點。Take the first speaker 110 in FIG. 1 as an example. When the distance between the target listening point and the first speaker 110 is greater than the wavelength of the sub-band signal or a certain ratio of the size of the first speaker 110 , the audio system 100 determines that the sound field type is a far sound field. When the distance between the target listening point and the first speaker 110 is smaller than the wavelength of the sub-band signal or the specific ratio of the size of the first speaker 110 , it is determined that the sound field type is a near sound field. In a relatively simple implementation, the audio system 100 can define the double value (2λ) of the wavelength corresponding to the central frequency of a sub-band signal as the boundary point between the far sound field and the near sound field of the sub-band signal.

在遠音場中，使用者180從喇叭接收到的一子頻帶訊號的聲壓值變化與距離變化的關係如下：In the far sound field, the relationship between the sound pressure value change and the distance change of a sub-band signal received by the user 180 from the speaker is as follows:

SPL2 = SPL1 - 20 log ₁₀(R2/R1) （4） SPL2 = SPL1 - 20 log ₁₀ (R2/R1) (4)

其中，SPL2是新位置所收到的該子頻帶訊號的聲壓值，SPL1是原位置所收到的該子頻帶訊號的聲壓值，R2是新位置與喇叭的距離，R1是原位置與喇叭的距離。Among them, SPL2 is the sound pressure value of the sub-band signal received at the new position, SPL1 is the sound pressure value of the sub-band signal received at the original position, R2 is the distance between the new position and the speaker, and R1 is the distance between the original position and the speaker. Speaker distance.

從公式（4）可知，SPL1和SPL2的差值就是該喇叭需要被補償回來的部份。It can be seen from formula (4) that the difference between SPL1 and SPL2 is the part that needs to be compensated for the speaker.

SPL2’ = SPL2 + 20 log ₁₀(R2/R1) = SPL1 （5） SPL2' = SPL2 + 20 log ₁₀ (R2/R1) = SPL1 (5)

其中，SPL2’是補償後的新位置所收到的該子頻帶訊號的聲壓值。由公式（5）可知本實施例是將改變的部份補償回來。Wherein, SPL2' is the sound pressure value of the sub-band signal received at the new position after compensation. It can be known from formula (5) that this embodiment compensates for the changed part.

在近音場中，使用者180從喇叭接收到的該子頻帶訊號的聲壓值變與距離變化的關係如下：In the near sound field, the relationship between the sound pressure value of the sub-band signal received by the user 180 from the speaker and the distance change is as follows:

SPL2 = SPL1 - 10 log ₁₀(R2/R1) （6） SPL2 = SPL1 - 10 log ₁₀ (R2/R1) (6)

SPL2’ = SPL2 + 20 log ₁₀(R2/R1) = SPL1 （7） SPL2' = SPL2 + 20 log ₁₀ (R2/R1) = SPL1 (7)

由公式（6）和（7）可知，近音場的聲音衰減變化率較遠音場緩和，而其他計算邏輯相同。It can be seen from formulas (6) and (7) that the sound attenuation change rate of the near sound field is slower than that of the far sound field, and other calculation logics are the same.

可以理解的是，上述公式遇到一些特殊情況時可能會有例外。舉例來說，當使用者180從第一位置601移動至第二位置602而貼近第二喇叭120，使用者180與第二喇叭120的距離從R1變小為R2’，可能會使公式（7）的計算結果變成負值。但是第二喇叭120輸出的子頻帶訊號不可能是負值，最小只能降為人耳最低可聽值。例如，使第二喇叭120輸出的該子頻帶訊號的聲壓值為零。另一方面，當使用者180從第一位置601移動到第二位置602而遠離第一喇叭110時，使用者180與第一喇叭110的距離從R1拉大為R2。第一喇叭110的最大輸出極限有可能沒辦法滿足公式（5）。這時，可由音響系統100對使用者180發出超限提示。It is understandable that there may be exceptions to the above formula in some special cases. For example, when the user 180 moves from the first position 601 to the second position 602 and gets close to the second speaker 120, the distance between the user 180 and the second speaker 120 decreases from R1 to R2', which may make the formula (7 ) becomes a negative value. However, the sub-band signal output by the second speaker 120 cannot be a negative value, and the minimum can only be reduced to the lowest audible value for human ears. For example, make the sound pressure value of the sub-band signal output by the second speaker 120 zero. On the other hand, when the user 180 moves from the first position 601 to the second position 602 away from the first speaker 110 , the distance between the user 180 and the first speaker 110 increases from R1 to R2 . The maximum output limit of the first speaker 110 may not be able to satisfy formula (5). At this time, the sound system 100 may issue a prompt of exceeding the limit to the user 180 .

圖6的實施例突顯了下列的優點。透過通道基底的補償算法，使用者的最佳聆聽點不受到移動的影響。通道基底的計算方式簡易且效率高，在大部份的目標空間600中皆可適用。The embodiment of Fig. 6 highlights the following advantages. Through the channel-based compensation algorithm, the user's sweet spot is not affected by movement. The calculation method of the channel basis is simple and efficient, and is applicable in most object spaces 600 .

圖6已說明了依據使用者180移動的聲音補償方式。以下以圖7說明依據環境物件175的聲音補償方式。環境物件175的聲學屬性資訊包含對聲音的反射率和吸收率。本實施例依據環境物件175的空間配置資訊而對應地使用適當的計算方式來計算環境物件的聲學影響。FIG. 6 has illustrated the sound compensation method according to the movement of the user 180 . The sound compensation method according to the environmental object 175 will be described below with FIG. 7 . The acoustic property information of the environmental object 175 includes reflectance and absorption of sound. In this embodiment, according to the spatial configuration information of the environmental object 175 , an appropriate calculation method is used to calculate the acoustic impact of the environmental object.

圖7為本發明一目標空間700示意圖，用於說明依據環境物件的吸收率計算音訊調整量的實施例。FIG. 7 is a schematic diagram of a target space 700 according to the present invention, which is used to illustrate an embodiment of calculating an audio adjustment amount according to an absorption rate of an environmental object.

圖7顯示了在一目標空間700中，一環境物件175位於一第一喇叭110和一使用者180的中間。舉例來說，環境物件175可以是沙發或柱子。這種情況下，環境物件175可能因遮擋而造成使用者180的聆聽效果衰減。換句話說，使用者180從第一喇叭110收到的聲壓值會被遮擋或吸收。當控制電路132透過空間配置資訊而解讀出這種布局狀況時，就採用該環境物件175的吸收率來計算該第一喇叭110在目標聆聽點（使用者180的位置）上的播放效果受到環境物件175影響的程度，以決定該第一聲道音訊112需要輸出的等效音量、聲壓值或增益值。FIG. 7 shows an environmental object 175 located between a first speaker 110 and a user 180 in a target space 700 . For example, environmental object 175 may be a sofa or a pillar. In this case, the environmental object 175 may attenuate the listening effect of the user 180 due to occlusion. In other words, the sound pressure received by the user 180 from the first speaker 110 will be blocked or absorbed. When the control circuit 132 interprets this layout situation through the spatial configuration information, it uses the absorption rate of the environmental object 175 to calculate the playback effect of the first speaker 110 at the target listening point (the position of the user 180) affected by the environment. The degree of influence of the object 175 is used to determine the equivalent volume, sound pressure value or gain value that the first channel audio 112 needs to output.

在一實施例中，可依據環境物件175從第一喇叭110所接收到的聲壓值來計算環境物件175吸收掉的聲音耗損：In one embodiment, the sound loss absorbed by the environmental object 175 can be calculated according to the sound pressure value received by the environmental object 175 from the first speaker 110:

A _t[n]=R[n]*SPL _t（8） A _t [n]=R[n]*SPL _t (8)

其中，n代表子頻帶的編號。即，第一喇叭110輸出的第一聲道音訊112可切割成多個子頻帶分別計算。A _t[n]代表在時間點t上偵測到的第n個子頻帶的增益值。R[n]代表第n個子頻帶的吸收率。SPL _t代表環境物件175在第t個時間點所受到的來自第一喇叭110的聲壓值。時間點t可代表聲音從第一喇叭110傳送到環境物件175的時間差。 Wherein, n represents the number of the sub-band. That is, the first channel audio 112 output by the first speaker 110 can be divided into a plurality of sub-bands and calculated separately. _At [n] represents the gain value of the nth sub-band detected at time point t. R[n] represents the absorption rate of the nth sub-band. SPL _t represents the sound pressure value received by the environmental object 175 from the first speaker 110 at the tth time point. The time point t may represent the time difference when the sound is transmitted from the first speaker 110 to the environmental object 175 .

由公式（8）可知，A _t[n]代表一第一聲道音訊112在第n個子頻帶上被環境物件175吸收掉的增益值，也代表該第一聲道音訊112的第n個子頻帶所需要的輸出補償值。因此，控制電路132在透過第一喇叭110產生第一聲道音訊112時，使第一聲道音訊112的第n個子頻帶增益值增加該增益值A _t[n]。 It can be known from the formula (8) that A _t [n] represents the gain value of a first channel audio 112 absorbed by the environmental object 175 on the nth sub-band, and also represents the nth subband of the first channel audio 112 desired output offset value. Therefore, when the first speaker 110 generates the first channel audio 112 , the control circuit 132 increases the gain value of the nth sub-band of the first channel audio 112 by the gain value _At [n].

環境物件175位於第一喇叭110和使用者180的中間情況可能存在多種情境。本實施例以第一喇叭110和使用者180的可視線是否被遮擋為主要依據，或是進一步以第一喇叭110和使用者180的耳朵的可視線為判斷標準。可以理解的是，SPL _t本身是一個和環境物件175與第一喇叭110的距離和時間相關的函數，而計算出來的A _t[n]對使用者180造成的影響程度是一個和環境物件175與使用者180的距離和時間相關的函數。在加上不同角度的排列狀況和遠近關係的考量後，牽涉多樣化的非線性關聯性。本申請不限定公式（8）的衍生變化，例如視實作情況而加入其他的權重係數、參數、及偏移修正值。舉例來說，使用者180和第一喇叭110之間可能放置有沙發。雖然沙發沒有遮擋可視線，但還是有可能影響使用者180從第一喇叭110接收到的聲壓值。控制電路132可依據公式（8）搭配內插法或其他修正公式使補償結果更符合需求。 There may be many situations where the environmental object 175 is located between the first speaker 110 and the user 180 . In this embodiment, whether the line of sight of the first speaker 110 and the user 180 is blocked is the main basis, or further, the line of sight of the first speaker 110 and the ear of the user 180 is used as the judging criterion. It can be understood that SPL _t itself is a function related to the distance and time between the environmental object 175 and the first speaker 110, and the calculated A _t [n] has a degree of influence on the user 180 that is related to the environmental object 175. A function related to the distance and time of the user 180 . After considering the arrangement of different angles and the relationship between distance and distance, a variety of non-linear correlations are involved. The present application does not limit the derivative changes of formula (8), such as adding other weight coefficients, parameters, and offset correction values depending on the actual situation. For example, a sofa may be placed between the user 180 and the first speaker 110 . Although the sofa does not block the visible line, it may affect the sound pressure received by the user 180 from the first speaker 110 . The control circuit 132 can use the interpolation method or other correction formulas according to formula (8) to make the compensation result more in line with requirements.

圖8為本發明一目標空間800示意圖，用於說明依據環境物件的反射率計算音訊調整量的實施例。FIG. 8 is a schematic diagram of a target space 800 according to the present invention, which is used to illustrate an embodiment of calculating an audio adjustment amount based on reflectivity of environmental objects.

圖8顯示了在一目標空間800中，一使用者180位於一第一喇叭110和一環境物件175的中間。環境物件175可能是一面牆壁、天花板或地板。這種情況下，環境物件175會反彈第一喇叭110輸出的第一聲道音訊112給使用者180。換句話說，使用者180從第一喇叭110收到的聲壓值會被疊加或干擾。當控制電路132透過空間配置資訊而解讀出這種布局狀況時，就採用該環境物件175的反射率來計算該第一喇叭110在目標聆聽點（使用者180的位置）上的播放效果受到環境物件175影響的程度，以決定該第一聲道音訊112需要輸出的等效音量、聲壓值或增益值。FIG. 8 shows a user 180 located between a first speaker 110 and an environmental object 175 in a target space 800 . Environmental object 175 may be a wall, ceiling or floor. In this case, the environmental object 175 will bounce the first channel audio 112 output by the first speaker 110 to the user 180 . In other words, the sound pressure received by the user 180 from the first speaker 110 will be superimposed or interfered. When the control circuit 132 interprets the layout situation through the spatial configuration information, it uses the reflectivity of the environmental object 175 to calculate the playback effect of the first speaker 110 at the target listening point (the position of the user 180) affected by the environment. The degree of influence of the object 175 is used to determine the equivalent volume, sound pressure value or gain value that the first channel audio 112 needs to output.

在本實施例中，同樣可依據公式（8）計算環境物件175造成的影響，但將R[n]改為代表該環境物件175在第n個子頻帶上的反射率。In this embodiment, the influence caused by the environmental object 175 can also be calculated according to formula (8), but R[n] is changed to represent the reflectivity of the environmental object 175 on the nth sub-band.

公式（8）的運算結果A _t[n]可代表一第一聲道音訊112在第n個子頻帶上被環境物件175反射給使用者180的成份。因此，控制電路132在透過第一喇叭110產生第一聲道音訊112時，可適當地減少第一聲道音訊112的增益值，使使用者180從第一喇叭110和環境物件175接收的總聲壓值維持在預設的位準值。 The operation result A _t [n] of the formula (8) may represent the component of the first channel audio 112 reflected by the environmental object 175 to the user 180 on the nth sub-band. Therefore, when the control circuit 132 generates the first channel audio 112 through the first speaker 110, it can appropriately reduce the gain value of the first channel audio 112, so that the total amount received by the user 180 from the first speaker 110 and the environmental object 175 The sound pressure value is maintained at the preset level value.

與圖7的實施例類似，圖8中的使用者180位於第一喇叭110和環境物件175的中間情況可能存在多種變化情境。本實施例以第一喇叭110和環境物件175的可視線是否被使用者180遮擋為主要依據。然而，在實作中，牆壁、天花板、地板不論是位於任何角度都具有反射作用。因此本實施例的運算公式不限定於公式（8），還可能依據的排列狀況和遠近關係進一步衍生其他非線性的補償計算方式。舉例來說，目標空間800因為牆壁、天花板、地板材質和房間大小形狀格局等特徵，可被分類為不同的應用場景，例如客廳、書房、浴室、劇院、或戶外等。主機裝置130可先將目標空間800所屬的應用場景進行分類，再分別採用對應的參數或公式。Similar to the embodiment in FIG. 7 , the situation in FIG. 8 where the user 180 is located between the first speaker 110 and the environmental object 175 may have many changing scenarios. In this embodiment, it is mainly based on whether the visible line of the first speaker 110 and the environmental object 175 is blocked by the user 180 . However, in practice, walls, ceilings, and floors are reflective at any angle. Therefore, the calculation formula of this embodiment is not limited to the formula (8), and other nonlinear compensation calculation methods may be further derived based on the arrangement status and the distance relationship. For example, the target space 800 can be classified into different application scenarios, such as living room, study room, bathroom, theater, or outdoor, due to characteristics such as wall, ceiling, floor material, and room size and shape. The host device 130 can first classify the application scenarios to which the target space 800 belongs, and then adopt corresponding parameters or formulas respectively.

圖7和圖8的實施例突顯了下列的優點。透過通道基底補償運作，扺消環境物件175對使用者180的聆聽效果造成的影響。通道基底補償運作可依據環境物件的配置狀況而靈活套用不同的物件聲學屬性，可有效應付多種複雜環境的優化問題。The embodiment of Figures 7 and 8 highlights the following advantages. The effect of the environmental object 175 on the listening effect of the user 180 is eliminated through the channel floor compensation operation. The channel base compensation operation can flexibly apply different object acoustic properties according to the configuration of environmental objects, and can effectively deal with the optimization problems of various complex environments.

綜上所述，辨識電路134可接收感測器電路140的資料而辨識出目標空間170中的使用者180位置，為由控制電路132動態地將使用者180的位置指派為目標聆聽點。控制電路132針對目標聆聽點移動所做出的補償，已在圖6的實施例和公式（4）至（7）中說明。控制電路132針對環境物件175的干擾所做出的補償，已在圖7至8和公式（8）中說明。這兩種補償運算可以是分別進行並施加於聲道音訊上。換句話說，最終輸出的優化聲道音訊包含針對目標聆聽點移動所做出的補償值，也包含針對環境物件175的干擾所做出的補償。To sum up, the identification circuit 134 can receive the data from the sensor circuit 140 to identify the location of the user 180 in the target space 170 , so that the control circuit 132 dynamically assigns the location of the user 180 as the target listening point. The compensation made by the control circuit 132 for the movement of the target listening point has been described in the embodiment of FIG. 6 and formulas (4) to (7). The compensation made by the control circuit 132 for the disturbance of the environmental object 175 has been described in FIGS. 7 to 8 and formula (8). These two compensation operations can be performed separately and applied to the channel audio. In other words, the final output optimized channel audio includes the compensation value for the movement of the target listening point, and also includes the compensation for the interference of the environmental object 175 .

辨識電路134依據感測器電路140捕捉的音場環境資訊，進行使用者180的位置辨識。辨識的過程還可包含對應用場景的辨識，以助於加速控制電路132的後續運算。以下以圖9說明主機裝置130依據應用場景類別而辨識物件的過程。The identification circuit 134 identifies the location of the user 180 according to the sound field environment information captured by the sensor circuit 140 . The identification process may also include the identification of the application scene, so as to help speed up the subsequent operation of the control circuit 132 . The following uses FIG. 9 to illustrate the process of the host device 130 identifying objects according to the application scenario category.

圖9為本發明一實施例的主機裝置130辨識物件的流程圖。在不同的應用場景中出現的環境物件，其聲學屬性通常有顯著的族群關聯性，周圍環境材質或房間大小造成的音場反射係數也不同。因此，事先區別應用場景類別，有助於音響系統100提升音場優化的效率。可以理解的是，圖9中的每一流程是主機裝置130執行，但不限定是其中的單一電路或模塊所執行，也可以是多個電路的協同運作。FIG. 9 is a flow chart of identifying an object by the host device 130 according to an embodiment of the present invention. The acoustic properties of environmental objects that appear in different application scenarios usually have significant group correlations, and the sound field reflection coefficients caused by the surrounding environment materials or room sizes are also different. Therefore, distinguishing the types of application scenarios in advance helps the audio system 100 to improve the efficiency of sound field optimization. It can be understood that, each process in FIG. 9 is executed by the host device 130 , but is not limited to be executed by a single circuit or module therein, and may also be a coordinated operation of multiple circuits.

在流程902中，主機裝置130獲取目標空間170的應用場景類別。主機裝置130可透過幾種不同的方式獲取應用場景類別。在一實施例中，主機裝置130中的辨識電路134可在辨識感測器電路140提供的音場環境資訊時，依據該音場環境資訊而判斷適用的一應用場景類別。在另一實施例中，主機裝置130中的控制電路132在透過人機介面電路133運行一配置程序而獲得環境物件的空間配置資訊時，還同時透過該配置程序獲得由使用者180定義的應用場景類別。在進一步衍生的實施例中，主機裝置130中的控制電路132，可透過通信電路136而從一用戶裝置150獲得該應用場景類別的相關信息。In the process 902 , the host device 130 acquires the application scenario category of the target space 170 . The host device 130 can obtain the application scenario category in several different ways. In one embodiment, the identification circuit 134 in the host device 130 can determine an applicable application scenario category according to the sound field environment information when identifying the sound field environment information provided by the sensor circuit 140 . In another embodiment, when the control circuit 132 in the host device 130 obtains the spatial configuration information of the environmental objects by running a configuration program through the man-machine interface circuit 133, it also obtains the application defined by the user 180 through the configuration program. Scene category. In a further derivative embodiment, the control circuit 132 in the host device 130 can obtain information related to the application scenario type from a user device 150 through the communication circuit 136 .

在流程904中，為了加速環境物件的查詢並提升正確性，主機裝置130依據應用場景類別優先選用相關的物件資料庫。物件資料庫通常是事先建立好的資料集合，可由多種不同的管道提供。舉例來說，主機裝置130中的儲存電路131可預先儲存一或多個對應不同應用場景的物件資料庫。在另一實施例中，主機裝置130可利用通信電路136連接至一遠端資料庫160。遠端資料庫160中可包含對應不同應用場景的多個物件資料庫。每個物件資料庫中，包含多個環境物件的外型特徵資訊，以及聲學屬性資訊。In the process 904, in order to speed up the query of the environmental objects and improve the accuracy, the host device 130 preferentially selects the relevant object database according to the type of the application scenario. Object databases are usually pre-built collections of data that can be provided by a variety of different channels. For example, the storage circuit 131 in the host device 130 can pre-store one or more object databases corresponding to different application scenarios. In another embodiment, the host device 130 can use the communication circuit 136 to connect to a remote database 160 . The remote database 160 may include multiple object databases corresponding to different application scenarios. Each object database contains appearance feature information and acoustic attribute information of multiple environmental objects.

當主機裝置130在流程902中獲得應用場景類別後，可優先從儲存電路131中或遠端資料庫160中選擇使用與該應用場景類別相關的一物件資料庫，用於後續環境物件的辨識。在一實施例中，辨識電路134分析感測器電路140提供的音場環境資訊而獲得一或多個物件外型特徵資訊，並依據該物件外型特徵資訊檢索該物件資料庫，便可辨識出符合該物件外型特徵資訊的環境物件，包含名稱，吸收率和反射率。在另一實施例中，控制電路132執行配置程序，利用人機介面電路133而獲得一環境物件的名稱。控制電路132依據該環境物件175的名稱查找該物件資料庫，以獲得該環境物件對應的吸收率和反射率。After the host device 130 obtains the application scenario category in the process 902, it can preferentially select and use an object database related to the application scenario category from the storage circuit 131 or the remote database 160 for subsequent identification of environmental objects. In one embodiment, the recognition circuit 134 analyzes the sound field environment information provided by the sensor circuit 140 to obtain one or more object shape feature information, and searches the object database according to the object shape feature information to identify Output the environmental objects that match the shape characteristic information of the object, including name, absorption rate and reflectance rate. In another embodiment, the control circuit 132 executes the configuration program, and uses the man-machine interface circuit 133 to obtain the name of an environmental object. The control circuit 132 searches the object database according to the name of the environmental object 175 to obtain the corresponding absorptivity and reflectivity of the environmental object.

在進一步衍生的實施例中，查找的過程所使用的參數可以多元組合。例如，辨識電路134在分析音場環境資訊的過程中，可獲得環境物件175的材質、大小、形狀等外在特徵。辨識電路134將這些外在特徵資訊傳送至物件資料庫進行多條件交叉比對，而獲得依照媒合分數排序的一候選物件列表。若是在查找物件資料庫的過程中，搭配應用場景類別的資訊做為查找條件，將有助於縮小可能範圍，加速辨識，並提高正確性。In a further derivative embodiment, the parameters used in the search process may be combined in multiples. For example, the identification circuit 134 can obtain external characteristics such as material, size, and shape of the environmental object 175 during the process of analyzing the environmental information of the sound field. The identification circuit 134 sends the external feature information to the object database for multi-conditional cross-comparison to obtain a list of candidate objects sorted according to matching scores. If in the process of searching the object database, the information of the application scene category is used as the search condition, it will help to narrow the possible range, speed up the identification, and improve the accuracy.

在流程906中，控制電路132從流程904中選用的物件資料庫查找環境物件的吸收率和反射率。在實作中，物件資料庫中儲存的環境物件的聲學屬性資訊，不限定是分割成多個獨立的物件資料庫而儲存。物件資料庫可以是關聯式資料庫，包含多種欄位以相關係數的方式連接在一起。舉例來說，物件資料庫的欄位可包含物件名稱、應用場景類別、材質、吸收率、反射率、甚至是形狀、顏色、光澤等外在特徵。而每一個環境物件對應的欄位值並不限定是一對一的關係，而可以是一對多、或多對一。每一欄位中儲存的數值也未必是絕對的數值，而是範圍值或機率值。在進一步衍生的實施中，物件資料庫可以是一個可機器學習而不斷迭代修正的自適應資料庫。使用者180可透過人機介面電路133回饋偏好的設定值而訓練物件資料庫。In the process 906 , the control circuit 132 searches the absorptivity and reflectivity of the environmental object from the object database selected in the process 904 . In practice, the acoustic property information of the environmental objects stored in the object database is not limited to be divided into multiple independent object databases for storage. The object database can be an associative database, which contains various fields connected together in the form of correlation coefficients. For example, the fields of the object database may include object name, application scene type, material, absorption rate, reflectance rate, and even external characteristics such as shape, color, and luster. The field value corresponding to each environmental object is not limited to a one-to-one relationship, but can be one-to-many or many-to-one. The value stored in each column is not necessarily an absolute value, but a range value or a probability value. In a further derivative implementation, the object database can be an adaptive database that can be continuously iteratively modified by machine learning. The user 180 can feed back the preferred setting value through the man-machine interface circuit 133 to train the object database.

在流程908中，控制電路132依據環境物件的查找結果和配置狀況，在多個子頻帶中分別調整聲道音訊。環境物件175的聲學屬性，在不同的頻帶上可能有顯著的差異。舉例來說，沙發可能吸收大量高頻訊號，但是不影響低頻訊號的穿透。因此，從物件資料庫中查找出來的吸收率或反射率，可以是對應多個子頻帶的陣列值，或是一頻率響應曲線。關於子頻帶的區間大小或區隔方式，可隨設計需求而定，在本實施例中並未限定。控制電路132在多個子頻帶中分別調整聲道音訊的增益值，在實作上可類比為等化器或濾波器的概念。換句話說，控制電路132可為音響系統100中的每一喇叭實作一等化器，並依據前述實施例計算的輸出補償值客製化該等化器，使對應的聲道音訊受到調整。關於計算輸出補償值的進一步實施例，將在圖10中說明。In the process 908, the control circuit 132 adjusts the channel audio in the multiple sub-bands according to the search result and the configuration of the environmental objects. The acoustic properties of environmental objects 175 may vary significantly across different frequency bands. For example, a sofa may absorb a lot of high-frequency signals, but it does not affect the penetration of low-frequency signals. Therefore, the absorptivity or reflectivity found from the object database can be array values corresponding to multiple sub-bands, or a frequency response curve. The size of the sub-bands or the manner of partitioning may be determined according to design requirements, and is not limited in this embodiment. The control circuit 132 separately adjusts the gain values of the channel audio in the multiple sub-bands, which can be compared to the concept of an equalizer or a filter in practice. In other words, the control circuit 132 can implement an equalizer for each speaker in the audio system 100, and customize the equalizer according to the output compensation value calculated in the foregoing embodiment, so that the corresponding channel audio can be adjusted. . A further embodiment for calculating the output compensation value will be illustrated in FIG. 10 .

在流程910中，控制電路132將調整後的聲道音訊透過音訊傳輸電路135輸出至對應喇叭。關於音訊傳輸電路135的實施方式已於圖1中介紹，在此不再贅述。In the process 910 , the control circuit 132 outputs the adjusted channel audio to the corresponding speaker through the audio transmission circuit 135 . The implementation of the audio transmission circuit 135 has been introduced in FIG. 1 , and will not be repeated here.

圖9的實施例突顯了下列的優點。物件辨識的運作可參照應用場景類別（自動辨識或手動輸入）以增加辨識效率。物件資料庫採用具有擴充性的架構，在雲端大數據服務和機器學習的回饋下而持續長期地增強辨識能力。音響系統100可應用等化器的概念將聲道音訊分為多個子頻帶分別處理，使最終合成的音質有效提升。The embodiment of Fig. 9 highlights the following advantages. The operation of object recognition can refer to the application scenario category (automatic recognition or manual input) to increase the recognition efficiency. The object database adopts a scalable architecture, and the recognition ability can be continuously enhanced for a long time under the feedback of cloud big data services and machine learning. The sound system 100 can apply the concept of an equalizer to divide the channel audio into a plurality of sub-bands to be processed separately, so that the final synthesized sound quality can be effectively improved.

下列以圖10進一步說明控制電路132如何根據環境物件175的空間配置資訊而計算每個聲道的輸出補償值。The following uses FIG. 10 to further illustrate how the control circuit 132 calculates the output compensation value of each channel according to the spatial configuration information of the environmental object 175 .

圖10為本發明一實施例的音訊處理方法流程圖，說明依據環境物件的位置關係計算輸出補償值的實施例。圖10的流程主要由主機裝置130中的控制電路132所執行。FIG. 10 is a flowchart of an audio processing method according to an embodiment of the present invention, illustrating an embodiment of calculating an output compensation value according to the positional relationship of environmental objects. The process in FIG. 10 is mainly executed by the control circuit 132 in the host device 130 .

在流程1002中，控制電路132判斷環境物件、目標聆聽點與喇叭的相對位置關係。目標空間170中的多個喇叭和多個環境物件175，可與目標聆聽點排列組合出多組位置關係。每一組位置關係包含一喇叭、一環境物件175，與目標聆聽點。控制電路132為目標空間170中的每一位置關係組合進行檢查判斷並計算對應的輸出補償值。以下以音響系統100中的其中一組位置關係為例，說明控制電路132針對一環境物件175對一喇叭在目標聆聽點造成的干擾所做出的補償方式。In the process 1002, the control circuit 132 determines the relative positional relationship of the environmental object, the target listening point and the speaker. Multiple speakers and multiple environmental objects 175 in the target space 170 can be arranged and combined with the target listening point to form multiple sets of positional relationships. Each set of positional relationships includes a speaker, an environmental object 175, and a target listening point. The control circuit 132 checks and judges each positional relationship combination in the target space 170 and calculates a corresponding output compensation value. The following takes one set of positional relationships in the sound system 100 as an example to illustrate how the control circuit 132 compensates for the interference caused by an environmental object 175 to a speaker at a target listening point.

在流程1004中，控制電路132判斷環境物件175是否在目標聆聽點與喇叭中間。目標空間170中的環境物件175的位置也可由辨識電路134獲得，或是由人機介面電路133經過一配置程序而取得。控制電路132綜合上述資訊後可判斷每一環境物件175、目標聆聽點、與每一喇叭的相對位置關係，並針對每一喇叭分別進行對應的補償運算。流程1004所要判斷的情況就是如圖7所示的狀況。如果情況符合，則進行流程1008。如果情況不符合，則進行流程1006。In the process 1004, the control circuit 132 determines whether the environmental object 175 is between the target listening point and the speaker. The position of the environmental object 175 in the target space 170 can also be obtained by the identification circuit 134 , or obtained by the man-machine interface circuit 133 through a configuration program. After synthesizing the above information, the control circuit 132 can determine the relative positional relationship between each environmental object 175 , the target listening point, and each speaker, and perform corresponding compensation calculations for each speaker. The situation to be judged in the process 1004 is the situation shown in FIG. 7 . If the condition is met, go to process 1008. If not, go to process 1006.

在流程1006中，控制電路132判斷目標聆聽點是否位於環境物件175與喇叭中間。流程1006所要判斷的情況就是如圖8所示的狀況。如果情況符合，則進行流程1010。如果情況不符合，則進行流程1012。In the process 1006, the control circuit 132 determines whether the target listening point is located between the environmental object 175 and the speaker. The situation to be judged in the process 1006 is the situation shown in FIG. 8 . If the conditions are met, go to process 1010. If the situation is not met, go to process 1012.

在流程1008中，控制電路132使用環境物件175的吸收率計算聲道音訊的輸出補償值。在一較佳的實施例中，該喇叭的聲道音訊的輸出補償值是分為多個子頻帶分別計算的。詳細的計算可參考圖7的目標空間700和公式（8）。控制電路132可從物件資料庫中查找環境物件175的吸收率，並代入公式（8）中而求得輸出補償值。In the process 1008, the control circuit 132 uses the absorption rate of the environmental object 175 to calculate the output compensation value of the channel audio. In a preferred embodiment, the output compensation value of the channel audio of the speaker is divided into a plurality of sub-bands and calculated respectively. For detailed calculations, refer to the target space 700 in FIG. 7 and formula (8). The control circuit 132 can search the absorption rate of the environmental object 175 from the object database, and substitute it into the formula (8) to obtain the output compensation value.

在流程1010中，控制電路132使用環境物件175的反射率計算聲道音訊的輸出補償值。參考圖8的目標空間800和公式（8），控制電路132可從物件資料庫中查找環境物件175的反射率，並代入公式（8）中而求得輸出補償值。In the process 1010 , the control circuit 132 calculates the output compensation value of the channel audio by using the reflectivity of the environmental object 175 . Referring to the target space 800 and formula (8) in FIG. 8 , the control circuit 132 can search the reflectivity of the environmental object 175 from the object database, and substitute it into the formula (8) to obtain the output compensation value.

可以理解的是，依據環境物件175的吸收率計算出來的輸出補償值，可能使調整後的聲道音訊的增益值、聲壓值、或等效音量放大，來彌補被吸收掉的能量。相對地，依據環境物件175的反射率計算出來的輸出補償值，可能使調整後的聲道音訊的增益值、聲壓值、或等效音量降低，來平衡被反射回來的能量。換句話說，依據吸收率和反射率所計算的輸出補償值，其正負號通常是彼此相反的。It can be understood that the output compensation value calculated according to the absorption rate of the environmental object 175 may amplify the adjusted gain value, sound pressure value, or equivalent volume of the channel audio to compensate for the absorbed energy. In contrast, the output compensation value calculated according to the reflectivity of the environmental object 175 may reduce the adjusted gain value, sound pressure value, or equivalent volume of the channel audio to balance the reflected energy. In other words, the signs of the output compensation values calculated according to the absorptivity and reflectivity are usually opposite to each other.

在流程1012中，如果環境物件175不符合流程1004的條件，也不符合流程1006的條件，則控制電路132可判斷環境物件175位在一個不會影響到該喇叭對目標聆聽點播放的位置。在這種情況下，控制電路132可不為該組位置關係計算該環境物件175對該喇叭和目標聆聽點造成的影響。然而，需要理解的是，一目標空間170中通常包含多個喇叭。環境物件175不會影響到其中一個喇叭對目標聆聽點的播放，但還是可能會影響到其他喇叭對目標聆聽點的播放。換句話說，控制電路132需要為目標空間170中每一組位置關係分別進行圖10的流程。In the process 1012, if the environmental object 175 does not meet the conditions of the process 1004 and does not meet the conditions of the process 1006, then the control circuit 132 can determine that the environmental object 175 is in a position that will not affect the playback of the speaker to the target listening point. In this case, the control circuit 132 may not calculate the impact of the environmental object 175 on the speaker and the target listening point for the set of positional relationships. However, it should be understood that a target space 170 usually contains multiple speakers. The environmental object 175 will not affect the playback of one of the speakers to the target listening point, but may still affect the playback of other speakers to the target listening point. In other words, the control circuit 132 needs to perform the process shown in FIG. 10 for each set of positional relationships in the target space 170 .

在一些特定的情況下，環境物件175的存在可被直接忽略。舉例來說，如果環境物件175對聲音的反射率或吸收率小於一特定閾值，表示其在目標空間170中的存在可以忽略。另一方面，如果控制電路132判斷環境物件175的體積小於一特定大小，也可忽略環境物件175的存在。In some specific cases, the existence of the environment object 175 can be ignored directly. For example, if the reflection rate or absorption rate of the environmental object 175 to sound is less than a certain threshold, it means that its existence in the target space 170 can be ignored. On the other hand, if the control circuit 132 determines that the volume of the environmental object 175 is smaller than a certain size, the existence of the environmental object 175 may also be ignored.

在進一步衍生的實施例中，如果目標空間170中偵測到一個以上的使用者，則目標聆聽點的判斷，可依據多個使用者的位置中心點，也可以選擇性地依據其中一個使用者的位置。至於未被選擇為目標聆聽點的使用者，主機裝置130可將其類比為環境物件，依照圖7至圖8的實施例式處理。In a further derivative embodiment, if more than one user is detected in the target space 170, then the judgment of the target listening point can be based on the location centers of multiple users, or selectively based on one of the users s position. As for the users who are not selected as target listening points, the host device 130 can compare them to environmental objects, and process them according to the embodiments in FIGS. 7 to 8 .

圖10的實施例突顯了下列的優點。圖10的實施例延續圖7和圖8的處理方式，將複雜環境問題簡化為多個線性關係的問題而分別解決。針對特定情況的環境物件175，還可忽略不計以簡化計算的複雜度。The embodiment of Fig. 10 highlights the following advantages. The embodiment in FIG. 10 continues the processing methods in FIG. 7 and FIG. 8 , and simplifies complex environmental problems into multiple linearly related problems and solves them respectively. The environment object 175 for a specific situation can also be ignored to simplify the calculation complexity.

圖11為本發明一目標空間1100示意圖，用於說明以物件基底補償運作優化音場的實施例。FIG. 11 is a schematic diagram of a target space 1100 of the present invention, which is used to illustrate an embodiment of optimizing a sound field by operating object floor compensation.

在目標空間1100中，包含多個喇叭，例如第一喇叭1110、第二喇叭1120、第三喇叭1130和第四喇叭1140。在音響系統100是基於物件基底補償運作而運作的情況下，控制電路132在邏輯上將目標空間1100視為一個空間標座系統。該空間標座可以是二維平面座標或三維平面座標。為了便於說明，圖11以包含一X軸和一Y軸的二維平面座標的方式繪示說明。In the target space 1100 , a plurality of speakers are included, such as a first speaker 1110 , a second speaker 1120 , a third speaker 1130 and a fourth speaker 1140 . In the case that the audio system 100 operates based on the object basis compensation operation, the control circuit 132 logically regards the target space 1100 as a spatial coordinate system. The spatial coordinates may be two-dimensional plane coordinates or three-dimensional plane coordinates. For ease of description, FIG. 11 is illustrated in a two-dimensional plane coordinate manner including an X axis and a Y axis.

在目標空間1100中，使用者180位於原點P0。控制電路132將使用者180指派為目標聆聽點。如圖3的實施例所述，物件基底聲學系統是建立在大量聲學參數的陣列運算上。每一音源物件具有一中繼資料，用於描述該音源物件的類型、位置、大小（長寬高）、發散度（divergence）等。經過物件基底運算後，一音源物件所代表的聲音將會被指派至一或多個喇叭而共同播放，每一喇叭相對播放該音源件的一部份的聲音。換句話說，物件基底聲學系統可利用多個喇叭來模擬一個音源物件的實體存在感。舉例來說，透過物件基底補償運作，在目標聆聽點上的使用者180，可聽到一虛擬音源物件1105沿著移動軌跡1103從第一位置P1移動到新第一位置P1’。In the target space 1100, the user 180 is located at the origin P0. The control circuit 132 assigns the user 180 as the target listening point. As described in the embodiment of FIG. 3 , the object-based acoustic system is based on the array operation of a large number of acoustic parameters. Each audio source object has a metadata for describing the type, location, size (length, width, height), divergence, etc. of the audio source object. After the object-based calculation, the sound represented by an audio source object will be assigned to one or more speakers to play together, and each speaker will play a part of the sound of the audio source object. In other words, object-based acoustic systems can utilize multiple speakers to simulate the physical presence of an audio source object. For example, through the object base compensation operation, the user 180 at the target listening point can hear a virtual sound source object 1105 moving from the first position P1 to the new first position P1' along the movement track 1103 .

本實施例的物件基底補償運作可使所有喇叭輸出的聲道音訊針對目標聆聽點優化。物件基底補償運作利用既有物件基底聲學系統中的陣列運算模組，將各種距離因素和音場類別參數化，並可進行類似公式（4）到（7）的運算。對音響系統100而言，主機裝置130只需要將使用者180的位置資訊套用至物件基底補償運作中，就能使所有喇叭輸出的聲道音訊針對目標聆聽點優化。The object floor compensation operation of this embodiment can optimize the channel audio output from all the speakers for the target listening point. The object base compensation operation utilizes the array calculation module in the existing object base acoustic system to parameterize various distance factors and sound field types, and perform calculations similar to formulas (4) to (7). For the audio system 100, the host device 130 only needs to apply the location information of the user 180 to the object floor compensation operation, so that the channel audio output by all the speakers can be optimized for the target listening point.

在一實施例中，控制電路132可將目標聆聽點定義為整個空間座標系統的原點。當使用者180移動時，整個空間座標系統隨著原點而移動。換句話說，虛擬音源物件1105相對原點的位置保持不變。控制電路132透過物件基底補償運作而播放虛擬音源物件1105的效果時，使用者180感受到的虛擬音源物件1105的相對位置不會隨著使用者180的移動而改變。In one embodiment, the control circuit 132 can define the target listening point as the origin of the entire spatial coordinate system. When the user 180 moves, the entire spatial coordinate system moves along with the origin. In other words, the position of the virtual sound source object 1105 relative to the origin remains unchanged. When the control circuit 132 plays the effect of the virtual sound source object 1105 through the object base compensation operation, the relative position of the virtual sound source object 1105 felt by the user 180 will not change as the user 180 moves.

在本實施例的目標空間1100中，可能會存在有環境物件175，會對使用者180的聆聽效果造成實質影響。控制電路132可透過圖9的流程902而獲得目標空間1100中的環境物件的空間配置資訊，而得知環境物件175位在第二位置P2上。當使用者180移動時，整個空間座標系統的原點隨著使用者180而改變。環境物件175雖然沒有移動，但是與原點的相對位置改變了。因此可以理解的是，在移動後的空間座標系統中，是環境物件175的座標值往反方向移動了。In the target space 1100 of this embodiment, there may be environmental objects 175 that will substantially affect the listening effect of the user 180 . The control circuit 132 can obtain the spatial configuration information of the environmental objects in the target space 1100 through the process 902 in FIG. 9 , and know that the environmental object 175 is located at the second position P2. When the user 180 moves, the origin of the entire spatial coordinate system changes with the user 180 . Although the environment object 175 has not moved, its relative position to the origin has changed. Therefore, it can be understood that in the moved spatial coordinate system, the coordinate value of the environmental object 175 is moved in the opposite direction.

為了抵消該環境物件175對使用者180產生的干擾，本實施例的控制電路132依據環境物件175建立物件基底的補償音源物件。該補償音源物件的中繼資料包含：該環境物件175的座標位置、大小，以及對聲音的反射率和吸收率。環境物件175對聲音的反射率和吸收率，可以是由圖9的流程906所獲得。補償音源物件會被視為環境物件175的負音源物件而被套用至物件基底補償運作中，成為可抵消環境物件175的虛擬音源。In order to counteract the interference caused by the environmental object 175 to the user 180 , the control circuit 132 of this embodiment establishes an object-based compensation sound source object according to the environmental object 175 . The metadata of the compensation sound source object includes: the coordinate position, the size, and the reflection rate and absorption rate of the sound source of the environmental object 175 . The reflection rate and absorption rate of the environmental object 175 to sound can be obtained by the process 906 in FIG. 9 . The compensated sound source object will be regarded as the negative sound source object of the environmental object 175 and applied to the object base compensation operation to become a virtual sound source capable of offsetting the environmental object 175 .

可以理解的是，補償音源物件的本質是環境物件175對應的負音源物件，其所在的位置與環境物件175重疊，因此在圖11中不另標示。另外，目標空間1100的四喇叭配置只是示例。在音響系統100的實際應用中，喇叭配置數量可以更多，甚至包含上喇叭和下喇叭的立體配置。本說明不限定其他可能的配置方式。It can be understood that the essence of the compensation sound source object is the negative sound source object corresponding to the environment object 175 , and its position overlaps with the environment object 175 , so it is not marked in FIG. 11 . Additionally, the four-speaker configuration of target space 1100 is just an example. In the actual application of the sound system 100 , the number of speaker configurations can be more, even including the stereo configuration of the upper speaker and the lower speaker. This description does not limit other possible configurations.

圖11的實施例說明了物件基底補償運作的優點。控制電路132將目標空間1100的資訊轉換為空間座標系統的形式，可將複雜的多物件互動運算簡化為中繼資料的陣列運算。將移動中的使用者180的位置設定為空間座標系統的原點，使虛擬物件的處理完全不受使用者180的移動的影響，而簡化了運算流程。本實施例還提出了補償音源物件的概念，直接套用物件基底補償運作來抵消環境物件干擾，免去了複雜的多通道交互運算。The embodiment of Fig. 11 illustrates the advantages of object floor compensation operation. The control circuit 132 converts the information of the target space 1100 into the form of the space coordinate system, which can simplify the complex multi-object interaction operation into the array operation of the relay data. The position of the moving user 180 is set as the origin of the spatial coordinate system, so that the processing of the virtual object is not affected by the movement of the user 180 at all, and the calculation process is simplified. This embodiment also proposes the concept of compensating the sound source object, and directly applies the object base compensation operation to offset the interference of environmental objects, eliminating the need for complex multi-channel interactive calculations.

以下以圖12說明物件基底補償運作的簡便之處和可能的衍生應用。The following uses FIG. 12 to illustrate the convenience and possible derivative applications of the object base compensation operation.

圖12為本發明一目標空間1200示意圖，用於說明以物件基底補償運作優化音場的實施例。FIG. 12 is a schematic diagram of a target space 1200 of the present invention, which is used to illustrate an embodiment of optimizing a sound field by performing object floor compensation operation.

目標空間1200中可能包含多個喇叭，例如第一喇叭1210、第二喇叭1220、第三喇叭1230、第四喇叭1240、第五喇叭1250和第六喇叭1260，排列為一個長條形音場。每個喇叭對應一個ID。當使用者180位於第一位置P1時，一個虛擬音源物件（未繪示）的中繼資料映射至第一喇叭1210和第二喇叭1220的ID。控制電路132進行物件基底補償運作後，就會使第一喇叭1210和第二喇叭1220播放第一聲道輸出1212和第二聲道輸出1222，讓使用者180感受到該虛擬音源物件的存在。當使用者180沿著移動軌跡1203移動到第二位置P2時，控制電路132經過目標聆聽點的重新計算，將該虛擬音源物件的中繼資料映射至第五喇叭1250和第六喇叭1260。控制電路132進行物件基底補償運作後，就會使第五喇叭1250和第五喇叭1250播放第五聲道輸出1252和第六聲道輸出1262，讓使用者180感受到該虛擬音源物件依然存在於使用者180的左右，不隨著使用者180的移動而離開。The target space 1200 may contain multiple speakers, such as the first speaker 1210 , the second speaker 1220 , the third speaker 1230 , the fourth speaker 1240 , the fifth speaker 1250 and the sixth speaker 1260 , arranged as a strip sound field. Each horn corresponds to an ID. When the user 180 is at the first position P1, the metadata of a virtual sound source object (not shown) is mapped to the IDs of the first speaker 1210 and the second speaker 1220 . After the control circuit 132 performs the object base compensation operation, the first speaker 1210 and the second speaker 1220 will play the first channel output 1212 and the second channel output 1222, so that the user 180 can feel the existence of the virtual audio source object. When the user 180 moves to the second position P2 along the moving track 1203 , the control circuit 132 maps the metadata of the virtual sound source object to the fifth speaker 1250 and the sixth speaker 1260 through recalculation of the target listening point. After the control circuit 132 performs the object base compensation operation, the fifth speaker 1250 and the fifth speaker 1250 will play the fifth channel output 1252 and the sixth channel output 1262, so that the user 180 can feel that the virtual sound source object still exists The left and right sides of the user 180 do not move away with the movement of the user 180 .

本實施例主要說明，物件基底補償運作的彈性應用以及簡便之處。在許多特殊的情況下，只需要進行少量的運算，就能完成音場的優化。例如，如果使用者180位於一個球體狀的音場中，則控制電路132只需要進行旋轉座標的運算，就能讓使用者180面對各種方向時，感受一致的音場效果。This embodiment mainly illustrates the flexible application and convenience of the object base compensation operation. In many special cases, only a small amount of calculation is required to complete the optimization of the sound field. For example, if the user 180 is located in a sphere-shaped sound field, the control circuit 132 only needs to perform calculations on rotating coordinates, so that the user 180 can experience consistent sound field effects when facing various directions.

以下以圖13總結控制電路132在執行物件基底補償運作時的基本邏輯。The following summarizes the basic logic of the control circuit 132 when performing the object base compensation operation with FIG. 13 .

圖13為本發明一實施例的物件基底補償運作流程圖，說明建立補償音源物件的概念。FIG. 13 is a flow chart of object base compensation operation according to an embodiment of the present invention, illustrating the concept of creating a compensated audio source object.

在流程1304中，控制電路132依據環境物件175建立對應的補償音源物件。對位於目標聆聽點上的使用者180而言，環境物件175的存在是一個實體音源。環境物件175可能將一喇叭發出的聲音反射至該目標聆聽點。環境物件175也可能阻擋或吸收一部份聲音，使一喇叭對該目標聆聽點發出的聲音受到衰減。補償音源物件即為針對環境物件175建立的負音源物件. 在主機裝置130將補償音源物件代入物件基底補償運作而產生聲道音訊時，可將環境物件175的存在感消除。物件基底運算本身的具體細節，可延用現有的物件基底聲學產品的計算方式，利用音源物件的中繼資料進行大量相關的陣列運算。舉例來說，該補償音源物件的一中繼資料包含：該環境物件175的座標位置、大小，以及對聲音的反射率和吸收率。In the process 1304 , the control circuit 132 establishes a corresponding compensation sound source object according to the environmental object 175 . For the user 180 at the target listening point, the presence of the environmental object 175 is a physical sound source. Environmental objects 175 may reflect sound from a speaker to the target listening point. Environmental objects 175 may also block or absorb some sound, attenuating the sound emitted by a speaker to the target listening point. The compensation sound source object is a negative sound source object created for the environment object 175. When the host device 130 substitutes the compensation sound source object into the object base compensation operation to generate channel audio, the presence of the environment object 175 can be eliminated. The specific details of the object-based calculation itself can continue to use the existing object-based acoustic product calculation method, and use the metadata of the sound source object to perform a large number of related array calculations. For example, a metadata of the compensating sound source object includes: the coordinate position, size, and reflection rate and absorption rate of the environmental object 175 .

在流程1306中，控制電路132計算補償音源物件的音源效果。在本實施例中，補償音源物件是依據環境物件175而對應建立，其中繼資料具有與該環境物件175相同的座標位置、大小，以及對聲音的反射率和吸收率，但是產生的音源效果是環境物件175的反增益值。In the process 1306, the control circuit 132 calculates the sound source effect of the compensated sound source object. In this embodiment, the compensating sound source object is correspondingly established according to the environmental object 175, and its metadata has the same coordinate position, size, and sound reflection rate and absorption rate as the environmental object 175, but the generated sound source effect is Inverse buff value for environment object 175.

圖13的實施例，也可參照類似圖7和圖8的計算。公式（8）可衍生為公式（9），依據環境物件175從第一喇叭110所接收到的聲壓值來計算環境物件175被動產生的增益值：The embodiment in FIG. 13 may also refer to calculations similar to those in FIG. 7 and FIG. 8 . Formula (8) can be derived into formula (9), and the gain value passively generated by the environmental object 175 is calculated according to the sound pressure value received by the environmental object 175 from the first speaker 110 :

A _t[m][n]=R[n]*SPL _t[m] （9） A _t [m][n]=R[n]*SPL _t [m] (9)

其中，m代表喇叭編號，n代表子頻帶的編號。A _t[m][n]代表第n個子頻帶受到第m個喇叭影響而產生的增益值。R[n]代表第n個子頻帶的吸收率。SPL _t[m]代表環境物件175在第t個時間點所受到的來自第m個喇叭的聲壓值。時間點t可代表聲音從喇叭傳送到環境物件175的時間差。如果時間差大於一不可忽略的範圍，則表示目標空間170中存在有回音的狀況。 Wherein, m represents the number of the horn, and n represents the number of the sub-band. _At [m][n] represents the gain value of the nth sub-band affected by the mth speaker. R[n] represents the absorption rate of the nth sub-band. SPL _t [m] represents the sound pressure value received by the environmental object 175 from the mth speaker at the tth time point. The time point t may represent the time difference when the sound is transmitted from the speaker to the environmental object 175 . If the time difference is greater than a non-negligible range, it indicates that there is an echo in the target space 170 .

由公式（9）可知，每一環境物件對應的計算結果，包含多個喇叭及多個子頻帶在一時間點上的一增益值陣列。而補償音源物件的音源效果，就是該增益值陣列的負數值。換句話說，基於公式（9）而進行的物件基底補償運作，包含多個維度的參數交互排列組合的陣列運算。以下為便於說明起見，以其中一喇叭及其中一子頻帶於一時間點上對應的增益值來說明。It can be known from formula (9) that the calculation result corresponding to each environmental object includes an array of gain values of multiple speakers and multiple sub-bands at a time point. The sound source effect of the compensation sound source object is the negative value of the gain value array. In other words, the object base compensation operation based on the formula (9) includes an array operation in which parameters of multiple dimensions are alternately arranged and combined. For the sake of illustration, the gain value corresponding to one of the speakers and one of the sub-bands at a time point is used for illustration.

圖13的實施例與圖7和圖8的實施例類似的是，本實施例可依據環境物件175的空間配置資訊而對應地使用適當的計算方式來計算環境物件的聲學影響。例如，如果該目標聆聽點位於一喇叭和環境物件175的可視線之間時，控制電路132依據環境物件175的反射率計算補償音源物件的音源效果。相對的，如果環境物件175位於該目標聆聽點和該喇叭的可視線之間時，控制電路132依據環境物件175的吸收率計算補償音源物件的音源效果。The embodiment of FIG. 13 is similar to the embodiments of FIG. 7 and FIG. 8 in that this embodiment can use an appropriate calculation method to calculate the acoustic impact of the environmental object according to the spatial configuration information of the environmental object 175 . For example, if the target listening point is located between a speaker and the visual line of the environmental object 175 , the control circuit 132 calculates and compensates the sound source effect of the sound source object according to the reflectivity of the environmental object 175 . In contrast, if the environmental object 175 is located between the target listening point and the line of sight of the speaker, the control circuit 132 calculates and compensates the sound source effect of the sound source object according to the absorption rate of the environmental object 175 .

舉例來說，當一環境物件175吸收了一喇叭發出的聲音，使得目標聆聽點收到的音量效果減少。這時控制電路132在環境物件175的座標位置上建立一個會產生對應音量效果的虛擬音源物件做為補償。相對地，如果一環境物件175反射了一喇叭的聲音，使目標聆聽點收到過多音量。這時控制電路132在環境物件175的座標位置上建立一個具有負增益值的虛擬音源物件。For example, when an environmental object 175 absorbs the sound from a speaker, the volume effect received by the target listening point is reduced. At this time, the control circuit 132 creates a virtual sound source object that can generate a corresponding volume effect at the coordinate position of the environmental object 175 as compensation. Conversely, if an environmental object 175 reflects the sound of a speaker, the target listening point receives too much volume. At this time, the control circuit 132 creates a virtual sound source object with a negative gain value at the coordinate position of the environmental object 175 .

可以理解的是，可視線定義為兩個物件在空間中的直線連線。由於物件具有一定的體積和面積，體積可能很大，而可視線被遮擋的情況可能包含部份遮擋和完全遮擋。本實施例可依據公式（9）為基礎，再視各種情境而乘上不同的權重係數或加上不同的偏移修正量。It can be understood that the visible line is defined as a straight line connecting two objects in space. Since the object has a certain volume and area, the volume may be large, and the situation where the visible line is blocked may include partial occlusion and complete occlusion. In this embodiment, based on the formula (9), different weight coefficients or different offset corrections may be added depending on various situations.

在流程1308中，控制電路132將補償音源物件的音源效果混入聲道音訊，使對應的喇叭播放。控制電路132執行物件基底補償運作時可處理複雜的物件對應陣列運算，將每一喇叭被分配播放的多個音源訊號混合成對應的一聲道音訊。在套用物件基底補償運作之後，目標聆聽點上收到的音量效果會包含補償音源物件產生的音源效果。藉此，環境物件175造成的干擾可有效地被補償音源物件抵消。In the process 1308, the control circuit 132 mixes the sound source effect of the compensation sound source object into the channel audio, and makes the corresponding speaker play it. When the control circuit 132 performs the object floor compensation operation, it can handle complex object-to-matrix calculations, and mix multiple audio source signals assigned to each speaker into corresponding one-channel audio. After applying the object base compensation operation, the volume effect received at the target listening point will include the sound source effect produced by the compensated sound source object. In this way, the interference caused by the environmental object 175 can be effectively offset by the compensating sound source object.

在流程1310中，控制電路132判斷目標聆聽點是否移動至新位置。如流程208所述，音響系統100可持續追蹤使用者180的移動而更新目標聆聽點。如果目標聆聽點移動了，則進行流程1312。反之則持續流程1308的播放運作。In the process 1310, the control circuit 132 determines whether the target listening point is moved to a new position. As described in the process 208 , the audio system 100 can continuously track the movement of the user 180 to update the target listening point. If the target listening point has moved, go to process 1312 . Otherwise, continue the playback operation of the process 1308 .

在流程1312中，控制電路132更新補償音源物件的中繼資料。在本實施例中，控制電路132會以目標聆聽點為一座標原點而建立物件基底空間。如果目標聆聽點移動至一新位置，控制電路132將該新位置指派為物件基底空間的新座標原點。該新座標原點和原座標原點位置差，可表示為一移動向量。環境物件175相對於目標聆聽點的空間座標值也會隨著該移動向量而反向改變。控制電路132於是依據該移動向量更新環境物件175對應的補償音源物件的中繼資料。在進一步的實施例中，物件基底空間中所有喇叭也可視為一種物件，具有對應的ID、中繼資料和座標值。In the process 1312, the control circuit 132 updates the metadata of the compensated audio source object. In this embodiment, the control circuit 132 establishes the object base space with the target listening point as the coordinate origin. If the target listening point moves to a new location, the control circuit 132 assigns the new location as the new coordinate origin of the object base space. The position difference between the new coordinate origin and the original coordinate origin can be expressed as a moving vector. The spatial coordinates of the environmental object 175 relative to the target listening point will also change inversely with the moving vector. The control circuit 132 then updates the metadata of the compensated sound source object corresponding to the environmental object 175 according to the motion vector. In a further embodiment, all the speakers in the object base space can also be regarded as an object, and have corresponding IDs, metadata and coordinates.

在另一實施例中，音響系統100不限定要以目標聆聽點為座標原點。音響系統100也可以採用一固定參考點為物件基底空間的原點。當物件基底空間中的音源物件出現相對位置的變化時，控制電路132對應地更新音源物件的中繼資料中的座標值。In another embodiment, the audio system 100 is not limited to take the target listening point as the coordinate origin. The audio system 100 can also adopt a fixed reference point as the origin of the object base space. When the relative position of the sound source object in the object base space changes, the control circuit 132 correspondingly updates the coordinate value in the metadata of the sound source object.

當流程1312完成後，控制電路132重複流程1308。After the process 1312 is completed, the control circuit 132 repeats the process 1308 .

圖13的實施例說明了物件基底補償運作的優點。控制電路132將目標空間1100的資訊轉換為空間座標系統的形式，可將複雜的多物件互動運算簡化為中繼資料的陣列運算。將移動中的使用者180的位置設定為空間座標系統的原點，使虛擬物件的處理完全不受使用者180的移動的影響，而簡化了運算流程。本實施例還提出了補償音源物件的概念，直接套用物件基底補償運作來抵消環境物件干擾，免去了複雜的多通道交互運算。The embodiment of Fig. 13 illustrates the advantages of object floor compensation operation. The control circuit 132 converts the information of the target space 1100 into the form of the space coordinate system, which can simplify the complex multi-object interaction operation into the array operation of the relay data. The position of the moving user 180 is set as the origin of the spatial coordinate system, so that the processing of the virtual object is not affected by the movement of the user 180 at all, and the calculation process is simplified. This embodiment also proposes the concept of compensating the sound source object, and directly applies the object base compensation operation to offset the interference of environmental objects, eliminating the need for complex multi-channel interactive calculations.

在進一步衍生的實施例中，如果主機裝置130本身不具備物件基底的混音運算能力，控制電路132可透過執行軟件的方式提供通道映射(Channel mapping)的功能，使物件基底的運算結果能夠正確地對應至每個喇叭。In a further derivative embodiment, if the host device 130 itself does not have the object-based mixing operation capability, the control circuit 132 can provide the function of channel mapping (Channel mapping) by executing software, so that the object-based operation result can be correct Corresponds to each horn.

綜上所述，本申請提出了一種音響系統100，可動態追蹤使用者位置而優化音場，也可智能地消除環境物件造成的干擾。追蹤使用者位置的手段，可以是攝影機、紅外線、或無線偵測器等多種方式的各別運用或組合運用。目標空間170中的環境物件175的空間配置資訊可以是由攝影機捕捉的影像經過辨識後獲得，也可以是透過使用者手動輸入。優化音場的方式可以是通道基底補償運作或物件基底補償運作。計算環境物件175對目標聆聽點的影響時，還可考慮環境物件175和喇叭、和目標聆聽點的相對位置關係，而採用不同的計算方式。使用物件基底補償運作時，控制電路132為每個環境物件175建立對應的補償音源物件，使得最後混音產生的聲道音訊，消除了環境物件175對目標聆聽點造成的干擾。To sum up, the present application proposes an audio system 100 that can dynamically track the user's position to optimize the sound field, and can also intelligently eliminate the interference caused by environmental objects. The means of tracking the user's location can be the separate use or combination of various methods such as cameras, infrared rays, or wireless detectors. The spatial configuration information of the environmental objects 175 in the target space 170 can be obtained through recognition of images captured by the camera, or can be manually input by the user. The way to optimize the sound field can be channel floor compensation operation or object floor compensation operation. When calculating the impact of the environmental object 175 on the target listening point, the relative positional relationship between the environmental object 175 and the loudspeaker and the target listening point can also be considered, and different calculation methods can be adopted. When using the object base compensation operation, the control circuit 132 establishes a corresponding compensation audio source object for each environmental object 175 , so that the channel audio generated by the final mixing eliminates the interference caused by the environmental object 175 to the target listening point.

在說明書及申請專利範圍中使用了某些詞彙來指稱特定的元件，而本領域內的技術人員可能會用不同的名詞來稱呼同樣的元件。本說明書及申請專利範圍並不以名稱的差異來做為區分元件的方式，而是以元件在功能上的差異來做為區分的基準。在說明書及申請專利範圍中所提及的「包含」爲開放式的用語，應解釋成「包含但不限定於」。另外，「耦接」一詞在此包含任何直接及間接的連接手段。因此，若文中描述第一元件耦接於第二元件，則代表第一元件可通過電性連接或無線傳輸、光學傳輸等信號連接方式而直接地連接於第二元件，或通過其它元件或連接手段間接地電性或信號連接至第二元件。Certain words are used to refer to specific elements in the specification and scope of claims, but those skilled in the art may use different terms to refer to the same element. This specification and the scope of the patent application do not use the difference in name as a way to distinguish components, but use the difference in function of components as a basis for differentiation. The "comprising" mentioned in the specification and scope of patent application is an open term and should be interpreted as "including but not limited to". In addition, the term "coupled" herein includes any direct and indirect means of connection. Therefore, if it is described that the first element is coupled to the second element, it means that the first element can be directly connected to the second element through electrical connection or signal connection means such as wireless transmission or optical transmission, or through other elements or connections. The means is indirectly electrically or signally connected to the second element.

在說明書中所使用的「和/或」的描述方式，包含所列舉的其中一個項目或多個項目的任意組合。另外，除非說明書中特別指明，否則任何單數格的用語都同時包含複數格的含義。The description of "and/or" used in the specification includes any combination of one or more of the listed items. In addition, unless otherwise specified in the specification, any singular term also includes a plural meaning.

以上僅為本發明的較佳實施例，凡依本發明請求項所做的等效變化與修改，皆應屬本發明的涵蓋範圍。The above are only preferred embodiments of the present invention, and all equivalent changes and modifications made according to the claims of the present invention shall fall within the scope of the present invention.

100:音響系統（audio system）100: Audio system (audio system)

110:第一喇叭（first speaker）110: first speaker

112:第一聲道音訊（first channel signal）112: first channel audio (first channel signal)

120:第二喇叭（second speaker）120: second speaker

122:第二聲道音訊（second channel signal）122: Second channel audio (second channel signal)

130:主機裝置（host device）130: host device (host device)

131:儲存電路（storage circuit）131: Storage circuit (storage circuit)

132:控制電路（control circuit）132: Control circuit (control circuit)

133:人機介面電路（user interface circuit）133: Human-machine interface circuit (user interface circuit)

134:辨識電路（recognizer circuit）134: Identification circuit (recognizer circuit)

135:音訊傳輸電路（audio transmission circuit）135: audio transmission circuit (audio transmission circuit)

136:通信電路（communication circuit）136: Communication circuit (communication circuit)

140:感測器電路（sensor circuit）140: sensor circuit (sensor circuit)

150:用戶設備（user equipment）150: user equipment (user equipment)

160:遠端資料庫（remote database）160: remote database

170:目標空間（target space）170: target space

171:第一位置（first position）171: first position (first position)

172:第二位置（second position）172: Second position (second position)

173:移動軌跡（movement trail）173:Movement trail

175:環境物件（ambient object）175:ambient object

180:使用者（user）180: user (user)

202～218:流程（operation）202～218: Process (operation)

312～316:流程（operation）312～316: Process (operation)

410:流程（operation）410: Process (operation)

600:目標空間（target space）600: Target space (target space)

601:第一位置（first location）601: first location

602:第二位置（first location）602: second location (first location)

610:攝影機（camera）610: camera

620:紅外線感測器（infrared sensor）620: infrared sensor (infrared sensor)

630:無線偵測器（radio detector）630: wireless detector (radio detector)

700:目標空間（target space）700: Target space (target space)

800:目標空間（target space）800: Target space (target space)

902～910:流程（operation）902～910: Process (operation)

1002～1012:流程（operation）1002～1012: Process (operation)

1100:目標空間（target space）1100: Target space (target space)

1103:物件移動軌跡（movement trail）1103: Object movement track (movement trail)

1105:虛擬音源物件（virtual audio object）1105: Virtual audio object (virtual audio object)

1110:第一喇叭（first speaker）1110: first speaker (first speaker)

1120:第二喇叭（second speaker）1120: Second speaker (second speaker)

1130:第三喇叭（third speaker）1130: third speaker (third speaker)

1140:第四喇叭（fourth speaker）1140: fourth speaker

P0:原點（original point）P0: origin (original point)

P1:第一位置（first position）P1: first position (first position)

P1’:新第一位置（new first position）P1': new first position

P2:第二位置（second position）P2: second position (second position)

1200:目標空間（target space）1200: Target space (target space)

1201:目標聆聽點（target listening spot）1201: target listening spot

1203:移動軌跡（movement trail）1203: Movement trail (movement trail)

1210:第一喇叭（first speaker）1210: first speaker (first speaker)

1212:第一聲道輸出（first channel output）1212: first channel output (first channel output)

1220:第二喇叭（second speaker）1220: Second speaker (second speaker)

1222:第二聲道輸出（second channel output）1222: Second channel output (second channel output)

1230:第三喇叭（third speaker）1230: third speaker (third speaker)

1240:第四喇叭（fourth speaker）1240: fourth speaker (fourth speaker)

1250:第五喇叭（fifth speaker）1250: fifth speaker (fifth speaker)

1252:第五聲道輸出（fifth channel output）1252: Fifth channel output (fifth channel output)

1260:第六喇叭（sixth speaker）1260: sixth speaker (sixth speaker)

1262:第六聲道輸出（sixth channel output）1262: sixth channel output (sixth channel output)

1304～1312:流程（operation）1304～1312: Process (operation)

圖1為本發明一實施例的音響系統的功能方塊圖。FIG. 1 is a functional block diagram of an audio system according to an embodiment of the present invention.

圖6為本發明一目標空間示意圖，用於說明依據最佳聆聽點的位置計算音訊調整量的實施例。FIG. 6 is a schematic diagram of a target space according to the present invention, which is used to illustrate an embodiment of calculating an audio adjustment amount based on the position of the best listening point.

圖7為本發明一目標空間示意圖，用於說明依據環境物件的吸收率計算音訊調整量的實施例。FIG. 7 is a schematic diagram of a target space of the present invention, which is used to illustrate an embodiment of calculating an audio adjustment amount based on the absorption rate of an environmental object.

圖8為本發明一目標空間示意圖，用於說明依據環境物件的反射率計算音訊調整量的實施例。FIG. 8 is a schematic diagram of a target space according to the present invention, which is used to illustrate an embodiment of calculating an audio adjustment amount according to the reflectivity of an environmental object.

圖9為本發明一實施例的主機裝置辨識物件的流程圖。FIG. 9 is a flow chart of identifying an object by a host device according to an embodiment of the invention.

圖10為本發明一實施例的音訊處理方法流程圖，說明依據環境物件的位置關係計算輸出補償值的實施例。FIG. 10 is a flowchart of an audio processing method according to an embodiment of the present invention, illustrating an embodiment of calculating an output compensation value according to the positional relationship of environmental objects.

圖11為本發明一目標空間示意圖，用於說明以物件基底補償運作優化音場的實施例。FIG. 11 is a schematic diagram of a target space of the present invention, which is used to illustrate an embodiment of optimizing a sound field by operating object floor compensation.

圖12為本發明一目標空間示意圖，用於說明以物件基底補償運作優化音場的實施例。FIG. 12 is a schematic diagram of a target space of the present invention, which is used to illustrate an embodiment of optimizing a sound field by operating object floor compensation.

圖13為本發明一實施例的物件基底補償運作流程圖。FIG. 13 is a flow chart of an object base compensation operation according to an embodiment of the present invention.

100:音響系統 100: Audio system

110:第一喇叭 110: The first horn

112:第一聲道音訊 112: First channel audio

120:第二喇叭 120: second horn

122:第二聲道音訊 122:Second channel audio

130:主機裝置 130: host device

131:儲存電路 131: storage circuit

132:控制電路 132: control circuit

133:人機介面電路 133: Human-machine interface circuit

134:辨識電路 134: Identification circuit

135:音訊傳輸電路 135: audio transmission circuit

136:通信電路 136: Communication circuit

140:感測器電路 140: Sensor circuit

150:用戶設備 150: user equipment

160:遠端資料庫 160:Remote database

170:目標空間 170: target space

171:第一位置 171: First position

172:第二位置 172: second position

173:移動軌跡 173:Movement track

175:環境物件 175: Environmental objects

180:使用者 180: user

Claims

An audio system (100), which can dynamically optimize the playback effect according to the user's position, comprising: A sensor circuit (140), configured to dynamically sense a target space (170) to generate sound field environmental information; A first speaker (110) and a second speaker (120), set to play audio; A host device (130), coupled to the sensor circuit (140), the first speaker (110) and the second speaker (120), comprising: An identification circuit (134), configured to identify a user from the sound field environment information and determine a user position of the user in the target space; a control circuit (132), coupled to the identification circuit (134), configured to dynamically assign the user's position as a target listening point; and An audio transmission circuit (135), coupled to the control circuit (132), the first speaker (110) and the second speaker (120), configured to transmit audio; Wherein, the sensor circuit (140) includes a camera (610), configured to capture a sound field environment image of the target space (170); Wherein, the identification circuit (134) analyzes the sound field environment image to obtain spatial configuration information and acoustic attribute information of an environmental object (175) in the target space (170); Wherein, the control circuit (132) performs a channel floor compensation operation to generate a first channel optimized for the target listening point according to the target listening point, and the spatial configuration information and acoustic property information of the environmental object (175) audio (112) and a second channel audio (122). Wherein, the control circuit (132) respectively outputs the first channel audio (112) and the second channel audio (122) to the corresponding first speaker (110) and the first speaker (110) through the audio transmission circuit (135). Second horn (120).

The audio system (100) as claimed in claim 1, wherein the channel floor compensation operation includes: splitting the first channel audio (112) emitted by the first speaker (110) into a plurality of sub-band signals; According to the wavelength of a sub-band signal among the plurality of sub-band signals and the distance between the target listening point and the first speaker (110), it is judged that a sound field type generated by the sub-band signal at the target listening point belongs to a near sound field or a far sound field; When the distance between the target listening point and the first speaker (110) is greater than the wavelength of the sub-band signal or a specific ratio of the size of the first speaker (110), the sound field type is determined to be a far sound field; and When the distance between the target listening point and the first speaker (110) is smaller than the wavelength of the sub-band signal or the specific ratio of the size of the first speaker (110), it is judged that the sound field type is a near sound field.

The sound system (100) according to claim 2, wherein the channel floor compensation operation further includes, when the control circuit (132) determines the distance between the position of the target listening point and the first speaker (110) from a first When a distance (R1) is changed to a second distance (R2), and the sound field type belongs to the far sound field, a far sound field formula is used to calculate the sound pressure value of the sub-band signal; wherein, the far sound field formula includes: SPL ' = SPL + 20 log ₁₀ (R2/R1) wherein, SPL is the sound pressure value of the sub-band signal before adjustment, SPL' is the sound pressure value of the sub-band signal after adjustment, R1 is the first distance, R2 is the second distance; and wherein, if the adjusted sound pressure value of the sub-band signal is less than zero, the control circuit (132) makes the adjusted sound pressure value of the sub-band signal zero.

The sound system (100) according to claim 2, wherein the channel floor compensation operation further includes, when the control circuit (132) determines the distance between the position of the target listening point and the first speaker (110) from a first When a distance (R1) is changed to a second distance (R2), and the sound field type belongs to the near sound field, a near sound field formula is used to calculate the sound pressure value of the sub-band signal; wherein, the near sound field formula Including: SPL' = SPL + 10 log ₁₀ (R2/R1) Among them, SPL is the sound pressure value of the sub-band signal before adjustment, SPL' is the sound pressure value of the sub-band signal after adjustment, and R1 is the sound pressure value of the sub-band signal A distance, R2 is the second distance; and wherein, if the adjusted sound pressure value of the sub-band signal is less than zero, the control circuit (132) makes the adjusted sound pressure value of the sub-band signal zero.

The sound system (100) according to claim 2, wherein the spatial configuration information of the environmental object includes the location, size and appearance characteristics of the environmental object, and the acoustic property information of the environmental object includes the reflection rate and absorption rate of sound ; Wherein, when the control circuit (132) generates the first channel audio, if the target listening point is between the first speaker (110) and the visual line of the environmental object (175), the control circuit (132 ) Calculate the degree to which the playback effect of the first speaker (110) at the target listening point is affected by the environmental object (175) according to the reflectivity of the environmental object, so as to determine the sound pressure of the first channel audio (112) value; and Wherein, when the control circuit (132) generates the first channel audio, if the environmental object (175) is located between the target listening point and the visible line of the first speaker (110), the control circuit (132 ) Calculate the degree to which the playback effect of the first speaker (110) at the target listening point is affected by the environmental object (175) according to the absorption rate of the environmental object, so as to determine the sound pressure of the first channel audio (112) value.

The sound system (100) according to claim 2, wherein the recognition circuit (134) dynamically recognizes the user's head position, face direction, or ear position to determine the user's position.

The sound system (100) according to claim 2, wherein the sensor circuit (140) further includes an infrared sensor (620), configured to capture a thermal imaging data in the target space; the identification The circuit (134) analyzes the movement track of the thermal imaging data to dynamically determine the user's position.

The audio system (100) according to claim 2, wherein the sensor circuit (140) further includes a wireless detector (630), which is arranged in the target space and detects a wireless signal of an electronic device ; Wherein, the identification circuit (134) dynamically locates the position of the electronic device according to the characteristics of the wireless signal detected by the wireless detector (630); and Wherein, the identification circuit (134) dynamically judges the user's position according to the position of the electronic device.

The audio system (100) according to claim 2, wherein the host device (130) further includes a storage circuit (131), coupled to the control circuit (132), configured to store one or more object databases, Each of the object databases corresponds to an application scenario category, and includes appearance feature information and acoustic property information of multiple environmental objects; Wherein, when the identification circuit (134) analyzes the sound field environment information, it identifies the sound field environment information to determine an applicable application scenario category; and Wherein, the control circuit (132) preferentially selects and uses an object database related to the application scenario category from the storage circuit (131) according to the application scenario category to identify the environmental object and search for the acoustic properties of the environmental object Information.

The audio system (100) according to claim 2, wherein the host device (130) further includes a communication circuit (136), coupled to the control circuit (132), configured to be controlled by the control circuit (132) Controlled and connected to a remote database (160) corresponding to the application scenario category; the remote database (160) is configured to store one or more object databases, wherein each object database corresponds to an application scenario category , including appearance feature information and acoustic attribute information of multiple environmental objects; Wherein, when the identification circuit (134) analyzes the sound field environment information, it identifies the sound field environment information to determine an applicable application scenario category; and Wherein, the control circuit (132) preferentially selects and uses an object database related to the application scenario category from the remote database (160) according to the application scenario category to identify the environment object and search for the environment object. Acoustic property information.