TW202242856A

TW202242856A - Open-back headphones

Info

Publication number: TW202242856A
Application number: TW111114832A
Authority: TW
Inventors: 肖樂; 鄭金波; 張承乾; 廖風雲; 齊心
Original assignee: 大陸商深圳市韶音科技有限公司
Priority date: 2021-04-25
Filing date: 2022-04-19
Publication date: 2022-11-01
Also published as: CN115240697A; WO2022226696A1; WO2022227056A1; CN117501710A

Abstract

The present disclosure may disclose an open-back headphone, including: a fixed structure, configured to fix the headphone at the position near the user's ears and does not block the ear channel; and a shell structure, configured to bear: a first microphone array configured to collect environmental noise; at least one speaker array; and a signal processor configured to: estimate the noise of a first space position based on the collected environmental noise, the first space position is closer to the user's ear channel than any microphone in the first microphone array; and generate denoise signal based on the noise of the first space position, so that the at least one speaker array may output a denoise wave according to the denoise signal, and the denoise wave may be used to eliminate the environmental noise transmitted to the user's ear channel.

Description

open earphones

本申請案涉及聲學領域，特別涉及一種開放式耳機。This application relates to the field of acoustics, in particular to an open earphone.

本申請案主張於2021年4月25日提交之申請號為PCT/CN2021/089670的國際專利申請案的優先權，其全部內容通過引用的方式併入本文。This application claims priority to International Patent Application No. PCT/CN2021/089670 filed on April 25, 2021, the entire contents of which are incorporated herein by reference.

耳機設備允許使用者在收聽音訊內容、進行語音通話的同時保證使用者互動內容的私密性，且收聽時不打擾到周圍人群。耳機設備通常可以分為入耳式耳機設備和開放式耳機設備兩類。入耳式耳機設備在使用過程中會堵塞使用者耳部，且使用者在長時間佩戴時容易產生堵塞、異物、脹痛等感受。開放式耳機設備可以開放使用者耳部，有利於長期佩戴，但當外界雜訊較大時，其降噪效果不明顯，使得使用者收聽體驗較差。The headset device allows users to listen to audio content and make voice calls while ensuring the privacy of the user's interactive content, and does not disturb the surrounding people when listening. Headphone devices can generally be classified into two types: in-ear headphone devices and open-type headphone devices. The in-ear earphone device will block the user's ear during use, and the user is prone to experience blockage, foreign objects, swelling pain and the like when wearing it for a long time. The open earphone device can open the user's ear, which is beneficial to long-term wearing, but when the external noise is large, its noise reduction effect is not obvious, making the user's listening experience poor.

因此，希望提供一種開放式耳機，可以開放使用者雙耳且提高使用者聽感體驗。Therefore, it is desirable to provide an open earphone that can open both ears of the user and improve the user's listening experience.

本發明實施例提供一種開放式耳機，包括：固定結構，被配置為將所述耳機固定在使用者耳朵附近且不堵塞使用者耳道的位置；殼體結構，被配置為承載：第一麥克風陣列，被配置為拾取環境雜訊；至少一個揚聲器陣列；以及信號處理器，被配置為：基於拾取的所述環境雜訊估計第一空間位置的雜訊，所述第一空間位置比所述第一麥克風陣列中任一麥克風更加靠近所述使用者耳道；且基於所述第一空間位置的雜訊產生降噪信號，使得所述至少一個揚聲器陣列根據所述降噪信號輸出降雜訊波，所述降雜訊波用於消除傳遞到所述使用者耳道的所述環境雜訊。An embodiment of the present invention provides an open earphone, comprising: a fixing structure configured to fix the earphone near the user's ear without blocking the user's ear canal; a shell structure configured to carry: a first microphone an array configured to pick up ambient noise; at least one loudspeaker array; and a signal processor configured to: estimate noise at a first spatial location based on the picked up ambient noise, the first spatial location being smaller than the Any microphone in the first microphone array is closer to the user's ear canal; and a noise reduction signal is generated based on the noise at the first spatial position, so that the at least one speaker array outputs a noise reduction signal according to the noise reduction signal waves, the noise reduction waves are used to eliminate the environmental noise transmitted to the user's ear canal.

在一些實施例中，所述殼體結構被配置為容納第二麥克風陣列，所述第二麥克風陣列被配置為拾取所述環境雜訊和所述降雜訊波，所述第二麥克風陣列至少部分區別於所述第一麥克風陣列；且所述信號處理器被配置為基於所述第二麥克風陣列拾取的聲音信號來更新所述降噪信號。In some embodiments, the housing structure is configured to accommodate a second microphone array configured to pick up the environmental noise and the de-noise wave, the second microphone array being at least partially distinct from the first microphone array; and the signal processor is configured to update the noise reduction signal based on sound signals picked up by the second microphone array.

在一些實施例中，基於所述第二麥克風陣列拾取的所述聲音信號來更新所述降噪信號包括：基於所述第二麥克風陣列拾取的所述聲音信號，對所述使用者耳道處的聲場進行估計；且根據所述使用者耳道處的所述聲場，調整所述降噪信號的參數資訊。In some embodiments, updating the noise reduction signal based on the sound signal picked up by the second microphone array includes: based on the sound signal picked up by the second microphone array, Estimating the sound field of the user; and adjusting parameter information of the noise reduction signal according to the sound field at the ear canal of the user.

在一些實施例中，所述第二麥克風陣列包括一個比所述第一麥克風陣列中任意麥克風更加靠近所述使用者耳道的麥克風。In some embodiments, the second array of microphones includes a microphone that is closer to the user's ear canal than any microphone in the first array of microphones.

在一些實施例中，所述信號處理器基於拾取的所述環境雜訊估計所述第一空間位置的雜訊包括：根據拾取的所述環境雜訊進行信號分離，獲取所述環境雜訊對應的參數資訊，基於所述參數資訊產生所述降噪信號。In some embodiments, the signal processor estimating the noise at the first spatial position based on the picked-up environmental noise includes: performing signal separation according to the picked-up environmental noise, and obtaining the environmental noise corresponding Parameter information of , generating the noise reduction signal based on the parameter information.

在一些實施例中，所述信號處理器基於拾取的所述環境雜訊估計所述第一空間位置的雜訊包括：確定一個或多個與拾取的所述環境雜訊有關的空間噪音源；且基於所述空間噪音源，估計所述第一空間位置的雜訊。In some embodiments, the signal processor estimating noise at the first spatial location based on the picked-up environmental noise comprises: determining one or more spatial noise sources related to the picked-up environmental noise; And based on the spatial noise source, the noise of the first spatial location is estimated.

在一些實施例中，確定一個或多個與拾取的所述環境雜訊有關的所述空間噪音源包括：將拾取的所述環境雜訊劃分為多個子帶，每個子帶對應不同的頻率範圍；以及在至少一個子帶上，確定與其對應的所述空間噪音源。In some embodiments, determining one or more of the spatial noise sources related to the picked-up environmental noise includes: dividing the picked-up environmental noise into a plurality of sub-bands, each sub-band corresponding to a different frequency range ; and on at least one subband, determining the spatial noise source corresponding thereto.

在一些實施例中，所述第一麥克風陣列包括第一子麥克陣列和第二子麥克風陣列，所述第一子麥克風陣列和所述第二子麥克風陣列分別位於使用者的左耳和右耳處，確定與所述至少一個子帶對應的所述空間噪音源包括：獲取使用者頭函數，所述使用者頭函數反映使用者頭部對聲音的反射或吸收情況；以及在所述至少一個子帶上，結合所述第一子麥克風陣列拾取的所述環境雜訊、所述第二子麥克風陣列拾取的所述環境雜訊，以及所述使用者頭函數，確定與其對應的所述空間噪音源。In some embodiments, the first microphone array includes a first sub-microphone array and a second sub-microphone array, and the first sub-microphone array and the second sub-microphone array are respectively located on the left ear and the right ear of the user. where, determining the spatial noise source corresponding to the at least one sub-band includes: obtaining a user head function, the user head function reflects the reflection or absorption of the sound by the user's head; and in the at least one On the sub-band, combine the environmental noise picked up by the first sub-microphone array, the environmental noise picked up by the second sub-microphone array, and the user head function to determine the corresponding spatial noise source.

在一些實施例中，所述第一麥克風陣列包括一個雜訊麥克風，所述至少一個揚聲器陣列形成至少一組聲學偶極子，且所述雜訊麥克風位於偶極子輻射聲場的聲學零點處。In some embodiments, the first microphone array includes a noise microphone, the at least one loudspeaker array forms at least one set of acoustic dipoles, and the noise microphone is located at an acoustic null of the dipole radiation sound field.

在一些實施例中，所述至少一個麥克風陣列包括骨傳導麥克風，所述骨傳導麥克風被配置以拾取使用者的說話聲音，所述信號處理器基於拾取的所述環境雜訊估計所述第一空間位置的雜訊包括：從拾取的所述環境雜訊中去除與所述骨傳導麥克風拾取的信號相關聯的成分，以更新所述環境雜訊；以及根據更新後的所述環境雜訊估計所述第一空間位置的雜訊。In some embodiments, the at least one microphone array includes a bone conduction microphone configured to pick up the voice of a user, and the signal processor estimates the first The noise of the spatial position includes: removing components associated with the signal picked up by the bone conduction microphone from the picked-up environmental noise, so as to update the environmental noise; and estimating the environmental noise based on the updated environmental noise Noise at the first spatial location.

在一些實施例中，所述至少一個麥克風陣列包括骨傳導麥克風和氣傳導麥克風，所述信號處理器基於所述耳機的工作狀態控制所述骨傳導麥克風和所述氣傳導麥克風的開關狀態。In some embodiments, the at least one microphone array includes a bone conduction microphone and an air conduction microphone, and the signal processor controls switching states of the bone conduction microphone and the air conduction microphone based on the working state of the earphone.

為了更清楚地說明本發明實施例的技術方案，下面將對實施例描述中所需要使用的附圖作簡單的介紹。顯而易見地，下面描述中的附圖僅僅是本發明的一些示例或實施例，對於所屬技術領域中具有通常知識者來講，在不付出進步性努力的前提下，還可以根據這些附圖將本發明應用於其它類似情景。除非從語言環境中顯而易見或另做說明，圖式中相同的元件符號代表相同結構或操作。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the following briefly introduces the drawings that need to be used in the description of the embodiments. Apparently, the accompanying drawings in the following description are only some examples or embodiments of the present invention, and for those skilled in the art, the present invention can also be translated according to these drawings without making progressive efforts. The invention applies to other similar scenarios. Unless otherwise apparent from context or otherwise indicated, like reference symbols in the drawings represent like structures or operations.

應當理解，本文使用的“系統”、“裝置”、“單元”和/或“模組”是用於區分不同級別的不同元件、組件、部件、部分或裝配的一種方法。然而，如果其他詞語可實現相同的目的，則可通過其他表達來替換所述詞語。It should be understood that "system", "device", "unit" and/or "module" as used herein is a method for distinguishing different elements, components, parts, parts or assemblies of different levels. However, the words may be replaced by other expressions if other words can achieve the same purpose.

如本發明和申請專利範圍中所示，除非上下文明確提示例外情形，“一”、“一個”、“一種”和/或“該”等詞並非特指單數，也可包括複數。一般說來，術語“包括”與“包含”僅提示包括已明確標識的步驟和元素，而這些步驟和元素不構成一個排它性的羅列，方法或者設備也可能包含其它的步驟或元素。As shown in the present invention and scope of patent claims, words such as "a", "an", "an" and/or "the" are not specific to the singular and may also include the plural, unless the context clearly indicates an exception. Generally speaking, the terms "comprising" and "comprising" only suggest the inclusion of clearly identified steps and elements, and these steps and elements do not constitute an exclusive list, and the method or device may also contain other steps or elements.

本發明中使用了流程圖用來說明根據本發明的實施例的系統所執行的操作。應當理解的是，前面或後面操作不一定按照順序來精確地執行。相反地，可以按照倒序或同時處理各個步驟。同時，也可以將其他操作添加到這些流程中，或從這些流程移除某一步驟或幾個步驟的操作。The flow chart is used in the present invention to illustrate the operations performed by the system according to the embodiment of the present invention. It should be understood that the preceding or following operations are not necessarily performed in the exact order. Instead, various steps may be processed in reverse order or simultaneously. At the same time, other operations can also be added to these processes, or operations of a certain step or several steps can be removed from these processes.

開放式耳機是一種可以開放使用者耳部的耳機設備。開放式耳機可以通過固定結構（例如，耳掛、頭掛等）將揚聲器固定於使用者耳朵附近且不堵塞使用者耳道的位置。當使用者使用開放式耳機時，外界環境噪音也可以被使用者聽到，這就使得使用者的聽感體驗較差。例如，在外界環境噪音較大的場所（例如，街道、景區等），使用者在使用開放式耳機進行音樂播放時，外界環境的噪音會直接進入使用者耳道，使得使用者聽到較大的環境噪音，環境噪音會干擾使用者的聽音樂體驗。又例如，當使用者佩戴開放式耳機進行通話時，麥克風不僅會拾取使用者自身的說話聲音，也會拾取環境噪音，使得使用者通話體驗較差。An open-back headset is a headphone device that opens the ear of the user. Open earphones can fix the speaker near the user's ear without blocking the user's ear canal through a fixed structure (eg, ear hook, head hook, etc.). When the user uses the open earphone, the external environment noise can also be heard by the user, which makes the user's listening experience poor. For example, in places where the external environment is noisy (for example, streets, scenic spots, etc.), when the user uses open earphones to play music, the noise of the external environment will directly enter the user's ear canal, so that the user can hear a loud noise. Ambient noise. Ambient noise will interfere with the user's listening experience. For another example, when a user wears an open earphone for a call, the microphone will not only pick up the user's own voice, but also pick up ambient noise, making the user's call experience poor.

基於上述問題，本說明書實施例中提供一種開放式耳機，在一些實施例中，開放式耳機可以包括固定結構、殼體結構、第一麥克風陣列、至少一個揚聲器陣列且信號處理器。其中，固定結構被配置為將開放式耳機固定在使用者耳朵附近且不堵塞使用者耳道的位置。殼體結構被配置為容納第一麥克風陣列、至少一個揚聲器陣列且信號處理器。第一麥克風陣列被配置為拾取環境雜訊。信號處理器被配置為基於第一麥克風陣列拾取的環境雜訊估計第一空間位置的雜訊，並基於第一空間位置的雜訊產生降噪信號，這裡的第一空間位置比第一麥克風陣列中任一麥克風更加靠近使用者耳道。這裡可以理解為，第一麥克風陣列中的各麥克風可以分佈於使用者耳道附近的不同位置，通過對第一麥克風陣列中的各麥克風採集的環境雜訊信號來估計靠近使用者耳道位置處（例如，第一空間位置）的雜訊。至少一個揚聲器陣列被配置為基於降噪信號輸出降雜訊波，該降雜訊波可以用於消除傳遞到使用者耳道的環境雜訊。在一些實施例中，開放式耳機還可以包括第二麥克風陣列，第二麥克風陣列被配置為拾取環境雜訊和至少一個揚聲器陣列輸出的降雜訊波。在一些實施例中，信號處理器可以基於第二麥克風陣列拾取的聲音信號更新降噪信號。例如，信號處理器可以基於第二麥克風陣列拾取的聲音信號對使用者耳道處的聲場進行估計，並根據使用者耳道處的聲場調整降噪信號的相位或幅值，實現降噪信號的更新。本說明書的實施例中，通過上述方法利用降雜訊波消除使用者耳道處的環境雜訊，實現了開放式耳機的主動降噪，提高了使用者在使用開放式耳機過程中的聽覺體驗。Based on the above problems, the embodiments of this specification provide an open earphone. In some embodiments, the open earphone may include a fixed structure, a shell structure, a first microphone array, at least one loudspeaker array and a signal processor. Wherein, the fixing structure is configured to fix the open earphone near the user's ear and not block the user's ear canal. The housing structure is configured to house a first microphone array, at least one speaker array, and a signal processor. The first microphone array is configured to pick up ambient noise. The signal processor is configured to estimate noise at a first spatial position based on environmental noise picked up by the first microphone array, and generate a noise-reduced signal based on the noise at the first spatial position, where the first spatial position is larger than the first microphone array Either microphone is brought closer to the user's ear canal. It can be understood here that the microphones in the first microphone array can be distributed in different positions near the user's ear canal, and the location close to the user's ear canal can be estimated by collecting environmental noise signals from the microphones in the first microphone array. (eg, the first spatial position) of the noise. The at least one loudspeaker array is configured to output a noise reduction wave based on the noise reduction signal, and the noise reduction wave can be used to cancel ambient noise delivered to the user's ear canal. In some embodiments, the open earphone may further include a second microphone array configured to pick up environmental noise and noise reduction waves output by the at least one speaker array. In some embodiments, the signal processor may update the noise reduction signal based on the sound signal picked up by the second microphone array. For example, the signal processor can estimate the sound field at the user's ear canal based on the sound signal picked up by the second microphone array, and adjust the phase or amplitude of the noise reduction signal according to the sound field at the user's ear canal to achieve noise reduction Signal updates. In the embodiment of this specification, the noise reduction wave is used to eliminate the environmental noise at the user's ear canal through the above method, realizing the active noise reduction of the open earphone and improving the user's listening experience in the process of using the open earphone .

圖1是根據本發明的一些實施例提供的開放式耳機的示例性方塊圖。如圖1所示，開放式耳機100可以包括殼體結構110、第一麥克風陣列130、信號處理器140和揚聲器陣列150，其中，第一麥克風陣列130、信號處理器140和揚聲器陣列150位於殼體結構110處。開放式耳機100可以通過固定結構120將耳機固定於使用者耳朵附近且不堵塞使用者耳道。在一些實施例中，位於殼體結構110處的第一麥克風陣列130可以拾取使用者耳道處的環境雜訊，並將拾取到的環境雜訊信號轉換為電信號傳遞至信號處理器140進行信號處理。信號處理器140耦接第一麥克風陣列130和揚聲器陣列150，信號處理器140接收第一麥克風陣列130拾取的環境雜訊信號並對其進行信號處理以獲取環境雜訊的參數資訊（例如，幅值資訊、相位資訊等）。信號處理器140可以基於環境雜訊的參數資訊（例如，幅值資訊、相位資訊等）估計使用者第一空間位置處的雜訊，並基於第一空間位置處的雜訊產生降噪信號。該降噪信號的參數資訊與環境雜訊的參數資訊相對應，例如，降噪信號的幅值大小與環境雜訊的幅值大小近似相等，降噪信號的相位與環境雜訊的相位近似相反。信號處理器140將產生的降噪信號傳遞至揚聲器陣列150。揚聲器陣列150可以根據信號處理器140產生的降噪信號輸出降雜訊波，該降雜訊波可以與使用者耳道位置處的環境雜訊相互抵消，從而實現開放式耳機100的主動降噪，提高使用者在使用開放式耳機100過程中的聽覺體驗。FIG. 1 is an exemplary block diagram of an open-back earphone provided according to some embodiments of the present invention. As shown in FIG. 1 , the open earphone 100 may include a shell structure 110, a first microphone array 130, a signal processor 140 and a speaker array 150, wherein the first microphone array 130, the signal processor 140 and the speaker array 150 are located in the shell body structure 110. The open earphone 100 can fix the earphone near the user's ear through the fixing structure 120 without blocking the user's ear canal. In some embodiments, the first microphone array 130 located at the shell structure 110 can pick up the environmental noise at the ear canal of the user, and convert the picked-up environmental noise signal into an electrical signal and send it to the signal processor 140 for further processing. signal processing. The signal processor 140 is coupled to the first microphone array 130 and the speaker array 150. The signal processor 140 receives the environmental noise signal picked up by the first microphone array 130 and performs signal processing on it to obtain parameter information of the environmental noise (for example, amplitude value information, phase information, etc.). The signal processor 140 can estimate the noise at the first spatial position of the user based on parameter information of the environmental noise (eg, amplitude information, phase information, etc.), and generate a noise reduction signal based on the noise at the first spatial position. The parameter information of the noise reduction signal corresponds to the parameter information of the environmental noise. For example, the amplitude of the noise reduction signal is approximately equal to the amplitude of the environmental noise, and the phase of the noise reduction signal is approximately opposite to that of the environmental noise. . The signal processor 140 transmits the generated noise reduction signal to the speaker array 150 . The loudspeaker array 150 can output a noise reduction wave according to the noise reduction signal generated by the signal processor 140, and the noise reduction wave can cancel out the environmental noise at the position of the user's ear canal, thereby realizing the active noise reduction of the open earphone 100 , to improve the listening experience of the user in the process of using the open earphone 100 .

殼體結構110可以被配置為承載第一麥克風陣列130、信號處理器140且揚聲器陣列150。在一些實施例中，殼體結構110可以是內部中空的封閉式或半封閉式殼體結構，且第一麥克風陣列130、信號處理器140且揚聲器陣列150位於殼體結構110處。在一些實施例中，殼體結構110的形狀可以為長方體、圓柱體、圓臺等規則或不規則形狀的立體結構。當使用者佩戴開放式耳機100時，殼體結構110可以位於靠近使用者耳朵附近的位置，例如，殼體結構110可以位於使用者耳廓的周側（例如，前側或後側），或者位於使用者耳朵上但不堵塞或覆蓋使用者的耳道。在一些實施例中，開放式耳機100可以為骨傳導耳機，殼體結構110的至少一側可以與使用者的頭部皮膚接觸。骨傳導耳機中聲學驅動器（例如，振動揚聲器）將音訊信號轉換為機械振動，該機械振動可以通過殼體結構110且使用者的骨骼傳遞至使用者的聽覺神經。在一些實施例中，開放式耳機100可以為氣傳導耳機，殼體結構110的至少一側可以與使用者的頭部皮膚接觸或不接觸。殼體結構110的側壁上包括至少一個導聲孔，氣傳導耳機中的揚聲器將音訊信號轉換為氣導聲音，該氣導聲音可以通過導聲孔向使用者耳朵的方向進行輻射。The housing structure 110 may be configured to carry the first microphone array 130 , the signal processor 140 and the speaker array 150 . In some embodiments, the housing structure 110 may be a closed or semi-closed housing structure with a hollow inside, and the first microphone array 130 , the signal processor 140 and the speaker array 150 are located at the housing structure 110 . In some embodiments, the shape of the housing structure 110 may be a regular or irregular three-dimensional structure such as a cuboid, cylinder, or truncated cone. When the user wears the open earphone 100, the shell structure 110 can be located near the user's ear, for example, the shell structure 110 can be located on the peripheral side (for example, the front side or the back side) of the user's auricle, or on the On the user's ear but does not block or cover the user's ear canal. In some embodiments, the open earphone 100 may be a bone conduction earphone, and at least one side of the shell structure 110 may be in contact with the user's head skin. The acoustic driver (eg, vibrating speaker) in the bone conduction earphone converts the audio signal into mechanical vibration, which can be transmitted to the user's auditory nerve through the shell structure 110 and the user's bones. In some embodiments, the open earphone 100 may be an air conduction earphone, and at least one side of the housing structure 110 may or may not be in contact with the user's head skin. The side wall of the housing structure 110 includes at least one sound guide hole. The speaker in the air conduction earphone converts the audio signal into air conduction sound, and the air conduction sound can radiate toward the user's ear through the sound guide hole.

第一麥克風陣列130可以被配置為拾取環境雜訊。在一些實施例中，環境雜訊是指使用者所處環境中的多種外界聲音的組合。在一些實施例中，環境雜訊可以包括交通雜訊、工業雜訊、建築施工雜訊、社會雜訊等中的一種或多種。在一些實施例中，交通雜訊可以包括但不限於機動車輛的行駛雜訊、鳴笛雜訊等。工業雜訊可以包括但不限於工廠動力機械運轉雜訊等。建築施工雜訊可以包括但不限於動力機械挖掘雜訊、打洞雜訊、攪拌雜訊等。社會生活環境雜訊可以包括但不限於群眾集會雜訊、文娛宣傳雜訊、人群喧鬧雜訊、家用電器雜訊等。在一些實施例中，第一麥克風陣列130可以設置於使用者耳道附近位置，用於拾取傳遞至使用者耳道處的環境雜訊，第一麥克風陣列130可以將拾取的環境雜訊信號轉換為電信號並傳遞至信號處理器140進行信號處理。在一些實施例中，環境雜訊也可以包括使用者講話的聲音。例如，當開放式耳機100為未通話狀態時，使用者自身說話產生的聲音也可以視為環境雜訊，第一麥克風陣列130可以拾取使用者自身說話的聲音且其他環境雜訊，並將使用者說話產生的聲音信號和其他環境雜訊轉化為電信號傳遞至信號處理器140進行信號處理。在一些實施例中，第一麥克風陣列130可以分佈于使用者的左耳或右耳處。在一些實施例中，第一麥克風陣列130還可以位於使用者的左耳和右耳處。例如，第一麥克風陣列130可以包括第一子麥克風陣列和第二子麥克風陣列，其中，第一子麥克風陣列位於使用者的左耳處，第二子麥克風陣列位於使用者的右耳處，第一子麥克風陣列和第二子麥克風陣列可以同時進入工作狀態或二者中的一個進入工作狀態。The first microphone array 130 may be configured to pick up environmental noise. In some embodiments, environmental noise refers to a combination of various external sounds in the environment where the user is located. In some embodiments, environmental noise may include one or more of traffic noise, industrial noise, building construction noise, social noise, and the like. In some embodiments, the traffic noise may include, but not limited to, the driving noise of motor vehicles, the honking noise, and the like. Industrial noise may include, but is not limited to, factory power machinery running noise, etc. Building construction noise may include but not limited to power machinery excavation noise, hole drilling noise, stirring noise, etc. Social and living environment noise may include, but is not limited to, mass gathering noise, entertainment publicity noise, crowd noise, household appliance noise, etc. In some embodiments, the first microphone array 130 can be arranged near the user's ear canal to pick up the environmental noise transmitted to the user's ear canal, and the first microphone array 130 can convert the picked-up environmental noise signal It is an electrical signal and transmitted to the signal processor 140 for signal processing. In some embodiments, the environmental noise may also include the voice of the user speaking. For example, when the open earphone 100 is not in a call state, the sound generated by the user's own speech can also be regarded as environmental noise, and the first microphone array 130 can pick up the user's own speech and other environmental noise, and will use The voice signal and other environmental noise generated by the speaker's speech are converted into electrical signals and transmitted to the signal processor 140 for signal processing. In some embodiments, the first microphone array 130 may be distributed at the left ear or the right ear of the user. In some embodiments, the first microphone array 130 may also be located at the left and right ears of the user. For example, the first microphone array 130 may include a first sub-microphone array and a second sub-microphone array, wherein the first sub-microphone array is located at the user's left ear, the second sub-microphone array is located at the user's right ear, and the second sub-microphone array is located at the user's right ear. The first sub-microphone array and the second sub-microphone array can enter the working state at the same time or one of them can enter the working state.

在一些實施例中，第一麥克風陣列130可以包括氣傳導麥克風和/或骨傳導麥克風。例如，在一些實施例中，第一麥克風陣列130可以包括一個或多個氣傳導麥克風。例如，使用者在使用開放式耳機100聽取音樂時，氣傳導麥克風可以同時獲取外界環境的雜訊和使用者說話時的聲音並將其作為環境雜訊轉換為電信號傳輸至信號處理器140中進行處理。在一些實施例中，第一麥克風陣列130還可以包括一個或多個骨傳導麥克風。在一些實施例中，骨傳導麥克風可以直接與使用者的頭部皮膚接觸，使用者說話時面部骨骼或肌肉產生的振動信號可以直接傳遞給骨傳導麥克風，進而骨傳導麥克風將振動信號轉換為電信號，並將電信號傳遞至信號處理器140進行信號處理。在一些實施例中，骨傳導麥克風也可以不與人體直接接觸，使用者說話時面部骨骼或肌肉產生的振動信號可以先傳遞至殼體結構110，再由殼體結構110傳遞至骨傳導麥克風，骨傳導麥克風進一步將該人體振動信號轉換為包含語音資訊的電信號。例如，使用者在通話狀態時，信號處理器140可以將氣傳導麥克風採集的聲音信號作為環境雜訊進行降噪處理，骨傳導麥克風採集的聲音信號作為語音信號進行保留，從而保證使用者通話時的通話品質。在一些實施例中，根據麥克風的工作原理作為分類，第一麥克風陣列130可以包括動圈式麥克風、帶式麥克風、電容式麥克風、駐極體式麥克風、電磁式麥克風、碳粒式麥克風等，或其任意組合。在一些實施例中，第一麥克風陣列130的陣列排布方式可以是線性陣列（例如，直線形、曲線形）、平面陣列（例如，十字形、圓形、環形、多邊形、網狀形等規則和/或不規則形狀）或立體陣列（例如，圓柱狀、球狀、半球狀、多面體等），關於第一麥克風陣列130的排布方式具體可以參考本說明書圖8及其相關內容。In some embodiments, the first microphone array 130 may include air conduction microphones and/or bone conduction microphones. For example, in some embodiments, first microphone array 130 may include one or more air conduction microphones. For example, when the user listens to music with the open earphone 100, the air conduction microphone can simultaneously acquire the noise of the external environment and the voice of the user when speaking, and convert it into an electrical signal as the environmental noise and transmit it to the signal processor 140. to process. In some embodiments, the first microphone array 130 may also include one or more bone conduction microphones. In some embodiments, the bone conduction microphone can be directly in contact with the user's head skin, and the vibration signal generated by the facial bones or muscles of the user can be directly transmitted to the bone conduction microphone when the user speaks, and then the bone conduction microphone converts the vibration signal into an electric current. signal, and transmit the electrical signal to the signal processor 140 for signal processing. In some embodiments, the bone conduction microphone may not be in direct contact with the human body. When the user speaks, the vibration signal generated by the facial bones or muscles may be transmitted to the shell structure 110, and then transmitted to the bone conduction microphone by the shell structure 110. A bone conduction microphone further converts the human body vibration signal into an electrical signal containing voice information. For example, when the user is in a call state, the signal processor 140 can use the sound signal collected by the air conduction microphone as environmental noise for noise reduction processing, and the sound signal collected by the bone conduction microphone as a voice signal, so as to ensure call quality. In some embodiments, according to the working principle of the microphones, the first microphone array 130 may include dynamic microphones, ribbon microphones, condenser microphones, electret microphones, electromagnetic microphones, carbon microphones, etc., or any combination thereof. In some embodiments, the array arrangement of the first microphone array 130 can be a linear array (for example, linear, curved), planar array (for example, cross-shaped, circular, circular, polygonal, mesh-shaped, etc. and/or irregular shape) or a three-dimensional array (for example, cylindrical, spherical, hemispherical, polyhedral, etc.), for the arrangement of the first microphone array 130, please refer to FIG. 8 and its related contents in this specification.

信號處理器140被配置為基於第一麥克風陣列130拾取的環境雜訊估計第一空間位置的雜訊，並基於第一空間位置的雜訊產生降噪信號。第一空間位置是指靠近使用者耳道特定距離的空間位置，該第一空間位置比第一麥克風陣列130中任一麥克風更加靠近使用者耳道。這裡的特定距離可以是固定的距離，例如，0.5 cm、1cm、2cm、3cm等。在一些實施例中，第一空間位置與第一麥克風陣列130中各麥克風相對於使用者耳朵的分佈位置、數量相關，通過調整第一麥克風陣列130中各麥克風相對於使用者耳朵的分佈位置和/或數量可以對第一空間位置進行調整。例如，通過增加第一麥克風陣列130中麥克風的數量可以使第一空間位置更加靠近使用者耳道。The signal processor 140 is configured to estimate the noise at the first spatial location based on the environmental noise picked up by the first microphone array 130 , and generate a noise reduction signal based on the noise at the first spatial location. The first spatial position refers to a spatial position close to the user's ear canal by a certain distance, and the first spatial position is closer to the user's ear canal than any microphone in the first microphone array 130 . The specific distance here may be a fixed distance, for example, 0.5 cm, 1 cm, 2 cm, 3 cm and so on. In some embodiments, the first spatial position is related to the distribution position and quantity of each microphone in the first microphone array 130 relative to the user's ear, by adjusting the distribution position and quantity of each microphone in the first microphone array 130 relative to the user's ear and/or the quantity can be adjusted for the first spatial position. For example, increasing the number of microphones in the first microphone array 130 can make the first spatial position closer to the user's ear canal.

信號處理器140可以對接收到的環境雜訊信號進行信號處理，估計第一空間位置的雜訊。在一些實施例中，信號處理器140可以耦合到第一麥克風陣列130和揚聲器陣列150，信號處理器140可以接收第一麥克風陣列130拾取的環境雜訊以估計第一空間位置的雜訊。例如，信號處理器140可以確定一個或多個與拾取的環境雜訊有關的空間噪音源。又例如，信號處理器140可以對環境雜訊有關的空間噪音源進行方位估計、相位資訊估計、幅值資訊估計等。信號處理器140可以根據第一空間位置的雜訊估計（例如，相位資訊、幅值資訊）產生降噪信號。降噪信號是指與第一空間位置的雜訊的幅值大小近似相等、相位近似相反的聲音信號。The signal processor 140 may perform signal processing on the received environmental noise signal to estimate the noise at the first spatial location. In some embodiments, the signal processor 140 may be coupled to the first microphone array 130 and the speaker array 150, and the signal processor 140 may receive the environmental noise picked up by the first microphone array 130 to estimate the noise at the first spatial location. For example, signal processor 140 may determine one or more sources of spatial noise related to picked-up ambient noise. For another example, the signal processor 140 may perform orientation estimation, phase information estimation, amplitude information estimation, etc. on the spatial noise source related to the environmental noise. The signal processor 140 can generate a noise-reduced signal according to the noise estimate (eg, phase information, amplitude information) of the first spatial location. The noise reduction signal refers to a sound signal that is approximately equal in amplitude and opposite in phase to the noise at the first spatial position.

揚聲器陣列150被配置為基於降噪信號輸出降雜訊波，該降雜訊波用於消除傳遞到使用者耳道的環境雜訊。在一些實施例中，揚聲器陣列150可以設置於殼體結構110處，當使用者佩戴開放式耳機100時，揚聲器陣列150可以位於使用者耳部的附近位置。在一些實施例中，揚聲器陣列150可以基於降噪信號輸出降雜訊波以與第一空間位置的環境雜訊相抵消。僅作為示例性說明，例如，信號處理器140控制揚聲器陣列150輸出與第一空間位置的雜訊的幅值大小近似相等、相位近似相反的聲音信號以抵消第一空間位置的雜訊。在一些實施例中，第一空間位置與使用者耳道之間的間距較小，第一空間位置的雜訊可以近似視為傳遞到使用者耳朵處的雜訊，揚聲器陣列150基於降噪信號輸出的降雜訊波可以與第一空間位置的雜訊相互抵消，可以近似為傳遞至使用者耳道的環境雜訊被消除。在一些實施例中，根據揚聲器的工作原理進行分類，揚聲器陣列150可以包括電動式揚聲器（例如，動圈式揚聲器）、磁式揚聲器、離子揚聲器、靜電式揚聲器（或電容式揚聲器）、壓電式揚聲器等中的一種或多種。在一些實施例中，根據揚聲器輸出的聲音的傳播方式進行分類，揚聲器陣列150可以包括氣傳導揚聲器或骨傳導揚聲器。例如，揚聲器陣列150只包括氣傳導揚聲器時，揚聲器陣列150中的部分氣傳導揚聲器可以用於輸出降雜訊波以消除雜訊，揚聲器陣列150中的其他氣傳導揚聲器可以用於向使用者傳遞使用者需要聽取的聲音資訊（例如，設備媒體音訊、通話遠端音訊）。在一些實施例中，揚聲器陣列150中用於向使用者傳遞使用者需要聽取的聲音資訊的揚聲器也可以用於輸出降雜訊波。又例如，揚聲器陣列150包括骨傳導揚聲器和氣傳導揚聲器時，氣傳導揚聲器可以用於輸出降雜訊波以消除雜訊，骨傳導揚聲器可以用於向使用者傳遞使用者需要聽取的聲音資訊，相比於氣傳導揚聲器，骨傳導揚聲器將機械振動直接通過使用者的身體（例如，骨骼、皮膚組織等）傳遞至使用者的聽覺神經，在此過程中對於拾取環境雜訊的氣傳導麥克風的干擾較小。骨傳導揚聲器在將機械振動傳遞至使用者的過程中會造成殼體結構110產生機械振動，殼體結構110產生的機械振動作用於空氣產生氣傳導聲音，在一些實施例中，殼體結構110產生的氣傳導聲音也可以作為降雜訊波。需要注意的是，在一些實施例中，揚聲器陣列150可以是獨立的功能器件，也可以是能夠實現多個功能的單個器件的一部分。在一些實施例中，信號處理器140可以和揚聲器陣列150集成在一起和/或形成為一體。在一些實施例中，揚聲器陣列150的排布方式可以是線性陣列（例如，直線形、曲線形）、平面陣列（例如，十字形、網狀形、圓形、環形、多邊形等規則和/或不規則形狀）或立體陣列（例如，圓柱狀、球狀、半球狀、多面體等），本說明書在此不做限定。The speaker array 150 is configured to output a noise reduction wave based on the noise reduction signal, and the noise reduction wave is used to eliminate environmental noise transmitted to the user's ear canal. In some embodiments, the speaker array 150 may be disposed at the shell structure 110 , and when the user wears the open earphone 100 , the speaker array 150 may be located near the ear of the user. In some embodiments, the speaker array 150 may output a noise reduction wave based on the noise reduction signal to cancel out the environmental noise at the first spatial location. As an example only, for example, the signal processor 140 controls the speaker array 150 to output a sound signal having approximately equal amplitude and approximately opposite phase to the noise at the first spatial position to cancel the noise at the first spatial position. In some embodiments, the distance between the first spatial position and the user's ear canal is relatively small, the noise at the first spatial position can be approximately regarded as the noise transmitted to the user's ear, and the speaker array 150 is based on the noise reduction signal The outputted noise-reducing wave can cancel out the noise at the first spatial position, and it can be approximated that the environmental noise transmitted to the user's ear canal is eliminated. In some embodiments, the speaker array 150 may include electrodynamic speakers (for example, dynamic speakers), magnetic speakers, ion speakers, electrostatic speakers (or capacitive speakers), piezoelectric One or more of the speaker, etc. In some embodiments, the speaker array 150 may include air conduction speakers or bone conduction speakers according to the propagation mode of the sound output by the speakers. For example, when the speaker array 150 only includes air conduction speakers, part of the air conduction speakers in the speaker array 150 can be used to output noise reduction waves to eliminate noise, and other air conduction speakers in the speaker array 150 can be used to transmit The audio information that the user needs to hear (for example, device media audio, call remote audio). In some embodiments, the speakers in the speaker array 150 that are used to deliver the audio information that the user needs to hear to the user may also be used to output noise reduction waves. For another example, when the speaker array 150 includes a bone conduction speaker and an air conduction speaker, the air conduction speaker can be used to output noise reduction waves to eliminate noise, and the bone conduction speaker can be used to transmit the sound information that the user needs to hear to the user. Compared with air conduction speakers, bone conduction speakers transmit mechanical vibrations directly through the user's body (such as bones, skin tissue, etc.) to the user's auditory nerves. smaller. When the bone conduction speaker transmits the mechanical vibration to the user, it will cause the shell structure 110 to generate mechanical vibration. The mechanical vibration generated by the shell structure 110 acts on the air to generate air-conducted sound. In some embodiments, the shell structure 110 The resulting air-conducted sound also acts as a noise reduction wave. It should be noted that, in some embodiments, the speaker array 150 may be an independent functional device, or a part of a single device capable of implementing multiple functions. In some embodiments, the signal processor 140 may be integrated and/or formed into a single body with the speaker array 150 . In some embodiments, the speaker array 150 can be arranged in a linear array (for example, linear, curved), planar array (for example, cross-shaped, mesh-shaped, circular, circular, polygonal, etc.) and/or Irregular shape) or three-dimensional array (for example, cylindrical, spherical, hemispherical, polyhedron, etc.), this description is not limited here.

在一些實施例中，開放式耳機100還可以包括固定結構120，固定結構120被配置為將開放式耳機100固定在使用者耳朵附近且不堵塞使用者耳道的位置。在一些實施例中，固定結構120可以包括耳掛、頭梁或彈性帶等，使得開放式耳機100可以更好地固定在使用者耳朵附近位置，防止使用者在使用時發生掉落。僅作為示例性說明，例如，固定結構120可以為耳掛，耳掛可以被配置為圍繞耳部區域佩戴。又例如，固定結構120可以為頸帶，被配置為圍繞頸/肩區域佩戴。在一些實施例中，耳掛可以是連續的鉤狀物，並可以被彈性地拉伸以佩戴在使用者的耳部，同時耳掛還可以對使用者的耳廓施加壓力，使得開放式耳機100牢固地固定在使用者的耳部或頭部的特定位置上。在一些實施例中，耳掛可以是不連續的帶狀物。例如，耳掛可以包括剛性部分和柔性部分，其中，剛性部分可以由剛性材料（例如，塑膠或金屬）製成，剛性部分可以與聲學輸出裝置的殼體結構通過物理連接（例如，卡接、螺紋連接等）的方式進行固定。柔性部分可以由彈性材料製成（例如，布料、複合材料或/和氯丁橡膠）。In some embodiments, the open-back earphone 100 may further include a fixing structure 120 configured to fix the open-back earphone 100 near the user's ear without blocking the user's ear canal. In some embodiments, the fixing structure 120 may include ear hooks, headbands or elastic bands, etc., so that the open earphone 100 can be better fixed near the user's ear and prevent the user from falling during use. By way of illustration only, for example, securing structure 120 may be an earhook that may be configured to be worn around the ear region. As another example, securing structure 120 may be a neck strap configured to be worn around the neck/shoulder area. In some embodiments, the earhook can be a continuous hook that can be elastically stretched to fit on the user's ear, while the earhook can also apply pressure to the user's auricle, making the open earphone 100 is firmly fixed on the user's ear or head at a specific location. In some embodiments, the earhook may be a discontinuous strip. For example, the earhook may include a rigid part and a flexible part, wherein the rigid part may be made of a rigid material (for example, plastic or metal), and the rigid part may be physically connected to the housing structure of the acoustic output device (for example, clipping, screw connection, etc.) to fix. The flexible portion may be made of elastic material (eg, cloth, composite, or/and neoprene).

應當注意的是，以上關於圖1和圖2的描述僅僅是出於說明的目的而提供的，並不旨在限制本發明的範圍。對於所屬技術領域中具有通常知識者來說，根據本案揭露內容的指導可以做出多種變化和修改。然而，這些變化和修改不會背離本發明的範圍。例如，開放式耳機100中的一個或多個元件（例如，固定結構120等）可以省略。在一些實施例中，一個元件可以被其他能實現類似功能的元件替代。例如，在一些實施例中，開放式耳機100可以不包括固定結構120，殼體結構110可以為具有人體耳朵適配形狀的殼體結構，例如圓環形、橢圓形、多邊形（規則或不規則）、U 型、V 型、半圓形，以便殼體結構110可以掛靠在使用者的耳朵附近。在一些實施例中，一個元件可以拆分成多個子元件，或者多個元件可以合併為單個元件。It should be noted that the above descriptions with respect to FIGS. 1 and 2 are provided for illustrative purposes only, and are not intended to limit the scope of the present invention. Numerous changes and modifications may be made by those skilled in the art in light of the teachings of the present disclosure. However, these changes and modifications do not depart from the scope of the present invention. For example, one or more elements of open-back earphone 100 (eg, securing structure 120 , etc.) may be omitted. In some embodiments, an element may be replaced by another element that performs a similar function. For example, in some embodiments, the open earphone 100 may not include the fixing structure 120, and the housing structure 110 may be a housing structure having a shape suitable for a human ear, such as a ring, an ellipse, a polygon (regular or irregular) ), U-shaped, V-shaped, semi-circular, so that the shell structure 110 can hang near the user's ear. In some embodiments, an element may be split into multiple sub-elements, or multiple elements may be combined into a single element.

圖2是根據本發明的一些實施例提供的開放式耳機的示例性原理流程圖。如圖2所示，流程200可以包括：Fig. 2 is an exemplary schematic flowchart of an open earphone provided according to some embodiments of the present invention. As shown in Figure 2, the process 200 may include:

在步驟210中，拾取環境雜訊。In step 210, environmental noise is picked up.

在一些實施例中，該步驟可以由第一麥克風陣列130執行。在一些實施例中，環境雜訊是指使用者所處環境中的多種外界聲音的組合。在一些實施例中，環境雜訊可以包括交通雜訊、工業雜訊、建築施工雜訊、社會雜訊等中的一種或多種。在一些實施例中，交通雜訊可以包括但不限於機動車輛的行駛雜訊、鳴笛雜訊等。工業雜訊可以包括但不限於工廠動力機械運轉雜訊等。建築施工雜訊可以包括但不限於動力機械挖掘雜訊、打洞雜訊、攪拌雜訊等。社會生活環境雜訊可以包括但不限於群眾集會雜訊、文娛宣傳雜訊、人群喧鬧雜訊、家用電器雜訊等。在一些實施例中，第一麥克風陣列130可以位於使用者耳道的附近位置，用於拾取傳遞至使用者耳道處的環境雜訊，第一麥克風陣列130可以將拾取的環境雜訊信號轉換為電信號並傳遞至信號處理器140進行信號處理。在一些實施例中，環境雜訊也可以包括使用者講話的聲音。例如，當開放式耳機100為未通話狀態時（例如收聽音訊或觀看視頻時），使用者自身說話產生的聲音也可以視為環境雜訊，第一麥克風陣列130可以拾取使用者自身說話的聲音且其他環境雜訊，並將使用者說話產生的聲音信號和其他環境雜訊轉化為電信號傳遞至信號處理器140進行信號處理。In some embodiments, this step may be performed by the first microphone array 130 . In some embodiments, environmental noise refers to a combination of various external sounds in the environment where the user is located. In some embodiments, environmental noise may include one or more of traffic noise, industrial noise, building construction noise, social noise, and the like. In some embodiments, the traffic noise may include, but not limited to, the driving noise of motor vehicles, the honking noise, and the like. Industrial noise may include, but is not limited to, factory power machinery running noise, etc. Building construction noise may include but not limited to power machinery excavation noise, hole drilling noise, stirring noise, etc. Social and living environment noise may include, but is not limited to, mass gathering noise, entertainment publicity noise, crowd noise, household appliance noise, etc. In some embodiments, the first microphone array 130 may be located near the user's ear canal for picking up environmental noise transmitted to the user's ear canal, and the first microphone array 130 may convert the picked-up environmental noise signal It is an electrical signal and transmitted to the signal processor 140 for signal processing. In some embodiments, the environmental noise may also include the voice of the user speaking. For example, when the open earphone 100 is not in a call state (for example, when listening to audio or watching a video), the sound generated by the user's own speech can also be regarded as environmental noise, and the first microphone array 130 can pick up the sound of the user's own speech and other environmental noises, and convert the sound signal generated by the user's speech and other environmental noises into electrical signals and transmit them to the signal processor 140 for signal processing.

在步驟220中，基於拾取的環境雜訊估計第一空間位置的雜訊。In step 220, the noise of the first spatial location is estimated based on the picked-up ambient noise.

在一些實施例中，該步驟可以由信號處理器140執行。第一空間位置是指靠近使用者耳道特定距離的空間位置。這裡的特定距離可以是固定的距離，例如，0.5 cm、1cm、2 cm、3 cm等，可以根據實際應用情況進行適應性調整。第一麥克風陣列130拾取的環境雜訊可以是來自不同方位、不同種類的空間噪音源，因而每一個空間噪音源對應的參數資訊（例如，相位資訊、幅值資訊）是不同的。在一些實施例中，信號處理器140可以根據不同類型的雜訊在不同維度（例如，空域、時域、頻域等）的統計分佈和結構化特徵將第一空間位置的雜訊進行信號分離提取，從而估計不同類型（例如不同頻率、不同相位等）的雜訊，並估計每種雜訊所對應的參數資訊（例如，幅值資訊、相位資訊等）。在一些實施例中，信號處理器140還可以將根據第一空間位置處不同類型雜訊對應的參數資訊確定第一空間位置的雜訊的整體參數資訊。在一些實施例中，基於拾取的環境雜訊估計第一空間位置的雜訊還可以包括確定一個或多個與拾取的環境雜訊有關的空間噪音源，基於空間噪音源估計第一空間位置的雜訊。例如，將拾取的環境雜訊劃分為多個子帶，每個子帶對應不同的頻率範圍，在至少一個子帶上，確定與其對應的空間噪音源。需要注意的是，這裡通過子帶估計的空間噪音源是與外界真實噪音源對應的虛擬噪音源。關於基於拾取的環境雜訊估計第一空間位置的雜訊的具體內容可以參考本發明說明書其它地方，例如，圖5至圖7及其相應描述。In some embodiments, this step may be performed by the signal processor 140 . The first spatial position refers to a spatial position close to the user's ear canal at a specific distance. The specific distance here can be a fixed distance, for example, 0.5 cm, 1 cm, 2 cm, 3 cm, etc., and can be adjusted adaptively according to actual application conditions. The environmental noise picked up by the first microphone array 130 may come from different azimuths and different types of spatial noise sources, so the parameter information (eg, phase information, amplitude information) corresponding to each spatial noise source is different. In some embodiments, the signal processor 140 may separate the noise at the first spatial position according to the statistical distribution and structural characteristics of different types of noise in different dimensions (for example, space domain, time domain, frequency domain, etc.) Extraction, thereby estimating different types of noises (such as different frequencies, different phases, etc.), and estimating the parameter information (such as amplitude information, phase information, etc.) corresponding to each noise. In some embodiments, the signal processor 140 may also determine the overall parameter information of the noise at the first spatial location according to the parameter information corresponding to different types of noise at the first spatial location. In some embodiments, estimating the noise of the first spatial position based on the picked-up environmental noise may further include determining one or more spatial noise sources related to the picked-up environmental noise, and estimating the noise of the first spatial position based on the spatial noise source. noise. For example, the picked-up environmental noise is divided into multiple subbands, each subband corresponds to a different frequency range, and the corresponding spatial noise source is determined on at least one subband. It should be noted that the spatial noise source estimated by the sub-band here is a virtual noise source corresponding to the real noise source outside. For details about estimating the noise of the first spatial position based on the picked-up environmental noise, reference may be made to other places in the specification of the present invention, for example, FIG. 5 to FIG. 7 and their corresponding descriptions.

開放式耳機100不堵塞使用者耳道，無法通過在耳道處設置麥克風的方式獲取環境雜訊，第一空間位置是第一麥克風陣列130所構造的用於模擬使用者耳道位置的空間區域，為了更加精確地估計使用者耳道處傳遞的環境雜訊，在一些實施例中，第一空間位置比第一麥克風陣列130中任一麥克風更加靠近使用者耳道。在一些實施例中，第一空間位置與第一麥克風陣列130中各麥克風相對於使用者耳朵的分佈位置、數量相關，通過調整第一麥克風陣列130中各麥克風相對於使用者耳朵的分佈位置或數量可以對第一空間位置進行調整。例如，通過增加第一麥克風陣列130中麥克風的數量可以使第一空間位置更加靠近使用者耳道。又例如，還可以通過減小第一麥克風陣列130中各麥克風的間距使第一空間位置更加靠近使用者耳道。再例如，還可以通過改變第一麥克風陣列130中各麥克風的排列方式使第一空間位置更加靠近使用者耳道。The open earphone 100 does not block the user's ear canal, and cannot obtain environmental noise by installing a microphone at the ear canal. The first spatial position is the spatial area constructed by the first microphone array 130 to simulate the position of the user's ear canal , in order to more accurately estimate the environmental noise transmitted by the user's ear canal, in some embodiments, the first spatial position is closer to the user's ear canal than any microphone in the first microphone array 130 . In some embodiments, the first spatial position is related to the distribution position and quantity of each microphone in the first microphone array 130 relative to the user's ear. By adjusting the distribution position or quantity of each microphone in the first microphone array 130 relative to the user's ear The quantity can be adjusted for the first space position. For example, increasing the number of microphones in the first microphone array 130 can make the first spatial position closer to the user's ear canal. For another example, the first spatial position may be closer to the user's ear canal by reducing the distance between the microphones in the first microphone array 130 . For another example, the first spatial position may be closer to the user's ear canal by changing the arrangement of the microphones in the first microphone array 130 .

在步驟230中，基於第一空間位置的雜訊產生降噪信號。In step 230, a noise reduction signal is generated based on the noise at the first spatial location.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，信號處理器140可以基於步驟220中獲得的第一空間位置的雜訊的參數資訊（例如，幅值資訊、相位資訊等）產生降噪信號。例如，降雜訊波的相位可以與第一空間位置的雜訊的相位近似相反。又例如，降雜訊波的相位可以與第一空間位置的雜訊的相位近似相反，並且該降噪信號的幅值大小可以與第一空間位置的雜訊的幅值大小近似相等。在一些實施例中，揚聲器陣列150可以基於信號處理器140產生的降噪信號輸出降雜訊波，降雜訊波可以與第一空間位置的雜訊相互抵消。在一些實施例中，第一空間位置的雜訊可以近似視為使用者耳道位置的雜訊，因此，降噪信號與第一空間位置的雜訊相互抵消，可以近似為傳遞至使用者耳道的環境雜訊被消除。在一些實施例中，開放式耳機100可以通過圖2中所描述的方法步驟消除使用者耳道位置處的環境雜訊，實現開放式耳機100的主動降噪。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the signal processor 140 may generate the noise-reduced signal based on the parameter information (eg, amplitude information, phase information, etc.) of the noise at the first spatial position obtained in step 220 . For example, the phase of the denoised wave may be approximately opposite to the phase of the noise at the first spatial location. For another example, the phase of the noise reduction wave may be approximately opposite to the phase of the noise at the first spatial position, and the amplitude of the noise reduction signal may be approximately equal to the amplitude of the noise at the first spatial position. In some embodiments, the speaker array 150 may output a noise reduction wave based on the noise reduction signal generated by the signal processor 140, and the noise reduction wave may cancel out the noise at the first spatial position. In some embodiments, the noise at the first spatial position can be approximately regarded as the noise at the position of the user's ear canal. Therefore, the noise reduction signal and the noise at the first spatial position cancel each other, and can be approximately transmitted to the user's ear The ambient noise of the channel is eliminated. In some embodiments, the open earphone 100 can eliminate the environmental noise at the position of the user's ear canal through the method steps described in FIG. 2 , so as to realize the active noise reduction of the open earphone 100 .

應當注意的是，上述有關流程200的描述僅僅是為了示例和說明，而不限定本說明書的適用範圍。對於所屬技術領域中具有通常知識者來說，在本說明書的指導下可以對流程200進行各種修正和改變。然而，這些修正和改變仍在本說明書的範圍之內。例如，還可以增加、省略或合併流程200中的步驟，比如，還可以對環境雜訊進行信號處理（例如，濾波處理等）。It should be noted that the above description about the process 200 is only for illustration and description, and does not limit the scope of application of this specification. For those skilled in the art, various modifications and changes can be made to the process 200 under the guidance of this description. However, such modifications and changes are still within the scope of this specification. For example, steps in the process 200 may also be added, omitted or combined, for example, signal processing (for example, filtering processing, etc.) may also be performed on environmental noise.

繼續參照圖1，在一些實施例中，開放式耳機100還可以包括第二麥克風陣列160。第二麥克風陣列160可以位於殼體結構110的內部。在一些實施例中，第二麥克風陣列160至少部分區別於第一麥克風陣列130。例如，第二麥克風陣列160中的麥克風與第一麥克風陣列130中的麥克風的數量、種類、位置、排布方式等中的一種或多種不同。在一些實施例中，例如，第一麥克風陣列130中麥克風排布方式可以是線形的，第二麥克風陣列160中麥克風的排布方式可以是圓形的。又例如，第二麥克風陣列160中的麥克風可以只包括氣傳導麥克風，第一麥克風陣列130中可以包括氣傳導麥克風和骨傳導麥克風。在一些實施例中，第二麥克風陣列160中的麥克風可以是第一麥克風陣列130中包括的任意一個或多個麥克風，第二麥克風陣列160中的麥克風也可以獨立於第一麥克風陣列130的麥克風。在一些實施例中，第二麥克風陣列160可以被配置為拾取環境雜訊和揚聲器陣列150輸出的降雜訊波。第二麥克風陣列160拾取的環境雜訊和降雜訊波可以傳遞至信號處理器140。在一些實施例中，信號處理器140可以基於第二麥克風陣列160拾取的聲音信號更新降噪信號。例如，信號處理器140可以根據第二麥克風陣列160拾取的聲音信號的參數資訊（例如，頻率資訊、幅值資訊、相位資訊等）調整降噪信號的參數資訊，使得調整後的降噪信號的幅值能夠與第一空間位置的雜訊的幅值更加吻合，或調整後的降噪信號的相位能夠與第一空間位置的雜訊的相位的反相位更加吻合，從而使得更新後的降雜訊波與第一空間位置的雜訊可以更全面的抵消。關於基於第二麥克風陣列160拾取的聲音信號更新降噪信號的具體內容可以參考本說明書圖3及其相關描述。Continuing to refer to FIG. 1 , in some embodiments, the open-back earphone 100 may further include a second microphone array 160 . The second microphone array 160 may be located inside the housing structure 110 . In some embodiments, the second microphone array 160 is at least partially distinct from the first microphone array 130 . For example, the microphones in the second microphone array 160 are different from the microphones in the first microphone array 130 in one or more of quantity, type, position, arrangement, and the like. In some embodiments, for example, the arrangement of the microphones in the first microphone array 130 may be linear, and the arrangement of the microphones in the second microphone array 160 may be circular. For another example, the microphones in the second microphone array 160 may only include air conduction microphones, and the first microphone array 130 may include air conduction microphones and bone conduction microphones. In some embodiments, the microphones in the second microphone array 160 can be any one or more microphones included in the first microphone array 130, and the microphones in the second microphone array 160 can also be independent of the microphones in the first microphone array 130. . In some embodiments, the second microphone array 160 may be configured to pick up environmental noise and the noise reduction wave output by the speaker array 150 . The ambient noise and noise reduction waves picked up by the second microphone array 160 can be transmitted to the signal processor 140 . In some embodiments, the signal processor 140 may update the noise reduction signal based on the sound signal picked up by the second microphone array 160 . For example, the signal processor 140 can adjust the parameter information of the noise reduction signal according to the parameter information (for example, frequency information, amplitude information, phase information, etc.) of the sound signal picked up by the second microphone array 160, so that the adjusted noise reduction signal The amplitude can be more consistent with the amplitude of the noise at the first spatial position, or the phase of the adjusted noise reduction signal can be more consistent with the opposite phase of the phase of the noise at the first spatial position, so that the updated noise reduction The noise wave and the noise of the first spatial position can be more comprehensively canceled. For specific content about updating the noise reduction signal based on the sound signal picked up by the second microphone array 160, reference may be made to FIG. 3 and its related descriptions in this specification.

圖3是根據本發明一些實施例提供的更新降噪信號的示例性流程圖。如圖3所示，流程300可以包括：Fig. 3 is an exemplary flow chart of updating a noise reduction signal according to some embodiments of the present invention. As shown in Figure 3, the process 300 may include:

在步驟310中，基於第二麥克風陣列160拾取的聲音信號，對使用者耳道處的聲場進行估計。In step 310 , based on the sound signal picked up by the second microphone array 160 , the sound field at the ear canal of the user is estimated.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，第二麥克風陣列160拾取的聲音信號包括環境雜訊和揚聲器陣列150輸出的降雜訊波。在一些實施例中，環境雜訊和揚聲器陣列150輸出的降雜訊波相抵消後，使用者耳道附近可能仍會存在一部分未相互抵消掉的聲音信號，這些未抵消掉的聲音信號可以是殘餘的環境雜訊和/殘餘的降雜訊波，因此使得環境雜訊和降雜訊波抵消後使用者耳道處仍存在一定的雜訊。信號處理器140可以根據第二麥克風陣列160拾取的聲音信號（例如，環境雜訊、降雜訊波）進行信號處理，得到使用者耳道處的聲場的參數資訊，例如，頻率資訊、幅值資訊和相位資訊等，從而實現對使用者耳道處的聲場估計。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the sound signal picked up by the second microphone array 160 includes environmental noise and the noise reduction wave output by the speaker array 150 . In some embodiments, after the environmental noise and the noise reduction wave output by the speaker array 150 are canceled, there may still be some uncancelled sound signals near the user's ear canal, and these uncancelled sound signals can be The residual environmental noise and/or the residual noise reduction wave, therefore, after the cancellation of the environmental noise and the noise reduction wave, there is still a certain amount of noise in the user's ear canal. The signal processor 140 can perform signal processing according to the sound signal picked up by the second microphone array 160 (for example, environmental noise, noise reduction wave), to obtain parameter information of the sound field at the user's ear canal, for example, frequency information, amplitude Value information and phase information, etc., so as to realize the estimation of the sound field at the user's ear canal.

在步驟320中，根據使用者耳道處的聲場，調整降噪信號的參數資訊。In step 320, the parameter information of the noise reduction signal is adjusted according to the sound field at the ear canal of the user.

在一些實施例中，步驟320可以由信號處理器140執行。在一些實施例中，信號處理器140可以根據步驟310中得到的使用者耳道處的聲場的參數資訊，調整降噪信號的參數資訊（例如，頻率資訊、幅值資訊和/或相位資訊），使得更新後降噪信號的幅值資訊、頻率資訊與使用者耳道處的環境雜訊的幅值資訊、頻率資訊更加吻合，且更新後降噪信號的相位資訊與使用者耳道處的環境雜訊的反相位資訊更加吻合，從而使得更新後降噪信號可以更加精準的消除環境雜訊。In some embodiments, step 320 may be performed by the signal processor 140 . In some embodiments, the signal processor 140 can adjust the parameter information (for example, frequency information, amplitude information and/or phase information) of the noise reduction signal according to the parameter information of the sound field at the user's ear canal obtained in step 310 ), so that the amplitude information and frequency information of the updated noise reduction signal are more consistent with the amplitude information and frequency information of the environmental noise at the user's ear canal, and the phase information of the updated noise reduction signal is more consistent with the user's ear canal The anti-phase information of the environmental noise is more consistent, so that the updated noise reduction signal can more accurately eliminate the environmental noise.

應當注意的是，上述有關流程300的描述僅僅是為了示例和說明，而不限定本說明書的適用範圍。對於所屬技術領域中具有通常知識者來說，在本說明書的指導下可以對流程300進行各種修正和改變。然而，這些修正和改變仍在本說明書的範圍之內。例如，拾取使用者耳道處的聲場的麥克風陣列不限於第二麥克風陣列，還可以包括其它麥克風陣列，例如第三麥克風陣列、第四麥克風陣列等，可以將多個麥克風陣列拾取的使用者耳道處的聲場的相關參數資訊以平均或加權演算法等方式對使用者耳道處的聲場進行估計。It should be noted that the above description about the process 300 is only for illustration and description, and does not limit the scope of application of this description. For those skilled in the art, various modifications and changes can be made to the process 300 under the guidance of this description. However, such modifications and changes are still within the scope of this specification. For example, the microphone array that picks up the sound field at the user's ear canal is not limited to the second microphone array, and may also include other microphone arrays, such as the third microphone array, the fourth microphone array, etc. The relevant parameter information of the sound field at the ear canal is used to estimate the sound field at the user's ear canal by means of averaging or weighting algorithm.

在一些實施例中，為了更加精準地獲取使用者耳道處的聲場，第二麥克風陣列160包括一個比第一麥克風陣列130中任意麥克風更加靠近使用者耳道的麥克風。在一些實施例中，第一麥克風陣列130拾取的聲音信號是環境雜訊，第二麥克風陣列160拾取的聲音信號是環境雜訊和降雜訊波。在一些實施例中，信號處理器140可以根據第二麥克風陣列160拾取的聲音信號對使用者耳道處的聲場進行估計，以更新降噪信號。第二麥克風陣列160需要對降噪信號與環境雜訊抵消後使用者耳道處的聲場進行監測，第二麥克風陣列160包括一個比第一麥克風陣列130中任意麥克風更加靠近使用者耳道的麥克風可以更加準確的表徵使用者聽到的聲音信號，通過第二麥克風陣列160的聲場進行估計以更新降噪信號，可以進一步提高降噪效果和使用者的聽覺體驗感。In some embodiments, in order to obtain the sound field at the user's ear canal more accurately, the second microphone array 160 includes a microphone that is closer to the user's ear canal than any microphone in the first microphone array 130 . In some embodiments, the sound signal picked up by the first microphone array 130 is environmental noise, and the sound signal picked up by the second microphone array 160 is the environmental noise and the noise reduction wave. In some embodiments, the signal processor 140 may estimate the sound field at the user's ear canal according to the sound signal picked up by the second microphone array 160, so as to update the noise reduction signal. The second microphone array 160 needs to monitor the sound field at the user's ear canal after the noise reduction signal and the environmental noise are cancelled. The second microphone array 160 includes a microphone closer to the user's ear canal than any microphone in the first microphone array 130 The microphone can more accurately represent the sound signal heard by the user, and the sound field of the second microphone array 160 is used to estimate the sound field to update the noise reduction signal, which can further improve the noise reduction effect and the user's sense of hearing experience.

在一些實施例中，第一麥克風陣列130和第二麥克風陣列160的排布方式可以相同。這裡的第一麥克風陣列130和第二麥克風陣列160的排布方式相同可以理解為二者的排布形狀近似相同。圖4A是根據本發明說明書一些實施例提供的第一麥克風陣列和第二麥克風陣列的排布方式和位置關係的示例性分佈圖。如圖4A所示，第一麥克風陣列130以半圓形排布的排布方式設置於人耳處，第二麥克風陣列160也是以半圓形排布的排布方式設置於人耳處，第二麥克風陣列160中的麥克風比第一麥克風陣列130中任意麥克風更加靠近使用者耳道。在一些實施例中，第一麥克風陣列130中的麥克風可以與第二麥克風陣列160中的麥克風獨立設置。例如，圖4A中的第一麥克風陣列130中的麥克風以半圓形的排布方式進行排布，第二麥克風陣列160中的麥克風以半圓形的排布方式進行排布，第一麥克風陣列130中的麥克風與第二麥克風陣列160中的麥克風未出現重疊或交叉。在一些實施例中，第一麥克風陣列130中的麥克風可以與第二麥克風陣列160中的麥克風可以部分重疊或交叉。In some embodiments, the arrangements of the first microphone array 130 and the second microphone array 160 may be the same. Here, the same arrangement manner of the first microphone array 130 and the second microphone array 160 may be understood as that the arrangement shapes of the two are approximately the same. Fig. 4A is an exemplary distribution diagram of the arrangement and positional relationship of the first microphone array and the second microphone array provided according to some embodiments of the specification of the present invention. As shown in FIG. 4A, the first microphone array 130 is arranged at the human ear in a semicircular arrangement, and the second microphone array 160 is also arranged at the human ear in a semicircular arrangement. The microphones in the second microphone array 160 are closer to the user's ear canal than any microphone in the first microphone array 130 . In some embodiments, the microphones in the first microphone array 130 may be independently configured from the microphones in the second microphone array 160 . For example, the microphones in the first microphone array 130 in FIG. 4A are arranged in a semicircular arrangement, the microphones in the second microphone array 160 are arranged in a semicircular arrangement, and the first microphone array The microphones in 130 do not overlap or cross the microphones in the second microphone array 160 . In some embodiments, the microphones in the first microphone array 130 may partially overlap or cross the microphones in the second microphone array 160 .

在一些實施例中，第一麥克風陣列130和第二麥克風陣列160的排布方式可以不同。圖4B是根據本發明說明書另一些實施例提供的第一麥克風陣列和第二麥克風陣列的排布方式的示例性分佈圖。如圖4B所示，第一麥克風陣列130以半圓形排布的排布方式設置於人耳處，第二麥克風陣列160以線形排布的排布方式是設置於人耳處，其中，第二麥克風陣列160中的麥克風相對於第一麥克風陣列130中任意麥克風更加靠近使用者耳道。在一些實施例中，第一麥克風陣列130和第二麥克風陣列160還可以為組合排布的方式進行排列。例如，圖4B中第二麥克風陣列160包括線形排布的部分和半圓形排布的部分，第二麥克風陣列160中半圓形排布的部分為第一麥克風陣列130的組成部分。需要說明的是，第一麥克風陣列130和第二麥克風陣列160的排布方式不限於圖4A和圖4B中所示的半圓形和線形，這裡的半圓形和線形只出於說明的目的，關於麥克風陣列的排布方式可以參見本說明書圖8及其相關描述。In some embodiments, the arrangements of the first microphone array 130 and the second microphone array 160 may be different. Fig. 4B is an exemplary distribution diagram of an arrangement manner of the first microphone array and the second microphone array provided according to some other embodiments of the specification of the present invention. As shown in FIG. 4B, the first microphone array 130 is arranged at the human ear in a semicircular arrangement, and the second microphone array 160 is arranged at the human ear in a linear arrangement. The microphones in the second microphone array 160 are closer to the user's ear canal than any microphone in the first microphone array 130 . In some embodiments, the first microphone array 130 and the second microphone array 160 may also be arranged in combination. For example, in FIG. 4B , the second microphone array 160 includes linearly arranged parts and semicircularly arranged parts, and the semicircularly arranged part in the second microphone array 160 is a component of the first microphone array 130 . It should be noted that the arrangement of the first microphone array 130 and the second microphone array 160 is not limited to the semicircular and linear shapes shown in FIGS. 4A and 4B , and the semicircular and linear shapes here are only for illustration purposes. , for the arrangement of the microphone array, please refer to FIG. 8 and related descriptions in this specification.

在一些實施例中，還可以根據使用者的手動輸入更新降噪信號。例如，在一些實施例中，使用者在比較嘈雜的外界環境中佩戴開放式耳機100進行音樂播放時，使用者自身的聽覺體驗效果不理想，使用者可以根據自身的聽覺效果手動調整降噪信號的參數資訊（例如，頻率資訊、相位資訊或者幅值資訊）。又例如，特殊使用者（例如，聽力受損使用者或者年齡較大使用者）在使用開放式耳機100的過程中，特殊使用者的聽力能力與普通使用者的聽力能力存在差異，開放式耳機100本身產生的降雜訊波與特殊人群的聽力能力不匹配，導致特殊使用者的聽覺體驗較差。這種情況下，特殊使用者可以根據自身的聽覺效果手動調整降噪信號的頻率資訊、相位資訊或者幅值資訊，從而更新降噪信號以提高特殊使用者的聽覺體驗。在一些實施例中，使用者手動調整降噪信號的方式可以是通過開放式耳機100上的鍵位進行手動調整。在一些實施例中，使用者手動調整降噪信號的方式也可以是通過終端設備進行手動輸入調整。在一些實施例中，開放式耳機100或者與開放式耳機100通信連接的手機、平板電腦、電腦等電子產品上可以顯示使用者耳道處的聲場，並回饋給使用者建議的降噪信號的頻率資訊範圍、幅值資訊範圍或相位資訊範圍，使用者可以根據建議的降噪信號的參數資訊進行手動輸入，然後再根據自身的聽覺體驗情況進行參數資訊的微調。In some embodiments, the noise reduction signal can also be updated according to the user's manual input. For example, in some embodiments, when the user wears the open earphone 100 to play music in a relatively noisy external environment, the user's own hearing experience is not ideal, and the user can manually adjust the noise reduction signal according to his own hearing effect. Parameter information (for example, frequency information, phase information, or amplitude information) of . For another example, when a special user (for example, a hearing-impaired user or an older user) uses the open earphone 100, the hearing ability of the special user is different from that of an ordinary user. The noise reduction wave generated by 100 itself does not match the hearing ability of special people, resulting in poor hearing experience for special users. In this case, the special user can manually adjust the frequency information, phase information or amplitude information of the noise reduction signal according to his own hearing effect, so as to update the noise reduction signal to improve the hearing experience of the special user. In some embodiments, the way for the user to manually adjust the noise reduction signal may be through the keys on the open earphone 100 for manual adjustment. In some embodiments, the way for the user to manually adjust the noise reduction signal may also be to manually input and adjust through the terminal device. In some embodiments, the sound field at the user's ear canal can be displayed on the open earphone 100 or an electronic product such as a mobile phone, a tablet computer, or a computer that is communicatively connected with the open earphone 100, and the noise reduction signal suggested by the user can be fed back. The frequency information range, amplitude information range or phase information range, users can manually input the parameter information of the proposed noise reduction signal, and then fine-tune the parameter information according to their own hearing experience.

在一些實施例中，信號處理器140基於拾取的環境雜訊估計第一空間位置的雜訊可以包括：根據拾取的環境雜訊進行信號分離，獲取環境雜訊對應的參數資訊，基於環境雜訊對應的參數資訊產生降噪信號。在一些實施例中，麥克風陣列（例如，第一麥克風陣列130、第二麥克風陣列160）拾取的環境雜訊可以包括雜訊、使用者人聲、揚聲器陣列150輸出的音訊等。在一些實施例中，揚聲器陣列150輸出的音訊可以包括揚聲器陣列150輸出的通話遠端音訊、設備媒體音訊、降雜訊波等。在一些實施例中，信號處理器140可以對麥克風陣列拾取的環境雜訊進行信號分析，將環境雜訊所包括的各種聲音信號進行信號分離，得到雜訊、使用者人聲、降雜訊波、設備媒體音訊、通話遠端音訊等多種單一的聲音信號。具體地，信號處理器140可以根據雜訊、使用者人聲、降雜訊波、設備媒體音訊、通話遠端音訊等在空間、時域、頻域等不同維度的統計分佈特性及結構化特徵，自我調整調整濾波器組參數，估計環境雜訊中各個聲音信號（例如，雜訊、使用者人聲、降雜訊波、設備媒體音訊、通話遠端音訊等）的參數資訊，並根據不同的參數資訊完成信號分離過程。例如，在一些實施例中，麥克風陣列可以將拾取的雜訊、使用者人聲、降雜訊波分別轉換成對應的第一信號、第二信號、第三信號。信號處理器140獲取第一信號、第二信號、第三信號在空間差異（例如，信號所處位置）、時域差異（例如，延遲）、頻域差異（例如，幅值、相位），並根據三種維度上的差異將第一信號、第二信號、第三信號進行信號分離，得到相對純淨的第一信號、第二信號、第三信號。分離後的第一信號、第二信號、第三信號分別對應純淨的雜訊、使用者人聲、降雜訊波，信號處理器140完成信號分離過程。在一些實施例中，信號處理器140可以根據信號分離得到的雜訊、降雜訊波、設備媒體音訊、通話遠端音訊等的參數資訊更新降雜訊波，更新後的降雜訊波通過揚聲器陣列150輸出。In some embodiments, the signal processor 140 estimating the noise at the first spatial position based on the picked-up environmental noise may include: performing signal separation according to the picked-up environmental noise, obtaining parameter information corresponding to the environmental noise, and based on the environmental noise The corresponding parameter information generates a noise-reduced signal. In some embodiments, the environmental noise picked up by the microphone array (eg, the first microphone array 130 and the second microphone array 160 ) may include noise, user voice, audio output from the speaker array 150 , and the like. In some embodiments, the audio output by the speaker array 150 may include remote audio, device media audio, and noise reduction waves output by the speaker array 150 . In some embodiments, the signal processor 140 can perform signal analysis on the environmental noise picked up by the microphone array, and separate various sound signals included in the environmental noise to obtain noise, user voice, noise reduction wave, A variety of single sound signals such as device media audio, call remote audio, etc. Specifically, the signal processor 140 can be based on the statistical distribution characteristics and structural characteristics of noise, user voice, noise reduction wave, device media audio, and call remote audio in different dimensions such as space, time domain, and frequency domain, Self-tuning Adjust the parameters of the filter bank, estimate the parameter information of each sound signal in the environmental noise (for example, noise, user voice, noise reduction wave, device media audio, call remote audio, etc.), and according to different parameters Information completes the signal separation process. For example, in some embodiments, the microphone array can convert the picked-up noise, the user's voice, and the noise reduction wave into corresponding first signals, second signals, and third signals, respectively. The signal processor 140 acquires the first signal, the second signal, and the third signal in terms of spatial difference (for example, signal location), time domain difference (for example, delay), and frequency domain difference (for example, amplitude, phase), and The first signal, the second signal, and the third signal are separated according to the differences in the three dimensions to obtain relatively pure first signal, second signal, and third signal. The separated first signal, second signal, and third signal respectively correspond to pure noise, user's voice, and noise-reduced wave, and the signal processor 140 completes the signal separation process. In some embodiments, the signal processor 140 can update the noise reduction wave according to the parameter information of the noise obtained from signal separation, the noise reduction wave, the device media audio, the call remote audio, etc., and the updated noise reduction wave is passed through Speaker array 150 output.

在一些實施例中，雜訊的結構化特徵可以包括雜訊分佈、雜訊強度、全域雜訊強度、雜訊率等，或其任意組合。在一些實施例中，雜訊強度可以指雜訊圖元的值，反映雜訊圖元中的雜訊幅度，因此，雜訊分佈可以反映圖像中具有不同雜訊強度的雜訊的概率密度。全域雜訊強度可以反映圖像中的平均雜訊強度或加權平均雜訊強度。雜訊率可以反映出雜訊分佈的分散程度。在一些實施例中，雜訊的統計分佈特性可以包括但不限於概率分佈密度、功率譜密度、自相關函數、概率密度函數、方差、數學期望等。在一些實施例中，通過信號分離得到的使用者人聲、設備媒體音訊、通話遠端音訊等也可以傳輸至通話遠端。例如，使用者佩戴開放式耳機100進行語音通話時，使用者人聲可以傳輸至通話遠端。In some embodiments, the structural features of noise may include noise distribution, noise intensity, global noise intensity, noise rate, etc., or any combination thereof. In some embodiments, the noise intensity may refer to the value of the noise primitive, which reflects the magnitude of the noise in the noise primitive. Therefore, the noise distribution may reflect the probability density of noise with different noise intensities in the image. . The global noise intensity can reflect the average noise intensity or the weighted average noise intensity in the image. The noise ratio can reflect the degree of dispersion of the noise distribution. In some embodiments, the statistical distribution characteristics of noise may include but not limited to probability distribution density, power spectral density, autocorrelation function, probability density function, variance, mathematical expectation and so on. In some embodiments, the user's voice, device media audio, and audio from the remote end of the call obtained through signal separation can also be transmitted to the remote end of the call. For example, when the user wears the open earphone 100 for a voice call, the user's voice can be transmitted to the remote end of the call.

圖5是根據本發明說明書一些實施例提供的估計第一空間位置的雜訊的示例性流程圖。如圖5所示，流程500可以包括：FIG. 5 is an exemplary flow chart of estimating noise at a first spatial location according to some embodiments of the present invention. As shown in Figure 5, the process 500 may include:

在步驟510中，確定一個或多個與拾取的環境雜訊有關的空間噪音源。In step 510, one or more spatial noise sources related to the picked up ambient noise are determined.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，與環境雜訊有關的空間噪音源是指其聲波可傳遞至使用者耳道處或靠近使用者耳道處（例如，第一空間位置）的噪音源。在一些實施例中，空間噪音源可以為使用者身體不同方向（例如，前方、後方等）的噪音源。例如，使用者身體前方存在人群喧鬧雜訊，使用者身體左方存在車輛鳴笛雜訊，這種情況下，空間噪音源即為使用者身體前方的人群喧鬧噪音源和使用者身體左方的車輛鳴笛噪音源。在一些實施例中，第一麥克風陣列130可以拾取使用者身體各個方向的空間雜訊，並將空間雜訊轉化為電信號傳遞至信號處理器140，信號處理器140可以將空間雜訊對應的電信號進行信號分析，得到各個方向的空間雜訊的參數資訊（例如，方位資訊、幅值資訊、相位資訊等）。信號處理器140根據各個方向的空間雜訊的參數資訊確定各個方向的空間噪音源，例如，空間噪音源的方位、空間噪音源的相位且空間噪音源的幅值等。在一些實施例中，信號處理器140可以通過雜訊定位演算法確定空間噪音源。在一些實施例中，雜訊定位演算法可以包括波束形成、超解析度空間譜估計、到達時差等中的一種或多種。在一些實施例中，信號處理器140可以將拾取的環境雜訊按照特定的頻帶寬度（例如，每500 Hz作為一個頻帶）劃分為多個子帶，每個子帶可以分別對應不同的頻率範圍，並在至少一個子帶上確定與該子帶對應的空間噪音源。關於空間噪音源的定位方法具體可以參考本說明書其他地方，在此不做贅述。關於確定一個或多個與拾取的環境雜訊有關的空間噪音源的詳細描述可以參考本說明書圖6及其相關描述。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the spatial noise source related to environmental noise refers to a noise source whose sound waves can be transmitted to the user's ear canal or close to the user's ear canal (eg, the first spatial position). In some embodiments, the spatial noise sources may be noise sources from different directions (eg, front, rear, etc.) of the user's body. For example, there is crowd noise noise in front of the user's body, and there is vehicle horn noise noise on the left side of the user's body. In this case, the spatial noise source is the crowd noise source in front of the user body and the noise source on the left side of the user body Vehicle horn noise source. In some embodiments, the first microphone array 130 can pick up the spatial noise in all directions of the user's body, convert the spatial noise into an electrical signal and transmit it to the signal processor 140, and the signal processor 140 can convert the spatial noise corresponding to Signal analysis is performed on the electrical signal to obtain parameter information of spatial noise in various directions (for example, orientation information, amplitude information, phase information, etc.). The signal processor 140 determines the spatial noise source in each direction according to the parameter information of the spatial noise in each direction, for example, the orientation of the spatial noise source, the phase of the spatial noise source, and the amplitude of the spatial noise source. In some embodiments, the signal processor 140 can determine the spatial noise source through a noise localization algorithm. In some embodiments, the noise localization algorithm may include one or more of beamforming, super-resolution spatial spectrum estimation, time difference of arrival, and the like. In some embodiments, the signal processor 140 may divide the picked-up environmental noise into multiple subbands according to a specific frequency bandwidth (for example, every 500 Hz is regarded as a frequency band), each subband may correspond to a different frequency range, and A spatial noise source corresponding to the subband is determined on at least one subband. Regarding the location method of the spatial noise source, reference may be made to other places in this specification, and details are not repeated here. For a detailed description of determining one or more spatial noise sources related to the picked-up environmental noise, reference may be made to FIG. 6 and its related descriptions in this specification.

在步驟520中，基於空間噪音源，估計第一空間位置的雜訊。In step 520, the noise at the first spatial location is estimated based on the spatial noise source.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，信號處理器140可以基於步驟510中得到的位於使用者身體各個方向的空間噪音源的參數資訊（例如，頻率資訊、幅值資訊、相位資訊等），估計各個空間噪音源分別傳遞至第一空間位置的雜訊的參數資訊，從而估計出第一空間位置的雜訊。例如，在一些實施例中，使用者身體前方和後方分別有一個空間噪音源，信號處理器140可以根據前方空間噪音源的頻率資訊、相位資訊或幅值資訊，估計前方空間噪音源傳遞到第一空間位置時，前方空間噪音源的頻率資訊、相位資訊或幅值資訊。信號處理器140根據後方空間噪音源的頻率資訊、相位資訊或幅值資訊，估計後方空間噪音源傳遞到第一空間位置時，後方空間噪音源的頻率資訊、相位資訊或幅值資訊。信號處理器140基於前方空間噪音源的頻率資訊、相位資訊或幅值資訊和後方空間噪音源的頻率資訊、相位資訊或幅值資訊，估計第一空間位置的雜訊資訊，從而估計第一空間位置的雜訊。在一些實施例中，在一些實施例中，可以通過特徵提取的方法從麥克風陣列拾取的空間噪音源的頻率回應曲線提取空間噪音源的參數資訊。在一些實施例中，提取空間噪音源的參數資訊的方法可以包括但不限於主成分分析（Principal Components Analysis, PCA）、獨立成分分析（Independent Component Algorithm, ICA）、線性判別分析（Linear Discriminant Analysis, LDA）、奇異值分解（Singular Value Decomposition, SVD）等。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the signal processor 140 can estimate each spatial noise source based on the parameter information (for example, frequency information, amplitude information, phase information, etc.) The parameter information of the noise at the first spatial position is respectively transmitted to estimate the noise at the first spatial position. For example, in some embodiments, there is a spatial noise source at the front and rear of the user's body, and the signal processor 140 can estimate that the front spatial noise source is transmitted to the second For a spatial position, the frequency information, phase information or amplitude information of the front spatial noise source. The signal processor 140 estimates the frequency information, phase information or amplitude information of the rear spatial noise source when the rear spatial noise source transmits to the first spatial position according to the frequency information, phase information or amplitude information of the rear spatial noise source. The signal processor 140 estimates the noise information at the first spatial position based on the frequency information, phase information or amplitude information of the front spatial noise source and the frequency information, phase information or amplitude information of the rear spatial noise source, thereby estimating the first spatial location noise. In some embodiments, in some embodiments, the parameter information of the spatial noise source may be extracted from the frequency response curve of the spatial noise source picked up by the microphone array through a feature extraction method. In some embodiments, methods for extracting parameter information of spatial noise sources may include but not limited to principal component analysis (Principal Components Analysis, PCA), independent component analysis (Independent Component Algorithm, ICA), linear discriminant analysis (Linear Discriminant Analysis, LDA), singular value decomposition (Singular Value Decomposition, SVD), etc.

在一些實施例中，確定一個或多個與拾取的環境雜訊有關的空間噪音源可以包括通過波束形成、超解析度空間譜估計或到達時差中的一種或多種方式定位一個或多個空間噪音源。波束形成定位方式是一種基於最大輸出功率的可控波束形成的聲源定位方法。在一些實施例中，波束形成聲源定位方法可以將麥克風陣列中的各個麥克風陣元拾取的聲音信號進行加權求和形成波束，通過搜索空間噪音源的可能位置來引導該波束，修改權值使得麥克風陣列的輸出信號功率最大。需要說明的是，波束形成聲源定位方法既可以在時域中使用，也可以在頻域中使用。波束形成在時域中的時間平移等價於在頻域中的相位延遲。在一些實施例中，超解析度空間譜估計的聲源定位方法可以包括自回歸AR 模型、最小方差譜估計（MV）和特徵值分解方法（例如，Music 演算法）等，這些方法都可以通過獲取麥克風陣列的聲音信號來計算空間譜的相關矩陣，並對空間噪音源的方向進行有效估計。在一些實施例中，到達時差聲源定位方法可以先進行聲達時間差估計，並從中獲取麥克風陣列中陣元間的聲延遲(TDOA)，再利用獲取的聲達時間差，結合已知的麥克風陣列的空間位置進一步定位出空間噪音源的位置。In some embodiments, determining the one or more sources of spatial noise related to the picked up environmental noise may include localizing the one or more spatial noise sources by one or more of beamforming, super-resolution spatial spectral estimation, or time difference of arrival source. The beamforming localization method is a sound source localization method based on the controllable beamforming of the maximum output power. In some embodiments, the beamforming sound source localization method can perform weighted summation of the sound signals picked up by each microphone array element in the microphone array to form a beam, guide the beam by searching for possible locations of spatial noise sources, and modify the weights so that The output signal power of the microphone array is maximum. It should be noted that the beamforming sound source localization method can be used in both the time domain and the frequency domain. The time shift of beamforming in the time domain is equivalent to the phase delay in the frequency domain. In some embodiments, the sound source localization method of super-resolution spatial spectrum estimation can include autoregressive AR model, minimum variance spectrum estimation (MV) and eigenvalue decomposition method (for example, Music algorithm), etc., these methods can be obtained by Acquire the sound signal of the microphone array to calculate the correlation matrix of the spatial spectrum, and effectively estimate the direction of the spatial noise source. In some embodiments, the time difference of arrival sound source localization method can firstly estimate the time difference of arrival, and obtain the acoustic delay (TDOA) between array elements in the microphone array, and then use the obtained time difference of arrival, combined with the known microphone array The spatial location of the noise further locates the location of the spatial noise source.

為了更加清楚的說明空間噪音源的定位原理，下面以波束形成聲源定位方法為例具體說明空間噪音源的定位是如何實現的。以麥克風陣列為直線形陣列作為示例，空間噪音源可以為遠場聲源，此時認為空間噪音源入射到麥克風陣列的入射聲波是平行的。在平行的聲場中，空間噪音源入射聲波的入射角度與麥克風陣列（例如，第一麥克風陣列130或第二麥克風陣列160）中的麥克風平面垂直時，入射聲波可以同時達到麥克風陣列（例如，第一麥克風陣列130或第二麥克風陣列160）中的各個麥克風。在一些實施例中，平行聲場中的空間噪音源入射聲波的入射角度與麥克風陣列（例如，第一麥克風陣列130或第二麥克風陣列160）中的麥克風平面不垂直時，入射聲波到達麥克風陣列（例如，第一麥克風陣列130或第二麥克風陣列160）中的每個麥克風會有延時，該延時可以由入射角度決定。在一些實施例中，不同的入射角度，疊加之後的雜訊波形強度是不一樣的。例如，入射角度為0°時，雜訊信號強度較弱，入射角度為45°時，雜訊信號強度最強。入射角度不同時，雜訊波形疊加後的波形疊加強度不同，由此使得麥克風陣列具有極性，從而可以得到麥克風陣列的極性圖。在一些實施例中，麥克風陣列（例如，第一麥克風陣列130或第二麥克風陣列160）可以是一個方向陣，該方向陣的指向性可以通過時域演算法或頻域相位延遲演算法實現，例如，延遲、疊加等。在一些實施例中，通過控制不同的延遲，可以實現不同方向的指向。在一些實施例中，方向陣指向可控相當於一個空間濾波器，先把雜訊定位區域進行網格劃分，再通過各個網格點的延遲時間對各個麥克風進行時域延遲，最終將各個麥克風的時域延遲疊加起來，計算得到每個網格的聲壓，從而得到每個網格的相對聲壓，最終實現空間噪音源定位。In order to illustrate the principle of spatial noise source localization more clearly, the beamforming sound source localization method is used as an example to illustrate how the spatial noise source localization is realized. Taking the microphone array as a linear array as an example, the spatial noise source may be a far-field sound source. In this case, it is considered that the incident sound waves incident on the microphone array by the spatial noise source are parallel. In a parallel sound field, when the incident angle of the incident sound wave from the spatial noise source is perpendicular to the plane of the microphones in the microphone array (for example, the first microphone array 130 or the second microphone array 160), the incident sound waves can simultaneously reach the microphone array (for example, Each microphone in the first microphone array 130 or the second microphone array 160). In some embodiments, when the incident angle of the incident sound wave from the spatial noise source in the parallel sound field is not perpendicular to the microphone plane in the microphone array (for example, the first microphone array 130 or the second microphone array 160), the incident sound wave reaches the microphone array Each microphone in (for example, the first microphone array 130 or the second microphone array 160 ) has a delay, which may be determined by the angle of incidence. In some embodiments, for different incident angles, the intensity of the noise waveform after superimposition is different. For example, when the incident angle is 0°, the noise signal strength is weak, and when the incident angle is 45°, the noise signal strength is the strongest. When the incident angles are different, the waveform superposition intensity after the noise waveform superposition is different, which makes the microphone array have polarity, so that the polarity diagram of the microphone array can be obtained. In some embodiments, the microphone array (for example, the first microphone array 130 or the second microphone array 160) can be a directional array, and the directivity of the directional array can be realized by a time domain algorithm or a frequency domain phase delay algorithm, For example, delay, overlay, etc. In some embodiments, pointing in different directions can be achieved by controlling different delays. In some embodiments, the controllable direction of the direction array is equivalent to a spatial filter. Firstly, the noise location area is divided into grids, and then each microphone is delayed in time domain by the delay time of each grid point, and finally each microphone The time domain delay is superimposed to calculate the sound pressure of each grid, so as to obtain the relative sound pressure of each grid, and finally realize the location of the spatial noise source.

應當注意的是，上述有關流程500的描述僅僅是為了示例和說明，而不限定本說明書的適用範圍。對於所屬技術領域中具有通常知識者來說，在本說明書的指導下可以對流程500進行各種修正和改變。例如，流程500還可以包括對空間噪音源進行定位，提取空間噪音源的參數資訊等。又例如，步驟510和步驟520可以合併為一個步驟。然而，這些修正和改變仍在本說明書的範圍之內。It should be noted that the above description about the process 500 is only for illustration and description, and does not limit the scope of application of this specification. For those skilled in the art, various modifications and changes can be made to the process 500 under the guidance of this description. For example, the process 500 may also include locating the spatial noise source, extracting parameter information of the spatial noise source, and the like. For another example, step 510 and step 520 may be combined into one step. However, such modifications and changes are still within the scope of this specification.

圖6是根據本發明說明書一些實施例提供的確定空間噪音源的示例性流程圖。如圖6所示，流程600可以包括：Fig. 6 is an exemplary flow chart of determining a spatial noise source according to some embodiments of the present invention. As shown in Figure 6, the process 600 may include:

在步驟610中，將拾取的環境雜訊劃分為多個子帶，每個子帶對應不同的頻率範圍。In step 610, the picked-up environmental noise is divided into a plurality of sub-bands, and each sub-band corresponds to a different frequency range.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，來自使用者身體不同方向的環境雜訊的頻率可以是不同的，信號處理器140在對環境雜訊信號進行信號處理時，可以將環境雜訊頻帶劃分為多個子帶，每個子帶對應不同的頻率範圍。這裡每個子帶對應的頻率範圍可以是預先設定好的頻率範圍，例如，80Hz-100Hz、100Hz-300Hz、300Hz-800Hz等。在一些實施例中，每個子帶中都包含了對應頻段的環境雜訊的參數資訊。例如，信號處理器140可以將拾取的環境雜訊劃分為80 Hz-100 Hz、100 Hz-300 Hz、300 Hz-800 Hz、800 Hz-1000 Hz四個子帶，這四個子帶中分別對應80 Hz-100 Hz、100 Hz-300 Hz、300 Hz-800 Hz、800 Hz-1000 Hz的環境雜訊的參數。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the frequency of the environmental noise from different directions of the user's body may be different, and the signal processor 140 may divide the environmental noise frequency band into a plurality of sub-bands when performing signal processing on the environmental noise signal, Each subband corresponds to a different frequency range. Here, the frequency range corresponding to each subband may be a preset frequency range, for example, 80 Hz-100 Hz, 100 Hz-300 Hz, 300 Hz-800 Hz, and so on. In some embodiments, each subband includes parameter information of the environmental noise of the corresponding frequency band. For example, the signal processor 140 can divide the picked-up environmental noise into four sub-bands of 80 Hz-100 Hz, 100 Hz-300 Hz, 300 Hz-800 Hz, and 800 Hz-1000 Hz, and these four sub-bands correspond to 80 Parameters of environmental noise of Hz-100 Hz, 100 Hz-300 Hz, 300 Hz-800 Hz, 800 Hz-1000 Hz.

在步驟620中，在至少一個子帶上，確定與其對應的空間噪音源。In step 620, on at least one sub-band, the corresponding spatial noise source is determined.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，信號處理器140可以對環境雜訊劃分的子帶進行信號分析，得到每個子帶對應的環境雜訊的參數資訊，並根據參數資訊確定與每個子帶對應的空間噪音源。例如，在300 Hz-800 Hz這個子帶上，信號處理器140可以獲取該子帶中包含的對應的環境雜訊的參數資訊（例如，頻率資訊、幅值資訊、相位資訊等），信號處理器140根據獲取的參數資訊確定與300 Hz-800 Hz這個子帶對應的空間噪音源。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the signal processor 140 may perform signal analysis on the subbands divided by the environmental noise, obtain the parameter information of the environmental noise corresponding to each subband, and determine the spatial noise source corresponding to each subband according to the parameter information . For example, on the subband of 300 Hz-800 Hz, the signal processor 140 can obtain the parameter information (for example, frequency information, amplitude information, phase information, etc.) of the corresponding environmental noise contained in the subband, and the signal processing The unit 140 determines the spatial noise source corresponding to the subband of 300 Hz-800 Hz according to the obtained parameter information.

應當注意的是，上述有關流程600的描述僅僅是為了示例和說明，而不限定本說明書的適用範圍。對於所屬技術領域中具有通常知識者來說，在本說明書的指導下可以對流程600進行各種修正和改變。例如，將步驟610和步驟620進行合併。又例如，在流程600中增加其他步驟。然而，這些修正和改變仍在本說明書的範圍之內。It should be noted that the above description about the process 600 is only for example and description, and does not limit the scope of application of this specification. For those skilled in the art, various modifications and changes can be made to the process 600 under the guidance of this description. For example, step 610 and step 620 are combined. For another example, other steps are added in the process 600 . However, such modifications and changes are still within the scope of this specification.

圖7是根據本發明說明書一些實施例提供的確定空間噪音源的另一示例性流程圖。如圖7所示，流程700可以包括：Fig. 7 is another exemplary flow chart for determining a spatial noise source according to some embodiments of the present invention. As shown in FIG. 7, the process 700 may include:

在步驟710中，獲取使用者頭函數，使用者頭函數反映使用者頭部對聲音的反射或吸收情況。In step 710, the user's head function is obtained, which reflects the reflection or absorption of the sound by the user's head.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，第一麥克風陣列130可以包括第一子麥克風陣列和第二子麥克風陣列，第一子麥克風陣列和第二子麥克風陣列分別位於使用者的左耳和右耳處。在一些實施例中，第一麥克風陣列130可以是雙邊模式排布，即第一子麥克風陣列和第二子麥克風陣列同時啟用的雙邊模式排布。在一些實施例中，第一子麥克風陣列位於使用者左耳位置，第二子麥克風陣列位於使用者右耳位置的雙邊模式排布時，在聲音信號的傳遞過程中，使用者頭部會對聲音信號進行反射或吸收，致使第一子麥克風陣列和第二子麥克風陣列對同一環境雜訊的拾取有所差異。在一些實施例中，信號處理器140可以基於第一子麥克風陣列拾取的環境雜訊的參數資訊與第二子麥克風陣列拾取的同一環境雜訊的參數資訊之間的差異，構建使用者頭函數，該使用者頭函數可以反映使用者頭部對聲音的反射和吸收情況。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the first microphone array 130 may include a first sub-microphone array and a second sub-microphone array, and the first sub-microphone array and the second sub-microphone array are respectively located at the user's left ear and right ear. In some embodiments, the first microphone array 130 may be arranged in a double-sided mode, that is, a double-sided mode arrangement in which the first sub-microphone array and the second sub-microphone array are simultaneously enabled. In some embodiments, when the first sub-microphone array is located at the position of the user's left ear, and the second sub-microphone array is located at the position of the user's right ear, when the bilateral pattern is arranged, during the transmission of the sound signal, the user's head will The sound signal is reflected or absorbed, so that the first sub-microphone array and the second sub-microphone array pick up the noise of the same environment differently. In some embodiments, the signal processor 140 may construct the user head function based on the difference between the parameter information of the environmental noise picked up by the first sub-microphone array and the parameter information of the same environmental noise picked up by the second sub-microphone array , the user head function can reflect the reflection and absorption of sound by the user's head.

在步驟720中，在至少一個子帶上，結合第一子麥克風陣列拾取的環境雜訊，第二子麥克風陣列拾取的環境雜訊，且使用者頭函數，確定與其對應的空間噪音源。In step 720 , on at least one sub-band, the ambient noise picked up by the first sub-microphone array, the ambient noise picked up by the second sub-microphone array, and the user head function are used to determine the corresponding spatial noise source.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，第一麥克風陣列130在雙邊模式下，第一子麥克風陣列拾取的環境雜訊信號的幅度資訊和相位資訊與第二子麥克風陣列拾取的環境雜訊信號的幅度資訊和相位資訊之間存在幅值差和相位差。信號處理器140可以根據第一子麥克風陣列拾取的環境雜訊、第二子麥克風陣列拾取的環境雜訊，且步驟710中信號處理器140獲取的使用者頭函數，在環境雜訊的至少一個子帶上進行頻點合成，即將頭函數作為先驗資訊，在環境雜訊的至少一個子帶上將第一子麥克風陣列拾取的對應子帶上的環境雜訊的頻點與第二子麥克風陣列拾取的對應子帶上的環境雜訊的頻點進行合成。完成頻點合成後的子帶中包含的參數資訊對應於重構的虛擬噪音源的參數資訊。信號處理器140基於重構的虛擬聲源的參數資訊確定空間噪音源，進而完成空間噪音源定位。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, when the first microphone array 130 is in bilateral mode, the amplitude information and phase information of the environmental noise signal picked up by the first sub-microphone array and the amplitude information and phase information of the environmental noise signal picked up by the second sub-microphone array There is an amplitude difference and a phase difference between the information. The signal processor 140 may, according to the environmental noise picked up by the first sub-microphone array, the environmental noise picked up by the second sub-microphone array, and the user head function obtained by the signal processor 140 in step 710, in at least one of the environmental noise Frequency point synthesis is performed on the sub-band, that is, the head function is used as prior information, and the frequency point of the environmental noise on the corresponding sub-band picked up by the first sub-microphone array is combined with the second sub-microphone on at least one sub-band of the environmental noise. The frequency points of the environmental noise on the corresponding sub-bands picked up by the array are synthesized. The parameter information contained in the sub-bands after frequency point synthesis corresponds to the parameter information of the reconstructed virtual noise source. The signal processor 140 determines the spatial noise source based on the reconstructed parameter information of the virtual sound source, and then completes the positioning of the spatial noise source.

在一些實施例中，第一麥克風陣列130也可以是單邊模式排布。例如，只啟用第一子麥克風陣列或者第二子麥克風陣列。在一些實施例中，第一麥克風陣列130為單邊模式排布，啟用位於使用者左耳處的第一子麥克風陣列時，信號處理器140可以將使用者頭函數作為先驗資訊，在環境雜訊的至少一個子帶上將第一子麥克風陣列拾取的對應子帶上的環境雜訊的頻點進行合成。完成頻點合成後的子帶中包含的參數資訊對應重構的虛擬噪音源參數資訊。信號處理器140基於重構的虛擬聲源的參數資訊確定空間噪音源，進而完成空間噪音源定位。In some embodiments, the first microphone array 130 may also be arranged in a single-sided mode. For example, only the first sub-microphone array or the second sub-microphone array is enabled. In some embodiments, the first microphone array 130 is arranged in a unilateral mode, and when the first sub-microphone array located at the user's left ear is activated, the signal processor 140 can use the user's head function as prior information, and in the environment In at least one sub-band of the noise, the frequency points of the environmental noise in the corresponding sub-band picked up by the first sub-microphone array are synthesized. The parameter information included in the sub-bands after frequency point synthesis corresponds to the reconstructed virtual noise source parameter information. The signal processor 140 determines the spatial noise source based on the reconstructed parameter information of the virtual sound source, and then completes the positioning of the spatial noise source.

在一些實施例中，第一子麥克風陣列可以拾取到達使用者左耳處的環境雜訊，信號處理器140還可以基於該環境雜訊參數資訊通過使用者頭函數估計出該環境雜訊到達使用者右耳處時的參數資訊。信號處理器140根據估計的環境雜訊到達使用者右耳處時的參數資訊，由此可以更加精準的完成空間噪音源定位。在一些實施例中，第一麥克風陣列130的單邊模式也可以是只設置一個子麥克風陣列，關於此類單邊模式下進行空間噪音源定位過程與只啟用第一子麥克風陣列（或第二子麥克風陣列）的單邊模式空間噪音源定位過程類似，在此不做贅述。In some embodiments, the first sub-microphone array can pick up the environmental noise reaching the user's left ear, and the signal processor 140 can also estimate the environmental noise reaching the user's head function based on the environmental noise parameter information. The parameter information of the patient's right ear. The signal processor 140 can locate the spatial noise source more accurately according to the estimated parameter information when the environmental noise reaches the user's right ear. In some embodiments, the unilateral mode of the first microphone array 130 may also be that only one sub-microphone array is set, regarding the spatial noise source localization process in such a unilateral mode and only enabling the first sub-microphone array (or the second The unilateral mode spatial noise source localization process of sub-microphone array) is similar, and will not be repeated here.

應當注意的是，上述有關流程700的描述僅僅是為了示例和說明，而不限定本說明書的適用範圍。對於所屬技術領域中具有通常知識者來說，在本說明書的指導下可以對流程300進行各種修正和改變。然而，這些修正和改變仍在本說明書的範圍之內。It should be noted that the above description about the process 700 is only for illustration and description, and does not limit the scope of application of this specification. For those skilled in the art, various modifications and changes can be made to the process 300 under the guidance of this specification. However, such modifications and changes are still within the scope of this specification.

在一些實施例中，第一子麥克風陣列或者第二子麥克風陣列的排布方式可以是規則幾何形狀的陣列。圖8A是根據本發明說明書一些實施例提供的第一子麥克風陣列的排布方式的示意圖。如圖8A所示，第一子麥克風陣列成線形陣列。在一些實施例中，第一子麥克風陣列或者第二子麥克風陣列的排布方式也可以是其他形狀的陣列，例如，圖8B是根據本發明說明書另一些實施例提供的第一子麥克風陣列的排布方式的示意圖。如圖8B所示，第一子麥克風陣列成十字形陣列。又例如，圖8C是根據本發明說明書另一些實施例提供的第一子麥克風陣列的排布方式的示意圖。如圖8C所示，第一子麥克風陣列成圓形陣列。需要說明的是，第一子麥克風陣列或者第二子麥克風陣列的排布方式不限於圖8A、圖8B、圖8C所示的線形陣列、十字形陣列、圓形陣列，也可以是其他形狀的陣列圖形，例如，三角形陣列、螺旋形陣列、平面陣列、立體陣列等，本說明書對此不做限定。需要說明的是，圖8A至8D中的每一條短實線可以視為一個麥克風或一組麥克風。在一些實施例中，每一條短實線為一組麥克風時，每組麥克風的數量可以相同或不同，每組麥克風的種類可以相同或不同，每組麥克風的朝向可以相同或不同，關於麥克風中種類、數量、朝向且間距可以根據實際應用情況進行適應性調整。In some embodiments, the arrangement of the first sub-microphone array or the second sub-microphone array may be an array of regular geometric shapes. Fig. 8A is a schematic diagram of an arrangement of a first sub-microphone array according to some embodiments of the present invention. As shown in FIG. 8A , the first sub-microphone array forms a linear array. In some embodiments, the arrangement of the first sub-microphone array or the second sub-microphone array can also be an array of other shapes. For example, FIG. Schematic diagram of the layout. As shown in FIG. 8B , the first sub-microphone array forms a cross-shaped array. For another example, FIG. 8C is a schematic diagram of an arrangement manner of the first sub-microphone array provided according to other embodiments of the specification of the present invention. As shown in FIG. 8C , the first sub-microphone array forms a circular array. It should be noted that the arrangement of the first sub-microphone array or the second sub-microphone array is not limited to the linear array, cross-shaped array, and circular array shown in FIG. 8A, FIG. 8B, and FIG. 8C, and may also be of other shapes. Array patterns, for example, triangular arrays, spiral arrays, planar arrays, three-dimensional arrays, etc., are not limited in this specification. It should be noted that each short solid line in FIGS. 8A to 8D can be regarded as a microphone or a group of microphones. In some embodiments, when each short solid line is a group of microphones, the number of each group of microphones can be the same or different, the types of each group of microphones can be the same or different, and the orientation of each group of microphones can be the same or different. The type, quantity, orientation and spacing can be adjusted adaptively according to actual application conditions.

在一些實施例中，第一子麥克風陣列或者第二子麥克風陣列的排布方式也可以是不規則幾何形狀的陣列。例如，圖8D是根據本發明說明書另一些實施例提供的第一子麥克風陣列的排布方式的示意圖。如圖8D所示，第一子麥克風陣列成不規則陣列。需要說明的是，第一子麥克風陣列或者第二子麥克風陣列的不規則形狀陣列排布不限於圖8D中所示的形狀，還可以是其他形狀的不規則圖形等陣列排布，例如，不規則多邊形等，本說明書對此不做限定。In some embodiments, the arrangement of the first sub-microphone array or the second sub-microphone array may also be an array of irregular geometric shapes. For example, FIG. 8D is a schematic diagram of an arrangement manner of the first sub-microphone array provided according to other embodiments of the specification of the present invention. As shown in FIG. 8D , the first sub-microphones are arrayed in an irregular array. It should be noted that the irregular shaped array arrangement of the first sub-microphone array or the second sub-microphone array is not limited to the shape shown in FIG. Regular polygons, etc., are not limited in this specification.

在一些實施例中，第一子麥克風陣列（或第二子麥克風陣列）中的麥克風之間可以是均勻分佈，這裡的均勻分佈是指第一子麥克風陣列（或第二子麥克風陣列）中的麥克風之間的間距相同。在一些實施例中，第一子麥克風陣列（或第二子麥克風陣列）中的麥克風也可以是非均勻分佈，這裡的非均勻分佈是指第一子麥克風陣列（或第二子麥克風陣列）中的麥克風之間的間距不同。關於子麥克風陣列中的麥克風陣元之間的間距可以根據實際情況做適應性調整，本說明書對此不做限定。In some embodiments, the microphones in the first sub-microphone array (or the second sub-microphone array) may be uniformly distributed, and the uniform distribution here refers to the first sub-microphone array (or the second sub-microphone array). The spacing between the microphones is the same. In some embodiments, the microphones in the first sub-microphone array (or the second sub-microphone array) may also be non-uniformly distributed, and the non-uniform distribution here refers to the microphones in the first sub-microphone array (or the second sub-microphone array). The spacing between the microphones varies. The distance between the microphone array elements in the sub-microphone array can be adjusted adaptively according to the actual situation, which is not limited in this specification.

圖9A是根據本發明說明書一些實施例提供的第一子麥克風陣列和第二子麥克風陣列的位置關係的示意圖。如圖9A所示，第一子麥克風陣列911位於使用者左耳處，第一子麥克風陣列911呈近似三角形排布。第二子麥克風陣列912位於使用者右耳處，第二子麥克風陣列912也呈近似三角形排布，第二子麥克風陣列912與第一子麥克風陣列911的排布方式相同且關於使用者頭部對稱分佈。參照圖9A，第一子麥克風陣列911沿陣列方向的延長線與第二子麥克風陣列912沿陣列方向的延長線相交，可以構成四邊形結構。Fig. 9A is a schematic diagram of the positional relationship between the first sub-microphone array and the second sub-microphone array according to some embodiments of the present invention. As shown in FIG. 9A , the first sub-microphone array 911 is located at the left ear of the user, and the first sub-microphone array 911 is approximately triangularly arranged. The second sub-microphone array 912 is located at the right ear of the user, and the second sub-microphone array 912 is also arranged in an approximately triangular shape. Symmetrical distribution. Referring to FIG. 9A , the extension line of the first sub-microphone array 911 along the array direction intersects the extension line of the second sub-microphone array 912 along the array direction to form a quadrilateral structure.

圖9B是根據本發明說明書另一些實施例提供的第一子麥克風陣列和第二子麥克風陣列的位置關係的示意圖。如圖9B所示，第一子麥克風陣列921位於使用者左耳處，第一子麥克風陣列921呈線形排布。第二子麥克風陣列922位於使用者右耳處，第二子麥克風陣列922呈近似三角形排布，第二子麥克風陣列922與第一子麥克風陣列921的排布方式不同且關於使用者頭部非對稱分佈。參照圖9B，第一子麥克風陣列921沿陣列方向的延長線與第二子麥克風陣列922沿陣列方向的延長線相交，可以構成三角形結構。Fig. 9B is a schematic diagram of the positional relationship between the first sub-microphone array and the second sub-microphone array according to other embodiments of the present invention. As shown in FIG. 9B , the first sub-microphone array 921 is located at the user's left ear, and the first sub-microphone array 921 is arranged in a line. The second sub-microphone array 922 is located at the right ear of the user. The second sub-microphone array 922 is arranged in an approximately triangular shape. The arrangement of the second sub-microphone array 922 is different from that of the first sub-microphone array 921. Symmetrical distribution. Referring to FIG. 9B , the extension line of the first sub-microphone array 921 along the array direction intersects the extension line of the second sub-microphone array 922 along the array direction to form a triangular structure.

在一些實施例中，第一子麥克風陣列921和第二子麥克風陣列922除了可以構成圖9A中所示的四邊形、圖9B中所示的三角形外，還可以構成八字形、圓形、橢圓形、環形、多邊形等規則和/或不規則形狀。第一子麥克風陣列和第二子麥克風陣列以特定的形狀或立體空間分佈，可以全方位的獲取使用者各個方向的環境雜訊，通過各麥克風獲取的環境雜訊的參數資訊可以對空間噪音源進行更加精準的定位，進而更加精準地模擬處使用者耳道處的雜訊聲場，以達到更好的降噪效果。第一子麥克風陣列和第二子麥克風陣列不同的排布方式具有不同的空域濾波性能。僅作為示例性說明，空域濾波性能可以包括主瓣寬度、旁瓣（也被稱為副瓣）寬度。主瓣寬度是指聲波輻射的最大輻射波束。旁瓣寬度是指除最大輻射波束之外的輻射波束。其中，主瓣寬度越窄，麥克風陣列解析度越高、指向性越好。旁瓣高度越低，麥克風陣列抗干擾性能越好，旁瓣高度越高，麥克風陣列抗干擾性能越差。例如，十字形陣列的波束圖對應的主瓣寬度比圓形、矩形或螺旋形的波陣圖對應的主瓣寬度要窄，也就是說同樣陣元個數的條件下，十字形陣列具有較高的空間解析度和更好指向性。而從旁瓣高度來看，十字形陣列的波束圖對應的旁瓣寬度比圓形、矩形或螺旋形的波陣圖對應的旁瓣寬度高，也就是說，十字形陣列的抗干擾能力較差。關於第一子麥克風陣列和第二子麥克風陣列的排布方式可以根據實際應用情況進行適應性調整，在此不做進一步限定。需要說明的是，圖9A和圖9B中的每一條短實線可以視為一個麥克風或一組麥克風。在一些實施例中，每一條短實線為一組麥克風時，每組麥克風的數量可以相同或不同，每組麥克風的種類可以相同或不同，每組麥克風的朝向可以相同或不同，關於麥克風中種類、數量、朝向且間距可以根據實際應用情況進行適應性調整。在一些實施例中，還可以通過合成孔徑、稀疏恢復、互素陣列等方法形成環境雜訊的空間超解析度圖像，該空間超解析度圖像可以用於反映環境雜訊的信號反射圖，以進一步提高空間噪音源的定位精度。在一些實施例中，基於空間噪音源的定位精度的回饋情況可以調整麥克風陣列（例如，第一麥克風陣列130、第二麥克風陣列160）的中麥克風的位置、間距、啟閉狀態等。In some embodiments, the first sub-microphone array 921 and the second sub-microphone array 922 may form a figure-eight, a circle, or an ellipse in addition to the quadrilateral shown in FIG. 9A and the triangle shown in FIG. 9B. , rings, polygons and other regular and/or irregular shapes. The first sub-microphone array and the second sub-microphone array are distributed in a specific shape or three-dimensional space, which can obtain the environmental noise in all directions of the user in all directions, and the parameter information of the environmental noise obtained by each microphone can be used to analyze the spatial noise source Perform more precise positioning, and then more accurately simulate the noise sound field at the user's ear canal to achieve better noise reduction effects. Different arrangements of the first sub-microphone array and the second sub-microphone array have different spatial filtering performances. By way of example only, spatial filtering performance may include main lobe width, side lobe (also referred to as sidelobe) width. The main lobe width refers to the largest radiation beam of acoustic radiation. Side lobe width refers to radiation beams other than the largest radiation beam. Among them, the narrower the width of the main lobe, the higher the resolution and the better the directivity of the microphone array. The lower the side lobe height, the better the anti-interference performance of the microphone array, and the higher the side lobe height, the worse the anti-interference performance of the microphone array. For example, the width of the main lobe corresponding to the beam pattern of the cross-shaped array is narrower than that corresponding to the circular, rectangular or spiral wave pattern, that is to say, under the same number of array elements, the cross-shaped array has a smaller High spatial resolution and better directivity. From the perspective of side lobe height, the width of the side lobe corresponding to the beam pattern of the cross-shaped array is higher than that corresponding to the circular, rectangular or spiral wave array, that is to say, the anti-interference ability of the cross-shaped array is poor . The arrangement of the first sub-microphone array and the second sub-microphone array can be adaptively adjusted according to actual application conditions, which is not further limited here. It should be noted that each short solid line in FIG. 9A and FIG. 9B can be regarded as a microphone or a group of microphones. In some embodiments, when each short solid line is a group of microphones, the number of each group of microphones can be the same or different, the types of each group of microphones can be the same or different, and the orientation of each group of microphones can be the same or different. The type, quantity, orientation and spacing can be adjusted adaptively according to actual application conditions. In some embodiments, the spatial super-resolution image of environmental noise can also be formed by methods such as synthetic aperture, sparse recovery, and mutual prime array, and the spatial super-resolution image can be used to reflect the signal reflection map of environmental noise , to further improve the localization accuracy of the spatial noise source. In some embodiments, the position, spacing, opening and closing status, etc. of the microphones in the microphone array (eg, the first microphone array 130 and the second microphone array 160 ) can be adjusted based on the feedback of the positioning accuracy of the spatial noise source.

在一些實施例中，第一麥克風陣列130可以包括一個雜訊麥克風，第一麥克風陣列130中的雜訊麥克風用於拾取使用者耳道處的空間雜訊，而雜訊麥克風拾取使用者耳道處的空間雜訊時，也會拾取揚聲器陣列150輸出的降雜訊波，該降雜訊波是不期望被雜訊麥克風拾取的。因此，可以將雜訊麥克風設置於揚聲器陣列150中形成的聲學偶極子的聲學零點處，以使得雜訊麥克風拾取的降雜訊波最小。在一些實施例中，至少一個揚聲器陣列150形成至少一組聲學偶極子，且雜訊麥克風位於偶極子輻射聲場的聲學零點處。在一些實施例中，揚聲器陣列150中的任意兩個揚聲器輸出的聲音信號可以看作是兩個向外輻射聲音的點聲源，其輻射聲音的幅值相同，相位相反。所述兩個揚聲器可以構成聲學偶極子或類似聲學偶極子，向外輻射聲音具有明顯的指向性，形成一個“8”字形聲音輻射區域。在所述兩個揚聲器連線所在的直線方向，揚聲器處輻射的聲音最大，其餘方向輻射聲音明顯減小，兩個揚聲器連線的中垂線處輻射的聲音最小。在一些實施例中，揚聲器陣列150中的一個揚聲器輸出的聲音信號也可以視為一個偶極子。例如，揚聲器陣列150中的一個揚聲器振膜正面和振膜背面輸出的一組相位近似相反、幅值近似相同的聲音信號可以視為兩個點聲源。In some embodiments, the first microphone array 130 may include a noise microphone, the noise microphone in the first microphone array 130 is used to pick up the spatial noise at the user's ear canal, and the noise microphone picks up the user's ear canal When there is spatial noise at the location, the noise reduction wave output by the speaker array 150 will also be picked up, and the noise reduction wave is not expected to be picked up by the noise microphone. Therefore, the noise microphone can be disposed at the acoustic null point of the acoustic dipole formed in the speaker array 150 , so that the noise wave picked up by the noise microphone is minimized. In some embodiments, at least one speaker array 150 forms at least one set of acoustic dipoles, and the noise microphone is located at the acoustic null of the dipole radiating sound field. In some embodiments, the sound signals output by any two speakers in the speaker array 150 can be regarded as two point sound sources that radiate sound outward, and the amplitude of the radiated sound is the same, but the phase is opposite. The two loudspeakers can form an acoustic dipole or similar to an acoustic dipole, and the outwardly radiated sound has obvious directivity, forming an "8"-shaped sound radiation area. In the straight line direction where the two speakers are connected, the sound radiated from the speaker is the largest, the sound radiated in other directions is obviously reduced, and the sound radiated at the perpendicular line between the two speakers is the smallest. In some embodiments, the sound signal output by one speaker in the speaker array 150 can also be regarded as a dipole. For example, a set of sound signals with approximately opposite phases and approximately the same amplitude output from the front and back of the diaphragm of a speaker in the speaker array 150 can be regarded as two point sound sources.

在一些實施例中，還可以通過演算法獲取麥克風陣列（例如，第一麥克風陣列130）在聲學零點位置處拾取的環境雜訊信號。例如，在一些實施例中，可以預先將第一麥克風陣列130中的一個或多個麥克風設置於特定頻帶的揚聲器陣列150構成的聲學偶極子的聲學零點位置。特定頻帶可以是對語音可懂度起關鍵作用的頻帶，例如，500 Hz-1500 Hz。在一些實施例中，信號處理器140根據聲學偶極子位置（即，構成聲學偶極子的兩個揚聲器的位置）和聲傳遞函數，計算特定頻帶的補償參數並預先儲存。信號處理器140可以根據預先儲存的補償參數對第一麥克風陣列130中的其餘麥克風（即，未設置在聲學零點位置的麥克風）拾取的環境雜訊基進行幅值補償和/或相位補償，補償後的環境雜訊信號等效於設置在聲學零點位置雜訊麥克風拾取的環境雜訊信號。需要注意的是，麥克風陣列（例如，第一麥克風陣列130）中的麥克風還可以不設置於揚聲器陣列150構成的聲學偶極子的聲學零點處，例如，在一些實施例中，信號處理器140可以根據不同類型的雜訊在不同維度（例如，空域、時域、頻域等）的統計分佈和結構化特徵將麥克風陣列拾取的第一空間位置的雜訊進行信號分離提取，從而獲取不同類型（例如不同頻率、不同相位等）的雜訊，並通過信號處理器將麥克風陣列拾取的揚聲器陣列150發出的降雜訊波進行消除。In some embodiments, the environmental noise signal picked up by the microphone array (for example, the first microphone array 130 ) at the position of the acoustic null point can also be acquired through an algorithm. For example, in some embodiments, one or more microphones in the first microphone array 130 may be pre-set at the acoustic zero point position of the acoustic dipole formed by the speaker array 150 of a specific frequency band. The specific frequency band may be a frequency band that plays a key role in speech intelligibility, for example, 500 Hz-1500 Hz. In some embodiments, the signal processor 140 calculates and pre-stores the compensation parameters for a specific frequency band according to the acoustic dipole position (ie, the positions of the two loudspeakers forming the acoustic dipole) and the acoustic transfer function. The signal processor 140 can perform amplitude compensation and/or phase compensation on the environmental noise base picked up by the remaining microphones in the first microphone array 130 (that is, the microphones not set at the acoustic zero position) according to the pre-stored compensation parameters. The final environmental noise signal is equivalent to the environmental noise signal picked up by the noise microphone set at the acoustic zero position. It should be noted that the microphones in the microphone array (for example, the first microphone array 130) may not be arranged at the acoustic zero point of the acoustic dipole formed by the speaker array 150, for example, in some embodiments, the signal processor 140 may According to the statistical distribution and structural characteristics of different types of noise in different dimensions (for example, space domain, time domain, frequency domain, etc.), the noise at the first spatial position picked up by the microphone array is separated and extracted to obtain different types ( For example, noises of different frequencies, different phases, etc.) are eliminated by the signal processor to eliminate the noise-reducing waves emitted by the speaker array 150 picked up by the microphone array.

圖10是根據本發明說明書一些實施例提供的估計第一空間位置的雜訊的示例性流程圖。如圖10所示，流程1000可以包括：FIG. 10 is an exemplary flow chart of estimating noise at a first spatial location according to some embodiments of the present invention. As shown in Figure 10, the process 1000 may include:

在步驟1010中，從拾取的環境雜訊中去除與骨傳導麥克風拾取的信號相關聯的成分，以便更新環境雜訊。In step 1010, components associated with the signal picked up by the bone conduction microphone are removed from the picked up ambient noise, so as to update the ambient noise.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，麥克風陣列（例如，第一麥克風陣列130、第二麥克風陣列160）在拾取環境雜訊時，使用者自身的說話聲音也會被麥克風陣列拾取，即，使用者自身說話的聲音也被視為環境雜訊的一部分。這種情況下，揚聲器陣列150輸出的降雜訊波會將使用者自身說話的聲音抵消。在一些實施例中，特定場景下，使用者自身說話的聲音需要被保留，例如，使用者進行語音通話、發送語音訊息等場景中。在一些實施例中，使用者佩戴開放式耳機100進行語音通話或錄製語音資訊時，骨傳導麥克風可以通過拾取使用者說話時面部骨骼或肌肉產生的振動信號來拾取使用者說話的聲音信號，並傳遞至信號處理器140。信號處理器140獲取來自骨傳導麥克風拾取的聲音信號的參數資訊，信號處理器140從麥克風陣列（例如，第一麥克風陣列130、第二麥克風陣列160）拾取的環境雜訊中找到並去除與骨傳導麥克風拾取的聲音信號相關聯的聲音信號成分。信號處理器140根據剩餘的麥克風陣列拾取的環境雜訊的參數資訊更新環境雜訊。更新後的環境雜訊中不再包含使用者自身說話的聲音信號，即在使用者進行語音通話時保留了使用者自身說話的聲音信號。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, when the microphone array (for example, the first microphone array 130 and the second microphone array 160) picks up environmental noise, the user's own voice will also be picked up by the microphone array, that is, the user's own voice Sound is also considered part of environmental noise. In this case, the noise reduction wave output by the speaker array 150 will cancel the voice of the user himself. In some embodiments, in certain scenarios, the voice of the user's own speech needs to be preserved, for example, in scenarios where the user makes a voice call or sends a voice message. In some embodiments, when the user wears the open earphone 100 to make a voice call or record voice information, the bone conduction microphone can pick up the sound signal of the user's speech by picking up the vibration signal generated by the facial bones or muscles when the user speaks, and passed to the signal processor 140. The signal processor 140 obtains the parameter information of the sound signal picked up by the bone conduction microphone, and the signal processor 140 finds and removes bone-related noise from the environmental noise picked up by the microphone array (for example, the first microphone array 130, the second microphone array 160). The sound signal component associated with the sound signal picked up by a conduction microphone. The signal processor 140 updates the ambient noise according to the parameter information of the ambient noise picked up by the remaining microphone arrays. The updated environmental noise no longer includes the voice signal of the user's own speech, that is, the voice signal of the user's own speech is retained when the user makes a voice call.

在步驟1020中，根據更新後的環境雜訊估計第一空間位置的雜訊。In step 1020, the noise at the first spatial location is estimated according to the updated environmental noise.

在一些實施例中，該步驟可以由信號處理器140執行。在一些實施例中，信號處理器140可以根據更新後的環境雜訊估計第一空間位置的雜訊。關於根據環境雜訊估計第一空間位置的雜訊的詳細描述可以參考本說明書圖2及其相關描述，在此不做贅述。In some embodiments, this step may be performed by the signal processor 140 . In some embodiments, the signal processor 140 may estimate the noise at the first spatial location according to the updated ambient noise. For a detailed description of estimating the noise of the first spatial position according to the environmental noise, reference may be made to FIG. 2 and related descriptions in this specification, and details are not repeated here.

應當注意的是，上述有關流程1000的描述僅僅是為了示例和說明，而不限定本說明書的適用範圍。對於本領域技術人員來說，在本說明書的指導下可以對流程1000進行各種修正和改變。例如，還可以對骨傳導麥克風拾取的信號相關聯的成分進行預處理，並將骨傳導麥克風拾取的信號作為音訊信號傳輸至終端設備。然而，這些修正和改變仍在本說明書的範圍之內。It should be noted that the above description about the process 1000 is only for illustration and description, and does not limit the scope of application of this description. For those skilled in the art, various modifications and changes can be made to the process 1000 under the guidance of this description. For example, it is also possible to preprocess components associated with the signal picked up by the bone conduction microphone, and transmit the signal picked up by the bone conduction microphone to the terminal device as an audio signal. However, such modifications and changes are still within the scope of this specification.

在一些實施例中，至少一個麥克風陣列可以包括骨傳導麥克風和氣傳導麥克風，信號處理器140可以基於開放式耳機100的工作狀態控制骨傳導麥克風和氣傳導麥克風的開關狀態。在一些實施例中，開放式耳機100的工作狀態可以是指使用者佩戴開放式耳機100時所使用的用途狀態。在一些實施例中，開放式耳機100的工作狀態可以包括但不限於音樂播放狀態、語音通話狀態、語音發送狀態等。在一些實施例中，麥克風陣列拾取環境雜訊時，麥克風陣列中的骨傳導麥克風的開關狀態和氣傳導麥克風的開關狀態可以根據開放式耳機100的工作狀態決定。例如，使用者佩戴開放式耳機100進行音樂播放時，骨傳導麥克風的開關狀態可以為待機狀態，氣傳導麥克風的開關狀態可以為工作狀態。又例如，使用者佩戴開放式耳機100進行語音發送時，骨傳導麥克風的開關狀態可以為工作狀態，氣傳導麥克風的開關狀態可以為工作狀態。在一些實施例中，信號處理器140耦接麥克風陣列，信號處理器140可以通過發送控制信號控制麥克風陣列中的麥克風（例如，骨傳導麥克風、氣傳導麥克風）的開關狀態。In some embodiments, at least one microphone array may include a bone conduction microphone and an air conduction microphone, and the signal processor 140 may control the on/off status of the bone conduction microphone and the air conduction microphone based on the working state of the open earphone 100 . In some embodiments, the working state of the open-style earphone 100 may refer to the use state used by the user when wearing the open-style earphone 100 . In some embodiments, the working state of the open earphone 100 may include, but not limited to, a music playing state, a voice call state, a voice sending state, and the like. In some embodiments, when the microphone array picks up environmental noise, the on/off state of the bone conduction microphone and the air conduction microphone in the microphone array can be determined according to the working state of the open earphone 100 . For example, when the user wears the open earphone 100 to play music, the on/off state of the bone conduction microphone may be in the standby state, and the on/off state of the air conduction microphone may be in the working state. For another example, when the user wears the open earphone 100 for voice transmission, the on-off state of the bone conduction microphone may be the working state, and the on-off state of the air conduction microphone may be the working state. In some embodiments, the signal processor 140 is coupled to the microphone array, and the signal processor 140 can control the switch state of the microphones (eg, bone conduction microphone, air conduction microphone) in the microphone array by sending a control signal.

在一些實施例中，開放式耳機100的工作狀態可以包括通話狀態和未通話狀態。在一些實施例中，開放式耳機100的工作狀態為未通話狀態時，信號處理器140可以控制骨傳導麥克風為待機狀態。例如，開放式耳機100在未通話狀態下，使用者自身說話的聲音信號可以視為環境雜訊，這種情況下，麥克風陣列拾取的環境雜訊中包括的使用者自身說話的聲音信號可以不被濾除，從而使得使用者自身說話的聲音信號也可以與揚聲器陣列150輸出的降雜訊波相抵消。In some embodiments, the working state of the open earphone 100 may include a talking state and a non-talking state. In some embodiments, when the working state of the open earphone 100 is not in a call state, the signal processor 140 may control the bone conduction microphone to be in a standby state. For example, when the open earphone 100 is not in a call state, the voice signal of the user's own speech can be regarded as environmental noise. In this case, the voice signal of the user's own speech included in the environmental noise picked up by the microphone array may not is filtered out, so that the voice signal of the user's own speech can also be canceled out by the noise reduction wave output by the speaker array 150 .

在一些實施例中，開放式耳機100的工作狀態為通話狀態時，信號處理器140可以控制骨傳導麥克風為工作狀態。例如，開放式耳機100在通話狀態下，使用者自身說話的聲音信號需要保留，這種情況下，信號處理器140可以發送控制信號控制骨傳導麥克風為工作狀態，骨傳導麥克風拾取使用者說話的聲音信號，信號處理器140從麥克風陣列拾取的環境雜訊中找到並去除與骨傳導麥克風拾取的聲音信號相關聯的聲音信號成分，以使使用者自身說話的聲音信號不與揚聲器陣列150輸出的降雜訊波相抵消，從而保證使用者正常的通話狀態。In some embodiments, when the working state of the open earphone 100 is the talking state, the signal processor 140 may control the bone conduction microphone to be in the working state. For example, when the open earphone 100 is in a call state, the voice signal of the user's own speech needs to be preserved. In this case, the signal processor 140 can send a control signal to control the bone conduction microphone to be in the working state, and the bone conduction microphone picks up the user's voice. sound signal, the signal processor 140 finds and removes the sound signal components associated with the sound signal picked up by the bone conduction microphone from the environmental noise picked up by the microphone array, so that the sound signal of the user's own speech does not differ from the sound signal output by the speaker array 150 The clutter-reducing waves cancel each other out, thus ensuring the normal communication status of the user.

在一些實施例中，開放式耳機100的工作狀態為通話狀態時，若環境雜訊的聲壓級大於預設閾值時，信號處理器140可以控制骨傳導麥克風保持工作狀態。在一些實施例中，環境雜訊的聲壓級可以反映環境雜訊的強度。這裡的預設閾值可以是預先儲存在開放式耳機100中的數值，例如，50 dB、60 dB或70 dB等其它任意數值。在一些實施例中，當環境雜訊的聲壓級大於預設閾值時，環境雜訊會影響使用者的通話品質。信號處理器140可以通過發送控制信號控制骨傳導麥克風保持工作狀態，骨傳導麥克風可以獲取使用者講話時的面部肌肉的振動信號，而基本不會拾取外部環境雜訊，此時將骨傳導麥克風拾取的振動信號作為通話時的語音信號，從而保證使用者的正常通話。In some embodiments, when the working state of the open earphone 100 is the talking state, if the sound pressure level of the ambient noise is greater than a preset threshold, the signal processor 140 may control the bone conduction microphone to keep the working state. In some embodiments, the sound pressure level of the environmental noise can reflect the intensity of the environmental noise. The preset threshold here may be a value pre-stored in the open earphone 100, for example, 50 dB, 60 dB or 70 dB and other arbitrary values. In some embodiments, when the sound pressure level of the environmental noise is greater than a preset threshold, the environmental noise will affect the user's call quality. The signal processor 140 can control the bone conduction microphone to keep working by sending a control signal. The bone conduction microphone can obtain the vibration signal of the facial muscles of the user when speaking, and basically will not pick up external environmental noise. At this time, the bone conduction microphone will pick up The vibration signal is used as the voice signal during the call, so as to ensure the normal call of the user.

在一些實施例中，開放式耳機100的工作狀態為通話狀態時，若環境雜訊的聲壓級小於預設閾值時，信號處理器140可以控制骨傳導麥克風由工作狀態切換至待機狀態。在一些實施例中，當環境雜訊的聲壓級小於預設閾值時，環境雜訊的聲壓級相對於使用者說話產生的聲音信號的聲壓級較小，使用者說話產生的聲音信號被揚聲器陣列150輸出的降雜訊波抵消一部分後，剩餘的使用者說話產生的聲音信號仍然可以達到通話標準，足以保證使用者的正常通話。這種情況下，信號處理器140可以通過發送控制信號控制骨傳導麥克風由工作狀態切換至待機狀態，進而降低信號處理複雜度且開放式耳機100的功率損耗。In some embodiments, when the working state of the open earphone 100 is the talking state, if the sound pressure level of the ambient noise is lower than the preset threshold, the signal processor 140 can control the bone conduction microphone to switch from the working state to the standby state. In some embodiments, when the sound pressure level of the environmental noise is less than the preset threshold, the sound pressure level of the environmental noise is smaller than the sound pressure level of the sound signal generated by the user's speech, and the sound signal generated by the user's speech After being partially offset by the noise reduction wave output by the loudspeaker array 150, the remaining sound signals generated by the user's speech can still meet the call standard, which is enough to ensure the user's normal call. In this case, the signal processor 140 can control the bone conduction microphone to switch from the working state to the standby state by sending a control signal, thereby reducing signal processing complexity and power consumption of the open earphone 100 .

在一些實施例中，開放式耳機100還可以包括用於調整降雜訊波聲壓級的調節模組。在一些實施例中，調節模組可以包括按鈕、語音助手、手勢感測器等。使用者通過控制調節模組可以調整開放式耳機100的降噪模式。具體地，使用者通過控制調節模組可以調整（例如，放大或縮小）降噪信號的幅值資訊，以改變揚聲器陣列發出的降雜訊波的聲壓級，進而達到不同的降噪效果。僅作為示例性說明，在一些實施例中，降噪模式可以包括強降噪模式、中級降噪模式、弱降噪模式等。例如，使用者在室內佩戴開放式耳機100時，外界環境雜訊較小，使用者可以通過調節模組將開放式耳機的降噪模式關閉或調整為弱降噪模式。又例如，當使用者在街邊等公共場合行走時佩戴開放式耳機100，使用者需要在收聽音訊信號（例如，音樂、語音資訊）的同時，保持對周圍環境的一定感知能力，以應對突發狀況，此時使用者可以通過調節模組（例如，按鈕或語音助手）選擇中級降噪模式，以保留周圍環境雜訊（如警報聲、撞擊聲、汽車鳴笛聲等）。再例如，使用者在乘坐地鐵或飛機等交通工具時，使用者可以通過調節模組選擇強降噪模式，以進一步降低周圍環境雜訊。在一些實施例中，信號處理器140還可以基於環境雜訊強度範圍向開放式耳機100或與開放式耳機100通信連接的終端設備（例如，手機、智慧手錶等）發出提示資訊，以提醒使用者調整降噪模式。In some embodiments, the open-back earphone 100 may further include an adjustment module for adjusting the sound pressure level of the noise reduction wave. In some embodiments, the adjustment module may include buttons, voice assistants, gesture sensors, and the like. The user can adjust the noise reduction mode of the open earphone 100 by controlling the adjustment module. Specifically, the user can adjust (for example, amplify or reduce) the amplitude information of the noise reduction signal by controlling the adjustment module, so as to change the sound pressure level of the noise reduction wave emitted by the speaker array, thereby achieving different noise reduction effects. As an example only, in some embodiments, the noise reduction mode may include a strong noise reduction mode, a medium noise reduction mode, a weak noise reduction mode, and the like. For example, when the user wears the open-type earphone 100 indoors, the noise of the external environment is small, and the user can turn off the noise reduction mode of the open-type earphone or adjust it to a weak noise reduction mode through the adjustment module. For another example, when the user wears the open earphone 100 while walking in a public place such as the street, the user needs to maintain a certain awareness of the surrounding environment while listening to audio signals (for example, music, voice information) to cope with sudden At this time, the user can select a medium-level noise reduction mode by adjusting the module (for example, a button or a voice assistant) to preserve the surrounding noise (such as sirens, crashes, car horns, etc.). For another example, when a user is taking a subway or an airplane, the user can select a strong noise reduction mode by adjusting the module to further reduce the noise of the surrounding environment. In some embodiments, the signal processor 140 can also send prompt information to the open earphone 100 or a terminal device (such as a mobile phone, a smart watch, etc.) or adjust the noise canceling mode.

上文已對基本概念做了描述，顯然，對於所屬技術領域中具有通常知識者來說，上述詳細揭露內容僅僅作為示例，而並不構成對本發明的限定。雖然此處並沒有明確說明，所屬技術領域中具有通常知識者可以對本發明進行各種修改、改進和修正。該類修改、改進和修正在本發明中被建議，所以該類修改、改進、修正仍屬於本發明示範實施例的精神和範圍。The basic concept has been described above, obviously, for those with ordinary knowledge in the technical field, the above detailed disclosure is only an example, and does not constitute a limitation to the present invention. Although not explicitly described herein, various modifications, improvements and amendments to the present invention can be made by those skilled in the art. Such modifications, improvements and corrections are suggested in the present invention, so such modifications, improvements and corrections still belong to the spirit and scope of the exemplary embodiments of the present invention.

同時，本發明使用了特定詞語來描述本發明的實施例。如“一個實施例”、“一實施例”、和/或“一些實施例”意指與本發明至少一個實施例相關的某一特徵、結構或特點。因此，應強調並注意的是，本說明書中在不同位置兩次或多次提及的“一實施例”或“一個實施例”或“一個替代性實施例”並不一定是指同一實施例。此外，本發明的一個或多個實施例中的某些特徵、結構或特點可以進行適當的組合。Meanwhile, the present invention uses specific words to describe the embodiments of the present invention. For example, "one embodiment", "an embodiment", and/or "some embodiments" means a certain feature, structure or characteristic related to at least one embodiment of the present invention. Therefore, it should be emphasized and noted that two or more references to "an embodiment" or "an embodiment" or "an alternative embodiment" in different places in this specification do not necessarily refer to the same embodiment . In addition, certain features, structures or characteristics of one or more embodiments of the present invention may be properly combined.

此外，所屬技術領域中具有通常知識者可以理解，本發明的各方面可以通過若干具有可專利性的種類或情況進行說明和描述，包括任何新的和有用的工序、機器、產品或物質的組合，或對他們的任何新的和有用的改進。相應地，本發明的各個方面可以完全由硬體執行、可以完全由軟體（包括韌體、常駐軟體、微碼等）執行、也可以由硬體和軟體組合執行。以上硬體或軟體均可被稱為“資料塊”、“模組”、“引擎”、“單元”、“元件”或“系統”。此外，本發明的各方面可能表現為位於一個或多個電腦可讀取媒體中的電腦產品，該產品包括電腦可讀取程式碼。Furthermore, it will be appreciated by those skilled in the art that various aspects of the invention may be illustrated and described in several patentable varieties or situations, including any new and useful process, machine, product or combination of matter , or any new and useful improvements to them. Correspondingly, various aspects of the present invention may be entirely executed by hardware, may be entirely executed by software (including firmware, resident software, microcode, etc.), or may be executed by a combination of hardware and software. The above hardware or software may be referred to as "data block", "module", "engine", "unit", "component" or "system". Additionally, aspects of the present invention may be embodied as a computer product comprising computer readable program code on one or more computer readable media.

電腦儲存媒體可能包含一個內含有電腦程式碼的傳播資料信號，例如在基帶上或作為載波的一部分。該傳播信號可能有多種表現形式，包括電磁形式、光形式等，或合適的組合形式。電腦儲存媒體可以是除電腦可讀取儲存媒體之外的任何電腦可讀取媒體，該媒體可以通過連接至一個指令執行系統、裝置或設備以實現通訊、傳播或傳輸供使用的程式。位於電腦儲存媒體上的程式編碼可以通過任何合適的媒體進行傳播，包括無線電、電纜、光纖電纜、RF、或類似媒體，或任何上述媒體的組合。A computer storage medium may contain a propagated data signal containing computer program code, for example in baseband or as part of a carrier wave. The propagated signal may have various manifestations, including electromagnetic form, optical form, etc., or a suitable combination. A computer storage medium may be any computer-readable medium, other than a computer-readable storage medium, that can communicate, broadcast, or transfer programs for use by being connected to an instruction execution system, device, or device. Program code located on computer storage media may be transmitted over any suitable medium, including radio, electric cable, fiber optic cable, RF, or the like, or any combination of the foregoing.

此外，除非申請專利範圍中明確說明，本發明所述處理元素和序列的順序、數字字母的使用、或其他名稱的使用，並非用於限定本發明流程和方法的順序。儘管上述揭露內容中通過各種示例討論了一些目前認為有用的發明實施例，但應當理解的是，該類細節僅起到說明的目的，附加的申請專利範圍並不僅限於揭露的實施例，相反地，申請專利範圍旨在覆蓋所有符合本發明實施例實質和範圍的修正和均等的組合。例如，雖然以上所描述的系統元件可以通過硬體設備實現，但是也可以只通過軟體的解決方案得以實現，如在現有的伺服器或行動設備上安裝所描述的系統。In addition, unless clearly stated in the scope of the patent application, the sequence of processing elements and sequences, the use of numbers and letters, or the use of other names in the present invention are not used to limit the sequence of the process and methods of the present invention. While the foregoing disclosure discusses, by way of various examples, some embodiments of the invention presently believed to be useful, it should be understood that such details are for illustrative purposes only and that the appended claims are not limited to the disclosed embodiments, but rather , the scope of the patent application is intended to cover all modifications and equal combinations that conform to the essence and scope of the embodiments of the present invention. For example, although the above-described system components can be implemented by hardware devices, they can also be implemented by only software solutions, such as installing the described system on an existing server or mobile device.

同理，應當注意的是，為了簡化本發明揭露內容的表述，從而幫助對一個或多個發明實施例的理解，前文對本發明實施例的描述中，有時會將多種特徵歸併至一個實施例、附圖或對其的描述中。但是，這種揭露方法並不意味著本發明物件所需要的特徵比申請專利範圍中提及的特徵多。實際上，實施例的特徵要少於上述揭露的單個實施例的全部特徵。In the same way, it should be noted that in order to simplify the expression of the disclosure of the present invention and help the understanding of one or more embodiments of the invention, in the foregoing description of the embodiments of the present invention, sometimes multiple features are combined into one embodiment , drawings or descriptions thereof. However, this method of disclosure does not imply that the object of the invention requires more features than those mentioned in the claims. Indeed, embodiment features are less than all features of a single foregoing disclosed embodiment.

一些實施例中使用了描述成分、屬性數量的數字，應當理解的是，此類用於實施例描述的數字，在一些示例中使用了修飾詞“大約”、“近似”或“大體上”來修飾。除非另外說明，“大約”、“近似”或“大體上”表明所述數字允許有±20%的變化。相應地，在一些實施例中，說明書和申請專利範圍中使用的數值參數均為近似值，該近似值根據個別實施例所需特點可以發生改變。在一些實施例中，數值參數應考慮規定的有效位數並採用一般位數保留的方法。儘管本發明一些實施例中用於確認其範圍廣度的數值域和參數為近似值，在具體實施例中，此類數值的設定在可行範圍內盡可能精確。In some embodiments, numbers describing the quantity of components and attributes are used. It should be understood that such numbers used in the description of the embodiments use the modifiers "about", "approximately" or "substantially" in some examples. grooming. Unless otherwise stated, "about", "approximately" or "substantially" indicates that the stated figure allows for a variation of ±20%. Accordingly, in some embodiments, the numerical parameters used in the specification and claims are approximations that may vary depending upon the desired characteristics of individual embodiments. In some embodiments, numerical parameters should take into account the specified number of significant digits and adopt the general digit reservation method. Although the numerical ranges and parameters used to demonstrate the breadth of scope in some embodiments of the invention are approximations, in specific embodiments such numerical values are set as precisely as practicable.

針對本案引用的每個專利、專利申請案、專利申請案的公開本和其他材料，如文章、書籍、說明書、出版物、文件等，特此將其全部內容併入本發明作為參考。與本發明內容不一致或產生衝突的申請歷史文件除外，對本發明申請專利範圍最廣範圍有限制的文件（當前或之後附加於本申請案中的）也除外。需要說明的是，如果本申請案附屬材料中的描述、定義、和/或術語的使用與本申請案所述內容有不一致或衝突的地方，以本申請案的描述、定義和/或術語的使用為準。Every patent, patent application, publication of a patent application, and other materials, such as articles, books, specifications, publications, documents, etc., cited in this case are hereby incorporated by reference in their entirety. Application history documents inconsistent with or conflicting with the content of the present invention are excluded, as well as documents (currently or later appended to this application) that limit the broadest scope of the claimed invention. It should be noted that if there is any inconsistency or conflict between the descriptions, definitions, and/or terms used in the attached materials of the application and the contents of the application, the descriptions, definitions and/or terms of the application shall be Use shall prevail.

最後，應當理解的是，本發明中所述實施例僅用以說明本發明實施例的原則。其他的變形也可能屬於本發明的範圍。因此，作為示例而非限制，本發明實施例的替代配置可視為與本發明的教導一致。相應地，本發明的實施例不僅限於本發明明確介紹和描述的實施例。Finally, it should be understood that the embodiments described in the present invention are only used to illustrate the principles of the embodiments of the present invention. Other modifications are also possible within the scope of the present invention. Accordingly, by way of illustration and not limitation, alternative configurations of the embodiments of the present invention may be considered consistent with the teachings of the present invention. Accordingly, the embodiments of the present invention are not limited to the embodiments of the present invention that are explicitly shown and described.

100:開放式耳機 110:殼體結構 120:固定結構 130:第一麥克風陣列 140:信號處理器 150:揚聲器陣列 160:第二麥克風陣列 200:流程 210:步驟 220:步驟 230:步驟 300:流程 310:步驟 320:步驟 500:流程 510:步驟 520:步驟 600:流程 610:步驟 620:步驟 700:流程 710:步驟 720:步驟 911:第一子麥克風陣列 912:第二子麥克風陣列 921:第一子麥克風陣列 922:第二子麥克風陣列 1000:流程 1010:步驟 1020:步驟 100: Open-back headphones 110: shell structure 120: fixed structure 130: the first microphone array 140: signal processor 150: speaker array 160: second microphone array 200: Process 210: step 220: step 230: step 300: Process 310: step 320: Step 500: Process 510: step 520: step 600: process 610: Step 620: Step 700: process 710: Step 720: step 911: the first sub-microphone array 912: the second sub-microphone array 921: the first sub-microphone array 922: the second sub-microphone array 1000: process 1010: step 1020: Steps

本發明將以示例性實施例的方式進一步說明，這些示例性實施例將通過圖式進行詳細描述。這些實施例並非限制性的，在這些實施例中，相同的元件符號表示相同的結構，其中：The invention will be further illustrated by way of exemplary embodiments which will be described in detail by means of the drawings. These embodiments are not limiting, and in these embodiments, the same reference numerals represent the same structure, wherein:

[圖1]係根據本發明的一些實施例提供的開放式耳機的示例性方塊圖；[ FIG. 1 ] is an exemplary block diagram of an open earphone provided according to some embodiments of the present invention;

[圖2]係根據本發明的一些實施例提供的開放式耳機的示例性原理流程圖；[Fig. 2] is an exemplary schematic flow chart of an open earphone provided according to some embodiments of the present invention;

[圖3]係根據本發明一些實施例提供的更新降噪信號的示例性流程圖；[Fig. 3] is an exemplary flow chart of updating the noise reduction signal according to some embodiments of the present invention;

[圖4A]係根據本發明說明書一些實施例提供的第一麥克風陣列和第二麥克風陣列的排布方式和位置關係的示例性分佈圖；[FIG. 4A] is an exemplary distribution diagram of the arrangement and positional relationship between the first microphone array and the second microphone array provided according to some embodiments of the specification of the present invention;

[圖4B]係根據本發明說明書另一些實施例提供的第一麥克風陣列和第二麥克風陣列的排布方式的示例性分佈圖；[FIG. 4B] is an exemplary distribution diagram of the arrangement of the first microphone array and the second microphone array provided according to other embodiments of the specification of the present invention;

[圖5]係根據本發明說明書一些實施例提供的估計第一空間位置的雜訊的示例性流程圖；[Fig. 5] is an exemplary flow chart for estimating noise at a first spatial position according to some embodiments of the present invention;

[圖6]係根據本發明說明書一些實施例提供的確定空間噪音源的示例性流程圖；[FIG. 6] is an exemplary flow chart for determining a spatial noise source provided according to some embodiments of the specification of the present invention;

[圖7]係根據本發明說明書一些實施例提供的確定空間噪音源的另一示例性流程圖；[Fig. 7] is another exemplary flow chart for determining a spatial noise source according to some embodiments of the present invention;

[圖8A]係根據本發明說明書一些實施例提供的第一子麥克風陣列的排布方式的示意圖；[Fig. 8A] is a schematic diagram of the arrangement of the first sub-microphone array provided according to some embodiments of the specification of the present invention;

[圖8B]係根據本發明說明書另一些實施例提供的第一子麥克風陣列的排布方式的示意圖；[FIG. 8B] is a schematic diagram of the arrangement of the first sub-microphone array provided according to other embodiments of the specification of the present invention;

[圖8C]係根據本發明說明書另一些實施例提供的第一子麥克風陣列的排布方式的示意圖；[FIG. 8C] is a schematic diagram of the arrangement of the first sub-microphone array provided according to other embodiments of the specification of the present invention;

[圖8D]係根據本發明說明書另一些實施例提供的第一子麥克風陣列的排布方式的示意圖；[FIG. 8D] is a schematic diagram of the arrangement of the first sub-microphone array provided according to other embodiments of the specification of the present invention;

[圖9A]係根據本發明說明書一些實施例提供的第一子麥克風陣列和第二子麥克風陣列的位置關係的示意圖；[FIG. 9A] is a schematic diagram of the positional relationship between the first sub-microphone array and the second sub-microphone array provided according to some embodiments of the specification of the present invention;

[圖9B]係根據本發明說明書另一些實施例提供的第一子麥克風陣列和第二子麥克風陣列的位置關係的示意圖；[Fig. 9B] is a schematic diagram of the positional relationship between the first sub-microphone array and the second sub-microphone array according to other embodiments of the present invention;

[圖10]係根據本發明說明書一些實施例提供的估計第一空間位置的雜訊的示例性流程圖。[ FIG. 10 ] is an exemplary flow chart of estimating noise at a first spatial location according to some embodiments of the present invention.

200:流程 200: Process

210:步驟 210: step

220:步驟 220: step

230:步驟 230: step

Claims

An open earphone comprising: a fixing structure configured to fix the earphone near the user's ear and not block the user's ear canal; Housing structure, configured to carry: a first microphone array configured to pick up environmental noise; at least one loudspeaker array; and A signal handler, configured as: estimating noise at a first spatial location closer to the user's ear canal than any microphone in the first microphone array based on the picked-up environmental noise; and A noise reduction signal is generated based on the noise at the first spatial position, so that the at least one speaker array outputs a noise reduction wave according to the noise reduction signal, and the noise reduction wave is used to eliminate transmission to the user's ear. The environmental noise of the road.

Such as the open earphone of claim 1, wherein, The housing structure is configured to house a second microphone array configured to pick up the environmental noise and the noise reduction wave, the second microphone array being at least partially distinct from the first microphone array a microphone array; and The signal processor is configured to update the noise reduction signal based on sound signals picked up by the second microphone array.

The open earphone according to claim 2, wherein updating the noise reduction signal based on the sound signal picked up by the second microphone array includes: estimating a sound field at the ear canal of the user based on the sound signal picked up by the second microphone array; and The parameter information of the noise reduction signal is adjusted according to the sound field at the ear canal of the user.

The open earphone according to claim 2, wherein the second microphone array includes a microphone closer to the user's ear canal than any microphone in the first microphone array.

The open earphone according to claim 1, wherein the signal processor estimating the noise at the first spatial position based on the picked-up environmental noise includes: Signal separation is performed according to the picked-up environmental noise, parameter information corresponding to the environmental noise is obtained, and the noise reduction signal is generated based on the parameter information.

The open earphone according to claim 1, wherein the signal processor estimating the noise at the first spatial position based on the picked-up environmental noise includes: determining one or more spatial noise sources associated with the picked-up ambient noise; and Noise at the first spatial location is estimated based on the spatial noise sources.

The open earphone according to claim 6, wherein determining one or more of the spatial noise sources related to the picked-up environmental noise comprises: dividing the picked-up environmental noise into a plurality of subbands, each subband corresponding to a different frequency range; and On at least one subband, the spatial noise source corresponding thereto is determined.

The open earphone according to claim 7, wherein the first microphone array includes a first sub-microphone array and a second sub-microphone array, and the first sub-microphone array and the second sub-microphone array are respectively located at the user’s At the left ear and the right ear, determining the spatial noise source corresponding to the at least one subband comprises: Obtaining a user head function, which reflects the reflection or absorption of sound by the user's head; and On the at least one sub-band, in combination with the environmental noise picked up by the first sub-microphone array, the environmental noise picked up by the second sub-microphone array, and the user head function, determine the corresponding to the spatial noise source.

The open earphone according to claim 1, wherein the first microphone array includes a noise microphone, the at least one loudspeaker array forms at least one set of acoustic dipoles, and the noise microphone is located at the side of the dipole radiation sound field Acoustic zero point.

The open earphone according to claim 1, wherein the at least one microphone array includes a bone conduction microphone, the bone conduction microphone is configured to pick up the voice of the user, and the signal processor is based on the picked up environmental noise Estimating noise at the first spatial location includes: removing components associated with the signal picked up by the bone conduction microphone from the picked up ambient noise to update the ambient noise; and Estimating noise at the first spatial location according to the updated environmental noise.

The open earphone according to claim 1, wherein the at least one microphone array includes a bone conduction microphone and an air conduction microphone, and the signal processor controls the power of the bone conduction microphone and the air conduction microphone based on the working state of the earphone switch status.