TWM626673U

TWM626673U - Device for determining direction of acoustic source

Info

Publication number: TWM626673U
Application number: TW110205169U
Authority: TW
Inventors: 簡達益; 郭佳瑜; 賴瀅如; 范翔富; 白春成
Original assignee: 宏茂光電股份有限公司
Priority date: 2021-05-07
Filing date: 2021-05-07
Publication date: 2022-05-11

Abstract

本創作係一種聲源定位裝置，用以偵測一原始聲源相對於所述聲源定位裝置之原始方位角，其係包含：一頂蓋，設有一個或多個空氣流通孔，且其分布長度有一入口口徑；一底座，設有一容置空間，並設計與該頂蓋緊密結合；一麥克風電路板，其上有複數個麥克風，能受空氣壓力變化產生聲電訊號，並與該一個或多個空氣流通孔距一垂直距離置於該容置空間中，其中，該垂直距離係大於該入口口徑至少三倍；以及一運算單元，置於該容置空間中與該麥克風電路板電氣連接，用以接收該等聲電訊號、偵測該等聲電訊號強度、並根據該等聲電訊號強度在該等麥克風方位角間的對稱中心輸出該原始方位角。其中，該等麥克風間最大距離係大於等於該垂直距離的1.3倍。 The present invention is a sound source localization device for detecting the original azimuth angle of an original sound source relative to the sound source localization device, which comprises: a top cover provided with one or more air circulation holes, and its The distribution length has an inlet diameter; a base is provided with an accommodating space and is designed to be closely combined with the top cover; a microphone circuit board has a plurality of microphones on it, which can generate acoustic and electrical signals under the change of air pressure, and communicate with the one or a plurality of air circulation holes are placed in the accommodating space at a vertical distance, wherein the vertical distance is at least three times greater than the diameter of the inlet; and an arithmetic unit is placed in the accommodating space and electrically connected to the microphone circuit board The connection is used for receiving the acoustic and electrical signals, detecting the intensity of the acoustic and electrical signals, and outputting the original azimuth at the center of symmetry between the azimuths of the microphones according to the intensity of the acoustic and electrical signals. Wherein, the maximum distance between the microphones is greater than or equal to 1.3 times the vertical distance.

Description

Sound source localization device

本創作係關於一種聲源定位裝置；特別關於一種利用都卜勒效應(Doppler effect)的聲源定位裝置，主要在其高適應性。 This creation relates to a sound source localization device; especially a sound source localization device utilizing the Doppler effect, mainly in its high adaptability.

隨著半導體產業的發展，麥克風(microphone)從以往使用磁力的動圈式麥克風(dynamic microphone/moving coil microphone)，到駐極體式麥克風(electret condenser microphone/ECM)，甚至是新興起的微機電式麥克風(MEMS microphone)，體積與耗電量都在一步步地降低。日漸縮小的麥克風機構，甚至得以與放大器或訊號處理器整合在同一塊電路板上，這都促使麥克風脫離被動的收音角色，踏入主動的操作介面。 With the development of the semiconductor industry, microphones have changed from dynamic microphone/moving coil microphone, which used magnetic force, to electret condenser microphone/ECM, and even the newly emerging micro-electromechanical type. Microphone (MEMS microphone), the volume and power consumption are decreasing step by step. The ever-shrinking microphone mechanism can even be integrated with the amplifier or signal processor on the same circuit board, which pushes the microphone to move away from the passive sound-receiving role and step into the active operating interface.

因此，不論是美國亞馬遜公司推出的Amazon Echo，或是谷歌公司推出的Google Home，都採用微機電麥克風組成的矩陣(array)，主動偵測說話者的方向，進而強化聲音指令的辨識。藉由聲波到達角(Angle of Arrive，AoA)在麥克風間形成相位差(phase)的原理，比對任兩麥克風產生的聲電訊號時間差，便能從最大時間差的兩麥克風連線得出說話者的方位角(azimuth)。 Therefore, whether it is the Amazon Echo launched by Amazon in the United States or the Google Home launched by Google, an array of micro-electromechanical microphones is used to actively detect the direction of the speaker, thereby enhancing the recognition of voice commands. According to the principle of the angle of arrival (AoA) of the sound wave to form a phase difference between the microphones, by comparing the time difference of the acoustic and electrical signals generated by any two microphones, the speaker can be obtained from the connection of the two microphones with the largest time difference. azimuth.

若以20℃時聲速34,300cm/s來計算，互延遲1ms代表34.3cm的聲程差Lm=Dm．cos(A)，其中，Dm為兩麥克風距離以及A為到達角(請參閱第1圖)。然而，像Amazon Echo或Google Home這類家用裝置最大直徑僅有7cm左右，就需要至少0.2ms精度的互延遲分析能力才得以找出說話者的方位角。 If the speed of sound at 20℃ is 34,300cm/s, the mutual delay of 1ms represents the sound path difference of 34.3cm, Lm=Dm. cos(A), where Dm is the distance between the two microphones and A is the angle of arrival (see Figure 1). However, home devices like the Amazon Echo or Google Home have a maximum diameter of only At about 7cm, a mutual delay analysis capability of at least 0.2ms accuracy is required to find the azimuth of the speaker.

更重要的是，為了基礎防塵防水，麥克風多置於裝置內部，用來收音的小孔遠小於人聲波長範圍10-110cm。對這類裝置而言，聲波係以新生點源的方式穿過收音的小孔，同時也失去原始聲源的傳播方向，因此無法再適用習知的單一聲源平面波模型。 More importantly, in order to be basically dustproof and waterproof, the microphones are mostly placed inside the device, and the small holes used for sound collection are much smaller than the human voice wavelength range of 10-110cm. For this type of device, the sound wave passes through the small hole of the sound receiver in the form of a new point source, and at the same time loses the propagation direction of the original sound source, so the conventional single sound source plane wave model can no longer be applied.

介紹一種聲源定位方法，由本創作之聲源定位裝置實施，至少藉一麥克風陣列相對於一入口之位置以及該麥克風陣列所產生之複數聲電訊號強度，決定一原始聲源相對於該入口之原始方位角，其係包含：決定該麥克風陣列之於該入口所在平面方向之一個或多個極角以及複數方位角；依所述複數方位角以及在一選定範圍內之所述極角從該複數聲電訊號強度決定一組強度分布；依該組強度分布決定一對稱中心；以及決定該原始方位角位於該對稱中心上偏向該組強度分布較弱之一側，其中，該選定範圍係自大於等於33°以及小於90°內選擇。 Introduce a sound source localization method, which is implemented by the sound source localization device of the present creation, at least by the position of a microphone array relative to an entrance and the intensity of a plurality of acoustic and electrical signals generated by the microphone array to determine the position of an original sound source relative to the entrance. The original azimuth, which includes: determining one or more polar angles and a complex azimuth of the microphone array in the plane direction of the entrance; from the complex azimuth and the polar angle within a selected range The intensity of the complex acoustic and electrical signals determines a set of intensity distributions; determines a center of symmetry according to the set of intensity distributions; and determines that the original azimuth is located on the center of symmetry and deviates from the weaker side of the set of intensity distributions, wherein the selected range is from Select within 33° or more and less than 90°.

本創作實際應用時，當該原始聲源相對於該入口之到達角無特殊限定時，該選定範圍係自大於等於53°以及小於90°內選擇；當該原始聲源相對於該入口之到達角

35°，該選定範圍係自大於等於50°以及小於90°內選擇，且隨著所述到達角愈趨向低角度，該選定範圍則愈趨向高角度；以及當該原始聲源相對於該入口之到達角

65°，該選定範圍係自大於等於33°以及小於等於63°選擇，且隨著所述到達角愈趨向高角度，該選定範圍則愈趨向低角度。 In the practical application of this creation, when the arrival angle of the original sound source relative to the entrance is not particularly limited, the selected range is selected from greater than or equal to 53° and less than 90°; when the original sound source arrives at the entrance relative to the entrance horn

35°, the selected range is selected from greater than or equal to 50° and less than 90°, and as the angle of arrival becomes lower, the selected range becomes higher; and when the original sound source is relative to the inlet angle of arrival

65°, the selected range is selected from greater than or equal to 33° and less than or equal to 63°, and as the angle of arrival tends to a higher angle, the selected range tends to a lower angle.

在其一實施例中，所述依該組強度分布決定一對稱中心之意義，係以該組強度分布為權重作加權平均數，並據此決定該對稱中心。 In one embodiment, the meaning of determining a center of symmetry according to the set of intensity distributions is to use the set of intensity distributions as weights to make a weighted average, and determine the center of symmetry accordingly.

在其一實施例中，所述依該組強度分布決定一對稱中心之意義，係包含：依該複數聲電訊號之雜訊位準(noise level)決定該組強度分布之一無聲區域；以及找出該無聲區域外緣兩端之該組強度分布比例，並據此決定該對稱中心。 In one embodiment, the meaning of determining a center of symmetry according to the set of intensity distributions includes: determining a silent region of the set of intensity distributions according to the noise level of the complex acoustic and electrical signals; and Find out the intensity distribution ratios at both ends of the outer edge of the silent area, and determine the center of symmetry accordingly.

在其一實施例中，所述依該組強度分布決定一對稱中心之意義，係包含：決定該組強度分布之二極值以及該二極值間低谷，其中，該二極值與該低谷落差係大於一閥值，反之，則判定該二極值決定失敗；於該二極值決定成功時，決定該對稱中心係位於該低谷內；以及於該二極值決定失敗時，依所述複數方位角以及在一終極範圍內之所述極角從該複數聲電訊號強度決定一組終極分布，並依該組終極分布決定一對稱中心，其中，該終極範圍內最小角係大於該選定範圍內最小角，以及該閥值係大於等於該二極值強度的五分之一。 In one embodiment, the meaning of determining a center of symmetry according to the set of intensity distributions includes: determining two extremes of the set of intensity distributions and a valley between the two extremes, wherein the two extremes and the valley The drop is greater than a threshold, otherwise, it is determined that the determination of the two extremes fails; when the determination of the two extremes is successful, it is determined that the symmetry center is located in the valley; and when the determination of the two extremes fails, according to the above The complex azimuth and the polar angle within a final range determine a set of final distributions from the complex acoustic and electrical signal strength, and a center of symmetry is determined according to the set of final distributions, wherein the minimum angle in the final range is greater than the selected The minimum angle in the range, and the threshold value is greater than or equal to one-fifth of the intensity of the diode value.

一種聲源定位裝置，用以偵測一原始聲源相對於所述聲源定位裝置之原始方位角，其係包含：一頂蓋，設有一個或多個空氣流通孔，且其分布長度有一入口口徑；一底座，設有一容置空間，並設計與該頂蓋緊密結合；一麥克風電路板，其上有複數個麥克風，能受空氣壓力變化產生聲電訊號，並與該一個或多個空氣流通孔距一垂直距離置於該容置空間中，其中，該垂直距離係大於該入口口徑至少三倍；以及一運算單元，置於該容置空間中與該麥克風電路板電氣連接，用以接收該等聲電訊號、偵測該等聲電訊號強度、並根據該等聲電訊號強度在該等麥克風方位角間的對稱中心輸出該原始方位角，其中，該等麥克風間最大距離係大於等於該垂直距離的1.3倍。為求全方位的該原始方位角精度，該等麥克風間最大距離係大於等於該垂直距離的2.38倍；或是為判斷來自正前方的該原始聲源，該等麥克風間最大距離係小於等於該垂直距離的3.92倍。 A sound source locating device for detecting the original azimuth angle of an original sound source relative to the sound source locating device, which comprises: a top cover, which is provided with one or more air circulation holes, and its distribution length has a The inlet diameter; a base with an accommodating space and designed to be closely combined with the top cover; a microphone circuit board with a plurality of microphones on it, which can be subjected to changes in air pressure to generate acoustic and electrical signals, and communicate with the one or more microphones The air circulation hole is placed in the accommodating space at a vertical distance, wherein the vertical distance is at least three times greater than the diameter of the inlet; and an arithmetic unit is placed in the accommodating space and electrically connected to the microphone circuit board, using to receive the acousto-electric signals, detect the intensity of the acousto-electric signals, and determine the azimuth angle between the microphones according to the intensity of the acousto-electric signals The center of symmetry outputs the original azimuth angle, wherein the maximum distance between the microphones is greater than or equal to 1.3 times the vertical distance. In order to obtain the original azimuth angle accuracy in all directions, the maximum distance between the microphones is greater than or equal to 2.38 times the vertical distance; or to determine the original sound source from directly in front, the maximum distance between the microphones is less than or equal to the vertical distance. 3.92 times the distance.

一種聲源定位裝置，用以偵測一原始聲源相對於所述聲源定位裝置之原始方位角，其係包含：一頂蓋，設有一滑動空腔，其內側布有滑動軌道；一入口部件，置於該滑動空腔中，能受手動或受電力致動於該滑動軌道上移動，並具有空氣流通的一個或多個貫穿道，其中，該一個或多個貫穿道分布長度有一入口口徑；一底座，設有一容置空間，並設計與該頂蓋緊密結合；一麥克風電路板，其上有複數個麥克風，能受空氣壓力變化產生聲電訊號，並與該一個或多個貫穿道距一垂直距離置於該容置空間中；以及一運算單元，置於該容置空間中與該麥克風電路板電氣連接，用以接收該等聲電訊號、偵測該等聲電訊號強度、並根據該等聲電訊號強度在該等麥克風方位角間的對稱中心輸出該原始方位角，其中，該入口部件於該滑動軌道上移動會造成該垂直距離增減，但該入口部件得固定在該滑動軌道上至少兩處，且與所述兩處對應的該垂直距離有一最小值與一最大值，而該最小值係大於該入口口徑至少三倍。 A sound source locating device for detecting the original azimuth angle of an original sound source relative to the sound source locating device, which comprises: a top cover with a sliding cavity, the inner side of which is provided with sliding tracks; an entrance A part, placed in the sliding cavity, can be manually or electrically actuated to move on the sliding track, and has one or more through-channels for air circulation, wherein the one or more through-channels have an inlet distributed along the length Diameter; a base with an accommodating space and designed to be closely combined with the top cover; a microphone circuit board with a plurality of microphones on it, which can be subjected to changes in air pressure to generate acoustic and electrical signals and communicate with the one or more microphones. The track is placed in the accommodating space at a vertical distance; and an arithmetic unit is placed in the accommodating space and electrically connected to the microphone circuit board for receiving the acoustic and electrical signals and detecting the intensity of the acoustic and electrical signals , and output the original azimuth angle at the center of symmetry between the microphone azimuth angles according to the strength of the acoustic and electrical signals, wherein, the movement of the entrance part on the sliding track will cause the vertical distance to increase or decrease, but the entrance part must be fixed at There are at least two places on the sliding track, and the vertical distance corresponding to the two places has a minimum value and a maximum value, and the minimum value is at least three times larger than the inlet diameter.

在其一實施例中，所述聲源定位裝置更包含：一馬達，電氣連接該滑動軌道，並置於該容置空間內，用以受控制訊號調整該入口部件相對於該滑動軌道之高低。該控制訊號係一週期性脈衝訊號，交替地出現以調整該入口部件高低。該運算單元係根據該控制訊號交替出現的時間，在該等聲電訊號強度分布出現兩極值時，輸出較高該入口部件時之結果，反之，則輸出較低該入口部件時之結果。 In one embodiment, the sound source localization device further includes: a motor electrically connected to the sliding track and placed in the accommodating space for adjusting the height of the entrance part relative to the sliding track by a control signal. The control signal is a periodic pulse signal which alternately appears to adjust the height of the inlet part. According to the time when the control signal appears alternately, when the intensity distribution of the acoustic and electrical signals has two extreme values, the computing unit outputs the result when the inlet component is higher, On the contrary, the result when the entry part is lower is output.

A:到達角 A: Arrival angle

A₁:第一到達角 A ₁ : First Arrival Angle

A₂:第一到達角 A ₂ : First Arrival Angle

B:外殼 B: Shell

Bin:內腔 Bin: lumen

C30:α=30°在麥克風陣列平面上之路徑 C30: The path of α=30° on the plane of the microphone array

C60:α=60°在麥克風陣列平面上之路徑 C60: α=60° path on the plane of the microphone array

C90:α=90°在麥克風陣列平面上之路徑 C90: α=90° path on the plane of the microphone array

C120:α=120°在麥克風陣列平面上之路徑 C120: α=120° path on the plane of the microphone array

C150:α=30°在麥克風陣列平面上之路徑 C150: α=30° path on the plane of the microphone array

COM:麥克風陣列平面上同一極角所圍路徑 COM: Path surrounded by the same polar angle on the plane of the microphone array

c:聲速 c: speed of sound

Dm:兩麥克風距離 Dm: distance between two microphones

D₁:分布寬度 D ₁ : Distribution width

D₂:對稱圓直徑 D ₂ : Symmetric circle diameter

dp:深度 dp: depth

dur:持續期 dur: duration

Lm:兩麥克風距離造成之聲程差 Lm: The sound path difference caused by the distance between the two microphones

L₁:第一聲程差 L ₁ : The first sound path difference

M:麥克風電路板 M: Microphone circuit board

MU:運算單元 MU: arithmetic unit

O₁:第一入口 O ₁ : The first entrance

O₂:第二入口 O ₂ : Second entrance

OM:傳播位移 OM: Propagation Displacement

p:行走時間 p: walking time

SO:入射方向 SO: Incident direction

th:垂直距離 th: vertical distance

U:頂蓋 U: top cover

UB:底座 UB: base

V₁:第一速度 V ₁ : first speed

α:內角 α: interior angle

β:投射角 β: projection angle

γ:都卜勒因子 γ: Doppler factor

δ:等效聲源瞬時寬度 δ: Instantaneous width of equivalent sound source

θ:麥克風所在之極角 θ: polar angle where the microphone is located

Φ:麥克風所在之方位角 Φ: The azimuth where the microphone is located

1:左點源傳到正下方之時間脈衝 1: The time pulse from the left point source to the directly below

2:右點源傳到正下方之時間脈衝 2: The right point source is transmitted to the time pulse directly below

3:左點源傳到隔壁之時間脈衝 3: The time pulse from the left point source to the next door

4:右點源傳到隔壁之時間脈衝 4: The time pulse from the right point source to the next door

335:以A₁=35°入射在θ=30°的COM上強度分布 335: Intensity distribution on COM at θ=30° incident at A ₁ = 35°

355:以A₁=55°入射在θ=30°的COM上強度分布 355: Intensity distribution on COM at θ=30° incident at A ₁ = 55°

375:以A₁=75°入射在θ=30°的COM上強度分布 375: Intensity distribution on COM with A ₁ = 75° incident at θ = 30°

435:以A₁=35°入射在θ=45°的COM上強度分布 435: Intensity distribution on COM at θ=45° incident at A ₁ = 35°

455:以A₁=55°入射在θ=45°的COM上強度分布 455: Intensity distribution on COM at θ=45° incident at A ₁ = 55°

475:以A₁=75°入射在θ=45°的COM上強度分布 475: Intensity distribution on COM at θ=45° incident at A ₁ = 75°

735:以A₁=35°入射在θ=70°的COM上強度分布 735: Intensity distribution on COM at θ=70° incident at A ₁ = 35°

755:以A₁=55°入射在θ=70°的COM上強度分布 755: Intensity distribution on COM at θ=70° incident at A ₁ = 55°

775:以A₁=75°入射在θ=70°的COM上強度分布 775: Intensity distribution on COM at θ=70° incident at A ₁ = 75°

90:活動支柱 90: Activity Pillar

901:上表面 901: Upper surface

902:滑動側面 902: Sliding sides

95:顯示單元 95: Display unit

第1圖係先前技術之平面波模型示意圖 Figure 1 is a schematic diagram of the plane wave model of the prior art

第2圖係先前技術遇兩透氣小孔之困境示意圖 Figure 2 is a schematic diagram of the predicament of the prior art encountering two ventilation holes

第3圖係本創作之第一實施例配置示意圖 Figure 3 is a schematic diagram of the configuration of the first embodiment of the present creation

第4圖係本創作之第一實施例聲電訊號強度分布圖 Figure 4 shows the intensity distribution of acoustic and electrical signals in the first embodiment of the present invention

第5圖係本創作之第二、三實施例聲電訊號強度分布圖 Figure 5 shows the intensity distribution of acoustic and electrical signals in the second and third embodiments of this creation

第6圖係本創作之第二、三實施例之誤差示意圖 Figure 6 is a schematic diagram of the errors of the second and third embodiments of this creation

第7圖係本創作之第四實施例外觀示意圖 Figure 7 is a schematic diagram of the appearance of the fourth embodiment of the present invention

為闡述本創作之獨特功效，前兩段先以習知技藝，搭配二收音小孔的收音配置示範習知面臨的困境；接著，再提出本創作之實施例。由於本說明書係以裝置內部所感測到的聲電訊號做示範，故，後續出現所述聲電訊號間互延遲(mutual delay)之分析結果，係無法等同先前技術提到的聲程差除以聲速。 In order to illustrate the unique effect of this creation, the first two paragraphs first demonstrate the difficulties faced by conventional techniques with the use of conventional techniques and the configuration of the two radio holes; then, an embodiment of this creation is presented. Since this specification uses the acoustic and electrical signals sensed inside the device as an example, the subsequent analysis results of the mutual delay between the acoustic and electrical signals cannot be equivalent to dividing the sound path difference mentioned in the prior art by speed of sound.

請參照「第2圖」，當來自左側的原始聲源前後接觸到該二收音小孔時，由於該二收音小孔的孔隙遠小於人聲波長，故聲波通過該二收音小孔後，再無法以原來行進方向繼續傳播至麥克風，反而是以新生點源的方式向外輻射傳播。原始聲源的行進方向與該二收音小孔沿線所夾之到達角造成的聲程差為c．p，其中，c為當時的聲速，以及p為行走時間(traveling time)代表該二收音小孔新生的左點源比右點源早的時間差。該二收音小孔下方的兩個麥克風，除了收到正上方傳來的聲波外，還會收到隔壁傳來的聲波。以圖示1、4分別代表該左點源傳到正下方以及該右點源傳到隔壁的到時；同時，以圖示2、3分別代表該右點源傳到正下方以及該左點源傳到隔壁的到時。 Please refer to "Picture 2", when the original sound source from the left touches the two sound-receiving holes before and after, because the pores of the second sound-receiving holes are much smaller than the wavelength of human voice, after the sound wave passes through the two sound-receiving holes, it can no longer be Continue to propagate to the microphone in the original direction of travel, but instead radiate outward in the form of a new point source. The sound path difference caused by the traveling direction of the original sound source and the angle of arrival between the two radio holes along the line is c. p, where c is the speed of sound at that time, and p is the traveling time, which represents the time difference between the left point source and the right point source of the newly-born left point source of the two radio holes. The two radio holes The two microphones below, in addition to receiving sound waves from directly above, will also receive sound waves from next door. Figures 1 and 4 represent the time when the left point source is transmitted directly below and the right point source is transmitted to the next door; at the same time, Figures 2 and 3 represent the right point source is transmitted directly below and the left point respectively. The source is transmitted to the time next door.

當該左右點源傳到隔壁的時間略少於一個該行走時間p，該左點源因為早於該右點源一個該行走時間p，所以反而比該右點源早傳到該右點源下方的麥克風。如同「第2圖」左上所示，左側麥克風的到時(實線)與右側麥克風的到時(點線)並不一致，故兩麥克風產生的聲電訊號波形相當地不同；這與同調量測法(coherent measurement,CM)或交叉功率頻譜相位法(CSP)的先決條件-「彼此相似的波形」，相當牴觸。正由於兩麥克風產生的聲電訊號的波形並不相似，習知技藝分析出的互延遲將與p分歧，也就無從得知該原始聲源的方位角。 When the time of the left and right point sources to the next door is slightly less than one of the walking time p, the left point source is transmitted to the right point source earlier than the right point source because the left point source is one the walking time p earlier than the right point source. microphone below. As shown in the upper left of "Figure 2", the arrival time of the left microphone (solid line) and the arrival time of the right microphone (dotted line) are not consistent, so the waveforms of the acoustic and electrical signals generated by the two microphones are quite different; this is different from the coherence measurement. A prerequisite for coherent measurement (CM) or cross-power spectral phase (CSP) - "waveforms that are similar to each other", is quite confusing. Since the waveforms of the acoustic and electrical signals generated by the two microphones are not similar, the mutual delay analyzed by the prior art will be different from p, and the azimuth angle of the original sound source cannot be known.

請參照「第3圖」，係本創作第一實施例，一外殼B設置有一第一入口O₁；且該第一入口O₁設有一個或數個空氣貫穿孔沿x-y平面分布，且沿x軸之分布寬度D₁小於人聲最小波長。有一麥克風陣列設置於該外殼B所圍之內腔Bin，其所在平面至該外殼B的深度dp，係大於該分布寬度D₁至少三倍以上。當原始聲源由左側沿一入射方向SO襲來，即在該第一入口O₁形成以第一速度V₁移動之等效聲源，且V₁=D₁/(L₁/c)=c/cos(A₁)，其中，L₁為第一聲程差，A₁為第一到達角以及V₁方向與x軸方向同。若以一脈衝波示意該等效點源之發生，則若干個該脈衝波隨時間依序出現即示意該等效聲源的移動，且該若干個脈衝波出現的期間稱為持續期(duration)dur。由於該深度dp遠大於該分布寬度D₁，故該等效聲源到達該麥克風陣列的時間，僅與該第一入口O₁至該麥克風陣列之傳播位移OM長度有關，並無法從中推算出該原始聲源之方位角。 Please refer to "Fig. 3", which is the first embodiment of the present invention. A casing B is provided with a first inlet O ₁ ; and the first inlet O ₁ is provided with one or more air through holes distributed along the xy plane, and along the The distribution width D ₁ of the x-axis is smaller than the minimum wavelength of human voice. A microphone array is disposed in the cavity Bin surrounded by the casing B, and the depth dp from the plane where the microphone array is located to the casing B is at least three times greater _than the distribution width D1. When the original sound source strikes from the left along an incident direction SO, that is, an equivalent sound source moving at the first velocity V ₁ is formed at the first inlet O ₁ , and V ₁ =D ₁ /(L ₁ /c)= c/cos(A ₁ ), wherein L ₁ is the first sound path difference, A ₁ is the first angle of arrival, and the direction of V ₁ is the same as the direction of the x-axis. If a pulse wave is used to indicate the occurrence of the equivalent point source, several pulse waves appear in sequence over time to indicate the movement of the equivalent sound source, and the period during which the pulse waves appear is called a duration. )dur. Since the depth dp is much larger than the distribution width D ₁ , the time for the equivalent sound source to reach the microphone array is only related to the length of the propagation displacement OM from the first inlet O ₁ to the microphone array, and the The azimuth of the original sound source.

然而，該持續期dur卻與該第一到達角A₁明顯相關。對該傳播位移OM偏離該第一速度V₁較多的麥克風而言，較早發生的該等效聲源具有較短的該傳播位移OM，以及較晚發生的該等效聲源有較長的該傳播位移OM；反之，對於較靠近該第一速度V₁的麥克風而言，較早發生的該等效聲源有較長的該傳播位移OM，以及較晚發生的該等效聲源有較短的該傳播位移OM。這與都卜勒效應產生的原因一樣，其結果就是當移動中的聲源朝我們而來，我們會聽到較高頻(該等效聲源密集出現)的聲音；反之，則聽到較低頻(該等效聲源分開出現)的聲音。 However, the duration dur is clearly related to the _first angle of arrival A1. For the microphone whose propagation displacement OM deviates more from the first velocity V ₁ , the equivalent sound source that occurs earlier has a shorter propagation displacement OM, and the equivalent sound source that occurs later has a longer propagation displacement OM. the propagation displacement OM _of There is a shorter OM for this propagation displacement. This is the same as the Doppler effect, the result is that when a moving sound source is coming towards us, we hear the higher frequency (the equivalent sound source is densely present) sound; otherwise, we hear the lower frequency sound (the equivalent sound source appears separately).

該持續期dur(α)，即遵守都卜勒效應的關係式：dur(α)=δ+γ．(L₁/c)...(1)，其中，α為該第一速度V₁與該傳播位移OM所成之內角，δ為該等效聲源瞬時寬度以及都卜勒因子γ=|1-cos(α)/cos(A₁)|。該等效聲源瞬時寬度δ與該第一入口O₁沿y軸分布寬度有關，並正比於該傳播位移OM與y軸所夾之投射角β之正弦函數；因此，麥克風在x-z平面上的該傳播位移OM對應的即是最短的δ，以及麥克風在y-z平面上的該傳播位移OM對應的即是最長的δ。根據能量守恆定律，該傳播位移OM上通過的聲源強度應反比於該傳播位移OM長度平方，且該傳播位移OM上麥克風之聲電訊號強度應反比該持續期dur。 The duration dur(α) is the relational formula that obeys the Doppler effect: dur(α)=δ+γ. (L ₁ /c)...(1), where α is the inner angle formed by the first velocity V ₁ and the propagation displacement OM, δ is the instantaneous width of the equivalent sound source and the Doppler factor γ= |1-cos(α)/cos(A ₁ )|. The instantaneous width δ of the equivalent sound source is related to the distribution width of the first entrance O ₁ along the y-axis, and is proportional to the sine function of the projection angle β between the propagation displacement OM and the y-axis; therefore, the microphone on the xz plane The propagation displacement OM corresponds to the shortest δ, and the propagation displacement OM of the microphone on the yz plane corresponds to the longest δ. According to the law of conservation of energy, the intensity of the sound source passing through the propagation displacement OM should be inversely proportional to the square of the length of the propagation displacement OM, and the intensity of the acoustic signal of the microphone on the propagation displacement OM should be inversely proportional to the duration dur.

請參照「第4圖」，該麥克風陣列所在平面上，對應該內角α=30°、60°、90°、120°以及150°之曲線C30、C60、C90、C120以及C150。若沿著一等長路徑COM(極角θ=70°所圍之圓)，取該麥克風陣列之聲電訊號強度，以該麥克風陣列所在之方位角Φ排序(右圖橫軸)，並設0°為該第一速度V₁代表的方位角，便能看出該等聲電訊號強度間的對稱性(如右圖)。正方形、三角形以及圓形分別標示當該第一到達角A₁為35°、55°以及75°時，該等聲電訊號強度隨該麥克風陣列之方位角變化(735、755、775)。因此，以該等聲電訊號強度對該麥克風陣列所在之方位角Φ做加權平均數，便能得出該第一速度V₁方位角預估值，例如：右圖即得出3.7°、5.4°以及6.7°的結果，其平均誤差弳度大約是0.09。以六公尺遠的距離估算，此平均誤差弳度導致的最大誤判範圍約為55cm。 Please refer to "Fig. 4", on the plane where the microphone array is located, the curves C30, C60, C90, C120 and C150 corresponding to the inner angles α=30°, 60°, 90°, 120° and 150°. If you follow an equal-length path COM (a circle surrounded by a polar angle θ=70°), take the acoustic and electrical signal strength of the microphone array, sort by the azimuth angle Φ of the microphone array (horizontal axis in the right figure), and set 0° is the azimuth angle represented by the first velocity V ₁ , and the symmetry between the intensities of the acoustic and electrical signals can be seen (as shown in the figure on the right). Squares, triangles, and circles indicate that when the _first angle of arrival A1 is 35°, 55°, and 75°, the intensity of the acoustic and electrical signals varies with the azimuth of the microphone array (735, 755, 775). Therefore, by taking the weighted average of the azimuth angle Φ where the microphone array is located, the estimated value of the azimuth angle of the first velocity V ₁ can be obtained. ° and 6.7°, the average error is about 0.09. Estimated at a distance of six meters, the maximum misjudgment range caused by this average error is about 55cm.

除此之外，由於該內角α大於80°後，該等聲電訊號強度會弱到融入背景噪音至無法判斷；故，可以利用無法判斷出該等聲電訊號強度之無聲區域來決定該第一速度V₁方位角預估值。更進一步地，還能利用該無聲區域外緣的該等聲電訊號強度比例，來計算該第一速度V₁方位角預估值，例如：該無聲區域外緣+100°與-100°各有該聲電訊號強度1與0.6，則[(+100°．0.6)+(-100°．1)]/1.6=-25°即該第一速度V₁方位角預估值。 In addition, since the internal angle α is greater than 80°, the intensity of the acoustic and electrical signals will be so weak that it is merged into the background noise and cannot be judged; therefore, the silent area where the intensity of the acoustic and electrical signals cannot be judged can be used to determine the The estimated value of the azimuth angle of the _first velocity V1. Furthermore, the ratio of the intensity of the acoustic and electrical signals at the outer edge of the silent area can also be used to calculate the estimated value of the azimuth angle of the first velocity V ₁ , for example: the outer edge of the silent area is +100° and -100° respectively. With the acoustic and electrical signal strengths 1 and 0.6, then [(+100°.0.6)+(-100°.1)]/1.6=-25°, which is the estimated value of the azimuth angle of the _first velocity V1.

請參照「第5圖」，本創作第二實施例以及第三實施例係分別沿不同極角所圍出的該等長路徑(極角θ=45°與30°)，取該麥克風陣列之聲電訊號強度，以該麥克風陣列所在之方位角Φ排序(右圖橫軸)，並設0°為該第一速度V₁代表的方位角。正方形、三角形以及圓形分別標示當該第一到達角A₁為35°、55°以及75°時，該等聲電訊號強度隨該麥克風陣列之方位角變化；其中，圖示435、455以及475分別表示三種該第一到達角在極角θ=45°所圍出的該等長路徑上的麥克風，以及圖示335、355以及375分別表示三種該第一到達角在極角θ=30°所圍出的該等長路徑上的麥克風。同樣地，以該等聲電訊號強度對該麥克風陣列所在之方位角Φ做加權平均數，所得出的該第一速度V₁方位角平均誤差弳度分別為0.10(θ=45°)以及0.12(θ=30°)。雖然，本創作之第二實施例與第三實施例在平均誤差的表現上並不如第一實施例；第二實施例對該第一到達角A₁=55°卻有較佳的方位角解析能力，其誤差弳度只有0.07，第三實施例則是針對該第一到達角A₁=75°有較佳的方位角解析能力，其誤差弳度來到更低的0.05。 Please refer to "Fig. 5", the second embodiment and the third embodiment of the present invention are respectively along the equal-length paths (polar angles θ=45° and 30°) surrounded by different polar angles, taking the distance of the microphone array The acoustic and electrical signal strengths are sorted by the azimuth angle Φ where the microphone array is located (horizontal axis in the right figure), and 0° is set as the azimuth angle represented by the first velocity V ₁ . Squares, triangles and circles respectively indicate that when the _first angle of arrival A1 is 35°, 55° and 75°, the intensity of the acoustic and electrical signals varies with the azimuth angle of the microphone array; among them, the figures 435, 455 and 475 respectively represent the three microphones on the equal-length path surrounded by the first angle of arrival at the polar angle θ=45°, and the diagrams 335, 355 and 375 respectively represent the three types of the first angle of arrival at the polar angle θ=30 ° the microphones on the path of equal length. Similarly, taking the weighted average of the azimuth angle Φ where the microphone array is located with the strengths of the acousto-electric signals, the average azimuth angle error Φ of the first velocity V ₁ obtained is 0.10 (θ=45°) and 0.12, respectively. (θ=30°). Although, the average error performance of the second embodiment and the third embodiment of the present invention is not as good as that of the first embodiment; the second embodiment has a better azimuth angle analysis for the first angle of arrival A ₁ =55° The third embodiment has better azimuth resolution capability for the first angle of arrival A ₁ =75°, and the error ratio is lower than 0.05.

因此，若本創作應用的場合中，該第一到達角A₁有較集中的範圍，則應就其範圍選定該麥克風陣列所在的位置。例如：當該第一到達角A₁集中在72°附近，則適合選定在極角θ在35°-40°附近所圍出的該等長路徑布置該麥克風陣列；以及當該第一到達角A₁集中在20°附近，則適合選定在極角θ大於等於60°所圍出的該等長路徑布置該麥克風陣列。實際應用時，可以藉由調整該深度的深淺，例如：墊高該麥克風陣列所在之電路，即能達到針對該第一到達角A₁最佳化的配置。 Therefore, if the first angle of arrival A1 has _a relatively concentrated range in this creative application, the position of the microphone array should be selected for the range. For example: when the _first angle of arrival A1 is concentrated around 72°, it is suitable to select the equal-length path surrounded by the polar angle θ around 35°-40° to arrange the microphone array; and when the first angle of arrival is around 35°-40° If A ₁ is concentrated around 20°, it is suitable to choose to arrange the microphone array on the equal-length path surrounded by the polar angle θ greater than or equal to 60°. In practical application, by adjusting the depth of the depth, for example, raising the circuit where the microphone array is located, the configuration optimized for the _first angle of arrival A1 can be achieved.

請參照「第6圖」，若該麥克風陣列在該等長路徑(以極角θ=70°來看)上的分布並不是以該第一速度V₁為對稱軸時，例如：上圖顯示偏移10°的結果以及下圖顯示偏移30°的結果；所述誤差弳度最高將來到該第一到達角A₁=75°的0.19(上圖偏移10°的圓形標結果)，其他偏移情形之所述誤差弳度最差約為0.14(下圖偏移30°的三角形標結果)。若應用時當該第一到達角A₁集中範圍大致大於50°，將該麥克風陣列分布在該等長路徑(θ=30°-45°)，則可控制該第一速度V₁預估值之誤差弳度不致超過0.15；反之，將該麥克風陣列分布在該等長路徑(θ>45°)，則可控制該第一速度V₁預估值之誤差弳度不致超過0.09。 Please refer to "Fig. 6", if the distribution of the microphone array on the equal-length path (viewed by the polar angle θ=70°) is not the axis of symmetry of the first velocity V ₁ , for example: the above figure shows The result with a 10° offset and the following figure shows the result with a 30° offset; the maximum error will come to 0.19 of the first angle of arrival A ₁ =75° (the circular result with a 10° offset in the upper figure) , and the worst case of the other offsets is about 0.14 (the result of the triangular mark with a 30° offset in the figure below). If the concentrated range of the first angle of arrival A ₁ is approximately greater than 50° in application, and the microphone array is distributed on the path of equal length (θ=30°-45°), the estimated value of the first velocity V ₁ can be controlled On the contrary, by distributing the microphone array on the same length path (θ>45°), the error ratio of the estimated value of the first velocity V ₁ can be controlled not to exceed 0.09.

請參照「第7圖」，本創作之第四實施例，一聽聲辨位裝置以一頂蓋U朝上，並設置有一第二入口O₂；再以一底座UB，設有一容置空間，設計與該頂蓋U緊密結合。該第二入口O₂設有一個或數個空氣貫穿孔沿x-y平面大致呈圓對稱分布，其對稱圓直徑D₂係小於人聲最小波長。該第二入口O₂係位於一活動支柱90之上表面901上，藉由該活動支柱90之滑動側面902相對於該頂蓋U的上下滑動，調整該第二入口O₂與一麥克風電路板M的垂直距離th，且該垂直距離th係大於該對稱圓直徑D₂至少三倍。該聽聲辨位裝置中亦配置有一運算單元MU以及與其電氣連接一顯示單元95。該運算單元MU係與該麥克風電路板M電氣相連，用以處理麥克風產生的聲電訊號，以及分析出該等聲電訊號強度所代表的方位角對稱性，最後將其輸出至該顯示單元95，令使用者於外觀上即能清楚看到說話者的方位，例如：將說話者的方向上的顯示燈亮起。 Please refer to "Fig. 7", the fourth embodiment of the present creation, a sound location identification device is provided with a top cover U facing upward, and a second inlet O ₂ is provided; and a base UB is provided with a accommodating space , the design is closely combined with the top cover U. The second inlet O ₂ is provided with one or more air through holes distributed approximately in a circular symmetry along the xy plane, and the diameter D ₂ of the symmetry circle is smaller than the minimum wavelength of human voice. The second inlet O ₂ is located on the upper surface 901 of a movable pillar 90 , and the second inlet O ₂ and a microphone circuit board are adjusted by sliding the sliding side 902 of the movable pillar 90 up and down relative to the top cover U. The vertical distance th of M, and the vertical distance th is at least three times greater than the diameter D ₂ of the circle of symmetry. There is also an arithmetic unit MU and a display unit 95 electrically connected to the listening and positioning device. The computing unit MU is electrically connected to the microphone circuit board M, and is used to process the acoustic and electrical signals generated by the microphone, analyze the azimuth symmetry represented by the strength of the acoustic and electrical signals, and finally output it to the display unit 95 , so that the user can clearly see the speaker's orientation on the outside, for example: turn on the indicator light on the speaker's direction.

為因應較低說話者位置的使用情形，該上表面901可受壓迫，或經馬達操作滑動該第二入口O₂至較低的位置，也就是使該垂直距離th較小，以調大所述極角的數值。 In response to the use situation of the lower speaker position, the upper surface 901 can be pressed, or the second inlet O ₂ can be slid to a lower position by the motor operation, that is, the vertical distance th can be made smaller, so as to increase the size of the speaker. the value of the polar angle.

另一方面，該聽聲辨位裝置係置於一牆面或一講台斜面上，並以該頂蓋U朝向大部分說話者的方向，且說話者聲波接觸到該第二入口O₂之第二到達角A₂皆大於等於65°；故該麥克風電路板M上複數個麥克風對該第二入口O₂所圍之極角θ最好是介於33°到63°之間。因應這樣的使用情形，該上表面901可受壓彈跳，或經馬達操作滑動該第二入口O₂至較高的位置，也就是該垂直距離th較大，以調小所述極角的數值。 On the other hand, the listening position identification device is placed on a wall or on the slope of a podium, with the top cover U facing the direction of most of the speakers, and the speaker's sound waves contact the second entrance _O2 . The two arrival angles A ₂ are both greater than or equal to 65°; therefore, the polar angle θ surrounded by the plurality of microphones on the microphone circuit board M to the second inlet O ₂ is preferably between 33° and 63°. In response to such a situation, the upper surface 901 can bounce under pressure, or slide the second inlet O ₂ to a higher position through motor operation, that is, the vertical distance th is larger, so as to reduce the value of the polar angle .

本說明說書提及之聲電訊號係麥克風因應空氣壓力變化所產生的電力相關訊號，並不限定其數位或是類比形式；以及該聲電訊號強度可指該聲電訊號中首度上下波形的振幅，也可以指該聲電訊號在一時間範圍內的最高數值，更可以是該聲電訊號做完快速傅立葉轉換後的頻譜在特定頻率的能量密度，例如：該特定頻率為人聲最低或次低的主要頻率。 The audio signal mentioned in this manual is the result of the microphone's response to changes in air pressure. The generated power-related signal is not limited to its digital or analog form; and the acousto-electric signal strength can refer to the amplitude of the first up and down waveform in the acousto-electric signal, or it can refer to the highest amplitude of the acousto-electric signal within a time range. The value can be the energy density of the frequency spectrum of the acoustic signal after the fast Fourier transformation is completed, for example, the specific frequency is the lowest or second lowest main frequency of the human voice.

本創作之第五實施例係在兩個不同極角所圍之圓上設置一內麥克風陣列(θ=33°)以及一外麥克風陣列(θ=68°)，並各自分析出其聲電訊號強度之對稱軸所在之內方位角與外方位角。接著，偵測該聲電訊號是否存在兩個明顯極值，並在所述兩個明顯極值存在時，採用該內方位角為結果；反之，則採用該外方位角為結果。 In the fifth embodiment of the present invention, an inner microphone array (θ=33°) and an outer microphone array (θ=68°) are arranged on a circle surrounded by two different polar angles, and their acoustic and electrical signals are analyzed separately. The inner and outer azimuths where the symmetry axis of the intensity is located. Next, it is detected whether there are two obvious extreme values in the acoustic and electrical signal, and when the two obvious extreme values exist, the inner azimuth angle is used as the result; otherwise, the outer azimuth angle is used as the result.

綜上所述，本創作之聲源定位方法及其裝置，確已符合專利申請之要件，爰依法提出專利申請。惟以上所述者，僅為本創作之較佳實施例，當不能以此限定本創作實施之範圍；故，凡依本創作申請專利範圍及說明書內容所作之簡單的等效變化與修飾，皆應仍屬本創作專利涵蓋之範圍內。 To sum up, the sound source localization method and device of this creation have indeed met the requirements of the patent application, and a patent application can be filed in accordance with the law. However, the above are only the preferred embodiments of this creation, and should not limit the scope of implementation of this creation; therefore, any simple equivalent changes and modifications made according to the scope of the patent application for this creation and the contents of the description are all applicable. shall remain within the scope of this creative patent.

A₁:第一到達角 A ₁ : First Arrival Angle

B:外殼 B: Shell

Bin:內腔 Bin: lumen

D₁:分布寬度 D ₁ : Distribution width

dp:深度 dp: depth

dur:持續期 dur: duration

O₁:第一入口 O ₁ : The first entrance

OM:傳播位移 OM: Propagation Displacement

SO:入射方向 SO: Incident direction

V₁:第一速度 V ₁ : first speed

α:內角 α: interior angle

β:投射角 β: projection angle

θ:麥克風所在之極角 θ: polar angle where the microphone is located

Claims

A sound source locating device for detecting the original azimuth angle of an original sound source relative to the sound source locating device, which comprises: a top cover, which is provided with one or more air circulation holes, and its distribution length has a The inlet diameter; a base with an accommodating space and designed to be closely combined with the top cover; a microphone circuit board with a plurality of microphones on it, which can be subjected to changes in air pressure to generate acoustic and electrical signals, and communicate with the one or more microphones The air circulation hole is placed in the accommodating space at a vertical distance, wherein the vertical distance is at least three times greater than the diameter of the inlet; and an arithmetic unit is placed in the accommodating space and electrically connected to the microphone circuit board, using to receive the acoustic and electrical signals, detect the intensity of the acoustic and electrical signals, and output the original azimuth at the center of symmetry between the azimuth angles of the microphones according to the intensity of the acoustic and electrical signals, wherein the maximum distance between the microphones is greater than is equal to 1.3 times this vertical distance.

The sound source localization device according to claim 1, wherein the maximum distance between the microphones is greater than or equal to 2.38 times the vertical distance.

The sound source localization device according to claim 1, wherein the maximum distance between the microphones is less than or equal to 3.92 times the vertical distance.

A sound source locating device for detecting the original azimuth angle of an original sound source relative to the sound source locating device, which comprises: a top cover with a sliding cavity, the inner side of which is provided with sliding tracks; an entrance A part, placed in the sliding cavity, can be manually or electrically actuated to move on the sliding track, and has one or more through-channels for air circulation, wherein the one or more through-channels have an inlet distributed along the length caliber; a base with an accommodating space and designed to be closely combined with the top cover; a microphone circuit board with a plurality of microphones on it, which can be subjected to changes in air pressure to generate acoustic and electrical signals and communicate with the one or more through-track distances A vertical distance is placed in the accommodating space; and an arithmetic unit is placed in the accommodating space and electrically connected to the microphone circuit board for receiving the acoustic and electrical signals, detecting the intensity of the acoustic and electrical signals, and The original azimuth angle is output at the center of symmetry between the microphone azimuth angles according to the strength of the acoustic and electrical signals, wherein the vertical distance increases or decreases when the entrance part moves on the sliding track, but the entrance part must be fixed on the sliding track. There are at least two places on the track, and the vertical distance corresponding to the two places has a minimum value and a maximum value, and the minimum value is at least three times larger than the entrance aperture, used to detect an original sound source relative to the The original azimuth of the sound source localization device.

The sound source positioning device according to claim 4, further comprising: a motor electrically connected to the sliding track and placed in the accommodating space for adjusting the height of the entrance part relative to the sliding track by a control signal, wherein , the operation unit also receives the control signal, and when the intensity distribution of the acoustic and electrical signals has two extreme values, it outputs the result when the inlet component is higher, and vice versa, outputs the result when the inlet component is lower.