CN201639751U - Fixed-direction and fixed-distance voice collecting system based on multi-microphone array - Google Patents
Fixed-direction and fixed-distance voice collecting system based on multi-microphone array Download PDFInfo
- Publication number
- CN201639751U CN201639751U CN2010201291853U CN201020129185U CN201639751U CN 201639751 U CN201639751 U CN 201639751U CN 2010201291853 U CN2010201291853 U CN 2010201291853U CN 201020129185 U CN201020129185 U CN 201020129185U CN 201639751 U CN201639751 U CN 201639751U
- Authority
- CN
- China
- Prior art keywords
- microphone array
- collecting system
- wave beam
- system based
- target
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Abstract
The utility model discloses a fixed-direction and fixed-distance voice collecting system based on a multi-microphone array, which comprises a camera, the multi-microphone array, and a voice output device that is connected with the multi-microphone array; a wave-beam forming and calculating device and a wave-beam forming and processing device are sequentially arranged between the multi-microphone array and the voice output device; the camera is connected with the wave-beam forming and calculating device through a target positioning device. The voice collecting system provided by the utility model adopting the multi-microphone array technology can receive voice signals of the assigned target in a fixed-direction and fixed-distance way through forming directional wave beam, thereby leading the user to accurately monitor the voice signals of the suspicious target without interference and increasing monitoring efficiency.
Description
Technical field
The utility model relates to audio signal and strengthens the field, specifically is meant a kind of directed spacing speech collecting system based on the multi-microphone array.
Background technology
General outputting video signal of existing supervisory control system generally taked with the direct form of gathering of single microphone for audio signal.If in the scene of more complicated, as the railway station, the square, flow of the people is big, the situation that many people speak simultaneously, the audio frequency of taking common audio signal sample mode to export will make the user can't tell the different people word.If the monitor staff finds that the suspicious figure is whispering to each other simultaneously, want to monitor their talk, at this moment common audio collection mode can't satisfy this demand.
The utility model content
The purpose of this utility model is to overcome the shortcoming and defect of above-mentioned prior art, a kind of directed spacing speech collecting system based on the multi-microphone array is provided, this speech collecting system adopts the multi-microphone array technique, can be by forming directional wave beam, the liftoff voice signal of accepting intended target of directed spacing, thereby can make the user get rid of the audio signal that interference listens to suspicious object exactly, improve monitoring efficiency.
The purpose of this utility model is achieved through the following technical solutions: a kind of directed spacing speech collecting system based on the multi-microphone array, comprise camera, multi-microphone array and the instantaneous speech power that links to each other with the multi-microphone array, be disposed with wave beam between described multi-microphone array and the instantaneous speech power and form calculation element and wave beam formation processing unit, camera forms calculation element by target locating set and wave beam and links to each other.
The operation principle of above-mentioned target locating set is: after the monitor staff finds suspicious object, single-frame images according to the camera shooting, target locating set positions the intended target on the image, this target locating set is made up of target range calculator and target direction calculator, wherein, the target range calculator can be according to known camera setting height(from bottom), luffing angle, and suspicious object picture size and empirical data calculate the distance from camera of target; The target direction calculator can be according to the camera deviation angle, and luffing angle and fixed setting calculate the angle of the relative camera calibration direction of target.Wave beam forms calculation element and utilizes information such as target range that target locating set calculates and target direction, calculates needed directional wave beam coefficient.
The operation principle that wave beam forms processing unit is: after wave beam formation calculation element calculates needed directional wave beam coefficient, wave beam forms processing unit and utilizes these directional wave beam coefficients, filter the multi-path voice signal that the multi-microphone array collects, this wave beam coefficient can provide high-gain by a voice signal to target direction and target place distance, suppress the voice signal of other directions and distance simultaneously, thereby reach the purpose of only accepting the intended target voice.
As further improvement of the utility model, described wave beam forms between processing unit and the instantaneous speech power and is provided with noise elimination apparatus, form the voice signal that processing unit is handled through wave beam, further take out noise in the voice signal by noise elimination apparatus, thereby can further improve quality of speech signal, be more conducive to the monitor staff and do not hear voice; Noise elimination apparatus is connected to instantaneous speech power at last, is used to export voice.
For guaranteeing to need to check most complete monitor data under the extreme case, described instantaneous speech power is connected with holder, is used to store the HD video video recording that camera collection arrives; Described instantaneous speech power is connected with holder, is used for storaged voice.
Above-mentioned target locating set is made up of target range calculator and target direction calculator.
It is the wave beam coefficient calculator that above-mentioned wave beam forms calculation element.
It is the sef-adapting filter group that above-mentioned wave beam forms processing unit.
Microphone array number of columns in the above-mentioned multi-microphone array is at least 2.
Above-mentioned instantaneous speech power is loud speaker, earphone or network.
In sum, the beneficial effects of the utility model are: utilize advanced directed spacing voice collecting technology, can get rid of other interference signals, effectively collect the voice signal of intended target, realize the task that the traditional voice collection can't be finished.Can help the monitor staff to monitor the suspicious object dialogue effectively, judge whether threat, great application value be arranged at public safety field.
Description of drawings
Fig. 1 is a structural representation of the present utility model.
Embodiment
Below in conjunction with embodiment and accompanying drawing, the utility model is described in further detail, but execution mode of the present utility model is not limited only to this.
Embodiment:
As shown in Figure 1, the utility model comprises camera, multi-microphone array and the instantaneous speech power that links to each other with the multi-microphone array, be provided with wave beam between described multi-microphone array and the instantaneous speech power and form calculation element, camera forms calculation element by target locating set and wave beam and links to each other, and wave beam forms and is connected with wave beam formation processing unit, noise elimination apparatus between calculation element and the instantaneous speech power in turn.
For guaranteeing to need to check most complete monitor data under the extreme case, described instantaneous speech power is connected with holder, is used to store the HD video video recording that camera collection arrives; Described instantaneous speech power is connected with holder, is used for storaged voice.
The course of work of the present utility model is: the intended target on the image that target locating set photographs camera positions, this target locating set is made up of target range calculator and target direction calculator, wherein, the target range calculator can be according to known camera setting height(from bottom), luffing angle, suspicious object picture size and empirical data calculate the distance from camera of target, concrete computational methods are: the luffing angle of known camera, known camera setting height(from bottom) of while, according to the triangle principle, just can calculate the distance of target to camera, we also utilize the empirical data that one group of repetition test test draws simultaneously, result of calculation is revised, thereby the distance of calculating that is is more accurate; The target direction calculator can be according to the camera deviation angle, luffing angle and fixed setting calculate the angle of the relative camera calibration direction of target, it is supreme that camera is erected at The Cloud Terrace, so we can know the deviation angle of camera, according to the distance of target at image centre to centre heart point, we just can calculate the angle of target to fixed setting then.
Through behind the target localization, wave beam forms direction and the distance of calculation element according to target, utilizes improved MUSIC algorithm computation to go out the coefficient of needed directional wave beam; After coefficient calculations is finished, wave beam forms processing unit can utilize these coefficients, wave beam form processing unit be one based on the FIR filter, while is in conjunction with the sef-adapting filter group of ICA Blind Signal Separation algorithm, the coefficient that this bank of filters utilizes wave beam formation calculation element to calculate is initial value, the voice signal that the multi-microphone array is collected carries out filtering, the result of filtering separates through the audio signal that ICA Blind Signal Separation algorithm further mixes then, the voice signal that result's meeting after the separation and original multi-microphone array collect compares and obtains an error signal, utilize this error signal constantly to adjust filter coefficient again by improved NLMS algorithm, thereby finally export stable, the voice signal of destination object clearly.
At last, form the voice signal that processing unit is exported, can handle by noise elimination apparatus by wave beam.The purpose of noise elimination apparatus is in order further to remove the noise information in the voice signal.Noise elimination apparatus can judge whether voice signal is arranged in the current input signal, if do not have, then be judged to be noise, and accumulation and calculating noise spectrum (the spectrum here promptly is the energy value that time-domain signal is converted to each frequency behind the frequency-region signal), as judge that current demand signal is voice, and then utilize the spectrum of current demand signal to deduct the noise spectrum that historical accumulation is calculated, then the result is converted to gain coefficient, again this gain coefficient is used for primary speech signal, thereby reaches the purpose of eliminating noise; Noise elimination apparatus is connected to instantaneous speech power at last, and instantaneous speech power is loud speaker, earphone or network, is used to export voice.
As mentioned above, just can realize the utility model preferably.
Claims (9)
1. based on the directed spacing speech collecting system of multi-microphone array, comprise camera, multi-microphone array and the instantaneous speech power that links to each other with the multi-microphone array, it is characterized in that, be disposed with wave beam between described multi-microphone array and the instantaneous speech power and form calculation element and wave beam formation processing unit, camera forms calculation element by target locating set and wave beam and links to each other.
2. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that, described wave beam forms between processing unit and the instantaneous speech power and is provided with noise elimination apparatus.
3. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that described camera is connected with holder.
4. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that described instantaneous speech power is connected with holder.
5. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that described target locating set is made up of target range calculator and target direction calculator.
6. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that, it is the wave beam coefficient calculator that described wave beam forms calculation element.
7. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that, it is the sef-adapting filter group that described wave beam forms processing unit.
8. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that the microphone array number of columns in the described multi-microphone array is at least 2.
9. the directed spacing speech collecting system based on the multi-microphone array according to claim 1 is characterized in that described instantaneous speech power is loud speaker, earphone or network.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010201291853U CN201639751U (en) | 2010-03-11 | 2010-03-11 | Fixed-direction and fixed-distance voice collecting system based on multi-microphone array |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010201291853U CN201639751U (en) | 2010-03-11 | 2010-03-11 | Fixed-direction and fixed-distance voice collecting system based on multi-microphone array |
Publications (1)
Publication Number | Publication Date |
---|---|
CN201639751U true CN201639751U (en) | 2010-11-17 |
Family
ID=43084258
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2010201291853U Expired - Fee Related CN201639751U (en) | 2010-03-11 | 2010-03-11 | Fixed-direction and fixed-distance voice collecting system based on multi-microphone array |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN201639751U (en) |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103680512A (en) * | 2012-09-03 | 2014-03-26 | 现代摩比斯株式会社 | Speech recognition level improving system and method for vehicle array microphone |
CN103716724A (en) * | 2012-09-28 | 2014-04-09 | 联想(北京)有限公司 | Sound collection method and electronic device |
CN104301664A (en) * | 2013-07-19 | 2015-01-21 | 松下电器产业株式会社 | Directivity control system, directivity control method, sound collection system and sound collection control method |
CN104777450A (en) * | 2015-04-29 | 2015-07-15 | 西安电子科技大学 | Two-stage MUSIC microphone array direction finding method |
WO2015168901A1 (en) * | 2014-05-08 | 2015-11-12 | Intel Corporation | Audio signal beam forming |
CN106328156A (en) * | 2016-08-22 | 2017-01-11 | 华南理工大学 | Microphone array voice reinforcing system and microphone array voice reinforcing method with combination of audio information and video information |
CN106887236A (en) * | 2015-12-16 | 2017-06-23 | 宁波桑德纳电子科技有限公司 | A kind of remote speech harvester of sound image combined positioning |
CN103873978B (en) * | 2012-12-17 | 2017-06-27 | 联想(北京)有限公司 | The method and electronic equipment of information processing |
CN106982316A (en) * | 2017-05-03 | 2017-07-25 | 张德明 | A kind of sound field collection based on ball base camera renders supervising device |
CN107153796A (en) * | 2017-03-30 | 2017-09-12 | 联想(北京)有限公司 | A kind of information processing method and electronic equipment |
CN107290969A (en) * | 2016-03-30 | 2017-10-24 | 芋头科技(杭州)有限公司 | A kind of distributed sound control system |
CN107343094A (en) * | 2017-06-30 | 2017-11-10 | 联想(北京)有限公司 | A kind of processing method and electronic equipment |
CN103873998B (en) * | 2012-12-17 | 2018-07-03 | 联想(北京)有限公司 | Electronic equipment and sound collection method |
CN108712694A (en) * | 2018-05-18 | 2018-10-26 | 四川湖山电器股份有限公司 | The method for forming directive property in space using the evenly distributed microphone of planar array |
CN109286790A (en) * | 2018-10-17 | 2019-01-29 | 杭州兆华电子有限公司 | A kind of orientation monitoring system and its monitor method based on auditory localization |
CN109605373A (en) * | 2018-12-21 | 2019-04-12 | 重庆大学 | Voice interactive method based on robot |
CN111785032A (en) * | 2020-06-22 | 2020-10-16 | 杭州海康威视数字技术股份有限公司 | Audio signal positioning method and device, electronic equipment and intelligent traffic system |
WO2020238954A1 (en) * | 2019-05-31 | 2020-12-03 | 华为技术有限公司 | Apnea monitoring method and device |
CN113640805A (en) * | 2021-07-16 | 2021-11-12 | 中国科学院声学研究所 | Omnibearing automatic monitoring and tracking device and method for acoustic image of field birds |
-
2010
- 2010-03-11 CN CN2010201291853U patent/CN201639751U/en not_active Expired - Fee Related
Cited By (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103680512B (en) * | 2012-09-03 | 2018-02-27 | 现代摩比斯株式会社 | The horizontal lifting system of speech recognition and its method of vehicle array microphone |
CN103680512A (en) * | 2012-09-03 | 2014-03-26 | 现代摩比斯株式会社 | Speech recognition level improving system and method for vehicle array microphone |
CN103716724B (en) * | 2012-09-28 | 2017-05-24 | 联想(北京)有限公司 | Sound collection method and electronic device |
CN103716724A (en) * | 2012-09-28 | 2014-04-09 | 联想(北京)有限公司 | Sound collection method and electronic device |
CN103873978B (en) * | 2012-12-17 | 2017-06-27 | 联想(北京)有限公司 | The method and electronic equipment of information processing |
CN103873998B (en) * | 2012-12-17 | 2018-07-03 | 联想(北京)有限公司 | Electronic equipment and sound collection method |
CN104301664A (en) * | 2013-07-19 | 2015-01-21 | 松下电器产业株式会社 | Directivity control system, directivity control method, sound collection system and sound collection control method |
CN104301664B (en) * | 2013-07-19 | 2018-01-23 | 松下电器产业株式会社 | Directive property control system, directivity control method, radio system and radio reception control method |
WO2015168901A1 (en) * | 2014-05-08 | 2015-11-12 | Intel Corporation | Audio signal beam forming |
US9913027B2 (en) | 2014-05-08 | 2018-03-06 | Intel Corporation | Audio signal beam forming |
CN104777450A (en) * | 2015-04-29 | 2015-07-15 | 西安电子科技大学 | Two-stage MUSIC microphone array direction finding method |
CN106887236A (en) * | 2015-12-16 | 2017-06-23 | 宁波桑德纳电子科技有限公司 | A kind of remote speech harvester of sound image combined positioning |
CN107290969A (en) * | 2016-03-30 | 2017-10-24 | 芋头科技(杭州)有限公司 | A kind of distributed sound control system |
CN106328156A (en) * | 2016-08-22 | 2017-01-11 | 华南理工大学 | Microphone array voice reinforcing system and microphone array voice reinforcing method with combination of audio information and video information |
CN106328156B (en) * | 2016-08-22 | 2020-02-18 | 华南理工大学 | Audio and video information fusion microphone array voice enhancement system and method |
CN107153796A (en) * | 2017-03-30 | 2017-09-12 | 联想(北京)有限公司 | A kind of information processing method and electronic equipment |
CN107153796B (en) * | 2017-03-30 | 2020-08-25 | 联想(北京)有限公司 | Information processing method and electronic equipment |
CN106982316A (en) * | 2017-05-03 | 2017-07-25 | 张德明 | A kind of sound field collection based on ball base camera renders supervising device |
CN107343094A (en) * | 2017-06-30 | 2017-11-10 | 联想(北京)有限公司 | A kind of processing method and electronic equipment |
CN108712694A (en) * | 2018-05-18 | 2018-10-26 | 四川湖山电器股份有限公司 | The method for forming directive property in space using the evenly distributed microphone of planar array |
CN109286790A (en) * | 2018-10-17 | 2019-01-29 | 杭州兆华电子有限公司 | A kind of orientation monitoring system and its monitor method based on auditory localization |
CN109286790B (en) * | 2018-10-17 | 2021-05-07 | 杭州兆华电子有限公司 | Directional monitoring system based on sound source positioning and monitoring method thereof |
CN109605373A (en) * | 2018-12-21 | 2019-04-12 | 重庆大学 | Voice interactive method based on robot |
WO2020238954A1 (en) * | 2019-05-31 | 2020-12-03 | 华为技术有限公司 | Apnea monitoring method and device |
CN111785032A (en) * | 2020-06-22 | 2020-10-16 | 杭州海康威视数字技术股份有限公司 | Audio signal positioning method and device, electronic equipment and intelligent traffic system |
CN113640805A (en) * | 2021-07-16 | 2021-11-12 | 中国科学院声学研究所 | Omnibearing automatic monitoring and tracking device and method for acoustic image of field birds |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN201639751U (en) | Fixed-direction and fixed-distance voice collecting system based on multi-microphone array | |
CN110010147A (en) | A kind of method and system of Microphone Array Speech enhancing | |
CN106846803B (en) | Traffic event detection device and method based on audio frequency | |
US9439019B2 (en) | Sound signal processing method and apparatus | |
CN103873977B (en) | Recording system and its implementation based on multi-microphone array beam forming | |
EP2725819B1 (en) | Method and implementation apparatus for intelligently controlling volume of electronic device | |
CN104053107B (en) | One kind is for Sound seperation and localization method under noise circumstance | |
US9226070B2 (en) | Directional sound source filtering apparatus using microphone array and control method thereof | |
CN108109617B (en) | Remote pickup method | |
US20170365255A1 (en) | Far field automatic speech recognition pre-processing | |
CN111044973B (en) | MVDR target sound source directional pickup method for microphone matrix | |
CN107919133A (en) | For the speech-enhancement system and sound enhancement method of destination object | |
CN206349145U (en) | Audio signal processing apparatus | |
CN101807404B (en) | Pretreatment system for strengthening directional voice at front end of electronic cochlear implant | |
WO2015196760A1 (en) | Microphone array speech detection method and device | |
CN104053088A (en) | Microphone array adjustment method, microphone array and electronic device | |
JP6211716B2 (en) | Method, apparatus, and hands-free call device for improving call quality of hands-free call device | |
CN111429939B (en) | Sound signal separation method of double sound sources and pickup | |
CN205139359U (en) | Based on indoor sound localization system of FPGA microphone array | |
CN103856871A (en) | Device and method for collecting multi-channel sound through microphone array | |
WO2015106401A1 (en) | Speech processing method and speech processing apparatus | |
CN110875056B (en) | Speech transcription device, system, method and electronic device | |
JP6977448B2 (en) | Device control device, device control program, device control method, dialogue device, and communication system | |
EP3222053B1 (en) | Surround sound recording for mobile devices | |
CN107274910A (en) | The supervising device and audio/video linkage method of a kind of audio/video linkage |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20101117 Termination date: 20140311 |