CN202068548U - Three-dimensional space high-definition voice acquisition subsystem of video sensing system - Google Patents

Three-dimensional space high-definition voice acquisition subsystem of video sensing system Download PDF

Info

Publication number
CN202068548U
CN202068548U CN2011200124345U CN201120012434U CN202068548U CN 202068548 U CN202068548 U CN 202068548U CN 2011200124345 U CN2011200124345 U CN 2011200124345U CN 201120012434 U CN201120012434 U CN 201120012434U CN 202068548 U CN202068548 U CN 202068548U
Authority
CN
China
Prior art keywords
audio
input
microphone
output
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
CN2011200124345U
Other languages
Chinese (zh)
Inventor
穆科明
王兴国
方汝松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NANJING JIEMAI VIDEO TECHNOLOGY CO., LTD.
Original Assignee
穆科明
王兴国
方汝松
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 穆科明, 王兴国, 方汝松 filed Critical 穆科明
Priority to CN2011200124345U priority Critical patent/CN202068548U/en
Application granted granted Critical
Publication of CN202068548U publication Critical patent/CN202068548U/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

The utility model discloses a three-dimensional space high-definition voice acquisition subsystem of a video sensing system. The structure of the subsystem comprises that the output of each microphone is connected with the input of a microphone programmable gain amplifier, the signal output end of the microphone programmable gain amplifier is connected with the input of an analog-to-digital conversion module (A/D), the output of the analog-to-digital conversion module (A/D) is connected with the input of an audio preprocessing, the output of the audio preprocessing is connected with the input end of a three-dimensional space high-definition voice processing system, the output end of the three-dimensional space high-definition voice processing system is connected with the input interface of an audio and video signal processor, the output interface of the audio and video signal processor is connected with the input interface of an audio decompression, and the output interface of the audio decompression is connected with a standard audio input interface. Advantages comprise that the subsystem employs a microphone array to carry out acquisition, identification, and confirmation to a surrounding environment sound source, thus sound reecho is reduced; self-setting voice amplification control is realized so as to carry out processing to wideband voice signals; and quality and effect achieve optimum.

Description

Video sensor-based system three dimensions high definition voice collecting subsystem
Background technology
The utility model relates to a kind of video sensor-based system three dimensions high definition voice collecting subsystem, belong to safety monitoring system, intelligent transportation system, HD video conference system, high definition medical video diagnostic device, application technology technical fields such as long-distance educational system.
Background technology
In common rig camera, video conferencing system, Medical Devices, its audio frequency generally all adopts single microphone to gather voice signal at present.Its principle as shown in Figure 1.Language in the conventional video sensor-based system adopts module to be made up of 4 parts basically: 1. the audio signal sample front end is general adopts single microphone to finish.Microphone changes into analog signal with voice signal, in the power amplifier connecting system.2. analog digital conversion A/D module analog signal conversion that the audio signal sample front end is transmitted is a digital signal, is transferred to the audio frequency pre-processing module then.3. the audio frequency pre-treatment mainly is that the digital signal of input is carried out processing such as noise reduction to input signal in numeric field.4. audio compression is that after treatment voice signal form is on request compressed, such as MP3.Being input to audio video processor then stores and transmits.In traditional video sensor-based system, there are two serious problems in the audio collection subsystem: at first, owing to adopt single signal input device, the system requirements microphone is as far as possible near speech source, to obtain quality signal preferably.But in actual applications, being difficult to require the user is near microphone in a minute.Such as, the video camera that is used to monitor, often be placed in higher position or stash, monitored object is difficult near video camera.Do not have the good data source like this, the audio collection front end just can't obtain high-quality signal.Secondly, there is numerous audio signals some application scenario, as outdoor, and exhibit halls etc.For traditional audio signal sample system is can't the identifying purpose signal source, can only mechanically all signals collecting be come in, and compresses.The voice signal of Huo Deing is very noisy like this, is difficult to obtain the signal of theme voice object, can not satisfy any situation subaudio frequency acquisition system and can both obtain best sound effect.
Summary of the invention
The utility model proposes a kind of video sensor-based system three dimensions high definition voice collecting subsystem, its purpose is intended to overcome the existing in prior technology defective, solves the audio-quality problems in the video sensor-based system.Utilize microphone array to realize the collection of high definition voice, the select target sound source is amplified and is handled targetedly in three dimensions.For noise, echo, etc. other non-target sound suppress and eliminate.Microphone array is one group of microphone that the position arranged in proximity is orderly, and microphone array utilizes sound wave to time difference of different microphones and obtain better directivity.
Technical solution of the present utility model: its feature comprises microphone array, the microphone programmable gain amplifier, analog-to-digital conversion module (A/D), the audio frequency pre-treatment, the three dimensions speech processing system, the audio-video signal processor, audio decompression and standard audio output interface, wherein the input of the output of each microphone and microphone programmable gain amplifier is joined, the input of the signal output part of microphone programmable gain amplifier and analog-to-digital conversion module (A/D) joins, the output of analog-to-digital conversion module (A/D) and the input of audio frequency pre-treatment join, the output of audio frequency pre-treatment and three dimensions high definition speech processing system input join, the input interface of three dimensions high definition speech processing system output and audio-video signal processor joins, the output interface of audio-video signal processor and the input interface of audio decompression join, and the output interface of audio decompression and standard audio input interface join.
Advantage of the present utility model: adopted microphone array, by of the collection of a plurality of microphones to the surrounding environment sound source.The echo signal source is discerned.Signal source through confirming has dedicated microphone that it is carried out signals collecting in the microphone array.Reducing sound echoes.Realize that voice amplify control from adjusting, and can handle wideband section voice signal.Three dimensions high definition speech collecting system adopts a plurality of microphones.Sound collection ability, quality and best results have been increased to different frequency range.
Description of drawings
Accompanying drawing 1 is the voice collecting subsystem structure schematic diagram in the conventional video sensor-based system.
Accompanying drawing 2 is video sensor-based system three dimensions high definition voice collecting subsystem structure figure.
Accompanying drawing 3 is structural representations of three dimensions speech processing system.
Embodiment
Contrast accompanying drawing 2, its structure comprises microphone array, the microphone programmable gain amplifier, analog-to-digital conversion module (A/D), the audio frequency pre-treatment, the three dimensions speech processing system, the audio-video signal processor, audio decompression and standard audio output interface, wherein the input of the output of each microphone and microphone programmable gain amplifier is joined, the input of the signal output part of microphone programmable gain amplifier and analog-to-digital conversion module (A/D) joins, the output of analog-to-digital conversion module (A/D) and the input of audio frequency pre-treatment join, the output of audio frequency pre-treatment and three dimensions high definition speech processing system input join, the input interface of three dimensions high definition speech processing system output and audio-video signal processor joins, the output interface of audio-video signal processor and the input interface of audio decompression join, and the output interface of audio decompression and standard audio input interface join.
Three dimensions high definition speech collecting system is to utilize microphone array to realize the collection of high definition voice in visual sensing system.It can be in three dimensions targetedly the select target sound source amplify and handle.For noise, echo, etc. other non-target sound suppress and eliminate.Microphone array is one group of microphone that the position arranged in proximity is orderly.Compare with the microphone that tradition is single, microphone array utilizes sound wave to time difference of different microphones and obtain better directivity.Three dimensions high definition speech collecting system has mainly been realized three key technologies: 1. the formation of wave beam, utilize the signal of different microphone inputs in the microphone array, microphone array can wait the microphone that is used for a high orientation, and the voice that can form a high orientation are pounced on and caught wave beam.The wave beam of microphone array can controlled definite object sound source direction.The search engine of microphone array can be searched for target sound source in real time and pouncing on of it caught positioning of beam in current position.The microphone array of this high directivity has reduced the noise of surrounding environment and entering of response signal to a great extent.2. the directivity of array because the noise of microphone array output and echo more much smaller than single microphone output, so also good than single microphone to the inhibition of steady noise.Pounce on the typical pattern of beam direction of catching such as the microphone array voice of a 1000Hz.This pattern is much better than the effect of the microphone of a high price, high-quality, super individual event.In the voice collecting process, the microphone array Control Software is searched for target sound source, and will pounce on and catch the direction of positioning of beam in target sound source.If target sound source is moved, pounces on and catch wave beam and can follow the tracks of sound source.This mechanism is equal to the microphone of two high directivities.A microphone is used for not stopping the input that each voice signal is tested in the scanning three-dimensional space.Another one is that voice are pounced on and caught microphone, and it is oriented to the sound source of descant matter, target sound source that Here it is.3. constant wave beamwidth, normal voice collecting work frequency range is that 200Hz is to 7000Hz.Wavelength fluctuation has 35 times.Be difficult to find a constant microphone or a microphone array of frequency range to satisfy top whole working band like this.In typical working environment, the noise of the overwhelming majority generally is lower than 750Hz all in the lower part of frequency ratio but fortunately.Also be present in low-frequency range and echo, exist hardly for the frequency range that is higher than 4000Hz.The microphone array of such linearity can provide 300Hz constant wave beam frequency range to 5000Hz, the working frequency range of the basic voice collecting that satisfies.Adopt a plurality of microphones to form microphone arrays, can be automatically, recognition objective speech source effectively, and can dynamically lock, follow the tracks of this sound source.We have adopted 4 displays that microphone combination forms in visual sensing system.By the real-time control to the preposition amplification of each microphone, analog-to-digital conversion (A/D), audio frequency pre-treatment, system finally obtains the voice signal of high definition.This real-time control is to be finished by the three dimensions speech processing system.
Contrast accompanying drawing 3, the structure of three dimensions speech processing system comprises microphone programmable gain amplifier, analog-to-digital conversion module, high definition voice controller and standard audio output interface, wherein the signal output part of microphone programmable gain amplifier joins by the digital filter signal input part in A/D analog-to-digital conversion module and the high definition voice controller, and parameter programmable I IR filter signal output and standard audio output interface in the high definition voice controller join.
The structure of high definition voice controller comprises digital filter, 5 band equalizer, central processing unit, high pass filter able to programme, automatic gain controller, parameter programmable I IR filter, wherein the signal input part of the signal output part of digital filter and 5 band equalizer joins, first signal output part of 5 band equalizer and first signal input part of automatic gain controller join, the secondary signal output of 5 band equalizer and first signal input part of high pass filter able to programme join, first signal output part of central processing unit and the secondary signal input of high pass filter able to programme join, the secondary signal output of central processing unit and the secondary signal input of automatic gain controller join, first signal input part of the 3rd signal output part of central processing unit and parameter programmable I IR filter joins, and the secondary signal input of the signal output part of high pass filter able to programme and parameter programmable I IR filter joins.
1.) microphone programmable gain amplifier
The microphone programmable gain amplifier comprises programmable microphone gain amplifier and fixed gain amplifier.By the effect of automatic gain controller, it is constant that the microphone programmable gain amplifier keeps outputing to the analog voice signal of analog-to-digital conversion module.
2.) analog-to-digital conversion module
The analog-to-digital conversion control module adopts many bits higher order signal sampling architectures.It supports this employing frequency, is the sample frequency of 8ks/s to the high definition voice signal from the received pronunciation sample frequency, 48ks/s.
3.) high definition voice controller
The high definition voice controller is made up of 6 modules:
The ■ digital filter
The digital decimation of employing Sigma-Delta structure, interpolation filter can be at the voice digital signal of sample frequency 8ks/s to output high definition between the 48ks/s.It can suppress extraordinary noise, such as wind noise of outdoor environment etc.
■ 5 band equalizer
The volume relative size that adopts the dynamic volume equalizer to regulate indivedual wave bands, voice are sounded more 3D effect.5 band equalizer by to the sound (20Hz-16KHz) of different frequency by the center cut-off frequency to signal carry out-12dB is to gain or the inhibition of+12dB.Voice signal by equalizer is clear, and is melodious, not thin.
■ high pass filter able to programme
The ■ high pass filter can pass through high-frequency signal.Its attenuation amplitude is lower than the cut-off frequency of general filter.The attenuation of each frequency is able to programme.The high pass filter of native system is supported two kinds of patterns: cut-off frequency is at first order IIR filtering device and the programmable bivalent high-pass filter of cut-off frequency of 3.7Hz.Parameter programmable I IR filter
Iir filter is used for eliminating the narrow-band noise in the assigned frequency voice signal, not as the noise jamming of 50Hz-60Hz.Iir filter has different centre frequencies and bandwidth settings.These settings all are to finish by programmable parameter setting.
The ■ automatic gain controller
Automatic gain controller is controlled the programmable microphone gain amplifier in real time according to the input signal after being exaggerated.Contain a digital peak value detector in the automatic gain controller, the time compares input signal and the threshold value that configures.
The ■ central processing unit
Central processing unit is regulated each module, parameter in real time according to the output of each module and presetting of system.
4.) standard audio output
The high definition voice controller adopts the voice output interface of standard.The agreement of its dateout is able to programme.Can support I2S, DSP Mode, MSB-First L and MSB-First R etc.It may operate in holotype or under pattern.

Claims (3)

1. video sensor-based system three dimensions high definition voice collecting subsystem, its feature comprises microphone array, the microphone programmable gain amplifier, analog-to-digital conversion module A/D, the audio frequency pre-treatment, the three dimensions speech processing system, the audio-video signal processor, audio decompression and standard audio output interface, wherein the input of the output of each microphone and microphone programmable gain amplifier is joined, the input of the signal output part of microphone programmable gain amplifier and analog-to-digital conversion module A/D joins, the input of the output of analog-to-digital conversion module A/D and audio frequency pre-treatment joins, the output of audio frequency pre-treatment and three dimensions high definition speech processing system input join, the input interface of three dimensions high definition speech processing system output and audio-video signal processor joins, the output interface of audio-video signal processor and the input interface of audio decompression join, and the output interface of audio decompression and standard audio input interface join.
2. video sensor-based system three dimensions high definition voice collecting subsystem according to claim 1, the structure that it is characterized in that the three dimensions speech processing system comprises microphone programmable gain amplifier, analog-to-digital conversion module, high definition voice controller and standard audio output interface, wherein the signal output part of microphone programmable gain amplifier joins by the digital filter signal input part in A/D analog-to-digital conversion module and the high definition voice controller, and parameter programmable I IR filter signal output and standard audio output interface in the high definition voice controller join.
3. video sensor-based system three dimensions high definition voice collecting subsystem according to claim 2, the structure that it is characterized in that the high definition voice controller comprises digital filter, 5 band equalizer, central processing unit, high pass filter able to programme, automatic gain controller, parameter programmable I IR filter, wherein the signal input part of the signal output part of digital filter and 5 band equalizer joins, first signal output part of 5 band equalizer and first signal input part of automatic gain controller join, the secondary signal output of 5 band equalizer and first signal input part of high pass filter able to programme join, first signal output part of central processing unit and the secondary signal input of high pass filter able to programme join, the secondary signal output of central processing unit and the secondary signal input of automatic gain controller join, first signal input part of the 3rd signal output part of central processing unit and parameter programmable I IR filter joins, and the secondary signal input of the signal output part of high pass filter able to programme and parameter programmable I IR filter joins.
CN2011200124345U 2011-01-17 2011-01-17 Three-dimensional space high-definition voice acquisition subsystem of video sensing system Expired - Lifetime CN202068548U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011200124345U CN202068548U (en) 2011-01-17 2011-01-17 Three-dimensional space high-definition voice acquisition subsystem of video sensing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011200124345U CN202068548U (en) 2011-01-17 2011-01-17 Three-dimensional space high-definition voice acquisition subsystem of video sensing system

Publications (1)

Publication Number Publication Date
CN202068548U true CN202068548U (en) 2011-12-07

Family

ID=45062396

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011200124345U Expired - Lifetime CN202068548U (en) 2011-01-17 2011-01-17 Three-dimensional space high-definition voice acquisition subsystem of video sensing system

Country Status (1)

Country Link
CN (1) CN202068548U (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105228057A (en) * 2015-10-27 2016-01-06 无锡中感微电子股份有限公司 The voicefrequency circuit improved
CN107277690A (en) * 2017-08-02 2017-10-20 北京地平线信息技术有限公司 Sound processing method, device and electronic equipment
CN110326309A (en) * 2017-09-01 2019-10-11 深圳市台电实业有限公司 A kind of pick up facility and system

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105228057A (en) * 2015-10-27 2016-01-06 无锡中感微电子股份有限公司 The voicefrequency circuit improved
CN105228057B (en) * 2015-10-27 2019-01-22 无锡中感微电子股份有限公司 Improved voicefrequency circuit
CN107277690A (en) * 2017-08-02 2017-10-20 北京地平线信息技术有限公司 Sound processing method, device and electronic equipment
CN107277690B (en) * 2017-08-02 2020-07-24 北京地平线信息技术有限公司 Sound processing method and device and electronic equipment
CN110326309A (en) * 2017-09-01 2019-10-11 深圳市台电实业有限公司 A kind of pick up facility and system
CN110326309B (en) * 2017-09-01 2021-04-09 深圳市台电实业有限公司 Pickup equipment and system

Similar Documents

Publication Publication Date Title
CN106448722B (en) The way of recording, device and system
CN103873977B (en) Recording system and its implementation based on multi-microphone array beam forming
CN205249484U (en) Microphone linear array reinforcing directive property adapter
KR101519768B1 (en) Method, device and system for eliminating noises with multi-microphone array
CN201426153Y (en) Intelligent camera control system for video conference
CN108109617B (en) Remote pickup method
DE102019129330A1 (en) Conference system with a microphone array system and method for voice recording in a conference system
CN202068548U (en) Three-dimensional space high-definition voice acquisition subsystem of video sensing system
CN207869389U (en) A kind of voice de-noising sound pick-up based on Homogeneous Circular microphone array
CN205881451U (en) Device of making an uproar is removed to intelligence house
CN203193889U (en) Sound pick-up device based on microphone array voice noise reduction technology
CN104994456B (en) A kind of earphone and its method improving call tone quality
CN109104683B (en) Method and system for correcting phase measurement of double microphones
CN104185116B (en) A kind of method for automatically determining acoustically radiating emission mode
CN107274910A (en) The supervising device and audio/video linkage method of a kind of audio/video linkage
CN106340306A (en) Method and device for improving speech recognition degree
CN109192219A (en) The method for improving microphone array far field pickup based on keyword
CN109286790B (en) Directional monitoring system based on sound source positioning and monitoring method thereof
CN206077668U (en) A kind of Audio Processing control system for possessing feedback suppression and decrease of noise functions
CN105898649B (en) A kind of sound collector suitable under remote high-noise environment
CN110517704A (en) A kind of speech processing system based on microphone array beamforming algorithm
CN201708909U (en) Super-directivity microphone pickup processing device
CN107948870A (en) Portable audio noise reduction system based on stereo microphone array
DE102020117299A1 (en) Microphone array, conference system with microphone array and method for controlling a microphone array
CN204258950U (en) Sound identification location cloud platform camera system

Legal Events

Date Code Title Description
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: GM-INNOVATION INC.

Free format text: FORMER OWNER: MU KEMING

Effective date: 20130208

Free format text: FORMER OWNER: WANG XINGGUO FANG RUSONG

Effective date: 20130208

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20130208

Address after: Guanghua Road, Baixia District Nanjing city Jiangsu province 210014 No. 1 Innovation Park Incubator Building 5 floor C District

Patentee after: NANJING JIEMAI VIDEO TECHNOLOGY CO., LTD.

Address before: Guanghua Road, Baixia District Nanjing city Jiangsu province 210014 No. 1 Baixia high-tech park 5 floor C District of Nanjing Jammah Video Science and Technology Co Ltd

Patentee before: Mu Keming

Patentee before: Wang Xingguo

Patentee before: Fang Rusong

CX01 Expiry of patent term

Granted publication date: 20111207

CX01 Expiry of patent term