CN103576126A - Four-channel array sound source positioning system based on neural network - Google Patents

Four-channel array sound source positioning system based on neural network Download PDF

Info

Publication number
CN103576126A
CN103576126A CN201210264336.XA CN201210264336A CN103576126A CN 103576126 A CN103576126 A CN 103576126A CN 201210264336 A CN201210264336 A CN 201210264336A CN 103576126 A CN103576126 A CN 103576126A
Authority
CN
China
Prior art keywords
sound source
neural network
time delay
array
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210264336.XA
Other languages
Chinese (zh)
Inventor
姜楠
赛音
傅洋
张超
Original Assignee
姜楠
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 姜楠 filed Critical 姜楠
Priority to CN201210264336.XA priority Critical patent/CN103576126A/en
Publication of CN103576126A publication Critical patent/CN103576126A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/18Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
    • G01S5/22Position of source determined by co-ordinating a plurality of position lines defined by path-difference measurements

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention discloses a sound positioning system design which can be used for human-computer interaction to replace keyboard input and is based on an array microphone. The sound source positioning technology of a microphone array refers to an array consisting of a plurality of microphones which are arranged according to a certain geometric structure. The same sound source arrives at different microphones at different time. Time delay when the sound source arrives at the different microphones is collected so as to calculate the position of the sound source and represent different function commands; an upper computer sends a corresponding action response to realize human-computer interaction. Sounds produced when fingers rap a desktop are collected; on the basis of a classic time delay estimation algorithm, a time delay estimation result is processed in combination with a BP neural network, a positioning position is determined, and an input key value, namely the keys the fingers click on, is obtained. As a result, a traditional PC keyboard can be replaced to some extent, and human-computer interaction of command input is realized.

Description

Four-way array sonic location system based on neural network
Technical field
The present invention relates to digital signal processing, Embedded System Design etc., relate in particular to multi channel signals source location technology.
Background technology
In the past few decades, human-computer interaction technology makes rapid progress, especially along with the universal of automatic electronic equipment with extend, and the fast development of having brought exponential type to the exploitation of built-in human-machine interaction mode.How by multiple input-output device and computing machine, to be the important content of multimedia technology research alternately.Development acoustic keyboard and acoustics input technology contribute to promote the progress of computer entry device, follow new technology continue to optimize and perfect, the low cost of commercialization input equipment, the environmental protection characteristic of low-power consumption and facilitate conveniently, will bring comparatively considerable economic benefit and social benefit.Auditory localization technology based on microphone array refers to by certain geometry arranges the array that several microphones of forming form.It is different that same sound source arrives time of different microphones.By gathering sound source, arrive the time delay of different microphones, calculate sound generation source position, characterize different command functions, it is corresponding that host computer sends corresponding action, to realize man-machine interaction.
System of the present invention has realized a kind of like this sound localization model based on array microphone, and on classical Time Delay Estimation Algorithms basis, the result of estimating in conjunction with BP Processing with Neural Network time delay, determines position location, realizes the human-computer interaction function of order input.
Summary of the invention
Technical matters to be solved by this invention is: under lower sampling rate, classical signals disposal route, for being positioned with very large error, can not realize the interpersonal mutual of dexterity.Under higher sampling rate, the data stream that signal is processed is huge, computing power is had relatively high expectations simultaneously, and the uninterrupted running cost of high load capacity is higher, is not suitable for input equipment.In order not increase under the prerequisite of computational load and sampling rate, utilize signal processing technology, overcome the factors such as noise, low-power consumption and obtain meticulousr sound source localization as much as possible, realize order input function, replace conventional keyboard as man-machine interaction mode, we study and use synchronized sampling fidelity, and time delay is estimated and BP nerual network technique is realized this system.
The present invention is for solving above technical matters, and the technical scheme adopting is: take data acquisition and sound fidelity hardware circuit as basis, by multichannel collecting, arrive sound, voice signal obtains Multidimensional numerical after analog to digital conversion, and then calculates sound source location.The signal that knocks tone signal and be a kind of similar shock pulse has compared with steep front when knocking signal while occurring, and belongs to non-stationary signal.Signal is produced by acoustical vibration, therefore generate and fluctuation knocking signal while occurring, amplitude strengthens rapidly, starts decay after generally experiencing 2 to 3 peak values, and the average velocity that sound is propagated in air is estimated as 340m/s conventionally.The range difference of 0.01m can cause the mistiming of 2.94 * 10-5s.And middling speed digital acquisition circuit can reach the frequency acquisition of 44Khz-200KHz, guaranteed to collect comparatively exactly the relative time that sound arrives different array element points, therefore calculate phase differential, just can reflect truly the relation between multiple signals, thereby calculating sound source position, obtain corresponding key assignments, realize the meaning of input equipment.
Ideal situation, desktop infinity and isotropy, in array, each array element is not exist under the assumed condition of impact of the factors such as passage is inconsistent, mutual coupling.But in actual engineering application, various errors are inevitable.The error of microphone array, the performance that is mainly reflected in sonic transducer is inconsistent, array element distance does not strictly equate the array structure error that causes and each array element interchannel amplitude, the phase error of whole microphone array data acquisition system (DAS).Desktop situation is complicated, and not of uniform size, shape differs, and the fine and close loose situation of wooden structures is different, the uncertain stack of multiple transmitted wave.Tradition Time Delay Estimation Algorithms runs into very large challenge on registration, and the present invention, in conjunction with neural network model, has effectively solved the universal applicability problem of algorithm.
Described hardware circuit design comprises acoustic pickup array, signal synchronization adjustment circuit, and signal fidelity Circuit tuning, multi-channel data acquisition board, PCI input is comfortable
Described Software for Design comprises, discrete signal processing, and denoising wave filter, broad sense cross correlation function method time delay is estimated, BP neural network.
Beneficial effect of the present invention is as follows:
The sound localization key assignments position that the present invention uses finger tapping desktop to produce, replaces conventional keyboard, and environmental protection is convenient.
The present invention is by adopting above-mentioned hardware circuit design, can be synchronously and fidelity collection multi-channel sound signal.
The present invention, by adopting above-mentioned software algorithm to design, can realize the location of sound signal source, for replacing keyboard identification key assignments.
Accompanying drawing explanation
Fig. 1 is overall system diagram of the present invention
Fig. 2 is sampled voice microphone distribution array figure of the present invention
Fig. 3 is hardware circuit diagram of the present invention
Fig. 4 is that hardware actual acquisition of the present invention is to the signal of Multi-path synchronous fidelity
Fig. 5 is the Time Delay Estimation Algorithms process flow diagram adopting in the present invention
Fig. 6 is the neural network algorithm network design figure adopting in the present invention
Fig. 7 is test result of the present invention and accuracy rate situation
Embodiment
Below in conjunction with drawings and the specific embodiments, the present invention is described in further details.
The present invention is based on the sampling of acoustic pickup array, and the synchronous fidelity of circuit is processed, and Data Acquisition Card carries out analog to digital conversion, by obtained a cover system of sound source position information by software algorithm processing signals, as shown in Figure 1.
Whole system mainly realizes by hardware and software.
Hardware circuit:
Main target is to gather near field, multichannel broadband sound signal (as Fig. 3).For the realization of following model provides physical basis.Wherein emphasis lays particular emphasis on the raising of multi-channel synchronous and two aspect performances of signal phase fidelity.Lower meter of the signals collecting of sensor is passed to sound wave in realization, and through fidelity circuit, synchronous multichannel collecting also amplifies, and then carries out passing to host computer after digital-to-analog conversion.
Wherein the distribution of acoustic pickup adopts linear array to distribute (as Fig. 2), conveniently in different occasions, moves and installation system.
Software section:
This algorithm process, input signal is for gathering desktop knocking as the hyperchannel microphone array of describing in Fig. 4
The key assignments knocking when design object is input signal collection is consistent with the key assignments that calculates output, is divided into two basic steps:
1. occurrence positions → window checking and debugging → band-pass filter → ask for cross-power spectrum → dimensionality reduction to neural network input sample is knocked in the intercepting of multichannel collecting signal → windowing
2. → create neural network (target output Chu Shiquanzhi Shen Jingyuangeshuo network Ceng Shuo Chuan Dihanshuo Xue Xisushuai performance function) → training network → network test → identification to export
Wherein, neural network creates and training process only calculates when system is moved for the first time, after this, preserves this neural network model parameter, and directly computational grid recognition result, guarantees lower operand, to realize system real-time response.
The software algorithm first that the present invention uses is broad sense cross correlation function method (GCC, Generalized Cross-Correlation), in the method for estimating, is most widely used in classical time delay.Broad sense cross correlation function method is by asking the cross-power spectrum between two signals, and in frequency domain, give certain weighting and suppress noise, in change, arrive time domain, thereby obtain the cross correlation function between two signals, relative time delay between corresponding two signals of peak of this cross correlation function, as shown in Figure 5.Via processing, tentatively obtain delay inequality, as the data of next stage neural network.
The software algorithm second portion that the present invention uses is the multilayer feedforward neural network based on error backpropagation algorithm, and BP neural network is the multilayer feedforward neural network based on error backpropagation algorithm (BP algorithm).That D.E.Rumelhart and J.L.McCelland and research group thereof studied and designed in 1986.BP algorithm is current most widely used Learning Algorithm, has nearly 90% Application of Neural Network based on BP algorithm.The popularization of WINDROW-HOFF algorithm in multilayer feedforward neural network.The adjustment of weights and threshold value adopts error back propagation, and carries out along the negative gradient direction of error change, and pattern is saequential transmission broadcast with error Back-Propagation and intersected and carry out, and finally makes network error reach minimal value or minimum value.
Error Tolerance wherein, stronger adaptability makes on the nonlinearity errons such as resolution system, environment, there is positive effect, and input sample is had to very strong recognition and classification ability.Adopt two layers of BP network hidden layer to adopt S type transport functions herein, output layer also adopts logarithm S type transport function, to limit output area, input dimension by classical time delay, estimated to sequence signature vector dimension size determine.The neuron number of output layer is determined according to the key assignments number that will identify.Be that L-M optimized algorithm is trained, this is a kind of batch mode algorithm, and after all inputs are all submitted, network is just updated.It has used Levenberg-Marquardt optimization method, thereby makes learning time shorter.The mse function of acquiescence is done Performance Evaluation function, and the square error between the network of usining output and target output t is as Performance Evaluation standard.Network structure as shown in Figure 6.
Finally, via hardware and software designing two portions structure, recognition result is: 45 groups of samples, and 1 group of failure, 44 groups of successes, discrimination 97.7%, as shown in Figure 7.

Claims (3)

1. the four-way array sonic location system based on neural network, is characterized in that: use is pointed the position difference of knocking desktop and represented different key assignments key mappings, replaces the input mode of conventional keyboard.By multichannel collecting knocking sound, via the estimation of broad sense cross correlation function time delay and neural computing, obtain sound source position.
2. multichannel collecting system according to claim 1, is characterized in that: synchronous acquisition multi channel signals, amplification, fidelity audio system.
3. the auditory localization algorithm that broad sense cross correlation function according to claim 1 is combined with neural network, it is characterized in that, under middle low sampling rate condition, calculate accurately sound source position, opposing noise, the factors such as echo reflection, with test repeatedly under objective environment condition, discrimination reaches 95%.
CN201210264336.XA 2012-07-27 2012-07-27 Four-channel array sound source positioning system based on neural network Pending CN103576126A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210264336.XA CN103576126A (en) 2012-07-27 2012-07-27 Four-channel array sound source positioning system based on neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210264336.XA CN103576126A (en) 2012-07-27 2012-07-27 Four-channel array sound source positioning system based on neural network

Publications (1)

Publication Number Publication Date
CN103576126A true CN103576126A (en) 2014-02-12

Family

ID=50048308

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210264336.XA Pending CN103576126A (en) 2012-07-27 2012-07-27 Four-channel array sound source positioning system based on neural network

Country Status (1)

Country Link
CN (1) CN103576126A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104535965A (en) * 2014-12-29 2015-04-22 江苏科技大学 Parallelized sound source positioning system based on embedded GPU system and method
CN106899302A (en) * 2015-12-17 2017-06-27 亚德诺半导体集团 The adaptive digital quantizing noise of MADS ADCS eliminates wave filter
CN107195305A (en) * 2017-07-21 2017-09-22 合肥联宝信息技术有限公司 A kind of information processing method and electronic equipment
CN107452402A (en) * 2016-07-05 2017-12-08 深圳大学 The system and method for content is tapped using voice signal detection keyboard
CN107507625A (en) * 2016-06-14 2017-12-22 讯飞智元信息科技有限公司 Sound source distance determines method and device
CN107782977A (en) * 2017-08-31 2018-03-09 苏州知声声学科技有限公司 Multiple usb data capture card input signal Time delay measurement devices and measuring method
CN109686381A (en) * 2017-10-19 2019-04-26 恩智浦有限公司 Signal processor and correlation technique for signal enhancing
CN111240633A (en) * 2019-08-06 2020-06-05 杭州爱华仪器有限公司 Giant sound array system based on FPGA and Ethernet transmission and control method thereof
CN111239687A (en) * 2020-01-17 2020-06-05 浙江理工大学 Sound source positioning method and system based on deep neural network
CN113702719A (en) * 2021-08-03 2021-11-26 北京科技大学 Broadband near-field electromagnetic positioning method and device based on neural network

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104535965A (en) * 2014-12-29 2015-04-22 江苏科技大学 Parallelized sound source positioning system based on embedded GPU system and method
CN106899302A (en) * 2015-12-17 2017-06-27 亚德诺半导体集团 The adaptive digital quantizing noise of MADS ADCS eliminates wave filter
CN107507625A (en) * 2016-06-14 2017-12-22 讯飞智元信息科技有限公司 Sound source distance determines method and device
CN107452402B (en) * 2016-07-05 2019-02-12 深圳大学 The system and method for content is tapped using voice signal detection keyboard
CN107452402A (en) * 2016-07-05 2017-12-08 深圳大学 The system and method for content is tapped using voice signal detection keyboard
WO2018006797A1 (en) * 2016-07-05 2018-01-11 深圳大学 System and method for detecting keyboard pressing content by using acoustic signal
CN107195305A (en) * 2017-07-21 2017-09-22 合肥联宝信息技术有限公司 A kind of information processing method and electronic equipment
CN107782977A (en) * 2017-08-31 2018-03-09 苏州知声声学科技有限公司 Multiple usb data capture card input signal Time delay measurement devices and measuring method
CN109686381A (en) * 2017-10-19 2019-04-26 恩智浦有限公司 Signal processor and correlation technique for signal enhancing
CN109686381B (en) * 2017-10-19 2024-01-19 汇顶科技(香港)有限公司 Signal processor for signal enhancement and related method
CN111240633A (en) * 2019-08-06 2020-06-05 杭州爱华仪器有限公司 Giant sound array system based on FPGA and Ethernet transmission and control method thereof
CN111239687A (en) * 2020-01-17 2020-06-05 浙江理工大学 Sound source positioning method and system based on deep neural network
CN111239687B (en) * 2020-01-17 2021-12-14 浙江理工大学 Sound source positioning method and system based on deep neural network
CN113702719A (en) * 2021-08-03 2021-11-26 北京科技大学 Broadband near-field electromagnetic positioning method and device based on neural network

Similar Documents

Publication Publication Date Title
CN103576126A (en) Four-channel array sound source positioning system based on neural network
CN109559758B (en) Method for converting texture image into tactile signal based on deep learning
CN108182418B (en) Keystroke identification method based on multi-dimensional sound wave characteristics
CN107526437A (en) A kind of gesture identification method based on Audio Doppler characteristic quantification
CN102163427A (en) Method for detecting audio exceptional event based on environmental model
CN107450724A (en) A kind of gesture identification method and system based on dual-channel audio Doppler effect
CN108109617A (en) A kind of remote pickup method
WO2017092213A1 (en) Methods, systems, and media for recognition of user interaction based on acoustic signals
Küçükbay et al. Audio-based event detection in office live environments using optimized MFCC-SVM approach
CN110031827B (en) Gesture recognition method based on ultrasonic ranging principle
CN109584904B (en) Video-song audio-song name recognition modeling method applied to basic music video-song education
CN105976827A (en) Integrated-learning-based indoor sound source positioning method
CN106255027A (en) The tonequality Small Enclosure appraisal procedure of a kind of non-linear audio system and system
Al-Hattab et al. Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction
CN112394324A (en) Microphone array-based remote sound source positioning method and system
CN102509548B (en) Audio indexing method based on multi-distance sound sensor
CN112397090B (en) Real-time sound classification method and system based on FPGA
US20070076899A1 (en) Audio collecting device by audio input matrix
CN116027911B (en) Non-contact handwriting input recognition method based on audio signal
CN110136741B (en) Single-channel speech enhancement method based on multi-scale context
Watcharasupat et al. Improving Polyphonic Sound Event Detection on Multichannel Recordings with the S {\o} rensen-Dice Coefficient Loss and Transfer Learning
CN113314127B (en) Bird song identification method, system, computer equipment and medium based on space orientation
CN104766610A (en) Voice recognition system and method based on vibration
CN110784788A (en) Gesture recognition method based on microphone
Ma et al. Acoustic scene classification based on binaural deep scattering spectra with neural network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140212