CN108198567A

CN108198567A - A kind of novel voice is except system of making an uproar

Info

Publication number: CN108198567A
Application number: CN201810153081.7A
Authority: CN
Inventors: 陈思应; 高君效; 何云鹏; 孙振奎; 陈跃华; 余杰
Original assignee: Chengdu Leader Technology Co Ltd
Current assignee: Chengdu Leader Technology Co Ltd; Chipintelli Technology Co Ltd
Priority date: 2018-02-22
Filing date: 2018-02-22
Publication date: 2018-06-22

Abstract

The present invention relates to field of speech recognition, a kind of novel voice is especially related to except system of making an uproar, it is acquired external voice data by sound acquisition module and is transferred to sound identification module and be identified, sound identification module employs the big noise that big noise denoising pattern rejects burst, the big noise eliminating of burst can be ensure that the accuracy of identification of sound source using this method.

Description

A kind of novel voice is except system of making an uproar

Technical field

The present invention relates to intelligent sound identification technology fields, and big noise squelch circuit can be utilized by especially relating to one kind To reject the novel voice of burst noise except system of making an uproar.

Background technology

Constantly increase with the improvement of people ' s living standards and to the demand of electric appliance, household electrical appliance are by constantly changing Leather and innovation, have the function of more using, such as micro-wave oven, and in the past only simple is used to heat, and by now, micro-wave oven increases Added the functions such as boiling, barbecue, hot milk and become more intelligent, the intelligent microwave oven for the various brands that market occurs, mainly by This four most of composition of control panel, observation window, fire door safety lock system, power cord and plug, control panel are mainly functional The functions such as setting, time setting, weight set, function setting mainly by function menu realize, such as directly press steamed fish, The buttons such as steamed spareribs, cooking rice, realize different mode of heatings automatically, and the intelligent microwave oven of all kinds of different brands uses step all It is similar.

Interactive voice can help user that the various terminal equipment in family is seamless as most effective communication control mode It connects, intelligent sound micro-wave oven is exactly one of them, and user is carried out by the i.e. controllable micro-wave oven of simple voice command Different work, in terms of speech recognition, in order to enhance the accuracy rate of the experience sense of user and speech recognition, research staff passes through Technology cross-correlation time delay scheduling algorithm obtains the position that people speaks, and then locks this position, inhibits the sound source of other positions, improves Signal-to-noise ratio is ensured for high phonetic recognization rate, although sound source locking can improve signal-to-noise ratio, works as in environment and occurs big noise suddenly When, sound source focus can be shifted, phonetic order can not be recognized by electronic equipment after leading to big noise, and this reduces user's Intelligent experience sense and the accuracy rate of speech recognition.

Invention content

In order to solve the speech recognition problem of above-mentioned emergent big noise, burst can effectively be rejected by having invented one kind The novel voice of big noise is except system of making an uproar.

A kind of novel voice except making an uproar system, the equipment control circuit being electrically connected including equipment, with the equipment, with it is described The sound identification module and voice playing module that equipment control circuit is electrically connected are electrically connected with the sound identification module Voice acquisition module；

The sound identification module is made of speech processing circuit and sound source lock-in circuit, described in the speech processing circuit processing The voice data that voice acquisition module acquisition comes, the sound source lock-in circuit is according to the processing knot of the speech analysis circuit Fruit locks the position of sound source, and the speech processing circuit is by big noise squelch circuit, conventional squelch circuit and phonetic decision circuit Form, the phonetic decision circuit connect with the equipment control circuit, the phonetic decision circuit respectively with the big noise Squelch circuit is connected with conventional squelch circuit, and the big noise squelch circuit and conventional squelch circuit lock respectively with the sound source Circuit connects, and the sound source lock-in circuit is connect with the voice playing module.

As the preferred embodiment of the present invention, speech recognition engine is embedded in the sound identification module, the voice is known Other engine carries out speech recognition using DNN algorithms.

As the preferred embodiment of the present invention, the voice acquisition module includes N number of voice capture device, and the N is big In the positive integer equal to 2, the voice playing module includes M voice playing equipment, and the M is just whole more than or equal to 1 Number.

The DNN algorithms include voice pretreatment, feature extraction, form Pronounceable dictionary and establish speech model etc. four Process, wherein voice preprocessing process are included to the sampling of voice signal or voice data, anti-confusion filtering, speech enhan-cement and end Point detection, the effect of characteristic extraction procedure be one group is extracted from the waveform of voice signal or voice data being capable of description message Number or voice data feature parameter, to train and to identify, it is then phoneme according to pronunciation to form Pronounceable dictionary, is obtained corresponding Text collection be Pronounceable dictionary, it is then to utilize knowledge of grammar adjustment not conforming to of being identified of acoustic model to establish speech model The word of logic.

In order to which audio data is made easily by Processing with Neural Network, complicated sound wave to be needed to resolve into composition portion one by one Point, to realize that sound wave decomposes, need to use Fourier transformation, complicated sound wave is decomposed into simple sound by Fourier transform Then the energy that every a frequency range is included is added together by wave, obtained result is a frequency spectrum from bass to high pitch, The frequency spectrum is inputted into deep neural network again, each small audio is sliced, neural network will all be attempted to find out currently The initial consonant or simple or compound vowel of a Chinese syllable corresponding to sound said, after our entire audio clips are run through by neural network, finally obtain These, wherein designating each audio block and its most possible corresponding initial consonant or simple or compound vowel of a Chinese syllable, are then based on pronunciation by portion mapping Prediction be combined with the possibility score of the text database based on mark, remove most unlikely as a result, leaving most realistic Result.

Compared with prior art, beneficial effects of the present invention：

1st, as a result of big noise squelch circuit, which can ensure that sound source is known by the big noise eliminating of burst Other accuracy.

Description of the drawings

Fig. 1 is the block diagram of Speech Signal system of the present invention；

Fig. 2 is the block diagram of Speech Signal system embodiment of the present invention.

Specific embodiment

With reference to embodiment and specific embodiment, the present invention is described in further detail, but should not understand this Range for aforementioned body of the present invention is only limitted to following embodiment, all to belong to this based on the technology that the content of present invention is realized The range of invention.

As shown in Fig. 2, a kind of novel voice removes system of making an uproar, the micro-wave oven control being electrically connected including micro-wave oven, with micro-wave oven The sound identification module and voice playing module that circuit and controlling circuit of microwave oven processed are electrically connected, with sound identification module The voice acquisition module of electrical connection；

Sound identification module is made of speech processing circuit and sound source lock-in circuit, speech processing circuit processing voice acquisition module The voice data that acquisition comes, sound source lock-in circuit locks the position of sound source according to the handling result of speech analysis circuit, at voice Reason circuit is made of big noise squelch circuit, conventional noise squelch circuit and phonetic decision circuit, phonetic decision circuit and equipment Control circuit connects, and phonetic decision circuit is connect respectively with big noise squelch circuit and conventional noise squelch circuit, and big noise is gone Noise cancellation circuit and conventional squelch circuit are connect respectively with sound source lock-in circuit, and sound source lock-in circuit is connect with voice playing module.

Speech recognition engine is embedded in sound identification module, sound identification module is integrated on the panel of micro-wave oven, Speech recognition engine carries out speech recognition using DNN algorithms to voice data.

The voice capture device of voice acquisition module is two microphones being set up in parallel in the present embodiment, speech play mould The voice playing equipment of block is a loudspeaker, and when the operation of micro-wave oven terminates or has accident generation, loudspeaker can be to behaviour It is reminded as personnel, in the present embodiment, the module that system is included is integrated on the panel of micro-wave oven.

Claims

1. a kind of novel voice is except system of making an uproar, it is characterised in that：The equipment control electricity being electrically connected including equipment, with the equipment Road, the sound identification module and voice playing module being electrically connected with the equipment control circuit, with the speech recognition mould The voice acquisition module of block electrical connection；

The sound identification module is made of speech processing circuit and sound source lock-in circuit, described in the speech processing circuit processing The voice data that voice acquisition module acquisition comes, the sound source lock-in circuit is according to the processing knot of the speech analysis circuit Fruit locks the position of sound source, and the speech processing circuit is by big noise squelch circuit, conventional noise squelch circuit and phonetic decision Circuit is formed, and the phonetic decision circuit connect with the equipment control circuit, the phonetic decision circuit respectively with it is described greatly Noise squelch circuit connect with conventional squelch circuit, the big noise squelch circuit and routine squelch circuit respectively with the sound source Lock-in circuit connects, and the sound source lock-in circuit is connect with the voice playing module.

2. a kind of sound source locking system according to claim 1, it is characterised in that：It is embedded in the sound identification module Speech recognition engine, the speech recognition engine carry out speech recognition using DNN algorithms.

3. a kind of sound source locking system according to claim 1, it is characterised in that：The voice acquisition module includes N A voice capture device, the N are the positive integer more than or equal to 2, and the voice playing module includes M speech play and sets Standby, the M is the positive integer more than or equal to 1.