CN108198567A - A kind of novel voice is except system of making an uproar - Google Patents
A kind of novel voice is except system of making an uproar Download PDFInfo
- Publication number
- CN108198567A CN108198567A CN201810153081.7A CN201810153081A CN108198567A CN 108198567 A CN108198567 A CN 108198567A CN 201810153081 A CN201810153081 A CN 201810153081A CN 108198567 A CN108198567 A CN 108198567A
- Authority
- CN
- China
- Prior art keywords
- circuit
- voice
- sound source
- speech
- sound
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012545 processing Methods 0.000 claims description 14
- 235000013399 edible fruits Nutrition 0.000 claims description 2
- 230000005611 electricity Effects 0.000 claims 1
- 238000000034 method Methods 0.000 abstract description 4
- 238000013528 artificial neural network Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 241000251468 Actinopterygii Species 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 235000021168 barbecue Nutrition 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 239000004568 cement Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010411 cooking Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 239000010985 leather Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Electric Ovens (AREA)
Abstract
The present invention relates to field of speech recognition, a kind of novel voice is especially related to except system of making an uproar, it is acquired external voice data by sound acquisition module and is transferred to sound identification module and be identified, sound identification module employs the big noise that big noise denoising pattern rejects burst, the big noise eliminating of burst can be ensure that the accuracy of identification of sound source using this method.
Description
Technical field
The present invention relates to intelligent sound identification technology fields, and big noise squelch circuit can be utilized by especially relating to one kind
To reject the novel voice of burst noise except system of making an uproar.
Background technology
Constantly increase with the improvement of people ' s living standards and to the demand of electric appliance, household electrical appliance are by constantly changing
Leather and innovation, have the function of more using, such as micro-wave oven, and in the past only simple is used to heat, and by now, micro-wave oven increases
Added the functions such as boiling, barbecue, hot milk and become more intelligent, the intelligent microwave oven for the various brands that market occurs, mainly by
This four most of composition of control panel, observation window, fire door safety lock system, power cord and plug, control panel are mainly functional
The functions such as setting, time setting, weight set, function setting mainly by function menu realize, such as directly press steamed fish,
The buttons such as steamed spareribs, cooking rice, realize different mode of heatings automatically, and the intelligent microwave oven of all kinds of different brands uses step all
It is similar.
Interactive voice can help user that the various terminal equipment in family is seamless as most effective communication control mode
It connects, intelligent sound micro-wave oven is exactly one of them, and user is carried out by the i.e. controllable micro-wave oven of simple voice command
Different work, in terms of speech recognition, in order to enhance the accuracy rate of the experience sense of user and speech recognition, research staff passes through
Technology cross-correlation time delay scheduling algorithm obtains the position that people speaks, and then locks this position, inhibits the sound source of other positions, improves
Signal-to-noise ratio is ensured for high phonetic recognization rate, although sound source locking can improve signal-to-noise ratio, works as in environment and occurs big noise suddenly
When, sound source focus can be shifted, phonetic order can not be recognized by electronic equipment after leading to big noise, and this reduces user's
Intelligent experience sense and the accuracy rate of speech recognition.
Invention content
In order to solve the speech recognition problem of above-mentioned emergent big noise, burst can effectively be rejected by having invented one kind
The novel voice of big noise is except system of making an uproar.
A kind of novel voice except making an uproar system, the equipment control circuit being electrically connected including equipment, with the equipment, with it is described
The sound identification module and voice playing module that equipment control circuit is electrically connected are electrically connected with the sound identification module
Voice acquisition module;
The sound identification module is made of speech processing circuit and sound source lock-in circuit, described in the speech processing circuit processing
The voice data that voice acquisition module acquisition comes, the sound source lock-in circuit is according to the processing knot of the speech analysis circuit
Fruit locks the position of sound source, and the speech processing circuit is by big noise squelch circuit, conventional squelch circuit and phonetic decision circuit
Form, the phonetic decision circuit connect with the equipment control circuit, the phonetic decision circuit respectively with the big noise
Squelch circuit is connected with conventional squelch circuit, and the big noise squelch circuit and conventional squelch circuit lock respectively with the sound source
Circuit connects, and the sound source lock-in circuit is connect with the voice playing module.
As the preferred embodiment of the present invention, speech recognition engine is embedded in the sound identification module, the voice is known
Other engine carries out speech recognition using DNN algorithms.
As the preferred embodiment of the present invention, the voice acquisition module includes N number of voice capture device, and the N is big
In the positive integer equal to 2, the voice playing module includes M voice playing equipment, and the M is just whole more than or equal to 1
Number.
The DNN algorithms include voice pretreatment, feature extraction, form Pronounceable dictionary and establish speech model etc. four
Process, wherein voice preprocessing process are included to the sampling of voice signal or voice data, anti-confusion filtering, speech enhan-cement and end
Point detection, the effect of characteristic extraction procedure be one group is extracted from the waveform of voice signal or voice data being capable of description message
Number or voice data feature parameter, to train and to identify, it is then phoneme according to pronunciation to form Pronounceable dictionary, is obtained corresponding
Text collection be Pronounceable dictionary, it is then to utilize knowledge of grammar adjustment not conforming to of being identified of acoustic model to establish speech model
The word of logic.
In order to which audio data is made easily by Processing with Neural Network, complicated sound wave to be needed to resolve into composition portion one by one
Point, to realize that sound wave decomposes, need to use Fourier transformation, complicated sound wave is decomposed into simple sound by Fourier transform
Then the energy that every a frequency range is included is added together by wave, obtained result is a frequency spectrum from bass to high pitch,
The frequency spectrum is inputted into deep neural network again, each small audio is sliced, neural network will all be attempted to find out currently
The initial consonant or simple or compound vowel of a Chinese syllable corresponding to sound said, after our entire audio clips are run through by neural network, finally obtain
These, wherein designating each audio block and its most possible corresponding initial consonant or simple or compound vowel of a Chinese syllable, are then based on pronunciation by portion mapping
Prediction be combined with the possibility score of the text database based on mark, remove most unlikely as a result, leaving most realistic
Result.
Compared with prior art, beneficial effects of the present invention:
1st, as a result of big noise squelch circuit, which can ensure that sound source is known by the big noise eliminating of burst
Other accuracy.
Description of the drawings
Fig. 1 is the block diagram of Speech Signal system of the present invention;
Fig. 2 is the block diagram of Speech Signal system embodiment of the present invention.
Specific embodiment
With reference to embodiment and specific embodiment, the present invention is described in further detail, but should not understand this
Range for aforementioned body of the present invention is only limitted to following embodiment, all to belong to this based on the technology that the content of present invention is realized
The range of invention.
As shown in Fig. 2, a kind of novel voice removes system of making an uproar, the micro-wave oven control being electrically connected including micro-wave oven, with micro-wave oven
The sound identification module and voice playing module that circuit and controlling circuit of microwave oven processed are electrically connected, with sound identification module
The voice acquisition module of electrical connection;
Sound identification module is made of speech processing circuit and sound source lock-in circuit, speech processing circuit processing voice acquisition module
The voice data that acquisition comes, sound source lock-in circuit locks the position of sound source according to the handling result of speech analysis circuit, at voice
Reason circuit is made of big noise squelch circuit, conventional noise squelch circuit and phonetic decision circuit, phonetic decision circuit and equipment
Control circuit connects, and phonetic decision circuit is connect respectively with big noise squelch circuit and conventional noise squelch circuit, and big noise is gone
Noise cancellation circuit and conventional squelch circuit are connect respectively with sound source lock-in circuit, and sound source lock-in circuit is connect with voice playing module.
Speech recognition engine is embedded in sound identification module, sound identification module is integrated on the panel of micro-wave oven,
Speech recognition engine carries out speech recognition using DNN algorithms to voice data.
The voice capture device of voice acquisition module is two microphones being set up in parallel in the present embodiment, speech play mould
The voice playing equipment of block is a loudspeaker, and when the operation of micro-wave oven terminates or has accident generation, loudspeaker can be to behaviour
It is reminded as personnel, in the present embodiment, the module that system is included is integrated on the panel of micro-wave oven.
Claims (3)
1. a kind of novel voice is except system of making an uproar, it is characterised in that:The equipment control electricity being electrically connected including equipment, with the equipment
Road, the sound identification module and voice playing module being electrically connected with the equipment control circuit, with the speech recognition mould
The voice acquisition module of block electrical connection;
The sound identification module is made of speech processing circuit and sound source lock-in circuit, described in the speech processing circuit processing
The voice data that voice acquisition module acquisition comes, the sound source lock-in circuit is according to the processing knot of the speech analysis circuit
Fruit locks the position of sound source, and the speech processing circuit is by big noise squelch circuit, conventional noise squelch circuit and phonetic decision
Circuit is formed, and the phonetic decision circuit connect with the equipment control circuit, the phonetic decision circuit respectively with it is described greatly
Noise squelch circuit connect with conventional squelch circuit, the big noise squelch circuit and routine squelch circuit respectively with the sound source
Lock-in circuit connects, and the sound source lock-in circuit is connect with the voice playing module.
2. a kind of sound source locking system according to claim 1, it is characterised in that:It is embedded in the sound identification module
Speech recognition engine, the speech recognition engine carry out speech recognition using DNN algorithms.
3. a kind of sound source locking system according to claim 1, it is characterised in that:The voice acquisition module includes N
A voice capture device, the N are the positive integer more than or equal to 2, and the voice playing module includes M speech play and sets
Standby, the M is the positive integer more than or equal to 1.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810153081.7A CN108198567A (en) | 2018-02-22 | 2018-02-22 | A kind of novel voice is except system of making an uproar |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810153081.7A CN108198567A (en) | 2018-02-22 | 2018-02-22 | A kind of novel voice is except system of making an uproar |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108198567A true CN108198567A (en) | 2018-06-22 |
Family
ID=62594021
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810153081.7A Pending CN108198567A (en) | 2018-02-22 | 2018-02-22 | A kind of novel voice is except system of making an uproar |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108198567A (en) |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1707613A (en) * | 2005-05-27 | 2005-12-14 | 曲哲 | Collecting apparatus and method for noise insulation audio frequency |
CN101740028A (en) * | 2009-11-20 | 2010-06-16 | 四川长虹电器股份有限公司 | Voice control system of household appliance |
CN204390479U (en) * | 2015-03-04 | 2015-06-10 | 冠捷显示科技(厦门)有限公司 | A kind of controlling intelligent household appliances telechiric device |
CN104754183A (en) * | 2015-04-10 | 2015-07-01 | 四川理工学院 | Real-time monitoring video adaptive filtering method and real-time monitoring video adaptive filtering system |
CN104834922A (en) * | 2015-05-27 | 2015-08-12 | 电子科技大学 | Hybrid neural network-based gesture recognition method |
CN104936091A (en) * | 2015-05-14 | 2015-09-23 | 科大讯飞股份有限公司 | Intelligent interaction method and system based on circle microphone array |
CN106548772A (en) * | 2017-01-16 | 2017-03-29 | 上海智臻智能网络科技股份有限公司 | Speech recognition test system and method |
CN106548771A (en) * | 2015-09-21 | 2017-03-29 | 上海日趋信息技术有限公司 | For the method that speech recognition system eliminates burst noise |
CN106772244A (en) * | 2016-11-25 | 2017-05-31 | 北京明泰朗繁精密设备有限公司 | A kind of sonic location system and method |
CN107392864A (en) * | 2017-07-01 | 2017-11-24 | 南京理工大学 | A kind of mixed noise filtering method for removing Gaussian noise and impulsive noise |
CN107403619A (en) * | 2017-06-30 | 2017-11-28 | 武汉泰迪智慧科技有限公司 | A kind of sound control method and system applied to bicycle environment |
CN107705260A (en) * | 2017-10-03 | 2018-02-16 | 陈值英 | The denoising system of medical X-ray image |
-
2018
- 2018-02-22 CN CN201810153081.7A patent/CN108198567A/en active Pending
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1707613A (en) * | 2005-05-27 | 2005-12-14 | 曲哲 | Collecting apparatus and method for noise insulation audio frequency |
CN101740028A (en) * | 2009-11-20 | 2010-06-16 | 四川长虹电器股份有限公司 | Voice control system of household appliance |
CN204390479U (en) * | 2015-03-04 | 2015-06-10 | 冠捷显示科技(厦门)有限公司 | A kind of controlling intelligent household appliances telechiric device |
CN104754183A (en) * | 2015-04-10 | 2015-07-01 | 四川理工学院 | Real-time monitoring video adaptive filtering method and real-time monitoring video adaptive filtering system |
CN104936091A (en) * | 2015-05-14 | 2015-09-23 | 科大讯飞股份有限公司 | Intelligent interaction method and system based on circle microphone array |
CN104834922A (en) * | 2015-05-27 | 2015-08-12 | 电子科技大学 | Hybrid neural network-based gesture recognition method |
CN106548771A (en) * | 2015-09-21 | 2017-03-29 | 上海日趋信息技术有限公司 | For the method that speech recognition system eliminates burst noise |
CN106772244A (en) * | 2016-11-25 | 2017-05-31 | 北京明泰朗繁精密设备有限公司 | A kind of sonic location system and method |
CN106548772A (en) * | 2017-01-16 | 2017-03-29 | 上海智臻智能网络科技股份有限公司 | Speech recognition test system and method |
CN107403619A (en) * | 2017-06-30 | 2017-11-28 | 武汉泰迪智慧科技有限公司 | A kind of sound control method and system applied to bicycle environment |
CN107392864A (en) * | 2017-07-01 | 2017-11-24 | 南京理工大学 | A kind of mixed noise filtering method for removing Gaussian noise and impulsive noise |
CN107705260A (en) * | 2017-10-03 | 2018-02-16 | 陈值英 | The denoising system of medical X-ray image |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109800700B (en) | Underwater acoustic signal target classification and identification method based on deep learning | |
CN110120227A (en) | A kind of depth stacks the speech separating method of residual error network | |
CN108172220A (en) | A kind of novel voice denoising method | |
CN106847281A (en) | Intelligent household voice control system and method based on voice fuzzy identification technology | |
US20170154640A1 (en) | Method and electronic device for voice recognition based on dynamic voice model selection | |
CN102005070A (en) | Voice identification gate control system | |
CN105206271A (en) | Intelligent equipment voice wake-up method and system for realizing method | |
CN105096946B (en) | Awakening device and method based on voice activation detection | |
CN103886236A (en) | Acoustic control screen unlocking method and mobile terminal | |
CN110956965A (en) | Personalized intelligent home safety control system and method based on voiceprint recognition | |
CN110189746A (en) | A kind of method for recognizing speech applied to earth-space communication | |
CN109561003A (en) | A kind of IR remote controller and electrical control system based on acoustic control | |
CN108461081A (en) | Method, apparatus, equipment and the storage medium of voice control | |
CN108091327A (en) | A kind of intelligent sound apparatus control method | |
CN104952446A (en) | Digital building presentation system based on voice interaction | |
CN105405447B (en) | One kind sending words respiratory noise screen method | |
Wang et al. | Application of speech recognition technology in IoT smart home | |
CN109767767A (en) | A kind of voice interactive method, system, electronic equipment and storage medium | |
CN109544745A (en) | A kind of intelligent door lock control method, apparatus and system | |
CN108198567A (en) | A kind of novel voice is except system of making an uproar | |
CN107093430A (en) | A kind of vocal print feature extraction algorithm based on wavelet package transforms | |
CN113077798B (en) | Old man calls for help equipment at home | |
CN106971712A (en) | A kind of adaptive rapid voiceprint recognition methods and system | |
CN110580900A (en) | Vehicle-mounted sound voice control system | |
CN104240705A (en) | Intelligent voice-recognition locking system for safe box |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180622 |
|
RJ01 | Rejection of invention patent application after publication |