CN108198567A - A kind of novel voice is except system of making an uproar - Google Patents

A kind of novel voice is except system of making an uproar Download PDF

Info

Publication number
CN108198567A
CN108198567A CN201810153081.7A CN201810153081A CN108198567A CN 108198567 A CN108198567 A CN 108198567A CN 201810153081 A CN201810153081 A CN 201810153081A CN 108198567 A CN108198567 A CN 108198567A
Authority
CN
China
Prior art keywords
circuit
voice
sound source
speech
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810153081.7A
Other languages
Chinese (zh)
Inventor
陈思应
高君效
何云鹏
孙振奎
陈跃华
余杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Leader Technology Co Ltd
Chipintelli Technology Co Ltd
Original Assignee
Chengdu Leader Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Leader Technology Co Ltd filed Critical Chengdu Leader Technology Co Ltd
Priority to CN201810153081.7A priority Critical patent/CN108198567A/en
Publication of CN108198567A publication Critical patent/CN108198567A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Electric Ovens (AREA)

Abstract

The present invention relates to field of speech recognition, a kind of novel voice is especially related to except system of making an uproar, it is acquired external voice data by sound acquisition module and is transferred to sound identification module and be identified, sound identification module employs the big noise that big noise denoising pattern rejects burst, the big noise eliminating of burst can be ensure that the accuracy of identification of sound source using this method.

Description

A kind of novel voice is except system of making an uproar
Technical field
The present invention relates to intelligent sound identification technology fields, and big noise squelch circuit can be utilized by especially relating to one kind To reject the novel voice of burst noise except system of making an uproar.
Background technology
Constantly increase with the improvement of people ' s living standards and to the demand of electric appliance, household electrical appliance are by constantly changing Leather and innovation, have the function of more using, such as micro-wave oven, and in the past only simple is used to heat, and by now, micro-wave oven increases Added the functions such as boiling, barbecue, hot milk and become more intelligent, the intelligent microwave oven for the various brands that market occurs, mainly by This four most of composition of control panel, observation window, fire door safety lock system, power cord and plug, control panel are mainly functional The functions such as setting, time setting, weight set, function setting mainly by function menu realize, such as directly press steamed fish, The buttons such as steamed spareribs, cooking rice, realize different mode of heatings automatically, and the intelligent microwave oven of all kinds of different brands uses step all It is similar.
Interactive voice can help user that the various terminal equipment in family is seamless as most effective communication control mode It connects, intelligent sound micro-wave oven is exactly one of them, and user is carried out by the i.e. controllable micro-wave oven of simple voice command Different work, in terms of speech recognition, in order to enhance the accuracy rate of the experience sense of user and speech recognition, research staff passes through Technology cross-correlation time delay scheduling algorithm obtains the position that people speaks, and then locks this position, inhibits the sound source of other positions, improves Signal-to-noise ratio is ensured for high phonetic recognization rate, although sound source locking can improve signal-to-noise ratio, works as in environment and occurs big noise suddenly When, sound source focus can be shifted, phonetic order can not be recognized by electronic equipment after leading to big noise, and this reduces user's Intelligent experience sense and the accuracy rate of speech recognition.
Invention content
In order to solve the speech recognition problem of above-mentioned emergent big noise, burst can effectively be rejected by having invented one kind The novel voice of big noise is except system of making an uproar.
A kind of novel voice except making an uproar system, the equipment control circuit being electrically connected including equipment, with the equipment, with it is described The sound identification module and voice playing module that equipment control circuit is electrically connected are electrically connected with the sound identification module Voice acquisition module;
The sound identification module is made of speech processing circuit and sound source lock-in circuit, described in the speech processing circuit processing The voice data that voice acquisition module acquisition comes, the sound source lock-in circuit is according to the processing knot of the speech analysis circuit Fruit locks the position of sound source, and the speech processing circuit is by big noise squelch circuit, conventional squelch circuit and phonetic decision circuit Form, the phonetic decision circuit connect with the equipment control circuit, the phonetic decision circuit respectively with the big noise Squelch circuit is connected with conventional squelch circuit, and the big noise squelch circuit and conventional squelch circuit lock respectively with the sound source Circuit connects, and the sound source lock-in circuit is connect with the voice playing module.
As the preferred embodiment of the present invention, speech recognition engine is embedded in the sound identification module, the voice is known Other engine carries out speech recognition using DNN algorithms.
As the preferred embodiment of the present invention, the voice acquisition module includes N number of voice capture device, and the N is big In the positive integer equal to 2, the voice playing module includes M voice playing equipment, and the M is just whole more than or equal to 1 Number.
The DNN algorithms include voice pretreatment, feature extraction, form Pronounceable dictionary and establish speech model etc. four Process, wherein voice preprocessing process are included to the sampling of voice signal or voice data, anti-confusion filtering, speech enhan-cement and end Point detection, the effect of characteristic extraction procedure be one group is extracted from the waveform of voice signal or voice data being capable of description message Number or voice data feature parameter, to train and to identify, it is then phoneme according to pronunciation to form Pronounceable dictionary, is obtained corresponding Text collection be Pronounceable dictionary, it is then to utilize knowledge of grammar adjustment not conforming to of being identified of acoustic model to establish speech model The word of logic.
In order to which audio data is made easily by Processing with Neural Network, complicated sound wave to be needed to resolve into composition portion one by one Point, to realize that sound wave decomposes, need to use Fourier transformation, complicated sound wave is decomposed into simple sound by Fourier transform Then the energy that every a frequency range is included is added together by wave, obtained result is a frequency spectrum from bass to high pitch, The frequency spectrum is inputted into deep neural network again, each small audio is sliced, neural network will all be attempted to find out currently The initial consonant or simple or compound vowel of a Chinese syllable corresponding to sound said, after our entire audio clips are run through by neural network, finally obtain These, wherein designating each audio block and its most possible corresponding initial consonant or simple or compound vowel of a Chinese syllable, are then based on pronunciation by portion mapping Prediction be combined with the possibility score of the text database based on mark, remove most unlikely as a result, leaving most realistic Result.
Compared with prior art, beneficial effects of the present invention:
1st, as a result of big noise squelch circuit, which can ensure that sound source is known by the big noise eliminating of burst Other accuracy.
Description of the drawings
Fig. 1 is the block diagram of Speech Signal system of the present invention;
Fig. 2 is the block diagram of Speech Signal system embodiment of the present invention.
Specific embodiment
With reference to embodiment and specific embodiment, the present invention is described in further detail, but should not understand this Range for aforementioned body of the present invention is only limitted to following embodiment, all to belong to this based on the technology that the content of present invention is realized The range of invention.
As shown in Fig. 2, a kind of novel voice removes system of making an uproar, the micro-wave oven control being electrically connected including micro-wave oven, with micro-wave oven The sound identification module and voice playing module that circuit and controlling circuit of microwave oven processed are electrically connected, with sound identification module The voice acquisition module of electrical connection;
Sound identification module is made of speech processing circuit and sound source lock-in circuit, speech processing circuit processing voice acquisition module The voice data that acquisition comes, sound source lock-in circuit locks the position of sound source according to the handling result of speech analysis circuit, at voice Reason circuit is made of big noise squelch circuit, conventional noise squelch circuit and phonetic decision circuit, phonetic decision circuit and equipment Control circuit connects, and phonetic decision circuit is connect respectively with big noise squelch circuit and conventional noise squelch circuit, and big noise is gone Noise cancellation circuit and conventional squelch circuit are connect respectively with sound source lock-in circuit, and sound source lock-in circuit is connect with voice playing module.
Speech recognition engine is embedded in sound identification module, sound identification module is integrated on the panel of micro-wave oven, Speech recognition engine carries out speech recognition using DNN algorithms to voice data.
The voice capture device of voice acquisition module is two microphones being set up in parallel in the present embodiment, speech play mould The voice playing equipment of block is a loudspeaker, and when the operation of micro-wave oven terminates or has accident generation, loudspeaker can be to behaviour It is reminded as personnel, in the present embodiment, the module that system is included is integrated on the panel of micro-wave oven.

Claims (3)

1. a kind of novel voice is except system of making an uproar, it is characterised in that:The equipment control electricity being electrically connected including equipment, with the equipment Road, the sound identification module and voice playing module being electrically connected with the equipment control circuit, with the speech recognition mould The voice acquisition module of block electrical connection;
The sound identification module is made of speech processing circuit and sound source lock-in circuit, described in the speech processing circuit processing The voice data that voice acquisition module acquisition comes, the sound source lock-in circuit is according to the processing knot of the speech analysis circuit Fruit locks the position of sound source, and the speech processing circuit is by big noise squelch circuit, conventional noise squelch circuit and phonetic decision Circuit is formed, and the phonetic decision circuit connect with the equipment control circuit, the phonetic decision circuit respectively with it is described greatly Noise squelch circuit connect with conventional squelch circuit, the big noise squelch circuit and routine squelch circuit respectively with the sound source Lock-in circuit connects, and the sound source lock-in circuit is connect with the voice playing module.
2. a kind of sound source locking system according to claim 1, it is characterised in that:It is embedded in the sound identification module Speech recognition engine, the speech recognition engine carry out speech recognition using DNN algorithms.
3. a kind of sound source locking system according to claim 1, it is characterised in that:The voice acquisition module includes N A voice capture device, the N are the positive integer more than or equal to 2, and the voice playing module includes M speech play and sets Standby, the M is the positive integer more than or equal to 1.
CN201810153081.7A 2018-02-22 2018-02-22 A kind of novel voice is except system of making an uproar Pending CN108198567A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810153081.7A CN108198567A (en) 2018-02-22 2018-02-22 A kind of novel voice is except system of making an uproar

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810153081.7A CN108198567A (en) 2018-02-22 2018-02-22 A kind of novel voice is except system of making an uproar

Publications (1)

Publication Number Publication Date
CN108198567A true CN108198567A (en) 2018-06-22

Family

ID=62594021

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810153081.7A Pending CN108198567A (en) 2018-02-22 2018-02-22 A kind of novel voice is except system of making an uproar

Country Status (1)

Country Link
CN (1) CN108198567A (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1707613A (en) * 2005-05-27 2005-12-14 曲哲 Collecting apparatus and method for noise insulation audio frequency
CN101740028A (en) * 2009-11-20 2010-06-16 四川长虹电器股份有限公司 Voice control system of household appliance
CN204390479U (en) * 2015-03-04 2015-06-10 冠捷显示科技(厦门)有限公司 A kind of controlling intelligent household appliances telechiric device
CN104754183A (en) * 2015-04-10 2015-07-01 四川理工学院 Real-time monitoring video adaptive filtering method and real-time monitoring video adaptive filtering system
CN104834922A (en) * 2015-05-27 2015-08-12 电子科技大学 Hybrid neural network-based gesture recognition method
CN104936091A (en) * 2015-05-14 2015-09-23 科大讯飞股份有限公司 Intelligent interaction method and system based on circle microphone array
CN106548772A (en) * 2017-01-16 2017-03-29 上海智臻智能网络科技股份有限公司 Speech recognition test system and method
CN106548771A (en) * 2015-09-21 2017-03-29 上海日趋信息技术有限公司 For the method that speech recognition system eliminates burst noise
CN106772244A (en) * 2016-11-25 2017-05-31 北京明泰朗繁精密设备有限公司 A kind of sonic location system and method
CN107392864A (en) * 2017-07-01 2017-11-24 南京理工大学 A kind of mixed noise filtering method for removing Gaussian noise and impulsive noise
CN107403619A (en) * 2017-06-30 2017-11-28 武汉泰迪智慧科技有限公司 A kind of sound control method and system applied to bicycle environment
CN107705260A (en) * 2017-10-03 2018-02-16 陈值英 The denoising system of medical X-ray image

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1707613A (en) * 2005-05-27 2005-12-14 曲哲 Collecting apparatus and method for noise insulation audio frequency
CN101740028A (en) * 2009-11-20 2010-06-16 四川长虹电器股份有限公司 Voice control system of household appliance
CN204390479U (en) * 2015-03-04 2015-06-10 冠捷显示科技(厦门)有限公司 A kind of controlling intelligent household appliances telechiric device
CN104754183A (en) * 2015-04-10 2015-07-01 四川理工学院 Real-time monitoring video adaptive filtering method and real-time monitoring video adaptive filtering system
CN104936091A (en) * 2015-05-14 2015-09-23 科大讯飞股份有限公司 Intelligent interaction method and system based on circle microphone array
CN104834922A (en) * 2015-05-27 2015-08-12 电子科技大学 Hybrid neural network-based gesture recognition method
CN106548771A (en) * 2015-09-21 2017-03-29 上海日趋信息技术有限公司 For the method that speech recognition system eliminates burst noise
CN106772244A (en) * 2016-11-25 2017-05-31 北京明泰朗繁精密设备有限公司 A kind of sonic location system and method
CN106548772A (en) * 2017-01-16 2017-03-29 上海智臻智能网络科技股份有限公司 Speech recognition test system and method
CN107403619A (en) * 2017-06-30 2017-11-28 武汉泰迪智慧科技有限公司 A kind of sound control method and system applied to bicycle environment
CN107392864A (en) * 2017-07-01 2017-11-24 南京理工大学 A kind of mixed noise filtering method for removing Gaussian noise and impulsive noise
CN107705260A (en) * 2017-10-03 2018-02-16 陈值英 The denoising system of medical X-ray image

Similar Documents

Publication Publication Date Title
CN109800700B (en) Underwater acoustic signal target classification and identification method based on deep learning
CN110120227A (en) A kind of depth stacks the speech separating method of residual error network
CN108172220A (en) A kind of novel voice denoising method
CN106847281A (en) Intelligent household voice control system and method based on voice fuzzy identification technology
US20170154640A1 (en) Method and electronic device for voice recognition based on dynamic voice model selection
CN102005070A (en) Voice identification gate control system
CN105206271A (en) Intelligent equipment voice wake-up method and system for realizing method
CN105096946B (en) Awakening device and method based on voice activation detection
CN103886236A (en) Acoustic control screen unlocking method and mobile terminal
CN110956965A (en) Personalized intelligent home safety control system and method based on voiceprint recognition
CN110189746A (en) A kind of method for recognizing speech applied to earth-space communication
CN109561003A (en) A kind of IR remote controller and electrical control system based on acoustic control
CN108461081A (en) Method, apparatus, equipment and the storage medium of voice control
CN108091327A (en) A kind of intelligent sound apparatus control method
CN104952446A (en) Digital building presentation system based on voice interaction
CN105405447B (en) One kind sending words respiratory noise screen method
Wang et al. Application of speech recognition technology in IoT smart home
CN109767767A (en) A kind of voice interactive method, system, electronic equipment and storage medium
CN109544745A (en) A kind of intelligent door lock control method, apparatus and system
CN108198567A (en) A kind of novel voice is except system of making an uproar
CN107093430A (en) A kind of vocal print feature extraction algorithm based on wavelet package transforms
CN113077798B (en) Old man calls for help equipment at home
CN106971712A (en) A kind of adaptive rapid voiceprint recognition methods and system
CN110580900A (en) Vehicle-mounted sound voice control system
CN104240705A (en) Intelligent voice-recognition locking system for safe box

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180622

RJ01 Rejection of invention patent application after publication