CN101882370A - Voice recognition remote controller - Google Patents

Voice recognition remote controller Download PDF

Info

Publication number
CN101882370A
CN101882370A CN2010102149949A CN201010214994A CN101882370A CN 101882370 A CN101882370 A CN 101882370A CN 2010102149949 A CN2010102149949 A CN 2010102149949A CN 201010214994 A CN201010214994 A CN 201010214994A CN 101882370 A CN101882370 A CN 101882370A
Authority
CN
China
Prior art keywords
module
signal
remote controller
voice
blind source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102149949A
Other languages
Chinese (zh)
Inventor
罗笑南
吴其泽
刘广发
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sun Yat Sen University
National Sun Yat Sen University
Original Assignee
National Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Sun Yat Sen University filed Critical National Sun Yat Sen University
Priority to CN2010102149949A priority Critical patent/CN101882370A/en
Publication of CN101882370A publication Critical patent/CN101882370A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Selective Calling Equipment (AREA)

Abstract

The embodiment of the invention discloses a voice recognition remote controller, which comprises buttons and a chip of a common remote controller, a sensor group, an analog-to-digital conversion module, a blind source separation module, a voice recognition module and a control and response module, wherein the sensor group is arranged on the remote controller and is a group of porous microphones used for receiving voice signals; the analog-to-digital conversion module is used for receiving voice signals received and input by the sensor group, and converting the voice signals to digital acquisition signals which can be processed by the digital chip; the blind source separation module receives the digital acquisition signals from the analog-to-digital conversion module, and separates mixed signals through a blind source separation algorithm; and the voice recognition module receives signals separated by the blind source separation module, recognizes useful signals, and sends response voice instruction codes to the control and response module according to the recognized signals. Through the voice recognition remote controller and by utilizing the blind source separation technology, the mixed voice signals are separated, and the subsequent recognition rate is improved.

Description

A kind of voice recognition remote controller
Technical field
The present invention relates to digital home technical field, be specifically related to a kind of voice recognition remote controller.
Background technology
Present stage on the market the TV remote controller of main flow all be based on simple electronic circuit and on button realize control function.Its biggest advantage is exactly with low cost, reliable in quality; But shortcoming also is conspicuous, and that is exactly that button is various, and is directly perceived inadequately, is not easy to the user and remembers use.The telepilot of a complexity can allow the user that a kind of forbidding sensation is arranged.
Along with the continuous progress of science and technology, among the speech recognition technology life that appears at us gradually, as mobile phone, PC.An importance of household electrical appliance development is to allow user interface hommization more, and convenient nature accomplishes that the elderly and the disabled can use without barrier.Utilize speech recognition technology to realize that voice control is an important channel of improving household appliances user interface quality.
The telepilot that has speech identifying function can greatly improve the availability of household appliances.With the TV remote controller is example, if the user wants to watch " central authorities one cover " program, he otherwise channel browsing is wanted the program seen up to him occurring one by one, or the platform numeral of memory " central authorities' one cover ", this is not easy to use.The TV remote controller that has added speech identifying function only need be said " central authorities' one cover ", and the control signal of turntable just can be discerned and send to televisor to telepilot automatically.
The telepilot that has speech identifying function also has a difficult problem, runs into the identification problem of multi-source input signal exactly.With the TV remote controller is example.The user sends instruction by voice to telepilot when televiewing, at this moment, it is not the phonetic order that simple user says that telepilot receives voice signal, but the mixed signal of TV loudspeaker and user's voice instruction.Though user's voice instruction intensity may be greater than the sound of TV loudspeaker, the signal that mixes is very big for the influence of speech recognition, influences its discrimination greatly.
Summary of the invention
The invention provides a kind of voice recognition remote controller that separates based on blind source, make that the preceding mixed signal of speech recognition is separated, improve discrimination.
In order to realize goal of the invention, the embodiment of the invention discloses a kind of voice recognition remote controller, comprise conventional remote controller buttons and chip, sensor groups, D/A converter module, blind source separation module, sound identification module, control and respond module, wherein:
Conventional remote controller buttons and chip make telepilot have the function that general telepilot has, and comprise Menu key, the volume adjusting key ,+/-key, signal emission module;
Sensor groups is one group of poroid microphone that is used for received speech signal on telepilot;
D/A converter module is used to receive the voice signal that biography comes from the input of sensor group of received, and changes into the accessible digital collection signal of digit chip;
Blind source separation module receives the digital collection signal from D/A converter module, by blind source separation algorithm, the Signal Separation of mixing;
Sound identification module receives from the signal after the separation of blind source separation module, identifies useful signal, and sends the voice responsive coded instructions according to the voice that identify to control and respond module;
Control and respond module, preestablish the rule of man-machine interaction, be used for receiving information, send information, and confirmed to send steering order to the telepilot chip after the instruction by loudspeaker by sound identification module.
Microphone number in the described sensor groups is no less than two.
After described blind source separation module is used to receive the multichannel mixed signal, at first carries out centralization and albefaction and handle, the iteration optimization separation matrix is tried to achieve separation signal by separation matrix after the convergence then, the signal after output separates at last.
The present invention has the following advantages: utilize speech recognition technology, and can be so that man-machine interaction be more humane.Utilize blind source separate technology, the voice signal that mixes is separated, improve follow-up discrimination.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, to do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below, apparently, accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the voice recognition remote controller structural representation in the embodiment of the invention;
Fig. 2 is the blind source separation module workflow diagram among Fig. 1;
Fig. 3 uses the flow process of telepilot of the present invention for the user in the embodiment of the invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making all other embodiment that obtained under the creative work prerequisite.
The remote-controller function structural drawing as shown in Figure 1, telepilot is by sensor groups, D/A converter module, blind source separation module, sound identification module, control and respond module, the conventional func module is formed.The sensor received speech signal is after become digital signal to analog signal conversion by D/A converter module, by blind source separation module mixed signal is separated then, sound identification module is discerned useful command signal then, and send to control and respond module, control and rule and the user interactions of respond module according to setting, last transmitting control commands is finished the process of term sound control system telepilot to the conventional func module.
Concrete, here conventional remote controller buttons and chip make telepilot have the function that general telepilot has, and comprise Menu key, the volume adjusting key ,+/-key, signal emission module or the like.
Sensor groups is one group of poroid microphone that is used for received speech signal on telepilot; According to the theory that separate in blind source, to separate for the blind source of realizing signal smoothly, the number of signals of reception must not be less than the number of sound source, so the microphone number is no less than two, could differentiate the sound of TV and user's sound.
D/A converter module is used to receive the voice signal that biography comes from the input of sensor group of received, and changes into the accessible digital collection signal of digit chip.
Blind source separation module receives the digital collection signal from D/A converter module, by blind source separation algorithm, the Signal Separation of mixing.
Sound identification module receives from the signal after the separation of blind source separation module, identifies useful signal, and sends the voice responsive coded instructions according to the voice that identify to control and respond module.
Control and respond module, the inside is set with the rule of man-machine interaction, receives information by sound identification module, sends information by loudspeaker, carries out alternately with the user in this way.And confirmed to send steering order to the telepilot chip after the instruction.
Blind source separation module workflow diagram after the multichannel mixed signal is imported this module, at first carries out centralization and albefaction and handles as shown in Figure 2, and the iteration optimization separation matrix is tried to achieve separation signal by separation matrix after the convergence then, at last output.
Here represent the source signal matrix with S, A represents hybrid matrix, X represents the observation signal matrix, W represents separation matrix, Y ecbatic signal matrix, then, X=AS is exactly the signal that sensor groups receives, we will obtain separation matrix W exactly, make Y=WS approach S, have just realized the separation of mixed signal like that.
Specifically describe each step principle below.
Signal centerization is exactly to make that the average of signal is zero.If x is the non-vanishing stochastic variable of average, only need use x 0=x-E (x) replaces x to get final product.Then replace its mathematical expectation to realize zero-meanization in practice with arithmetic mean.
The albefaction of signal is exactly to make the correlation matrix of the stochastic variable x ' after the conversion satisfy R by certain linear transformation T:x '=Tx X '=E[x ' x ' H]=I.
If the correlation matrix of mixed signal vector x is R x, by the character of correlation matrix as can be known, R xExist characteristic value decomposition to be:
R x=Q∑ 2Q T
∑ in the formula 2Be diagonal matrix.
Make the T=∑ -1Q T, establish x '=Tx, then can be so that the correlation matrix of the x ' after the conversion is I, thus realized the albefaction of signal.
The process of iteration optimization separation matrix is described with maximum entropy method (MEM) below.
Entropy is a notion of information theory the inside.The entropy H (A) of definition A is the mean value of incident self-information, and the mathematic(al) representation of the entropy of discrete random variable is:
H ( A ) = E ( I ) = - Σ k = 1 n p i · log ( p k )
The combination entropy of two stochastic variable x and y is defined as:
H ( x , y ) = - Σ i p ( x = a i , y = b i ) lgp ( x = a i , y = b i )
Mutual information between stochastic variable x and the y is defined as:
I(x,y)=H(x)+H(y)-H(x,y)
Be edge entropy sum and deduct combination entropy.
Maximum entropy method (MEM) is characterized in after output u replacing estimation to high-order statistic by a nonlinear function yi=gi of component ground introducing (ui).The criterion of this method is: behind given suitable gi (ui), make output y=[y1, y2 ..., yn] total entropy amount H (y) very big.Here gi (ui) is a reversible dull nonlinear function, and u=Wx.The combination entropy of output signal is
H(y1,...yN)=H(y1)+...+H(yN)-I(y1,...yN)
In the formula: H (yi) respectively exports the destination edge entropy,, and I (y1 ... yN) be their mutual information.The maximization of combination entropy means the maximization with the edge entropy of minimizing of mutual information.To the stochastic variable y1 of bounded ... yN, when mutual information is zero, H (y1 ... yN) reach maximal value, marginal distribution is uniform.
There are two parameters to be used for determining maximum combined entropy, just nonlinear function yi=gi (ui) and weight coefficient W.Behind selected nonlinear function, remaining parameter is exactly W.Differentiate gets to W:
∂ H ( y ) ∂ W = ∂ ∂ W ( - D ( p ( s ) | p ( u ) ) )
Wherein D (.) represents the KL distance.
Define non-linear or evaluation function is
φ ( u ) = - ∂ p ( u ) ∂ u p ( u )
The formula of final iteration is:
W(k+1)=W(k)+μ k[W-T(k)-φ(u(k))x T(k)]
By iteration repeatedly, just obtain separation matrix W after the convergence.
After trying to achieve separation matrix, just can realize the separation of mixed signal by Y=WX, the signal after the separation passes to sound identification module.
The user uses process flow diagram as shown in Figure 3, and what this figure described is the flow process that the user uses telepilot of the present invention.At first, the user says instruction, and such as " adjustment brightness ", what the sensor groups of telepilot received will be the mixed signal of the sound of user's voice instruction and televisor.Mixed signal sends to blind source separation module and carries out the separation of signal through after the digital-to-analog conversion.Signal after the separation passes to sound identification module, and the information after the identification passes to control and respond module, and this is control and the rule of respond module according to setting, the information that response pass is come.If control and the instruction that the respond module affirmation need send then directly send instruction to the conventional func module, if uncertain, then write down current interaction mode, continuation and user carry out alternately.When receiving that " after " adjustment brightness " message, telepilot sends " please adjust brightness ", and the user says instruction once more and " brightens ", and this moment, control then can clearly be instructed with respond module, then sent instruction to the conventional func module.The conventional func module is then finished control to household electrical appliances according to instruction.
To sum up,, utilize speech recognition technology by implementing the embodiment of the invention, can be so that man-machine interaction be more humane.Utilize blind source separate technology, the voice signal that mixes is separated, improve follow-up discrimination
More than the voice recognition remote controller that the embodiment of the invention provided separates based on blind source is described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (3)

1. a voice recognition remote controller is characterized in that, comprises conventional remote controller buttons and chip, sensor groups, and D/A converter module, blind source separation module, sound identification module, control and respond module, wherein:
Conventional remote controller buttons and chip make telepilot have the function that general telepilot has, and comprise Menu key, the volume adjusting key ,+/-key, signal emission module;
Sensor groups is one group of poroid microphone that is used for received speech signal on telepilot;
D/A converter module is used to receive the voice signal that biography comes from the input of sensor group of received, and changes into the accessible digital collection signal of digit chip;
Blind source separation module receives the digital collection signal from D/A converter module, by blind source separation algorithm, the Signal Separation of mixing;
Sound identification module receives from the signal after the separation of blind source separation module, identifies useful signal, and sends the voice responsive coded instructions according to the voice that identify to control and respond module;
Control and respond module, preestablish the rule of man-machine interaction, be used for receiving information, send information, and confirmed to send steering order to the telepilot chip after the instruction by loudspeaker by sound identification module.
2. voice recognition remote controller as claimed in claim 1 is characterized in that, the microphone number in the described sensor groups is no less than two.
3. voice recognition remote controller as claimed in claim 2, it is characterized in that, after described blind source separation module is used to receive the multichannel mixed signal, at first carrying out centralization and albefaction handles, iteration optimization separation matrix then, separation signal is tried to achieve by separation matrix in the convergence back, the signal after output separates at last.
CN2010102149949A 2010-06-30 2010-06-30 Voice recognition remote controller Pending CN101882370A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102149949A CN101882370A (en) 2010-06-30 2010-06-30 Voice recognition remote controller

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102149949A CN101882370A (en) 2010-06-30 2010-06-30 Voice recognition remote controller

Publications (1)

Publication Number Publication Date
CN101882370A true CN101882370A (en) 2010-11-10

Family

ID=43054377

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102149949A Pending CN101882370A (en) 2010-06-30 2010-06-30 Voice recognition remote controller

Country Status (1)

Country Link
CN (1) CN101882370A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102957732A (en) * 2011-08-31 2013-03-06 德信互动科技(北京)有限公司 Man-machine interaction system and method
CN103209370A (en) * 2012-01-16 2013-07-17 联想(北京)有限公司 Electronic equipment and method for adjusting file sound parameters output by sound playing device
WO2016187910A1 (en) * 2015-05-22 2016-12-01 西安中兴新软件有限责任公司 Voice-to-text conversion method and device, and storage medium
CN107718992A (en) * 2017-11-14 2018-02-23 上海电机学院 A kind of interactive drawing method and device based on sound intensity value
CN108534297A (en) * 2018-04-16 2018-09-14 奥克斯空调股份有限公司 A kind of intelligent air-conditioning system and control method based on speech recognition
CN108833327A (en) * 2018-03-28 2018-11-16 哈尔滨工程大学 A kind of digital signal modulated and demodulation method and device
CN110021307A (en) * 2019-04-04 2019-07-16 Oppo广东移动通信有限公司 Audio method of calibration, device, storage medium and electronic equipment
CN111383636A (en) * 2019-06-28 2020-07-07 深圳国威电子有限公司 Wireless communication device controlled by voice operation

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1345029A (en) * 2000-09-19 2002-04-17 汤姆森许可贸易公司 Voice-operated method and device for electronic equipment for consumption
CN101426171A (en) * 2007-10-31 2009-05-06 株式会社东芝 Sound field control method and system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1345029A (en) * 2000-09-19 2002-04-17 汤姆森许可贸易公司 Voice-operated method and device for electronic equipment for consumption
CN101426171A (en) * 2007-10-31 2009-05-06 株式会社东芝 Sound field control method and system

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102957732A (en) * 2011-08-31 2013-03-06 德信互动科技(北京)有限公司 Man-machine interaction system and method
CN103209370A (en) * 2012-01-16 2013-07-17 联想(北京)有限公司 Electronic equipment and method for adjusting file sound parameters output by sound playing device
WO2016187910A1 (en) * 2015-05-22 2016-12-01 西安中兴新软件有限责任公司 Voice-to-text conversion method and device, and storage medium
CN107718992A (en) * 2017-11-14 2018-02-23 上海电机学院 A kind of interactive drawing method and device based on sound intensity value
CN108833327A (en) * 2018-03-28 2018-11-16 哈尔滨工程大学 A kind of digital signal modulated and demodulation method and device
CN108833327B (en) * 2018-03-28 2019-08-16 哈尔滨工程大学 A kind of digital signal modulated and demodulation method and device
CN108534297A (en) * 2018-04-16 2018-09-14 奥克斯空调股份有限公司 A kind of intelligent air-conditioning system and control method based on speech recognition
CN110021307A (en) * 2019-04-04 2019-07-16 Oppo广东移动通信有限公司 Audio method of calibration, device, storage medium and electronic equipment
CN111383636A (en) * 2019-06-28 2020-07-07 深圳国威电子有限公司 Wireless communication device controlled by voice operation

Similar Documents

Publication Publication Date Title
CN101882370A (en) Voice recognition remote controller
CN111223497B (en) Nearby wake-up method and device for terminal, computing equipment and storage medium
CN109087669B (en) Audio similarity detection method and device, storage medium and computer equipment
CN105511608A (en) Intelligent robot based interaction method and device, and intelligent robot
JP2014089437A (en) Voice recognition device, and voice recognition method
CN105304081A (en) Smart household voice broadcasting system and voice broadcasting method
EP2680548A1 (en) Method and apparatus for reducing noise in voices in mobile terminals
EP2661095A1 (en) Method and apparatus for controlling automatic interworking of multiple devices
CN104123932A (en) Voice conversion system and method
CN111429897B (en) Intelligent household system control implementation method
CN106448654A (en) Robot speech recognition system and working method thereof
CN104796177A (en) Bluetooth transceiver, line control earphone module and mobile device module
CN105245993A (en) Automatic earphone volume adjusting method and system, and smart earphone
CN101826324A (en) Intelligent terminal
CN106681160A (en) Method and device for controlling intelligent equipment
CN101436404A (en) Conversational biology-liked apparatus and conversational method thereof
CN102404522B (en) Speech remote control method for television and television
CN1300175A (en) Radio remote control system with microphone/loud speaker for Internet apparatus and method for controlling its telecontroller
CN109561003A (en) A kind of IR remote controller and electrical control system based on acoustic control
CN106653020A (en) Multi-business control method and system for smart sound and video equipment based on deep learning
CN103269445B (en) Intelligent television system control method
CN103297896B (en) A kind of audio-frequency inputting method and electronic equipment
KR100691976B1 (en) Mobile Communication Terminal and Method for Morse Signal and Analysis and conversion
CN104766462A (en) Sound wave remote control system and sound wave remote control method
CN204350220U (en) A kind of remote controller and control system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20101110