CN102013254A - Man-machine interactive system and method for digital television voice recognition - Google Patents
Man-machine interactive system and method for digital television voice recognition Download PDFInfo
- Publication number
- CN102013254A CN102013254A CN 201010549953 CN201010549953A CN102013254A CN 102013254 A CN102013254 A CN 102013254A CN 201010549953 CN201010549953 CN 201010549953 CN 201010549953 A CN201010549953 A CN 201010549953A CN 102013254 A CN102013254 A CN 102013254A
- Authority
- CN
- China
- Prior art keywords
- module
- voice
- digital television
- order
- speech recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Machine Translation (AREA)
Abstract
The invention discloses a man-machine interactive system and method for digital television voice recognition. The system comprises a target voice acquisition module, a voice analysis module, a semantic computation module and an intelligent control module, wherein the target voice acquisition module comprises a signal amplification module, a forward filtering module, a signal sampling module and a data compression and coding module; and the voice analysis module comprises a noise removal module, a feature extraction module and a decoding device. The method comprises the processes of target voice acquisition, voice noise removal, voice recognition processing, command recognition conversion and intelligent control processing. In the invention, through the cooperative work of the modules, a digital TV man-machine interactive technology for anti-interference voice intelligent identification and voice analysis and interaction under the digital TV reverberation acoustic environment of digital home life is achieved, and an advanced digital TV voice language interaction mode is provided.
Description
Technical field
The present invention relates to speech processes and semantic recognition technology field, and the technology of computer intelligence analysis, processing and collection voice, be specifically related to a kind of Digital Television speech recognition man-machine interactive system and method.
Background technology
Speech recognition technology mainly is to allow machine voice signal be become the technology of corresponding text or order by identification and understanding.The collection input of speech recognition technology by voice extracts the feature of voice, and the voice messaging feature of performance model database is carried out pattern match again, and obtaining the information translation that voice comprise is literal or order.
According to the object difference of speech recognition, in the speech recognition personage, can be divided into isolated word identification, key word recognition and continuous speech recognition three classes substantially.Isolated voice identification is used in identification known vocabulary in advance, key word recognition is used in the middle of the continuous voice, but it is the whole literal of nonrecognition also, and only detect the appearance of known plurality of keywords, and continuous speech recognition is used to discern continuous a sentence or one section word.
Under the Digital Television reverberation acoustic enviroment of real family life, the speech recognition influence that noise caused is bigger.In real family life, the restriction of speech recognition mainly is the lack of standard and the arbitrariness of The noise and interactive voice.Briefly, because noise impacts user's voice sampling and input, in speech recognition losing of misinterpretation or user speech can be taken place.The uncertainty at random that the lack of standard of user interaction voice and the arbitrariness coupling in speech recognition is brought may be because the lack of standard of voice and arbitrariness and matching error cause the language semanteme of voice to separate mistake in the middle of the coupling voice.
Solution is under the Digital Television reverberation acoustic enviroment of family life, and under the situation of user interaction voice lack of standard and arbitrariness, the identification of keyword voice has better application to the continuous speech recognition under this environment.In user's continuous speech order, key word recognition can match the position at known keyword place, and according to the position and the combination of keyword, and explaining needs the order carried out.
Therefore, the present invention proposes a kind of Digital Television speech recognition man-machine interactive system and method, purpose is under the environment of Digital Television, and advanced Digital Television voice language interactive mode is provided.
Summary of the invention
The objective of the invention is under the Digital Television reverberation acoustic enviroment of real family life, solve the lack of standard and the arbitrariness problem of interactive voice, interpersonal interactive system of a kind of Digital Television speech recognition and method are provided.
The interpersonal interactive system of Digital Television speech recognition of the present invention is made up of target voice acquisition module, speech analysis module, semantic computing module and intelligent control module.
Described target voice acquisition module is one or more microphone or other input systems of being used to gather voice messaging, realize the automatic collection of voice messaging, and the voice messaging of simulation is to the conversion of digital speech information, comprise signal amplification module, filtration module, signal sampling module forward, the data compression coding module;
Described speech analysis module is used for processed voice information, under the Digital Television reverberation acoustic enviroment of real family life, extract useful voice messaging, remove the noise noise, draw the voice messaging data then, be converted into Word message, comprise noise remove module, characteristic extracting module, decoder module;
Described semantic computing module is used to understand the implication of the Word message that speech analysis module draws, and by fuzzy message search and Chinese characters spoken language understanding, voice is carried out feature extraction, voice messaging is interpreted as the order that can carry out.At first in Word message, search for all literal relevant and carry out semanteme calculating, according to the position of order literal and the context statement of order and order literal, judge the order of required execution again with order according to the command information storehouse.Semantic computing module is set voice and order corresponding conversion relation, thereby crucial Word message is converted into order by the crucial literal information that identifies is made an explanation in the Digital Television reverberation acoustic enviroment of real family life.
Described intelligent control module, be used to receive the order of semantic computing module, when order can correct execution, carry out the order that obtained and the user is carried out the prompting of sound, image and video and mutual, continue to return the target voice acquisition module then the user is carried out alternately.When order is invalid, invalid to the user prompt order, return the interactive voice information that the target voice acquisition module is waited for the user then.
In the technique scheme, described target voice acquisition module also comprises the data compression coding module, and transmission speed is accelerated, and reduces the time-delay of system.
In the technique scheme, signal sampling module in the described target voice acquisition module uses single-chip microcomputer to make the double data processing of control, sampled data is read in CPU control just, carries out data compression then voluntarily, can meet the requirements of in speed that cost is relatively low simultaneously.
Speech analysis module described in the present invention is provided with deposits Chinese characters spoken language database of information module.When setting up keyword, adopt the syllable modeling, hidden Markov model (HMM) topological structure on acoustic model and language model basis is cut apart earlier, each section is decoded again.
Described semantic computing module is provided with the database module of depositing fill order and information extraction strategy, and described database module is provided with the artificial intelligence self-study mechanism, and is provided with the manual control interface.Artificial selection ambiguity information is set in semantic analysis, and the information extraction strategy of database is carried out artificial intelligence study, strengthen the accuracy of semantic identification.
In the such scheme, described semantic computing module has merged Chinese fuzzy information retrieval, Chinese characters spoken language understanding technology, utilize Chinese fuzzy information retrieval to find out the key words that comprises order, utilize Chinese characters spoken language understanding technology key words is understood and to be explained again, thus obtain the order that need carry out.
Described intelligent control module can be according to the direct control figure TV of order, and intelligent control module can be operated set-top box according to order, thereby reaches control figure TV and the mutual effect of people.
In addition, a kind of Digital Television speech recognition man-machine interaction method, its step is as described below:
1) initial step is used to start the interpersonal interactive system of this speech recognition;
2) gather voice messaging, under the Digital Television reverberation acoustic enviroment of real family life,, then gather user's voice information by the target voice acquisition module if the user wants to be undertaken alternately by voice and Digital Television.At first utilize measuring amplifier that voice signal is amplified, adopt 5 rank Butterworth low passes and 5 rank Butterworth high pass cascades to carry out filtering forward then, utilize the AD sampling A to carry out the signal sampling of 4k and 8k sampling rate according to Nyquist criterion again.Carry out data compression coding at last, make data become digital speech information;
3) conversion of voice messaging, the voice messaging that the target voice acquisition module is gathered comprises noise, by the processing of speech analysis module, user's voice information is extracted, and explanation becomes Word message.With reference to the fill order of all Digital Television, the keyword that definition is relevant with order, by speech analysis module, coupling identifies the position of keyword in user's continuous speech input, and keyword is mapped as Word message;
4) semantic understanding according to the Word message that is drawn, by semantic computing module, draws the order that will be performed.In Word message, search for all and order relevant literal according to the command information storehouse, carry out semanteme calculating according to the position of order literal and the context statement of order and order literal again, judge the order of required execution;
5) by in order that semantic computing module drew, when order can be executed correctly, the intelligent control module fill order is also carried out the mutual of sound, image and video to the user, and return the target voice acquisition module and the user is carried out next step is mutual, when order is invalid, intelligent control module is invalid to the user prompt order, returns the interactive voice information that the target voice acquisition module is waited for the user then.
Beneficial effect of the present invention is as follows:
1, a kind of Digital Television speech recognition man-machine interactive system and method proposed by the invention realizes the mutual of Digital Television voice language.The present invention provides the mutual of user and advanced Digital Television voice language under the Digital Television reverberation acoustic enviroment of real family life, realize the application towards digital home.
2, a kind of Digital Television speech recognition man-machine interactive system and method proposed by the invention, when setting up keyword, adopt the syllable modeling, hidden Markov model (HMM) topological structure on acoustic model and language model basis, cut apart earlier, again each section is decoded, can make speech recognition more accurate.
3, a kind of Digital Television speech recognition man-machine interactive system and method proposed by the invention, in semantic understanding, utilization interactive operation and artificial intelligence learning method, in Word message, search for all literal relevant according to the command information storehouse with order, carry out semanteme according to the context statement of the position of order literal and order and order literal again and calculate, make and semanticly judge more accurately and quick.
4, a kind of Digital Television speech recognition man-machine interactive system and method proposed by the invention, in the Digital Television reverberation acoustic enviroment of real family life, set voice and order corresponding conversion relation, can be in lack of standard that adapts to voice better and arbitrariness.
Description of drawings
Fig. 1 is an entire system module frame chart of the present invention;
Fig. 2 is the operational flowchart of the inventive method;
Fig. 3 is a voice collecting process flow diagram of the present invention;
Fig. 4 is a speech analysis process flow diagram of the present invention.
Embodiment
Describe the present invention below in conjunction with accompanying drawing.
As shown in Figure 1, a kind of Digital Television speech recognition man-machine interactive system, it comprises target voice acquisition module, speech analysis module, semantic computing module and intelligent control module; Described target voice acquisition module comprises signal amplification module, filtration module, signal sampling module, data compression coding module forward; Described speech analysis module comprises noise remove module, characteristic extracting module, decoder module.
The functional description of each module is as follows:
1, target voice acquisition module: one or more is used to gather microphone or other input systems of voice messaging, realizes the automatic collection of voice messaging, and the voice messaging of simulation is to the conversion of digital speech information.Be transferred to speech analysis module after the data-switching and carry out the identification processing of voice.
1) signal amplification module: because under the Digital Television reverberation acoustic enviroment of real family life, the voice signal that microphone is gathered is comparatively small and weak, need isolate to amplify to small-signal to strengthen voice signal.
2) filtration module forward: utilize Filtering Processing sound, can remove noise, outstanding voice signal.
3) signal sampling module: the voice signal to simulation carries out signal sampling and conversion process, utilizes single-chip microcomputer to carry out computing, and analog voice information is converted to digital speech information.
4) data compression coding module: the digital speech information after the sampling is carried out compressed encoding, and convenient storage and transmission improve transmitting speed.
2, speech analysis module: under the Digital Television reverberation acoustic enviroment of real family life, extract useful voice messaging, remove the noise noise, draw the voice messaging data then, be converted into Word message.With reference to the fill order of all Digital Television, definition and the relevant keyword of order by speech analysis module, are mated in user's continuous speech is imported and are identified the position of keyword, and keyword is mapped as Word message passes to semantic computing module.
1) noise remove module: in digital speech information, the utilization Wiener filtering is removed noise, makes the digital speech information can be not affected by noise, and makes digital speech information express more accurately.
2) characteristic extracting module: in digital speech information, extract phonetic feature, according to the various combination of voice to cutting apart that voice carry out looking like.
3) decoder module: the voice messaging that splits is carried out speech recognition decoder, the decoding finish after converting voice message into text message.
3, semantic computing module: understand the implication of the Word message that speech analysis module draws,, then voice are carried out feature extraction, voice messaging is interpreted as the order that can be performed by fuzzy message search and Chinese characters spoken language understanding.The command transfer explaining out carry out processing to intelligent control module.
4, intelligent control module: receive the order of semantic computing module, when order can be executed correctly, carry out the order obtained and the user is carried out the prompting of sound, image and video and mutual, continue to return the target voice acquisition module then the user is carried out alternately.When order is invalid, invalid to the user prompt order, return the interactive voice information that the target voice acquisition module is waited for the user then.
Be illustrated in figure 2 as a kind of operational flowchart of Digital Television speech recognition man-machine interactive system.
Operating process divides following several steps:
1) initial step is used to start the interpersonal interactive system of this speech recognition;
2) gather voice messaging, under the Digital Television reverberation acoustic enviroment of real family life,, then gather user's voice information by the target voice acquisition module if the user wants to be undertaken alternately by voice and Digital Television.At first in the signal amplification module, utilize measuring amplifier that voice signal is amplified, in filtration module forward, adopt 5 rank Butterworth low passes and 5 rank Butterworth high pass cascades to carry out filtering forward then, utilize AD sampling A in the signal sampling module to carry out the signal sampling of 4k and 8k sampling rate according to Nyquist criterion again.How carry out data compression coding in the data compression coding module at last, make data become digital speech information;
3) conversion of voice messaging, the voice messaging that the target voice acquisition module is gathered comprises noise, by the processing of speech analysis module, user's voice information is extracted, and explanation becomes Word message.At first the noise remove module is removed the digital speech noise, fill order with reference to all Digital Television, the keyword that definition is relevant with order, pass through characteristic extracting module, coupling identifies the position of keyword in user's continuous speech input, and by decoder module keyword is mapped as Word message;
4) semantic understanding according to the Word message that is drawn, by semantic computing module, draws the order that will be performed.In Word message, search for all and order relevant literal according to the command information storehouse, carry out semanteme calculating according to the position of order literal and the context statement of order and order literal again, judge the order of required execution;
5) by in order that semantic computing module drew, when order can be executed correctly, the intelligent control module fill order is also carried out the mutual of sound, image and video to the user, and return the target voice acquisition module and the user is carried out next step is mutual, when order is invalid, intelligent control module is invalid to the user prompt order, returns the interactive voice information that the target voice acquisition module is waited for the user then.
Be illustrated in figure 3 as the voice collecting process flow diagram in a kind of Digital Television speech recognition man-machine interactive system.When the present invention carries out voice collecting, the analog voice information of input at first utilizes measuring amplifier that voice signal is amplified, adopt 5 rank Butterworth low passes and 5 rank Butterworth high pass cascades to carry out filtering forward then, utilize the AD sampling A to carry out the signal sampling of 4k and 8k sampling rate according to Nyquist criterion again.Carry out data compression coding at last, make data become digital speech information.
Be illustrated in figure 4 as the speech analysis process flow diagram in a kind of Digital Television speech recognition man-machine interactive system.When the present invention carries out speech analysis, the speech data of input at first uses Wiener filtering to remove noise, draw user speech information accurately, utilize the feature extraction acoustic feature of Chinese characters spoken language, utilize acoustic feature to decode by the speech model set pair feature that it is good that the Viterbi algorithm utilizes training in advance, at last decoded information and literal are mated, generate Word message.
Claims (9)
1. Digital Television speech recognition man-machine interactive system is characterized in that comprising:
Realize the automatic collection of voice messaging, and the voice messaging of simulation is to the target voice acquisition module of the conversion of digital speech information;
Be responsible for processed voice information, under the Digital Television reverberation acoustic enviroment of real family life, extract useful voice messaging, remove the noise noise, draw the voice messaging data then, be converted into the speech analysis module of Word message;
Be used to understand the implication of the Word message that speech analysis module draws, voice messaging be interpreted as the semantic computing module of the order that can be performed;
Be used to receive the order of semantic computing module, the intelligent control module of fill order information.
2. Digital Television speech recognition man-machine interactive system according to claim 1 is characterized in that described target voice acquisition module also comprises signal amplification module, filtration module, signal sampling module forward, data compression coding module.
3. Digital Television speech recognition man-machine interactive system according to claim 2 is characterized in that the double data processing of described signal sampling module use single-chip microcomputer do control.
4. Digital Television speech recognition man-machine interactive system according to claim 1 is characterized in that described speech analysis module also comprises noise remove module, characteristic extracting module, decoder module.
5. Digital Television speech recognition man-machine interactive system according to claim 1 is characterized in that described speech analysis module is provided with to deposit Chinese characters spoken language database of information module.
6. Digital Television speech recognition man-machine interactive system according to claim 1, it is characterized in that described semantic computing module is provided with the database module of depositing fill order and information extraction strategy, described database module is provided with the artificial intelligence self-study mechanism, and is provided with the manual control interface.
7. Digital Television speech recognition man-machine interactive system according to claim 1 or 5 is characterized in that described semantic computing module has merged Chinese fuzzy information retrieval, Chinese characters spoken language understanding technology.
8. require described Digital Television speech recognition man-machine interactive system according to right 1, it is characterized in that described intelligent control module can be according to the direct control figure TV of order.
9. the method for a Digital Television speech recognition man-machine interaction is characterized in that may further comprise the steps:
1) initial step is used to start the interpersonal interactive system of this speech recognition;
2) gather voice messaging, under the Digital Television reverberation acoustic enviroment of real family life,, then gather user's voice information by the target voice acquisition module if the user wants to be undertaken alternately by voice and Digital Television;
3) conversion of voice messaging, the voice messaging that the target voice acquisition module is gathered comprises noise, by the processing of speech analysis module, user's voice information is extracted, and explanation becomes Word message;
4) semantic understanding according to the Word message that is drawn, by semantic computing module, draws the order that will be performed;
5) by in order that semantic computing module drew, when order can be executed correctly, the intelligent control module fill order is also carried out the mutual of sound, image and video to the user, and return the target voice acquisition module and the user is carried out next step is mutual, when order is invalid, intelligent control module is invalid to the user prompt order, returns the interactive voice information that the target voice acquisition module is waited for the user then.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010549953 CN102013254A (en) | 2010-11-17 | 2010-11-17 | Man-machine interactive system and method for digital television voice recognition |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010549953 CN102013254A (en) | 2010-11-17 | 2010-11-17 | Man-machine interactive system and method for digital television voice recognition |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102013254A true CN102013254A (en) | 2011-04-13 |
Family
ID=43843399
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201010549953 Pending CN102013254A (en) | 2010-11-17 | 2010-11-17 | Man-machine interactive system and method for digital television voice recognition |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102013254A (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102833634A (en) * | 2012-09-12 | 2012-12-19 | 康佳集团股份有限公司 | Implementation method for television speech recognition function and television |
CN103020102A (en) * | 2011-09-22 | 2013-04-03 | 歌乐株式会社 | Information terminal, server device, searching system and corresponding searching method |
CN103108234A (en) * | 2013-02-27 | 2013-05-15 | 康佳集团股份有限公司 | Method and system for controlling television through handwritten contents |
CN103702231A (en) * | 2013-11-29 | 2014-04-02 | 康佳集团股份有限公司 | Method and system for inputting characters by external equipment |
CN104240700A (en) * | 2014-08-26 | 2014-12-24 | 智歌科技(北京)有限公司 | Global voice interaction method and system for vehicle-mounted terminal device |
CN105137789A (en) * | 2015-08-28 | 2015-12-09 | 青岛海尔科技有限公司 | Control method and device of intelligent IoT electrical appliances, and related devices |
WO2016112634A1 (en) * | 2015-01-12 | 2016-07-21 | 芋头科技(杭州)有限公司 | Voice recognition system and method of robot system |
CN106297782A (en) * | 2016-07-28 | 2017-01-04 | 北京智能管家科技有限公司 | A kind of man-machine interaction method and system |
CN106328166A (en) * | 2016-08-31 | 2017-01-11 | 上海交通大学 | Man-machine dialogue anomaly detection system and method |
CN106369773A (en) * | 2016-11-15 | 2017-02-01 | 北京小米移动软件有限公司 | Method and device for controlling air supply of air conditioner |
CN106558309A (en) * | 2015-09-28 | 2017-04-05 | 中国科学院声学研究所 | A kind of spoken dialog strategy-generating method and spoken dialog method |
CN107481716A (en) * | 2017-07-31 | 2017-12-15 | 合肥上量机械科技有限公司 | Auxiliary input system for computer voice |
CN107645677A (en) * | 2017-09-26 | 2018-01-30 | 深圳市九洲电器有限公司 | Will of the people collection method and system |
CN107910002A (en) * | 2017-12-20 | 2018-04-13 | 北京工业大学 | A kind of man machine language's graphical interaction system and method |
CN108198552A (en) * | 2018-01-18 | 2018-06-22 | 深圳市大疆创新科技有限公司 | A kind of sound control method and video glass |
CN108257593A (en) * | 2017-12-29 | 2018-07-06 | 深圳和而泰数据资源与云技术有限公司 | A kind of audio recognition method, device, electronic equipment and storage medium |
CN109164414A (en) * | 2018-09-07 | 2019-01-08 | 深圳市天博智科技有限公司 | Localization method, device and storage medium based on microphone array |
CN112350908A (en) * | 2020-11-10 | 2021-02-09 | 珠海格力电器股份有限公司 | Control method and device of intelligent household equipment |
CN112420052A (en) * | 2020-11-18 | 2021-02-26 | 青岛海尔科技有限公司 | Device control method, device, storage medium, and electronic apparatus |
CN112435658A (en) * | 2020-12-18 | 2021-03-02 | 中国南方电网有限责任公司 | Human-computer interaction system for natural language processing dialogue exchange based on corpus |
CN112735410A (en) * | 2020-12-25 | 2021-04-30 | 中国人民解放军63892部队 | Automatic voice interactive force model control method and system |
CN113223518A (en) * | 2021-04-16 | 2021-08-06 | 讯飞智联科技(江苏)有限公司 | Human-computer interaction method of edge computing gateway based on AI (Artificial Intelligence) voice analysis |
WO2023216414A1 (en) * | 2022-05-13 | 2023-11-16 | 深圳创维-Rgb电子有限公司 | Speech interaction system and speech interaction method |
CN117672221A (en) * | 2023-12-14 | 2024-03-08 | 深圳市燊元软件科技有限公司 | Information transmission communication control method and system through voice control |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101807395A (en) * | 2010-02-26 | 2010-08-18 | 中山大学 | Method for controlling intelligent terminal via voice |
CN101826324A (en) * | 2010-02-26 | 2010-09-08 | 中山大学 | Intelligent terminal |
CN101867742A (en) * | 2010-05-21 | 2010-10-20 | 中山大学 | Television system based on sound control |
-
2010
- 2010-11-17 CN CN 201010549953 patent/CN102013254A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101807395A (en) * | 2010-02-26 | 2010-08-18 | 中山大学 | Method for controlling intelligent terminal via voice |
CN101826324A (en) * | 2010-02-26 | 2010-09-08 | 中山大学 | Intelligent terminal |
CN101867742A (en) * | 2010-05-21 | 2010-10-20 | 中山大学 | Television system based on sound control |
Cited By (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103020102A (en) * | 2011-09-22 | 2013-04-03 | 歌乐株式会社 | Information terminal, server device, searching system and corresponding searching method |
CN103020102B (en) * | 2011-09-22 | 2019-07-09 | 歌乐株式会社 | Server unit, searching system and its search method |
CN102833634A (en) * | 2012-09-12 | 2012-12-19 | 康佳集团股份有限公司 | Implementation method for television speech recognition function and television |
CN103108234B (en) * | 2013-02-27 | 2018-06-26 | 康佳集团股份有限公司 | A kind of method and system controlled by handwritten content TV |
CN103108234A (en) * | 2013-02-27 | 2013-05-15 | 康佳集团股份有限公司 | Method and system for controlling television through handwritten contents |
CN103702231A (en) * | 2013-11-29 | 2014-04-02 | 康佳集团股份有限公司 | Method and system for inputting characters by external equipment |
CN104240700A (en) * | 2014-08-26 | 2014-12-24 | 智歌科技(北京)有限公司 | Global voice interaction method and system for vehicle-mounted terminal device |
WO2016112634A1 (en) * | 2015-01-12 | 2016-07-21 | 芋头科技(杭州)有限公司 | Voice recognition system and method of robot system |
CN105137789A (en) * | 2015-08-28 | 2015-12-09 | 青岛海尔科技有限公司 | Control method and device of intelligent IoT electrical appliances, and related devices |
CN106558309B (en) * | 2015-09-28 | 2019-07-09 | 中国科学院声学研究所 | A kind of spoken dialog strategy-generating method and spoken dialog method |
CN106558309A (en) * | 2015-09-28 | 2017-04-05 | 中国科学院声学研究所 | A kind of spoken dialog strategy-generating method and spoken dialog method |
CN106297782A (en) * | 2016-07-28 | 2017-01-04 | 北京智能管家科技有限公司 | A kind of man-machine interaction method and system |
CN106328166A (en) * | 2016-08-31 | 2017-01-11 | 上海交通大学 | Man-machine dialogue anomaly detection system and method |
CN106369773A (en) * | 2016-11-15 | 2017-02-01 | 北京小米移动软件有限公司 | Method and device for controlling air supply of air conditioner |
CN107481716A (en) * | 2017-07-31 | 2017-12-15 | 合肥上量机械科技有限公司 | Auxiliary input system for computer voice |
CN107645677B (en) * | 2017-09-26 | 2020-07-07 | 深圳市九洲电器有限公司 | Method and system for collecting folk meaning |
CN107645677A (en) * | 2017-09-26 | 2018-01-30 | 深圳市九洲电器有限公司 | Will of the people collection method and system |
CN107910002A (en) * | 2017-12-20 | 2018-04-13 | 北京工业大学 | A kind of man machine language's graphical interaction system and method |
CN108257593A (en) * | 2017-12-29 | 2018-07-06 | 深圳和而泰数据资源与云技术有限公司 | A kind of audio recognition method, device, electronic equipment and storage medium |
CN108257593B (en) * | 2017-12-29 | 2020-11-13 | 深圳和而泰数据资源与云技术有限公司 | Voice recognition method and device, electronic equipment and storage medium |
CN108198552A (en) * | 2018-01-18 | 2018-06-22 | 深圳市大疆创新科技有限公司 | A kind of sound control method and video glass |
CN109164414A (en) * | 2018-09-07 | 2019-01-08 | 深圳市天博智科技有限公司 | Localization method, device and storage medium based on microphone array |
CN112350908B (en) * | 2020-11-10 | 2021-11-23 | 珠海格力电器股份有限公司 | Control method and device of intelligent household equipment |
CN112350908A (en) * | 2020-11-10 | 2021-02-09 | 珠海格力电器股份有限公司 | Control method and device of intelligent household equipment |
CN112420052A (en) * | 2020-11-18 | 2021-02-26 | 青岛海尔科技有限公司 | Device control method, device, storage medium, and electronic apparatus |
CN112435658A (en) * | 2020-12-18 | 2021-03-02 | 中国南方电网有限责任公司 | Human-computer interaction system for natural language processing dialogue exchange based on corpus |
CN112735410A (en) * | 2020-12-25 | 2021-04-30 | 中国人民解放军63892部队 | Automatic voice interactive force model control method and system |
CN112735410B (en) * | 2020-12-25 | 2024-06-07 | 中国人民解放军63892部队 | Automatic voice interactive force model control method and system |
CN113223518A (en) * | 2021-04-16 | 2021-08-06 | 讯飞智联科技(江苏)有限公司 | Human-computer interaction method of edge computing gateway based on AI (Artificial Intelligence) voice analysis |
CN113223518B (en) * | 2021-04-16 | 2024-03-22 | 讯飞智联科技(江苏)有限公司 | Human-computer interaction method of edge computing gateway based on AI voice analysis |
WO2023216414A1 (en) * | 2022-05-13 | 2023-11-16 | 深圳创维-Rgb电子有限公司 | Speech interaction system and speech interaction method |
CN117672221A (en) * | 2023-12-14 | 2024-03-08 | 深圳市燊元软件科技有限公司 | Information transmission communication control method and system through voice control |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102013254A (en) | Man-machine interactive system and method for digital television voice recognition | |
CN105632486B (en) | Voice awakening method and device of intelligent hardware | |
CN111048082B (en) | Improved end-to-end speech recognition method | |
CN108829894B (en) | Spoken word recognition and semantic recognition method and device | |
CN111223483A (en) | Lip language identification method based on multi-granularity knowledge distillation | |
CN111090727B (en) | Language conversion processing method and device and dialect voice interaction system | |
CN104036774A (en) | Method and system for recognizing Tibetan dialects | |
CN110070065A (en) | The sign language systems and the means of communication of view-based access control model and speech-sound intelligent | |
CN101604522B (en) | Embedded Chinese-English mixed voice recognition method and system for non-specific people | |
CN103730115A (en) | Method and device for detecting keywords in voice | |
CN104078044A (en) | Mobile terminal and sound recording search method and device of mobile terminal | |
CN101604520A (en) | Spoken language voice recognition method based on statistical model and syntax rule | |
CN110210416B (en) | Sign language recognition system optimization method and device based on dynamic pseudo tag decoding | |
CN103810998A (en) | Method for off-line speech recognition based on mobile terminal device and achieving method | |
CN105654947B (en) | Method and system for acquiring road condition information in traffic broadcast voice | |
CN111653270B (en) | Voice processing method and device, computer readable storage medium and electronic equipment | |
CN111046148A (en) | Intelligent interaction system and intelligent customer service robot | |
CN111192572A (en) | Semantic recognition method, device and system | |
CN101645270A (en) | Bidirectional speech recognition processing system and method | |
CN109686365A (en) | A kind of audio recognition method and speech recognition system | |
CN102141812A (en) | Robot | |
CN110232918B (en) | Unmanned aerial vehicle ground control station voice control system and control method | |
CN104424942A (en) | Method for improving character speed input accuracy | |
CN116534700A (en) | Control system and method for stair climbing machine | |
CN110619877A (en) | Voice recognition man-machine interaction method, device and system applied to laser pen and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
DD01 | Delivery of document by public notice |
Addressee: Guangdong ZSU Telecommunication Information Co., Ltd. Xue Kaijun Document name: Notification of Publication and of Entering the Substantive Examination Stage of the Application for Invention |
|
DD01 | Delivery of document by public notice |
Addressee: Guangdong ZSU Telecommunication Information Co., Ltd. Xue Kaijun Document name: Decision of Rejection |
|
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20110413 |