CN103024530A - Intelligent television voice response system and method - Google Patents

Intelligent television voice response system and method Download PDF

Info

Publication number
CN103024530A
CN103024530A CN2012105532157A CN201210553215A CN103024530A CN 103024530 A CN103024530 A CN 103024530A CN 2012105532157 A CN2012105532157 A CN 2012105532157A CN 201210553215 A CN201210553215 A CN 201210553215A CN 103024530 A CN103024530 A CN 103024530A
Authority
CN
China
Prior art keywords
user
voice
module
sent
intelligent television
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012105532157A
Other languages
Chinese (zh)
Inventor
常连城
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Samsung Electronics Co Ltd
Samsung Electronics Co Ltd
Original Assignee
Tianjin Samsung Electronics Co Ltd
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Samsung Electronics Co Ltd, Samsung Electronics Co Ltd filed Critical Tianjin Samsung Electronics Co Ltd
Priority to CN2012105532157A priority Critical patent/CN103024530A/en
Publication of CN103024530A publication Critical patent/CN103024530A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses an intelligent television voice response system which comprises a user identity characteristic identification module, a voice command identification module and an intelligent response module, and further discloses an intelligent television voice response method, wherein the method comprises the following steps: firstly, obtaining user identity characteristics; secondly, carrying out voice identification, and obtaining a command according with the user identity characteristics according to the user identity characteristics; and thirdly, carrying out matching of a knowledge base according to the command so as to obtain the response information of the user, and feeding back the response information. According to the intelligent television voice response system and method, the identity characteristic of the user can be identified, and the thought of the user can be understood according to context simultaneously so as to accurately give out feedback response for the voice input of the user; and therefore, the accuracy of intelligent television voice response is increased, minute and complicated steps of repeated selection are eliminated for the user, the time is saved, the use satisfaction degree of intelligent television is improved, and the operation of intelligent television is more humanized.

Description

Intelligent television voice response system and method
Technical field
The present invention relates to the intelligent television field, especially a kind of intelligent television voice answer method and system.
Background technology
Along with the intellectuality of TV and popularizing of intelligent television, broadband cabled television network integrates the technology such as the Internet, multimedia, communication, and the multiple interactive services that comprises Digital Television is provided to the domestic consumer.Intelligent television can adapt to the trend of current network develop rapidly well, fully effectively utilizes Internet resources.The intellectuality of television set is accompanied by is that the difficulty of TV button operation and loaded down with trivial details degree are also with increasing.Therefore speech recognition technology is incorporated in the operating system of intelligent television, realize that the voice operating of control command will make the user that both hands are freed, only depend on the easily complete operation of just speaking, this will be the direction of intelligent television field development.
Speech recognition technology is one of large important development in science and technology technology of areas of information technology ten so far from 2000.It is a cross discipline, just progressively becomes the key technology of man-machine interface in the information technology.Speech recognition technology develops into today, and particularly medium and small vocabulary unspecified person speech recognition system accuracy of identification is greater than 98%.These technology can satisfy the requirement of common application.Because the development of large scale integrated circuit technology, these complicated speech recognition systems also can be made special chip fully, a large amount of production.In west economy developed country, a large amount of speech recognition products have come into the market and service field.Some users hand over mechanical, electrical phone, mobile phone to comprise the speech recognition dial feature, also have the products such as voice memo basis, Intelligent toy with speech sounds also to comprise speech recognition and speech-sound synthesizing function.People can inquire about relevant air ticket, tourism, bank information with the speech recognition spoken dialogue system by telephone network, and obtain good result.Investigation statistics shows that nearly the people more than 85% is satisfied with to the performance of the information inquiry service system of speech recognition.Can predict that the application of speech recognition system will be more extensive in nearly five to ten years.Various speech recognition system products will appear on the market.The patent No. is that 201010255337.9 patent of invention discloses and a kind ofly looks audio frequency playing method and system based on voice command for another example.Relate to the media play technical field, only need press a start key, the user all can finish by voice command the operation of described terminal equipment.The user presses the start key of the one-key type control device on the fixed part that is arranged on vehicle, and described terminal equipment is set up voice conversation with the VSP server and is connected, and system enters the automatic-answering back device state.Described VSP server adopts the unspecified person speech recognition technology that user's voice command is resolved, and analysis result is sent to described terminal equipment, look audio playing module by described terminal equipment according to described analysis result startup, and obtain video-voice frequency flow according to looking the audio frequency broadcast address.This patented technology scheme is mainly used in amusement equipment, especially is applied to vehicular amusement apparatus.
Speech recognition technology is applied to the intelligent television field, in existing voice response system and the method, a kind of is to adopt server mode to carry out speech recognition, sound identification module is at server end, that is to say that every voice control command that the user sends all needs to send to server end through set-top box and carries out speech recognition, by server the operational order that identifies is transferred to set-top box again and carry out, like this will certainly the Seize ACK message transmission bandwidth, increase the operating time, reduce the sensitivity of voice operating; Another kind is to adopt the Embedded Speech Recognition System pattern, language identification software and model are write in the memory of intelligent television, identifying is finished in terminal, the operational order of intelligent television is more fixing in this pattern, it is not very large needing the vocabulary of identification, can not take the too large space of memory, so embedded pattern is compared required time of identification with server mode shorter, can make faster complete operation of user.
Along with the development of society, amount of information is also more and more.This must cause that people live in numerous information every day, can not to what all understand more clearly, so information inquiry is more and more necessary.Do not fix a point to watch weather by TV that such as not wishing wish can both inquire about at any time, if use PC or mobile phone to search for, it is cumbersome that some people can think, also needs input at home.The process of therefore wishing inquiry is more simple better, and the intelligent response system is best selection so.Only need to say the information that you go for, the intelligent response system will reply to the answer that you want.Yet the meaning that language can be explained has open characteristics, the meaning that same order is expressed under different context is also different, and each user's age, sex all there are differences, the meaning that these factors are wanted to express to same order also has considerable influence, for example when the user uses the order of " selection film ", system movie listings occurs and further selects for the user, and system exactly predictive user may want the films types viewed and admired, so that the film of user's the type can not preferentially appear in movie listings, the user has to continue to search in numerous and diverse tabulation.
Summary of the invention
The objective of the invention is to overcome defective of the prior art, provide a kind of can be for user voice command, intelligent television voice answer method and system.
For addressing the above problem, a kind of intelligent television voice response system of the present invention comprises:
User identity feature identification module links to each other with voice command recognition module, is used for obtaining the user identity feature, and the identity characteristic information of obtaining is sent to voice command recognition module;
Voice command recognition module, link to each other with the intelligent response module, be used for receiving user speech, and voice are identified, and carry out semanteme according to the subscriber identity information that user identity feature identification module sends and identify, draw the order that meets the user identity feature, and this order is sent to the intelligent response module;
The intelligent response module be used for to receive the order that meets the user identity feature that voice command recognition module sends, and carries out the coupling of knowledge base according to this order, draws the response message to the user, and response message is fed back.
Described user identity feature identification module comprises:
Image acquisition units links to each other with image analyzing unit, is used for gathering user images, and the user images that collects is sent to image analyzing unit;
Image analyzing unit links to each other with voice command recognition module, is used for receiving the user images that image acquisition units sends, and user images is carried out discriminance analysis, draws the user identity characteristic information and is sent to voice command recognition module.
Described image analyzing unit comprises:
Based on the Age estimation unit of recognition of face, be used for user's facial image is carried out discriminance analysis, draw user's age information.
Described image analyzing unit comprises:
Based on the sex judging unit of recognition of face, be used for user's facial image is carried out discriminance analysis, draw user's sex information.
Described voice command recognition module comprises:
The voice collecting unit links to each other with the speech analysis unit, is used for gathering user speech information, and the voice messaging that collects is sent to the speech analysis unit;
The speech analysis unit links to each other with the semantic analysis unit, be used for to receive the voice messaging that the voice collecting unit sends, and voice messaging analysis is drawn should Word message corresponding to voice, and this literal information is sent to the semantic analysis unit;
The semantic analysis unit, link to each other with the intelligent response module with user identity feature identification module respectively, be used for to receive the user identity characteristic information that Word message that the speech analysis unit sends and user identity feature identification module send, and transfer the order that is complementary with the user identity characteristic information in command library corresponding to described Word message and be sent to the intelligent response module.
Described intelligent television voice response system also comprises:
The TTS module links to each other with the intelligent response module, is used for that the intelligent response module is drawn response message to the user and transforms into audio format by text formatting and export.
A kind of intelligent television voice answer method may further comprise the steps:
1) the user identity feature is obtained in the identification of user identity feature, and the identity characteristic information of obtaining is sent to voice command recognition module;
2) voice command recognition module, receive user speech, voice are identified, and carry out semanteme identification according to the subscriber identity information that user identity feature identification module sends, draw the order that meets the user identity feature, and this order is sent to the intelligent response module;
3) the intelligent response module receives the order that meets the user identity feature that voice command recognition module sends, and carries out the coupling of knowledge base according to this order, draws the response message to the user, and response message is fed back.
Described step 1 comprises:
11) image acquisition units gathers user images, and the user images that collects is sent to image analyzing unit;
12) image analyzing unit receives the user images that image acquisition units sends, and user images is carried out discriminance analysis, draws the user identity characteristic information and is sent to voice command recognition module.
Described step 12) may further comprise the steps:
121) picture preliminary treatment makes people's face position, size and image normalization in the user images;
122) picture region is divided and training, people's face in the user images is carried out the zone divide, and is divided into a plurality of identified regions, different gray value and the binary images of each identified region output;
123) regional face feature value template matches, each described identified region is isolated different template matches unit, calculate the characteristic value of each template matches unit and matching template, with described characteristic value as the input neural unit in the input layer of neural network algorithm, through the Processing with Neural Network algorithm, draw the picture result of optimum Match degree;
124) described picture result is carried out face's weighted calculation;
125) picture recognition knowledge of result storehouse feedback.
Described step 2 comprises:
21) the voice collecting unit gathers user speech information, and the voice messaging that collects is sent to the speech analysis unit;
22) the speech analysis unit receives the voice messaging that the voice collecting unit sends, and voice messaging analysis is drawn should Word message corresponding to voice, and this literal information is sent to the semantic analysis unit;
23) the semantic analysis unit receives the user identity characteristic information that Word message that the speech analysis unit sends and user identity feature identification module send, and transfers the order that is complementary with the user identity characteristic information in command library corresponding to described Word message and be sent to the intelligent response module.
Described user identity feature comprises user's age and/or sex.
Adopt intelligent television voice response system of the present invention and method, user identity feature identification module can identify user's essential information, such as age, sex etc.These information provide decision-making foundation for voice command recognition module, such as user language order " selection film ", result according to the identification of user identity feature is different, it is also different to feed back the film that: if the user is children, the result of system feedback will be the Disney film, if the user is the young people, then the result of feedback is romance movie, if the user is a middle-aged person, then the result of feedback is feature film, and this is just so that user's option program has obtained greatly simplification.The present invention can identify user's identity characteristic, can based on context understand simultaneously user's the meaning, accurately user's phonetic entry being provided feedback replys, increased the accuracy of intelligent television voice answer-back, for the user has saved the loaded down with trivial details step of repeatedly selecting, saved the time, improved the user satisfaction of intelligent television, made the more hommization of operation of intelligent television.
Description of drawings
Fig. 1 is intelligent television voice response system structured flowchart of the present invention.
Fig. 2 is recognition of face FB(flow block) in the intelligent television voice response system of the present invention.
Fig. 3 main identified region of face of behaving is divided schematic diagram.
Embodiment
In order to make those skilled in the art person understand better technical solution of the present invention, the present invention is described in further detail below in conjunction with drawings and embodiments.
As shown in Figure 1, a kind of intelligent television voice response system of the present invention comprises user identity feature identification module, voice command recognition module and intelligent response module.
Wherein, user identity feature identification module links to each other with voice command recognition module, is used for obtaining the user identity feature, and the identity characteristic information of obtaining is sent to voice command recognition module.
This user identity feature identification module comprises image acquisition units and image analyzing unit.
Described image acquisition units links to each other with image analyzing unit, is used for gathering user images, and the user images that collects is sent to image analyzing unit; Image acquisition units is included in three cameras of TV top edge, and three cameras lay respectively at the upper left corner of TV, and the positive side bit image of catching user's head portrait can be pounced on by three cameras in three positions in the upper right corner and middle, top.With respect to a camera, three cameras can capture more fully image.
Image analyzing unit links to each other with voice command recognition module, is used for receiving the user images that image acquisition units sends, and user images is carried out discriminance analysis, draws the user identity characteristic information and is sent to voice command recognition module.
This image analyzing unit comprises Age estimation unit, sex judging unit or the expression judging unit based on recognition of face, facial image to the user carries out discriminance analysis, realize from user's facial image, drawing user's essential information, information such as age of user, sex or expression by image processing algorithm and fuzzy matching algorithm.
As shown in Figure 2, image analyzing unit carries out discriminance analysis to user's facial image and mainly passes through following several stages:
1) picture pretreatment stage;
In actual applications, restriction and the interference of image acquisition units because being subject to external environment, the image that collects may be with a lot of Noise and Interference signals, and this people's face pattern recognition problem of having relatively high expectations for picture quality can have a huge impact, and causes the decline of classification capacity.Therefore before digital picture was extracted feature, the image preliminary treatment was very important, and makes people's face position, size and image normalization in the facial image, and overcome block, the impact of the factors such as attitude, illumination, jewelry.
2) picture region is divided and the training stage;
Fig. 3 main identified region of face of behaving is divided schematic diagram.As shown in Figure 3, before carrying out the face template coupling, need to carry out the zone to face and divide the template matches of subregional memory picture.Facial zone is divided and is comprised hair hair style district, forehead district, eyebrow eyes district, nose region, cheekbone cheek district, mouth district and chin district.The gray-scale map that each zone output is different and the image of binaryzation.
3) the regional face feature value template matches stage;
The matching algorithm of face feature is the combination of neural net method and template matches.Different template matches unit is isolated in again refinement in each identified region.Simultaneously, also should comprise examination project general in the whole face recognition, different examination projects are reallocated to different weights.The examination project refinement of present whole face recognition as shown in Table 1.
Divide with the template matches unit along with the development deep and recognition technology of studying has adjustment for the zone of people's face.
For each template matches project, adopt the method for the characteristic value of calculating and matching stencil.Suppose that training set is arranged I}, wherein I be m * n size facial image (i=1,2 ... N), at first every row of each image I being linked to each other consists of the column vector that a size is d=m * n dimension.Obtain like this X} (i=1,2 ... N), X represents people's face vector that i width of cloth facial image forms, and then the computational methods of the characteristic value of matching stencil are:
S = Σ i = 1 N ( X i - X ‾ ) ( X i - X ‾ ) T
With the characteristic value result of the matching stencil in image processing target zone as each the input neural unit in the input layer of neural network algorithm, then begin the Processing with Neural Network algorithm, the result that the repeatedly competition between the process neuron and cluster draw the optimum Match degree.
Table one: face recognition template matches cell distribution table
Figure BDA00002609865400081
4) based on face's weighted calculation stage of picture background and picture quality:
After drawing each regional picture result, be weighted calculating.If each regional matching result is x, weights are p, and then result of calculation c is the product summation of weights and matching result.
Computing formula is as follows:
c ( i ) = Σ j = 0 i p ( x i )
Divide with the template matches unit along with the development deep and recognition technology of studying has adjustment for the zone of people's face.The weights of different examination projects are along with the expansion of sample set and the expansion of data training sample have correction.
5) picture recognition knowledge of result storehouse feedback stage.
To face's characteristic value ATL, the training sample that has enriched like this stencil value has improved the precision of recognition of face with the result feedback of picture recognition.
Certainly; based on recognition of face estimation Age and sex; can also use other alternative methods; all within protection range of the present invention; for example the patent No. is 200910032756.3 the disclosed a kind of age assessment method based on face recognition technology of patent of invention, and the patent No. is 200810226414.0 the disclosed a kind of face gender identification method based on fuzzy support vector machine of patent of invention.
Voice command recognition module in the intelligent television voice response system of the present invention, link to each other with the intelligent response module, be used for receiving user speech, and voice are identified, and carry out semanteme according to the subscriber identity information that user identity feature identification module sends and identify, draw the order that meets the user identity feature, and this order is sent to the intelligent response module.
Described voice command recognition module comprises voice collecting unit, speech analysis unit and semantic analysis unit.
Wherein, the voice collecting unit can be the external or built-in microphone of television set, links to each other with the speech analysis unit, is used for gathering user speech information, and the voice messaging that collects is sent to the speech analysis unit; The speech analysis unit links to each other with the semantic analysis unit, be used for to receive the voice messaging that the voice collecting unit sends, and voice messaging analysis is drawn should Word message corresponding to voice, and this literal information is sent to the semantic analysis unit; The semantic analysis unit, link to each other with the intelligent response module with user identity feature identification module respectively, be used for to receive the user identity characteristic information that Word message that the speech analysis unit sends and user identity feature identification module send, and transfer the order that is complementary with the user identity characteristic information in command library corresponding to described Word message and be sent to the intelligent response module.
Described intelligent response module be used for to receive the order that meets the user identity feature that voice command recognition module sends, and carries out the coupling of knowledge base according to this order, draws the response message to the user, and response message is fed back.
Described intelligent television voice response system comprises that also TTS(is from Text To Speech Text To Speech) module, this TTS module links to each other with the intelligent response module, the response message that is used for the intelligent response module is drawn to the user transforms into audio format by text formatting, exports by the loud speaker of television set.
A kind of intelligent television voice answer method of the present invention is achieved by above-mentioned intelligent television voice response system, may further comprise the steps:
1) image acquisition units gathers user images, and the user images that collects is sent to image analyzing unit;
2) image analyzing unit receives the user images that image acquisition units sends, and user images is carried out discriminance analysis, draws the user identity characteristic information, comprises age of user, sex or expression information, and is sent to voice command recognition module;
3) the voice collecting unit gathers user speech information, and the voice messaging that collects is sent to the speech analysis unit;
4) the speech analysis unit receives the voice messaging that the voice collecting unit sends, and voice messaging analysis is drawn should Word message corresponding to voice, and this literal information is sent to the semantic analysis unit;
5) the semantic analysis unit receives the user identity characteristic information that Word message that the speech analysis unit sends and user identity feature identification module send, and transfers the order that is complementary with the user identity characteristic information in command library corresponding to described Word message and be sent to the intelligent response module;
6) the intelligent response module receives the order that meets the user identity feature that voice command recognition module sends, and carries out the coupling of knowledge base according to this order, draws the response message to the user, and response message is sent to the TTS module;
7) response message of TTS module reception intelligent response module transmission, and the response message that the intelligent response module is drawn to the user transforms into audio format by text formatting, exports by the loud speaker of television set.
Wherein said semantic analysis unit can be server mode or Embedded Speech Recognition System pattern.When the semantic analysis unit is that server mode is, owing to information need to be sent to high in the clouds, be the data of encrypting at the transmission of data that is sent to server therefore, with protection user's privacy.
The above only is preferred implementation of the present invention; should be pointed out that for those skilled in the art, under the prerequisite that does not break away from the principle of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (10)

1. an intelligent television voice response system is characterized in that, comprising:
User identity feature identification module links to each other with voice command recognition module, is used for obtaining the user identity feature, and the identity characteristic information of obtaining is sent to voice command recognition module;
Voice command recognition module, link to each other with the intelligent response module, be used for receiving user speech, and voice are identified, and carry out semanteme according to the identity characteristic information that user identity feature identification module sends and identify, draw the order that meets the user identity feature, and this order is sent to the intelligent response module;
The intelligent response module be used for to receive the order that meets the user identity feature that voice command recognition module sends, and carries out the coupling of knowledge base according to this order, draws the response message to the user, and response message is fed back.
2. intelligent television voice response system as claimed in claim 1 is characterized in that described user identity feature identification module comprises:
Image acquisition units links to each other with image analyzing unit, is used for gathering user images, and the user images that collects is sent to image analyzing unit;
Image analyzing unit links to each other with voice command recognition module, is used for receiving the user images that image acquisition units sends, and user images is carried out discriminance analysis, draws the user identity characteristic information and is sent to voice command recognition module.
3. intelligent television voice response system as claimed in claim 2 is characterized in that described image analyzing unit comprises:
Based on the Age estimation unit of recognition of face, be used for user's facial image is carried out discriminance analysis, draw user's age information.
4. intelligent television voice response system as claimed in claim 2 is characterized in that described image analyzing unit comprises:
Based on the sex judging unit of recognition of face, be used for user's facial image is carried out discriminance analysis, draw user's sex information.
5. intelligent television voice response system as claimed in claim 1 is characterized in that described voice command recognition module comprises:
The voice collecting unit links to each other with the speech analysis unit, is used for gathering user speech information, and the voice messaging that collects is sent to the speech analysis unit;
The speech analysis unit links to each other with the semantic analysis unit, be used for to receive the voice messaging that the voice collecting unit sends, and voice messaging analysis is drawn should Word message corresponding to voice, and this literal information is sent to the semantic analysis unit;
The semantic analysis unit, link to each other with the intelligent response module with user identity feature identification module respectively, be used for to receive the identity characteristic information that Word message that the speech analysis unit sends and user identity feature identification module send, and transfer the order that is complementary with identity characteristic information in command library corresponding to described Word message and be sent to the intelligent response module.
6. such as claim 1 to 5 intelligent television voice response system as described in each, it is characterized in that described intelligent television voice response system also comprises:
The TTS module links to each other with the intelligent response module, is used for that the intelligent response module is drawn response message to the user and transforms into audio format by text formatting and export.
7. intelligent television voice answer method may further comprise the steps:
1) the user identity feature is obtained in the identification of user identity feature, and the identity characteristic information of obtaining is sent to voice command recognition module;
2) voice command recognition module, receive user speech, voice are identified, and carry out semanteme identification according to the identity characteristic information that user identity feature identification module sends, draw the order that meets the user identity feature, and this order is sent to the intelligent response module;
3) the intelligent response module receives the order that meets the user identity feature that voice command recognition module sends, and carries out the coupling of knowledge base according to this order, draws the response message to the user, and response message is fed back.
8. intelligent television voice answer method as claimed in claim 7 is characterized in that described step 1 comprises:
11) image acquisition units gathers user images, and the user images that collects is sent to image analyzing unit;
12) image analyzing unit receives the user images that image acquisition units sends, and user images is carried out discriminance analysis, draws the user identity characteristic information and is sent to voice command recognition module.
9. intelligent television voice answer method as claimed in claim 8 is characterized in that described step 12) may further comprise the steps:
121) picture preliminary treatment makes people's face position, size and image normalization in the user images;
122) picture region is divided and training, people's face in the user images is carried out the zone divide, and is divided into a plurality of identified regions, different gray value and the binary images of each identified region output;
123) regional face feature value template matches, each described identified region is isolated different template matches unit, calculate the characteristic value of each template matches unit and matching template, with described characteristic value as the input neural unit in the input layer of neural network algorithm, through the Processing with Neural Network algorithm, draw the picture result of optimum Match degree;
124) described picture result is carried out face's weighted calculation, draw the picture recognition result;
125) picture recognition knowledge of result storehouse feedback.
10. intelligent television voice answer method as claimed in claim 7 is characterized in that described step 2 comprises:
21) the voice collecting unit gathers user speech information, and the voice messaging that collects is sent to the speech analysis unit;
22) the speech analysis unit receives the voice messaging that the voice collecting unit sends, and voice messaging analysis is drawn should Word message corresponding to voice, and this literal information is sent to the semantic analysis unit;
23) the semantic analysis unit receives the identity characteristic information that Word message that the speech analysis unit sends and user identity feature identification module send, and transfers the order that is complementary with identity characteristic information in command library corresponding to described Word message and be sent to the intelligent response module.
CN2012105532157A 2012-12-18 2012-12-18 Intelligent television voice response system and method Pending CN103024530A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012105532157A CN103024530A (en) 2012-12-18 2012-12-18 Intelligent television voice response system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012105532157A CN103024530A (en) 2012-12-18 2012-12-18 Intelligent television voice response system and method

Publications (1)

Publication Number Publication Date
CN103024530A true CN103024530A (en) 2013-04-03

Family

ID=47972582

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012105532157A Pending CN103024530A (en) 2012-12-18 2012-12-18 Intelligent television voice response system and method

Country Status (1)

Country Link
CN (1) CN103024530A (en)

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104123938A (en) * 2013-04-29 2014-10-29 富泰华工业(深圳)有限公司 Voice control system, electronic device and voice control method
WO2014173286A1 (en) * 2013-04-26 2014-10-30 Tencent Technology (Shenzhen) Company Limited Method and apparatus for implementing a network transaction
CN104681023A (en) * 2015-02-15 2015-06-03 联想(北京)有限公司 Information processing method and electronic equipment
CN104795067A (en) * 2014-01-20 2015-07-22 华为技术有限公司 Voice interaction method and device
CN104933336A (en) * 2015-05-06 2015-09-23 丰唐物联技术(深圳)有限公司 Method and system for controlling smart home device
CN105070288A (en) * 2015-07-02 2015-11-18 百度在线网络技术(北京)有限公司 Vehicle-mounted voice instruction recognition method and device
CN105225662A (en) * 2015-08-24 2016-01-06 深圳市冠旭电子有限公司 Smart bluetooth earphone plays method and the smart bluetooth earphone of external voice automatically
CN105489218A (en) * 2015-11-24 2016-04-13 江苏惠通集团有限责任公司 Speech control system, remote control and server
CN105703978A (en) * 2014-11-24 2016-06-22 武汉物联远科技有限公司 Smart home control system and method
CN105723448A (en) * 2014-01-21 2016-06-29 三星电子株式会社 Electronic device and voice recognition method thereof
CN105895105A (en) * 2016-06-06 2016-08-24 北京云知声信息技术有限公司 Speech processing method and device
CN106789595A (en) * 2017-01-17 2017-05-31 北京诸葛找房信息技术有限公司 Information-pushing method and device
CN107170456A (en) * 2017-06-28 2017-09-15 北京云知声信息技术有限公司 Method of speech processing and device
CN107660303A (en) * 2015-06-26 2018-02-02 英特尔公司 The language model of local speech recognition system is changed using remote source
CN107909871A (en) * 2017-12-26 2018-04-13 安徽声讯信息技术有限公司 A kind of tablet computer of intelligent sound teaching
CN108366302A (en) * 2018-02-06 2018-08-03 南京创维信息技术研究院有限公司 TTS broadcast commands optimization method, smart television, system and storage device
CN108492823A (en) * 2018-03-07 2018-09-04 广东思派康电子科技有限公司 A kind of ordering song by voice interactive system and ordering song by voice exchange method
CN110415688A (en) * 2018-04-26 2019-11-05 杭州萤石软件有限公司 Information interaction method and robot
CN110428807A (en) * 2019-08-15 2019-11-08 三星电子(中国)研发中心 A kind of audio recognition method based on deep learning, system and device
CN110909610A (en) * 2019-10-26 2020-03-24 湖北讯獒信息工程有限公司 Accurate age identification method based on artificial intelligence
CN110970021A (en) * 2018-09-30 2020-04-07 航天信息股份有限公司 Question-answering control method, device and system
CN111128194A (en) * 2019-12-31 2020-05-08 云知声智能科技股份有限公司 System and method for improving online voice recognition effect
CN112418060A (en) * 2020-11-19 2021-02-26 西南大学 Facial recognition system based on neural network
CN113096654A (en) * 2021-03-26 2021-07-09 山西三友和智慧信息技术股份有限公司 Computer voice recognition system based on big data
CN114121014A (en) * 2021-10-26 2022-03-01 云知声智能科技股份有限公司 Control method and equipment of multimedia data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1079615A2 (en) * 1999-08-26 2001-02-28 Matsushita Electric Industrial Co., Ltd. System for identifying and adapting a TV-user profile by means of speech technology
CA2717992A1 (en) * 2008-03-12 2009-09-17 E-Lane Systems Inc. Speech understanding method and system
CN101588443A (en) * 2009-06-22 2009-11-25 费炜 Statistical device and detection method for television audience ratings based on human face
CN101620715A (en) * 2009-08-06 2010-01-06 余洋 Method and system for publishing intelligent advertisement
CN102262644A (en) * 2010-05-25 2011-11-30 索尼公司 Search Apparatus, Search Method, And Program
CN102298694A (en) * 2011-06-21 2011-12-28 广东爱科数字科技有限公司 Man-machine interaction identification system applied to remote information service

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1079615A2 (en) * 1999-08-26 2001-02-28 Matsushita Electric Industrial Co., Ltd. System for identifying and adapting a TV-user profile by means of speech technology
CA2717992A1 (en) * 2008-03-12 2009-09-17 E-Lane Systems Inc. Speech understanding method and system
CN101588443A (en) * 2009-06-22 2009-11-25 费炜 Statistical device and detection method for television audience ratings based on human face
CN101620715A (en) * 2009-08-06 2010-01-06 余洋 Method and system for publishing intelligent advertisement
CN102262644A (en) * 2010-05-25 2011-11-30 索尼公司 Search Apparatus, Search Method, And Program
CN102298694A (en) * 2011-06-21 2011-12-28 广东爱科数字科技有限公司 Man-machine interaction identification system applied to remote information service

Cited By (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014173286A1 (en) * 2013-04-26 2014-10-30 Tencent Technology (Shenzhen) Company Limited Method and apparatus for implementing a network transaction
CN104123938A (en) * 2013-04-29 2014-10-29 富泰华工业(深圳)有限公司 Voice control system, electronic device and voice control method
CN104795067B (en) * 2014-01-20 2019-08-06 华为技术有限公司 Voice interactive method and device
CN104795067A (en) * 2014-01-20 2015-07-22 华为技术有限公司 Voice interaction method and device
US9990924B2 (en) 2014-01-20 2018-06-05 Huawei Technologies Co., Ltd. Speech interaction method and apparatus
CN110459214A (en) * 2014-01-20 2019-11-15 华为技术有限公司 Voice interactive method and device
CN110459214B (en) * 2014-01-20 2022-05-13 华为技术有限公司 Voice interaction method and device
US10468025B2 (en) 2014-01-20 2019-11-05 Huawei Technologies Co., Ltd. Speech interaction method and apparatus
US11380316B2 (en) 2014-01-20 2022-07-05 Huawei Technologies Co., Ltd. Speech interaction method and apparatus
US10304443B2 (en) 2014-01-21 2019-05-28 Samsung Electronics Co., Ltd. Device and method for performing voice recognition using trigger voice
CN105723448A (en) * 2014-01-21 2016-06-29 三星电子株式会社 Electronic device and voice recognition method thereof
US11011172B2 (en) 2014-01-21 2021-05-18 Samsung Electronics Co., Ltd. Electronic device and voice recognition method thereof
CN105723448B (en) * 2014-01-21 2021-01-12 三星电子株式会社 Electronic equipment and voice recognition method thereof
US11984119B2 (en) 2014-01-21 2024-05-14 Samsung Electronics Co., Ltd. Electronic device and voice recognition method thereof
CN105703978A (en) * 2014-11-24 2016-06-22 武汉物联远科技有限公司 Smart home control system and method
CN104681023A (en) * 2015-02-15 2015-06-03 联想(北京)有限公司 Information processing method and electronic equipment
CN104933336A (en) * 2015-05-06 2015-09-23 丰唐物联技术(深圳)有限公司 Method and system for controlling smart home device
CN107660303A (en) * 2015-06-26 2018-02-02 英特尔公司 The language model of local speech recognition system is changed using remote source
CN105070288B (en) * 2015-07-02 2018-08-07 百度在线网络技术(北京)有限公司 Vehicle-mounted voice instruction identification method and device
WO2017000489A1 (en) * 2015-07-02 2017-01-05 百度在线网络技术(北京)有限公司 On-board voice command identification method and apparatus, and storage medium
US10446150B2 (en) 2015-07-02 2019-10-15 Baidu Online Network Technology (Beijing) Co. Ltd. In-vehicle voice command recognition method and apparatus, and storage medium
CN105070288A (en) * 2015-07-02 2015-11-18 百度在线网络技术(北京)有限公司 Vehicle-mounted voice instruction recognition method and device
CN105225662A (en) * 2015-08-24 2016-01-06 深圳市冠旭电子有限公司 Smart bluetooth earphone plays method and the smart bluetooth earphone of external voice automatically
CN105489218A (en) * 2015-11-24 2016-04-13 江苏惠通集团有限责任公司 Speech control system, remote control and server
CN105895105A (en) * 2016-06-06 2016-08-24 北京云知声信息技术有限公司 Speech processing method and device
CN105895105B (en) * 2016-06-06 2020-05-05 北京云知声信息技术有限公司 Voice processing method and device
CN106789595A (en) * 2017-01-17 2017-05-31 北京诸葛找房信息技术有限公司 Information-pushing method and device
CN107170456A (en) * 2017-06-28 2017-09-15 北京云知声信息技术有限公司 Method of speech processing and device
CN107909871A (en) * 2017-12-26 2018-04-13 安徽声讯信息技术有限公司 A kind of tablet computer of intelligent sound teaching
CN108366302A (en) * 2018-02-06 2018-08-03 南京创维信息技术研究院有限公司 TTS broadcast commands optimization method, smart television, system and storage device
CN108366302B (en) * 2018-02-06 2020-06-30 南京创维信息技术研究院有限公司 TTS (text to speech) broadcast instruction optimization method, smart television, system and storage device
CN108492823A (en) * 2018-03-07 2018-09-04 广东思派康电子科技有限公司 A kind of ordering song by voice interactive system and ordering song by voice exchange method
CN110415688A (en) * 2018-04-26 2019-11-05 杭州萤石软件有限公司 Information interaction method and robot
CN110970021B (en) * 2018-09-30 2022-03-08 航天信息股份有限公司 Question-answering control method, device and system
CN110970021A (en) * 2018-09-30 2020-04-07 航天信息股份有限公司 Question-answering control method, device and system
CN110428807A (en) * 2019-08-15 2019-11-08 三星电子(中国)研发中心 A kind of audio recognition method based on deep learning, system and device
CN110909610A (en) * 2019-10-26 2020-03-24 湖北讯獒信息工程有限公司 Accurate age identification method based on artificial intelligence
CN111128194A (en) * 2019-12-31 2020-05-08 云知声智能科技股份有限公司 System and method for improving online voice recognition effect
CN112418060A (en) * 2020-11-19 2021-02-26 西南大学 Facial recognition system based on neural network
CN113096654A (en) * 2021-03-26 2021-07-09 山西三友和智慧信息技术股份有限公司 Computer voice recognition system based on big data
CN114121014A (en) * 2021-10-26 2022-03-01 云知声智能科技股份有限公司 Control method and equipment of multimedia data

Similar Documents

Publication Publication Date Title
CN103024530A (en) Intelligent television voice response system and method
CN113408385B (en) Audio and video multi-mode emotion classification method and system
Czyzewski et al. An audio-visual corpus for multimodal automatic speech recognition
CN108737872A (en) Method and apparatus for output information
US20060173859A1 (en) Apparatus and method for extracting context and providing information based on context in multimedia communication system
JP2019212288A (en) Method and device for outputting information
CN110519636A (en) Voice messaging playback method, device, computer equipment and storage medium
US20220392224A1 (en) Data processing method and apparatus, device, and readable storage medium
CN111326143B (en) Voice processing method, device, equipment and storage medium
CN112052333B (en) Text classification method and device, storage medium and electronic equipment
CN109410911A (en) Artificial intelligence learning method based on speech recognition
CN114187547A (en) Target video output method and device, storage medium and electronic device
CN112233698A (en) Character emotion recognition method and device, terminal device and storage medium
CN107507620A (en) Voice broadcast sound setting method and device, mobile terminal and storage medium
CN111583919B (en) Information processing method, device and storage medium
CN105872792A (en) Voice-based service recommending method and device
CN112785669B (en) Virtual image synthesis method, device, equipment and storage medium
CN115602165B (en) Digital employee intelligent system based on financial system
WO2024140434A1 (en) Text classification method based on multi-modal knowledge graph, and device and storage medium
WO2024140430A1 (en) Text classification method based on multimodal deep learning, device, and storage medium
CN107291704A (en) Treating method and apparatus, the device for processing
Huang et al. Audio-visual speech recognition using an infrared headset
CN111833907B (en) Man-machine interaction method, terminal and computer readable storage medium
CN111354362A (en) Method and device for assisting hearing-impaired communication
CN113823303A (en) Audio noise reduction method and device and computer readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20130403