CN112312181A - Smart television voice recognition method, system and readable storage medium - Google Patents

Smart television voice recognition method, system and readable storage medium Download PDF

Info

Publication number
CN112312181A
CN112312181A CN201910682661.XA CN201910682661A CN112312181A CN 112312181 A CN112312181 A CN 112312181A CN 201910682661 A CN201910682661 A CN 201910682661A CN 112312181 A CN112312181 A CN 112312181A
Authority
CN
China
Prior art keywords
user
voice
voiceprint
recognition
dialect
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910682661.XA
Other languages
Chinese (zh)
Inventor
鲍舰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen TCL New Technology Co Ltd
Original Assignee
Shenzhen TCL New Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen TCL New Technology Co Ltd filed Critical Shenzhen TCL New Technology Co Ltd
Priority to CN201910682661.XA priority Critical patent/CN112312181A/en
Priority to PCT/CN2020/103545 priority patent/WO2021017978A1/en
Publication of CN112312181A publication Critical patent/CN112312181A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/005Language recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/441Acquiring end-user identification, e.g. using personal code sent by the remote control or by inserting a card
    • H04N21/4415Acquiring end-user identification, e.g. using personal code sent by the remote control or by inserting a card using biometric characteristics of the user, e.g. by voice recognition or fingerprint scanning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention provides a voice recognition method, a voice recognition system and a storage medium for a smart television, which are used for recognizing dialects of users by the smart television, and receiving voice instructions of user interactive operation by the smart television; the voice print recognition module determines the dialect type used by the user according to the voice print characteristics of the voice command of the user interactive operation; the voice recognition module directly converts the voice instruction of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice instruction of the user. In the invention, the user operates the intelligent television through voice, the intelligent television identifies the voice of the user and identifies and feeds back the voice, the user does not need to select dialect types, and for families using the intelligent television and having various dialects, the dialects spoken by the user can be automatically identified and voice instructions of the user interactive operation can be directly identified according to the voice identification technology of the dialects, so that the selection times of the dialects by the user are greatly reduced, and the experience of the user in voice operation is improved.

Description

Smart television voice recognition method, system and readable storage medium
Technical Field
The present invention relates to the field of speech recognition technologies, and in particular, to a method, a system, and a readable storage medium.
Background
At present, the application of voice recognition technology on smart televisions is widespread, and a user can select a movie, play music and even control various household appliances by speaking. For some countries with broad breadth, such as China, pronunciations of various local dialects are greatly different, although the voice recognition technology on the smart television can recognize the local dialects, the precondition is that the dialects used by the user are preset on the television, and voice recognition cannot be randomly performed according to the dialects spoken by the user, in other words, the dialects of the user need to be preset in the smart television firstly, the smart television can recognize the dialects spoken by the user, otherwise, the voice AI technology of the smart television cannot automatically recognize the local dialects spoken by the user.
For a family, a television is a public electrical appliance in the whole family, an old person may speak a hometown word, a child only speaks a mandarin for school education, and in the family, the possibility of multiple dialects may exist, and it is not practical to preset the corresponding dialects for each member in the family in the television.
The prior art also has some voice recognition technologies for solving the problem that dialects need to be preset, for example, the judgment is carried out according to the geographic position of the smart television, that is, the geographic position of a user is judged according to the IP address of the smart television network, and then the preferred dialect type of the smart television is determined according to the geographic position.
Accordingly, the prior art is yet to be improved and developed.
Disclosure of Invention
In view of the defects of the prior art, the invention provides an automatic dialect matching technology for a smart television, so that the dialect spoken by a user is automatically matched under the condition that the dialect is not preset by the smart television, and the dialect is automatically identified.
The technical scheme adopted by the invention for solving the technical problem is as follows:
a speech recognition method for a smart television is used for the smart television to recognize dialects of users and comprises the following steps:
the intelligent television receives a voice instruction of user interactive operation;
the voice print recognition module determines the dialect type used by the user according to the voice print characteristics of the voice command operated by the user;
the voice recognition module directly converts the voice instruction of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice instruction of the user.
As a further improved technical scheme, the method also comprises the following steps:
the method comprises the steps that the smart television creates a corresponding voiceprint feature file for each user in advance;
and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
As a further improved technical solution, when the voiceprint recognition module determines that the voiceprint feature of the voice instruction is not in the corresponding voiceprint feature file previously created for each user by the smart television, the smart television newly creates the corresponding voiceprint feature file for the user with the voiceprint feature, and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
As a further improved technical solution, the voiceprint recognition module can be implemented by using a voiceprint recognition server connected to the smart television network.
As a further improved technical solution, the voice recognition module may be implemented by using a voice recognition server connected to an intelligent television network.
The invention also provides an intelligent television voice recognition system which is used for recognizing the dialect of the user by the intelligent television and comprises a voice receiving module, a voiceprint recognition module and a voice recognition module;
the voice receiving module is used for receiving a voice instruction of user interactive operation by the intelligent television;
the voice print recognition module is used for judging the voice print characteristics of the voice command of the user interactive operation received by the voice receiving module and determining the dialect type used by the user;
the voice recognition module is used for directly converting the voice of the user into characters according to the dialect type corresponding to the voice voiceprint characteristics of the user interaction operation recognized by the voiceprint recognition module so as to recognize the voice instruction of the user.
As a further improved technical solution, the system further includes a user voiceprint feature module, which is used for creating a corresponding voiceprint feature file for each smart television user in advance, and includes a dialect category corresponding to the user voiceprint feature.
As a further improved technical solution, when the voiceprint recognition module determines that the voiceprint feature of the voice instruction of the user interactive operation is not the user voiceprint feature in the user voiceprint feature module, the user voiceprint feature module creates a corresponding voiceprint feature file for the user of the voiceprint feature, and determines the dialect type used correspondingly.
As a further improved technical scheme, the voiceprint recognition module can be realized by adopting a voiceprint recognition server connected with an intelligent television network; the voice recognition module can be realized by adopting a voice recognition server connected with an intelligent television network.
The invention also provides a readable storage medium, wherein the readable storage medium stores a program for intelligent television voice recognition, and the steps of the intelligent television voice recognition method are realized when the program for intelligent television voice recognition is executed by a processor.
Compared with the prior art, the invention adopts the voiceprint feature recognition module to pre-document the voiceprint features of the user of the intelligent television and the dialect types correspondingly used, when the user operates the intelligent television through the voice operation function of the intelligent television, the voiceprint feature recognition module recognizes the voiceprint feature of the user in advance to determine the voiceprint feature of the user and the dialect type preset by the voiceprint feature recognition module, then directly calling a voice recognition module to directly convert the voice instruction of the dialect-like user interaction operation into a text, in the whole operation process that the user operates the intelligent television through voice and the intelligent television identifies the voice of the user and carries out identification feedback, the user does not need to select the dialect type, for a household using the intelligent television and with a plurality of dialects, the intelligent television can automatically recognize the dialect spoken by the user and directly recognize the voice instruction of the user interaction operation according to the voice recognition technology of the dialect. The invention greatly reduces the dialect selection times of the intelligent television user and improves the experience of the user in voice operation.
Drawings
The embodiments of the invention will be further described with reference to the accompanying drawings, in which:
fig. 1 is a flowchart of a speech recognition method for a smart television according to a preferred embodiment of the present invention.
Fig. 2 is a schematic structure diagram of a speech recognition system of a smart television according to a preferred embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The flow of the speech recognition method for the smart television provided by the invention is as shown in fig. 1, the flow of the preferred embodiment of the speech recognition method for the smart television of the invention is as shown in fig. 1, and the speech recognition method for the smart television of the invention comprises the following implementation steps:
and step S100, the intelligent television receives a voice instruction of user interactive operation.
In a family using the smart television, accents of users who are members of the family are different from each other, even different dialects may be used, although the dialects can be recognized by the existing smart television in the voice recognition function, in the operation process, when the users use the dialects to interact with the smart television, the voice recognition technology of the smart television cannot directly determine the dialects of the users, and the dialects which are selected by the users need to be selected by the users, that is, the smart television cannot directly recognize the dialects of the users, so that voice recognition is performed. The method of the invention can directly receive dialect interactive voice instruction of the user in the process of using voice recognition of the intelligent television to carry out man-machine interactive operation, of course, as another preferred embodiment, the intelligent television can create a corresponding voiceprint characteristic file for the user in advance to automatically select the dialect of the user and directly carry out recognition, and before the intelligent television receives the voice instruction of the user interactive operation, the method can also comprise the following steps:
the method comprises the steps that the smart television creates a corresponding voiceprint feature file for each user in advance; and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
The smart television establishes voice print characteristic files for family member users in advance according to respective dialects so as to ensure that a corresponding dialect voice recognition scheme can be directly selected for recognition according to the dialects in the subsequent smart television voice recognition process, and therefore, the user voice print characteristic files also need to be correspondingly selected for the dialects.
Step S200, the voiceprint recognition module determines the dialect type used by the user according to the voiceprint characteristics of the voice command of the user interactive operation.
Specifically, the voiceprint recognition module performs voiceprint recognition on a voice instruction of user interactive operation, and confirms the user according to a voiceprint feature file of the user established in the intelligent television in the process, so that the dialect used by the user can be directly determined. For example, a dialect speaking user (Cantonese) says "I want to watch XX programs" in front of the TV for the first time, and then the TV interface pops up various dialects according to the prior art: the dialect recognition results of the Cantonese, the Sichuan and the Hunan languages are given to the user, and the user needs to further judge the dialect to be the type of the Cantonese, and then the television can carry out the subsequent voice recognition operation. When the method is adopted, a dialect-speaking user (Guangdong dialect) says that the user wants to watch XX programs before a television for the first time, at the moment, various dialects cannot be popped up on a television interface for the user to select and confirm the dialect types before carrying out next voice recognition, and the user is confirmed through a voiceprint recognition module, directly selects the dialect types of the user to be matched, and then adopts a voice recognition scheme of the Guangdong dialect for recognition.
Certainly, as another preferred embodiment, the voiceprint recognition module can also be implemented by using a voiceprint recognition server connected to the smart television network, and the smart television can store more user voiceprint feature information by using the voiceprint recognition server connected to the smart television network.
Step S300, the voice recognition module directly converts the voice command of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice command of the user.
In the same way as the voiceprint recognition module, the voice recognition module can also be realized by adopting a voice recognition server connected with the intelligent television network, and in the same way, the intelligent television can store more voice recognition schemes by adopting the voice recognition server connected with the intelligent television network, and can also be continuously expanded and updated as required.
The method of the invention adopts the voiceprint feature recognition technology to distinguish the users in the family using the intelligent television, and directly carries out voice recognition according to the preset dialect of the user, thereby realizing the automatic dialect voice matching in the voice recognition process of the intelligent television.
Fig. 2 shows a schematic structure diagram of a preferred embodiment of the speech recognition system of the smart television, where the speech recognition system 60 of the smart television includes a speech receiving module 61, a voiceprint recognition module 62, and a speech recognition module 63.
The voice receiving module 61 is used for the smart television to receive a voice instruction of user interactive operation. In a family using the smart television, accents of users who are members of the family are different from each other, even different dialects may be used, although the dialects can be recognized by the existing smart television in the voice recognition function, in the operation process, when the users use the dialects to interact with the smart television, the voice recognition technology of the smart television cannot directly determine the dialects of the users, and the dialects which are selected by the users need to be selected by the users, that is, the smart television cannot directly recognize the dialects of the users, so that voice recognition is performed. The system of the present invention can directly receive dialect interactive voice instructions of the user during the process of performing human-computer interaction operation by using voice recognition of the smart television, and certainly, as another preferred embodiment, the smart television can create corresponding voiceprint feature files for the user in advance to automatically select the dialect of the user and directly perform recognition, that is, the system 60 further includes a user voiceprint feature module 64 for creating corresponding voiceprint feature files for each smart television user in advance and including dialect types corresponding to the voiceprint features of the user.
The smart television establishes voice print characteristic files for family member users in advance according to respective dialects so as to ensure that a corresponding dialect voice recognition scheme can be directly selected for recognition according to the dialects in the subsequent smart television voice recognition process, and therefore, the user voice print characteristic files also need to be correspondingly selected for the dialects.
The voiceprint recognition module 62 is configured to determine a voiceprint feature of the voice instruction of the user interaction operation received by the voice receiving module 61 and determine a dialect category used by the user.
Specifically, the voiceprint recognition module 62 performs voiceprint recognition on a voice instruction of user interactive operation, and confirms the user according to the voiceprint feature file of the user established in the smart television in the above process, so as to directly determine what dialect the user uses, unlike the prior art that when the smart television receives the interactive operation voice of the dialect of the user, the user needs to select the dialect again to perform voice recognition of the next step, the system of the present invention can directly confirm the voice recognition scheme according to the dialect of the user, thereby skipping the dialect selection process and improving the experience of the user in using the voice recognition technology. For example, a dialect speaking user (Cantonese) says "I want to watch XX programs" in front of the TV for the first time, and then the TV interface pops up various dialects according to the prior art: the dialect recognition results of the Cantonese, the Sichuan and the Hunan languages are given to the user, and the user needs to further judge the dialect to be the type of the Cantonese, and then the television can carry out the subsequent voice recognition operation. When the system is adopted, a dialect-speaking user (Guangdong dialect) says that the user wants to watch XX programs before a television for the first time, at the moment, various dialects cannot be popped up on a television interface for the user to select and confirm the dialect types before carrying out next voice recognition, and the user is confirmed through a voiceprint recognition module, directly selects the dialect types of the user to be matched, and then adopts a voice recognition scheme of the Guangdong dialect for recognition.
Certainly, as another preferred embodiment, the voiceprint recognition module can also be implemented by using a voiceprint recognition server connected to the smart television network, and the smart television can store more user voiceprint feature information by using the voiceprint recognition server connected to the smart television network.
The voice recognition module 63 is configured to directly convert the voice of the user into characters according to the dialect type corresponding to the voice command voiceprint feature of the user interaction operation recognized by the voiceprint recognition module 62, so as to recognize the voice command of the user.
In the same way as the voiceprint recognition module, the voice recognition module 63 can also be implemented by using a voice recognition server connected with the smart television network, and in the same way, the smart television can store more voice recognition schemes by using the voice recognition server connected with the smart television network, and can also be continuously expanded and updated as required.
The invention also provides a readable storage medium, wherein the readable storage medium stores a program for intelligent television voice recognition, and the steps of the intelligent television voice recognition method are realized when the program for intelligent television voice recognition is executed by a processor. The specific execution process of the program is the same as the preferred implementation of the above-mentioned speech recognition method for the smart television, and is not described herein again.
It should be understood that the above-mentioned embodiments are merely preferred examples of the present invention, and not restrictive, but rather, all the changes, substitutions, alterations and modifications that come within the spirit and scope of the invention as described above may be made by those skilled in the art, and all the changes, substitutions, alterations and modifications that fall within the scope of the appended claims should be construed as being included in the present invention.

Claims (10)

1. A speech recognition method for an intelligent television is used for the intelligent television to recognize dialects of users, and is characterized by comprising the following steps:
the intelligent television receives a voice instruction of user interactive operation;
the voice print recognition module determines the dialect type used by the user according to the voice print characteristics of the voice command of the user interactive operation;
the voice recognition module directly converts the voice instruction of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice instruction of the user.
2. The speech recognition method for the smart television set according to claim 1, wherein before the smart television set receives the speech command of the user interaction operation, the method further comprises the following steps:
the method comprises the steps that the smart television creates a corresponding voiceprint feature file for each user in advance;
and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
3. The voice recognition method for the smart television as claimed in claim 2, wherein when the voiceprint recognition module determines that the voiceprint feature of the voice command of the user interactive operation is not in the corresponding voiceprint feature file previously created for each user by the smart television, the smart television newly creates the corresponding voiceprint feature file for the user with the voiceprint feature, and the user selects and confirms the dialect category in the corresponding voiceprint feature file.
4. The voice recognition method for the smart television as claimed in any one of claims 1 to 3, wherein the voiceprint recognition module can be implemented by a voiceprint recognition server connected to a smart television network.
5. The intelligent television voice recognition method according to any one of claims 1 to 3, wherein the voice recognition module is implemented by a voice recognition server connected to an intelligent television network.
6. A speech recognition system of an intelligent television is used for the intelligent television to recognize dialect of a user and is characterized by comprising a speech receiving module, a voiceprint recognition module and a speech recognition module;
the voice receiving module is used for receiving a voice instruction of user interactive operation;
the voice print recognition module is used for judging the voice print characteristics of the voice command of the user interactive operation received by the voice receiving module and determining the dialect type used by the user;
the voice recognition module is used for directly converting the voice of the user into characters according to the dialect type corresponding to the voice command voiceprint characteristics of the user interactive operation recognized by the voiceprint recognition module so as to recognize the voice command of the user.
7. The system according to claim 6, further comprising a user voiceprint feature module, configured to create a corresponding voiceprint feature file for each smart tv user in advance, and include a dialect category corresponding to the user voiceprint feature.
8. The system according to claim 7, wherein when the voiceprint recognition module determines that the voiceprint feature of the voice command of the user interactive operation is not the user voiceprint feature in the user voiceprint feature module, the user voiceprint feature module creates a corresponding voiceprint feature file for the user with the voiceprint feature, and determines the dialect type to be used correspondingly.
9. The intelligent television voice recognition system according to any one of claims 6 to 8, wherein the voiceprint recognition module can be implemented by a voiceprint recognition server connected to an intelligent television network; the voice recognition module can be realized by adopting a voice recognition server connected with an intelligent television network.
10. A readable storage medium, characterized in that the readable storage medium stores a program for smart tv voice recognition, and the program for smart tv voice recognition realizes the steps of the smart tv voice recognition method according to any one of claims 1 to 5 when being executed by a processor.
CN201910682661.XA 2019-07-26 2019-07-26 Smart television voice recognition method, system and readable storage medium Pending CN112312181A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201910682661.XA CN112312181A (en) 2019-07-26 2019-07-26 Smart television voice recognition method, system and readable storage medium
PCT/CN2020/103545 WO2021017978A1 (en) 2019-07-26 2020-07-22 Smart television speech recognition method, system and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910682661.XA CN112312181A (en) 2019-07-26 2019-07-26 Smart television voice recognition method, system and readable storage medium

Publications (1)

Publication Number Publication Date
CN112312181A true CN112312181A (en) 2021-02-02

Family

ID=74229363

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910682661.XA Pending CN112312181A (en) 2019-07-26 2019-07-26 Smart television voice recognition method, system and readable storage medium

Country Status (2)

Country Link
CN (1) CN112312181A (en)
WO (1) WO2021017978A1 (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20100032140A (en) * 2008-09-17 2010-03-25 주식회사 현대오토넷 Method of interactive voice recognition and apparatus for interactive voice recognition
CN102638605A (en) * 2011-02-14 2012-08-15 苏州巴米特信息科技有限公司 Speech system for recognizing dialect background mandarin
US20140191949A1 (en) * 2013-01-07 2014-07-10 Samsung Electronics Co., Ltd. Display apparatus and method of controlling a display apparatus in a voice recognition system
CN105872687A (en) * 2016-03-31 2016-08-17 乐视控股(北京)有限公司 Method and device for controlling intelligent equipment through voice
CN206117701U (en) * 2016-09-30 2017-04-19 无锡小天鹅股份有限公司 Domestic appliance and control system thereof
CN107170454A (en) * 2017-05-31 2017-09-15 广东欧珀移动通信有限公司 Audio recognition method and Related product
CN107580237A (en) * 2017-09-05 2018-01-12 深圳Tcl新技术有限公司 Operating method, device, system and the storage medium of TV
CN108172223A (en) * 2017-12-14 2018-06-15 深圳市欧瑞博科技有限公司 Voice instruction recognition method, device and server and computer readable storage medium
CN109785832A (en) * 2018-12-20 2019-05-21 安徽声讯信息技术有限公司 A kind of old man's set-top box Intelligent voice recognition method suitable for accent again

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103593340B (en) * 2013-10-28 2017-08-29 余自立 Natural expressing information processing method, processing and response method, equipment and system
CN104575504A (en) * 2014-12-24 2015-04-29 上海师范大学 Method for personalized television voice wake-up by voiceprint and voice identification
CN106504754B (en) * 2016-09-29 2019-10-18 浙江大学 A kind of real-time method for generating captions according to audio output
CN106847281A (en) * 2017-02-26 2017-06-13 上海新柏石智能科技股份有限公司 Intelligent household voice control system and method based on voice fuzzy identification technology
CN107809667A (en) * 2017-10-26 2018-03-16 深圳创维-Rgb电子有限公司 Television voice exchange method, interactive voice control device and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20100032140A (en) * 2008-09-17 2010-03-25 주식회사 현대오토넷 Method of interactive voice recognition and apparatus for interactive voice recognition
CN102638605A (en) * 2011-02-14 2012-08-15 苏州巴米特信息科技有限公司 Speech system for recognizing dialect background mandarin
US20140191949A1 (en) * 2013-01-07 2014-07-10 Samsung Electronics Co., Ltd. Display apparatus and method of controlling a display apparatus in a voice recognition system
CN105872687A (en) * 2016-03-31 2016-08-17 乐视控股(北京)有限公司 Method and device for controlling intelligent equipment through voice
CN206117701U (en) * 2016-09-30 2017-04-19 无锡小天鹅股份有限公司 Domestic appliance and control system thereof
CN107170454A (en) * 2017-05-31 2017-09-15 广东欧珀移动通信有限公司 Audio recognition method and Related product
CN107580237A (en) * 2017-09-05 2018-01-12 深圳Tcl新技术有限公司 Operating method, device, system and the storage medium of TV
CN108172223A (en) * 2017-12-14 2018-06-15 深圳市欧瑞博科技有限公司 Voice instruction recognition method, device and server and computer readable storage medium
CN109785832A (en) * 2018-12-20 2019-05-21 安徽声讯信息技术有限公司 A kind of old man's set-top box Intelligent voice recognition method suitable for accent again

Also Published As

Publication number Publication date
WO2021017978A1 (en) 2021-02-04

Similar Documents

Publication Publication Date Title
US8064573B2 (en) Computer generated prompting
US7783475B2 (en) Menu-based, speech actuated system with speak-ahead capability
EP2521121B1 (en) Method and device for voice controlling
US6963836B2 (en) Speechdriven setting of a language of interaction
US20060159240A1 (en) System and method of utilizing a hybrid semantic model for speech recognition
US7881938B2 (en) Speech bookmarks in a voice user interface using a speech recognition engine and acoustically generated baseforms
US9807243B2 (en) Method and system for voice transmission control
CA2785081A1 (en) Method and system for processing multiple speech recognition results from a single utterance
US10535337B2 (en) Method for correcting false recognition contained in recognition result of speech of user
JP2008506156A (en) Multi-slot interaction system and method
KR20110127180A (en) Systems and methods for interactively accessing hosted services using voice communications
CN109036406A (en) A kind of processing method of voice messaging, device, equipment and storage medium
US20150030141A1 (en) Automated response system
EP3157236A1 (en) Method and device for quickly accessing ivr menu
KR20060014369A (en) Speaker-dependent voice recognition method and voice recognition system
US7451086B2 (en) Method and apparatus for voice recognition
CN112312181A (en) Smart television voice recognition method, system and readable storage medium
US20060077967A1 (en) Method to manage media resources providing services to be used by an application requesting a particular set of services
JP2005520194A (en) Generating text messages
US6141661A (en) Method and apparatus for performing a grammar-pruning operation
CN111292749B (en) Session control method and device of intelligent voice platform
CN105118507A (en) Sound control system and control method thereof
US8954325B1 (en) Speech recognition in automated information services systems
CA2256781A1 (en) Method and apparatus for automatically dialling a desired telephone number using speech commands
CN117831526A (en) Control system and method for vehicle-mounted voice streaming dialogue

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210202