CN112312181A - Smart television voice recognition method, system and readable storage medium - Google Patents
Smart television voice recognition method, system and readable storage medium Download PDFInfo
- Publication number
- CN112312181A CN112312181A CN201910682661.XA CN201910682661A CN112312181A CN 112312181 A CN112312181 A CN 112312181A CN 201910682661 A CN201910682661 A CN 201910682661A CN 112312181 A CN112312181 A CN 112312181A
- Authority
- CN
- China
- Prior art keywords
- user
- voice
- voiceprint
- recognition
- dialect
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 39
- 230000002452 interceptive effect Effects 0.000 claims abstract description 28
- 230000003993 interaction Effects 0.000 claims description 7
- 238000005516 engineering process Methods 0.000 abstract description 12
- 230000008569 process Effects 0.000 description 12
- 230000006870 function Effects 0.000 description 3
- 230000004075 alteration Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 241001672694 Citrus reticulata Species 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/42203—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/441—Acquiring end-user identification, e.g. using personal code sent by the remote control or by inserting a card
- H04N21/4415—Acquiring end-user identification, e.g. using personal code sent by the remote control or by inserting a card using biometric characteristics of the user, e.g. by voice recognition or fingerprint scanning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The invention provides a voice recognition method, a voice recognition system and a storage medium for a smart television, which are used for recognizing dialects of users by the smart television, and receiving voice instructions of user interactive operation by the smart television; the voice print recognition module determines the dialect type used by the user according to the voice print characteristics of the voice command of the user interactive operation; the voice recognition module directly converts the voice instruction of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice instruction of the user. In the invention, the user operates the intelligent television through voice, the intelligent television identifies the voice of the user and identifies and feeds back the voice, the user does not need to select dialect types, and for families using the intelligent television and having various dialects, the dialects spoken by the user can be automatically identified and voice instructions of the user interactive operation can be directly identified according to the voice identification technology of the dialects, so that the selection times of the dialects by the user are greatly reduced, and the experience of the user in voice operation is improved.
Description
Technical Field
The present invention relates to the field of speech recognition technologies, and in particular, to a method, a system, and a readable storage medium.
Background
At present, the application of voice recognition technology on smart televisions is widespread, and a user can select a movie, play music and even control various household appliances by speaking. For some countries with broad breadth, such as China, pronunciations of various local dialects are greatly different, although the voice recognition technology on the smart television can recognize the local dialects, the precondition is that the dialects used by the user are preset on the television, and voice recognition cannot be randomly performed according to the dialects spoken by the user, in other words, the dialects of the user need to be preset in the smart television firstly, the smart television can recognize the dialects spoken by the user, otherwise, the voice AI technology of the smart television cannot automatically recognize the local dialects spoken by the user.
For a family, a television is a public electrical appliance in the whole family, an old person may speak a hometown word, a child only speaks a mandarin for school education, and in the family, the possibility of multiple dialects may exist, and it is not practical to preset the corresponding dialects for each member in the family in the television.
The prior art also has some voice recognition technologies for solving the problem that dialects need to be preset, for example, the judgment is carried out according to the geographic position of the smart television, that is, the geographic position of a user is judged according to the IP address of the smart television network, and then the preferred dialect type of the smart television is determined according to the geographic position.
Accordingly, the prior art is yet to be improved and developed.
Disclosure of Invention
In view of the defects of the prior art, the invention provides an automatic dialect matching technology for a smart television, so that the dialect spoken by a user is automatically matched under the condition that the dialect is not preset by the smart television, and the dialect is automatically identified.
The technical scheme adopted by the invention for solving the technical problem is as follows:
a speech recognition method for a smart television is used for the smart television to recognize dialects of users and comprises the following steps:
the intelligent television receives a voice instruction of user interactive operation;
the voice print recognition module determines the dialect type used by the user according to the voice print characteristics of the voice command operated by the user;
the voice recognition module directly converts the voice instruction of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice instruction of the user.
As a further improved technical scheme, the method also comprises the following steps:
the method comprises the steps that the smart television creates a corresponding voiceprint feature file for each user in advance;
and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
As a further improved technical solution, when the voiceprint recognition module determines that the voiceprint feature of the voice instruction is not in the corresponding voiceprint feature file previously created for each user by the smart television, the smart television newly creates the corresponding voiceprint feature file for the user with the voiceprint feature, and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
As a further improved technical solution, the voiceprint recognition module can be implemented by using a voiceprint recognition server connected to the smart television network.
As a further improved technical solution, the voice recognition module may be implemented by using a voice recognition server connected to an intelligent television network.
The invention also provides an intelligent television voice recognition system which is used for recognizing the dialect of the user by the intelligent television and comprises a voice receiving module, a voiceprint recognition module and a voice recognition module;
the voice receiving module is used for receiving a voice instruction of user interactive operation by the intelligent television;
the voice print recognition module is used for judging the voice print characteristics of the voice command of the user interactive operation received by the voice receiving module and determining the dialect type used by the user;
the voice recognition module is used for directly converting the voice of the user into characters according to the dialect type corresponding to the voice voiceprint characteristics of the user interaction operation recognized by the voiceprint recognition module so as to recognize the voice instruction of the user.
As a further improved technical solution, the system further includes a user voiceprint feature module, which is used for creating a corresponding voiceprint feature file for each smart television user in advance, and includes a dialect category corresponding to the user voiceprint feature.
As a further improved technical solution, when the voiceprint recognition module determines that the voiceprint feature of the voice instruction of the user interactive operation is not the user voiceprint feature in the user voiceprint feature module, the user voiceprint feature module creates a corresponding voiceprint feature file for the user of the voiceprint feature, and determines the dialect type used correspondingly.
As a further improved technical scheme, the voiceprint recognition module can be realized by adopting a voiceprint recognition server connected with an intelligent television network; the voice recognition module can be realized by adopting a voice recognition server connected with an intelligent television network.
The invention also provides a readable storage medium, wherein the readable storage medium stores a program for intelligent television voice recognition, and the steps of the intelligent television voice recognition method are realized when the program for intelligent television voice recognition is executed by a processor.
Compared with the prior art, the invention adopts the voiceprint feature recognition module to pre-document the voiceprint features of the user of the intelligent television and the dialect types correspondingly used, when the user operates the intelligent television through the voice operation function of the intelligent television, the voiceprint feature recognition module recognizes the voiceprint feature of the user in advance to determine the voiceprint feature of the user and the dialect type preset by the voiceprint feature recognition module, then directly calling a voice recognition module to directly convert the voice instruction of the dialect-like user interaction operation into a text, in the whole operation process that the user operates the intelligent television through voice and the intelligent television identifies the voice of the user and carries out identification feedback, the user does not need to select the dialect type, for a household using the intelligent television and with a plurality of dialects, the intelligent television can automatically recognize the dialect spoken by the user and directly recognize the voice instruction of the user interaction operation according to the voice recognition technology of the dialect. The invention greatly reduces the dialect selection times of the intelligent television user and improves the experience of the user in voice operation.
Drawings
The embodiments of the invention will be further described with reference to the accompanying drawings, in which:
fig. 1 is a flowchart of a speech recognition method for a smart television according to a preferred embodiment of the present invention.
Fig. 2 is a schematic structure diagram of a speech recognition system of a smart television according to a preferred embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The flow of the speech recognition method for the smart television provided by the invention is as shown in fig. 1, the flow of the preferred embodiment of the speech recognition method for the smart television of the invention is as shown in fig. 1, and the speech recognition method for the smart television of the invention comprises the following implementation steps:
and step S100, the intelligent television receives a voice instruction of user interactive operation.
In a family using the smart television, accents of users who are members of the family are different from each other, even different dialects may be used, although the dialects can be recognized by the existing smart television in the voice recognition function, in the operation process, when the users use the dialects to interact with the smart television, the voice recognition technology of the smart television cannot directly determine the dialects of the users, and the dialects which are selected by the users need to be selected by the users, that is, the smart television cannot directly recognize the dialects of the users, so that voice recognition is performed. The method of the invention can directly receive dialect interactive voice instruction of the user in the process of using voice recognition of the intelligent television to carry out man-machine interactive operation, of course, as another preferred embodiment, the intelligent television can create a corresponding voiceprint characteristic file for the user in advance to automatically select the dialect of the user and directly carry out recognition, and before the intelligent television receives the voice instruction of the user interactive operation, the method can also comprise the following steps:
the method comprises the steps that the smart television creates a corresponding voiceprint feature file for each user in advance; and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
The smart television establishes voice print characteristic files for family member users in advance according to respective dialects so as to ensure that a corresponding dialect voice recognition scheme can be directly selected for recognition according to the dialects in the subsequent smart television voice recognition process, and therefore, the user voice print characteristic files also need to be correspondingly selected for the dialects.
Step S200, the voiceprint recognition module determines the dialect type used by the user according to the voiceprint characteristics of the voice command of the user interactive operation.
Specifically, the voiceprint recognition module performs voiceprint recognition on a voice instruction of user interactive operation, and confirms the user according to a voiceprint feature file of the user established in the intelligent television in the process, so that the dialect used by the user can be directly determined. For example, a dialect speaking user (Cantonese) says "I want to watch XX programs" in front of the TV for the first time, and then the TV interface pops up various dialects according to the prior art: the dialect recognition results of the Cantonese, the Sichuan and the Hunan languages are given to the user, and the user needs to further judge the dialect to be the type of the Cantonese, and then the television can carry out the subsequent voice recognition operation. When the method is adopted, a dialect-speaking user (Guangdong dialect) says that the user wants to watch XX programs before a television for the first time, at the moment, various dialects cannot be popped up on a television interface for the user to select and confirm the dialect types before carrying out next voice recognition, and the user is confirmed through a voiceprint recognition module, directly selects the dialect types of the user to be matched, and then adopts a voice recognition scheme of the Guangdong dialect for recognition.
Certainly, as another preferred embodiment, the voiceprint recognition module can also be implemented by using a voiceprint recognition server connected to the smart television network, and the smart television can store more user voiceprint feature information by using the voiceprint recognition server connected to the smart television network.
Step S300, the voice recognition module directly converts the voice command of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice command of the user.
In the same way as the voiceprint recognition module, the voice recognition module can also be realized by adopting a voice recognition server connected with the intelligent television network, and in the same way, the intelligent television can store more voice recognition schemes by adopting the voice recognition server connected with the intelligent television network, and can also be continuously expanded and updated as required.
The method of the invention adopts the voiceprint feature recognition technology to distinguish the users in the family using the intelligent television, and directly carries out voice recognition according to the preset dialect of the user, thereby realizing the automatic dialect voice matching in the voice recognition process of the intelligent television.
Fig. 2 shows a schematic structure diagram of a preferred embodiment of the speech recognition system of the smart television, where the speech recognition system 60 of the smart television includes a speech receiving module 61, a voiceprint recognition module 62, and a speech recognition module 63.
The voice receiving module 61 is used for the smart television to receive a voice instruction of user interactive operation. In a family using the smart television, accents of users who are members of the family are different from each other, even different dialects may be used, although the dialects can be recognized by the existing smart television in the voice recognition function, in the operation process, when the users use the dialects to interact with the smart television, the voice recognition technology of the smart television cannot directly determine the dialects of the users, and the dialects which are selected by the users need to be selected by the users, that is, the smart television cannot directly recognize the dialects of the users, so that voice recognition is performed. The system of the present invention can directly receive dialect interactive voice instructions of the user during the process of performing human-computer interaction operation by using voice recognition of the smart television, and certainly, as another preferred embodiment, the smart television can create corresponding voiceprint feature files for the user in advance to automatically select the dialect of the user and directly perform recognition, that is, the system 60 further includes a user voiceprint feature module 64 for creating corresponding voiceprint feature files for each smart television user in advance and including dialect types corresponding to the voiceprint features of the user.
The smart television establishes voice print characteristic files for family member users in advance according to respective dialects so as to ensure that a corresponding dialect voice recognition scheme can be directly selected for recognition according to the dialects in the subsequent smart television voice recognition process, and therefore, the user voice print characteristic files also need to be correspondingly selected for the dialects.
The voiceprint recognition module 62 is configured to determine a voiceprint feature of the voice instruction of the user interaction operation received by the voice receiving module 61 and determine a dialect category used by the user.
Specifically, the voiceprint recognition module 62 performs voiceprint recognition on a voice instruction of user interactive operation, and confirms the user according to the voiceprint feature file of the user established in the smart television in the above process, so as to directly determine what dialect the user uses, unlike the prior art that when the smart television receives the interactive operation voice of the dialect of the user, the user needs to select the dialect again to perform voice recognition of the next step, the system of the present invention can directly confirm the voice recognition scheme according to the dialect of the user, thereby skipping the dialect selection process and improving the experience of the user in using the voice recognition technology. For example, a dialect speaking user (Cantonese) says "I want to watch XX programs" in front of the TV for the first time, and then the TV interface pops up various dialects according to the prior art: the dialect recognition results of the Cantonese, the Sichuan and the Hunan languages are given to the user, and the user needs to further judge the dialect to be the type of the Cantonese, and then the television can carry out the subsequent voice recognition operation. When the system is adopted, a dialect-speaking user (Guangdong dialect) says that the user wants to watch XX programs before a television for the first time, at the moment, various dialects cannot be popped up on a television interface for the user to select and confirm the dialect types before carrying out next voice recognition, and the user is confirmed through a voiceprint recognition module, directly selects the dialect types of the user to be matched, and then adopts a voice recognition scheme of the Guangdong dialect for recognition.
Certainly, as another preferred embodiment, the voiceprint recognition module can also be implemented by using a voiceprint recognition server connected to the smart television network, and the smart television can store more user voiceprint feature information by using the voiceprint recognition server connected to the smart television network.
The voice recognition module 63 is configured to directly convert the voice of the user into characters according to the dialect type corresponding to the voice command voiceprint feature of the user interaction operation recognized by the voiceprint recognition module 62, so as to recognize the voice command of the user.
In the same way as the voiceprint recognition module, the voice recognition module 63 can also be implemented by using a voice recognition server connected with the smart television network, and in the same way, the smart television can store more voice recognition schemes by using the voice recognition server connected with the smart television network, and can also be continuously expanded and updated as required.
The invention also provides a readable storage medium, wherein the readable storage medium stores a program for intelligent television voice recognition, and the steps of the intelligent television voice recognition method are realized when the program for intelligent television voice recognition is executed by a processor. The specific execution process of the program is the same as the preferred implementation of the above-mentioned speech recognition method for the smart television, and is not described herein again.
It should be understood that the above-mentioned embodiments are merely preferred examples of the present invention, and not restrictive, but rather, all the changes, substitutions, alterations and modifications that come within the spirit and scope of the invention as described above may be made by those skilled in the art, and all the changes, substitutions, alterations and modifications that fall within the scope of the appended claims should be construed as being included in the present invention.
Claims (10)
1. A speech recognition method for an intelligent television is used for the intelligent television to recognize dialects of users, and is characterized by comprising the following steps:
the intelligent television receives a voice instruction of user interactive operation;
the voice print recognition module determines the dialect type used by the user according to the voice print characteristics of the voice command of the user interactive operation;
the voice recognition module directly converts the voice instruction of the user interactive operation into characters according to the dialect type used by the user so as to recognize the voice instruction of the user.
2. The speech recognition method for the smart television set according to claim 1, wherein before the smart television set receives the speech command of the user interaction operation, the method further comprises the following steps:
the method comprises the steps that the smart television creates a corresponding voiceprint feature file for each user in advance;
and the user selects and confirms the dialect type in the corresponding voiceprint feature file.
3. The voice recognition method for the smart television as claimed in claim 2, wherein when the voiceprint recognition module determines that the voiceprint feature of the voice command of the user interactive operation is not in the corresponding voiceprint feature file previously created for each user by the smart television, the smart television newly creates the corresponding voiceprint feature file for the user with the voiceprint feature, and the user selects and confirms the dialect category in the corresponding voiceprint feature file.
4. The voice recognition method for the smart television as claimed in any one of claims 1 to 3, wherein the voiceprint recognition module can be implemented by a voiceprint recognition server connected to a smart television network.
5. The intelligent television voice recognition method according to any one of claims 1 to 3, wherein the voice recognition module is implemented by a voice recognition server connected to an intelligent television network.
6. A speech recognition system of an intelligent television is used for the intelligent television to recognize dialect of a user and is characterized by comprising a speech receiving module, a voiceprint recognition module and a speech recognition module;
the voice receiving module is used for receiving a voice instruction of user interactive operation;
the voice print recognition module is used for judging the voice print characteristics of the voice command of the user interactive operation received by the voice receiving module and determining the dialect type used by the user;
the voice recognition module is used for directly converting the voice of the user into characters according to the dialect type corresponding to the voice command voiceprint characteristics of the user interactive operation recognized by the voiceprint recognition module so as to recognize the voice command of the user.
7. The system according to claim 6, further comprising a user voiceprint feature module, configured to create a corresponding voiceprint feature file for each smart tv user in advance, and include a dialect category corresponding to the user voiceprint feature.
8. The system according to claim 7, wherein when the voiceprint recognition module determines that the voiceprint feature of the voice command of the user interactive operation is not the user voiceprint feature in the user voiceprint feature module, the user voiceprint feature module creates a corresponding voiceprint feature file for the user with the voiceprint feature, and determines the dialect type to be used correspondingly.
9. The intelligent television voice recognition system according to any one of claims 6 to 8, wherein the voiceprint recognition module can be implemented by a voiceprint recognition server connected to an intelligent television network; the voice recognition module can be realized by adopting a voice recognition server connected with an intelligent television network.
10. A readable storage medium, characterized in that the readable storage medium stores a program for smart tv voice recognition, and the program for smart tv voice recognition realizes the steps of the smart tv voice recognition method according to any one of claims 1 to 5 when being executed by a processor.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910682661.XA CN112312181A (en) | 2019-07-26 | 2019-07-26 | Smart television voice recognition method, system and readable storage medium |
PCT/CN2020/103545 WO2021017978A1 (en) | 2019-07-26 | 2020-07-22 | Smart television speech recognition method, system and readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910682661.XA CN112312181A (en) | 2019-07-26 | 2019-07-26 | Smart television voice recognition method, system and readable storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112312181A true CN112312181A (en) | 2021-02-02 |
Family
ID=74229363
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910682661.XA Pending CN112312181A (en) | 2019-07-26 | 2019-07-26 | Smart television voice recognition method, system and readable storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112312181A (en) |
WO (1) | WO2021017978A1 (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20100032140A (en) * | 2008-09-17 | 2010-03-25 | 주식회사 현대오토넷 | Method of interactive voice recognition and apparatus for interactive voice recognition |
CN102638605A (en) * | 2011-02-14 | 2012-08-15 | 苏州巴米特信息科技有限公司 | Speech system for recognizing dialect background mandarin |
US20140191949A1 (en) * | 2013-01-07 | 2014-07-10 | Samsung Electronics Co., Ltd. | Display apparatus and method of controlling a display apparatus in a voice recognition system |
CN105872687A (en) * | 2016-03-31 | 2016-08-17 | 乐视控股(北京)有限公司 | Method and device for controlling intelligent equipment through voice |
CN206117701U (en) * | 2016-09-30 | 2017-04-19 | 无锡小天鹅股份有限公司 | Domestic appliance and control system thereof |
CN107170454A (en) * | 2017-05-31 | 2017-09-15 | 广东欧珀移动通信有限公司 | Audio recognition method and Related product |
CN107580237A (en) * | 2017-09-05 | 2018-01-12 | 深圳Tcl新技术有限公司 | Operating method, device, system and the storage medium of TV |
CN108172223A (en) * | 2017-12-14 | 2018-06-15 | 深圳市欧瑞博科技有限公司 | Voice instruction recognition method, device and server and computer readable storage medium |
CN109785832A (en) * | 2018-12-20 | 2019-05-21 | 安徽声讯信息技术有限公司 | A kind of old man's set-top box Intelligent voice recognition method suitable for accent again |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103593340B (en) * | 2013-10-28 | 2017-08-29 | 余自立 | Natural expressing information processing method, processing and response method, equipment and system |
CN104575504A (en) * | 2014-12-24 | 2015-04-29 | 上海师范大学 | Method for personalized television voice wake-up by voiceprint and voice identification |
CN106504754B (en) * | 2016-09-29 | 2019-10-18 | 浙江大学 | A kind of real-time method for generating captions according to audio output |
CN106847281A (en) * | 2017-02-26 | 2017-06-13 | 上海新柏石智能科技股份有限公司 | Intelligent household voice control system and method based on voice fuzzy identification technology |
CN107809667A (en) * | 2017-10-26 | 2018-03-16 | 深圳创维-Rgb电子有限公司 | Television voice exchange method, interactive voice control device and storage medium |
-
2019
- 2019-07-26 CN CN201910682661.XA patent/CN112312181A/en active Pending
-
2020
- 2020-07-22 WO PCT/CN2020/103545 patent/WO2021017978A1/en active Application Filing
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20100032140A (en) * | 2008-09-17 | 2010-03-25 | 주식회사 현대오토넷 | Method of interactive voice recognition and apparatus for interactive voice recognition |
CN102638605A (en) * | 2011-02-14 | 2012-08-15 | 苏州巴米特信息科技有限公司 | Speech system for recognizing dialect background mandarin |
US20140191949A1 (en) * | 2013-01-07 | 2014-07-10 | Samsung Electronics Co., Ltd. | Display apparatus and method of controlling a display apparatus in a voice recognition system |
CN105872687A (en) * | 2016-03-31 | 2016-08-17 | 乐视控股(北京)有限公司 | Method and device for controlling intelligent equipment through voice |
CN206117701U (en) * | 2016-09-30 | 2017-04-19 | 无锡小天鹅股份有限公司 | Domestic appliance and control system thereof |
CN107170454A (en) * | 2017-05-31 | 2017-09-15 | 广东欧珀移动通信有限公司 | Audio recognition method and Related product |
CN107580237A (en) * | 2017-09-05 | 2018-01-12 | 深圳Tcl新技术有限公司 | Operating method, device, system and the storage medium of TV |
CN108172223A (en) * | 2017-12-14 | 2018-06-15 | 深圳市欧瑞博科技有限公司 | Voice instruction recognition method, device and server and computer readable storage medium |
CN109785832A (en) * | 2018-12-20 | 2019-05-21 | 安徽声讯信息技术有限公司 | A kind of old man's set-top box Intelligent voice recognition method suitable for accent again |
Also Published As
Publication number | Publication date |
---|---|
WO2021017978A1 (en) | 2021-02-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8064573B2 (en) | Computer generated prompting | |
US7783475B2 (en) | Menu-based, speech actuated system with speak-ahead capability | |
EP2521121B1 (en) | Method and device for voice controlling | |
US6963836B2 (en) | Speechdriven setting of a language of interaction | |
US20060159240A1 (en) | System and method of utilizing a hybrid semantic model for speech recognition | |
US7881938B2 (en) | Speech bookmarks in a voice user interface using a speech recognition engine and acoustically generated baseforms | |
US9807243B2 (en) | Method and system for voice transmission control | |
CA2785081A1 (en) | Method and system for processing multiple speech recognition results from a single utterance | |
US10535337B2 (en) | Method for correcting false recognition contained in recognition result of speech of user | |
JP2008506156A (en) | Multi-slot interaction system and method | |
KR20110127180A (en) | Systems and methods for interactively accessing hosted services using voice communications | |
CN109036406A (en) | A kind of processing method of voice messaging, device, equipment and storage medium | |
US20150030141A1 (en) | Automated response system | |
EP3157236A1 (en) | Method and device for quickly accessing ivr menu | |
KR20060014369A (en) | Speaker-dependent voice recognition method and voice recognition system | |
US7451086B2 (en) | Method and apparatus for voice recognition | |
CN112312181A (en) | Smart television voice recognition method, system and readable storage medium | |
US20060077967A1 (en) | Method to manage media resources providing services to be used by an application requesting a particular set of services | |
JP2005520194A (en) | Generating text messages | |
US6141661A (en) | Method and apparatus for performing a grammar-pruning operation | |
CN111292749B (en) | Session control method and device of intelligent voice platform | |
CN105118507A (en) | Sound control system and control method thereof | |
US8954325B1 (en) | Speech recognition in automated information services systems | |
CA2256781A1 (en) | Method and apparatus for automatically dialling a desired telephone number using speech commands | |
CN117831526A (en) | Control system and method for vehicle-mounted voice streaming dialogue |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210202 |