WO2015135300A1 - 语音控制电视机的方法及其电视机 - Google Patents

语音控制电视机的方法及其电视机 Download PDF

Info

Publication number
WO2015135300A1
WO2015135300A1 PCT/CN2014/085329 CN2014085329W WO2015135300A1 WO 2015135300 A1 WO2015135300 A1 WO 2015135300A1 CN 2014085329 W CN2014085329 W CN 2014085329W WO 2015135300 A1 WO2015135300 A1 WO 2015135300A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
instruction
voice signal
user
instructions
Prior art date
Application number
PCT/CN2014/085329
Other languages
English (en)
French (fr)
Inventor
吴海龙
喻娟
陈维涛
Original Assignee
京东方科技集团股份有限公司
北京京东方显示技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 京东方科技集团股份有限公司, 北京京东方显示技术有限公司 filed Critical 京东方科技集团股份有限公司
Priority to US14/436,304 priority Critical patent/US20160277698A1/en
Publication of WO2015135300A1 publication Critical patent/WO2015135300A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/60Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0638Interactive procedures
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details

Definitions

  • the present disclosure relates to a method of voice control a television set and a television set therefor. Background technique
  • Speech is the most direct way human can express naturally. Speech recognition is considered as the main development direction of human-computer interaction. With the development of speech recognition technology and the widespread use of TV sets, more and more TV sets use speech recognition.
  • the technology performs voice control.
  • the known voice recognition of a television set is to encode the collected user voice signal, and then extract the voice features in the voice signal after the encoding process, for example, audio, sound pressure, etc., and finally The extracted voice feature is compared with a pre-stored voice template, and a corresponding instruction is determined according to the comparison result.
  • the known speech recognition technology can only recognize the same speech signal as the pre-stored speech template language, or blur the speech signal with similar query language, but in practical applications, the language of the user and the language of the pre-stored speech template are often encountered. In the case of not similar or even different, for example, China is a multi-ethnic country. There are many local dialects. If the voice template is Mandarin, when the user uses dialect for voice control, it will lead to unrecognizable voice, some live in China. Foreigners are also unable to effectively use the TV voice control function. Summary of the invention
  • Embodiments of the present disclosure provide a method of voice-controlled television set and a television set thereof, which can improve a voice control function of a television set.
  • Embodiments of the present disclosure employ the following technical solutions.
  • a method for controlling a television set for a television comprising: collecting a first voice signal of a user; when the television cannot recognize the first voice signal, displaying an instruction interface, The instruction interface includes N instructions, so that the user selects a first instruction corresponding to the first voice signal, the first instruction is any one of the N instructions; according to a pre-established instruction-voice And correspondingly storing the first voice signal in the first voice group corresponding to the first instruction, where the first voice group includes all voice signals that trigger the first instruction.
  • the method before the collecting the first voice signal of the user, the method further includes: establishing the command-voice group correspondence, where the command-voice group correspondence is used to indicate the N fingers Corresponding to the N voice groups, each of the N commands corresponds to one voice group.
  • a standard voice signal is included in each of the voice groups, and the standard voice signal is generated by standard Mandarin recording.
  • the method before the collecting the first voice signal of the user, the method further includes: numbering the N instructions, so that each of the instructions corresponds to a number, so that the user inputs a number , select the instruction corresponding to the number.
  • a television set includes: a collecting unit, configured to collect a first voice signal of a user, and a display unit, configured to: when the television does not recognize the When the first voice signal is collected by the unit, the instruction interface is displayed, and the instruction interface includes
  • the storage unit is configured to: according to the pre-established instruction-voice group correspondence, The first voice signal collected by the set unit is stored in a first voice group corresponding to the first instruction, and the first voice group includes all voice signals that trigger the first instruction.
  • the television further includes: an establishing unit, configured to establish the command-to-speech group correspondence, where the command-to-speech group correspondence is used to indicate a correspondence between the N commands and N voice groups , causing each of the N instructions to correspond to one voice group.
  • an establishing unit configured to establish the command-to-speech group correspondence, where the command-to-speech group correspondence is used to indicate a correspondence between the N commands and N voice groups , causing each of the N instructions to correspond to one voice group.
  • a standard voice signal is included in each of the voice groups, and the standard voice signal is generated by standard Mandarin recording.
  • the television set further includes: a numbering unit, configured to number the N instructions, such that each of the instructions corresponds to a number, so that the user selects the number corresponding to the number by inputting a number instruction.
  • a numbering unit configured to number the N instructions, such that each of the instructions corresponds to a number, so that the user selects the number corresponding to the number by inputting a number instruction.
  • a method for voice-controlled television set and a television set thereof first collects a first voice signal of a user, and then determines whether the first voice signal can be identified, and when the television cannot identify the first
  • an instruction interface is displayed, the instruction interface includes N instructions, so that the user selects the first instruction, the first instruction is any one of the N instructions, after the user selects the first instruction And executing the first instruction and saving the first voice signal in the first voice group corresponding to the first instruction according to the pre-established command-voice group correspondence, when the voice command of the next time is the first voice signal, the television
  • the machine can recognize the operation of the first instruction by the user, and execute the first instruction after the recognition, complete the voice control process of the user, and perfect the voice control function of the television set compared with the known technology.
  • FIG. 1 is a flowchart of a method for voice-controlled television set according to an embodiment of the present disclosure
  • FIG. 2 is a flowchart of another method for voice-controlled television set according to an embodiment of the present disclosure
  • FIG. 4 is a schematic structural diagram of another television set according to an embodiment of the present disclosure.
  • FIG. 5 is a schematic structural diagram of still another television provided by an embodiment of the present disclosure. detailed description
  • An embodiment of the present disclosure provides a method for voice control of a television set, as shown in FIG. 1, for a television, comprising:
  • Step 101 Collect the first voice signal of the user.
  • the television When the television receives the user's voice control, it first needs to receive the user's voice command, which is the first voice signal that the television needs to collect. Since the voice command issued by the user of the television set can be any language or any dialect, the first voice signal collected by the television set can be any language or any dialect.
  • Step 102 When the television does not recognize the first voice signal, display an instruction interface, where the instruction interface includes N instructions, so that the user selects a first instruction corresponding to the first voice signal.
  • the first instruction is any one of the N instructions.
  • the television set is collected into the first voice signal, it is first determined whether the television can recognize the first voice signal, and the voice recognition of the first voice signal is the same as the voice recognition process of the known technology.
  • the embodiment does not mention this.
  • the television cannot recognize the first voice signal, the television cannot perform the voice control process of the user, and the television displays An instruction interface, wherein the instruction interface can display N instructions, wherein the N instructions are all executable instructions of the television, and the instruction interface in the actual application can also display the television according to the first voice signal
  • the resulting M users may need an instruction that is less than or equal to N.
  • the user selects a required first instruction among the N instructions displayed by the instruction interface, and the first instruction is any one of the N instructions.
  • the user can use the remote controller to move the to-be-identified identifier. Go to the first instruction, and then select the first instruction by the confirmation key, or all the executable instructions of the television set may be initialized at the time of initialization, and then the user selects the number by using the number key of the remote controller to select the number corresponding to the first instruction. An instruction.
  • Step 103 Save the first voice signal in a first voice group corresponding to the first instruction according to a pre-established command-to-speech group correspondence, where the first voice group includes all triggering the first instruction voice signal.
  • the command-to-speech group correspondence is pre-established, and is used to indicate a correspondence between the N commands and the N voice groups, so that each of the N commands corresponds to one voice group, and each voice group All voice signals capable of triggering an instruction corresponding to the voice group are included.
  • the instruction selected by the user is the first instruction
  • the instruction corresponding to the first voice signal collected by the television set is the first instruction
  • the television executes the first instruction, and according to the pre-established instruction-voice group correspondence relationship, Saving the first voice signal in a first voice group corresponding to the first instruction, where the first voice group includes all voice signals capable of triggering the first instruction, and when the voice control is performed next time, if the voice command of the user is The first voice signal
  • the television can recognize that the user needs to perform the operation of the first instruction, and after executing the first instruction, complete the voice control process of the user.
  • the command interface can be displayed, and the command interface includes N commands, and the user can select the TV as needed. Executing the first instruction, then the television executes the first instruction, and saves the first voice signal in the first voice group corresponding to the first instruction according to the pre-established instruction-voice group correspondence, so that the user passes the first A voice signal triggers the first command, which improves the voice control function of the television set compared to known techniques.
  • the television set needs to establish the command-voice group correspondence, where the command-voice group correspondence is used to indicate the N commands and N voices.
  • the correspondence of the groups is such that each of the N instructions corresponds to one voice group. For example, suppose N is 4, and the four instructions are "play”, "pause”, “fast forward” and “rewind” respectively. If "play" is the first command, the corresponding voice group is the first voice group.
  • the first voice group includes M
  • the voice signal collected by the television set is any one of the M voice signals, and the television may be triggered to perform the playing action.
  • a standard voice signal may be recorded for the N executable instructions of the television set, where each voice group corresponding to the instruction of the television set includes a standard voice signal, that is, any one of the command units.
  • the corresponding voice group includes a standard voice signal capable of triggering the command.
  • the standard voice signal is generated by standard Mandarin recording.
  • the N instructions may be numbered, such that each of the instructions corresponds to a number, so that the user selects a corresponding according to the number. Instructions.
  • the method for voice control of a television set first collects a first voice signal of a user, and then determines whether the first voice signal can be recognized. When the television does not recognize the first voice signal, Displaying an instruction interface, where the instruction interface includes N instructions, so that the user selects a first instruction corresponding to the first voice signal, and the first instruction is any one of the N instructions, when After the first instruction is selected by the user, the first instruction is executed, and the first voice signal is saved in the first voice group corresponding to the first instruction according to the pre-established command-voice group correspondence, when the voice command of the next user is the first When a voice signal is used, the television can recognize that the user needs to perform the operation of the first instruction, and executes the first instruction after the recognition, completes the voice control process of the user, and improves the voice control of the television set compared with the known technology.
  • An embodiment of the present disclosure provides a method for voice control of a television set. As shown in FIG. 2, the method includes the following steps: Step 201: Acquire N instructions of a television set, and perform step 202.
  • TV sets can usually execute more and more instructions, so it is first necessary to obtain N commands that the TV can execute.
  • Step 202 Establish a command-to-speech group correspondence, where the command-to-speech group correspondence is used to indicate a correspondence between the N commands and N voice groups, so that each of the N commands corresponds to one voice. Group, go to step 203.
  • the television set After acquiring the N commands of the television set, the television set needs to set N voice groups for the N commands, and establish an instruction-voice group correspondence, where the command-voice group correspondence is used to indicate the N
  • the command is associated with the N voice groups, such that each of the N commands corresponds to one voice group, and each voice group includes all voice signals capable of triggering an instruction corresponding to the voice group. For example, suppose N is 4, and the four instructions are "play”, "pause”, "fast forward”, and “fast reverse” respectively, then the television needs to set 4 voice groups, corresponding to 4 instructions, for example, If For the first instruction, the corresponding voice group is the first voice group, and the first voice group includes M voice signals. When the user performs voice control, the voice signal collected by the television set is the M voice signals. Any one of the voice signals can trigger the TV to perform the playback.
  • Step 203 Record a standard voice signal for each of the N voice groups, and perform step 204.
  • a standard voice signal can be recorded for each of the N voice groups, for example, the first standard voice signal is recorded in Mandarin, and the first standard voice signal is saved in the first voice group, thus
  • the television can recognize the voice command of the user, and can execute the corresponding first command according to the voice command.
  • Step 204 Collect the first voice signal of the user, and perform step 205.
  • the television When the television receives the user's voice control, it first needs to receive the user's voice command, which is the first voice signal that the television needs to collect. Since the voice command issued by the user of the television set can be any language or any dialect, the first voice signal collected by the television set can be any language or any dialect.
  • Step 205 Determine whether the first voice signal can be recognized. When the television cannot recognize the first voice signal, perform step 206. When the television can recognize the first voice signal, perform step 208.
  • a first voice recognition of another speech signal 1 J can chip ASR M08 like the first speech recognition chip for speech recognition by the speech signal chip LD3320,
  • the process of speech recognition is the same as the known technology, which is not described in detail in the embodiments of the present disclosure.
  • Step 206 Display an instruction interface, so that the user selects a first instruction corresponding to the first voice signal, the instruction interface includes N instructions, and step 207 is performed.
  • the instruction interface may be displayed, and the instruction interface may display N instructions, where the N instructions are all executable instructions of the television, in actual application
  • the command interface may also display an instruction that the M user may need to perform screening according to the first voice signal, and the M is less than or equal to N.
  • the user can select a desired first instruction among the N instructions displayed by the instruction interface, and the first instruction is any one of the N instructions.
  • the user can use the remote controller to identify the identifier to be confirmed.
  • step 207 the first voice signal is saved in the first voice group corresponding to the first instruction according to the pre-established command-to-speech group correspondence, and the first voice group includes all triggering the first instruction. For the voice signal, go to step 208.
  • the instruction selected by the user is the first instruction
  • the instruction corresponding to the first voice signal collected by the television set is the first instruction
  • the television saves the first voice signal in the first voice corresponding to the first instruction.
  • the first voice group includes all voice signals capable of triggering the first instruction
  • the television can recognize that the user needs to perform the operation of the first instruction, and recognizes
  • the first instruction is then executed to complete the user's voice control process. For example, when the first instruction selected by the user is “playing”, the instruction corresponding to the first voice signal is “playing”, and the television saves the first voice signal collected by the television in the voice group corresponding to the instruction “play”.
  • the voice control is next performed, if the user's voice command is the first voice signal, the television can recognize and execute the command "play".
  • Step 208 executing the first instruction.
  • the television can recognize the first voice signal collected, the first instruction corresponding to the first voice signal can be executed.
  • the method for voice control of a television set first collects a first voice signal of a user, and then determines whether the first voice signal can be recognized. When the television does not recognize the first voice signal, Displaying an instruction interface, where the instruction interface includes N instructions, so that the user selects a first instruction corresponding to the first voice signal, and the first instruction is any one of the N instructions, when After the first instruction is selected by the user, the first instruction is executed, and the first voice signal is saved in the first voice group corresponding to the first instruction according to the pre-established command-voice group correspondence, when the voice command of the next user is the first When a voice signal is used, the television can recognize that the user needs to perform the operation of the first instruction, and executes the first instruction after the recognition, completes the voice control process of the user, and improves the voice control of the television set compared with the known technology.
  • An embodiment of the present disclosure provides a television set 30.
  • the television set includes: a collecting unit 301, and the collecting unit 301 is configured to collect a first voice signal of a user.
  • the display unit 302 is configured to display an instruction interface when the television set 30 cannot recognize the first voice signal collected by the collection unit 301, where the instruction interface includes N instructions, so as to facilitate the user. And selecting a first instruction corresponding to the first voice signal, the first instruction being any one of the N instructions.
  • the storage unit 303 is configured to save, according to the pre-established command-to-speech group correspondence, the first voice signal that is collected by the collection unit 301 in the first voice group corresponding to the first instruction, where The first voice group includes all voice signals that trigger the first instruction.
  • the instruction interface can be displayed through the display unit, and the instruction interface includes N instructions, and the user can Selecting a first instruction that needs to be executed by the television set according to the need, and then the television set executes the first instruction, and saves the first voice signal in the first voice corresponding to the first instruction by the storage unit according to the preset instruction-voice group correspondence relationship.
  • the voice control function of the television set is improved compared to the known technology.
  • the television set 30 further includes:
  • the establishing unit 304 is configured to establish the command-to-speech group correspondence, where the command-to-speech group correspondence is used to indicate a correspondence between the N commands and N voice groups, so that each of the N commands
  • the instructions correspond to a voice group. For example, suppose N is 4, and the four instructions are "play”, “pause”, “fast forward” and “rewind” respectively. If “play” is the first command, the corresponding voice group is the first voice group.
  • the first voice group includes M voice signals, and when the user performs voice control, the voice signal collected by the television set is any one of the M voice signals, and the television may be triggered to perform the play operation. .
  • a standard voice signal may be recorded for N executable instructions of the television, that is, each of the N voice groups includes a standard voice signal, where the standard voice signal is Generated by standard Mandarin recording.
  • the television set 30 further includes: a numbering unit 305, configured to number the N instructions such that each of the instructions corresponds to a number, so that the user selects according to the number. Corresponding instructions.
  • the television provided by the embodiment of the present disclosure can first collect the first voice signal of the user, and then determine whether the first voice signal can be recognized.
  • the instruction interface is displayed.
  • the instruction interface includes N instructions to facilitate selection and location of the user a first instruction corresponding to the first voice signal, the first instruction is any one of the N instructions, after the user selects the first instruction, executing the first instruction and according to the pre-established instruction-voice group Corresponding relationship, the first voice signal is saved in the first voice group corresponding to the first instruction, and when the voice command of the next time is the first voice signal, the television can recognize that the user needs to perform the operation of the first instruction, and After the recognition, the first instruction is executed to complete the user's voice control process, and the voice control function of the television is improved compared to the known technology.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Artificial Intelligence (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • User Interface Of Digital Computer (AREA)
  • Selective Calling Equipment (AREA)

Abstract

一种语音控制电视机的方法及其电视机,所述语音控制电视机的方法包括:采集用户的第一语音信号;当所述电视机无法识别所述第一语音信号时,显示指令界面,所述指令界面包括N个指令,以便于所述用户选择与所述第一语音信号相对应的第一指令,所述第一指令为所述N个指令中任意一个指令;根据预先建立的指令-语音组对应关系,将所述第一语音信号保存在所述第一指令对应的第一语音组中,所述第一语音组中包括触发第一指令的所有语音信号。本公开的语音控制电视机的方法及电视机能够完善电视机的语音控制功能。

Description

语音控制电视机的方法及其电视机 技术领域
本公开涉及一种语音控制电视机的方法及其电视机。 背景技术
语音是人类可以自然表达的最直接方式, 语音识别被认为是人机交互 的主要发展方向, 随着语音识别技术的发展和电视机的广泛使用, 有越来 越多的电视机釆用语音识别技术进行语音控制, 已知的电视机的语音识别 是将釆集到的用户语音信号进行编码处理, 然后提取经过编码处理之后的 语音信号中的语音特征, 例如, 声频、 声压等, 最后将提取的语音特征与 预先存储的语音模板进行比较,根据比较结果决定是否执行相对应的指令。
已知的语音识别技术只能识别与预先存储的语音模板语言相同的语音 信号, 或模糊查询语言相近的语音信号, 但实际应用中, 常常会遇到用户 的语言与预先存储的语音模板的语言不相近甚至不相同的情况, 示例的, 中国是一个多民族的国家, 各地方言很多, 若语音模板是普通话, 当用户 使用方言进行语音控制时, 会导致语音无法识别的情况, 一些在中国生活 的外国人, 也无法有效的使用电视机语音控制的功能。 发明内容
本公开的实施例提供一种语音控制电视机的方法及其电视机, 能够完善 电视机的语音控制功能。
本公开的实施例釆用如下技术方案。
一方面, 提供一种语音控制电视机的方法, 用于电视机, 包括: 釆集用 户的第一语音信号; 当所述电视机无法识别所述第一语音信号时, 显示指令 界面, 所述指令界面包括 N个指令, 以便于所述用户选择与所述第一语音信 号相对应的第一指令, 所述第一指令为所述 N个指令中任意一个指令; 根据 预先建立的指令-语音组对应关系,将所述第一语音信号保存在所述第一指令 对应的第一语音组中,所述第一语音组中包括触发第一指令的所有语音信号。
可选地, 在所述釆集用户的第一语音信号之前, 所述方法还包括: 建立 所述指令 -语音组对应关系, 所述指令 -语音组对应关系用于指示所述 N个指 令与 N个语音组的对应关系,使得所述 N个指令中的每个指令对应一个语音 组。
可选地, 每个所述语音组中包括标准语音信号, 所述标准语音信号是由 标准普通话录制生成的。
可选地, 在所述釆集用户的第一语音信号之前, 所述方法还包括: 为所 述 N个指令进行编号, 使得每个所述指令对应一个数字, 以便于所述用户通 过输入数字, 选择所述数字对应的指令。
一方面, 提供一种电视机, 所述电视机包括: 釆集单元, 所述釆集单元 用于釆集用户的第一语音信号; 显示单元, 用于当所述电视机无法识别所述 釆集单元釆集到的所述第一语音信号时, 显示指令界面, 所述指令界面包括
N个指令, 以便于所述用户选择第一指令, 所述第一指令为所述 N个指令中 任意一个指令; 存储单元, 用于根据预先建立的指令 -语音组对应关系, 将所 述釆集单元釆集到的所述第一语音信号保存在所述第一指令对应的第一语音 组中, 所述第一语音组中包括触发第一指令的所有语音信号。
可选地, 所述电视机还包括: 建立单元, 用于建立所述指令 -语音组对应 关系, 所述指令 -语音组对应关系用于指示所述 N个指令与 N个语音组的对 应关系, 使得所述 N个指令中的每个指令对应一个语音组。
可选地, 每个所述语音组中包括标准语音信号, 所述标准语音信号是由 标准普通话录制生成的。
可选地,所述电视机还包括: 编号单元,用于为所述 N个指令进行编号, 使得每个所述指令对应一个数字, 以便于所述用户通过输入数字, 选择所述 数字对应的指令。
本公开的实施例提供的语音控制电视机的方法及其电视机, 首先釆集用 户的第一语音信号, 然后判断能否识别该第一语音信号, 当所述电视机无法 识别所述第一语音信号时, 显示指令界面, 所述指令界面包括 N个指令, 以 便于所述用户选择第一指令,所述第一指令为所述 N个指令中任意一个指令, 当用户选择第一指令之后,执行该第一指令并根据预先建立的指令-语音组对 应关系, 将第一语音信号保存在第一指令对应的第一语音组中, 当下次用户 的语音指令为第一语音信号时, 电视机能够识别出用户需要进行第一指令的 操作, 并在识别之后执行第一指令, 完成用户的语音控制过程, 相较于已知 的技术, 完善了电视机的语音控制功能。 附图说明
为了更清楚地说明本公开的实施例或已知的技术中的技术方案, 下面将 对实施例或已知的技术描述中所需要使用的附图作简单地介绍,显而易见地, 下面描述中的附图仅仅是本公开的一些实施例, 对于本领域普通技术人员来 讲, 在不付出创造性劳动的前提下, 还可以根据这些附图获得其他的附图。
图 1为本公开的实施例提供的一种语音控制电视机的方法流程图; 图 2为本公开的实施例提供的另一种语音控制电视机的方法流程图; 图 3为本公开的实施例提供的一种电视机的结构示意图;
图 4为本公开的实施例提供的另一种电视机的结构示意图;
图 5为本公开的实施例提供的又一种电视机的结构示意图。 具体实施方式
下面将结合本公开的实施例中的附图, 对本公开的实施例中的技术方案 进行清楚、 完整地描述, 显然, 所描述的实施例仅仅是本公开的一部分实施 例, 而不是全部的实施例。 基于本公开中的实施例, 本领域普通技术人员在 没有做出创造性劳动前提下所获得的所有其他实施例, 都属于本公开保护的 范围。
本公开的实施例提供一种语音控制电视机的方法, 如图 1 所示, 用于电 视机, 包括:
步骤 101, 釆集用户的第一语音信号。
电视机在接受用户语音控制时, 首先需要接收用户的语音指令, 该语音 指令即为电视机需要釆集的第一语音信号。 由于电视机的用户发出的语音命 令可以是任何一种语言或任何一种方言, 所以电视机釆集到的第一语音信号 也可以是任意一种语言或者任意一种方言。
步骤 102, 当所述电视机无法识别所述第一语音信号时, 显示指令界面, 所述指令界面包括 N个指令, 以便于所述用户选择与所述第一语音信号相对 应的第一指令, 所述第一指令为所述 N个指令中任意一个指令。
例如, 电视机釆集到第一语音信号之后, 首先判断所述电视机能否识别 出第一语音信号, 所述对第一语音信号的语音识别与已知的技术的语音识别 过程一样, 本公开的实施例对此不做赞述。 当所述电视机无法识别所述第一 语音信号时, 该电视机就无法进行用户的语音控制过程, 这时该电视机显示 指令界面, 所述指令界面可以显示 N个指令, 所述 N个指令是所述电视机的 所有可执行的指令, 实际应用中指令界面也可以显示所述电视机根据所述第 一语音信号进行筛选得出的 M个用户可能需要的指令,所述 M小于或等于 N。 用户在该指令界面所显示的 N个指令中选择所需的第一指令, 所述第一指令 为所述 N个指令中的任意一个指令, 通常的, 用户可以利用遥控器将待确认 标识移动到所述第一指令, 然后通过确认键选择第一指令, 也可以在初始化 时为电视机的所有可执行指令编号, 然后用户通过利用遥控器的数字按键选 择第一指令对应的编号来选择第一指令。
步骤 103, 根据预先建立的指令 -语音组对应关系, 将所述第一语音信号 保存在所述第一指令对应的第一语音组中, 所述第一语音组中包括触发第一 指令的所有语音信号。
所述指令 -语音组对应关系是预先建立的,用于指示所述 N个指令与 N个 语音组的对应关系, 使得所述 N个指令中的每个指令对应一个语音组, 每个 语音组中包括能够触发该语音组对应的指令的所有语音信号。 当用户选择的 指令为第一指令时, 说明电视机釆集到的第一语音信号所对应的指令为第一 指令, 电视机执行第一指令, 并根据预先建立的指令 -语音组对应关系, 将所 述第一语音信号保存在第一指令对应的第一语音组中, 该第一语音组中包括 能够触发第一指令的所有语音信号, 当下次进行语音控制时, 若用户的语音 指令为第一语音信号, 则电视机能够识别出用户需要进行第一指令的操作, 并在识别之后执行第一指令, 完成用户的语音控制过程。
这样一来, 当电视机无法识别釆集到的第一语音信号, 即电视机无法识 别用户的语音指令时, 能够显示指令界面, 该指令界面包括 N个指令, 用户 可以根据需要选择需要电视机执行的第一指令, 然后电视机执行第一指令, 并根据预先建立的指令-语音组对应关系, 将第一语音信号保存在第一指令对 应的第一语音组中, 以便于用户再次通过第一语音信号触发第一指令, 相较 于已知的技术, 完善了电视机的语音控制功能。
例如, 在所述釆集用户的第一语音信号之前, 该电视机还需要建立所述 指令 -语音组对应关系,所述指令 -语音组对应关系用于指示所述 N个指令与 N 个语音组的对应关系, 使得所述 N个指令中的每个指令对应一个语音组。 示 例的, 假设 N为 4, 该 4个指令分别为 "播放"、 "暂停"、 "快进"和 "快退", 若 "播放" 为第一指令, 对应的语音组为第一语音组, 第一语音组中包括 M 个语音信号, 则当用户进行语音控制时, 电视机釆集到的语音信号为该 M个 语音信号中的任意一个语音信号, 均可触发电视机执行播放的动作。
可选地, 在初始化时, 可以为电视机的 N个可执行指令录制标准语音信 号, 该电视机的指令对应的 N个语音组中每个语音组中包括标准语音信号, 即任意一个指令所对应的语音组中包括一个能够触发该指令的标准语音信 号, 通常情况下, 所述标准语音信号是由标准普通话录制生成的。
可选地, 在所述釆集用户的第一语音信号之前, 还可以为所述 N个指令 进行编号, 使得每个所述指令对应一个数字, 以便于所述用户根据所述数字, 选择对应的指令。
本公开的实施例提供的语音控制电视机的方法, 首先釆集用户的第一语 音信号, 然后判断能否识别该第一语音信号, 当所述电视机无法识别所述第 一语音信号时, 显示指令界面, 所述指令界面包括 N个指令, 以便于所述用 户选择与所述第一语音信号相对应的第一指令, 所述第一指令为所述 N个指 令中任意一个指令, 当用户选择第一指令之后, 执行该第一指令并根据预先 建立的指令 -语音组对应关系, 将第一语音信号保存在第一指令对应的第一语 音组中, 当下次用户的语音指令为第一语音信号时, 电视机能够识别出用户 需要进行第一指令的操作, 并在识别之后执行第一指令, 完成用户的语音控 制过程, 相较于已知的技术, 完善了电视机的语音控制功能。
本公开的实施例提供一种语音控制电视机的方法, 如图 2所示, 包括: 步骤 201, 获取电视机的 N个指令, 执行步骤 202。
随着电视机的发展, 通常情况下, 电视机可以执行的指令也越来越多, 因此首先需要获取电视机可以执行的 N个指令。
步骤 202, 建立指令 -语音组对应关系, 所述指令 -语音组对应关系用于指 示所述 N个指令与 N个语音组的对应关系,使得所述 N个指令中的每个指令 对应一个语音组, 执行步骤 203。
在获取到电视机的 N个指令之后,电视机需要为所述 N个指令设置 N个 语音组, 并建立指令 -语音组对应关系, 所述指令-语音组对应关系用于指示所 述 N个指令与 N个语音组的对应关系,使得所述 N个指令中的每个指令对应 一个语音组, 每个语音组中包括所有能够触发该语音组对应的指令的语音信 号。 示例的, 假设 N为 4, 该 4个指令分别为 "播放"、 "暂停"、 "快进" 和 "快退", 则电视机需要设置 4个语音组, 分别对应 4个指令, 例如, 若 "播 放" 为第一指令, 对应的语音组为第一语音组, 第一语音组中包括 M个语音 信号, 则当用户进行语音控制时, 电视机釆集到的语音信号为该 M个语音信 号中的任意一个语音信号, 均可触发电视机执行播放的动作。
步骤 203, 为 N个语音组中的每个语音组录制标准语音信号, 执行步骤 204。
例如, 可以为所述 N个语音组中的每个语音组录制标准语音信号, 例如, 用普通话录制第一标准语音信号, 并将该第一标准语音信号保存在第一语音 组, 这样一来, 当用户用普通话输入语音指令时, 该电视机能够识别用户的 语音指令, 并能根据该语音指令执行对应的第一指令。
步骤 204, 釆集用户的第一语音信号, 执行步骤 205。
电视机在接受用户语音控制时, 首先需要接收用户的语音指令, 该语音 指令即为电视机需要釆集的第一语音信号。 由于电视机的用户发出的语音命 令可以是任何一种语言或任何一种方言, 所以电视机釆集到的第一语音信号 也可以是任意一种语言或者任意一种方言。
步骤 205,判断是否能够识别所述第一语音信号, 当电视机无法识别该第 一语音信号时, 执行步骤 206; 当电视机能够识别该第一语音信号时, 执行步 骤 208。
通常的, 电视机釆集到第一语音信号之后, 对第一语音信号进行语音识 另1 J, 示例的, 可以通过芯片 LD3320、 芯片 ASR M08等语音识别芯片对第一 语音信号进行语音识别, 所述语音识别的过程与已知的技术相同, 本公开实 施例对此不作详述。
步骤 206,显示指令界面, 以便于所述用户选择与所述第一语音信号相对 应的第一指令, 所述指令界面包括 N个指令, 执行步骤 207。
当电视机无法识别釆集到的第一语音信号时, 可以显示指令界面, 所述 指令界面可以显示 N个指令, 所述 N个指令是所述电视机的所有可执行的指 令, 实际应用中指令界面也可以显示所述电视机根据所述第一语音信号进行 筛选得出的 M个用户可能需要的指令, 所述 M小于或等于 N。用户可以在该 指令界面所显示的 N个指令中选择所需的第一指令, 所述第一指令为所述 N 个指令中的任意一个指令, 通常的, 用户可以利用遥控器将待确认标识移动 到所述第一指令, 然后通过确认键选择第一指令, 也可以在初始化时为电视 机的所有可执行指令编号, 然后用户通过利用遥控器的数字按键选择第一指 令对应的编号来选择第一指令。
例如, 假设 N为 4, 该 4个指令分别为 "播放"、 "暂停"、 "快进"和 "快 退", 则指令界面显示 "播放"、 "暂停"、 "快进" 和 "快退" 这四个指令, 以 便于用户选择与所述第一语音信号相对应的第一指令, 假设第一语音信号对 应的第一指令为 "播放"。
步骤 207, 根据预先建立的指令 -语音组对应关系, 将所述第一语音信号 保存在所述第一指令对应的第一语音组中, 所述第一语音组中包括触发第一 指令的所有语音信号, 执行步骤 208。
当用户选择的指令为第一指令时, 说明电视机釆集到的第一语音信号所 对应的指令为第一指令, 电视机将所述第一语音信号保存在第一指令对应的 第一语音组中, 该第一语音组中包括能够触发第一指令的所有语音信号, 当 下次用户的语音指令为第一语音信号时, 电视机能够识别出用户需要进行第 一指令的操作, 并在识别之后执行第一指令, 完成用户的语音控制过程。 例 如, 当用户选择的第一指令为 "播放" 时, 说明第一语音信号对应的指令为 "播放", 电视机将釆集到的第一语音信号保存在指令 "播放" 对应的语音组 中, 当下次进行语音控制时, 若用户的语音指令为第一语音信号, 电视机能 够识别并执行指令 "播放"。
步骤 208, 执行第一指令。
例如, 当电视机能够识别釆集到的第一语音信号时, 可以执行该第一语 音信号对应的第一指令。
本公开的实施例提供的语音控制电视机的方法, 首先釆集用户的第一语 音信号, 然后判断能否识别该第一语音信号, 当所述电视机无法识别所述第 一语音信号时, 显示指令界面, 所述指令界面包括 N个指令, 以便于所述用 户选择与所述第一语音信号相对应的第一指令, 所述第一指令为所述 N个指 令中任意一个指令, 当用户选择第一指令之后, 执行该第一指令并根据预先 建立的指令-语音组对应关系, 将第一语音信号保存在第一指令对应的第一语 音组中, 当下次用户的语音指令为第一语音信号时, 电视机能够识别出用户 需要进行第一指令的操作, 并在识别之后执行第一指令, 完成用户的语音控 制过程, 相较于已知的技术, 完善了电视机的语音控制功能。
本公开的实施例提供一种电视机 30, 如图 3所示, 所述电视机包括: 釆集单元 301, 所述釆集单元 301用于釆集用户的第一语音信号。 显示单元 302, 用于当所述电视机 30无法识别所述釆集单元 301釆集到 的所述第一语音信号时, 显示指令界面, 所述指令界面包括 N个指令, 以便 于所述用户选择与所述第一语音信号相对应的第一指令, 所述第一指令为所 述 N个指令中任意一个指令。
存储单元 303, 用于根据预先建立的指令 -语音组对应关系, 将所述釆集 单元 301 釆集到的所述第一语音信号保存在所述第一指令对应的第一语音组 中, 所述第一语音组中包括触发第一指令的所有语音信号。
这样一来, 当电视机无法识别釆集单元釆集到的第一语音信号, 即电视 机无法识别用户的语音指令时, 能够通过显示单元显示指令界面, 该指令界 面包括 N个指令, 用户可以根据需要选择需要电视机执行的第一指令, 然后 电视机执行第一指令, 并根据预先建立的指令 -语音组对应关系, 通过存储单 元将第一语音信号保存在第一指令对应的第一语音组中, 以便于用户再次通 过第一语音信号触发第一指令, 相较于已知的技术, 完善了电视机的语音控 制功能。
进一步地, 如图 4所示, 所述电视机 30还包括:
建立单元 304, 用于建立所述指令 -语音组对应关系, 所述指令-语音组对 应关系用于指示所述 N个指令与 N个语音组的对应关系,使得所述 N个指令 中的每个指令对应一个语音组。 示例的,假设 N为 4, 该 4个指令分别为 "播 放"、 "暂停"、 "快进" 和 "快退", 若 "播放" 为第一指令, 对应的语音组为 第一语音组, 第一语音组中包括 M个语音信号, 则当用户进行语音控制时, 电视机釆集到的语音信号为该 M个语音信号中的任意一个语音信号, 均可触 发电视机执行播放的动作。
可选地, 在初始化时, 可以为电视机的 N个可执行指令录制标准语音信 号, 即所述 N个语音组中的每个所述语音组中包括标准语音信号, 所述标准 语音信号是由标准普通话录制生成的。
如图 5所示, 所述电视机 30还包括: 编号单元 305, 用于为所述 N个指 令进行编号, 使得每个所述指令对应一个数字, 以便于所述用户根据所述数 字, 选择对应的指令。
本公开的实施例提供的电视机, 能够首先釆集用户的第一语音信号, 然 后判断能否识别该第一语音信号, 当所述电视机无法识别所述第一语音信号 时, 显示指令界面, 所述指令界面包括 N个指令, 以便于所述用户选择与所 述第一语音信号相对应的第一指令, 所述第一指令为所述 N个指令中任意一 个指令, 当用户选择第一指令之后, 执行该第一指令并根据预先建立的指令- 语音组对应关系, 将第一语音信号保存在第一指令对应的第一语音组中, 当 下次用户的语音指令为第一语音信号时, 电视机能够识别出用户需要进行第 一指令的操作, 并在识别之后执行第一指令, 完成用户的语音控制过程, 相 较于已知的技术, 完善了电视机的语音控制功能。
以上所述, 仅为本公开的示例性实施方式, 但本公开的保护范围并不局 限于此, 任何熟悉本技术领域的技术人员在本公开揭露的技术范围内, 可轻 易想到变化或替换, 都应涵盖在本公开的保护范围之内。 因此, 本公开的保 护范围应以所述权利要求的保护范围为准。
本申请要求于 2014年 3月 14日递交的中国专利申请第 201410095779.X 号的优先权, 在此全文引用上述中国专利申请公开的内容以作为本申请的一 部分。

Claims

权利要求书
1、 一种语音控制电视机的方法, 用于电视机, 包括以下步骤: 釆集用户的第一语音信号;
当所述电视机无法识别所述第一语音信号时, 显示指令界面, 所述指令 界面包括 N个指令, 以便于所述用户选择第一指令, 所述第一指令为所述 N 个指令中任意一个指令;
根据预先建立的指令-语音组对应关系,将所述第一语音信号保存在所述 第一指令对应的第一语音组中, 所述第一语音组中包括触发所述第一指令的 所有语音信号。
2、根据权利要求 1所述的方法, 其中, 在所述釆集用户的第一语音信号 之前, 所述方法还包括以下步骤:
建立所述指令 -语音组对应关系, 所述指令 -语音组对应关系用于指示所 述 N个指令与 N个语音组的对应关系, 使得所述 N个指令中的每个指令对 应一个语音组。
3、根据权利要求 1或 2所述的方法, 其中,每个所述语音组中包括标准 语音信号, 所述标准语音信号是由标准普通话录制生成的。
4、根据权利要求 1-3中任一项所述的方法, 其中指令界面显示所述电视 机根据所述第一语音信号进行筛选得出的 M个用户可能需要的指令,所述 M 小于或等于 N。
5、 根据权利要求 1-4中任一项所述的方法, 其中, 在所述釆集用户的第 一语音信号之前, 所述方法还包括:
为所述 N个指令进行编号, 使得每个所述指令对应一个数字, 以便于所 述用户通过输入数字, 选择所述数字对应的指令。
6、 一种电视机, 所述电视机包括:
釆集单元, 所述釆集单元用于釆集用户的第一语音信号;
显示单元, 用于当所述电视机无法识别所述釆集单元釆集到的所述第一 语音信号时, 显示指令界面, 所述指令界面包括 N个指令, 以便于所述用户 选择第一指令, 所述第一指令为所述 N个指令中任意一个指令;
存储单元, 用于根据预先建立的指令 -语音组对应关系, 将所述釆集单元 釆集到的所述第一语音信号保存在所述第一指令对应的第一语音组中, 所述 第一语音组中包括触发第一指令的所有语音信号。
7、 根据权利要求 6所述的电视机, 其中, 所述电视机还包括: 建立单元, 用于建立所述指令 -语音组对应关系, 所述指令 -语音组对应 关系用于指示所述 N个指令与 N个语音组的对应关系, 使得所述 N个指令 中的每个指令对应一个语音组。
8、根据权利要求 6或 7所述的电视机, 其中,每个所述语音组中包括标 准语音信号, 所述标准语音信号是由标准普通话录制生成的。
9、根据权利要求 6-8中任一项所述的电视机,其中,所述电视机还包括: 编号单元, 用于为所述 N个指令进行编号, 使得每个所述指令对应一个 数字, 以便于所述用户通过输入数字, 选择所述数字对应的指令。
PCT/CN2014/085329 2014-03-14 2014-08-27 语音控制电视机的方法及其电视机 WO2015135300A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/436,304 US20160277698A1 (en) 2014-03-14 2014-08-27 Method for vocally controlling a television and television thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410095779.X 2014-03-14
CN201410095779.XA CN103945152A (zh) 2014-03-14 2014-03-14 一种语音控制电视机的方法及其电视机

Publications (1)

Publication Number Publication Date
WO2015135300A1 true WO2015135300A1 (zh) 2015-09-17

Family

ID=51192605

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/085329 WO2015135300A1 (zh) 2014-03-14 2014-08-27 语音控制电视机的方法及其电视机

Country Status (3)

Country Link
US (1) US20160277698A1 (zh)
CN (1) CN103945152A (zh)
WO (1) WO2015135300A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103945152A (zh) * 2014-03-14 2014-07-23 京东方科技集团股份有限公司 一种语音控制电视机的方法及其电视机
CN104811820A (zh) * 2015-03-23 2015-07-29 四川长虹电器股份有限公司 一种电视设备上使用语音实现参数设置的控制方法
CN105096551A (zh) * 2015-07-29 2015-11-25 努比亚技术有限公司 一种实现虚拟遥控器的装置和方法
CN105653233B (zh) * 2015-12-30 2019-06-04 芜湖美智空调设备有限公司 关联语音信号与控制指令的方法及控制终端
CN109215645A (zh) * 2018-08-03 2019-01-15 北京奔流网络信息技术有限公司 一种语音信息交互方法以及智能电器

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101516005A (zh) * 2008-02-23 2009-08-26 华为技术有限公司 一种语音识别频道选择系统、方法及频道转换装置
CN102833634A (zh) * 2012-09-12 2012-12-19 康佳集团股份有限公司 一种电视机语音识别功能的实现方法及电视机
CN102842306A (zh) * 2012-08-31 2012-12-26 深圳Tcl新技术有限公司 语音控制方法及装置、语音响应方法及装置
CN103945152A (zh) * 2014-03-14 2014-07-23 京东方科技集团股份有限公司 一种语音控制电视机的方法及其电视机

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69326431T2 (de) * 1992-12-28 2000-02-03 Kabushiki Kaisha Toshiba, Kawasaki Spracherkennungs-Schnittstellensystem, das als Fenstersystem und Sprach-Postsystem verwendbar ist
US5774859A (en) * 1995-01-03 1998-06-30 Scientific-Atlanta, Inc. Information system having a speech interface
US7620549B2 (en) * 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
JP2007142840A (ja) * 2005-11-18 2007-06-07 Canon Inc 情報処理装置及び情報処理方法
US8106742B2 (en) * 2006-08-04 2012-01-31 Tegic Communications, Inc. Remotely controlling one or more client devices detected over a wireless network using a mobile device
US8099289B2 (en) * 2008-02-13 2012-01-17 Sensory, Inc. Voice interface and search for electronic devices including bluetooth headsets and remote systems
US8958848B2 (en) * 2008-04-08 2015-02-17 Lg Electronics Inc. Mobile terminal and menu control method thereof
US8793136B2 (en) * 2012-02-17 2014-07-29 Lg Electronics Inc. Method and apparatus for smart voice recognition
US9106957B2 (en) * 2012-08-16 2015-08-11 Nuance Communications, Inc. Method and apparatus for searching data sources for entertainment systems
KR102209519B1 (ko) * 2014-01-27 2021-01-29 삼성전자주식회사 음성 제어를 수행하는 디스플레이 장치 및 그 음성 제어 방법
US9338493B2 (en) * 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101516005A (zh) * 2008-02-23 2009-08-26 华为技术有限公司 一种语音识别频道选择系统、方法及频道转换装置
CN102842306A (zh) * 2012-08-31 2012-12-26 深圳Tcl新技术有限公司 语音控制方法及装置、语音响应方法及装置
CN102833634A (zh) * 2012-09-12 2012-12-19 康佳集团股份有限公司 一种电视机语音识别功能的实现方法及电视机
CN103945152A (zh) * 2014-03-14 2014-07-23 京东方科技集团股份有限公司 一种语音控制电视机的方法及其电视机

Also Published As

Publication number Publication date
US20160277698A1 (en) 2016-09-22
CN103945152A (zh) 2014-07-23

Similar Documents

Publication Publication Date Title
WO2015135300A1 (zh) 语音控制电视机的方法及其电视机
WO2017012511A1 (zh) 语音控制方法、装置及投影仪设备
KR101992676B1 (ko) 영상 인식을 이용하여 음성 인식을 하는 방법 및 장치
EP3754997B1 (en) Method for controlling electronic apparatus based on voice recognition and motion recognition, and electronic apparatus applying the same
JP5746111B2 (ja) 電子装置及びその制御方法
JP6123121B2 (ja) 音声制御システム及びプログラム
JP6588673B2 (ja) 仮想現実機器及び仮想現実機器の入力制御方法
CN102568478A (zh) 一种基于语音识别的视频播放控制方法和系统
JPWO2016103988A1 (ja) 情報処理装置、情報処理方法およびプログラム
CN106971723A (zh) 语音处理方法和装置、用于语音处理的装置
JP2019161638A (ja) スマートテレビの制御モード切替方法、設備及びコンピュータプログラム
WO2020079941A1 (ja) 情報処理装置及び情報処理方法、並びにコンピュータプログラム
JP2013041580A (ja) 電子装置及びその制御方法
KR20140089863A (ko) 디스플레이 장치, 및 이의 제어 방법, 그리고 음성 인식 시스템의 디스플레이 장치 제어 방법
JPWO2018100743A1 (ja) 制御装置および機器制御システム
CN101894553A (zh) 电视机语音控制的实现方法
CN105635778A (zh) 一种智能电视的语音交互方法及系统
WO2019218656A1 (zh) 一种智能电视、其截屏应用方法及存储介质
US20080140423A1 (en) Information processing apparatus and information processing method
CN108965968A (zh) 智能电视操作提示的展示方法、装置及计算机存储介质
CN104423992A (zh) 显示器语音辨识的启动方法
CN108289252A (zh) 一种切换系统语言的机顶盒及方法
WO2016103465A1 (ja) 音声認識システム
CN110782886A (zh) 语音处理的系统、方法、电视、设备和介质
CN204619374U (zh) 利用语音控制的遥控多轴飞行器玩具

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 14436304

Country of ref document: US

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14885464

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 07.02.2017)

122 Ep: pct application non-entry in european phase

Ref document number: 14885464

Country of ref document: EP

Kind code of ref document: A1