CN111754996A - Control method and device based on voice simulation remote controller and electronic equipment - Google Patents

Control method and device based on voice simulation remote controller and electronic equipment Download PDF

Info

Publication number
CN111754996A
CN111754996A CN201910251168.2A CN201910251168A CN111754996A CN 111754996 A CN111754996 A CN 111754996A CN 201910251168 A CN201910251168 A CN 201910251168A CN 111754996 A CN111754996 A CN 111754996A
Authority
CN
China
Prior art keywords
voice
remote controller
recognition
server
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910251168.2A
Other languages
Chinese (zh)
Inventor
杨俊毅
官永鑫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201910251168.2A priority Critical patent/CN111754996A/en
Publication of CN111754996A publication Critical patent/CN111754996A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42212Specific keyboard arrangements
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

A control method, a device and electronic equipment based on a voice simulation remote controller are disclosed. The control method based on the voice simulation remote controller comprises the following steps: receiving voice input corresponding to a key command of a remote controller; obtaining a voice recognition result for indicating the key command based on the voice input; generating a key event corresponding to the key command based on the voice recognition result; and sending the key event. In this way, global manipulation of speech can be achieved without requiring a particular adaptation of the speech.

Description

Control method and device based on voice simulation remote controller and electronic equipment
Technical Field
The present disclosure relates to the field of control technologies, and more particularly, to a control method based on a voice-simulated remote controller, a control apparatus based on a voice-simulated remote controller, and an electronic device.
Background
The voice input is a very convenient control mode for users, and the voice input is more in line with the daily habits of people and is more natural and efficient. Based on the voice input of the user, the operation of the device can be controlled by voice, which is more rapid and convenient compared with manual control.
Specifically, the control is performed by converting a voice signal into a corresponding text or command through a recognition and understanding process based on the voice of the user collected by a microphone or the like. The voice recognition technology mainly comprises three aspects of a feature extraction technology, a pattern matching criterion and a model training technology.
At present, as terminal devices such as internet televisions are more and more popular, the control requirements of the terminal devices are more and more emphasized, and it is expected that voice search can be performed and the televisions can be controlled globally through voice, so that both hands are thoroughly released.
Accordingly, it is desirable to provide improved voice-based control schemes.
Disclosure of Invention
The present application is proposed to solve the above-mentioned technical problems. Embodiments of the present application provide a control method and apparatus based on a voice simulation remote controller, and an electronic device, which are capable of obtaining a voice recognition result corresponding to a voice input and accordingly generating a key event corresponding to a key command of the remote controller, so that global voice control can be implemented without special adaptation of voice.
According to an aspect of the present application, there is provided a control method for simulating a remote controller based on voice, including: receiving voice input corresponding to a key command of a remote controller; obtaining a voice recognition result for indicating the key command based on the voice input; generating a key event corresponding to the key command based on the voice recognition result; and sending the key event.
In the above control method based on a voice simulation remote controller, the method further includes: and responding to the sent key event, and executing the operation corresponding to the key event.
In the above control method based on a voice simulation remote controller, the method further includes: and displaying the operation result of the operation.
In the above control method based on a voice simulation remote controller, obtaining a voice recognition result indicating the key command based on the voice input includes: sending the voice input to a first server; and receiving a voice recognition result indicating the key command from a first server; the server performs character recognition on the voice input to obtain a text result, and performs intention recognition on the text result to obtain the voice recognition result.
In the above control method based on a voice simulation remote controller, obtaining a voice recognition result indicating the key command based on the voice input includes: sending the voice input to a second server; receiving a text result obtained by performing character recognition on the voice input from the second server; sending the text result to a third server; and receiving the voice recognition result obtained by performing intention recognition on the text result from the third server.
In the above control method based on a voice simulation remote controller, obtaining a voice recognition result indicating the key command based on the voice input includes: performing intent recognition on the speech input to obtain the speech recognition result.
According to another aspect of the present application, there is provided a voice-based analog remote controller, including: an input unit for receiving a voice input corresponding to a key command of a remote controller; a recognition unit configured to obtain a voice recognition result indicating the key command based on the voice input; the generating unit is used for generating a key event corresponding to the key command based on the voice recognition result; and the sending unit is used for sending the key event.
In the above control device based on a voice-simulated remote controller, the control device further includes: and the execution unit is used for responding to the sent key event and executing the operation corresponding to the key event.
In the above control device based on a voice-simulated remote controller, the control device further includes: and the display unit is used for displaying the operation result of the operation.
In the above control device based on a voice-simulated remote controller, the recognition unit includes: a first transmitting subunit, configured to transmit the voice input to a first server; and a first receiving subunit, configured to receive, from the first server, a voice recognition result indicating the key command; the server performs character recognition on the voice input to obtain a text result, and performs intention recognition on the text result to obtain the voice recognition result.
In the above control device based on a voice-simulated remote controller, the recognition unit includes: a second transmitting subunit, configured to transmit the voice input to a second server; a second receiving subunit, configured to receive, from the second server, a text result obtained by performing character recognition on the voice input; a third sending subunit, configured to send the text result to a third server; and a third receiving subunit operable to receive, from the third server, the speech recognition result obtained by performing intent recognition on the text result.
In the above control device based on a voice-simulated remote controller, the recognition unit includes: and the intention recognition subunit is used for performing intention recognition on the voice input to obtain the voice recognition result.
According to still another aspect of the present application, there is provided an electronic apparatus including: a processor; and a memory in which computer program instructions are stored, which, when executed by the processor, cause the processor to perform the control method based on a voice simulation remote control as described above.
According to yet another aspect of the present application, there is provided a computer readable medium having stored thereon computer program instructions which, when executed by a processor, cause the processor to execute the control method of a voice simulation-based remote controller as described above.
According to the control method and device based on the voice simulation remote controller and the electronic equipment, the voice recognition result corresponding to the voice input is obtained, and the key event corresponding to the key command of the remote controller is correspondingly generated, so that the global voice control can be realized without special voice adaptation.
Drawings
The above and other objects, features and advantages of the present application will become more apparent by describing in more detail embodiments of the present application with reference to the attached drawings. The accompanying drawings are included to provide a further understanding of the embodiments of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the principles of the application. In the drawings, like reference numbers generally represent like parts or steps.
Fig. 1 illustrates a schematic diagram of a prior art voice-based control scheme.
Fig. 2 illustrates a flowchart of a control method of a voice-based analog remote controller according to an embodiment of the present application.
Fig. 3 illustrates a schematic diagram of a first example of a recognition process of a speech input according to an embodiment of the application.
Fig. 4 illustrates a schematic diagram of a second example of a recognition process of a speech input according to an embodiment of the application.
Fig. 5 illustrates a schematic diagram of a third example of a recognition process of a speech input according to an embodiment of the application.
Fig. 6 is a schematic diagram illustrating an application example of a control method of a voice-based analog remote controller according to an embodiment of the present application.
Fig. 7 is a schematic diagram illustrating an implementation procedure of a control method based on a voice simulation remote controller according to an embodiment of the present application.
Fig. 8 illustrates a block diagram of a control apparatus based on a voice simulation remote controller according to an embodiment of the present application.
FIG. 9 illustrates a block diagram of an electronic device in accordance with an embodiment of the present application.
Detailed Description
Hereinafter, example embodiments according to the present application will be described in detail with reference to the accompanying drawings. It should be understood that the described embodiments are only some embodiments of the present application and not all embodiments of the present application, and that the present application is not limited by the example embodiments described herein.
Summary of the application
As described above, in current voice-based control schemes, control commands are generated by recognizing voice. However, this necessitates a contract for speech and application or system.
Fig. 1 illustrates a schematic diagram of a prior art voice-based control scheme. As shown in fig. 1, the disadvantage of such a scheme of voice manipulation with application or system contract is that:
1) specific contract and adaptation must be made with the application and system;
2) applications and systems that are not pre-adapted cannot be manipulated;
3) even though agreement and adaptation are carried out, adaptation of each clickable position in the application is difficult to carry out, and the adaptation cost is high;
4) for the controlled application, some development work which is not related to the original service needs to be additionally performed.
Aiming at the technical problems, the basic concept of the application is that based on the reality that most of the existing equipment can carry out global control through a remote controller, the global control of voice is realized by simulating the key events of the remote controller through voice.
Specifically, the control method, device and electronic equipment based on the voice simulation remote controller provided by the application firstly receive a voice input corresponding to a key command of the remote controller, then obtain a voice recognition result used for indicating the key command based on the voice input, then generate a key event corresponding to the key command based on the voice recognition result, and finally send the key event.
Because the remote controller keys are necessarily adapted in the system or application controlled by the remote controller, and all operable contents in the application are necessarily adapted to the remote controller key events, the remote controller is simulated by voice to control, special agreement and adaptation in the system or application are not needed, and special customization of voice is also not needed, so that the global control which can be achieved by the remote controller can be realized.
It is noted that, in the control method, device and electronic device based on the voice simulation remote controller provided by the present application, the control object of the voice may be various terminal devices that can be controlled by using the remote controller, including an internet television, a set-top box, and the like.
Having described the general principles of the present application, various non-limiting embodiments of the present application will now be described with reference to the accompanying drawings.
Exemplary method
Fig. 2 illustrates a flowchart of a control method of a voice-based analog remote controller according to an embodiment of the present application.
As shown in fig. 2, a control method of a voice-based analog remote control according to an embodiment of the present application includes: s110, receiving voice input corresponding to a key command of the remote controller; s120, obtaining a voice recognition result used for indicating the key command based on the voice input; s130, generating a key event corresponding to the key command based on the voice recognition result; and S140, sending the key event.
In step S110, a voice input corresponding to a key command of the remote controller is received. That is, in the manipulation, the user makes a voice input corresponding to a key command of the remote controller, for example, the user speaks a key name of the remote controller, such as up, down, confirm, or the like. In particular, a microphone pickup may be received by a voice client of the system to capture audio of the voice input.
Here, as will be understood by those skilled in the art, the key command refers to a command corresponding to a button on a device such as a remote controller or a keyboard. For example, the main buttons of the remote controller are: up, down, left, right, acknowledge, return, standby, volume +, volume-, etc.
In step S120, a voice recognition result indicating the key command is obtained based on the voice input. Specifically, the speech recognition result may be obtained by various speech recognition models. Moreover, in the embodiment of the present application, the voice recognition result may be obtained locally or remotely, that is, the voice recognition may be performed locally in the system, or may be sent to a remote server for voice recognition, which will be described in further detail below.
In step S130, a key event corresponding to the key command is generated based on the voice recognition result. That is, if it is recognized that the key command corresponding to the voice input is, for example, "up" through the voice recognition result, a key event corresponding to a key of the remote controller "up" is generated.
Here, the key event refers to an event triggered in a system of the client device, for example, an operating system of an internet television, when a user presses a key on an input device such as a remote controller, and specifically, the key event may include a key-down event, a key-up event, and the like.
In step S140, the key event is transmitted. In particular, the key event may be transmitted to a control portion of a system or application. That is, in the embodiment of the present application, the key operation of the remote controller is not actually triggered, but the remote controller is simulated to trigger the key event through the recognition of the voice, so as to control the system or the application.
Therefore, in the control method based on the voice simulation remote controller according to the embodiment of the application, by obtaining the voice recognition result corresponding to the voice input and accordingly generating and sending the key event corresponding to the key command of the remote controller, the key event can be triggered by the voice simulation remote controller, so that the global operation and control with the same effect as that of using the remote controller are realized.
In addition, in the embodiment of the present application, by sending the key event, a system or an application may be controlled to perform a corresponding operation in response to the key event. For example, in response to a key event "volume +", the system or application may respond to the process when it senses the key event, i.e., the system or application may increase the volume.
That is, in the control method based on the voice simulation remote controller according to the embodiment of the present application, further comprising: and responding to the sent key event, and executing the operation corresponding to the key event.
In addition, for a terminal device such as an internet television, an operation result based on a key event can be displayed to a user through a display unit, for example, for a key event "up", a check box displayed on a screen can be moved up, so that the user can clearly understand a manipulation result of a spoken voice input, enhancing convenience of use for the user.
Of course, if the terminal device does not have a display unit, the control result may be fed back to the user by other methods, for example, a voice prompt, a vibration prompt, and the like.
Therefore, in the control method based on the voice simulation remote controller according to the embodiment of the application, the method further includes: and displaying the operation result of the operation.
As mentioned above, speech recognition of the speech input may be accomplished in a variety of ways, several example ways of which are further described below.
Fig. 3 illustrates a schematic diagram of a first example of a recognition process of a speech input according to an embodiment of the application.
As shown in fig. 3, speech recognition is performed at a server side separate from the local system or application. Specifically, after receiving a voice input of a user, for example, "up", through a voice client, the voice client sends an audio of the voice of the user to a server, and the server recognizes the voice input as a word, that is, "up", through a voice recognition technique, for example, by using a voice recognition model, and then performs intent recognition on the word through a technique, for example, natural voice processing, so as to obtain a voice recognition result indicating that a key command is "up", and returns the voice recognition result to the voice client.
Therefore, in the control method based on the voice simulation remote controller according to the embodiment of the present application, obtaining a voice recognition result indicating the key command based on the voice input includes: sending the voice input to a first server; and receiving a voice recognition result indicating the key command from a first server; the server performs character recognition on the voice input to obtain a text result, and performs intention recognition on the text result to obtain the voice recognition result.
Fig. 4 illustrates a schematic diagram of a second example of a recognition process of a speech input according to an embodiment of the application.
As shown in fig. 4, unlike the example shown in fig. 3, speech-to-text recognition and text-to-intent recognition are performed at two server sides separate from the local system or application, respectively. Specifically, after receiving a user's voice input, e.g., "up", through a voice client, the voice client sends the audio of the user's voice to a server side, which recognizes the voice input as text, i.e., "up", through voice recognition techniques, e.g., by using a voice recognition model, and then sends back to the local voice client. Then, the local voice client sends the recognized words, such as "up", to another server, and the other server performs intent recognition on the words through a technology such as natural voice processing, so as to obtain a voice recognition result indicating that the key command is "up", and returns the voice recognition result to the voice client.
That is, in the control method based on a voice simulation remote controller according to an embodiment of the present application, obtaining a voice recognition result indicating the key command based on the voice input includes: sending the voice input to a second server; receiving a text result obtained by performing character recognition on the voice input from the second server; sending the text result to a third server; and receiving the voice recognition result obtained by performing intention recognition on the text result from the third server.
It is noted that in the embodiment of the present application, the second server and the third server may also be the same server, that is, even when the speech-to-text recognition and text-to-intention recognition processes are performed on the same server, the text as an intermediate product may be sent back to the speech client. For example, the voice client may directly match the recognized text with the key events based on a local template to generate key events, or the voice client may send the text to the server for recognition after a period of time.
Fig. 5 illustrates a schematic diagram of a third example of a recognition process of a speech input according to an embodiment of the application.
As shown in fig. 5, in this example, voice recognition is directly performed at the voice client, so that a user intention corresponding to the voice input is directly recognized without being converted into text, thereby obtaining a voice recognition result. Of course, those skilled in the art will appreciate that although not shown in FIG. 5, direct speech-to-intent recognition may also be performed on the server side.
Therefore, in the control method based on the voice simulation remote controller according to the embodiment of the present application, obtaining a voice recognition result indicating the key command based on the voice input includes: performing intent recognition on the speech input to obtain the speech recognition result.
It can be seen that, in the embodiment of the present application, there may be a plurality of calling manners of voice recognition, and the character recognition and the intention recognition may also be implemented by different calling flows. For example, after a server recognizes a voice as a character, the server may call another server to recognize an intention corresponding to the character. Therefore, the embodiments of the present application are not intended to be limited to any way, whether the server side calls the server side or the client side calls the server side.
Therefore, according to the control method based on the voice simulation remote controller, because the keys of the remote controller are necessarily adapted to the system or the application, and all operable contents in the system or the application are necessarily adapted to the key events of the remote controller, no special agreement and adaptation are needed, no special customization is needed, and the control which can be realized by the remote controller can be realized naturally, efficiently, quickly and conveniently in a voice mode.
Application example
Next, an application example of the control method based on the voice simulation remote controller according to the embodiment of the present application will be described with an example of the control method based on the voice simulation remote controller according to the embodiment of the present application being applied to the internet.
Fig. 6 is a schematic diagram illustrating an application example of a control method of a voice-based analog remote controller according to an embodiment of the present application.
As shown in fig. 6, when the user U wants to control the internet TV, a key name is spoken, such as up, down, confirm, etc. Then, the internet TV receives the microphone pickup and transmits the audio to the server S.
Next, the server S performs character recognition on the received audio, and then performs content analysis on the recognized characters to recognize the user' S intention. Thereafter, the server returns the identified user's intention to the internet television TV.
The internet television TV processes the recognized user's intention to generate a key event, e.g., a "determination" event, and performs an operation, e.g., a "determination" operation of the user, in response to the transmitted key event. Then, the operation result, for example, the content displayed by the "determination" operation by the user is fed back to the user, as shown in fig. 7. Fig. 7 is a schematic diagram illustrating an implementation procedure of a control method based on a voice simulation remote controller according to an embodiment of the present application.
Exemplary devices
Fig. 8 illustrates a block diagram of a control apparatus based on a voice simulation remote controller according to an embodiment of the present application.
As shown in fig. 8, the control apparatus 200 based on the voice simulation remote controller according to the embodiment of the present application includes: an input unit 210 for receiving a voice input corresponding to a key command of a remote controller; a recognition unit 220 for obtaining a voice recognition result indicating the key command based on the voice input received by the input unit 210; a generating unit 230, configured to generate a key event corresponding to the key command based on the voice recognition result obtained by the recognizing unit 220; and a transmitting unit 240 for transmitting the key event generated by the generating unit 230.
In one example, in the above control apparatus 200 based on a voice simulation remote controller, further comprising: an executing unit, configured to execute an operation corresponding to the key event in response to the key event sent by the sending unit 240.
In one example, in the above control apparatus 200 based on a voice simulation remote controller, further comprising: a display unit for displaying an operation result of the operation performed by the execution unit.
In one example, in the above control apparatus 200 based on a voice simulation remote controller, the recognition unit 220 includes: a first transmitting sub-unit, configured to transmit the voice input received by the input unit 210 to a first server; and a first receiving subunit, configured to receive, from the first server, a voice recognition result indicating the key command; the server performs character recognition on the voice input sent by the first sending subunit to obtain a text result, and performs intention recognition on the text result to obtain the voice recognition result.
In one example, in the above control apparatus 200 based on a voice simulation remote controller, the recognition unit 220 includes: a second transmitting subunit, configured to transmit the voice input received by the input unit 210 to a second server; a second receiving subunit, configured to receive, from the second server, a text result obtained by performing character recognition on the voice input sent by the second sending subunit; a third sending subunit, configured to send the text result received by the second receiving subunit to a third server; and a third receiving subunit operable to receive, from the third server, the speech recognition result obtained by performing intention recognition on the text result transmitted by the third transmitting subunit.
In one example, in the above control apparatus 200 based on a voice simulation remote controller, the recognition unit 220 includes: an intention recognition subunit, configured to perform intention recognition on the voice input received by the input unit 210 to obtain the voice recognition result.
Here, it can be understood by those skilled in the art that the detailed functions and operations of the respective units and modules in the above-described voice analog remote controller-based control apparatus 200 have been described in detail in the above description of the voice analog remote controller-based control method with reference to fig. 2 to 5, and thus, a repetitive description thereof will be omitted.
As described above, the control apparatus 200 based on the voice analog remote controller according to the embodiment of the present application may be implemented in various terminal devices, such as an internet tv, a set-top box, and the like. In one example, the control apparatus 200 based on the voice simulation remote controller according to the embodiment of the present application may be integrated into the terminal device as one software module and/or hardware module. For example, the control device 200 based on the voice simulation remote controller may be a software module in an operating system of the terminal device, or may be an application program developed for the terminal device; of course, the control device 200 based on the voice analog remote controller can also be one of a plurality of hardware modules of the terminal equipment.
Alternatively, in another example, the voice analog remote controller-based control apparatus 200 and the terminal device may be separate devices, and the voice analog remote controller-based control apparatus 200 may be connected to the terminal device through a wired and/or wireless network and transmit the interactive information according to an agreed data format.
Exemplary electronic device
Next, an electronic apparatus according to an embodiment of the present application is described with reference to fig. 9.
FIG. 9 illustrates a block diagram of an electronic device in accordance with an embodiment of the present application.
As shown in fig. 9, the electronic device 10 includes one or more processors 11 and memory 12.
The processor 13 may be a Central Processing Unit (CPU) or other form of processing unit having data processing capabilities and/or instruction execution capabilities, and may control other components in the electronic device 10 to perform desired functions.
Memory 12 may include one or more computer program products that may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. The volatile memory may include, for example, Random Access Memory (RAM), cache memory (cache), and/or the like. The non-volatile memory may include, for example, Read Only Memory (ROM), hard disk, flash memory, etc. One or more computer program instructions may be stored on the computer-readable storage medium and executed by the processor 11 to implement the voice-analog-remote-controller-based control methods of the various embodiments of the present application described above and/or other desired functions. Various contents such as voice audio, text contents, intention recognition results, etc. may also be stored in the computer-readable storage medium.
In one example, the electronic device 10 may further include: an input device 13 and an output device 14, which are interconnected by a bus system and/or other form of connection mechanism (not shown).
The input device 13 may include, for example, a keyboard, a mouse, and the like.
The output device 14 may output various information including a manipulation result based on a key command, etc. to the outside. The output devices 14 may include, for example, a display, speakers, a printer, and a communication network and its connected remote output devices, among others.
Of course, for simplicity, only some of the components of the electronic device 10 relevant to the present application are shown in fig. 9, and components such as buses, input/output interfaces, and the like are omitted. In addition, the electronic device 10 may include any other suitable components depending on the particular application.
Exemplary computer program product and computer-readable storage Medium
In addition to the above-described methods and apparatus, embodiments of the present application may also be a computer program product comprising computer program instructions that, when executed by a processor, cause the processor to perform the steps in a voice simulation remote control-based control method according to various embodiments of the present application described in the "exemplary methods" section of this specification, supra.
The computer program product may be written with program code for performing the operations of embodiments of the present application in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the first user computing device, partly on the first user device, as a stand-alone software package, partly on the first user computing device and partly on a remote computing device, or entirely on the remote computing device or server.
Furthermore, embodiments of the present application may also be a computer-readable storage medium having stored thereon computer program instructions that, when executed by a processor, cause the processor to perform the steps in the voice simulation remote control-based control method according to various embodiments of the present application described in the "exemplary methods" section above in this specification.
The computer-readable storage medium may take any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may include, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The foregoing describes the general principles of the present application in conjunction with specific embodiments, however, it is noted that the advantages, effects, etc. mentioned in the present application are merely examples and are not limiting, and they should not be considered essential to the various embodiments of the present application. Furthermore, the foregoing disclosure of specific details is for the purpose of illustration and description and is not intended to be limiting, since the foregoing disclosure is not intended to be exhaustive or to limit the disclosure to the precise details disclosed.
The block diagrams of devices, apparatuses, systems referred to in this application are only given as illustrative examples and are not intended to require or imply that the connections, arrangements, configurations, etc. must be made in the manner shown in the block diagrams. These devices, apparatuses, devices, systems may be connected, arranged, configured in any manner, as will be appreciated by those skilled in the art. Words such as "including," "comprising," "having," and the like are open-ended words that mean "including, but not limited to," and are used interchangeably therewith. The words "or" and "as used herein mean, and are used interchangeably with, the word" and/or, "unless the context clearly dictates otherwise. The word "such as" is used herein to mean, and is used interchangeably with, the phrase "such as but not limited to".
It should also be noted that in the devices, apparatuses, and methods of the present application, the components or steps may be decomposed and/or recombined. These decompositions and/or recombinations are to be considered as equivalents of the present application.
The previous description of the disclosed aspects is provided to enable any person skilled in the art to make or use the present application. Various modifications to these aspects will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other aspects without departing from the scope of the application. Thus, the present application is not intended to be limited to the aspects shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
The foregoing description has been presented for purposes of illustration and description. Furthermore, the description is not intended to limit embodiments of the application to the form disclosed herein. While a number of example aspects and embodiments have been discussed above, those of skill in the art will recognize certain variations, modifications, alterations, additions and sub-combinations thereof.

Claims (13)

1. A control method based on a voice simulation remote controller is characterized by comprising the following steps:
receiving voice input corresponding to a key command of a remote controller;
obtaining a voice recognition result for indicating the key command based on the voice input;
generating a key event corresponding to the key command based on the voice recognition result; and
and sending the key event.
2. The control method based on the voice simulation remote controller according to claim 1, further comprising:
and responding to the sent key event, and executing the operation corresponding to the key event.
3. The control method based on the voice simulation remote controller according to claim 2, further comprising:
and displaying the operation result of the operation.
4. The control method based on the voice simulation remote controller of claim 1, wherein obtaining a voice recognition result indicating the key command based on the voice input comprises:
sending the voice input to a first server; and
receiving a voice recognition result indicating the key command from a first server;
the server performs character recognition on the voice input to obtain a text result, and performs intention recognition on the text result to obtain the voice recognition result.
5. The control method based on the voice simulation remote controller of claim 1, wherein obtaining a voice recognition result indicating the key command based on the voice input comprises:
sending the voice input to a second server;
receiving a text result obtained by performing character recognition on the voice input from the second server;
sending the text result to a third server; and
receiving the speech recognition result obtained by performing intention recognition on the text result from the third server.
6. The control method based on the voice simulation remote controller of claim 1, wherein obtaining a voice recognition result indicating the key command based on the voice input comprises:
performing intent recognition on the speech input to obtain the speech recognition result.
7. A control device based on a voice simulation remote controller is characterized by comprising:
an input unit for receiving a voice input corresponding to a key command of a remote controller;
a recognition unit configured to obtain a voice recognition result indicating the key command based on the voice input;
the generating unit is used for generating a key event corresponding to the key command based on the voice recognition result; and
and the sending unit is used for sending the key event.
8. The voice-based analog remote control of claim 7, further comprising:
and the execution unit is used for responding to the sent key event and executing the operation corresponding to the key event.
9. The voice-based analog remote control of claim 8, further comprising:
and the display unit is used for displaying the operation result of the operation.
10. The control method based on the voice simulation remote controller according to claim 7, wherein the recognition unit comprises:
a first transmitting subunit, configured to transmit the voice input to a first server; and
a first receiving subunit, configured to receive, from a first server, a voice recognition result indicating the key command;
the server performs character recognition on the voice input to obtain a text result, and performs intention recognition on the text result to obtain the voice recognition result.
11. The control apparatus based on voice-simulated remote controller according to claim 7, wherein the recognition unit comprises:
a second transmitting subunit, configured to transmit the voice input to a second server;
a second receiving subunit, configured to receive, from the second server, a text result obtained by performing character recognition on the voice input;
a third sending subunit, configured to send the text result to a third server; and
a third receiving subunit configured to receive, from the third server, the speech recognition result obtained by performing intent recognition on the text result.
12. The control apparatus based on voice-simulated remote controller according to claim 7, wherein the recognition unit comprises:
and the intention recognition subunit is used for performing intention recognition on the voice input to obtain the voice recognition result.
13. An electronic device, comprising:
a processor; and
memory having stored therein computer program instructions which, when executed by the processor, cause the processor to perform the method of controlling a voice-based analog remote control according to any one of claims 1 to 6.
CN201910251168.2A 2019-03-29 2019-03-29 Control method and device based on voice simulation remote controller and electronic equipment Pending CN111754996A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910251168.2A CN111754996A (en) 2019-03-29 2019-03-29 Control method and device based on voice simulation remote controller and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910251168.2A CN111754996A (en) 2019-03-29 2019-03-29 Control method and device based on voice simulation remote controller and electronic equipment

Publications (1)

Publication Number Publication Date
CN111754996A true CN111754996A (en) 2020-10-09

Family

ID=72671744

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910251168.2A Pending CN111754996A (en) 2019-03-29 2019-03-29 Control method and device based on voice simulation remote controller and electronic equipment

Country Status (1)

Country Link
CN (1) CN111754996A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6629077B1 (en) * 2000-11-22 2003-09-30 Universal Electronics Inc. Universal remote control adapted to receive voice input
CN103456306A (en) * 2012-05-29 2013-12-18 三星电子株式会社 Method and apparatus for executing voice command in electronic device
CN103714816A (en) * 2012-09-28 2014-04-09 三星电子株式会社 Electronic appratus, server and control method thereof
CN108172223A (en) * 2017-12-14 2018-06-15 深圳市欧瑞博科技有限公司 Voice instruction recognition method, device and server and computer readable storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6629077B1 (en) * 2000-11-22 2003-09-30 Universal Electronics Inc. Universal remote control adapted to receive voice input
CN103456306A (en) * 2012-05-29 2013-12-18 三星电子株式会社 Method and apparatus for executing voice command in electronic device
CN103714816A (en) * 2012-09-28 2014-04-09 三星电子株式会社 Electronic appratus, server and control method thereof
CN108172223A (en) * 2017-12-14 2018-06-15 深圳市欧瑞博科技有限公司 Voice instruction recognition method, device and server and computer readable storage medium

Similar Documents

Publication Publication Date Title
JP6751122B2 (en) Page control method and apparatus
CN109618202B (en) Method for controlling peripheral equipment, television and readable storage medium
KR20190088945A (en) Electronic device, server and control methods thereof
JP7203865B2 (en) Multimodal interaction between users, automated assistants, and other computing services
JP7111682B2 (en) Speech command matching during testing of a speech-assisted application prototype for languages using non-phonetic writing systems
WO2020029500A1 (en) Voice command customization method, device, apparatus, and computer storage medium
US20180182399A1 (en) Control method for control device, control method for apparatus control system, and control device
CN111627436B (en) Voice control method and device
JP2011065467A (en) Conference relay device and computer program
CN111144138A (en) Simultaneous interpretation method and device and storage medium
JP2008145769A (en) Interaction scenario creation system, its method, and program
JP6624476B2 (en) Translation device and translation system
US20080109227A1 (en) Voice Control System and Method for Controlling Computers
JP6832503B2 (en) Information presentation method, information presentation program and information presentation system
CN111538812A (en) Method, equipment and system for disambiguating natural language content title
US10438582B1 (en) Associating identifiers with audio signals
CN111754996A (en) Control method and device based on voice simulation remote controller and electronic equipment
CN110706704A (en) Method, device and computer equipment for generating voice interaction prototype
WO2003079188A1 (en) Method for operating software object using natural language and program for the same
KR20220140304A (en) Video learning systems for recognize learners' voice commands
CN113852849A (en) Intelligent hotel room management method
US9613311B2 (en) Receiving voice/speech, replacing elements including characters, and determining additional elements by pronouncing a first element
Schnelle-Walka et al. Multimodal dialogmanagement in a smart home context with SCXML
CN106653026A (en) Intelligent robot home theater system based on voice control and control method of intelligent robot home theater system
CN112040326A (en) Bullet screen control method and system, television and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination