WO2022135259A1 - Speech input method and apparatus, and electronic device - Google Patents

Speech input method and apparatus, and electronic device Download PDF

Info

Publication number
WO2022135259A1
WO2022135259A1 PCT/CN2021/138688 CN2021138688W WO2022135259A1 WO 2022135259 A1 WO2022135259 A1 WO 2022135259A1 CN 2021138688 W CN2021138688 W CN 2021138688W WO 2022135259 A1 WO2022135259 A1 WO 2022135259A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice message
input
voice
content
target
Prior art date
Application number
PCT/CN2021/138688
Other languages
French (fr)
Chinese (zh)
Inventor
张孝东
Original Assignee
维沃移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信有限公司 filed Critical 维沃移动通信有限公司
Publication of WO2022135259A1 publication Critical patent/WO2022135259A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range

Definitions

  • the present application belongs to the field of communication technologies, and in particular relates to a voice input method, device and electronic device.
  • the purpose of the embodiments of the present application is to provide a voice input method, device, and electronic device, which can solve the problem of low efficiency in editing a message by the electronic device.
  • an embodiment of the present application provides a voice input method, the method includes: receiving a first voice message input by a user, displaying first voice content corresponding to the first voice message; An input, the first content is content corresponding to the target content in the first voice content; in response to the first input, the target voice message corresponding to the target content in the first voice message is replaced or deleted.
  • an embodiment of the present application provides a voice input device, which includes: a receiving module, a display module, and a processing module.
  • the receiving module is used for receiving the first voice message input by the user.
  • the display module is used for displaying the first voice content corresponding to the first voice message.
  • the receiving module is further configured to receive a user's first input of the first content, where the first content is content corresponding to the target content in the first voice content.
  • the processing module is configured to replace or delete the target voice message corresponding to the target content in the first voice message in response to the first input received by the receiving module.
  • an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented .
  • an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction, and implement the first aspect the method described.
  • the user can input the first voice message to trigger the electronic device to display the first voice content corresponding to the first voice message, so that the user can enter the first voice content corresponding to the target content in the first voice content.
  • a content is input, so that the electronic device can replace or delete the target voice message corresponding to the target content in the first voice message.
  • the electronic device can display the first voice content corresponding to the first voice message in real time, so when the target content in the first voice content is incorrect, the user can The first content corresponding to the target content is input, so that the electronic device can replace or delete the erroneous target voice message in the first voice message.
  • Fig. 2 is one of the example schematic diagrams of an interface of a mobile phone provided by an embodiment of the present application
  • FIG. 3 is the second schematic diagram of a voice input method provided by an embodiment of the present application.
  • FIG. 4 is the second schematic diagram of an example of an interface of a mobile phone provided by an embodiment of the present application.
  • FIG. 5 is a third schematic diagram of a voice input method provided by an embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a voice input device provided by an embodiment of the present application.
  • FIG. 7 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
  • first, second and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between “first”, “second”, etc.
  • the objects are usually of one type, and the number of objects is not limited.
  • the first object may be one or more than one.
  • “and/or” in the description and claims indicates at least one of the connected objects, and the character “/" generally indicates that the associated objects are in an "or” relationship.
  • the electronic device when the user sends a voice message to the target contact through a chat application program in the electronic device (for example: I am on Renmin East Road
  • the voice message to be sent can be input on the dialogue interface corresponding to the target contact. If the user realizes that there is an error in the voice message to be sent (for example, Renmin East Road), the electronic device can be triggered to obtain the user input.
  • FIG. 1 shows a flowchart of a voice input method provided by an embodiment of the present application, and the method can be applied to an electronic device.
  • the voice input method provided by this embodiment of the present application may include the following steps 201 to 203 .
  • Step 201 The electronic device receives the first voice message input by the user, and displays the first voice content corresponding to the first voice message.
  • the user when the user sends a voice message to a contact, the user can input the first voice message to be sent, so that the electronic device can acquire and display the first voice corresponding to the first voice message input by the user.
  • content ie, text content
  • the user can input the first content, so that the electronic device can replace or delete the target voice message corresponding to the target content in the first voice message.
  • the electronic device when the electronic device displays the conversation page corresponding to the target contact in the chat application, the user can input the control displayed in the conversation page for sending voice, and then the user can Voice content may be input to input the first voice message.
  • the user can perform a long-press input or a click input on the control used for sending voice to trigger the electronic device to be in a voice recording state (that is, perform a recording function), and the above-mentioned first voice message
  • the voice input is performed during the process of long-pressing the control for sending voice, or it is the voice input after the user taps the control for sending voice.
  • the electronic device may perform a recording function to record the user's voice input (ie, the first voice message), and obtain the first voice message.
  • the corresponding first voice content is displayed, and the first voice content is displayed.
  • the long-press input and the user's voice input are simultaneous inputs, that is, when the user performs By performing voice input while long-pressing the input, the electronic device can acquire the first voice content corresponding to the user's first voice message.
  • the electronic device cannot obtain the first voice content corresponding to the user's first voice message.
  • the electronic device may convert the voice content corresponding to the first voice message into text according to the acquired voice content corresponding to the first voice message. content, and display the first voice content (ie, text content) corresponding to the first voice message at a preset position on the screen.
  • the electronic device may directly display the complete first voice message corresponding to the first voice message in a preset position on the screen. a voice content; or, the electronic device may gradually display the first voice content corresponding to the first voice message at a preset position on the screen while the user is inputting the first voice message (that is, according to the progress of the user's voice input , display the corresponding text), that is, convert the voice content of the first voice message into text content in real time and display it.
  • the electronic device may acquire the first voice content corresponding to the first voice message, and control the first voice message to be in a to-be-edited state.
  • Step 203 in response to the first input, the electronic device replaces or deletes the target voice message corresponding to the target content in the first voice message.
  • the electronic device may delete the voice content corresponding to the target content from the first voice message, and add the voice content corresponding to the first content to the position where the voice content corresponding to the target content is located in the first voice message. , so as to combine to get a new voice message.
  • the voice input method provided by the embodiment of the present application may further include the following steps 301 and 302 , and the above steps 202 can be specifically implemented by the following step 202a, and the above-mentioned step 203 can be specifically implemented by the following step 203a.
  • Step 301 the electronic device receives a second input from the user.
  • the above-mentioned second input is the user's selection input of the target content.
  • the above-mentioned second input may be any one of the following: a user's click input on the target content, a user's long-press input on the target content, a user's double-click input on the target content, and the like.
  • Step 302 the electronic device determines the target voice message according to the target content in response to the second input.
  • the electronic device may mark and display the target content, and determine the target voice message corresponding to the target content from the first voice message according to the position of the target content in the first voice content.
  • the user can display the text content in the text display area 11: "I am on Renmin East Road, you can come and find me, let's go to eat together” in the text "Renmin East Road” Enter to trigger the mobile phone to highlight the text corresponding to "Renmin East Road", so that according to the location of "Renmin East Road” in “I'm on Renmin East Road now, you can come and find me, let's go to eat together", from the user
  • the voice part corresponding to "Renmin East Road” is determined in the voice message corresponding to the voice input.
  • Step 202a the electronic device receives the second voice message input by the user.
  • the above-mentioned second voice message is a voice message corresponding to the first content.
  • the voice content corresponding to the user's voice input may further include other content
  • the electronic device may obtain the voice content of the first content by performing semantic analysis on the voice content corresponding to the user's voice input.
  • the electronic device processes the first voice message according to the first content obtained by semantic analysis, so as to replace or delete the target voice content.
  • the user can select and input the target content in the first voice content, so that the electronic device can select and input the target content according to the user's input on the target content. , determine the target voice message in the first voice message, and replace the target voice message in the first voice message with the second voice message according to the second voice message corresponding to the first content input by the user, or delete the first voice message
  • the target voice message in the message; the third voice message is obtained, so that the user can accurately determine the content that needs to be replaced or deleted in the first voice message.
  • Step 401 the electronic device displays the target control.
  • the electronic device when the user starts to input the voice input control (that is, when the electronic device displays the voice input interface), the electronic device may display the target control at a preset position on the screen, so that the user can The target control makes an input to trigger the electronic device to be in a voice recording state.
  • the electronic device when the electronic device displays the target control, the electronic device does not need to display the first voice content corresponding to the first voice message on the screen.
  • the above-mentioned third input is the input of the user to the target control.
  • the user may perform sliding input after long-pressing the voice input control to slide to the position of the target control, thereby triggering the electronic device to be in a voice recording state.
  • the user when the electronic device is in a voice recording state, the user may input the first voice message to record the voice content input by the user.
  • the voice input method provided in the embodiment of the present application may further include: Step 404 is described below.
  • Step 404 In response to the first input, the electronic device performs semantic analysis processing on the second voice message to determine the first content and the target content.
  • the electronic device can display a target control for editing the first voice message, so that the user can input the target control to make the electronic device It is in a voice recording state, and performs semantic analysis and processing on the user's voice input to accurately determine the first content and the target content, so that the user can flexibly control the electronic device to perform corresponding operations through the voice input.
  • the specific step of the above step 203 may be replaced by "the electronic device replaces or deletes the target voice message corresponding to the target content in the first voice message".
  • the voice input method provided by the embodiment of the present application may further include the following steps 501 to 503 .
  • Step 501 the electronic device receives a fourth input from the user.
  • the user after the electronic device replaces the target content in the first voice message with the first content to obtain the third voice message, the user can perform voice input again to input the fourth voice message.
  • the above-mentioned first voice message and the fourth voice message can be understood as a complete voice message (that is, the fifth voice message described below), and the user is
  • the complete voice message when the input of the first part of the voice message (ie the first voice message) is completed, the voice input may be paused first to replace the wrong content (ie the target content) in the first voice message, Therefore, after the replacement of the wrong content is completed, the input of the second part of the voice message (ie, the fourth voice message) is continued.
  • Step 502 In response to the fourth input, the electronic device performs combined processing on the fourth voice message and the third voice message to obtain a fifth voice message.
  • the electronic device may add the fourth voice message to the third voice message to obtain a complete voice message (ie, the fifth voice message).
  • the electronic device may perform voice splicing processing on the fourth voice message and the third voice message, so as to combine the two voice messages to obtain one voice message.
  • the voice message sent by the electronic device is the fifth voice message including the third voice message.
  • Voice message if the user does not perform the fourth input, the voice message sent by the electronic device is the third voice message.
  • FIG. 6 shows a possible schematic structural diagram of the voice input device involved in the embodiment of the present application.
  • the voice input device 70 may include: a receiving module 71 , a display module 72 and a processing module 73 .
  • the receiving module 71 is configured to receive the first voice message input by the user.
  • the display module 72 is configured to display the first voice content corresponding to the first voice message.
  • the receiving module 71 is further configured to receive a user's first input of the first content, where the first content is content corresponding to the target content in the first voice content.
  • the processing module 73 is configured to replace or delete the target voice message corresponding to the target content in the first voice message in response to the first input received by the receiving module 71 .
  • the voice input apparatus 70 may further include: a determination module.
  • the receiving module 71 is further configured to receive the user's second input after the display module 72 displays the first voice content corresponding to the first voice message, where the second input is the user's selection input on the target content.
  • the determining module is configured to determine the target voice message according to the target content in response to the second input received by the receiving module 71 .
  • the receiving module 71 is specifically configured to receive a second voice message input by a user, where the second voice message is a voice message corresponding to the first content.
  • the processing module 73 is specifically configured to replace the target voice message in the first voice message with the second voice message according to the second voice message, or delete the target voice message in the first voice message; and obtain a third voice message.
  • non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
  • Network Attached Storage NAS
  • personal computer personal computer, PC
  • television television
  • teller machine or self-service machine etc.
  • the processor 110 is configured to, in response to the first input, replace or delete the target voice message corresponding to the target content in the first voice message.
  • the user input unit 107 is further configured to receive a second input from the user, where the second input is the user's selection input on the target content in the first text content.
  • the user can select and input the target content in the first voice content, so that the electronic device can select and input the target content according to the user's input on the target content. , determine the target voice message in the first voice message, and replace the target voice message in the first voice message with the second voice message according to the second voice message corresponding to the first content input by the user, or delete the first voice message
  • the target voice message in the message; the third voice message is obtained, so that the user can accurately determine the content that needs to be replaced or deleted in the first voice message.
  • the processor 110 is further configured to perform semantic analysis processing on the second voice message to determine the first content and the target content.
  • the processor 110 is further configured to, in response to the fourth input, perform combined processing on the fourth voice message and the third voice message to obtain a fifth voice message.
  • the network module 102 is configured to send a fifth voice message including a third voice message.
  • the processor is the processor in the electronic device described in the foregoing embodiments.
  • the readable storage medium includes a computer-readable storage medium, such as a computer read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.
  • the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephone Function (AREA)

Abstract

The present application relates to the technical field of communications. Disclosed are a speech input method and apparatus, and an electronic device. The method comprises: receiving a first speech message input by a user and displaying first speech content corresponding to the first speech message; receiving a first input for first content by a user, the first content being content corresponding to target content in the first speech content; and in response to the first input, replacing or deleting a target speech message corresponding to the target content in the first speech message. Embodiments of the present application are applied in a process of sending a message by an electronic device.

Description

语音输入方法、装置及电子设备Voice input method, device and electronic device
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请主张在2020年12月22日在中国提交的中国专利申请号202011529379.7的优先权,其全部内容通过引用包含于此。This application claims priority to Chinese Patent Application No. 202011529379.7 filed in China on December 22, 2020, the entire contents of which are hereby incorporated by reference.
技术领域technical field
本申请属于通信技术领域,具体涉及一种语音输入方法、装置及电子设备。The present application belongs to the field of communication technologies, and in particular relates to a voice input method, device and electronic device.
背景技术Background technique
通常,在用户通过电子设备中的聊天类应用程序与联系人进行沟通的情况下,用户可以通过发送语音消息的方式进行沟通,具体的,用户可以在聊天对话界面对语音录制控件进行长按输入的同时,进行语音输入,以使得电子设备可以实时录制用户的语音内容,并在语音录制完成后,向联系人发送语音消息。Usually, when a user communicates with a contact through a chat application in an electronic device, the user can communicate by sending a voice message. Specifically, the user can long-press the voice recording control on the chat dialog interface to input At the same time, voice input is performed, so that the electronic device can record the user's voice content in real time, and after the voice recording is completed, send a voice message to the contact.
然而上述方法中,用户在进行语音输入时,若语音输入内容有误,则需要用户通过输入触发电子设备取消正在进行的语音录制,并重新进行语音录制之后再向联系人发送无误的语音消息。因此用户的操作繁琐且耗时,尤其是在用户输入的语音内容较多的情况下,耗时较长,从而电子设备对消息进行编辑的效率较低。However, in the above method, when the user performs voice input, if the content of the voice input is incorrect, the user needs to trigger the electronic device to cancel the ongoing voice recording through the input, and re-record the voice before sending the correct voice message to the contact. Therefore, the user's operation is cumbersome and time-consuming, especially when the user inputs a lot of voice content, it takes a long time, and thus the efficiency of editing the message by the electronic device is low.
发明内容SUMMARY OF THE INVENTION
本申请实施例的目的是提供一种语音输入方法、装置及电子设备,能够解决电子设备对消息进行编辑的效率较低的问题。The purpose of the embodiments of the present application is to provide a voice input method, device, and electronic device, which can solve the problem of low efficiency in editing a message by the electronic device.
为了解决上述技术问题,本申请是这样实现的:In order to solve the above technical problems, this application is implemented as follows:
第一方面,本申请实施例提供了一种语音输入方法,该方法包括:接收用户输入的第一语音消息,显示与第一语音消息对应的第一语音内容;接收用户对第一内容的第一输入,该第一内容为与第一语音内容中的目标内容对应的内容;响应于第一输入,对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。In a first aspect, an embodiment of the present application provides a voice input method, the method includes: receiving a first voice message input by a user, displaying first voice content corresponding to the first voice message; An input, the first content is content corresponding to the target content in the first voice content; in response to the first input, the target voice message corresponding to the target content in the first voice message is replaced or deleted.
第二方面,本申请实施例提供了一种语音输入装置,该装置包括:接收模块、显示模块和处理模块。其中,接收模块,用于接收用户输入的第一语音消息。显示模块,用于显示与第一语音消息对应的第一语音内容。接收模块,还用于接收用户对第一内容的第一输入,该第一内容为与第一语音内容中的目标内容对应的内容。处理模块,用于响应于接收模块接收的第一输入,对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。In a second aspect, an embodiment of the present application provides a voice input device, which includes: a receiving module, a display module, and a processing module. The receiving module is used for receiving the first voice message input by the user. The display module is used for displaying the first voice content corresponding to the first voice message. The receiving module is further configured to receive a user's first input of the first content, where the first content is content corresponding to the target content in the first voice content. The processing module is configured to replace or delete the target voice message corresponding to the target content in the first voice message in response to the first input received by the receiving module.
第三方面,本申请实施例提供了一种电子设备,该电子设备包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如第一方面所述的方法的步骤。In a third aspect, embodiments of the present application provide an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being The processor implements the steps of the method according to the first aspect when executed.
第四方面,本申请实施例提供了一种可读存储介质,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如第一方面所述的方法的步骤。In a fourth aspect, an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented .
第五方面,本申请实施例提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如第一方面所述的方法。In a fifth aspect, an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction, and implement the first aspect the method described.
在本申请实施例中,用户可以对第一语音消息进行输入,以触发电子设备显示该第一语音消息对应的第一语音内容,从而用户可以对与第一语音内容中的目标内容对应的第一内容进行输入,以使得电子设备可以对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。由于在用户对待发送的第一语音消息进行输入时,电子设备可以实时显示与第一语音消息对应的第一语音内容,从而在该第一语音内容中的目标内容有误时,用户可以对与目标内容对应的第一内容进行输入,以使得电子设备可以对第一语音消息中有误的目标语音消息进行替换或删除。而无需用户重新输入第一语音消息,因此可以简化用户的操作,通过显示的第一语音内容,便于用户直观的确定有误的内容,并及时修改,从而提高电子设备对消息进行编辑的效率。In this embodiment of the present application, the user can input the first voice message to trigger the electronic device to display the first voice content corresponding to the first voice message, so that the user can enter the first voice content corresponding to the target content in the first voice content. A content is input, so that the electronic device can replace or delete the target voice message corresponding to the target content in the first voice message. When the user inputs the first voice message to be sent, the electronic device can display the first voice content corresponding to the first voice message in real time, so when the target content in the first voice content is incorrect, the user can The first content corresponding to the target content is input, so that the electronic device can replace or delete the erroneous target voice message in the first voice message. There is no need for the user to re-input the first voice message, so the user's operation can be simplified, and the displayed first voice content facilitates the user to intuitively determine the erroneous content and modify it in time, thereby improving the efficiency of message editing by the electronic device.
附图说明Description of drawings
图1是本申请实施例提供的一种语音输入方法的示意图之一;1 is one of the schematic diagrams of a voice input method provided by an embodiment of the present application;
图2是本申请实施例提供的一种手机的界面的实例示意图之一;Fig. 2 is one of the example schematic diagrams of an interface of a mobile phone provided by an embodiment of the present application;
图3是本申请实施例提供的一种语音输入方法的示意图之二;3 is the second schematic diagram of a voice input method provided by an embodiment of the present application;
图4是本申请实施例提供的一种手机的界面的实例示意图之二;4 is the second schematic diagram of an example of an interface of a mobile phone provided by an embodiment of the present application;
图5是本申请实施例提供的一种语音输入方法的示意图之三;5 is a third schematic diagram of a voice input method provided by an embodiment of the present application;
图6是本申请实施例提供的一种语音输入装置的结构示意图;6 is a schematic structural diagram of a voice input device provided by an embodiment of the present application;
图7是本申请实施例提供的一种电子设备的硬件结构示意图。FIG. 7 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
本申请的说明书和权利要求书中的术语“第一”、“第二”等是用于区别类似的对象,而不用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施,且“第一”、“第二”等所区分的对象通常为一类,并不限定对象的个数,例如第一对象可以是一个,也可以是多个。此外,说明书以及权利要求中“和/或”表示所连接对象的至少其中之一,字符“/”,一般表示前后关联对象是一种“或”的关系。The terms "first", "second" and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between "first", "second", etc. The objects are usually of one type, and the number of objects is not limited. For example, the first object may be one or more than one. In addition, "and/or" in the description and claims indicates at least one of the connected objects, and the character "/" generally indicates that the associated objects are in an "or" relationship.
下面结合附图,通过具体的实施例及其应用场景对本申请实施例提供的语音输入方法进行详细地说明。The voice input method provided by the embodiments of the present application will be described in detail below with reference to the accompanying drawings through specific embodiments and application scenarios thereof.
本申请实施例中,在用户通过电子设备中的聊天类应用程序向目标联系人发送语音消息(例如:我现在在人民东路,你可以过来找我,我们一起去吃饭)的情况下,用户可以在与该目标联系人对应的对话界面对待发送的语音消息进行输入,若用户意识到在待发送的语音消息中存在错误内容(例如人民东路)时,可以触发电子设备在获取到用户输入的语音消息后,控制该语音消息处于待编辑状态,从而用户可以针对该语音消息中的错误内容进行输入(即对与错误内容对应的正确内容的输入),以对正确内容(例如人民西路)进行语音输入,从而电子设备可以根据用户的输入,将语音消息中的错误内容替换为正确内容,以得到一个新的语音消息(例如:我现在在人民西路,你可以过来找我,我们一起去吃饭),并向目标联系人发送该新的语音消息,从而可以提高电子设备对消息进行编辑的效率。In the embodiment of the present application, when the user sends a voice message to the target contact through a chat application program in the electronic device (for example: I am on Renmin East Road The voice message to be sent can be input on the dialogue interface corresponding to the target contact. If the user realizes that there is an error in the voice message to be sent (for example, Renmin East Road), the electronic device can be triggered to obtain the user input. After the voice message is sent, the voice message is controlled to be in the pending editing state, so that the user can input the wrong content in the voice message (that is, input the correct content corresponding to the wrong content), so as to correct the correct content (such as Renmin West Road) ) for voice input, so that the electronic device can replace the wrong content in the voice message with the correct content according to the user's input to get a new voice message (for example: I am now on Renmin West Road, you can come and find me, we will Let's have dinner together), and send the new voice message to the target contact, so that the efficiency of editing the message by the electronic device can be improved.
本申请实施例提供一种语音输入方法,图1示出了本申请实施例提供的一种语音输入方法的流程图,该方法可以应用于电子设备。如图1所示,本申请实施例提供的语音输入方法可以包括下述的步骤201至步骤203。An embodiment of the present application provides a voice input method. FIG. 1 shows a flowchart of a voice input method provided by an embodiment of the present application, and the method can be applied to an electronic device. As shown in FIG. 1 , the voice input method provided by this embodiment of the present application may include the following steps 201 to 203 .
步骤201、电子设备接收用户输入的第一语音消息,显示与第一语音消息对应的第一语音内容。Step 201: The electronic device receives the first voice message input by the user, and displays the first voice content corresponding to the first voice message.
本申请实施例中,在用户向联系人发送语音消息的情况下,用户可以对待发送的第一语音消息进行输入,以使得电子设备可以获取并显示用户输入的第一语音消息对应的第一语音内容(即文字内容),进而用户可以对第一内容进行输入,以使得电子设备可以对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。In the embodiment of the present application, when the user sends a voice message to a contact, the user can input the first voice message to be sent, so that the electronic device can acquire and display the first voice corresponding to the first voice message input by the user. content (ie, text content), and then the user can input the first content, so that the electronic device can replace or delete the target voice message corresponding to the target content in the first voice message.
可选地,本申请实施例中,用户可以在电子设备显示聊天类应用程序中的目标联系人对应的会话页面的情况下,对会话页面中显示的用于发送语音的控件进行输入,然后用户可以输入语音内容,以输入第一语音消息。Optionally, in this embodiment of the present application, when the electronic device displays the conversation page corresponding to the target contact in the chat application, the user can input the control displayed in the conversation page for sending voice, and then the user can Voice content may be input to input the first voice message.
可选地,本申请实施例中,用户可以对用于发送语音的控件进行长按输入或点击输入,以触发电子设备处于语音录制状态(即执行录音功能),上述第一语音消息为在用户对用于发送语音的控件进行长按输入的过程中进行的语音输入,或者,为在用户对用于发送语音的控件进行点击输入之后进行的语音输入。Optionally, in this embodiment of the present application, the user can perform a long-press input or a click input on the control used for sending voice to trigger the electronic device to be in a voice recording state (that is, perform a recording function), and the above-mentioned first voice message The voice input is performed during the process of long-pressing the control for sending voice, or it is the voice input after the user taps the control for sending voice.
可选地,本申请实施例中,在用户对第一语音消息进行输入的同时,电子设备可以执行录音功能,以对用户的语音输入(即第一语音消息)进行录制,获取第一语音消息对应的第一语音内容,并显示第一语音内容。Optionally, in this embodiment of the present application, when the user inputs the first voice message, the electronic device may perform a recording function to record the user's voice input (ie, the first voice message), and obtain the first voice message. The corresponding first voice content is displayed, and the first voice content is displayed.
需要说明的是,在用户对用于发送语音的控件的输入为长按输入的情况下,该长按输入和用户的语音输入(即第一语音消息)为同时进行的输入,即在用户进行长按输入的同时进行语音输入,电子设备可以获取用户的第一语音消息对应的第一语音内容。在用户未进行长按输入的情况下,若用户进行语音输入,则电子设备无法获取用户的第一语音消息对应的第一语音内容。It should be noted that, in the case where the user's input to the control for sending voice is a long-press input, the long-press input and the user's voice input (ie, the first voice message) are simultaneous inputs, that is, when the user performs By performing voice input while long-pressing the input, the electronic device can acquire the first voice content corresponding to the user's first voice message. In the case where the user does not perform a long-press input, if the user performs a voice input, the electronic device cannot obtain the first voice content corresponding to the user's first voice message.
可选地,本申请实施例中,在用户开始对第一语音消息进行输入时,电子设备可以根据获取的第一语音消息对应的语音内容,将该第一语音消息对应的语音内容转换为文字内容,并在屏幕中的预设位置显示第一语音消息对应的第一语音内容(即文字 内容)。Optionally, in this embodiment of the present application, when the user starts to input the first voice message, the electronic device may convert the voice content corresponding to the first voice message into text according to the acquired voice content corresponding to the first voice message. content, and display the first voice content (ie, text content) corresponding to the first voice message at a preset position on the screen.
可选地,本申请实施例中,电子设备可以在用户完成对第一语音消息的输入(即语音输入)之后,可以在屏幕中的预设位置直接显示出第一语音消息对应的完整的第一语音内容;或者,电子设备可以在用户对第一语音消息进行输入的同时,逐步在屏幕中的预设位置显示出第一语音消息对应的第一语音内容(即根据用户的语音输入的进度,显示对应的文字),即实时将第一语音消息的语音内容转换为文字内容并进行显示。Optionally, in this embodiment of the present application, after the user completes the input of the first voice message (that is, voice input), the electronic device may directly display the complete first voice message corresponding to the first voice message in a preset position on the screen. a voice content; or, the electronic device may gradually display the first voice content corresponding to the first voice message at a preset position on the screen while the user is inputting the first voice message (that is, according to the progress of the user's voice input , display the corresponding text), that is, convert the voice content of the first voice message into text content in real time and display it.
可选地,本申请实施例中,上述显示的第一语音消息对应的第一语音内容为可编辑状态,用户可以对该第一文字内容中的部分内容或全部内容进行选择输入,以使得电子设备对选择的内容进行编辑(例如修改或替换)。Optionally, in this embodiment of the present application, the first voice content corresponding to the first voice message displayed above is in an editable state, and the user can select and input part or all of the first text content, so that the electronic device Make edits to the selection (eg modify or replace).
可选地,本申请实施例中,电子设备可以获取第一语音消息对应的第一语音内容,并控制第一语音消息处于待编辑状态。Optionally, in this embodiment of the present application, the electronic device may acquire the first voice content corresponding to the first voice message, and control the first voice message to be in a to-be-edited state.
可选地,本申请实施例中,在电子设备获取第一语音消息对应的第一语音内容后,用户可以在对语音输入控件进行长按输入的同时进行滑动输入,以滑动至预设区域(例如显示第一语音内容对应的显示区域),以触发电子设备控制第一语音内容处于待编辑状态。Optionally, in this embodiment of the present application, after the electronic device obtains the first voice content corresponding to the first voice message, the user can perform sliding input while long-pressing the voice input control to slide to the preset area ( For example, the display area corresponding to the first voice content is displayed) to trigger the electronic device to control the first voice content to be in a state to be edited.
需要说明的是,上述对语音输入控件进行长按输入和滑动输入可以为一个完整的输入,即在长按输入与滑动输入之间没有时间间隔,长按输入与滑动输入为一个完整的、连续的输入。It should be noted that the above-mentioned long-press input and sliding input of the voice input control can be a complete input, that is, there is no time interval between the long-press input and the sliding input, and the long-press input and the sliding input are a complete and continuous input. input of.
可选地,本申请实施例中,在电子设备控制第一语音消息处于待编辑状态的情况下,用户可以通过输入触发电子设备将第一语音消息中的内容替换为其他内容(即用户可以触发电子设备修改第一语音消息中的错误内容)。Optionally, in this embodiment of the present application, when the electronic device controls the first voice message to be in a state to be edited, the user can trigger the electronic device to replace the content in the first voice message with other content by inputting (that is, the user can trigger the The electronic device modifies the erroneous content in the first voice message).
示例性的,以电子设备为手机为例进行说明。如图2所示,在用户对语音输入控件进行第一输入时,手机显示录音界面10,在该录音界面10中包括有文字显示区域11,以在该文字显示区域11中同步显示用户的语音输入对应的文字内容(例如:我现在在人民东路,你可以过来找我,我们一起去吃饭)。Exemplarily, the electronic device is a mobile phone as an example for description. As shown in FIG. 2 , when the user makes the first input to the voice input control, the mobile phone displays a recording interface 10 , and the recording interface 10 includes a text display area 11 to synchronously display the user's voice in the text display area 11 Enter the corresponding text content (for example: I am currently on Renmin East Road, you can come and find me, we will have dinner together).
步骤202、电子设备接收用户对第一内容的第一输入。Step 202: The electronic device receives the first input of the first content by the user.
本申请实施例中,上述第一内容为与第一语音内容中的目标内容对应的内容。In the embodiment of the present application, the above-mentioned first content is content corresponding to the target content in the first voice content.
可选地,本申请实施例中,用户可以再次进行语音输入,以使得电子设备可以录制第一内容对应的语音内容;或者,用户可以进行文字输入,以对第一内容进行输入。Optionally, in this embodiment of the present application, the user can perform voice input again, so that the electronic device can record the voice content corresponding to the first content; or, the user can perform text input to input the first content.
步骤203、电子设备响应于第一输入,对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。 Step 203 , in response to the first input, the electronic device replaces or deletes the target voice message corresponding to the target content in the first voice message.
可选地,本申请实施例中,电子设备可以将第一语音消息中的目标内容替换为第一内容;或者,删除第一语音消息中的目标内容。Optionally, in this embodiment of the present application, the electronic device may replace the target content in the first voice message with the first content; or delete the target content in the first voice message.
可选地,本申请实施例中,电子设备可以对第一语音消息进行语义分析处理,以从该第一语音消息中确定出目标内容。Optionally, in this embodiment of the present application, the electronic device may perform semantic analysis processing on the first voice message, so as to determine the target content from the first voice message.
可选地,本申请实施例中,电子设备可以将目标内容从第一语音消息中截取出来,并将第一内容对应的语音内容与截取后的第一语音消息进行组合处理,以得到新的语音消息(即下述的第三语音消息)。Optionally, in this embodiment of the present application, the electronic device may intercept the target content from the first voice message, and perform combined processing on the voice content corresponding to the first content and the intercepted first voice message to obtain a new voice message. Voice message (ie, the third voice message described below).
需要说明的是,电子设备可以将目标内容对应的语音内容从第一语音消息中删除,并将第一内容对应的语音内容添加至目标内容对应的语音内容在第一语音消息中所处的位置,从而组合得到新的语音消息。It should be noted that the electronic device may delete the voice content corresponding to the target content from the first voice message, and add the voice content corresponding to the first content to the position where the voice content corresponding to the target content is located in the first voice message. , so as to combine to get a new voice message.
可选地,本申请实施例中,在电子设备显示第一语音消息对应的第一语音内容的情况下,电子设备可以根据用户的输入,将第一语音内容中的目标文字内容替换为第一内容对应的文字内容,以得到替换后的第一语音内容,并在屏幕中更新显示第一语音内容。Optionally, in this embodiment of the present application, when the electronic device displays the first voice content corresponding to the first voice message, the electronic device may replace the target text content in the first voice content with the first voice content according to the user's input. The text content corresponding to the content is obtained to obtain the replaced first voice content, and the first voice content is updated and displayed on the screen.
可选地,本申请实施例中,电子设备可以根据替换后的第一语音内容,将第一语音消息中的目标内容替换为第一内容,以生成新的语音消息。Optionally, in this embodiment of the present application, the electronic device may replace the target content in the first voice message with the first content according to the replaced first voice content, so as to generate a new voice message.
可选地,本申请实施例中,电子设备可以将替换后得到的新的语音消息发送至联系人。Optionally, in this embodiment of the present application, the electronic device may send a new voice message obtained after replacement to the contact.
本申请实施例提供一种语音输入方法,用户可以对待发送的第一语音消息进行输入,以触发电子设备显示该第一语音消息对应的第一语音内容,从而用户可以对与第一语音内容中的目标内容对应的第一内容进行输入,以使得电子设备可以对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。由于在用户对待发送的第一 语音消息进行输入时,电子设备可以实时显示与第一语音消息对应的第一语音内容,从而在该第一语音内容中的目标内容有误时,用户可以对与目标内容对应的第一内容进行输入,以使得电子设备可以对第一语音消息中有误的目标语音消息进行替换或删除。而无需用户重新输入第一语音消息,因此可以简化用户的操作,通过显示的第一语音内容,便于用户直观的确定有误的内容,并及时修改,从而提高电子设备对消息进行编辑的效率。An embodiment of the present application provides a voice input method, where a user can input a first voice message to be sent to trigger an electronic device to display the first voice content corresponding to the first voice message, so that the user can Input the first content corresponding to the target content in the first voice message, so that the electronic device can replace or delete the target voice message corresponding to the target content in the first voice message. When the user inputs the first voice message to be sent, the electronic device can display the first voice content corresponding to the first voice message in real time, so when the target content in the first voice content is incorrect, the user can The first content corresponding to the target content is input, so that the electronic device can replace or delete the erroneous target voice message in the first voice message. There is no need for the user to re-enter the first voice message, so the user's operation can be simplified, and the displayed first voice content facilitates the user to intuitively determine the wrong content and modify it in time, thereby improving the efficiency of message editing by the electronic device.
可选地,本申请实施例中,结合图1,如图3所示,在上述步骤201之后,本申请实施例提供的语音输入方法还可以包括下述的步骤301和步骤302,并且上述步骤202具体可以通过下述的步骤202a实现,上述的步骤203具体可以通过下述的步骤203a实现。Optionally, in the embodiment of the present application, with reference to FIG. 1 , as shown in FIG. 3 , after the above step 201 , the voice input method provided by the embodiment of the present application may further include the following steps 301 and 302 , and the above steps 202 can be specifically implemented by the following step 202a, and the above-mentioned step 203 can be specifically implemented by the following step 203a.
步骤301、电子设备接收用户的第二输入。 Step 301, the electronic device receives a second input from the user.
本申请实施例中,上述第二输入为用户对目标内容的选择输入。In the embodiment of the present application, the above-mentioned second input is the user's selection input of the target content.
本申请实施例中,上述第二输入为用户对第一语音内容中的目标文字内容(即目标内容)的输入。In the embodiment of the present application, the above-mentioned second input is the user's input to the target text content (ie, the target content) in the first voice content.
可选地,本申请实施例中,用户可以对目标内容进行输入,以触发电子设备选中该目标内容,进一步地,还可以标记显示该目标内容。Optionally, in this embodiment of the present application, the user may input the target content to trigger the electronic device to select the target content, and further, the target content may be marked and displayed.
可选地,本申请实施例中,上述第二输入可以为以下任一项:用户对目标内容的点击输入、用户对目标内容的长按输入、用户对目标内容的双击输入等。Optionally, in this embodiment of the present application, the above-mentioned second input may be any one of the following: a user's click input on the target content, a user's long-press input on the target content, a user's double-click input on the target content, and the like.
步骤302、电子设备响应于第二输入,根据目标内容,确定目标语音消息。 Step 302, the electronic device determines the target voice message according to the target content in response to the second input.
可选地,本申请实施例中,电子设备可以标记显示目标内容,并根据目标内容在第一语音内容中的位置,从第一语音消息中确定出目标内容对应的目标语音消息。Optionally, in this embodiment of the present application, the electronic device may mark and display the target content, and determine the target voice message corresponding to the target content from the first voice message according to the position of the target content in the first voice content.
示例性的,结合图2所示,用户可以对文字显示区域11中显示的文字内容:“我现在在人民东路,你可以过来找我,我们一起去吃饭”中的文字“人民东路”进行输入,以触发手机突出显示“人民东路”对应的文字,从而根据“人民东路”在“我现在在人民东路,你可以过来找我,我们一起去吃饭”中的位置,从用户的语音输入对应的语音消息中确定出“人民东路”对应的语音部分。Exemplarily, with reference to Fig. 2, the user can display the text content in the text display area 11: "I am on Renmin East Road, you can come and find me, let's go to eat together" in the text "Renmin East Road" Enter to trigger the mobile phone to highlight the text corresponding to "Renmin East Road", so that according to the location of "Renmin East Road" in "I'm on Renmin East Road now, you can come and find me, let's go to eat together", from the user The voice part corresponding to "Renmin East Road" is determined in the voice message corresponding to the voice input.
步骤202a、电子设备接收用户输入的第二语音消息。 Step 202a, the electronic device receives the second voice message input by the user.
本申请实施例中,上述第二语音消息为第一内容对应的语音消息。In this embodiment of the present application, the above-mentioned second voice message is a voice message corresponding to the first content.
可选地,本申请实施例中,在电子设备显示第一语音内容之后,用户可以对第一内容进行语音输入,从而电子设备可以获取用户输入的第二语音消息(即第一内容对应的语音消息)。Optionally, in this embodiment of the present application, after the electronic device displays the first voice content, the user can perform voice input on the first content, so that the electronic device can obtain the second voice message (that is, the voice corresponding to the first content) input by the user. information).
步骤203a、电子设备根据第二语音消息,将第一语音消息中的目标语音消息替换为第二语音消息,或者,删除第一语音消息中的目标语音消息;得到第三语音消息。 Step 203a, the electronic device replaces the target voice message in the first voice message with the second voice message according to the second voice message, or deletes the target voice message in the first voice message; and obtains a third voice message.
可选地,本申请实施例中,用户进行的语音输入对应的语音内容中至少包括第一内容的语音内容。Optionally, in this embodiment of the present application, the voice content corresponding to the voice input by the user includes at least the voice content of the first content.
可选地,本申请实施例中,用户进行的语音输入对应的语音内容中还可以包括其他内容,电子设备可以通过对用户的语音输入对应的语音内容进行语义分析得到第一内容的语音内容。Optionally, in this embodiment of the present application, the voice content corresponding to the user's voice input may further include other content, and the electronic device may obtain the voice content of the first content by performing semantic analysis on the voice content corresponding to the user's voice input.
可选地,本申请实施例中,电子设备根据语义分析得到的第一内容,对第一语音消息进行处理,以对目标语音内容进行替换或删除。Optionally, in this embodiment of the present application, the electronic device processes the first voice message according to the first content obtained by semantic analysis, so as to replace or delete the target voice content.
本申请实施例中,在电子设备显示与第一语音消息对应的第一语音内容之后,用户可以对第一语音内容中的目标内容进行选择输入,以使得电子设备可以根据用户对目标内容的输入,确定第一语音消息中的目标语音消息,并根据用户输入的第一内容对应的第二语音消息,将第一语音消息中的目标语音消息替换为第二语音消息,或者,删除第一语音消息中的目标语音消息;得到第三语音消息,从而用户可以准确的确定第一语音消息中需要替换或删除的内容。In this embodiment of the present application, after the electronic device displays the first voice content corresponding to the first voice message, the user can select and input the target content in the first voice content, so that the electronic device can select and input the target content according to the user's input on the target content. , determine the target voice message in the first voice message, and replace the target voice message in the first voice message with the second voice message according to the second voice message corresponding to the first content input by the user, or delete the first voice message The target voice message in the message; the third voice message is obtained, so that the user can accurately determine the content that needs to be replaced or deleted in the first voice message.
可选地,本申请实施例中,在上述步骤201之后,本申请实施例提供的语音输入方法还可以包括下述的步骤401至步骤403。Optionally, in the embodiment of the present application, after the above step 201, the voice input method provided by the embodiment of the present application may further include the following steps 401 to 403.
步骤401、电子设备显示目标控件。Step 401, the electronic device displays the target control.
本申请实施例中,上述目标控件用于编辑第一语音消息。In the embodiment of the present application, the above-mentioned target control is used to edit the first voice message.
可选地,本申请实施例中,在用户开始对语音输入控件进行输入时(即电子设备显示语音输入界面时),电子设备可以在屏幕中的预设位置显示目标控件,从而用户可以通过对目标控件进行输入,以触发电子设备处于语音录制状态。Optionally, in this embodiment of the present application, when the user starts to input the voice input control (that is, when the electronic device displays the voice input interface), the electronic device may display the target control at a preset position on the screen, so that the user can The target control makes an input to trigger the electronic device to be in a voice recording state.
可选地,本申请实施例中,在电子设备显示目标控件时,电子设备无需在屏幕中 显示第一语音消息对应的第一语音内容。Optionally, in this embodiment of the present application, when the electronic device displays the target control, the electronic device does not need to display the first voice content corresponding to the first voice message on the screen.
步骤402、电子设备接收用户的第三输入。Step 402: The electronic device receives a third input from the user.
本申请实施例中,上述第三输入为用户对目标控件的输入。In the embodiment of the present application, the above-mentioned third input is the input of the user to the target control.
可选地,本申请实施例中,用户可以在对语音输入控件进行长按输入之后,进行滑动输入,以滑动至目标控件所在位置,从而触发电子设备处于语音录制状态。Optionally, in this embodiment of the present application, the user may perform sliding input after long-pressing the voice input control to slide to the position of the target control, thereby triggering the electronic device to be in a voice recording state.
步骤403、电子设备响应于第三输入,控制电子设备处于语音录制状态。Step 403, the electronic device controls the electronic device to be in a voice recording state in response to the third input.
可选地,本申请实施例中,在电子设备处于语音录制状态时,用户可以对第一语音消息进行输入,以录制用户输入的语音内容。Optionally, in this embodiment of the present application, when the electronic device is in a voice recording state, the user may input the first voice message to record the voice content input by the user.
示例性的,结合图2,如图4所示,在用户对语音输入控件进行输入时,手机显示录音界面10,在该录音界面10中包括有目标控件12,用户可以在第一语音消息(例如:我现在在人民东路,你可以过来找我,我们一起去吃饭)输入完成之后,对该目标控件12进行输入,以触发手机处于语音录制状态,从而用户可以再次进行语音输入(例如:将人民东路替换为人民西路),以使得手机可以根据用户的语音输入,将第一语音消息中“人民东路”对应的语音内容替换为“人民西路”对应的语音内容。Exemplarily, in conjunction with FIG. 2, as shown in FIG. 4, when the user inputs the voice input control, the mobile phone displays the recording interface 10, and the recording interface 10 includes the target control 12, and the user can enter the first voice message ( For example: I am now on Renmin East Road, you can come to me, let's go to dinner) After the input is completed, input the target control 12 to trigger the phone to be in the voice recording state, so that the user can make voice input again (for example: Replace Renmin East Road with Renmin West Road), so that the mobile phone can replace the voice content corresponding to "Renmin East Road" in the first voice message with the voice content corresponding to "Renmin West Road" according to the user's voice input.
可选地,本申请实施例中,在上述步骤203中的“对第一语音消息中与目标内容对应的目标语音消息进行替换或删除”之前,本申请实施例提供的语音输入方法还可以包括下述的步骤404。Optionally, in the embodiment of the present application, before "replace or delete the target voice message corresponding to the target content in the first voice message" in the above step 203, the voice input method provided in the embodiment of the present application may further include: Step 404 is described below.
步骤404、电子设备响应于第一输入,对第二语音消息进行语义分析处理,确定第一内容和目标内容。Step 404: In response to the first input, the electronic device performs semantic analysis processing on the second voice message to determine the first content and the target content.
可选地,本申请实施例中,电子设备可以通过智能语义分析方式,对第二语音消息进行语义分析处理,以确定第一语音消息中待替换的内容(即目标内容),以及替换后的第一内容。Optionally, in this embodiment of the present application, the electronic device may perform semantic analysis processing on the second voice message by means of intelligent semantic analysis, so as to determine the content to be replaced (that is, the target content) in the first voice message, and the content after the replacement. first content.
示例性的,在用户需求对第一语音消息“我现在在人民东路,你可以过来找我,我们一起去吃饭”中的“人民东路”进行修改时,用户进行的语音输入的内容可以为“将人民东路替换为人民西路”或“人民东路错了,应该是人民西路”等,从而电子设备可以将第一语音消息中的“人民东路”替换为“人民西路”,以得到替换后的第一语音消息“我现在在人民西路,你可以过来找我,我们一起去吃饭”。Exemplarily, when the user needs to modify "Renmin East Road" in the first voice message "I'm on Renmin East Road now, you can come and find me, let's go to eat together", the content of the voice input by the user can be "Replace Renmin East Road with Renmin West Road" or "Renmin East Road is wrong, it should be Renmin West Road", etc., so that the electronic device can replace "Renmin East Road" in the first voice message with "Renmin West Road" ” to get the replaced first voice message “I’m on Renmin West Road now, you can come and find me, let’s have dinner together”.
本申请实施例中,在电子设备控制第一语音消息处于待编辑状态之后,电子设备可以显示用于编辑第一语音消息的目标控件,从而用户可以通过对该目标控件进行输入,以使得电子设备处于语音录制状态,并对用户的语音输入进行语义分析处理,以准确的确定第一内容和目标内容,从而用户可以通过语音输入灵活的控制电子设备执行相应的操作。In this embodiment of the present application, after the electronic device controls the first voice message to be in a state to be edited, the electronic device can display a target control for editing the first voice message, so that the user can input the target control to make the electronic device It is in a voice recording state, and performs semantic analysis and processing on the user's voice input to accurately determine the first content and the target content, so that the user can flexibly control the electronic device to perform corresponding operations through the voice input.
需要说明的是,在执行上述步骤404的情况下,上述步骤203的具体步骤可以替换为“电子设备对第一语音消息中与目标内容对应的目标语音消息进行替换或删除”。It should be noted that, when the above step 404 is executed, the specific step of the above step 203 may be replaced by "the electronic device replaces or deletes the target voice message corresponding to the target content in the first voice message".
可选地,本申请实施例中,结合图1,如图5所示,在上述步骤203之后,本申请实施例提供的语音输入方法还可以包括下述的步骤501至步骤503。Optionally, in the embodiment of the present application, with reference to FIG. 1 , as shown in FIG. 5 , after step 203 above, the voice input method provided by the embodiment of the present application may further include the following steps 501 to 503 .
步骤501、电子设备接收用户的第四输入。 Step 501, the electronic device receives a fourth input from the user.
本申请实施例中,上述第四输入为用户对第四语音消息的输入。In this embodiment of the present application, the above-mentioned fourth input is an input of a fourth voice message by the user.
可选地,本申请实施例中,在电子设备将第一语音消息中的目标内容替换为第一内容,得到第三语音消息之后,用户可以再次进行语音输入,以输入第四语音消息。Optionally, in this embodiment of the present application, after the electronic device replaces the target content in the first voice message with the first content to obtain the third voice message, the user can perform voice input again to input the fourth voice message.
需要说明的是,在用户对第四语音消息进行输入的情况下,上述第一语音消息和第四语音消息可以理解为一个完整的语音消息(即下述的第五语音消息),用户在对该完整的语音消息进行输入时,在对第一部分语音消息(即第一语音消息)输入完成时,可以先暂停语音输入,以对第一语音消息中的错误内容(即目标内容)进行替换,从而在错误内容替换完成之后,再继续进行第二部分语音消息(即第四语音消息)的输入。It should be noted that, when the user inputs the fourth voice message, the above-mentioned first voice message and the fourth voice message can be understood as a complete voice message (that is, the fifth voice message described below), and the user is When the complete voice message is input, when the input of the first part of the voice message (ie the first voice message) is completed, the voice input may be paused first to replace the wrong content (ie the target content) in the first voice message, Therefore, after the replacement of the wrong content is completed, the input of the second part of the voice message (ie, the fourth voice message) is continued.
步骤502、电子设备响应于第四输入,将第四语音消息与第三语音消息进行组合处理,得到第五语音消息。Step 502: In response to the fourth input, the electronic device performs combined processing on the fourth voice message and the third voice message to obtain a fifth voice message.
可选地,本申请实施例中,电子设备可以将第四语音消息添加至第三语音消息中,以得到一个完整的语音消息(即第五语音消息)。Optionally, in this embodiment of the present application, the electronic device may add the fourth voice message to the third voice message to obtain a complete voice message (ie, the fifth voice message).
可选地,本申请实施例中,电子设备可以对第四语音消息和第三语音消息进行语音拼接处理,以将两个语音消息组合得到一个语音消息。Optionally, in this embodiment of the present application, the electronic device may perform voice splicing processing on the fourth voice message and the third voice message, so as to combine the two voice messages to obtain one voice message.
步骤503、电子设备发送包括第三语音消息的第五语音消息。Step 503: The electronic device sends a fifth voice message including the third voice message.
可选地,本申请实施例中,在用户进行第四输入的情况下,即电子设备接收到用 户对第四语音消息的输入时,电子设备发送的语音消息为包括第三语音消息的第五语音消息;在用户未进行第四输入的情况下,电子设备发送的语音消息为第三语音消息。Optionally, in this embodiment of the present application, when the user performs the fourth input, that is, when the electronic device receives the user's input for the fourth voice message, the voice message sent by the electronic device is the fifth voice message including the third voice message. Voice message; if the user does not perform the fourth input, the voice message sent by the electronic device is the third voice message.
本申请实施例中,在电子设备发送第三语音消息之前,用户可以对第四语音消息进行输入,以使得电子设备将第四语音消息与第三语音消息进行组合处理,得到第五语音消息,从而电子设备可以发送包括第三语音消息的第五语音消息,因此可以提高电子设备发送语音消息的灵活性。In the embodiment of the present application, before the electronic device sends the third voice message, the user may input the fourth voice message, so that the electronic device performs combined processing on the fourth voice message and the third voice message to obtain the fifth voice message, Therefore, the electronic device can send the fifth voice message including the third voice message, so the flexibility of the electronic device to send the voice message can be improved.
需要说明的是,本申请实施例提供的语音输入方法,执行主体可以为语音输入装置,或者该语音输入装置中的用于执行语音输入方法的控制模块。本申请实施例中以语音输入装置执行加载语音输入方法为例,说明本申请实施例提供的语音输入装置。It should be noted that, in the voice input method provided by the embodiments of the present application, the execution body may be a voice input device, or a control module in the voice input device for executing the voice input method. In the embodiments of the present application, the voice input device provided by the embodiments of the present application is described by taking the voice input device executing the method for loading voice input as an example.
图6示出了本申请实施例中涉及的语音输入装置的一种可能的结构示意图。如图6所示,语音输入装置70可以包括:接收模块71、显示模块72和处理模块73。FIG. 6 shows a possible schematic structural diagram of the voice input device involved in the embodiment of the present application. As shown in FIG. 6 , the voice input device 70 may include: a receiving module 71 , a display module 72 and a processing module 73 .
其中,接收模块71,用于接收用户输入的第一语音消息。显示模块72,用于显示与第一语音消息对应的第一语音内容。接收模块71,还用于接收用户对第一内容的第一输入,该第一内容为与第一语音内容中的目标内容对应的内容。处理模块73,用于响应于接收模块71接收的第一输入,对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。The receiving module 71 is configured to receive the first voice message input by the user. The display module 72 is configured to display the first voice content corresponding to the first voice message. The receiving module 71 is further configured to receive a user's first input of the first content, where the first content is content corresponding to the target content in the first voice content. The processing module 73 is configured to replace or delete the target voice message corresponding to the target content in the first voice message in response to the first input received by the receiving module 71 .
在一种可能的实现方式中,本申请实施例提供的语音输入装置70还可以包括:确定模块。其中,接收模块71,还用于在显示模块72显示与第一语音消息对应的第一语音内容之后,接收用户的第二输入,该第二输入为用户对目标内容的选择输入。确定模块,用于响应于接收模块71接收的第二输入,根据目标内容,确定目标语音消息。接收模块71,具体用于接收用户输入的第二语音消息,该第二语音消息为第一内容对应的语音消息。处理模块73,具体用于根据第二语音消息,将第一语音消息中的目标语音消息替换为第二语音消息,或者,删除第一语音消息中的目标语音消息;得到第三语音消息。In a possible implementation manner, the voice input apparatus 70 provided in this embodiment of the present application may further include: a determination module. The receiving module 71 is further configured to receive the user's second input after the display module 72 displays the first voice content corresponding to the first voice message, where the second input is the user's selection input on the target content. The determining module is configured to determine the target voice message according to the target content in response to the second input received by the receiving module 71 . The receiving module 71 is specifically configured to receive a second voice message input by a user, where the second voice message is a voice message corresponding to the first content. The processing module 73 is specifically configured to replace the target voice message in the first voice message with the second voice message according to the second voice message, or delete the target voice message in the first voice message; and obtain a third voice message.
在一种可能的实现方式中,本申请实施例提供的语音输入装置70还可以包括:控制模块。其中,显示模块72,还用于在显示与第一语音消息对应的第一语音内容之后,显示目标控件,该目标控件用于编辑第一语音消息。接收模块71,还用于接收用户的 第三输入,该第三输入为用户对目标控件的输入。控制模块,用于响应于接收模块71接收的第三输入,控制语音输入装置处于语音录制状态。In a possible implementation manner, the voice input device 70 provided in this embodiment of the present application may further include: a control module. The display module 72 is further configured to display a target control after displaying the first voice content corresponding to the first voice message, where the target control is used to edit the first voice message. The receiving module 71 is further configured to receive a user's third input, where the third input is the user's input to the target control. The control module is configured to control the voice input device to be in a voice recording state in response to the third input received by the receiving module 71 .
在一种可能的实现方式中,处理模块73,还用于在对第一语音消息中与目标内容对应的目标语音消息进行替换或删除之前,对第二语音消息进行语义分析处理,确定第一内容和目标内容。In a possible implementation manner, the processing module 73 is further configured to perform semantic analysis processing on the second voice message before replacing or deleting the target voice message corresponding to the target content in the first voice message to determine the first voice message. content and target content.
在一种可能的实现方式中,本申请实施例提供的语音输入装置70还可以包括:发送模块。其中,接收模块71,还用于在处理模块73对第一语音消息中与目标内容对应的目标语音消息进行替换或删除之后,接收用户的第四输入,该第四输入为用户对第四语音消息的输入。处理模块73,还用于响应于接收模块71接收的第四输入,将第四语音消息与第三语音消息进行组合处理,得到第五语音消息。发送模块,用于发送包括第三语音消息的第五语音消息。In a possible implementation manner, the voice input apparatus 70 provided in this embodiment of the present application may further include: a sending module. Wherein, the receiving module 71 is further configured to receive a fourth input from the user after the processing module 73 replaces or deletes the target voice message corresponding to the target content in the first voice message, where the fourth input is the user's response to the fourth voice input of the message. The processing module 73 is further configured to perform combined processing on the fourth voice message and the third voice message in response to the fourth input received by the receiving module 71 to obtain a fifth voice message. A sending module, configured to send a fifth voice message including the third voice message.
本申请实施例提供的语音输入装置能够实现上述方法实施例中语音输入装置实现的各个过程,为避免重复,详细描述这里不再赘述。The voice input device provided in the embodiments of the present application can implement each process implemented by the voice input device in the foregoing method embodiments. To avoid repetition, the detailed description is not repeated here.
本申请实施例提供一种语音输入装置,由于在用户对待发送的第一语音消息进行输入时,语音输入装置可以实时显示与第一语音消息对应的第一语音内容,从而在该第一语音内容中的目标内容有误时,用户可以对与目标内容对应的第一内容进行输入,以使得语音输入装置可以对第一语音消息中有误的目标语音消息进行替换或删除。而无需用户重新输入第一语音消息,因此可以简化用户的操作,通过显示的第一语音内容,便于用户直观的确定有误的内容,并及时修改,从而提高语音输入装置对消息进行编辑的效率。An embodiment of the present application provides a voice input device, because when a user inputs a first voice message to be sent, the voice input device can display the first voice content corresponding to the first voice message in real time, so that the first voice content can be displayed in real time. When the target content in the first voice message is wrong, the user can input the first content corresponding to the target content, so that the voice input device can replace or delete the wrong target voice message in the first voice message. There is no need for the user to re-input the first voice message, so the user's operation can be simplified, and the displayed first voice content facilitates the user to intuitively determine the wrong content and modify it in time, thereby improving the efficiency of editing the message by the voice input device .
本申请实施例中的语音输入装置可以是装置,也可以是终端中的部件、集成电路、或芯片。该装置可以是移动电子设备,也可以为非移动电子设备。示例性的,移动电子设备可以为手机、平板电脑、笔记本电脑、掌上电脑、车载电子设备、可穿戴设备、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本或者个人数字助理(personal digital assistant,PDA)等,非移动电子设备可以为服务器、网络附属存储器(Network Attached Storage,NAS)、个人计算机(personal computer,PC)、电视机(television,TV)、柜员机或者自助机等,本申请实施例不作具体限定。The voice input device in this embodiment of the present application may be a device, or may be a component, an integrated circuit, or a chip in a terminal. The apparatus may be a mobile electronic device or a non-mobile electronic device. Exemplarily, the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant). assistant, PDA), etc., non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
本申请实施例中的语音输入装置可以为具有操作系统的装置。该操作系统可以为安卓(Android)操作系统,可以为ios操作系统,还可以为其他可能的操作系统,本申请实施例不作具体限定。The voice input device in this embodiment of the present application may be a device with an operating system. The operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
可选地,本申请实施例还提供一种电子设备,包括处理器110,存储器109,存储在存储器109上并可在所述处理器110上运行的程序或指令,该程序或指令被处理器110执行时实现上述语音输入方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。Optionally, an embodiment of the present application further provides an electronic device, including a processor 110, a memory 109, a program or instruction stored in the memory 109 and executable on the processor 110, the program or instruction being processed by the processor When 110 is executed, each process of the above embodiments of the voice input method is implemented, and the same technical effect can be achieved. To avoid repetition, details are not described here.
需要说明的是,本申请实施例中的电子设备包括上述所述的移动电子设备和非移动电子设备。It should be noted that the electronic devices in the embodiments of the present application include the aforementioned mobile electronic devices and non-mobile electronic devices.
图7为实现本申请实施例的一种电子设备的硬件结构示意图。FIG. 7 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
该电子设备100包括但不限于:射频单元101、网络模块102、音频输出单元103、输入单元104、传感器105、显示单元106、用户输入单元107、接口单元108、存储器109、以及处理器110等部件。The electronic device 100 includes but is not limited to: a radio frequency unit 101, a network module 102, an audio output unit 103, an input unit 104, a sensor 105, a display unit 106, a user input unit 107, an interface unit 108, a memory 109, and a processor 110, etc. part.
本领域技术人员可以理解,电子设备100还可以包括给各个部件供电的电源(比如电池),电源可以通过电源管理系统与处理器110逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。图7中示出的电子设备结构并不构成对电子设备的限定,电子设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置,在此不再赘述。Those skilled in the art can understand that the electronic device 100 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 110 through a power management system, so as to manage charging, discharging, and power management through the power management system. consumption management and other functions. The structure of the electronic device shown in FIG. 7 does not constitute a limitation on the electronic device. The electronic device may include more or less components than the one shown, or combine some components, or arrange different components, which will not be repeated here. .
其中,用户输入单元107,用于接收用户输入的第一语音消息。The user input unit 107 is configured to receive the first voice message input by the user.
显示单元106,用于显示与第一语音消息对应的第一语音内容。The display unit 106 is configured to display the first voice content corresponding to the first voice message.
处理器110,用于响应于第一输入,对第一语音消息中与目标内容对应的目标语音消息进行替换或删除。The processor 110 is configured to, in response to the first input, replace or delete the target voice message corresponding to the target content in the first voice message.
本申请实施例提供一种电子设备,由于在用户对待发送的第一语音消息进行输入时,电子设备可以实时显示与第一语音消息对应的第一语音内容,从而在该第一语音内容中的目标内容有误时,用户可以对与目标内容对应的第一内容进行输入,以使得电子设备可以对第一语音消息中有误的目标语音消息进行替换或删除。而无需用户重新输入第一语音消息,因此可以简化用户的操作,通过显示的第一语音内容,便于用 户直观的确定有误的内容,并及时修改,从而提高电子设备对消息进行编辑的效率。The embodiment of the present application provides an electronic device, because when a user inputs a first voice message to be sent, the electronic device can display the first voice content corresponding to the first voice message in real time, so that the first voice content in the first voice content can be displayed in real time. When the target content is wrong, the user can input the first content corresponding to the target content, so that the electronic device can replace or delete the wrong target voice message in the first voice message. There is no need for the user to re-input the first voice message, so the user's operation can be simplified, and through the displayed first voice content, it is convenient for the user to intuitively determine the wrong content and modify it in time, thereby improving the efficiency of editing the message by the electronic device.
可选地,用户输入单元107,还用于接收用户的第二输入,该第二输入为用户对第一文字内容中的目标内容的选择输入。Optionally, the user input unit 107 is further configured to receive a second input from the user, where the second input is the user's selection input on the target content in the first text content.
处理器110,还用于响应于第二输入,根据目标内容,确定目标语音消息。The processor 110 is further configured to, in response to the second input, determine the target voice message according to the target content.
用户输入单元107,具体用于接收用户输入的第二语音消息,该第二语音消息为第一内容对应的语音消息。The user input unit 107 is specifically configured to receive a second voice message input by a user, where the second voice message is a voice message corresponding to the first content.
处理器110,具体用于根据第二语音消息,将第一语音消息中的目标语音消息替换为第二语音消息,或者,删除第一语音消息中的目标语音消息;得到第三语音消息。The processor 110 is specifically configured to replace the target voice message in the first voice message with the second voice message according to the second voice message, or delete the target voice message in the first voice message; and obtain a third voice message.
本申请实施例中,在电子设备显示与第一语音消息对应的第一语音内容之后,用户可以对第一语音内容中的目标内容进行选择输入,以使得电子设备可以根据用户对目标内容的输入,确定第一语音消息中的目标语音消息,并根据用户输入的第一内容对应的第二语音消息,将第一语音消息中的目标语音消息替换为第二语音消息,或者,删除第一语音消息中的目标语音消息;得到第三语音消息,从而用户可以准确的确定第一语音消息中需要替换或删除的内容。In this embodiment of the present application, after the electronic device displays the first voice content corresponding to the first voice message, the user can select and input the target content in the first voice content, so that the electronic device can select and input the target content according to the user's input on the target content. , determine the target voice message in the first voice message, and replace the target voice message in the first voice message with the second voice message according to the second voice message corresponding to the first content input by the user, or delete the first voice message The target voice message in the message; the third voice message is obtained, so that the user can accurately determine the content that needs to be replaced or deleted in the first voice message.
显示单元106,还用于显示目标控件,该目标控件用于编辑第一语音消息。The display unit 106 is further configured to display a target control, where the target control is used to edit the first voice message.
用户输入单元107,还用于接收用户的第三输入,该第三输入为用户对目标控件的输入。The user input unit 107 is further configured to receive a user's third input, where the third input is the user's input to the target control.
处理器110,还用于响应于第三输入,控制电子设备处于语音录制状态。The processor 110 is further configured to control the electronic device to be in a voice recording state in response to the third input.
处理器110,还用于对第二语音消息进行语义分析处理,确定第一内容和目标内容。The processor 110 is further configured to perform semantic analysis processing on the second voice message to determine the first content and the target content.
本申请实施例中,在电子设备控制第一语音消息处于待编辑状态之后,电子设备可以显示用于编辑第一语音消息的目标控件,从而用户可以通过对该目标控件进行输入,以使得电子设备处于语音录制状态,并对用户的语音输入进行语义分析处理,以准确的确定第一内容和目标内容,从而用户可以通过语音输入灵活的控制电子设备执行相应的操作。In this embodiment of the present application, after the electronic device controls the first voice message to be in a state to be edited, the electronic device can display a target control for editing the first voice message, so that the user can input the target control to make the electronic device It is in a voice recording state, and performs semantic analysis and processing on the user's voice input to accurately determine the first content and the target content, so that the user can flexibly control the electronic device to perform corresponding operations through the voice input.
用户输入单元107,还用于接收用户的第四输入,该第四输入为用户对第四语音消息的输入。The user input unit 107 is further configured to receive a fourth input from a user, where the fourth input is an input of a fourth voice message by the user.
处理器110,还用于响应于第四输入,将第四语音消息与第三语音消息进行组合处理,得到第五语音消息。The processor 110 is further configured to, in response to the fourth input, perform combined processing on the fourth voice message and the third voice message to obtain a fifth voice message.
网络模块102,用于发送包括第三语音消息的第五语音消息。The network module 102 is configured to send a fifth voice message including a third voice message.
本申请实施例中,在电子设备发送第三语音消息之前,用户可以对第四语音消息进行输入,以使得电子设备将第四语音消息与第三语音消息进行组合处理,得到第五语音消息,从而电子设备可以发送包括第三语音消息的第五语音消息,因此可以提高电子设备发送语音消息的灵活性。In the embodiment of the present application, before the electronic device sends the third voice message, the user may input the fourth voice message, so that the electronic device performs combined processing on the fourth voice message and the third voice message to obtain the fifth voice message, Therefore, the electronic device can send the fifth voice message including the third voice message, so the flexibility of the electronic device to send the voice message can be improved.
应理解的是,本申请实施例中,输入单元104可以包括图形处理器(Graphics Processing Unit,GPU)1041和麦克风1042,图形处理器1041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。显示单元106可包括显示面板1061,可以采用液晶显示器、有机发光二极管等形式来配置显示面板1061。用户输入单元107包括触控面板1071以及其他输入设备1072。触控面板1071,也称为触摸屏。触控面板1071可包括触摸检测装置和触摸控制器两个部分。其他输入设备1072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆,在此不再赘述。存储器109可用于存储软件程序以及各种数据,包括但不限于应用程序和操作系统。处理器110可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器110中。It should be understood that, in this embodiment of the present application, the input unit 104 may include a graphics processor (Graphics Processing Unit, GPU) 1041 and a microphone 1042. Such as camera) to obtain still pictures or video image data for processing. The display unit 106 may include a display panel 1061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 107 includes a touch panel 1071 and other input devices 1072 . The touch panel 1071 is also called a touch screen. The touch panel 1071 may include two parts, a touch detection device and a touch controller. Other input devices 1072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which are not described herein again. Memory 109 may be used to store software programs as well as various data including, but not limited to, application programs and operating systems. The processor 110 may integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, and an application program, and the like, and the modem processor mainly processes wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 110 .
本申请实施例还提供一种可读存储介质,所述可读存储介质上存储有程序或指令,该程序或指令被处理器执行时实现上述语音输入方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。Embodiments of the present application further provide a readable storage medium, where a program or an instruction is stored on the readable storage medium. When the program or instruction is executed by a processor, each process of the above-mentioned voice input method embodiment can be achieved, and the same can be achieved. In order to avoid repetition, the technical effect will not be repeated here.
其中,所述处理器为上述实施例中所述的电子设备中的处理器。所述可读存储介质,包括计算机可读存储介质,如计算机只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等。Wherein, the processor is the processor in the electronic device described in the foregoing embodiments. The readable storage medium includes a computer-readable storage medium, such as a computer read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.
本申请实施例另提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现上述语音输入方法实施 例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。An embodiment of the present application further provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction to implement the voice input method embodiments described above. Each process can achieve the same technical effect. In order to avoid repetition, it will not be repeated here.
应理解,本申请实施例提到的芯片还可以称为系统级芯片、系统芯片、芯片系统或片上系统芯片等。It should be understood that the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。此外,需要指出的是,本申请实施方式中的方法和装置的范围不限按示出或讨论的顺序来执行功能,还可包括根据所涉及的功能按基本同时的方式或按相反的顺序来执行功能,例如,可以按不同于所描述的次序来执行所描述的方法,并且还可以添加、省去、或组合各种步骤。另外,参照某些示例所描述的特征可在其他示例中被组合。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element. In addition, it should be noted that the scope of the methods and apparatus in the embodiments of the present application is not limited to performing the functions in the order shown or discussed, but may also include performing the functions in a substantially simultaneous manner or in the reverse order depending on the functions involved. To perform functions, for example, the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to some examples may be combined in other examples.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course hardware can also be used, but in many cases the former is better implementation. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or in a part that contributes to the prior art, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of this application.
上面结合附图对本申请的实施例进行了描述,但是本申请并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本申请的启示下,在不脱离本申请宗旨和权利要求所保护的范围情况下,还可做出很多形式,均属于本申请的保护之内。The embodiments of the present application have been described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of this application, without departing from the scope of protection of the purpose of this application and the claims, many forms can be made, which all fall within the protection of this application.

Claims (15)

  1. 一种语音输入方法,所述方法包括:A voice input method, the method comprising:
    接收用户输入的第一语音消息,显示与所述第一语音消息对应的第一语音内容;receiving the first voice message input by the user, and displaying the first voice content corresponding to the first voice message;
    接收用户对第一内容的第一输入,所述第一内容为与所述第一语音内容中的目标内容对应的内容;receiving a user's first input of first content, where the first content is content corresponding to target content in the first voice content;
    响应于所述第一输入,对所述第一语音消息中与所述目标内容对应的目标语音消息进行替换或删除。In response to the first input, a target voice message corresponding to the target content in the first voice message is replaced or deleted.
  2. 根据权利要求1所述的方法,其中,所述显示与所述第一语音消息对应的第一语音内容之后,所述方法还包括:The method according to claim 1, wherein after the displaying the first voice content corresponding to the first voice message, the method further comprises:
    接收用户的第二输入,所述第二输入为用户对所述目标内容的选择输入;receiving a second input from the user, where the second input is the user's selection input on the target content;
    响应于所述第二输入,根据所述目标内容,确定所述目标语音消息;determining the target voice message according to the target content in response to the second input;
    所述接收用户对第一内容的第一输入,包括:The receiving the first input of the first content by the user includes:
    接收用户输入的第二语音消息,所述第二语音消息为所述第一内容对应的语音消息;receiving a second voice message input by a user, where the second voice message is a voice message corresponding to the first content;
    所述对所述第一语音消息中与所述目标内容对应的目标语音消息进行替换或删除,包括:The replacing or deleting the target voice message corresponding to the target content in the first voice message includes:
    根据所述第二语音消息,将所述第一语音消息中的所述目标语音消息替换为所述第二语音消息,或者,删除所述第一语音消息中的所述目标语音消息;得到第三语音消息。According to the second voice message, replace the target voice message in the first voice message with the second voice message, or delete the target voice message in the first voice message; obtain the first voice message Three voice messages.
  3. 根据权利要求1所述的方法,其中,所述显示与所述第一语音消息对应的第一语音内容之后,所述方法还包括:The method according to claim 1, wherein after the displaying the first voice content corresponding to the first voice message, the method further comprises:
    显示目标控件,所述目标控件用于编辑所述第一语音消息;displaying a target control, the target control is used to edit the first voice message;
    接收用户的第三输入,所述第三输入为用户对所述目标控件的输入;receiving a user's third input, where the third input is the user's input to the target control;
    响应于所述第三输入,控制电子设备处于语音录制状态。In response to the third input, the electronic device is controlled to be in a voice recording state.
  4. 根据权利要求2所述的方法,其中,所述对所述第一语音消息中与所述目标内容对应的目标语音消息进行替换或删除之前,所述方法还包括:The method according to claim 2, wherein before replacing or deleting the target voice message corresponding to the target content in the first voice message, the method further comprises:
    对所述第二语音消息进行语义分析处理,确定所述第一内容和所述目标内容。Semantic analysis processing is performed on the second voice message to determine the first content and the target content.
  5. 根据权利要求1至4中任一项所述的方法,其中,所述对所述第一语音消息中与所述目标内容对应的目标语音消息进行替换或删除之后,所述方法还包括:The method according to any one of claims 1 to 4, wherein after replacing or deleting the target voice message corresponding to the target content in the first voice message, the method further comprises:
    接收用户的第四输入,所述第四输入为用户对第四语音消息的输入;receiving a fourth input from the user, where the fourth input is the user's input on a fourth voice message;
    响应于所述第四输入,将所述第四语音消息与第三语音消息进行组合处理,得到第五语音消息;In response to the fourth input, combining the fourth voice message and the third voice message to obtain a fifth voice message;
    发送包括所述第三语音消息的所述第五语音消息。The fifth voice message including the third voice message is sent.
  6. 一种语音输入装置,所述语音输入装置包括:接收模块、显示模块和处理模块;A voice input device comprising: a receiving module, a display module and a processing module;
    所述接收模块,用于接收用户输入的第一语音消息;The receiving module is configured to receive the first voice message input by the user;
    所述显示模块,用于显示与所述第一语音消息对应的第一语音内容;the display module, configured to display the first voice content corresponding to the first voice message;
    所述接收模块,还用于接收用户对第一内容的第一输入,所述第一内容为与所述第一语音内容中的目标内容对应的内容;The receiving module is further configured to receive a user's first input of first content, where the first content is content corresponding to the target content in the first voice content;
    所述处理模块,用于响应于所述接收模块接收的所述第一输入,对所述第一语音消息中与所述目标内容对应的目标语音消息进行替换或删除。The processing module is configured to replace or delete the target voice message corresponding to the target content in the first voice message in response to the first input received by the receiving module.
  7. 根据权利要求6所述的语音输入装置,其中,所述语音输入装置还包括:确定模块;The voice input device according to claim 6, wherein the voice input device further comprises: a determination module;
    所述接收模块,还用于在所述显示模块显示与所述第一语音消息对应的第一语音内容之后,接收用户的第二输入,所述第二输入为用户对所述目标内容的选择输入;The receiving module is further configured to receive a second input from the user after the display module displays the first voice content corresponding to the first voice message, where the second input is the user's selection of the target content enter;
    所述确定模块,用于响应于所述接收模块接收的所述第二输入,根据所述目标内容,确定所述目标语音消息;the determining module, configured to determine the target voice message according to the target content in response to the second input received by the receiving module;
    所述接收模块,具体用于接收用户输入的第二语音消息,所述第二语音消息为所述第一内容对应的语音消息;The receiving module is specifically configured to receive a second voice message input by a user, where the second voice message is a voice message corresponding to the first content;
    所述处理模块,具体用于根据所述第二语音消息,将所述第一语音消息中的所述目标语音消息替换为所述第二语音消息,或者,删除所述第一语音消息中的所述目标语音消息;得到第三语音消息。The processing module is specifically configured to replace the target voice message in the first voice message with the second voice message according to the second voice message, or delete the target voice message in the first voice message. the target voice message; obtain a third voice message.
  8. 根据权利要求6所述的语音输入装置,其中,所述语音输入装置还包括:控制模块;The voice input device according to claim 6, wherein the voice input device further comprises: a control module;
    所述显示模块,还用于在显示与所述第一语音消息对应的第一语音内容之后,显 示目标控件,所述目标控件用于编辑所述第一语音消息;The display module is also used to display a target control after displaying the first voice content corresponding to the first voice message, and the target control is used to edit the first voice message;
    所述接收模块,还用于接收用户的第三输入,所述第三输入为用户对所述目标控件的输入;The receiving module is further configured to receive a user's third input, where the third input is the user's input to the target control;
    所述控制模块,用于响应于所述接收模块接收的所述第三输入,控制语音输入装置处于语音录制状态。The control module is configured to control the voice input device to be in a voice recording state in response to the third input received by the receiving module.
  9. 根据权利要求7所述的语音输入装置,其中,所述处理模块,还用于在对所述第一语音消息中与所述目标内容对应的目标语音消息进行替换或删除之前,对所述第二语音消息进行语义分析处理,确定所述第一内容和所述目标内容。The voice input device according to claim 7, wherein the processing module is further configured to, before replacing or deleting the target voice message corresponding to the target content in the first voice message, Semantic analysis processing is performed on the two voice messages to determine the first content and the target content.
  10. 根据权利要求6至9中任一项所述的语音输入装置,其中,所述消息发送装置还包括:发送模块;The voice input device according to any one of claims 6 to 9, wherein the message sending device further comprises: a sending module;
    所述接收模块,还用于在所述处理模块对所述第一语音消息中与所述目标内容对应的目标语音消息进行替换或删除之后,接收用户的第四输入,所述第四输入为用户对第四语音消息的输入;The receiving module is further configured to receive a fourth input from the user after the processing module replaces or deletes the target voice message corresponding to the target content in the first voice message, where the fourth input is: the user's input of the fourth voice message;
    所述处理模块,还用于响应于所述接收模块接收的所述第四输入,将所述第四语音消息与第三语音消息进行组合处理,得到第五语音消息;The processing module is further configured to perform combined processing on the fourth voice message and the third voice message in response to the fourth input received by the receiving module to obtain a fifth voice message;
    所述发送模块,用于发送包括所述第三语音消息的所述第五语音消息。The sending module is configured to send the fifth voice message including the third voice message.
  11. 一种电子设备,包括处理器,存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如权利要求1-5中任一项所述的语音输入方法的步骤。An electronic device, comprising a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being executed by the processor to achieve as claimed in claims 1-5 The steps of any one of the voice input methods.
  12. 一种可读存储介质,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如权利要求1-5中任一项所述的语音输入方法的步骤。A readable storage medium on which programs or instructions are stored, and when the programs or instructions are executed by a processor, implement the steps of the voice input method according to any one of claims 1-5.
  13. 一种计算机软件产品,所述计算机软件产品被至少一个处理器执行以实现如权利要求1至5中任一项所述的语音输入方法。A computer software product executed by at least one processor to implement the speech input method of any one of claims 1 to 5.
  14. 一种电子设备,包括电子设备被配置成用于执行如权利要求1至5中任一项所述的语音输入方法。An electronic device comprising an electronic device configured to perform the voice input method of any one of claims 1 to 5.
  15. 一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如权利要求1至5中任一项所述的语音输 入方法。A chip, the chip includes a processor and a communication interface, the communication interface is coupled with the processor, and the processor is used to run a program or an instruction to implement the voice as claimed in any one of claims 1 to 5 input method.
PCT/CN2021/138688 2020-12-22 2021-12-16 Speech input method and apparatus, and electronic device WO2022135259A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011529379.7 2020-12-22
CN202011529379.7A CN112637407A (en) 2020-12-22 2020-12-22 Voice input method and device and electronic equipment

Publications (1)

Publication Number Publication Date
WO2022135259A1 true WO2022135259A1 (en) 2022-06-30

Family

ID=75320973

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/138688 WO2022135259A1 (en) 2020-12-22 2021-12-16 Speech input method and apparatus, and electronic device

Country Status (2)

Country Link
CN (1) CN112637407A (en)
WO (1) WO2022135259A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112637407A (en) * 2020-12-22 2021-04-09 维沃移动通信有限公司 Voice input method and device and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106933561A (en) * 2015-12-31 2017-07-07 北京搜狗科技发展有限公司 Pronunciation inputting method and terminal device
CN106952655A (en) * 2017-02-23 2017-07-14 深圳市金立通信设备有限公司 A kind of input method and terminal
US20180166080A1 (en) * 2016-12-08 2018-06-14 Guangzhou Shenma Mobile Information Technology Co. Ltd. Information input method, apparatus and computing device
CN108632465A (en) * 2018-04-27 2018-10-09 维沃移动通信有限公司 A kind of method and mobile terminal of voice input
CN108737634A (en) * 2018-02-26 2018-11-02 珠海市魅族科技有限公司 Pronunciation inputting method and device, computer installation and computer readable storage medium
CN112637407A (en) * 2020-12-22 2021-04-09 维沃移动通信有限公司 Voice input method and device and electronic equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102546510B1 (en) * 2018-03-21 2023-06-23 삼성전자주식회사 Method for providing information mapped between plurality inputs and electronic device supporting the same
CN110392158A (en) * 2018-04-19 2019-10-29 成都野望数码科技有限公司 A kind of message treatment method, device and terminal device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106933561A (en) * 2015-12-31 2017-07-07 北京搜狗科技发展有限公司 Pronunciation inputting method and terminal device
US20180166080A1 (en) * 2016-12-08 2018-06-14 Guangzhou Shenma Mobile Information Technology Co. Ltd. Information input method, apparatus and computing device
CN106952655A (en) * 2017-02-23 2017-07-14 深圳市金立通信设备有限公司 A kind of input method and terminal
CN108737634A (en) * 2018-02-26 2018-11-02 珠海市魅族科技有限公司 Pronunciation inputting method and device, computer installation and computer readable storage medium
CN108632465A (en) * 2018-04-27 2018-10-09 维沃移动通信有限公司 A kind of method and mobile terminal of voice input
CN112637407A (en) * 2020-12-22 2021-04-09 维沃移动通信有限公司 Voice input method and device and electronic equipment

Also Published As

Publication number Publication date
CN112637407A (en) 2021-04-09

Similar Documents

Publication Publication Date Title
WO2021036594A1 (en) Control method applied to screen projection scenario and related device
WO2022001900A1 (en) Information sending method and apparatus, and electronic device
WO2022121790A1 (en) Split-screen display method and apparatus, electronic device, and readable storage medium
WO2022156709A1 (en) Audio signal processing method and apparatus, electronic device and readable storage medium
WO2022121877A1 (en) Message processing method, apparatus, and electronic device
JP2016506564A (en) Swipe stroke input and continuous handwriting
KR20160042902A (en) Feedback for lasso selection
WO2022156668A1 (en) Information processing method and electronic device
WO2022089409A1 (en) File sending method and apparatus, and electronic device
WO2016179124A1 (en) Real-time sharing of document edits
WO2022143521A1 (en) Message processing method and apparatus, and electronic device
WO2023131055A1 (en) Message sending method and apparatus, and electronic device
JP2024518775A (en) Message processing method, first message processing device, second message processing device, and electronic device
CN113518026A (en) Message processing method and device and electronic equipment
WO2023061343A1 (en) Session creation method and apparatus, and electronic device
WO2023155877A1 (en) Application icon management method and apparatus and electronic device
WO2023185817A1 (en) Multi-device cooperation method and apparatus, and electronic device and medium
WO2022262721A1 (en) Information interaction method and apparatus, and electronic device
WO2022218192A1 (en) File processing method and apparatus
WO2022143660A1 (en) Icon display method and apparatus, and electronic device
WO2022089481A1 (en) Information processing method and apparatus, and electronic device
WO2022135259A1 (en) Speech input method and apparatus, and electronic device
CN114415847A (en) Text information deleting method and device and electronic equipment
CN114374663A (en) Message processing method and message processing device
WO2024051522A1 (en) Message sending method and apparatus, and electronic device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21909255

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21909255

Country of ref document: EP

Kind code of ref document: A1