WO2022135259A1 - Procédé et appareil d'entrée vocale, et dispositif électronique - Google Patents

Procédé et appareil d'entrée vocale, et dispositif électronique Download PDF

Info

Publication number
WO2022135259A1
WO2022135259A1 PCT/CN2021/138688 CN2021138688W WO2022135259A1 WO 2022135259 A1 WO2022135259 A1 WO 2022135259A1 CN 2021138688 W CN2021138688 W CN 2021138688W WO 2022135259 A1 WO2022135259 A1 WO 2022135259A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice message
input
voice
content
target
Prior art date
Application number
PCT/CN2021/138688
Other languages
English (en)
Chinese (zh)
Inventor
张孝东
Original Assignee
维沃移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信有限公司 filed Critical 维沃移动通信有限公司
Publication of WO2022135259A1 publication Critical patent/WO2022135259A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range

Definitions

  • the present application belongs to the field of communication technologies, and in particular relates to a voice input method, device and electronic device.
  • the purpose of the embodiments of the present application is to provide a voice input method, device, and electronic device, which can solve the problem of low efficiency in editing a message by the electronic device.
  • an embodiment of the present application provides a voice input method, the method includes: receiving a first voice message input by a user, displaying first voice content corresponding to the first voice message; An input, the first content is content corresponding to the target content in the first voice content; in response to the first input, the target voice message corresponding to the target content in the first voice message is replaced or deleted.
  • an embodiment of the present application provides a voice input device, which includes: a receiving module, a display module, and a processing module.
  • the receiving module is used for receiving the first voice message input by the user.
  • the display module is used for displaying the first voice content corresponding to the first voice message.
  • the receiving module is further configured to receive a user's first input of the first content, where the first content is content corresponding to the target content in the first voice content.
  • the processing module is configured to replace or delete the target voice message corresponding to the target content in the first voice message in response to the first input received by the receiving module.
  • an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented .
  • an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction, and implement the first aspect the method described.
  • the user can input the first voice message to trigger the electronic device to display the first voice content corresponding to the first voice message, so that the user can enter the first voice content corresponding to the target content in the first voice content.
  • a content is input, so that the electronic device can replace or delete the target voice message corresponding to the target content in the first voice message.
  • the electronic device can display the first voice content corresponding to the first voice message in real time, so when the target content in the first voice content is incorrect, the user can The first content corresponding to the target content is input, so that the electronic device can replace or delete the erroneous target voice message in the first voice message.
  • Fig. 2 is one of the example schematic diagrams of an interface of a mobile phone provided by an embodiment of the present application
  • FIG. 3 is the second schematic diagram of a voice input method provided by an embodiment of the present application.
  • FIG. 4 is the second schematic diagram of an example of an interface of a mobile phone provided by an embodiment of the present application.
  • FIG. 5 is a third schematic diagram of a voice input method provided by an embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a voice input device provided by an embodiment of the present application.
  • FIG. 7 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
  • first, second and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between “first”, “second”, etc.
  • the objects are usually of one type, and the number of objects is not limited.
  • the first object may be one or more than one.
  • “and/or” in the description and claims indicates at least one of the connected objects, and the character “/" generally indicates that the associated objects are in an "or” relationship.
  • the electronic device when the user sends a voice message to the target contact through a chat application program in the electronic device (for example: I am on Renmin East Road
  • the voice message to be sent can be input on the dialogue interface corresponding to the target contact. If the user realizes that there is an error in the voice message to be sent (for example, Renmin East Road), the electronic device can be triggered to obtain the user input.
  • FIG. 1 shows a flowchart of a voice input method provided by an embodiment of the present application, and the method can be applied to an electronic device.
  • the voice input method provided by this embodiment of the present application may include the following steps 201 to 203 .
  • Step 201 The electronic device receives the first voice message input by the user, and displays the first voice content corresponding to the first voice message.
  • the user when the user sends a voice message to a contact, the user can input the first voice message to be sent, so that the electronic device can acquire and display the first voice corresponding to the first voice message input by the user.
  • content ie, text content
  • the user can input the first content, so that the electronic device can replace or delete the target voice message corresponding to the target content in the first voice message.
  • the electronic device when the electronic device displays the conversation page corresponding to the target contact in the chat application, the user can input the control displayed in the conversation page for sending voice, and then the user can Voice content may be input to input the first voice message.
  • the user can perform a long-press input or a click input on the control used for sending voice to trigger the electronic device to be in a voice recording state (that is, perform a recording function), and the above-mentioned first voice message
  • the voice input is performed during the process of long-pressing the control for sending voice, or it is the voice input after the user taps the control for sending voice.
  • the electronic device may perform a recording function to record the user's voice input (ie, the first voice message), and obtain the first voice message.
  • the corresponding first voice content is displayed, and the first voice content is displayed.
  • the long-press input and the user's voice input are simultaneous inputs, that is, when the user performs By performing voice input while long-pressing the input, the electronic device can acquire the first voice content corresponding to the user's first voice message.
  • the electronic device cannot obtain the first voice content corresponding to the user's first voice message.
  • the electronic device may convert the voice content corresponding to the first voice message into text according to the acquired voice content corresponding to the first voice message. content, and display the first voice content (ie, text content) corresponding to the first voice message at a preset position on the screen.
  • the electronic device may directly display the complete first voice message corresponding to the first voice message in a preset position on the screen. a voice content; or, the electronic device may gradually display the first voice content corresponding to the first voice message at a preset position on the screen while the user is inputting the first voice message (that is, according to the progress of the user's voice input , display the corresponding text), that is, convert the voice content of the first voice message into text content in real time and display it.
  • the electronic device may acquire the first voice content corresponding to the first voice message, and control the first voice message to be in a to-be-edited state.
  • Step 203 in response to the first input, the electronic device replaces or deletes the target voice message corresponding to the target content in the first voice message.
  • the electronic device may delete the voice content corresponding to the target content from the first voice message, and add the voice content corresponding to the first content to the position where the voice content corresponding to the target content is located in the first voice message. , so as to combine to get a new voice message.
  • the voice input method provided by the embodiment of the present application may further include the following steps 301 and 302 , and the above steps 202 can be specifically implemented by the following step 202a, and the above-mentioned step 203 can be specifically implemented by the following step 203a.
  • Step 301 the electronic device receives a second input from the user.
  • the above-mentioned second input is the user's selection input of the target content.
  • the above-mentioned second input may be any one of the following: a user's click input on the target content, a user's long-press input on the target content, a user's double-click input on the target content, and the like.
  • Step 302 the electronic device determines the target voice message according to the target content in response to the second input.
  • the electronic device may mark and display the target content, and determine the target voice message corresponding to the target content from the first voice message according to the position of the target content in the first voice content.
  • the user can display the text content in the text display area 11: "I am on Renmin East Road, you can come and find me, let's go to eat together” in the text "Renmin East Road” Enter to trigger the mobile phone to highlight the text corresponding to "Renmin East Road", so that according to the location of "Renmin East Road” in “I'm on Renmin East Road now, you can come and find me, let's go to eat together", from the user
  • the voice part corresponding to "Renmin East Road” is determined in the voice message corresponding to the voice input.
  • Step 202a the electronic device receives the second voice message input by the user.
  • the above-mentioned second voice message is a voice message corresponding to the first content.
  • the voice content corresponding to the user's voice input may further include other content
  • the electronic device may obtain the voice content of the first content by performing semantic analysis on the voice content corresponding to the user's voice input.
  • the electronic device processes the first voice message according to the first content obtained by semantic analysis, so as to replace or delete the target voice content.
  • the user can select and input the target content in the first voice content, so that the electronic device can select and input the target content according to the user's input on the target content. , determine the target voice message in the first voice message, and replace the target voice message in the first voice message with the second voice message according to the second voice message corresponding to the first content input by the user, or delete the first voice message
  • the target voice message in the message; the third voice message is obtained, so that the user can accurately determine the content that needs to be replaced or deleted in the first voice message.
  • Step 401 the electronic device displays the target control.
  • the electronic device when the user starts to input the voice input control (that is, when the electronic device displays the voice input interface), the electronic device may display the target control at a preset position on the screen, so that the user can The target control makes an input to trigger the electronic device to be in a voice recording state.
  • the electronic device when the electronic device displays the target control, the electronic device does not need to display the first voice content corresponding to the first voice message on the screen.
  • the above-mentioned third input is the input of the user to the target control.
  • the user may perform sliding input after long-pressing the voice input control to slide to the position of the target control, thereby triggering the electronic device to be in a voice recording state.
  • the user when the electronic device is in a voice recording state, the user may input the first voice message to record the voice content input by the user.
  • the voice input method provided in the embodiment of the present application may further include: Step 404 is described below.
  • Step 404 In response to the first input, the electronic device performs semantic analysis processing on the second voice message to determine the first content and the target content.
  • the electronic device can display a target control for editing the first voice message, so that the user can input the target control to make the electronic device It is in a voice recording state, and performs semantic analysis and processing on the user's voice input to accurately determine the first content and the target content, so that the user can flexibly control the electronic device to perform corresponding operations through the voice input.
  • the specific step of the above step 203 may be replaced by "the electronic device replaces or deletes the target voice message corresponding to the target content in the first voice message".
  • the voice input method provided by the embodiment of the present application may further include the following steps 501 to 503 .
  • Step 501 the electronic device receives a fourth input from the user.
  • the user after the electronic device replaces the target content in the first voice message with the first content to obtain the third voice message, the user can perform voice input again to input the fourth voice message.
  • the above-mentioned first voice message and the fourth voice message can be understood as a complete voice message (that is, the fifth voice message described below), and the user is
  • the complete voice message when the input of the first part of the voice message (ie the first voice message) is completed, the voice input may be paused first to replace the wrong content (ie the target content) in the first voice message, Therefore, after the replacement of the wrong content is completed, the input of the second part of the voice message (ie, the fourth voice message) is continued.
  • Step 502 In response to the fourth input, the electronic device performs combined processing on the fourth voice message and the third voice message to obtain a fifth voice message.
  • the electronic device may add the fourth voice message to the third voice message to obtain a complete voice message (ie, the fifth voice message).
  • the electronic device may perform voice splicing processing on the fourth voice message and the third voice message, so as to combine the two voice messages to obtain one voice message.
  • the voice message sent by the electronic device is the fifth voice message including the third voice message.
  • Voice message if the user does not perform the fourth input, the voice message sent by the electronic device is the third voice message.
  • FIG. 6 shows a possible schematic structural diagram of the voice input device involved in the embodiment of the present application.
  • the voice input device 70 may include: a receiving module 71 , a display module 72 and a processing module 73 .
  • the receiving module 71 is configured to receive the first voice message input by the user.
  • the display module 72 is configured to display the first voice content corresponding to the first voice message.
  • the receiving module 71 is further configured to receive a user's first input of the first content, where the first content is content corresponding to the target content in the first voice content.
  • the processing module 73 is configured to replace or delete the target voice message corresponding to the target content in the first voice message in response to the first input received by the receiving module 71 .
  • the voice input apparatus 70 may further include: a determination module.
  • the receiving module 71 is further configured to receive the user's second input after the display module 72 displays the first voice content corresponding to the first voice message, where the second input is the user's selection input on the target content.
  • the determining module is configured to determine the target voice message according to the target content in response to the second input received by the receiving module 71 .
  • the receiving module 71 is specifically configured to receive a second voice message input by a user, where the second voice message is a voice message corresponding to the first content.
  • the processing module 73 is specifically configured to replace the target voice message in the first voice message with the second voice message according to the second voice message, or delete the target voice message in the first voice message; and obtain a third voice message.
  • non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
  • Network Attached Storage NAS
  • personal computer personal computer, PC
  • television television
  • teller machine or self-service machine etc.
  • the processor 110 is configured to, in response to the first input, replace or delete the target voice message corresponding to the target content in the first voice message.
  • the user input unit 107 is further configured to receive a second input from the user, where the second input is the user's selection input on the target content in the first text content.
  • the user can select and input the target content in the first voice content, so that the electronic device can select and input the target content according to the user's input on the target content. , determine the target voice message in the first voice message, and replace the target voice message in the first voice message with the second voice message according to the second voice message corresponding to the first content input by the user, or delete the first voice message
  • the target voice message in the message; the third voice message is obtained, so that the user can accurately determine the content that needs to be replaced or deleted in the first voice message.
  • the processor 110 is further configured to perform semantic analysis processing on the second voice message to determine the first content and the target content.
  • the processor 110 is further configured to, in response to the fourth input, perform combined processing on the fourth voice message and the third voice message to obtain a fifth voice message.
  • the network module 102 is configured to send a fifth voice message including a third voice message.
  • the processor is the processor in the electronic device described in the foregoing embodiments.
  • the readable storage medium includes a computer-readable storage medium, such as a computer read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.
  • the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephone Function (AREA)

Abstract

La présente demande se rapporte au domaine technique des communications. Elle concerne un procédé et un appareil d'entrée vocale, ainsi qu'un dispositif électronique. Le procédé consiste à : recevoir un premier message vocal entré par un utilisateur et afficher un premier contenu vocal correspondant au premier message vocal ; recevoir une première entrée d'un utilisateur pour un premier contenu, le premier contenu étant un contenu correspondant à un contenu cible dans le premier contenu vocal ; et en réponse à la première entrée, remplacer ou supprimer un message vocal cible correspondant au contenu cible dans le premier message vocal. Les modes de réalisation de la présente demande sont appliqués à un processus d'envoi d'un message par un dispositif électronique.
PCT/CN2021/138688 2020-12-22 2021-12-16 Procédé et appareil d'entrée vocale, et dispositif électronique WO2022135259A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011529379.7A CN112637407A (zh) 2020-12-22 2020-12-22 语音输入方法、装置及电子设备
CN202011529379.7 2020-12-22

Publications (1)

Publication Number Publication Date
WO2022135259A1 true WO2022135259A1 (fr) 2022-06-30

Family

ID=75320973

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/138688 WO2022135259A1 (fr) 2020-12-22 2021-12-16 Procédé et appareil d'entrée vocale, et dispositif électronique

Country Status (2)

Country Link
CN (1) CN112637407A (fr)
WO (1) WO2022135259A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112637407A (zh) * 2020-12-22 2021-04-09 维沃移动通信有限公司 语音输入方法、装置及电子设备

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106933561A (zh) * 2015-12-31 2017-07-07 北京搜狗科技发展有限公司 语音输入方法和终端设备
CN106952655A (zh) * 2017-02-23 2017-07-14 深圳市金立通信设备有限公司 一种输入方法和终端
US20180166080A1 (en) * 2016-12-08 2018-06-14 Guangzhou Shenma Mobile Information Technology Co. Ltd. Information input method, apparatus and computing device
CN108632465A (zh) * 2018-04-27 2018-10-09 维沃移动通信有限公司 一种语音输入的方法及移动终端
CN108737634A (zh) * 2018-02-26 2018-11-02 珠海市魅族科技有限公司 语音输入方法及装置、计算机装置和计算机可读存储介质
CN112637407A (zh) * 2020-12-22 2021-04-09 维沃移动通信有限公司 语音输入方法、装置及电子设备

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102546510B1 (ko) * 2018-03-21 2023-06-23 삼성전자주식회사 복수의 입력 간에 매핑된 정보 제공 방법 및 이를 지원하는 전자 장치
CN110392158A (zh) * 2018-04-19 2019-10-29 成都野望数码科技有限公司 一种消息处理方法、装置以及终端设备

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106933561A (zh) * 2015-12-31 2017-07-07 北京搜狗科技发展有限公司 语音输入方法和终端设备
US20180166080A1 (en) * 2016-12-08 2018-06-14 Guangzhou Shenma Mobile Information Technology Co. Ltd. Information input method, apparatus and computing device
CN106952655A (zh) * 2017-02-23 2017-07-14 深圳市金立通信设备有限公司 一种输入方法和终端
CN108737634A (zh) * 2018-02-26 2018-11-02 珠海市魅族科技有限公司 语音输入方法及装置、计算机装置和计算机可读存储介质
CN108632465A (zh) * 2018-04-27 2018-10-09 维沃移动通信有限公司 一种语音输入的方法及移动终端
CN112637407A (zh) * 2020-12-22 2021-04-09 维沃移动通信有限公司 语音输入方法、装置及电子设备

Also Published As

Publication number Publication date
CN112637407A (zh) 2021-04-09

Similar Documents

Publication Publication Date Title
WO2021036594A1 (fr) Procédé de commande appliqué à un scénario de projection d'écran et dispositif associé
CN109219824B (zh) 利用用户访问权限来自动共享文档
WO2022001900A1 (fr) Procédé et appareil d'envoi d'informations et dispositif électronique
WO2022156709A1 (fr) Procédé et appareil de traitement de signal audio, dispositif électronique et support de stockage lisible
WO2022121790A1 (fr) Procédé et appareil d'affichage à écran partagé, dispositif électronique et support de stockage lisible
WO2022121877A1 (fr) Procédé de traitement de message, appareil et dispositif électronique
JP2016506564A (ja) スワイプストローク入力及び連続的な手書き
KR20160042902A (ko) 올가미 선택을 위한 피드백 제공 기법
WO2022089409A1 (fr) Procédé et appareil d'envoi de fichier, et dispositif électronique
WO2016179124A1 (fr) Partage en temps réel d'éditions d'un document
WO2022143521A1 (fr) Procédé et appareil de traitement de messages et dispositif électronique
WO2022156668A1 (fr) Procédé de traitement d'informations et dispositif électronique
JP2024518775A (ja) メッセージ処理方法、第1メッセージ処理装置、第2メッセージ処理装置及び電子機器
CN113518026A (zh) 消息处理方法、装置和电子设备
WO2023061343A1 (fr) Procédé et appareil de création de session et dispositif électronique
WO2023185817A1 (fr) Procédé et appareil de coopération multi-dispositif, et dispositif électronique et support
WO2022262721A1 (fr) Procédé et appareil d'interaction d'informations, et dispositif électronique
WO2022218192A1 (fr) Procédé et appareil de traitement de fichier
WO2022143660A1 (fr) Procédé et appareil d'affichage d'icônes, et dispositif électronique
WO2022089481A1 (fr) Procédé et appareil de traitement d'informations et dispositif électronique
US20170315703A1 (en) Projector playing control method, device, and computer storage medium
WO2022135259A1 (fr) Procédé et appareil d'entrée vocale, et dispositif électronique
CN114415847A (zh) 文本信息删除方法、装置及电子设备
WO2024051522A1 (fr) Procédé et appareil d'envoi de message, dispositif électronique et support de stockage
KR20220154825A (ko) 노트 생성 방법 및 전자기기

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21909255

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21909255

Country of ref document: EP

Kind code of ref document: A1