WO2013185606A1 - 一种终端语音辅助编辑的方法及装置 - Google Patents
一种终端语音辅助编辑的方法及装置 Download PDFInfo
- Publication number
- WO2013185606A1 WO2013185606A1 PCT/CN2013/077128 CN2013077128W WO2013185606A1 WO 2013185606 A1 WO2013185606 A1 WO 2013185606A1 CN 2013077128 W CN2013077128 W CN 2013077128W WO 2013185606 A1 WO2013185606 A1 WO 2013185606A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- command
- label
- voice
- editing
- text
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 24
- 230000006978 adaptation Effects 0.000 claims description 14
- 230000004913 activation Effects 0.000 claims description 6
- 230000006870 function Effects 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Definitions
- the present invention relates to the field of communications, and in particular, to a method and apparatus for voice assisted editing of a terminal.
- the embodiment of the invention provides a method and a device for voice-assisted editing of a terminal, which can reduce the cumbersome steps for the user to manually input the edited text, thereby greatly improving the user experience.
- An embodiment of the present invention provides a method for voice-assisted editing of a terminal, including:
- the terminal configures a correspondence between the user command and the label, and saves the correspondence in the configuration file, where the user command is saved in the form of a text command;
- the terminal After receiving the voice command of the user, the terminal converts the voice command into a text command, and queries the configuration file for whether the text command is available, and when the text command is queried in the configuration file Then, the label corresponding to the text command is read, and the editing operation indicated by the label is performed.
- the tag indicates one or more editing operations
- the step of configuring the correspondence between the user command and the label by the terminal includes: the terminal inputting a command through the touch screen, then configuring a label for the command, and selecting an editing operation indicated by the label, or inputting a command by voice, and then passing the The touch screen configures a label for the command and selects an editing operation indicated by the label.
- the step of configuring, by the terminal, the correspondence between the user command and the label includes: the format of the terminal configuration command and the label;
- the format includes: Command + Label + Label indicates the editing operation.
- the method further includes:
- the terminal After receiving the voice command of the user, when the terminal does not query the text command in the configuration file, the terminal does not perform an editing operation and waits for the next command of the user.
- the embodiment of the invention further provides a device for voice-assisted editing of a terminal, comprising: a configuration module, a voice adaptation module and an analysis editing module, wherein:
- the configuration module is configured to: configure a correspondence between the user command and the label, and save the corresponding relationship in the configuration file, where the user command is saved in the form of a text command;
- the voice adaptation module is configured to: receive a voice command of the user, convert the voice command into a text command, and send the text command to the analysis editing module;
- the parsing and editing module is configured to: after receiving the text command sent by the voice adapting module, querying, in the configuration file, whether the text command is available, and querying in the configuration file When the text command is executed, the label corresponding to the text command is read, and the editing operation indicated by the label is executed.
- the tag indicates one or more editing operations
- the configuration module is configured to configure a correspondence between the user command and the label in the following manner: inputting a command through the touch screen, then configuring a label for the command, and selecting an editing operation indicated by the label, or inputting a command by voice, and then passing The touch screen configures a label for the command and selects an editing operation indicated by the label.
- the configuration module is configured to: configure a format corresponding to the label;
- the format includes: an edit operation indicated by a command + tag + tag.
- the parsing and editing module is further configured to: after receiving the text command sent by the voice adaptation module, when the text command is not queried in the configuration file, the editing is not performed. Operation, waiting for the voice adaptation module to send the next command of the user.
- the device further includes a voice initiation module, where:
- the voice activation module is configured to: set a startup mode of voice assisted editing
- the startup mode includes: setting a button for the user to initiate a voice assisted editing function, or setting to automatically initiate a voice assisted editing function with a popup of the input method panel.
- the method and device for voice-assisted editing of the terminal provided by the embodiment of the invention can reduce the cumbersome steps for the user to manually input the edited text, thereby greatly improving the user experience.
- FIG. 1 is a structural diagram of an apparatus for voice-assisted editing of a terminal in an embodiment.
- FIG. 2 is a flow chart of a method for voice-assisted editing of a terminal in an embodiment.
- the embodiment provides a device for voice-assisted editing of a terminal, including: a configuration module 11, a voice adaptation module 12, and an analysis editing module 13.
- the configuration module 11 is configured to: configure a correspondence between the user command and the label, and save the corresponding relationship in the configuration file, where the user command is saved in the form of a text command;
- the system presets a series of standard labels, each label indicating one or more editing operations; the configuration module inputs a command through the touch screen, then configures a corresponding label for the command, and selects an edit indicated by the label. Operation, or input a command by voice, then configure the corresponding label for the command through the touch screen, and select the editing operation indicated by the label.
- the configuration module 11 is further configured to: configure a format corresponding to the command, and the format includes: an edit operation indicated by the command + label + label.
- the voice adaptation module 12 is configured to: receive a voice command of the user, and convert the voice command into a text command, and send the text command to the parsing and editing module 13;
- the voice adaptation module 12 is adapted to different voice engines at the same time, so that it is convenient to replace different voice recognition engines at any time.
- the parsing and editing module 13 is configured to: after receiving the text command sent by the voice adapting module, querying, in the configuration file, whether the text command is available, and querying the text in the configuration file When the command is executed, the label corresponding to the text command is read, and the editing operation indicated by the label is executed.
- the parsing and editing module 13 is further configured to: after receiving the text command sent by the voice adaptation module, when the text command is not queried in the configuration file, the editing operation is not performed, waiting The voice adaptation module 12 sends the user's next command.
- the device further includes a voice activation module 14, and the voice activation module 14 is configured to: set a startup mode of the voice assisted editing;
- the startup mode includes: setting a button for the user to initiate the voice assisted editing function, or setting the voice assisted editing function to be automatically started with the pop-up of the input method panel. This allows the user to easily turn on the voice editing function when editing the relationship between the command and the label in the configuration file and performing voice editing.
- this embodiment provides a method for voice-assisted editing of a terminal, which includes the following steps.
- Step S101 The terminal configures a correspondence between the user command and the label, and saves the corresponding relationship in the configuration file, where the user command is saved in the form of a text command.
- Step S102 After receiving the voice command of the user, the terminal converts the voice command into a text command.
- the terminal is adapted to various types of voice engines, and can convert voice into text.
- Step S103 querying, in the configuration file, whether there is the text command, when in the configuration When the text command is queried in the file, the label corresponding to the text command is read, and the editing operation indicated by the label is performed.
- step S101 the system presets a series of standard tags for user configuration, each tag represents one or more editing operations, and each tag and an API provided by the terminal platform (Application Programming Interface, application programming) Interfaces are mapped to implement platform operations.
- each tag represents one or more editing operations
- each tag and an API provided by the terminal platform (Application Programming Interface, application programming) Interfaces are mapped to implement platform operations.
- the label UP represents moving the cursor up one or several lines
- the label LEFT represents moving the cursor one or more digits to the left
- the label DEL for deletion represents moving the cursor one or more digits to the left
- the label DEL for deletion represents moving the cursor one or more digits to the left
- the label DEL for deletion represents moving the cursor one or more digits to the left
- the label DEL for deletion represents moving the cursor one or more digits to the left
- the label DEL for deletion represents moving the cursor one or more digits to the left
- the label DEL for deletion
- the label INPUT for input, and so on.
- the step of configuring the corresponding relationship between the user command and the label includes: the terminal inputs a command through the touch screen, then configures a corresponding label for the command, and selects an editing operation indicated by the label, or inputs a command by voice, and then passes the touch screen. Configure the corresponding label for the command and select the editing operation indicated by the label.
- the embodiment further provides a format corresponding to the command, and the format includes: a command + a label + an edit operation indicated by the label.
- the display interface pops up a series of tags, and the input tag INPUT is configured for the command, and the format corresponding to the tag is generated by the "comma" command: [comma] ⁇ INPUT(, ) ⁇ , indicates that the system automatically enters the symbol when receiving the voice command "comma" spoken by the user? .
- the display interface pops up a series of tags, configures the input tag LEFT for the command, and selects the editing operation indicated by the tag, for example, shifting 3 bits to the left, the command is generated.
- "Left” The format corresponding to the label: [Left] ⁇ LEFT(3) ⁇ , indicates that the system moves the cursor to the left by 3 digits when the voice command "Left" spoken by the user is received.
- the command can also be Chinese characters, words and numbers that the user can define at his own discretion, giving the user great freedom.
- the format corresponding to the command and the label can even be configured as: [etc] ⁇ LEFT(2) ⁇ , which means that when the user says "equal", it means that the cursor is shifted to the left by 2 digits.
- step S103 after receiving the voice command of the user, when in the configuration file When the text command is not queried, the editing operation is not performed, and the user's next command is awaited.
- the method and apparatus for voice-assisted editing of the terminal provided in the above embodiments can reduce the cumbersome steps for the user to manually input the edited text, thereby greatly improving the user experience.
- the method and device for voice-assisted editing of the terminal provided by the embodiment of the invention can reduce the cumbersome steps for the user to manually input the edited text, thereby greatly improving the user experience.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/417,318 US20150193200A1 (en) | 2012-07-26 | 2013-06-13 | Voice-assisted editing method and device for terminal |
EP13804943.2A EP2879046A4 (en) | 2012-07-26 | 2013-06-13 | METHOD AND DEVICE FOR VOICEALLY ASSISTED EDITING FOR A TERMINAL |
JP2015523383A JP2015525933A (ja) | 2012-07-26 | 2013-06-13 | 端末音声補助編集方法及び装置 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210260840.2 | 2012-07-26 | ||
CN201210260840.2A CN103577072A (zh) | 2012-07-26 | 2012-07-26 | 一种终端语音辅助编辑的方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013185606A1 true WO2013185606A1 (zh) | 2013-12-19 |
Family
ID=49757535
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2013/077128 WO2013185606A1 (zh) | 2012-07-26 | 2013-06-13 | 一种终端语音辅助编辑的方法及装置 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20150193200A1 (zh) |
EP (1) | EP2879046A4 (zh) |
JP (1) | JP2015525933A (zh) |
CN (1) | CN103577072A (zh) |
WO (1) | WO2013185606A1 (zh) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105589642B (zh) * | 2014-10-29 | 2021-02-05 | 深圳富泰宏精密工业有限公司 | 掌上型电子装置的输入法自动切换系统及方法 |
CN106648385B (zh) * | 2015-11-03 | 2020-02-21 | 东莞酷派软件技术有限公司 | 一种消息输入的处理方法、装置及终端 |
CN105446627B (zh) * | 2015-12-09 | 2019-02-15 | Oppo广东移动通信有限公司 | 文本信息的编辑方法、装置和终端设备 |
CN106297782A (zh) * | 2016-07-28 | 2017-01-04 | 北京智能管家科技有限公司 | 一种人机交互方法及系统 |
CN106484134A (zh) * | 2016-09-20 | 2017-03-08 | 深圳Tcl数字技术有限公司 | 基于安卓系统的语音输入标点符号的方法及装置 |
CN106527729A (zh) * | 2016-11-17 | 2017-03-22 | 科大讯飞股份有限公司 | 非接触式输入方法和装置 |
CN106775214A (zh) * | 2016-11-29 | 2017-05-31 | 珠海市魅族科技有限公司 | 一种文字编辑方法及装置 |
CN106775349A (zh) * | 2016-11-29 | 2017-05-31 | 珠海市魅族科技有限公司 | 一种文字内容的语音修改方法及装置 |
CN106921524B (zh) * | 2017-03-31 | 2021-12-17 | 新华三技术有限公司 | 一种配置命令行标签的方法及装置 |
CN109358848B (zh) * | 2018-10-17 | 2022-02-15 | 杨易超 | 自定义指令编辑方法、装置和计算机设备 |
US11289092B2 (en) | 2019-09-25 | 2022-03-29 | International Business Machines Corporation | Text editing using speech recognition |
CN111161735A (zh) * | 2019-12-31 | 2020-05-15 | 安信通科技(澳门)有限公司 | 一种语音编辑方法及装置 |
US11995394B1 (en) * | 2023-02-07 | 2024-05-28 | Adobe Inc. | Language-guided document editing |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1229216A (zh) * | 1998-03-16 | 1999-09-22 | 致伸实业股份有限公司 | 可接受语音指令的视窗显示系统 |
CN1410298A (zh) * | 2001-09-25 | 2003-04-16 | 公信电子股份有限公司 | 单键控制语音指令的声控方法及其装置 |
CN101013571A (zh) * | 2007-01-30 | 2007-08-08 | 无敌科技(西安)有限公司 | 一种使用语音命令的互动方法及其系统 |
CN102467362A (zh) * | 2010-11-18 | 2012-05-23 | 联想(北京)有限公司 | 一种语音输入方法及输入装置 |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5231670A (en) * | 1987-06-01 | 1993-07-27 | Kurzweil Applied Intelligence, Inc. | Voice controlled system and method for generating text from a voice controlled input |
JPH03163623A (ja) * | 1989-06-23 | 1991-07-15 | Articulate Syst Inc | 音声制御コンピュータ・インターフェース |
US5671328A (en) * | 1992-12-30 | 1997-09-23 | International Business Machines Corporation | Method and apparatus for automatic creation of a voice recognition template entry |
US6125347A (en) * | 1993-09-29 | 2000-09-26 | L&H Applications Usa, Inc. | System for controlling multiple user application programs by spoken input |
US6064959A (en) * | 1997-03-28 | 2000-05-16 | Dragon Systems, Inc. | Error correction in speech recognition |
US5873064A (en) * | 1996-11-08 | 1999-02-16 | International Business Machines Corporation | Multi-action voice macro method |
JPH10222337A (ja) * | 1997-02-13 | 1998-08-21 | Meidensha Corp | コンピュータシステム |
US6212498B1 (en) * | 1997-03-28 | 2001-04-03 | Dragon Systems, Inc. | Enrollment in speech recognition |
US6263375B1 (en) * | 1998-08-31 | 2001-07-17 | International Business Machines Corp. | Method for creating dictation macros |
US6839669B1 (en) * | 1998-11-05 | 2005-01-04 | Scansoft, Inc. | Performing actions identified in recognized speech |
US6192343B1 (en) * | 1998-12-17 | 2001-02-20 | International Business Machines Corporation | Speech command input recognition system for interactive computer display with term weighting means used in interpreting potential commands from relevant speech terms |
US6834264B2 (en) * | 2001-03-29 | 2004-12-21 | Provox Technologies Corporation | Method and apparatus for voice dictation and document production |
JP4675514B2 (ja) * | 2001-07-16 | 2011-04-27 | シャープ株式会社 | 音声処理装置、音声処理方法、及びその方法を実施するためのプログラムを記録したコンピュータにより読取り可能な記録媒体 |
JP2003122389A (ja) * | 2001-10-11 | 2003-04-25 | Casio Comput Co Ltd | データ処理装置及びプログラム |
WO2003077070A2 (en) * | 2002-03-06 | 2003-09-18 | Professional Pharmaceutical Index | Creating records of patients using a browser based hand-held assistant |
TWI298844B (en) * | 2005-11-30 | 2008-07-11 | Delta Electronics Inc | User-defines speech-controlled shortcut module and method |
US7966182B2 (en) * | 2006-06-20 | 2011-06-21 | Lunis Orcutt | Voiced programming system and method |
JP2008146158A (ja) * | 2006-12-06 | 2008-06-26 | Canon Inc | 情報処理装置及び情報処理方法 |
JP2008243146A (ja) * | 2007-03-29 | 2008-10-09 | Clarion Co Ltd | 音声認識処理装置及びその制御方法 |
US8538757B2 (en) * | 2007-05-17 | 2013-09-17 | Redstart Systems, Inc. | System and method of a list commands utility for a speech recognition command system |
WO2008144638A2 (en) * | 2007-05-17 | 2008-11-27 | Redstart Systems Inc. | Systems and methods of a structured grammar for a speech recognition command system |
US8886521B2 (en) * | 2007-05-17 | 2014-11-11 | Redstart Systems, Inc. | System and method of dictation for a speech recognition command system |
US8620652B2 (en) * | 2007-05-17 | 2013-12-31 | Microsoft Corporation | Speech recognition macro runtime |
-
2012
- 2012-07-26 CN CN201210260840.2A patent/CN103577072A/zh active Pending
-
2013
- 2013-06-13 WO PCT/CN2013/077128 patent/WO2013185606A1/zh active Application Filing
- 2013-06-13 EP EP13804943.2A patent/EP2879046A4/en not_active Withdrawn
- 2013-06-13 JP JP2015523383A patent/JP2015525933A/ja active Pending
- 2013-06-13 US US14/417,318 patent/US20150193200A1/en not_active Abandoned
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1229216A (zh) * | 1998-03-16 | 1999-09-22 | 致伸实业股份有限公司 | 可接受语音指令的视窗显示系统 |
CN1410298A (zh) * | 2001-09-25 | 2003-04-16 | 公信电子股份有限公司 | 单键控制语音指令的声控方法及其装置 |
CN101013571A (zh) * | 2007-01-30 | 2007-08-08 | 无敌科技(西安)有限公司 | 一种使用语音命令的互动方法及其系统 |
CN102467362A (zh) * | 2010-11-18 | 2012-05-23 | 联想(北京)有限公司 | 一种语音输入方法及输入装置 |
Also Published As
Publication number | Publication date |
---|---|
JP2015525933A (ja) | 2015-09-07 |
CN103577072A (zh) | 2014-02-12 |
EP2879046A1 (en) | 2015-06-03 |
US20150193200A1 (en) | 2015-07-09 |
EP2879046A4 (en) | 2015-08-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2013185606A1 (zh) | 一种终端语音辅助编辑的方法及装置 | |
US10074365B2 (en) | Voice control method, mobile terminal device, and voice control system | |
US9218052B2 (en) | Framework for voice controlling applications | |
TWI525532B (zh) | Set the name of the person to wake up the name for voice manipulation | |
EP3142107A1 (en) | Voice recognition apparatus and controlling method thereof | |
US9516115B2 (en) | Softphone user interface system and method | |
CN101835279A (zh) | 一种移动终端连接蓝牙设备的简化方法 | |
TW201035827A (en) | System and method for touch-based text entry | |
US10165097B2 (en) | Call processing method and device | |
CN105843681B (zh) | 一种移动终端及其操作系统切换的方法 | |
WO2011160356A1 (zh) | 一种功能键的显示模式切换方法及终端 | |
CN103903613A (zh) | 一种信息处理方法及电子设备 | |
KR20100111351A (ko) | 휴대 단말기의 입력 장치 및 방법 | |
CN102722395A (zh) | 移动终端及其应用程序的启动方法 | |
WO2022022566A1 (zh) | 图形码识别方法、装置和电子设备 | |
WO2012116647A1 (zh) | 终端设备控制方法及终端设备和电子设备 | |
WO2011017873A1 (zh) | 一种移动终端输入法切换方法及装置 | |
CN106484134A (zh) | 基于安卓系统的语音输入标点符号的方法及装置 | |
WO2012152115A1 (zh) | 输入方法及装置 | |
CN105487799A (zh) | 内容转换方法及装置 | |
CN111326158A (zh) | 一种基于智能终端的语音操控方法 | |
WO2014180362A1 (zh) | 一种终端及其管理多媒体记事本的方法 | |
CN110597480A (zh) | 一种自定义语音指令实现方法和终端 | |
KR100664241B1 (ko) | 멀티 편집기능을 구비한 휴대용 단말기 및 그의 운용방법 | |
US20150181646A1 (en) | Method and system for bridging an input signal from a human interface device between a computer and a mobile device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13804943 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2015523383 Country of ref document: JP Kind code of ref document: A |
|
REEP | Request for entry into the european phase |
Ref document number: 2013804943 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2013804943 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14417318 Country of ref document: US |