CN109215661A - Speech-to-text method, apparatus equipment and storage medium - Google Patents

Speech-to-text method, apparatus equipment and storage medium Download PDF

Info

Publication number
CN109215661A
CN109215661A CN201811006413.5A CN201811006413A CN109215661A CN 109215661 A CN109215661 A CN 109215661A CN 201811006413 A CN201811006413 A CN 201811006413A CN 109215661 A CN109215661 A CN 109215661A
Authority
CN
China
Prior art keywords
phrase
target
replacement
text
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811006413.5A
Other languages
Chinese (zh)
Inventor
王文斌
周围
李封翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kunshan Pinyuan Intellectual Property Operating Technology Co., Ltd.
Original Assignee
Shanghai Wind Communication Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Wind Communication Technologies Co Ltd filed Critical Shanghai Wind Communication Technologies Co Ltd
Priority to CN201811006413.5A priority Critical patent/CN109215661A/en
Publication of CN109215661A publication Critical patent/CN109215661A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the invention discloses a kind of speech-to-text method, apparatus equipment and storage mediums, this method comprises: obtaining voice messaging, and the voice messaging are converted into text paragraph;The text paragraph is broken into pieces to form multiple phrases;Receive the target phrase of user's selection;Receive the replacement phrase for being used to replace the target phrase of user's input;The target phrase is replaced using the replacement phrase, to update the text paragraph.The voice for solving the prior art is directly changed into the lower technical problem of the modification efficiency after text paragraph, to improve the modification efficiency for the text paragraph that voice messaging is converted into.

Description

Speech-to-text method, apparatus equipment and storage medium
Technical field
The present embodiments relate to data processing field more particularly to a kind of speech-to-text method, apparatus equipment and deposit Storage media.
Background technique
Voice is directly changed into text paragraph, it will usually which there are many mistakes, in order to enable recipient or reader The content that voice is determined according to the text after conversion needs to modify to the text paragraph after conversion.Existing modification It is the position for moving the cursor to transcription error, then deletes the content of mistake, then be manually entered correct content, modifies speed It is lower with efficiency.
Summary of the invention
The embodiment of the present invention provides a kind of speech-to-text method, apparatus equipment and storage medium, solves the prior art Voice be directly changed into the lower technical problem of the modification efficiency after text paragraph.
In a first aspect, the embodiment of the invention provides a kind of speech-to-text methods, comprising:
Voice messaging is obtained, and the voice messaging is converted into text paragraph;
The text paragraph is broken into pieces to form multiple phrases;
Receive the target phrase of user's selection;
Receive the replacement phrase for being used to replace the target phrase of user's input;
The target phrase is replaced using the replacement phrase, to update the text paragraph.
Further, after the target phrase for receiving user's selection, further includes:
Pop-up inputs the input frame of the replacement phrase for user;
The replacement phrase for being used to replace the target phrase for receiving user's input, comprising:
Receive the replacement phrase for being used to replace the target phrase that user inputs in the input frame.
Further, the target phrase for receiving user's selection, including;
One or more phrases for needing to modify of user's selection are received, and using the selected phrase of user as target word Group.
Further, after the replacement phrase for replacing the target phrase for receiving user's input, further includes:
Determine the corresponding target voice phrase of target phrase described in the voice messaging;
According to the corresponding target voice phrase of the target phrase and replacement phrase, the target voice phrase and institute are established The conversion corresponding relationship of replacement phrase is stated, to replace the conversion between the target voice phrase and the target phrase is corresponding to close System.
Further, before/after the target phrase for receiving user's selection, further includes:
The default text for obtaining synchronous vacations modifies range, by target phrase and default text modification range with The identical phrase of the target phrase is used as target phrase.
Further, after the target phrase for receiving user's selection, further includes:
All target phrases of the default text modification range are identified.
Further, described to replace the target phrase using the replacement phrase, to update the text paragraph, comprising:
It determines whether correct with the semanteme of context after the replacement phrase replaces the target phrase;
If correct, the target phrase is replaced using the replacement phrase;
If incorrect, suggestion prompting is exported, if user reaffirms modification, is replaced using the replacement phrase The target phrase.
Second aspect, the embodiment of the invention also provides a kind of speech-to-text devices, comprising:
Conversion module is converted into text paragraph for obtaining voice messaging, and by the voice messaging;
Module is broken into pieces, for breaking into pieces the text paragraph to form multiple phrases;
Target phrase receiving module, for receiving the target phrase of user's selection;
Phrase receiving module is replaced, for receiving the replacement phrase for being used to replace the target phrase of user's input;
Replacement module, for replacing the target phrase using the replacement phrase, to update the text paragraph.
The third aspect, the embodiment of the invention also provides a kind of equipment, the equipment includes:
One or more processors;
Storage device, for storing one or more programs;
When one or more of programs are executed by one or more of processors, so that one or more of processing Device realizes speech-to-text method as described in relation to the first aspect.
Fourth aspect, it is described the embodiment of the invention also provides a kind of storage medium comprising computer executable instructions Computer executable instructions by computer processor when being executed for executing speech-to-text method as described in relation to the first aspect.
The technical solution of speech-to-text method provided in an embodiment of the present invention, including voice messaging is obtained, and by voice Information is converted into text paragraph;Text paragraph is broken into pieces to form multiple phrases;Receive the target phrase of user's selection;It receives and uses The replacement phrase for being used to replace target phrase of family input;Target phrase is replaced using replacement phrase, with more new literacy paragraph.Directly It connects and target phrase is replaced using replacement phrase, without deleting original target phrase, eliminate the deletion of target phrase Operation, while reducing the mobile number of cursor, and then the modification efficiency of text paragraph can be greatly improved, reduce text section The modification time fallen.
Detailed description of the invention
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing does one and simply introduces, it should be apparent that, drawings in the following description are some embodiments of the invention, for this For the those of ordinary skill of field, without creative efforts, it can also be obtained according to these attached drawings others Attached drawing.
Fig. 1 is the flow chart for the speech-to-text method that the embodiment of the present invention one provides;
Fig. 2 is the structural block diagram of speech-to-text device provided by Embodiment 2 of the present invention;
Fig. 3 is the structural block diagram for the equipment that the embodiment of the present invention three provides.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, hereinafter with reference to attached in the embodiment of the present invention Figure, clearly and completely describes technical solution of the present invention by embodiment, it is clear that described embodiment is the present invention one Section Example, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not doing Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.
Embodiment one
Fig. 1 is the flow chart for the speech-to-text method that the embodiment of the present invention one provides.The technical solution of the present embodiment is suitable The case where text paragraph for being converted into voice messaging is modified.This method can be by provided in an embodiment of the present invention Speech-to-text device executes, which can be realized by the way of software and/or hardware, and configures and answer in the processor With.This method specifically comprises the following steps:
S101, voice messaging is obtained, and converts speech information into text paragraph.
When for being inconvenient to listen to the occasion of voice, or needing to carry out text backup to voice messaging, usually need Convert speech information into correct text paragraph.And its accurate text paragraph is obtained according to voice messaging, usually need Voice messaging is first converted into text paragraph.
S102, text paragraph is broken into pieces to form multiple phrases.
It modifies after voice messaging is converted into text paragraph, then to text paragraph.In order to improve repairing for text paragraph Change efficiency, the present embodiment is first broken text paragraph into pieces to form multiple phrases, for example, " will arrive in autumn, flock of wild geese is southward Fly ", it is available after breaking into pieces, autumn/arrived/,/a group/wild goose/southward fly/.I.e. the present embodiment is substantially single with phrase It modifies to text paragraph position.
It should be noted that being broken text paragraph to form multiple phrases into pieces using the prior art, the present embodiment is not Method is broken into pieces to specific text paragraph to be defined.
S103, the target phrase for receiving user's selection.
After text paragraph is broken into pieces, user selects to need the phrase modified as target phrase.Illustratively, for " autumn/ Arrived/, a group/wild goose/fly toward blue ", since " flying toward blue " should be " southward flying ", therefore, it is necessary to modify to it.And it is right It is modified, and user needs first to select " flying toward blue ", as target phrase.
It is understood that the determination mode about target phrase, can only select a phrase as target phrase, such as It, can also be using the multiple phrases of simultaneous selection as target phrase when the multiple phrases of fruit are adjacent.
S104, the replacement phrase for being used to replace target phrase for receiving user's input.
After target phrase determines, needs to modify to target phrase, that is, need to obtain the replacement phrase of target phrase.This Embodiment receives the replacement phrase for being used to replace target phrase of user's input by input frame, therefore for aforementioned exemplary, needs It to be inputted in input frame " southward flying ".
It is understood that input frame can be constantly in display state, user is after target phrase determines, directly defeated Enter input replacement phrase in frame;Input frame can also be converted in display and between hiding, when input frame is hidden, Ke Yi Page setup input frame triggers icon, and when the user clicks or touching the triggering icon, input frame pop-up, user can inputted Input replacement phrase in frame;In addition, can also directly be hidden for hiding input frame, without in the corresponding triggering of page setup Icon, when user press target phrase exceed preset time after, that is, automatic spring input frame, user inputs replacement in input frame Phrase, and after target phrase has been replaced, input frame is hidden automatically.
If being deposited in the text paragraph after converting in voice messaging comprising repeating phrase, and the repetition phrase transcription error In multiple identical target phrases, at this point, in order to improve text modification efficiency, the present embodiment in default modification literal scope and The identical phrase of target phrase is modified simultaneously, specifically, obtaining the default text of synchronous vacations before or after this step Word modifies range, regard phrase identical with target phrase in target phrase and default text modification range as target word Group.Wherein, presetting text modification range can be preset characters number, default line number or default nature number of segment etc., in actual use It can be set or be selected as the case may be.
Illustratively, it is assumed that default text modification range is that a natural character field is fallen, when depositing during the natural character field is fallen At multiple " flying toward blue ", and user has selected one of them " flying toward blue ", then natural character field fall in all " flying toward blue " Become target phrase simultaneously.
All target phrases are confirmed or checked for the ease of user, after all target phrases determine, the present embodiment All target phrases are identified by color, target phrase is more intuitively presented, convenient for user's visually wink Between position all target phrases.
S105, target phrase is replaced using replacement phrase, with more new literacy paragraph.
The modification of target phrase is completed using replacement phrase replacement target phrase.In voiced translation, it is normally based on The mandarin or dialect of standard carry out voice conversion, such as the south of Fujian Province words, but in real life, the pronunciation of many people is not marked The pronunciation of standard, especially adult are generally difficult to change, therefore identical mistake can often occur in voice conversion.In order to mention High voice messaging is converted into the accuracy of text paragraph, and reduces the modification amount of text paragraph, and the present embodiment first determines voice The corresponding target voice phrase of target phrase in information;According to the corresponding target voice phrase of target phrase and replacement phrase, build Vertical target voice phrase and the conversion corresponding relationship for replacing phrase, to replace the conversion between target voice phrase and target phrase Corresponding relationship, to can be directly changed into target voice phrase when converting speech information into text paragraph next time Replace phrase.
In order to improve the accuracy of text paragraph modification, the present embodiment is when using replacement phrase replacement target phrase, first Judge the semanteme of replacement phrase and context, if semantic correct, then be used directly and replace phrase replacement target phrase, if semanteme is not Correctly, then it exports prompting to suggest to remind user, if user adheres to continuing to modify, replaces target word using replacement phrase Group.
The technical solution of speech-to-text method provided in an embodiment of the present invention, including voice messaging is obtained, and by voice Information is converted into text paragraph;Text paragraph is broken into pieces to form multiple phrases, and determines the phrase that needs to modify using as mesh Mark phrase;The replacement phrase of target phrase is obtained by input frame;Using replacement phrase replacement target phrase with more new literacy section It falls.Directly target phrase is replaced using replacement phrase, without deleting original target phrase, eliminates target phrase Delete operation, while reducing the mobile number of cursor, and then the modification efficiency of text paragraph can be greatly improved, reduce text The modification time that field is fallen.
Embodiment two
Fig. 2 is the structural block diagram of speech-to-text device provided by Embodiment 2 of the present invention.The device is above-mentioned for executing Speech-to-text method provided by any embodiment, the device are chosen as software or hardware realization.The device includes:
Conversion module 11 is converted into text paragraph for obtaining voice messaging, and by the voice messaging;
Module 12 is broken into pieces, for breaking into pieces the text paragraph to form multiple phrases;
Target phrase receiving module 13, for receiving the target phrase of user's selection;
Phrase receiving module 14 is replaced, for receiving the replacement phrase for being used to replace the target phrase of user's input;
Replacement module 15, for replacing the target phrase using the replacement phrase, to update the text paragraph.
The technical solution of speech-to-text device provided in an embodiment of the present invention obtains voice messaging by conversion module, And convert speech information into text paragraph;Text paragraph is broken into pieces to form multiple phrases by breaking module into pieces;Pass through target Phrase receiving module receives the target phrase of user's selection;By replacement phrase receiving module reception user's input for replacing The replacement phrase of the target phrase;Target phrase is replaced with more new literacy paragraph using replacement phrase by replacement module.Directly It connects and target phrase is replaced using replacement phrase, without deleting original target phrase, eliminate the deletion of target phrase Operation, while reducing the mobile number of cursor, and then the modification efficiency of text paragraph can be greatly improved, reduce text section The modification time fallen.
Voice provided by any embodiment of the invention can be performed in speech-to-text device provided by the embodiment of the present invention Turn text method, has the corresponding functional module of execution method and beneficial effect.
Embodiment three
Fig. 3 is the structural schematic diagram for the equipment that the embodiment of the present invention three provides, as shown in figure 3, the equipment includes processor 201, memory 202, input unit 203 and output device 204;The quantity of processor 201 can be one or more in equipment It is a, in Fig. 3 by taking a processor 201 as an example;Processor 201, memory 202, input unit 203 and output dress in equipment Setting 204 can be connected by bus or other modes, in Fig. 3 for being connected by bus.
Memory 202 is used as a kind of computer readable storage medium, can be used for storing software program, journey can be performed in computer Sequence and module, if the corresponding program instruction/module of speech-to-text method in the embodiment of the present invention is (for example, conversion module 11, module 12, target phrase receiving module 13, replacement phrase receiving module 14 and replacement module 15 are broken into pieces).Processor 201 By running the software program, instruction and the module that are stored in memory 202, thereby executing equipment various function application with And data processing, that is, realize above-mentioned Y speech-to-text method.
Memory 202 can mainly include storing program area and storage data area, wherein storing program area can store operation system Application program needed for system, at least one function;Storage data area, which can be stored, uses created data etc. according to terminal.This Outside, memory 202 may include high-speed random access memory, can also include nonvolatile memory, for example, at least one Disk memory, flush memory device or other non-volatile solid state memory parts.In some instances, memory 202 can be into one Step includes the memory remotely located relative to processor 201, these remote memories can pass through network connection to equipment. The example of above-mentioned network includes but is not limited to internet, intranet, local area network, mobile radio communication and combinations thereof.
Input unit 203 can be used for receiving the number or character information of input, and generate with the user setting of equipment with And the related key signals input of function control.
Output device 204 may include that display screen etc. shows equipment, for example, the display screen of user terminal.
Example IV
The embodiment of the present invention four also provides a kind of storage medium comprising computer executable instructions, and the computer can be held Row instruction is used to execute a kind of speech-to-text method when being executed by computer processor, this method comprises:
Voice messaging is obtained, and the voice messaging is converted into text paragraph;
The text paragraph is broken into pieces to form multiple phrases;
Receive the target phrase of user's selection;
Receive the replacement phrase for being used to replace the target phrase of user's input;
The target phrase is replaced using the replacement phrase, to update the text paragraph.
Certainly, a kind of storage medium comprising computer executable instructions, computer provided by the embodiment of the present invention The method operation that executable instruction is not limited to the described above, can also be performed voice provided by any embodiment of the invention and turns text Relevant operation in word method.
By the description above with respect to embodiment, it is apparent to those skilled in the art that, the present invention It can be realized by software and required common hardware, naturally it is also possible to which by hardware realization, but in many cases, the former is more Good embodiment.Based on this understanding, technical solution of the present invention substantially in other words contributes to the prior art Part can be embodied in the form of software products, which can store in computer readable storage medium In, floppy disk, read-only memory (Read-Only Memory, abbreviation ROM), random access memory (Random such as computer Access Memory, abbreviation RAM), flash memory (FLASH), hard disk or CD etc., including some instructions are used so that a calculating Machine equipment (can be personal computer, server or the network equipment etc.) executes voice described in each embodiment of the present invention Turn text method.
It is worth noting that, in the embodiment of above-mentioned speech-to-text device, included each unit and module are It is divided according to the functional logic, but is not limited to the above division, as long as corresponding functions can be realized;Separately Outside, the specific name of each functional unit is also only for convenience of distinguishing each other, the protection scope being not intended to restrict the invention.
Note that the above is only a better embodiment of the present invention and the applied technical principle.It will be appreciated by those skilled in the art that The invention is not limited to the specific embodiments described herein, be able to carry out for a person skilled in the art it is various it is apparent variation, It readjusts and substitutes without departing from protection scope of the present invention.Therefore, although being carried out by above embodiments to the present invention It is described in further detail, but the present invention is not limited to the above embodiments only, without departing from the inventive concept, also It may include more other equivalent embodiments, and the scope of the invention is determined by the scope of the appended claims.

Claims (10)

1. a kind of speech-to-text method characterized by comprising
Voice messaging is obtained, and the voice messaging is converted into text paragraph;
The text paragraph is broken into pieces to form multiple phrases;
Receive the target phrase of user's selection;
Receive the replacement phrase for being used to replace the target phrase of user's input;
The target phrase is replaced using the replacement phrase, to update the text paragraph.
2. the method according to claim 1, wherein also being wrapped after the target phrase for receiving user's selection It includes:
Pop-up inputs the input frame of the replacement phrase for user;
The replacement phrase for being used to replace the target phrase for receiving user's input, comprising:
Receive the replacement phrase for being used to replace the target phrase that user inputs in the input frame.
3. according to the method described in claim 2, it is characterized in that, it is described receive user selection target phrase, including;
One or more phrases for needing to modify of user's selection are received, and using the selected phrase of user as target phrase.
4. according to the method described in claim 3, it is characterized in that, the reception user input is used to replace the target word After the replacement phrase of group, further includes:
Determine the corresponding target voice phrase of target phrase described in the voice messaging;
According to the corresponding target voice phrase of the target phrase and replacement phrase, establishes the target voice phrase and replaced with described The conversion corresponding relationship of phrase is changed, to replace the conversion corresponding relationship between the target voice phrase and the target phrase.
5. the method according to claim 1, wherein it is described receive user selection target phrase before/after, Further include:
The default text for obtaining synchronous vacations modifies range, by target phrase and default text modification range with it is described The identical phrase of target phrase is used as target phrase.
6. according to the method described in claim 5, it is characterized in that, also being wrapped after the target phrase for receiving user's selection It includes:
All target phrases of the default text modification range are identified.
7. the method according to claim 1, wherein described replace the target word using the replacement phrase Group, to update the text paragraph, comprising:
It determines whether correct with the semanteme of context after the replacement phrase replaces the target phrase;
If correct, the target phrase is replaced using the replacement phrase;
If incorrect, suggestion prompting is exported, if user reaffirms modification, using described in replacement phrase replacement Target phrase.
8. a kind of speech-to-text device characterized by comprising
Conversion module is converted into text paragraph for obtaining voice messaging, and by the voice messaging;
Module is broken into pieces, for breaking into pieces the text paragraph to form multiple phrases;
Target phrase receiving module, for receiving the target phrase of user's selection;
Phrase receiving module is replaced, for receiving the replacement phrase for being used to replace the target phrase of user's input;
Replacement module, for replacing the target phrase using the replacement phrase, to update the text paragraph.
9. a kind of equipment, which is characterized in that the equipment includes:
One or more processors;
Storage device, for storing one or more programs;
When one or more of programs are executed by one or more of processors, so that one or more of processors are real The now speech-to-text method as described in any in claim 1-7.
10. a kind of storage medium comprising computer executable instructions, which is characterized in that the computer executable instructions by For executing the speech-to-text method as described in any in claim 1-7 when computer processor executes.
CN201811006413.5A 2018-08-30 2018-08-30 Speech-to-text method, apparatus equipment and storage medium Pending CN109215661A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811006413.5A CN109215661A (en) 2018-08-30 2018-08-30 Speech-to-text method, apparatus equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811006413.5A CN109215661A (en) 2018-08-30 2018-08-30 Speech-to-text method, apparatus equipment and storage medium

Publications (1)

Publication Number Publication Date
CN109215661A true CN109215661A (en) 2019-01-15

Family

ID=64986511

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811006413.5A Pending CN109215661A (en) 2018-08-30 2018-08-30 Speech-to-text method, apparatus equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109215661A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110347996A (en) * 2019-07-15 2019-10-18 北京百度网讯科技有限公司 Amending method, device, electronic equipment and the storage medium of text
CN111540356A (en) * 2020-04-20 2020-08-14 苏州思必驰信息科技有限公司 Correction method and system for voice conversation
CN113378530A (en) * 2021-06-28 2021-09-10 北京七维视觉传媒科技有限公司 Voice editing method and device, equipment and medium

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020062210A1 (en) * 2000-11-20 2002-05-23 Teac Corporation Voice input system for indexed storage of speech
CN103150294A (en) * 2011-12-06 2013-06-12 盛乐信息技术(上海)有限公司 Method and system for correcting based on voice identification results
CN103369122A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input method and system
US20140249815A1 (en) * 2007-10-04 2014-09-04 Core Wireless Licensing, S.a.r.l. Method, apparatus and computer program product for providing text independent voice conversion
CN105810197A (en) * 2014-12-30 2016-07-27 联想(北京)有限公司 Voice processing method, voice processing device and electronic device
CN106328145A (en) * 2016-08-19 2017-01-11 北京云知声信息技术有限公司 Voice correction method and voice correction device
CN106782543A (en) * 2017-03-24 2017-05-31 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN107093423A (en) * 2017-05-27 2017-08-25 努比亚技术有限公司 A kind of phonetic entry modification method, device and computer-readable recording medium
CN107316639A (en) * 2017-05-19 2017-11-03 北京新美互通科技有限公司 A kind of data inputting method and device based on speech recognition, electronic equipment
CN107608957A (en) * 2017-09-06 2018-01-19 百度在线网络技术(北京)有限公司 Text modification method, apparatus and its equipment based on voice messaging
US20180190294A1 (en) * 2017-01-03 2018-07-05 Beijing Baidu Netcom Science And Technology Co., Ltd. Input method and apparatus

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020062210A1 (en) * 2000-11-20 2002-05-23 Teac Corporation Voice input system for indexed storage of speech
US20140249815A1 (en) * 2007-10-04 2014-09-04 Core Wireless Licensing, S.a.r.l. Method, apparatus and computer program product for providing text independent voice conversion
CN103150294A (en) * 2011-12-06 2013-06-12 盛乐信息技术(上海)有限公司 Method and system for correcting based on voice identification results
CN103369122A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input method and system
CN105810197A (en) * 2014-12-30 2016-07-27 联想(北京)有限公司 Voice processing method, voice processing device and electronic device
CN106328145A (en) * 2016-08-19 2017-01-11 北京云知声信息技术有限公司 Voice correction method and voice correction device
US20180190294A1 (en) * 2017-01-03 2018-07-05 Beijing Baidu Netcom Science And Technology Co., Ltd. Input method and apparatus
CN106782543A (en) * 2017-03-24 2017-05-31 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN107316639A (en) * 2017-05-19 2017-11-03 北京新美互通科技有限公司 A kind of data inputting method and device based on speech recognition, electronic equipment
CN107093423A (en) * 2017-05-27 2017-08-25 努比亚技术有限公司 A kind of phonetic entry modification method, device and computer-readable recording medium
CN107608957A (en) * 2017-09-06 2018-01-19 百度在线网络技术(北京)有限公司 Text modification method, apparatus and its equipment based on voice messaging

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110347996A (en) * 2019-07-15 2019-10-18 北京百度网讯科技有限公司 Amending method, device, electronic equipment and the storage medium of text
CN110347996B (en) * 2019-07-15 2023-06-20 北京百度网讯科技有限公司 Text modification method and device, electronic equipment and storage medium
CN111540356A (en) * 2020-04-20 2020-08-14 苏州思必驰信息科技有限公司 Correction method and system for voice conversation
WO2021212817A1 (en) * 2020-04-20 2021-10-28 思必驰科技股份有限公司 Method and apparatus for correcting voice dialogue
CN111540356B (en) * 2020-04-20 2022-05-17 思必驰科技股份有限公司 Correction method and system for voice conversation
US11804217B2 (en) 2020-04-20 2023-10-31 Ai Speech Co., Ltd. Method and apparatus for correcting voice dialogue
CN113378530A (en) * 2021-06-28 2021-09-10 北京七维视觉传媒科技有限公司 Voice editing method and device, equipment and medium

Similar Documents

Publication Publication Date Title
US11594211B2 (en) Methods and systems for correcting transcribed audio files
CN111090728B (en) Dialogue state tracking method and device and computing equipment
US20160062604A1 (en) Virtual assistant development system
EP3179473A1 (en) Training method and apparatus for language model, and device
CN101778160B (en) Method and device for dynamically presetting message
CN109215661A (en) Speech-to-text method, apparatus equipment and storage medium
CN107239547B (en) Voice error correction method, terminal and storage medium for ordering song by voice
CN112199477B (en) Dialogue management scheme and dialogue management corpus construction method
CN109240670A (en) Modular software development methodology, system, equipment and medium
CN109545205A (en) Virtual assistant based on context is realized
US20190147029A1 (en) Method and system for generating conversational user interface
CN103150294A (en) Method and system for correcting based on voice identification results
CN109271503A (en) Intelligent answer method, apparatus, equipment and storage medium
CN104202458A (en) Method and intelligent terminal for automatically storing contact information
US11615788B2 (en) Method for executing function based on voice and electronic device supporting the same
CN110502645A (en) Information query method and device
CN112233669A (en) Speech content prompting method and system
CN112559718B (en) Method, device, electronic equipment and storage medium for dialogue processing
CN109062891A (en) Media processing method, device, terminal and medium
CN110599126B (en) Schedule creating method and device, electronic equipment and storage medium
WO2022213943A1 (en) Message sending method, message sending apparatus, electronic device, and storage medium
CN104468926A (en) Method and device for controlling contact persons in mobile terminal
CN113792558A (en) Self-learning translation method and device based on machine translation and post-translation editing
CN113299290A (en) Method and device for speech recognition, electronic equipment and readable storage medium
CN109032497A (en) Speech-to-text method, apparatus equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200420

Address after: Room 706, Room 1699, South Zuchong Road, Yushan Town, Kunshan City, Suzhou City, Jiangsu Province

Applicant after: Kunshan Pinyuan Intellectual Property Operating Technology Co., Ltd.

Address before: 200233 Shanghai city industry Jinshan District Jinshan Industrial Zone Road No. 218 building 3 layer 2

Applicant before: SHANGHAI YUDE COMMUNICATION TECHNOLOGY Co.,Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190115