WO2022143454A1 - Information correction method and apparatus, and electronic device - Google Patents

Information correction method and apparatus, and electronic device Download PDF

Info

Publication number
WO2022143454A1
WO2022143454A1 PCT/CN2021/141161 CN2021141161W WO2022143454A1 WO 2022143454 A1 WO2022143454 A1 WO 2022143454A1 CN 2021141161 W CN2021141161 W CN 2021141161W WO 2022143454 A1 WO2022143454 A1 WO 2022143454A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
candidate
words
target
word
Prior art date
Application number
PCT/CN2021/141161
Other languages
French (fr)
Chinese (zh)
Inventor
王林林
Original Assignee
维沃移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信有限公司 filed Critical 维沃移动通信有限公司
Publication of WO2022143454A1 publication Critical patent/WO2022143454A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • G06F3/0488Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Definitions

  • the present application belongs to the field of communication technologies, and specifically relates to an information correction method, device and electronic device.
  • the pinyin input method and the voice input method are often used.
  • the voice input method is favored by more users.
  • the voice input method has the advantages of being fast and easy to use, it also has a serious disadvantage: when the user speaks a long sentence, there are often misrecognized words in the sentence after speech recognition, resulting in the recognition of wrong words.
  • the accuracy of text information is low, requiring users to modify and identify wrong words.
  • the purpose of the embodiments of the present application is to provide an information correction method, device and electronic device, which can solve the problem that the input efficiency of the voice input method in the prior art needs to be improved.
  • an embodiment of the present application provides an information correction method, the method comprising:
  • the first text information that is converted and generated according to the first voice information is displayed in the target area in the information editing interface;
  • the corresponding candidate word is displayed in the candidate information area in the information editing interface
  • the first text information in the target area is modified.
  • an information correction device including:
  • a display module used for displaying the first text information converted and generated according to the first voice information in the target area in the information editing interface
  • a processing module configured to display corresponding candidate words in the candidate information area in the information editing interface according to the received first input
  • a target determination module configured to obtain first indication information according to a user's operation in the candidate information area, where the first indication information is used to indicate a target candidate word in the candidate word;
  • An information modification module configured to modify the first text information in the target area according to the target candidate word.
  • embodiments of the present application provide an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being The processor implements the steps of the method according to the first aspect when executed.
  • an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented .
  • an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction, and implement the first aspect the method described.
  • an embodiment of the present application provides a computer program product, the computer program product is stored in a non-volatile storage medium, and the computer program product is executed by at least one processor to implement the first aspect. method described.
  • an embodiment of the present application provides an electronic device, where the electronic device is configured to execute the method described in the first aspect.
  • the user by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
  • Fig. 1 is one of the flow charts of the information correction method of the embodiment of the present application.
  • Fig. 2 is one of the schematic diagrams of the voice input interface of the embodiment of the present application.
  • FIG 3 is one of the schematic diagrams of the information editing interface under the correction mode of the embodiment of the present application.
  • FIG. 6 is the second schematic diagram of the process of determining target candidate words in the correction mode according to the embodiment of the present application.
  • FIG. 8 is the second flow chart of the information correction method according to the embodiment of the present application.
  • FIG 10 is the second schematic diagram of the information editing interface in the correction mode of the embodiment of the present application.
  • 11 is the second schematic diagram of the deletion operation of the information editing interface in the correction mode according to the embodiment of the present application.
  • FIG. 12 is the third schematic diagram of the process of determining the target candidate word in the correction mode according to the embodiment of the present application.
  • FIG. 13 is the second schematic diagram of the completion of the correction in the correction mode of the embodiment of the present application.
  • FIG. 14 is the third flowchart of the information correction method according to the embodiment of the present application.
  • FIG. 15 is a block diagram of a display device according to an embodiment of the present application.
  • 16 is a block diagram of an electronic device according to an embodiment of the present application.
  • FIG. 17 is a schematic diagram of a hardware structure of an electronic device according to an embodiment of the present application.
  • first, second and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between “first”, “second”, etc.
  • the objects are usually of one type, and the number of objects is not limited.
  • the first object may be one or more than one.
  • “and/or” in the description and claims indicates at least one of the connected objects, and the character “/" generally indicates that the associated objects are in an "or” relationship.
  • Multi-level candidate The user inputs a series of correct pinyin, but the desired sentence is not in the candidate list, and needs to select lower-level words one by one to form the desired sentence. This step-by-step selection process is called multi-level selection. level candidate.
  • Multi-order words (n-gram): We call a word with a more complete meaning a unigram or a first-order word. Such as “beauty”, “Hangzhou”, “Gongchen Bridge” and so on. A word composed of N single-order words is called a multi-order word. For example, “Hangzhou Gongchen Bridge” consists of two single-order words, which is the second-order word (bigram) in the multi-order word.
  • Higher-order words and second-order words This is a related concept. If a multi-order word A is composed of other multi-order words or single-order words, then A is the higher-order word of these other words, other These words are the second-order words of A, also called the lower-order words of A.
  • “Hangzhou Gongchen Bridge” is a higher-order word of "Hangzhou”
  • “Hangzhou” and “Gongchen Bridge” are both secondary words of "Hangzhou Gongchen Bridge”.
  • Weighted Finite State Transducer It is an efficient white-box model.
  • the built model can automatically select a path that meets each input condition according to a series of inputs, from the initial state to the end state, and output the sequences on these paths in turn. It can be used to realize the functions of pinyin characters to vocabulary, vocabulary to sentences and so on.
  • multiple models can also be combined into one to efficiently achieve the effect of cascading multiple models, such as the direct translation of Pinyin characters into sentences.
  • an embodiment of the present application provides an information correction method, including:
  • Step 11 Display the first text information converted and generated according to the first voice information in the target area in the information editing interface.
  • the text recognized according to the voice is directly displayed on the screen and displayed in the target area, which can give full play to the advantage of fewer voice input buttons.
  • Step 12 Display the corresponding candidate words in the candidate information area in the information editing interface according to the received first input.
  • the correction mode can be triggered by the user's operation, such as triggering by the user clicking or long-pressing the text in the target area.
  • the recognized characters can be corrected according to the received first input.
  • the user can perform a correction operation according to the voice.
  • the voice input by the user at this time ie, the second voice information
  • candidate words are converted and generated for the user to select.
  • the voice recognition of the desired word input by the user by using the voice can be converted into pinyin, and then multi-level candidate words can be expanded according to the pinyin for the user to select the correct target candidate word in multiple levels.
  • pinyin recognition may be performed according to the second text information, and candidate words corresponding to the second target pinyin are displayed for the user to select.
  • Step 13 Obtain first indication information according to the user's operation in the candidate information area, where the first indication information is used to indicate a target candidate word in the candidate words.
  • the first indication information may be obtained according to operations such as a single click or a long press of the user.
  • the first indication information is obtained when the user clicks a candidate word, and the candidate word selected by the user's click is used as the target candidate word.
  • Step 14 Modify the first text information in the target area according to the target candidate word.
  • the embodiments of the present application take advantage of the above features, when the user inputs the desired text using the voice input method, the voice input by the user is converted into text and directly displayed on the screen, which can give full play to the advantage of fewer voice input buttons.
  • the correction mode can be turned on. In this mode, the user can still input the pronunciation of the desired word or sentence by voice.
  • the present embodiment will not convert the speech recognition into Chinese characters and directly display it on the screen, but first adopts the speech recognition mode to identify the corresponding pinyin of the speech, and then expands multi-level candidate words for the user to select according to these pinyin.
  • the user can select the target candidate word through a multi-level candidate method, without having to press many keys to correct the text through pinyin input, avoid interrupting the rhythm of the speech input, and improve the speech input efficiency.
  • the information modification method further includes: determining that the first target position is located between the text information displayed in the target area, or the first target position in the target area, according to the user's operation in the information editing interface.
  • the correction mode is enabled; wherein, the first text information in the target area includes the second text information.
  • the correction mode can be turned on.
  • the displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input includes: obtaining the first target pinyin according to the received second voice information; The first target pinyin is described, and the candidate words corresponding to the second phonetic information are displayed in the candidate information area.
  • this embodiment of the present application can use WFST to convert pinyin to vocabulary and then convert to phrases to provide candidate words for users, and according to the weight (for example, rank order) order, probability, etc.) to rank candidate words from high to low.
  • these candidate words not only include high-order words that match all pinyin, but also high-frequency and low-order words that match part of pinyin. In this way, the user can avoid the tediousness of inputting the pinyin of each low-level word by pressing the keyboard step by step, and also avoid the problem that the selection cost increases due to the large ambiguity of short words. Therefore, the words desired by the user can be obtained more quickly to correct the erroneous text.
  • fewer user operations are required for input, and it is suitable for correcting relatively long input phrases (such as between 2nd and 4th order), and relatively short words with high frequency (usually 1, 2 order, a few words of order 3).
  • the user can select the target candidate word that he wants to input from the candidate words of each order, and this process may require multi-level consumption of pinyin to complete.
  • the user can confirm whether the pinyin converted by speech recognition is correct.
  • the received second voice information is converted into corresponding pinyin information, and displayed in the pinyin information area in the information editing interface. That is to say, the pinyin recognized and converted according to the second voice information can be displayed in a certain area (such as the pinyin information area) in the information editing interface, and the user can edit and modify the pinyin in this area.
  • the pinyin displayed in this area can be used as the first target pinyin, and the candidate words corresponding to the first target pinyin are displayed in the candidate information area.
  • the confirmation operation for the target candidate word may be that when the user clicks on the candidate word, the candidate word replaces the pinyin information corresponding to the candidate word and displays it in the pinyin information area until the pinyin information is displayed in the pinyin information area.
  • the candidate word in the pinyin information area is used as the target candidate word.
  • the string of pinyin fang'yi'wu'zi is displayed in the "pinyin information area”
  • the “Put a room” is displayed in the candidate information area.
  • “epidemic prevention” and “overflow prevention” and other candidate words after the user clicks “epidemic prevention”, the pinyin “fang'yi” is consumed, and the "pinyin information area” is displayed as epidemic prevention wu'zi, and enters the secondary selection.
  • the candidate information area displays the candidate words corresponding to "wu'zi” such as "house” and "materials”.
  • the displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input includes: according to the received second voice information and preset rules, in the The candidate information area displays candidate words corresponding to the second voice information;
  • the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words.
  • the first-order word is the word contained in the higher-order word;
  • the second voice information is the phonetic information corresponding to the higher-order word including the target candidate word;
  • the first indication information is obtained according to the long-press operation Instructions.
  • the first indication information is obtained according to the user's long-pressing on the candidate word, and the candidate word selected by the user's long-pressing is used as the target candidate word.
  • a first-order one-character word or two-character word if the word is not commonly used, you can use voice to input a longer common word containing this word, and the common word is a high-level word containing the word you want to input. Then the user can select the second-order word corresponding to the higher-order word (that is, the word that the user wants) from the multi-order candidate words and directly display it on the screen. This second-order word can be obtained accurately and quickly with the help of the input longer word.
  • the language model in the correction mode will give a greater weight to the more common combination, so that it ranks at the top of the candidate words.
  • the same-order words are not completely sorted by probability, but the second-order words contained in the higher-order words can appear in front of the words of the same order as this second-order word.
  • the single-order word "epidemic prevention” is part of the higher-order word “epidemic prevention measures", so among many first-order words, "epidemic prevention" is guaranteed to come before words such as "put one" or "overflow prevention”.
  • the information correction method further includes: converting the received second voice information into corresponding pinyin information, and displaying the information in the pinyin information area in the information editing interface.
  • the user can identify and convert the pinyin "xian'xing'ji'yin” according to the "dominant gene”, and display the pinyin in the pinyin information area, and the pinyin corresponds to The candidate words of are displayed in the candidate information area.
  • the displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input includes: when the second text information in the target area is selected, According to the second text information, determine the second target pinyin; according to the second target pinyin, display the candidate words corresponding to the second target pinyin in the candidate information area; wherein, the first target in the target area A text message includes the second text message.
  • the modifying the first text information in the target area according to the target candidate word includes: deleting the text information corresponding to the second indication information in the target area through the deletion control; For the second indication information, a second target position is determined, and target candidate words are displayed at the second target position; wherein, the second indication information is obtained according to the user's operation in the target area.
  • the second indication information can be obtained according to the user's operation of clicking "delete control k2 forward" or “delete control k3 backward” on the candidate information area, and delete the text in the target area with a recognition error.
  • the character in at least one adjacent position of the target position in the information editing interface can be deleted through the delete control.
  • the information correction method further includes: according to the third indication information, locating to the end of the sentence of the text information in the target area, and exiting the correction mode.
  • the third indication information may be obtained according to the operation of the user clicking the "correction completion confirmation control k1" on the candidate information area.
  • the user can conveniently move the cursor to the end of the paragraph by using the "correction completion confirmation control k1" to exit the correction mode.
  • the information correction method before correcting the first text information in the target area according to the target candidate word, the information correction method further includes: displaying a deletion control and a correction completion confirmation control k1 on the candidate information area,
  • the deletion control includes at least one of forward deletion control k2 and backward deletion control k3;
  • the forward deletion control k2 is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface
  • the backward deletion control k3 is used to receive an instruction to delete the character at the adjacent position on the right side of the first target position in the information editing interface;
  • the correction completion confirmation control k1 is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
  • the first target position is determined according to the user's operation in the information editing interface.
  • the characters at the left adjacent position of the target position in the information editing interface can be deleted by forwardly deleting the control k2; the characters at the right adjacent position of the target position in the information editing interface can be deleted by backward deleting the control k3 ;
  • the cursor can be moved to the end of the text message in the target area by modifying the completion confirmation control k1.
  • a backward deletion control k3 is also provided.
  • the user needs to delete the wrong word segment on the upper screen (that is, the characters adjacent to the target position in the information editing interface), in addition to forward deletion, the text adjacent to the target position can also be deleted backward.
  • the correction mode also provides a "correction completion confirmation control k1", which can exit the correction mode and return to the normal mode with one key. For example, after the user has corrected all the wrong words, press the "correction completion confirmation control k1" to move the cursor to the end of the sentence in the target area, ready to start a new input.
  • This type of shortcut keys can effectively improve input efficiency.
  • the user uses the voice input method to read a whole sentence of speech, and then the speech is mistakenly recognized as "the house is full now", and the recognized text is directly displayed on the screen. At this time, the user finds that "put in a room” is different from the text he wants to enter, and needs to be corrected.
  • the cursor can be moved to the error position, and the correction mode is triggered at this time.
  • the target area in the information editing interface is displayed as "now the house is full”.
  • users can click the voice recognition button below and still use voice to input "epidemic prevention materials”.
  • the speech recognition engine at this time may share most of the modules with the speech recognition engine in the normal mode, or may be two independent engines.
  • the engine of the correction mode is specially optimized for the phonetic-to-pinyin conversion, which can ensure a higher accuracy rate when the engine converts the phonetic to pinyin.
  • this application can use the WFST model to convert the string of pinyin into many multi-level words, for example, "put a room” is a bigram, which consists of two words "put one" and "room” composed of single-order words.
  • a multi-level candidate function is added to the voice input method, and when correcting a small inaccurate segment of a sentence (or a sentence input through the Pinyin input method) recognized by the voice, the voice input method is used After inputting the voice of the desired word, the voice recognition is converted into pinyin, and then multi-level candidate words are expanded according to the pinyin, so that the user can select the correct target candidate word in multiple levels.
  • Step 801 the voice input by the user (usually a sentence) is recognized by voice, and the recognition is converted into text and then directly displayed on the screen, which can give full play to the advantage of fewer keys for voice input.
  • Step 802 Check whether there is any wrong word in the text on the upper screen; if there is no error, go to Step 803; if there is an error, go to Step 804.
  • Step 803 Continue to input the next sentence by voice.
  • Step 804 Locate (by moving the cursor) the target position, that is, one of the wrong words.
  • the correction mode is activated.
  • a button that can delete the text backward ie “backward delete control k3" is provided, This makes it easier to remove erroneous text.
  • Step 805 Input the voice corresponding to the text to be input by voice.
  • Step 806 The optimized speech recognition engine in the embodiment of the present application mainly converts phrases rather than long sentences, and instead of directly converting the speech into Chinese characters, it first converts the speech into Pinyin.
  • the reason for adopting this conversion method is that there are only about 407 different Chinese pinyin, even if the tones are counted, it is much less than the more than 30,000 Chinese characters. Therefore, the accuracy rate of converting the phonetic into a pinyin string is relatively high. It is easier than converting to a Chinese character string.
  • Step 807 The user can confirm whether the pinyin converted by speech recognition is correct; if it is correct, go to Step 809, otherwise, go to Step 808;
  • Step 808 If the pinyin recognition is wrong, the user can still modify it through the keyboard. Of course, this should be avoided as much as possible, as it will increase the number of keystrokes.
  • Step 809 For the correct pinyin (ie, the first target pinyin), the trained WFST can convert the pinyin into multi-order candidate words, and arrange them according to the weight (such as order, probability, etc.) from high to low.
  • This WFST can be a composite of two models: the pinyin-to-first-order vocabulary, and the WFST language model of the vocabulary.
  • Step 810 According to the candidate words, the user can select a word as long as possible that is consistent with the target word by clicking the candidate word, and consume the corresponding pinyin until all the pinyin is consumed, and the desired word is displayed on the screen.
  • Step 811 Determine whether all the pinyin has been consumed; if yes, go to step 812; if not, go to step 810, that is, continue to select candidate words and consume the corresponding pinyin.
  • Step 812 When all the pinyin is exhausted, complete the screen of the word or phrase you want to input.
  • Step 813 The user can continue to confirm whether the text in the target area in the information editing interface is correct or not, and check whether there are other incorrect segments in the sentence. If yes, go to step 804; if not, go to step 814. That is to say, when there are multiple errors in the sentence, steps 804 to 813 can be executed repeatedly until all the error segments of the sentence are corrected.
  • Step 814 When the user confirms that the sentence has been corrected, he can press the "correction completion confirmation control k1", at this time, the cursor will automatically move to the end of the text in the target area, ready to start the input of the next sentence, so as to avoid moving the cursor. Positioning operation.
  • the 7 key operations include 4 deletions of the wrong word (respectively delete "put", "one", “house”, “zi"), 1 voice button and 2 two-level word selection.
  • the Pinyin input method to input a string of text input by the above voice, even using Jianpin will require a lot of keystrokes; if you use the traditional phonetic input method, you need to move the cursor to the wrong word, and then use the keyboard to input Pinyin to Correction, also requires a lot of keystrokes and interrupts the rhythm of voice input.
  • most of the operations are used in voice and word selection, and the number of keystrokes is reduced to a lower number, so that a correct sentence input can be obtained with less positioning and keystrokes, thereby making the voice input more efficient. promote.
  • the user can quickly correct the wrong words in the text through the information correction method provided in this application, so as to achieve the purpose of quickly and accurately inputting large pieces of text.
  • Embodiment 2 is a diagrammatic representation of Embodiment 1:
  • the user uses the voice input method to input a piece of speech. Suppose it is mistakenly recognized as "now the house is full of translation materials", and the recognized text is directly displayed on the screen. At this time, the user finds that the "translation" is different from the desired text and needs to be corrected.
  • the cursor can be moved to the error position, and the correction mode is triggered at this time.
  • the user can delete the word “translation” through the "delete control k2 forward".
  • the interface is displayed as shown in Figure 11, and the target area in the information editing interface is displayed as "Now the house is full of supplies”.
  • users can click the voice recognition button below to re-enter the voice.
  • users can choose to input "epidemic prevention measures” by voice instead of directly inputting the voice of "epidemic prevention”.
  • the WFST model can be used to convert the string of pinyin into many multi-level words, such as "epidemic prevention measures”.
  • "Measures” is a bigram consisting of “epidemic prevention” and “measures”, and at the same time, it will also be transformed into some low-level words, such as “epidemic prevention” and "put one” are single-level words.
  • high-order words are always arranged in front of low-order words; and in words of the same order, the sorting is mainly based on the probability of the WFST model.
  • the same-order words are not completely sorted by probability, but the second-order words contained in the higher-order words can appear in front of the words of the same order as this second-order word.
  • the single-order word "epidemic prevention” is part of the higher-order word “epidemic prevention measures”, so among many first-order words, "epidemic prevention” will be guaranteed to be ranked in words such as "put one" or "overflow prevention” in front of.
  • the user long presses the target candidate word "epidemic prevention", the purpose is to make the word “epidemic prevention” directly on the screen to complete the input of the correct word. During this process, the unconsumed pinyin "cuo'shi” will be automatically ignored. It should be noted that the more common the phrase entered by the voice, the higher the probability that the language model will ensure that the word and its subordinate words appear. If the user thinks that all the words have been corrected at this time, he can press the "correction completion confirmation control k1" shown in Figure 13, and the cursor can be conveniently moved to the end of the paragraph to automatically exit the correction mode.
  • Step 1401 Voice recognition of the voice input by the user (usually a sentence), and the recognition is converted into text and directly displayed on the screen.
  • Step 1402 Check whether there is any wrong word in the text on the upper screen; if there is no error, go to Step 1403; if there is an error, go to Step 1404.
  • Step 1403 Continue to input the next sentence by voice.
  • Step 1404 Move the cursor to locate the target position, that is, one of the wrong words.
  • the correction mode is activated, and in this mode, the wrong word can be deleted forward or backward at the target position (ie, adjacent to the target position).
  • Step 1405 When inputting an uncommon low-order word (usually 1st-order or 2nd-order), in order to improve the word probability of the word, the user can choose to input a high-order word (the more Commonly used as possible) speech instead of directly entering the low-order word.
  • the language model will give more weight to the more common combinations, ranking them higher in the candidate words.
  • Step 1406 The optimized speech engine converts the speech input by the user into corresponding pinyin.
  • Step 1407 For the identified correct pinyin, the WFST model will convert it into many multi-level words.
  • “epidemic prevention measures” is a second-order word consisting of “epidemic prevention” and “measures”, "put one” and “epidemic prevention”.
  • " is a single-order word.
  • high-order words always appear in front of low-order words.
  • the sorting is mainly based on the probability of the WFST model.
  • the second-order words included in the higher-order words in the candidate words are arranged in front of other candidate words of the same order as this second-order word.
  • the second-order words of the high-order word have the opportunity to appear in a higher position in the candidate words with the help of the correct probability of the high-order word.
  • the second-order word is only ranked at the top of its same-order words, and it is still ranked at the back of other high-order words.
  • Step 1408 Determine whether there is a word that the user wants in the second-order words in the candidate words; if yes, go to Step 1410; if not, the user can choose to modify the pinyin, that is, go to Step 1409, or can use voice to re-input other words.
  • Step 1405 is executed to include the speech corresponding to the commonly used higher-order words of the words to be input.
  • Step 1409 The user can modify the pinyin through the keyboard.
  • Step 1410 Since the user has found the desired word from the candidate words, although there are still redundant pinyin not consumed at this time, the mission of these redundant pinyin (ie, helping to find the word that the user needs) has been completed. Therefore, the user can press and hold the target candidate word to make it appear on the screen directly, and the pinyin that is not consumed will be ignored, so that the input of the correct word can be completed.
  • Step 1411 The user can continue to confirm whether the text in the target area in the information editing interface is correct or not, and check whether there are other incorrect segments in the sentence. If yes, go to step 1404; if not, go to step 1412.
  • Step 1412 When the user confirms that the sentence has been corrected, he can press the "correction completion confirmation control k1", at this time, the cursor will automatically move to the end of the text in the target area, ready to start the input of the next sentence, so as to avoid moving the cursor. Positioning operation.
  • the information correction method provided in this embodiment is more suitable for correcting the input of less common shorter words (usually 1st-order or 2nd-order words, and may also be a few 3rd-order words).
  • less common shorter words usually 1st-order or 2nd-order words, and may also be a few 3rd-order words.
  • the present application can prevent the user from searching for the desired word at a very late position in the candidate word list, thereby improving the input efficiency.
  • the speech recognition engine of the correction mode is optimized.
  • the engine can be simpler than the normal mode, but it can be optimized for the conversion of phonetic pinyin, so that the accuracy rate of pinyin conversion is higher. Level. Therefore, it can also be considered that the corresponding voice information input by the user is attached beside the text after voice recognition and correction.
  • the information receiver can confirm the accuracy of the text, and on the other hand, it can also provide a very good parallel corpus, and the parallel corpus can be used to improve the accuracy of the user's speech engine.
  • the pinyin of the word or short sentence to be input is obtained through voice input, and the Chinese character corresponding to the pinyin is selected by using multi-level candidates and high-level word auxiliary candidates, which is easy to operate.
  • the execution body may be an information correction device, or a control module in the information correction device for executing the information correction method.
  • the information correction device provided by the embodiment of the present application is described by taking the information correction device executing the information correction method as an example.
  • an embodiment of the present application further provides an information correction apparatus 1500, including:
  • the display module 1501 is used to display the first text information converted and generated according to the first voice information in the target area in the information editing interface;
  • the processing module 1502 is configured to display the corresponding candidate words in the candidate information area in the information editing interface according to the received first input;
  • a target determination module 1503, configured to obtain first indication information according to the user's operation in the candidate information area, where the first indication information is used to indicate the target candidate word in the candidate word;
  • the information modification module 1504 is configured to modify the first text information in the target area according to the target candidate word.
  • processing module 1502 includes:
  • a first processing unit configured to obtain the first target pinyin according to the received second voice information
  • the second processing unit is configured to display the candidate words corresponding to the second voice information in the candidate information area according to the first target pinyin.
  • processing module 1502 includes:
  • a third processing unit configured to display the candidate words corresponding to the second voice information in the candidate information area according to the received second voice information and a preset rule
  • the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words.
  • An order word is a word contained in the higher order word;
  • the second voice information is the voice information corresponding to the higher-order words including the target candidate words
  • the first indication information is indication information obtained according to a long-press operation.
  • processing module 1502 includes:
  • a fourth processing unit configured to determine a second target pinyin according to the second text information when the second text information in the target area is selected
  • a fifth processing unit configured to display candidate words corresponding to the second target pinyin in the candidate information area according to the second target pinyin;
  • the first text information in the target area includes the second text information.
  • the information correction apparatus 1500 further includes:
  • a control display module configured to display a deletion control and a correction completion confirmation control on the candidate information area, the deletion control includes at least one of a forward deletion control and a backward deletion control;
  • the forward deletion control is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface
  • the backward deletion control is used to receive an instruction to delete the character at the right adjacent position of the first target position in the information editing interface
  • the correction completion confirmation control is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
  • the first target position is determined according to the user's operation in the information editing interface.
  • the information correction module 1504 includes:
  • a first correction unit configured to delete the text information corresponding to the second indication information in the target area through the deletion control
  • a second correction unit configured to determine a second target position according to the second indication information, and display the target candidate word at the second target position
  • the second indication information is obtained according to the user's operation in the target area.
  • the user by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
  • the information correction device in this embodiment of the present application may be a device, or may be a component, an integrated circuit, or a chip in a terminal.
  • the apparatus may be a mobile electronic device or a non-mobile electronic device.
  • the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant).
  • UMPC ultra-mobile personal computer
  • netbook or a personal digital assistant
  • non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
  • Network Attached Storage NAS
  • personal computer personal computer, PC
  • television television
  • teller machine or self-service machine etc.
  • the information correction device in the embodiment of the present application may be a device with an operating system.
  • the operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
  • the information correction apparatus provided in this embodiment of the present application can implement each process implemented by the method embodiments in FIG. 1 to FIG. 14 , and to avoid repetition, details are not repeated here.
  • an embodiment of the present application further provides an electronic device 1600, including a processor 1602, a memory 1601, a program or instruction stored in the memory 1601 and executable on the processor 1602,
  • an electronic device 1600 including a processor 1602, a memory 1601, a program or instruction stored in the memory 1601 and executable on the processor 1602,
  • the program or instruction is executed by the processor 1602
  • each process of the above-mentioned information correction method embodiment is implemented, and the same technical effect can be achieved. To avoid repetition, details are not described here.
  • the electronic devices in the embodiments of the present application include the aforementioned mobile electronic devices and non-mobile electronic devices.
  • FIG. 17 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
  • the electronic device 1700 includes but is not limited to: a radio frequency unit 1701, a network module 1702, an audio output unit 1703, an input unit 1704, a sensor 1705, a display unit 1706, a user input unit 1707, an interface unit 1708, a memory 1709, and a processor 1710, etc. part.
  • the electronic device 1700 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 1710 through a power management system, so as to manage charging, discharging, and power consumption through the power management system. consumption management and other functions.
  • a power source such as a battery
  • the structure of the electronic device shown in FIG. 17 does not constitute a limitation on the electronic device.
  • the electronic device may include more or less components than those shown in the figure, or combine some components, or arrange different components, which will not be repeated here. .
  • the processor 1710 is configured to display the first text information converted and generated according to the first voice information in the target area in the information editing interface; according to the received first input, display in the candidate information area in the information editing interface corresponding candidate words; first indication information is obtained according to the user's operation in the candidate information area, and the first indication information is used to indicate the target candidate words in the candidate words; The first text information in the target area is modified.
  • the user by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
  • the processor 1710 when displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input, is further configured to: obtain the first voice information according to the received second voice information target pinyin; according to the first target pinyin, display the candidate words corresponding to the second voice information in the candidate information area.
  • the processor 1710 when displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input, is further configured to: according to the received second voice information and preset rules , displaying the candidate words corresponding to the second voice information in the candidate information area;
  • the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words.
  • the first-order word is the word contained in the higher-order word;
  • the second voice information is the phonetic information corresponding to the higher-order word including the target candidate word;
  • the first indication information is obtained according to the long-press operation Instructions.
  • the processor 1710 when displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input, is further configured to: the second text information in the target area is In the case of selection, according to the second text information, determine the second target pinyin;
  • the candidate words corresponding to the second target pinyin are displayed in the candidate information area
  • the first text information in the target area includes the second text information.
  • the processor 1710 is further configured to: display a delete control and a correction completion confirmation control on the candidate information area, so that The deletion control includes at least one of forward deletion control and backward deletion control;
  • the forward deletion control is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface
  • the backward deletion control is used to receive an instruction to delete the character at the right adjacent position of the first target position in the information editing interface
  • the correction completion confirmation control is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
  • the first target position is determined according to the user's operation in the information editing interface.
  • the processor 1710 when the processor 1710 corrects the first text information in the target area according to the target candidate words, the processor 1710 is further configured to: use the delete control to modify the text corresponding to the second indication information in the target area information is deleted; according to the second indication information, a second target position is determined, and target candidate words are displayed in the second target position; wherein, the second indication information is obtained according to the user's operation in the target area.
  • the user by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
  • the input unit 1704 may include a graphics processor (Graphics Processing Unit, GPU) 17041 and a microphone 17042. Such as camera) to obtain still pictures or video image data for processing.
  • the display unit 1706 may include a display panel 17061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like.
  • the user input unit 1707 includes a touch panel 17071 and other input devices 17072 .
  • the touch panel 17071 is also called a touch screen.
  • the touch panel 17071 may include two parts, a touch detection device and a touch controller.
  • Other input devices 17072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which will not be repeated here.
  • Memory 1709 may be used to store software programs as well as various data including, but not limited to, application programs and operating systems.
  • the processor 1710 may integrate an application processor and a modem processor, wherein the application processor mainly handles the operating system, user interface, and application programs, and the like, and the modem processor mainly handles wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 1710.
  • Embodiments of the present application further provide a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, each process of the above information modification method embodiment can be achieved, and the same can be achieved. In order to avoid repetition, the technical effect will not be repeated here.
  • the processor is the processor in the electronic device described in the foregoing embodiments.
  • the readable storage medium includes a computer-readable storage medium, such as a computer read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.
  • An embodiment of the present application further provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction to implement the information modification method embodiments described above.
  • the chip includes a processor and a communication interface
  • the communication interface is coupled to the processor
  • the processor is configured to run a program or an instruction to implement the information modification method embodiments described above.
  • the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.
  • An embodiment of the present application further provides a computer program product, where the computer program product is stored in a non-volatile storage medium, and the computer program product is executed by at least one processor to implement each of the foregoing information correction method embodiments process, and can achieve the same technical effect, in order to avoid repetition, it will not be repeated here.
  • the embodiment of the present application further provides an electronic device, which is configured to execute each process of the above-mentioned information correction method embodiment, and can achieve the same technical effect. To avoid repetition, details are not described here.
  • the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation.
  • the technical solution of the present application can be embodied in the form of a software product in essence or in a part that contributes to the prior art, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of this application.
  • a storage medium such as ROM/RAM, magnetic disk, CD-ROM

Abstract

The present application relates to the technical field of communications, and discloses an information correction method and apparatus, and an electronic device. The information correction method comprises: displaying, in a target area of an information editing interface, first text information converted and generated according to first voice information; according to a received first input, displaying corresponding candidate words in a candidate information area of the information editing interface; obtaining first indication information according to an operation by a user in the candidate information area, the first indication information being used for indicating a target candidate word among the candidate words; and correcting the first text information in the target area according to the target candidate word.

Description

信息修正方法、装置及电子设备Information correction method, device and electronic equipment
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请主张在2020年12月30日在中国提交的中国专利申请No.202011608156.X的优先权,其全部内容通过引用包含于此。This application claims priority to Chinese Patent Application No. 202011608156.X filed in China on December 30, 2020, the entire contents of which are incorporated herein by reference.
技术领域technical field
本申请属于通信技术领域,具体涉及一种信息修正方法、装置及电子设备。The present application belongs to the field of communication technologies, and specifically relates to an information correction method, device and electronic device.
背景技术Background technique
目前,用户在移动终端上使用输入法来表达自己思想的过程中,经常用到的是拼音输入法和语音输入法。At present, in the process of using an input method on a mobile terminal to express one's own thoughts, the pinyin input method and the voice input method are often used.
随着语音识别的准确率持续提升,语音输入法被更多用户所青睐。然而,语音输入法虽然具有快速和易于使用的优点,但也存在一个严重的缺点:当用户说一个较长的句子时,语音识别后的句子里经常会出现识别错误的词语,造成识别得到的文本信息准确率低,需要用户进行修改识别错误的词语。As the accuracy of speech recognition continues to improve, the voice input method is favored by more users. However, although the voice input method has the advantages of being fast and easy to use, it also has a serious disadvantage: when the user speaks a long sentence, there are often misrecognized words in the sentence after speech recognition, resulting in the recognition of wrong words. The accuracy of text information is low, requiring users to modify and identify wrong words.
在现有技术中,在用户修改通过语音识别得到的文本信息过程中,只能通过键盘输入法输入文本来对错误词语进行修正,导致输入效率降低。In the prior art, in the process of modifying the text information obtained by the voice recognition, the user can only correct the wrong words by inputting the text through the keyboard input method, which reduces the input efficiency.
发明内容SUMMARY OF THE INVENTION
本申请实施例的目的是提供一种信息修正方法、装置及电子设备,能够解决现有技术中语音输入法的输入效率有待提高的问题。The purpose of the embodiments of the present application is to provide an information correction method, device and electronic device, which can solve the problem that the input efficiency of the voice input method in the prior art needs to be improved.
为了解决上述技术问题,本申请是这样实现的:In order to solve the above technical problems, this application is implemented as follows:
第一方面,本申请实施例提供了一种信息修正方法,该方法包括:In a first aspect, an embodiment of the present application provides an information correction method, the method comprising:
将根据第一语音信息转化生成的第一文字信息显示在信息编辑界面中的目标区域;The first text information that is converted and generated according to the first voice information is displayed in the target area in the information editing interface;
根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词;According to the received first input, the corresponding candidate word is displayed in the candidate information area in the information editing interface;
根据用户在所述候选信息区域的操作,获得第一指示信息,所述第一指示信息用于指示所述候选词中的目标候选词;Obtain first indication information according to the user's operation in the candidate information area, where the first indication information is used to indicate the target candidate word in the candidate word;
根据所述目标候选词,对所述目标区域中的第一文字信息进行修正。According to the target candidate word, the first text information in the target area is modified.
第二方面,本申请实施例提供了一种信息修正装置,包括:In a second aspect, an embodiment of the present application provides an information correction device, including:
显示模块,用于将根据第一语音信息转化生成的第一文字信息显示在信息编辑界面中的目标区域;a display module, used for displaying the first text information converted and generated according to the first voice information in the target area in the information editing interface;
处理模块,用于根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词;a processing module, configured to display corresponding candidate words in the candidate information area in the information editing interface according to the received first input;
目标确定模块,用于根据用户在所述候选信息区域的操作,获得第一指示信息,所述第一指示信息用于指示所述候选词中的目标候选词;a target determination module, configured to obtain first indication information according to a user's operation in the candidate information area, where the first indication information is used to indicate a target candidate word in the candidate word;
信息修正模块,用于根据所述目标候选词,对所述目标区域中的第一文字信息进行修正。An information modification module, configured to modify the first text information in the target area according to the target candidate word.
第三方面,本申请实施例提供了一种电子设备,该电子设备包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如第一方面所述的方法的步骤。In a third aspect, embodiments of the present application provide an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being The processor implements the steps of the method according to the first aspect when executed.
第四方面,本申请实施例提供了一种可读存储介质,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如第一方面所述的方法的步骤。In a fourth aspect, an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented .
第五方面,本申请实施例提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如第一方面所述的方法。In a fifth aspect, an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction, and implement the first aspect the method described.
第六方面,本申请实施例提供了一种计算机程序产品,所述计算机程序产品被存储在非易失的存储介质中,所述计算机程序产品被至少一个处理器执行以实现如第一方面所述的方法。In a sixth aspect, an embodiment of the present application provides a computer program product, the computer program product is stored in a non-volatile storage medium, and the computer program product is executed by at least one processor to implement the first aspect. method described.
第七方面,本申请实施例提供了一种电子设备,所述电子设备被配置为执行如第一方面所述的方法。In a seventh aspect, an embodiment of the present application provides an electronic device, where the electronic device is configured to execute the method described in the first aspect.
在本申请实施例中,通过将用户的输入转化为对应的拼音信息,并针对该拼音展开候选词,使得用户可以方便地修正语音输入时发生的识别错误情况,快速准确地获得想要输入的内容,提高了语音输入效率,提升了用户体 验。In the embodiment of the present application, by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
附图说明Description of drawings
图1是本申请实施例的信息修正方法的流程图之一;Fig. 1 is one of the flow charts of the information correction method of the embodiment of the present application;
图2是本申请实施例的语音输入界面的示意图之一;Fig. 2 is one of the schematic diagrams of the voice input interface of the embodiment of the present application;
图3是本申请实施例的修正模式下的信息编辑界面的示意图之一;3 is one of the schematic diagrams of the information editing interface under the correction mode of the embodiment of the present application;
图4是本申请实施例的修正模式下的信息编辑界面删除操作示意图之一;4 is one of the schematic diagrams of the deletion operation of the information editing interface under the correction mode of the embodiment of the present application;
图5是本申请实施例的修正模式下的确定目标候选词的过程示意图之一;5 is one of the schematic diagrams of the process of determining target candidate words in the correction mode of the embodiment of the present application;
图6是本申请实施例的修正模式下的确定目标候选词的过程示意图之二;6 is the second schematic diagram of the process of determining target candidate words in the correction mode according to the embodiment of the present application;
图7是本申请实施例的修正模式下的修正完成的示意图之一;7 is one of the schematic diagrams of the completion of the correction in the correction mode of the embodiment of the present application;
图8是本申请实施例信息修正方法的流程图之二;FIG. 8 is the second flow chart of the information correction method according to the embodiment of the present application;
图9是本申请实施例的语音输入界面的示意图之二;9 is the second schematic diagram of the voice input interface according to the embodiment of the present application;
图10是本申请实施例的修正模式下的信息编辑界面的示意图之二;10 is the second schematic diagram of the information editing interface in the correction mode of the embodiment of the present application;
图11是本申请实施例的修正模式下的信息编辑界面删除操作示意图之二;11 is the second schematic diagram of the deletion operation of the information editing interface in the correction mode according to the embodiment of the present application;
图12是本申请实施例的修正模式下的确定目标候选词的过程示意图之三;12 is the third schematic diagram of the process of determining the target candidate word in the correction mode according to the embodiment of the present application;
图13是本申请实施例的修正模式下的修正完成的示意图之二;13 is the second schematic diagram of the completion of the correction in the correction mode of the embodiment of the present application;
图14是本申请实施例信息修正方法的流程图之三;FIG. 14 is the third flowchart of the information correction method according to the embodiment of the present application;
图15是本申请实施例的显示装置的框图;15 is a block diagram of a display device according to an embodiment of the present application;
图16是本申请实施例的电子设备的框图;16 is a block diagram of an electronic device according to an embodiment of the present application;
图17是本申请实施例的电子设备的硬件结构示意图。FIG. 17 is a schematic diagram of a hardware structure of an electronic device according to an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
本申请的说明书和权利要求书中的术语“第一”、“第二”等是用于区别 类似的对象,而不用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施,且“第一”、“第二”等所区分的对象通常为一类,并不限定对象的个数,例如第一对象可以是一个,也可以是多个。此外,说明书以及权利要求中“和/或”表示所连接对象的至少其中之一,字符“/”,一般表示前后关联对象是一种“或”的关系。The terms "first", "second" and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between "first", "second", etc. The objects are usually of one type, and the number of objects is not limited. For example, the first object may be one or more than one. In addition, "and/or" in the description and claims indicates at least one of the connected objects, and the character "/" generally indicates that the associated objects are in an "or" relationship.
为使本领域技术人员能够更好地理解本申请实施例,先进行如下说明。In order to enable those skilled in the art to better understand the embodiments of the present application, the following description is first provided.
多级候选:用户输入了一连串正确拼音,但是想要的句子不在候选列表中,而需要一级一级地选择更低阶的各个词来组成想输入的句子,这个逐级选择的过程叫做多级候选。Multi-level candidate: The user inputs a series of correct pinyin, but the desired sentence is not in the candidate list, and needs to select lower-level words one by one to form the desired sentence. This step-by-step selection process is called multi-level selection. level candidate.
多阶词(n-gram):我们把一个具有较完整意义的词叫做单阶词(unigram)或一阶词。比如“美”,“杭州”,“拱宸桥”等。而由N个单阶词组合起来的词就叫作多阶词,比如“杭州拱宸桥”由两个单阶词组成,它就是多阶词中的二阶词(bigram)。Multi-order words (n-gram): We call a word with a more complete meaning a unigram or a first-order word. Such as "beauty", "Hangzhou", "Gongchen Bridge" and so on. A word composed of N single-order words is called a multi-order word. For example, "Hangzhou Gongchen Bridge" consists of two single-order words, which is the second-order word (bigram) in the multi-order word.
高阶词和次阶词:这是一个有联系的相对的概念,如果一个多阶词A是有另一些多阶词或单阶词组成的,那么A就是其他这些词的高阶词,其他这些词就是A的次阶词,也叫A的低阶词。比如“杭州拱宸桥”是“杭州”的高阶词,“杭州”和“拱宸桥”都是“杭州拱宸桥”的次阶词。Higher-order words and second-order words: This is a related concept. If a multi-order word A is composed of other multi-order words or single-order words, then A is the higher-order word of these other words, other These words are the second-order words of A, also called the lower-order words of A. For example, "Hangzhou Gongchen Bridge" is a higher-order word of "Hangzhou", and "Hangzhou" and "Gongchen Bridge" are both secondary words of "Hangzhou Gongchen Bridge".
带权有限状态转换机(Weighted Finite State Transducer,WFST):是一种高效的白盒图模型。建好的模型,可以根据一系列输入,自动选择符合每个输入条件的路径,从起始状态一直走到终止状态,同时把这些路径上的序列依次输出。可以用来实现拼音字符转词汇,词汇转句子等功能。此外,还可以将多个模型复合成一个,高效地实现多个模型级联的效果,比如实现拼音字符直接转句子。Weighted Finite State Transducer (WFST): It is an efficient white-box model. The built model can automatically select a path that meets each input condition according to a series of inputs, from the initial state to the end state, and output the sequences on these paths in turn. It can be used to realize the functions of pinyin characters to vocabulary, vocabulary to sentences and so on. In addition, multiple models can also be combined into one to efficiently achieve the effect of cascading multiple models, such as the direct translation of Pinyin characters into sentences.
下面结合附图,通过具体的实施例及其应用场景对本申请实施例提供的信息修正方法、装置及电子设备进行详细地说明。The information correction method, apparatus, and electronic device provided by the embodiments of the present application will be described in detail below with reference to the accompanying drawings through specific embodiments and application scenarios thereof.
如图1所示,本申请实施例提供了一种信息修正方法,包括:As shown in FIG. 1 , an embodiment of the present application provides an information correction method, including:
步骤11:将根据第一语音信息转化生成的第一文字信息显示在信息编辑界面中的目标区域。Step 11: Display the first text information converted and generated according to the first voice information in the target area in the information editing interface.
这里,用户使用语音输入法整句输入一段语音后,根据该语音识别出的文字直接上屏,显示在目标区域,可以充分发挥语音输入按键少的优势。Here, after the user uses the voice input method to input a whole sentence of voice, the text recognized according to the voice is directly displayed on the screen and displayed in the target area, which can give full play to the advantage of fewer voice input buttons.
步骤12:根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词。Step 12: Display the corresponding candidate words in the candidate information area in the information editing interface according to the received first input.
这里,在用户发现语音识别出的文字跟想要输入的文字不同而需要纠正时,可以由用户的操作触发启动修正模式,比如通过用户点击或长按目标区域内的文字触发。修正模式下,可以根据接收到的第一输入对识别出的文字进行修正。Here, when the user finds that the text recognized by the voice is different from the text that he wants to input and needs to be corrected, the correction mode can be triggered by the user's operation, such as triggering by the user clicking or long-pressing the text in the target area. In the correction mode, the recognized characters can be corrected according to the received first input.
例如,第一输入为语音时,用户可以根据该语音进行修正操作。根据用户此时输入的语音(即第二语音信息),转化生成候选词供用户进行选择。具体的,在修正模式下,可以将用户采用语音输入想要的词语的语音识别转成拼音,然后再根据该拼音来展开多阶候选词,供用户多级选择正确的目标候选词。For example, when the first input is a voice, the user can perform a correction operation according to the voice. According to the voice input by the user at this time (ie, the second voice information), candidate words are converted and generated for the user to select. Specifically, in the correction mode, the voice recognition of the desired word input by the user by using the voice can be converted into pinyin, and then multi-level candidate words can be expanded according to the pinyin for the user to select the correct target candidate word in multiple levels.
又例如,第一输入为用户在目标区域内选中的第二文字信息,则可以根据该第二文字信息进行拼音识别,并显示第二目标拼音对应的候选词,供用户选择。For another example, if the first input is second text information selected by the user in the target area, pinyin recognition may be performed according to the second text information, and candidate words corresponding to the second target pinyin are displayed for the user to select.
步骤13:根据用户在所述候选信息区域的操作,获得第一指示信息,所述第一指示信息用于指示所述候选词中的目标候选词。Step 13: Obtain first indication information according to the user's operation in the candidate information area, where the first indication information is used to indicate a target candidate word in the candidate words.
本步骤中,所述第一指示信息可以根据用户的单击或长按等操作获得。例如,在用户单击候选词时得到第一指示信息,将用户单击选择的候选词作为目标候选词。In this step, the first indication information may be obtained according to operations such as a single click or a long press of the user. For example, the first indication information is obtained when the user clicks a candidate word, and the candidate word selected by the user's click is used as the target candidate word.
步骤14:根据所述目标候选词,对所述目标区域中的第一文字信息进行修正。Step 14: Modify the first text information in the target area according to the target candidate word.
需要说明的是,由于汉语拼音大约有407个,即使算上声调,也比汉字30000多个要少的多,因此,语音转拼音的准确率比较高。It should be noted that since there are about 407 Chinese pinyin, even if the tones are counted, it is much less than the more than 30,000 Chinese characters. Therefore, the accuracy of phonetic conversion is relatively high.
本申请实施例利用上述特点,在用户采用语音输入法输入想要的文字时,将用户输入的语音识别转化成文本后直接上屏,可以充分发挥语音输入按键少的优势。而在用户需要对上屏后的文本修改时,可以开启修正模式。在该模式下,用户依然可以通过语音输入想要输入的词或句的读音。这时,本实 施例不会将该语音识别转化成汉字直接上屏,而是先采用语音识别方式识别出该语音对应的拼音,然后根据这些拼音展开多阶的候选词供用户选择。这样,通过将语音信息识别转化成候选词,使得用户可以通过多级候选方式选择目标候选词,而不必通过拼音输入去按很多键去对文本修正,避免打断语音输入的节奏,提升了语音输入的效率。The embodiments of the present application take advantage of the above features, when the user inputs the desired text using the voice input method, the voice input by the user is converted into text and directly displayed on the screen, which can give full play to the advantage of fewer voice input buttons. When the user needs to modify the text after the screen is displayed, the correction mode can be turned on. In this mode, the user can still input the pronunciation of the desired word or sentence by voice. At this time, the present embodiment will not convert the speech recognition into Chinese characters and directly display it on the screen, but first adopts the speech recognition mode to identify the corresponding pinyin of the speech, and then expands multi-level candidate words for the user to select according to these pinyin. In this way, by converting the speech information into candidate words, the user can select the target candidate word through a multi-level candidate method, without having to press many keys to correct the text through pinyin input, avoid interrupting the rhythm of the speech input, and improve the speech input efficiency.
可选地,所述信息修正方法还包括:在根据用户在所述信息编辑界面中的操作确定第一目标位置位于所述目标区域内显示的文字信息之间,或所述目标区域内的第二文字信息被选中时,开启所述修正模式;其中,所述目标区域中的第一文字信息包括所述第二文字信息。Optionally, the information modification method further includes: determining that the first target position is located between the text information displayed in the target area, or the first target position in the target area, according to the user's operation in the information editing interface. When the second text information is selected, the correction mode is enabled; wherein, the first text information in the target area includes the second text information.
例如:在用户将光标移到目标区域内需要修正的位置时,或者用户长按所述目标区域内的文字时,即可开启修正模式。For example, when the user moves the cursor to a position that needs to be corrected in the target area, or when the user presses the text in the target area for a long time, the correction mode can be turned on.
可选地,所述根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词,包括:根据接收到的第二语音信息,获得第一目标拼音;根据所述第一目标拼音,在所述在候选信息区域显示所述第二语音信息对应的候选词。Optionally, the displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input includes: obtaining the first target pinyin according to the received second voice information; The first target pinyin is described, and the candidate words corresponding to the second phonetic information are displayed in the candidate information area.
具体而言,由于WFST里面包含的语言模型模块可以使拼音转短语达到较好的水平,本申请实施例可以使用WFST将拼音转词汇后再转短语为用户提供候选词,并按照权重(比如阶次、概率等)将候选词从高到低排列。需要说明的是,这些候选词中不仅有匹配所有拼音的高阶词,还有匹配部分拼音的高频低阶词。这样,可以用户避免一级一级按键盘输入各个低阶词的拼音的繁琐,也避免了由于短词的歧义性大而导致选择成本增大的问题。因而,能较快捷地获取到用户想要的词来对错误文本进行修正。Specifically, since the language model module included in WFST can make pinyin-to-phrase conversion to a better level, this embodiment of the present application can use WFST to convert pinyin to vocabulary and then convert to phrases to provide candidate words for users, and according to the weight (for example, rank order) order, probability, etc.) to rank candidate words from high to low. It should be noted that these candidate words not only include high-order words that match all pinyin, but also high-frequency and low-order words that match part of pinyin. In this way, the user can avoid the tediousness of inputting the pinyin of each low-level word by pressing the keyboard step by step, and also avoid the problem that the selection cost increases due to the large ambiguity of short words. Therefore, the words desired by the user can be obtained more quickly to correct the erroneous text.
本实施例中输入时所需要的用户操作比较少,适合用来修正输入相对较长(比如2阶到4阶之间)的词组,以及使用频率较高的相对较短(通常是1,2阶,少数3阶)的词。用户可以在这些各阶的候选词中选择想输入的目标候选词,这个过程有可能需要多级消耗拼音来完成。In this embodiment, fewer user operations are required for input, and it is suitable for correcting relatively long input phrases (such as between 2nd and 4th order), and relatively short words with high frequency (usually 1, 2 order, a few words of order 3). The user can select the target candidate word that he wants to input from the candidate words of each order, and this process may require multi-level consumption of pinyin to complete.
需要说明的是,在根据第二语音信息,确定第一目标拼音过程中,用户可以对语音识别转化来的拼音进行是否正确的确认。可选地,将接收到的第二语音信息转化为对应的拼音信息,并显示在所述信息编辑界面中的拼音信 息区域。也就是说,可以将根据所述第二语音信息识别转化来的拼音在信息编辑界面中的某一个区域(比如拼音信息区域)显示,而用户可以在此区域编辑修改该拼音。此时,可以将该区域显示的拼音作为第一目标拼音,而在候选信息区域显示第一目标拼音对应的候选词。这样,用户在对拼音进行确认的过程中,如果发现拼音识别错误,可以通过键盘对其进行修改;如果发现拼音识别正确,就可以直接在候选信息区域进行目标候选词的确认操作。It should be noted that, in the process of determining the first target pinyin according to the second speech information, the user can confirm whether the pinyin converted by speech recognition is correct. Optionally, the received second voice information is converted into corresponding pinyin information, and displayed in the pinyin information area in the information editing interface. That is to say, the pinyin recognized and converted according to the second voice information can be displayed in a certain area (such as the pinyin information area) in the information editing interface, and the user can edit and modify the pinyin in this area. At this time, the pinyin displayed in this area can be used as the first target pinyin, and the candidate words corresponding to the first target pinyin are displayed in the candidate information area. In this way, in the process of confirming pinyin, if the user finds that the pinyin recognition is wrong, he can modify it through the keyboard; if he finds that the pinyin recognition is correct, he can directly confirm the target candidate word in the candidate information area.
该实施例中,针对目标候选词的确认操作,可以是在用户点击所述候选词的情况下,将所述候选词替换所述候选词对应的拼音信息显示在拼音信息区域,直至所述拼音信息区域内的拼音信息均被替换时,将所述拼音信息区域内的所述候选词作为目标候选词。In this embodiment, the confirmation operation for the target candidate word may be that when the user clicks on the candidate word, the candidate word replaces the pinyin information corresponding to the candidate word and displays it in the pinyin information area until the pinyin information is displayed in the pinyin information area. When all the pinyin information in the information area is replaced, the candidate word in the pinyin information area is used as the target candidate word.
比如,在修正模式下,通过对用户语音输入“防疫物资”的读音进行识别转化后,在“拼音信息区域”显示fang’yi’wu’zi这串拼音,在候选信息区域显示“放一屋子”、“防疫”和“防溢”等候选词,用户点选“防疫”后,消耗掉拼音“fang’yi”,“拼音信息区域”显示为防疫wu’zi,并进入二级选择,在候选信息区域显示“屋子”、“物资”等“wu’zi”对应的候选词,用户点选“物资”后,fang’yi’wu’zi这串拼音全部被消耗掉,“防疫”和“物资”组成的词被确认为目标候选词,直接上屏显示在目标位置。For example, in the correction mode, after identifying and transforming the pronunciation of the user's voice input of "epidemic prevention materials", the string of pinyin fang'yi'wu'zi is displayed in the "pinyin information area", and the "Put a room" is displayed in the candidate information area. ", "epidemic prevention" and "overflow prevention" and other candidate words, after the user clicks "epidemic prevention", the pinyin "fang'yi" is consumed, and the "pinyin information area" is displayed as epidemic prevention wu'zi, and enters the secondary selection. The candidate information area displays the candidate words corresponding to "wu'zi" such as "house" and "materials". After the user clicks on "materials", all the pinyin strings of fang'yi'wu'zi are consumed, and "epidemic prevention" and "" The word composed of "materials" is confirmed as the target candidate word, and it is directly displayed on the screen at the target position.
可选地,所述根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词,包括:根据接收到的第二语音信息以及预设规则,在所述在候选信息区域显示所述第二语音信息对应的候选词;Optionally, the displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input includes: according to the received second voice information and preset rules, in the The candidate information area displays candidate words corresponding to the second voice information;
其中,所述预设规则包括:在所述候选词为词组的情况下,将所述候选词中的高阶词显示在低阶词之前,以及,将所述高阶词的次阶词显示在所述次阶词的同阶词之前;高阶词所包含的词的数量大于低阶词所包含的词的数量,两个同阶词所包含的词的数量相同,高阶词的次阶词为包含在所述高阶词中的词;所述第二语音信息为包含所述目标候选词的高阶词所对应的语音信息;所述第一指示信息为根据长按操作得到的指示信息。Wherein, the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words. The first-order word is the word contained in the higher-order word; the second voice information is the phonetic information corresponding to the higher-order word including the target candidate word; the first indication information is obtained according to the long-press operation Instructions.
比如,用户需要输入的是“显性”这个词时,与其直接输入“显性”的语音,不如输入“显性基因”的语音,这样更容易得到“显性”这个候选词。该实施例中,根据用户长按候选词得到第一指示信息,将用户长按选择的候 选词作为目标候选词。For example, when the user needs to input the word "dominant", instead of directly inputting the "dominant" voice, it is better to input the voice of "dominant gene", which makes it easier to obtain the candidate word "dominant". In this embodiment, the first indication information is obtained according to the user's long-pressing on the candidate word, and the candidate word selected by the user's long-pressing is used as the target candidate word.
需要说明的是,在用户需要输入的词不太常见且较短的情况下,根据语言模型的理论,构成句子的词语越多,其歧义性就越低,需要选择的高阶候选词数目就会越少。也就是说,输入的词组越长就越常用,其选择操作的成本就越低。It should be noted that, in the case where the words that the user needs to input are less common and shorter, according to the theory of the language model, the more words that constitute a sentence, the lower the ambiguity, and the number of high-level candidate words that need to be selected is will be less. That is, the longer and more commonly used phrases are entered, the lower the cost of the selection operation.
因此,对于1阶的单字词或二字词,如果这个词不太常用,可以用语音输入包含这个词语的更长的常用词,该常用词为包含想要输入的词语的高阶词,然后用户可以在多阶候选词中选择高阶词对应的次阶词(也正是用户要的词)直接上屏,这个次阶词借助了输入的更长的词来准确快速地获取。Therefore, for a first-order one-character word or two-character word, if the word is not commonly used, you can use voice to input a longer common word containing this word, and the common word is a high-level word containing the word you want to input. Then the user can select the second-order word corresponding to the higher-order word (that is, the word that the user wants) from the multi-order candidate words and directly display it on the screen. This second-order word can be obtained accurately and quickly with the help of the input longer word.
具体来说,在本实施例里,修正模式下的语言模型会给越常见的组合以越大的权重,让其排在候选词中的靠前位置。根据预设规则,同阶词不完全按照概率来排序,而是让高阶词包含的次阶词,能出现在与这个次阶词同阶的词的前面。举例来说,单阶词“防疫”是高阶词“防疫措施”的一部分,因此在众多一阶词中,会保证“防疫”排在“放一”或“防溢”等词的前面。Specifically, in this embodiment, the language model in the correction mode will give a greater weight to the more common combination, so that it ranks at the top of the candidate words. According to the preset rules, the same-order words are not completely sorted by probability, but the second-order words contained in the higher-order words can appear in front of the words of the same order as this second-order word. For example, the single-order word "epidemic prevention" is part of the higher-order word "epidemic prevention measures", so among many first-order words, "epidemic prevention" is guaranteed to come before words such as "put one" or "overflow prevention".
可选地,所述信息修正方法还包括:将接收到的第二语音信息转化为对应的拼音信息,并显示在所述信息编辑界面中的拼音信息区域。Optionally, the information correction method further includes: converting the received second voice information into corresponding pinyin information, and displaying the information in the pinyin information area in the information editing interface.
比如,用户在输入“显性基因”的语音后,可以在根据“显性基因”识别转化出拼音“xian’xing’ji’yin”,并将该拼音显示在拼音信息区域,将该拼音对应的候选词显示在候选信息区域。For example, after inputting the voice of "dominant gene", the user can identify and convert the pinyin "xian'xing'ji'yin" according to the "dominant gene", and display the pinyin in the pinyin information area, and the pinyin corresponds to The candidate words of are displayed in the candidate information area.
比如,用户在输入“显性基因”的语音后,在候选词中,找到低阶词“显性”长按后,“显性”直接上屏,显示在目标位置,而多余的拼音“ji’yin”则可以被丢弃掉。通过这种方式,可以避免多次按键盘输入拼音,能够比较快捷地获取到想要词语,从而可以提升输入效率。For example, after the user enters the voice of "dominant gene", in the candidate words, after finding the low-order word "dominant" and long-pressing, "dominant" is directly displayed on the screen and displayed at the target position, and the redundant pinyin "ji" 'yin' can be discarded. In this way, multiple pressing of the keyboard to input pinyin can be avoided, and the desired word can be obtained more quickly, thereby improving the input efficiency.
可选地,所述根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词,包括:在所述目标区域内的第二文字信息被选中的情况下,根据所述第二文字信息,确定第二目标拼音;根据所述第二目标拼音,在所述在候选信息区域显示所述第二目标拼音对应的候选词;其中,所述目标区域中的第一文字信息包括所述第二文字信息。Optionally, the displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input includes: when the second text information in the target area is selected, According to the second text information, determine the second target pinyin; according to the second target pinyin, display the candidate words corresponding to the second target pinyin in the candidate information area; wherein, the first target in the target area A text message includes the second text message.
比如,用户在用语音输入一句话后,发现语音识别转化的文本“现在家 里屋子都摆满了”中的“屋子”识别错误,正确文本应该是“物资”,用户可以在目标区域长按“屋子”二字。根据用户长按操作选中的“屋子”二字进行识别转化为其拼音“wu’zi”,并在候选信息区域显示“屋子”、“物资”等“wu’zi”对应的候选词,用户点选“物资”后,“物资”二字会替代目标区域中用户选择的“屋子”一词。For example, after a user enters a sentence with voice, he finds that the "house" in the text "now the house is full" converted by voice recognition is incorrectly recognized. The correct text should be "materials", and the user can long press " house". According to the user's long-press operation, the selected word "house" is identified and converted into its pinyin "wu'zi", and the candidate words corresponding to "wu'zi" such as "house" and "materials" are displayed in the candidate information area. After selecting "Material", the word "Material" will replace the word "House" selected by the user in the target area.
可选地,所述根据所述目标候选词,对所述目标区域中的第一文字信息进行修正,包括:通过所述删除控件对目标区域中与第二指示信息对应的文字信息进行删除;根据所述第二指示信息,确定第二目标位置,将目标候选词显示在所述第二目标位置;其中,所述第二指示信息根据用户在所述目标区域的操作获得。Optionally, the modifying the first text information in the target area according to the target candidate word includes: deleting the text information corresponding to the second indication information in the target area through the deletion control; For the second indication information, a second target position is determined, and target candidate words are displayed at the second target position; wherein, the second indication information is obtained according to the user's operation in the target area.
例如,第二指示信息可以根据用户点击候选信息区域上的“向前删除控件k2”或“向后删除控件k3”的操作获得,对目标区域中发生识别错误的文字进行删除。具体的,可以通过所述删除控件删除所述信息编辑界面中目标位置的至少一个相邻位置的字符。For example, the second indication information can be obtained according to the user's operation of clicking "delete control k2 forward" or "delete control k3 backward" on the candidate information area, and delete the text in the target area with a recognition error. Specifically, the character in at least one adjacent position of the target position in the information editing interface can be deleted through the delete control.
可选地,所述信息修正方法还包括:根据第三指示信息,定位至所述目标区域中的文字信息的句尾,退出修正模式。Optionally, the information correction method further includes: according to the third indication information, locating to the end of the sentence of the text information in the target area, and exiting the correction mode.
这里,第三指示信息可以根据用户点击候选信息区域上的“修正完成确认控件k1”的操作获得。用户利用“修正完成确认控件k1”,可以方便地把光标移到段落最后,退出修正模式。Here, the third indication information may be obtained according to the operation of the user clicking the "correction completion confirmation control k1" on the candidate information area. The user can conveniently move the cursor to the end of the paragraph by using the "correction completion confirmation control k1" to exit the correction mode.
可选地,根据所述目标候选词,对所述目标区域中的第一文字信息进行修正之前,所述信息修正方法还包括:在所述候选信息区域上显示删除控件和修正完成确认控件k1,所述删除控件包括向前删除控件k2和向后删除控件k3中的至少一种;Optionally, before correcting the first text information in the target area according to the target candidate word, the information correction method further includes: displaying a deletion control and a correction completion confirmation control k1 on the candidate information area, The deletion control includes at least one of forward deletion control k2 and backward deletion control k3;
其中,所述向前删除控件k2用于接收删除所述信息编辑界面中第一目标位置的左侧相邻位置的字符的指令;Wherein, the forward deletion control k2 is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface;
所述向后删除控件k3用于接收删除所述信息编辑界面中第一目标位置的右侧相邻位置的字符的指令;The backward deletion control k3 is used to receive an instruction to delete the character at the adjacent position on the right side of the first target position in the information editing interface;
所述修正完成确认控件k1用于接收将光标移动至所述目标区域中的文字信息的句尾的指令;The correction completion confirmation control k1 is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
所述第一目标位置根据用户在所述信息编辑界面中的操作确定。The first target position is determined according to the user's operation in the information editing interface.
该实施例中,通过向前删除控件k2可以删除信息编辑界面中目标位置的左侧相邻位置的字符;通过向后删除控件k3可以删除信息编辑界面中目标位置的右侧相邻位置的字符;通过修正完成确认控件k1可以将光标移动至目标区域中的文字信息的句尾。In this embodiment, the characters at the left adjacent position of the target position in the information editing interface can be deleted by forwardly deleting the control k2; the characters at the right adjacent position of the target position in the information editing interface can be deleted by backward deleting the control k3 ;The cursor can be moved to the end of the text message in the target area by modifying the completion confirmation control k1.
也就是说,在所述修正模式下,除了有拼音键盘和语音按键外,还会提供向后删除控件k3。在用户需要删除上屏的错误词语片段(即信息编辑界面中目标位置的相邻位置的字符)时,除了向前删除,也可以向后删除与目标位置相邻接的文字。That is to say, in the correction mode, in addition to the pinyin keyboard and the voice keys, a backward deletion control k3 is also provided. When the user needs to delete the wrong word segment on the upper screen (that is, the characters adjacent to the target position in the information editing interface), in addition to forward deletion, the text adjacent to the target position can also be deleted backward.
该修正模式还提供一个“修正完成确认控件k1”,可以退出修正模式,一键回到正常模式。比如,当用户修正完所有的错误词语后,按下“修正完成确认控件k1”就可以把光标移到目标区域内的句尾,准备开始新的输入。通过这类快捷键,可以有效提高输入效率。The correction mode also provides a "correction completion confirmation control k1", which can exit the correction mode and return to the normal mode with one key. For example, after the user has corrected all the wrong words, press the "correction completion confirmation control k1" to move the cursor to the end of the sentence in the target area, ready to start a new input. This type of shortcut keys can effectively improve input efficiency.
为了便于直观地理解本申请实施例,下面通过图形化方式解释本申请实施例。In order to facilitate intuitive understanding of the embodiments of the present application, the embodiments of the present application are explained below in a graphical manner.
实施例一:Example 1:
如图2至7所示,用户想输入一句话:“现在家里防疫物资都摆满了”。As shown in Figures 2 to 7, the user wants to input a sentence: "Now the home is full of anti-epidemic materials".
如图2所示,用户使用语音输入法整句读了一段语音,然后该语音被错误识别为“现在家里放一屋子都摆满了”,识别出的文字直接上屏。此时,用户发现“放一屋子”跟想要输入的文字不同,需要纠正。As shown in Figure 2, the user uses the voice input method to read a whole sentence of speech, and then the speech is mistakenly recognized as "the house is full now", and the recognized text is directly displayed on the screen. At this time, the user finds that "put in a room" is different from the text he wants to enter, and needs to be corrected.
如图3所示,可以将光标移至出错位置,此时触发开启修正模式。在该模式下,可以按下如图3所示的“向前删除控件k2”或“向后删除控件k3”,对发生识别错误的文字进行删除。这样,可以方便地删除目标位置的至少一个相邻字符。As shown in Figure 3, the cursor can be moved to the error position, and the correction mode is triggered at this time. In this mode, you can press "delete control k2 forward" or "delete control k3 backward" as shown in Figure 3 to delete the text that has a recognition error. In this way, at least one adjacent character at the target position can be easily deleted.
如图4所示,删除错误文字后,信息编辑界面中的目标区域显示为“现在家里都摆满了”。此时,为了正确输入“防疫物资”几个字,用户可以点击下方语音识别按钮,仍然使用语音来对“防疫物资”进行输入。As shown in Figure 4, after deleting the wrong text, the target area in the information editing interface is displayed as "now the house is full". At this time, in order to correctly input the words "epidemic prevention materials", users can click the voice recognition button below and still use voice to input "epidemic prevention materials".
如图5所示,此时的语音识别引擎可以和正常模式的语音识别引擎共用大部分模块,也可以是独立的两个引擎。比如说,修正模式的引擎是针对语 音转拼音来特别优化了的,这样可以保证该引擎将语音转成拼音时的较高准确率。针对fang’yi’wu’zi这串拼音,本申请可以采用WFST模型,将该串拼音转化成很多多阶词语,比如“放一屋子”是一个bigram,由“放一”和“屋子”两个单阶词组成。同时,也会转化成一些低阶词,比如单阶词“防疫”和“防溢”等。此时,由于图5所示界面中没有“防疫物资”而只有“防疫”,因此,用户在这一级可以先选择“防疫”这个词(比如通过单击该词选择),消耗掉对应拼音“fang’yi”后,还剩下拼音“wu’zi”。As shown in FIG. 5 , the speech recognition engine at this time may share most of the modules with the speech recognition engine in the normal mode, or may be two independent engines. For example, the engine of the correction mode is specially optimized for the phonetic-to-pinyin conversion, which can ensure a higher accuracy rate when the engine converts the phonetic to pinyin. For the string of pinyin fang'yi'wu'zi, this application can use the WFST model to convert the string of pinyin into many multi-level words, for example, "put a room" is a bigram, which consists of two words "put one" and "room" composed of single-order words. At the same time, it will also be transformed into some low-level words, such as the single-level words "epidemic prevention" and "overflow prevention". At this time, since there is no "epidemic prevention material" but only "epidemic prevention" in the interface shown in Figure 5, the user can first select the word "epidemic prevention" at this level (for example, by clicking on the word to select), and consume the corresponding pinyin After "fang'yi", the pinyin "wu'zi" remains.
如图6所示,在选择“防疫”后,进入第二级选择界面,此时可以选择“物资”这个词,消耗掉对应拼音“wu’zi”,由于“fang’yi’wu’zi”这串拼音全部已被消耗完,此时可将用户选择的“防疫”和“物资”完成上屏。需要说明的是,通过语音输入的短语越常见,语言模型将这个短语对应的多阶词或单阶词排在前面的概率也就越高。如果用户认为此时所有的词语已经修正完毕,可以按下图6中所示的“修正完成确认控件k1”,就可以方便地把光标移到段落最后,自动退出修正模式。As shown in Figure 6, after selecting "epidemic prevention", enter the second-level selection interface. At this time, you can select the word "materials" to consume the corresponding pinyin "wu'zi", because "fang'yi'wu'zi" This string of pinyin has been used up, and the "epidemic prevention" and "materials" selected by the user can be displayed on the screen. It should be noted that the more common the phrase entered through the voice, the higher the probability that the language model will rank the multi-order word or single-order word corresponding to the phrase in the front. If the user thinks that all the words have been corrected at this time, he can press the "correction completion confirmation control k1" shown in Figure 6, and the cursor can be conveniently moved to the end of the paragraph to automatically exit the correction mode.
如图7所示,退出修正模式后,再按语音识别键,就可以开始新的语音输入了。As shown in Figure 7, after exiting the correction mode, press the voice recognition key again to start a new voice input.
本实施例中,在语音输入法中增加了多级候选功能,对于语音识别出的一段话(也可以是通过拼音输入法输入的一段话)中不准确的小部分片段进行修正时,采用语音输入想要的词语的语音后,将该语音识别转成拼音,然后再根据该拼音来展开多阶候选词,供用户多级选择正确的目标候选词。In this embodiment, a multi-level candidate function is added to the voice input method, and when correcting a small inaccurate segment of a sentence (or a sentence input through the Pinyin input method) recognized by the voice, the voice input method is used After inputting the voice of the desired word, the voice recognition is converted into pinyin, and then multi-level candidate words are expanded according to the pinyin, so that the user can select the correct target candidate word in multiple levels.
上述实施例的流程图如图8所示,具体包括以下步骤:The flow chart of the above embodiment is shown in Figure 8, which specifically includes the following steps:
步骤801:语音识别用户输入的语音(通常是一句话),识别转化成文本后直接上屏,可以充分发挥语音输入按键少的优势。Step 801 : the voice input by the user (usually a sentence) is recognized by voice, and the recognition is converted into text and then directly displayed on the screen, which can give full play to the advantage of fewer keys for voice input.
步骤802:上屏的文本是否有错误词语;没有错误,则执行步骤803;如果有错误,则执行步骤804。Step 802: Check whether there is any wrong word in the text on the upper screen; if there is no error, go to Step 803; if there is an error, go to Step 804.
步骤803:继续用语音输入下一句。Step 803: Continue to input the next sentence by voice.
步骤804:定位至(可以通过移动光标定位)目标位置,即其中一处错误词语处。当把光标移动到错误词语处时,启动修正模式,在该模式下,除了提供“向前删除控件k2”,还提供一个可以向后删除文字的按键(即“向 后删除控件k3”),这样可以更方便删除错误文字。Step 804: Locate (by moving the cursor) the target position, that is, one of the wrong words. When the cursor is moved to the wrong word, the correction mode is activated. In this mode, in addition to the "forward delete control k2", a button that can delete the text backward (ie "backward delete control k3") is provided, This makes it easier to remove erroneous text.
步骤805:通过语音输入所要想要输入的文字所对应的语音。Step 805: Input the voice corresponding to the text to be input by voice.
步骤806:本申请实施例优化后的语音识别引擎主要针对短语而非长句进行转化,并且,不是直接将语音转为汉字,而是先转为拼音。采用这样的转化方式的原因是:不同的汉语拼音大概只有407个,即使算上声调,也比汉字30000多个要少的多,因此将语音转为拼音串的准确率比较高,这样转化相较于比转为汉字串来说要容易。Step 806: The optimized speech recognition engine in the embodiment of the present application mainly converts phrases rather than long sentences, and instead of directly converting the speech into Chinese characters, it first converts the speech into Pinyin. The reason for adopting this conversion method is that there are only about 407 different Chinese pinyin, even if the tones are counted, it is much less than the more than 30,000 Chinese characters. Therefore, the accuracy rate of converting the phonetic into a pinyin string is relatively high. It is easier than converting to a Chinese character string.
步骤807:用户可以对语音识别转化来的拼音进行是否正确的确认;正确,则执行步骤809,否则,执行步骤808;Step 807: The user can confirm whether the pinyin converted by speech recognition is correct; if it is correct, go to Step 809, otherwise, go to Step 808;
步骤808:如果拼音识别错误,用户仍然可以通过键盘对其进行修改。当然,应该尽量避免这种情况,因为会增加按键操作次数。Step 808: If the pinyin recognition is wrong, the user can still modify it through the keyboard. Of course, this should be avoided as much as possible, as it will increase the number of keystrokes.
步骤809:针对正确的拼音(即第一目标拼音),训练好的WFST可以将拼音转为多阶候选词,并按照权重(比如阶次、概率等)从高到低排列。这个WFST可以是两个模型的复合:即拼音转一阶词汇,及词汇的WFST语言模型。Step 809: For the correct pinyin (ie, the first target pinyin), the trained WFST can convert the pinyin into multi-order candidate words, and arrange them according to the weight (such as order, probability, etc.) from high to low. This WFST can be a composite of two models: the pinyin-to-first-order vocabulary, and the WFST language model of the vocabulary.
步骤810:用户根据候选词,可以通过单击的候选词的方式,选择和目标词语一致的尽可能长的词,消耗掉对应拼音,直到所有拼音消耗完,完成想要的词的上屏。Step 810: According to the candidate words, the user can select a word as long as possible that is consistent with the target word by clicking the candidate word, and consume the corresponding pinyin until all the pinyin is consumed, and the desired word is displayed on the screen.
步骤811:判断拼音是否全部消耗完;是,则执行步骤812;否,则执行步骤810,即继续选择候选词,消耗对应拼音。Step 811: Determine whether all the pinyin has been consumed; if yes, go to step 812; if not, go to step 810, that is, continue to select candidate words and consume the corresponding pinyin.
步骤812:所有拼音消耗完时,完成想要输入的词或短语的上屏。Step 812: When all the pinyin is exhausted, complete the screen of the word or phrase you want to input.
步骤813:用户可以继续对信息编辑界面中的目标区域的文字进行正确与否的确认,查看句中是否还有其他错误片段。是,则执行步骤804;否,则执行步骤814。也就是说,句中有多处错误时,可以通过反复执行步骤804到步骤813,直至句子的错误片段全部被修正。Step 813: The user can continue to confirm whether the text in the target area in the information editing interface is correct or not, and check whether there are other incorrect segments in the sentence. If yes, go to step 804; if not, go to step 814. That is to say, when there are multiple errors in the sentence, steps 804 to 813 can be executed repeatedly until all the error segments of the sentence are corrected.
步骤814:用户确认句子已经修正完毕时,可以按下“修正完成确认控件k1”,此时,光标会自动移到目标区域里文本的尾部,准备开始下一句的输入,这样可以避免移动光标的定位操作。Step 814: When the user confirms that the sentence has been corrected, he can press the "correction completion confirmation control k1", at this time, the cursor will automatically move to the end of the text in the target area, ready to start the input of the next sentence, so as to avoid moving the cursor. Positioning operation.
上述实施例中,如果一共出现1处错误,则需要移动光标1次、录入语 音1次以及按键7次。其中,7次按键操作包含删除错误词4次(分别删除“放”、“一”、“屋”、“子”)、语音按键1次和两级选词2次。相对来说,如果用拼音输入法输入上述语音输入的一串文本,即使用简拼也需要很多按键次数;如果用传统语音输入法,需要再移动光标到错误词语处,再用键盘输入拼音去修正,也需要很多按键次数,并且会打断语音输入的节奏。而实施例中,大部分操作是用在语音和选词上,把按键输入降到较低次数,能够实现用较少的定位和按键次数来获得一个正确的句子输入,从而使得语音输入效率显著提升。用户可以通过本申请提供的信息修正方法快捷修正文本中错误词语,达到了快速和准确地输入大段文本的目的。In the above embodiment, if there is one error in total, it is necessary to move the cursor once, enter the voice once, and press the key seven times. Among them, the 7 key operations include 4 deletions of the wrong word (respectively delete "put", "one", "house", "zi"), 1 voice button and 2 two-level word selection. Relatively speaking, if you use the Pinyin input method to input a string of text input by the above voice, even using Jianpin will require a lot of keystrokes; if you use the traditional phonetic input method, you need to move the cursor to the wrong word, and then use the keyboard to input Pinyin to Correction, also requires a lot of keystrokes and interrupts the rhythm of voice input. In the embodiment, most of the operations are used in voice and word selection, and the number of keystrokes is reduced to a lower number, so that a correct sentence input can be obtained with less positioning and keystrokes, thereby making the voice input more efficient. promote. The user can quickly correct the wrong words in the text through the information correction method provided in this application, so as to achieve the purpose of quickly and accurately inputting large pieces of text.
实施例二:Embodiment 2:
如图7、图9至13所示,假设用户仍是想输入:“现在家里防疫物资都摆满了”。As shown in Figures 7 and 9 to 13, it is assumed that the user still wants to input: "Now the home is full of anti-epidemic materials".
如图9所示,用户使用语音输入法输入了一段语音,假设被错误识别为“现在家里翻译物资都摆满了”,识别出的文字直接上屏。此时,用户发现“翻译”跟想要的文字不同,需要纠正。As shown in Figure 9, the user uses the voice input method to input a piece of speech. Suppose it is mistakenly recognized as "now the house is full of translation materials", and the recognized text is directly displayed on the screen. At this time, the user finds that the "translation" is different from the desired text and needs to be corrected.
如图10所示,可以将光标移至出错位置,此时触发开启修正模式。用户可以通过“向前删除控件k2”删除“翻译”二字。As shown in Figure 10, the cursor can be moved to the error position, and the correction mode is triggered at this time. The user can delete the word "translation" through the "delete control k2 forward".
如图11所示,删除错误文字后,界面显示如图11所示,信息编辑界面中的目标区域显示为“现在家里物资都摆满了”。为了正确输入“防疫”二字,用户可以点击下方语音识别键,重新输入语音。此时,为了提高“防疫”的出词概率,用户可以选择语音输入“防疫措施”,而不是直接输入“防疫”的语音。As shown in Figure 11, after deleting the wrong text, the interface is displayed as shown in Figure 11, and the target area in the information editing interface is displayed as "Now the house is full of supplies". In order to correctly enter the word "epidemic prevention", users can click the voice recognition button below to re-enter the voice. At this time, in order to improve the probability of the word "epidemic prevention", users can choose to input "epidemic prevention measures" by voice instead of directly inputting the voice of "epidemic prevention".
如图12所示,用户输入“防疫措施”的语音后,针对识别出的fang’yi’cuo’shi这串拼音,可以采用WFST模型,将该串拼音转化成很多多阶词语,比如“防疫措施”是一个bigram,由“防疫”和“措施”组成,同时,也会转化成一些低阶词,比如“防疫”和“放一”是单阶词。通常,高阶词总是排列在低阶词的前面;而在同阶的词里面,排序主要是根据WFST模型概率来排序。但是,在本实施例里,根据预设规则,同阶词不完全按照概率来排序,而是让高阶词包含的次阶词,能出现在与这个次阶词同阶的词 的前面。举例来说,这里面的单阶词“防疫”是高阶词“防疫措施”的一部分,因此在众多一阶词中,会保证“防疫”排在“放一”或“防溢”等词的前面。As shown in Figure 12, after the user enters the voice of "epidemic prevention measures", for the identified string of pinyin fang'yi'cuo'shi, the WFST model can be used to convert the string of pinyin into many multi-level words, such as "epidemic prevention measures". "Measures" is a bigram consisting of "epidemic prevention" and "measures", and at the same time, it will also be transformed into some low-level words, such as "epidemic prevention" and "put one" are single-level words. Usually, high-order words are always arranged in front of low-order words; and in words of the same order, the sorting is mainly based on the probability of the WFST model. However, in this embodiment, according to the preset rules, the same-order words are not completely sorted by probability, but the second-order words contained in the higher-order words can appear in front of the words of the same order as this second-order word. For example, the single-order word "epidemic prevention" is part of the higher-order word "epidemic prevention measures", so among many first-order words, "epidemic prevention" will be guaranteed to be ranked in words such as "put one" or "overflow prevention" in front of.
如图13所示,用户长按目标候选词“防疫”二字,目的是让“防疫”二字直接上屏,完成正确词语的输入。这个过程中,会自动忽略掉未消耗掉的拼音“cuo’shi”。需要说明的是,语音输入的短语越常见,语言模型会保证这个词语及其次阶词出现的概率也就越高。如果用户认为此时所有的词语已经修正完毕,可以按下图13中所示的“修正完成确认控件k1”,就可以方便地把光标移到段落最后,自动退出修正模式。As shown in Figure 13, the user long presses the target candidate word "epidemic prevention", the purpose is to make the word "epidemic prevention" directly on the screen to complete the input of the correct word. During this process, the unconsumed pinyin "cuo'shi" will be automatically ignored. It should be noted that the more common the phrase entered by the voice, the higher the probability that the language model will ensure that the word and its subordinate words appear. If the user thinks that all the words have been corrected at this time, he can press the "correction completion confirmation control k1" shown in Figure 13, and the cursor can be conveniently moved to the end of the paragraph to automatically exit the correction mode.
如图7所示,退出修正模式后,再按语音识别键,就可以开始新的语音输入了。As shown in Figure 7, after exiting the correction mode, press the voice recognition key again to start a new voice input.
上述实施例的流程图如图14所示,具体包括以下步骤:The flow chart of the above embodiment is shown in Figure 14, which specifically includes the following steps:
步骤1401:语音识别用户输入的语音(通常是一句话),识别转化成文本后直接上屏。Step 1401: Voice recognition of the voice input by the user (usually a sentence), and the recognition is converted into text and directly displayed on the screen.
步骤1402:上屏的文本是否有错误词语;没有错误,则执行步骤1403;如果有错误,则执行步骤1404。Step 1402: Check whether there is any wrong word in the text on the upper screen; if there is no error, go to Step 1403; if there is an error, go to Step 1404.
步骤1403:继续用语音输入下一句。Step 1403: Continue to input the next sentence by voice.
步骤1404:可以通过移动光标定位至目标位置,即其中一处错误词语处。当把光标移动到错误词语处时,启动修正模式,在该模式下,可以在目标位置向前或向后(即目标位置的相邻位置)删除错误文字。Step 1404: Move the cursor to locate the target position, that is, one of the wrong words. When the cursor is moved to the wrong word, the correction mode is activated, and in this mode, the wrong word can be deleted forward or backward at the target position (ie, adjacent to the target position).
步骤1405:在输入某个不常用的低阶词(通常是1阶或2阶)时,为了提高该词的出词概率,用户可以选择通过语音输入包含该低阶词的高阶词(越常用越好)的语音,而不是直接输入该低阶词。语言模型会给越常见的组合以越大的权重,让其排在候选词中的靠前位置。Step 1405: When inputting an uncommon low-order word (usually 1st-order or 2nd-order), in order to improve the word probability of the word, the user can choose to input a high-order word (the more Commonly used as possible) speech instead of directly entering the low-order word. The language model will give more weight to the more common combinations, ranking them higher in the candidate words.
步骤1406:优化后的语音引擎针对用户输入的语音转化成对应的拼音。Step 1406: The optimized speech engine converts the speech input by the user into corresponding pinyin.
步骤1407:针对识别出的正确拼音,WFST模型会把它转化成很多多阶词语,比如“防疫措施”是一个二阶词,由“防疫”和“措施”组成,“放一”和“防疫”是单阶词。通常,高阶词总是出现在低阶词的前面,在同阶的词里面,排序主要是根据WFST模型概率来排序。本申请实施例中,根据预设 规则,将候选词中的高阶词包含的次阶词排列在与这个次阶词同阶的其他候选词前面。这里,正是借助了高阶词有很大概率排在前面,使得该高阶词的次阶词借助高阶词的正确概率有机会出现在候选词中较靠前的位置。当然可以理解的是,该次阶词只是在其同阶词中排位靠前,在其他的高阶词还是排位靠后的。Step 1407: For the identified correct pinyin, the WFST model will convert it into many multi-level words. For example, "epidemic prevention measures" is a second-order word consisting of "epidemic prevention" and "measures", "put one" and "epidemic prevention". " is a single-order word. Usually, high-order words always appear in front of low-order words. In words of the same order, the sorting is mainly based on the probability of the WFST model. In the embodiment of the present application, according to a preset rule, the second-order words included in the higher-order words in the candidate words are arranged in front of other candidate words of the same order as this second-order word. Here, it is precisely with the help of high-order words that there is a high probability of ranking in the front, so that the second-order words of the high-order word have the opportunity to appear in a higher position in the candidate words with the help of the correct probability of the high-order word. Of course, it is understandable that the second-order word is only ranked at the top of its same-order words, and it is still ranked at the back of other high-order words.
步骤1408:判断候选词中的次阶词中有没有用户想要的词;有,则执行步骤1410;没有,用户则可以选择对拼音进行修改,即执行步骤1409,或者可以使用语音重新输入其他包含所要输入词语的常用高阶词所对应的语音,即执行步骤1405。Step 1408: Determine whether there is a word that the user wants in the second-order words in the candidate words; if yes, go to Step 1410; if not, the user can choose to modify the pinyin, that is, go to Step 1409, or can use voice to re-input other words. Step 1405 is executed to include the speech corresponding to the commonly used higher-order words of the words to be input.
步骤1409:用户可以通过键盘对拼音进行修改。Step 1409: The user can modify the pinyin through the keyboard.
步骤1410:由于用户已经从候选词中发现了所想要的词,虽然此时还有多余的拼音未消耗掉,这些多余拼音的使命(即帮助找到用户需要的词)也已经完成了。于是,用户可以通过长按目标候选词,使其直接上屏,而未消耗掉拼音会被忽略掉,这样,即可完成正确词语的输入。Step 1410: Since the user has found the desired word from the candidate words, although there are still redundant pinyin not consumed at this time, the mission of these redundant pinyin (ie, helping to find the word that the user needs) has been completed. Therefore, the user can press and hold the target candidate word to make it appear on the screen directly, and the pinyin that is not consumed will be ignored, so that the input of the correct word can be completed.
步骤1411:用户可以继续对信息编辑界面中的目标区域的文字进行正确与否的确认,查看句中是否还有其他错误片段。是,则执行步骤1404;否,则执行步骤1412。Step 1411: The user can continue to confirm whether the text in the target area in the information editing interface is correct or not, and check whether there are other incorrect segments in the sentence. If yes, go to step 1404; if not, go to step 1412.
步骤1412:用户确认句子已经修正完毕时,可以按下“修正完成确认控件k1”,此时,光标会自动移到目标区域里文本的尾部,准备开始下一句的输入,这样可以避免移动光标的定位操作。Step 1412: When the user confirms that the sentence has been corrected, he can press the "correction completion confirmation control k1", at this time, the cursor will automatically move to the end of the text in the target area, ready to start the input of the next sentence, so as to avoid moving the cursor. Positioning operation.
本实施例提供的信息修正方法比较适合修正输入不太常见的较短的词(通常是1阶或2阶词,也可以是少数3阶词)。对于不常见的词,极端情况下,用户甚至需要依靠单字的拼音去一个个、一级级地找需要的词,非常费劲。本申请可以避免用户在候选词列表很靠后的位置去找想要的词,提高了输入效率。The information correction method provided in this embodiment is more suitable for correcting the input of less common shorter words (usually 1st-order or 2nd-order words, and may also be a few 3rd-order words). For uncommon words, in extreme cases, users even need to rely on the pinyin of each word to find the words they need one by one, which is very laborious. The present application can prevent the user from searching for the desired word at a very late position in the candidate word list, thereby improving the input efficiency.
通过大量的带声调的拼音和对应的语音数据中对修正模式的语音识别引擎进行优化,该引擎可以比正常模式的简单,却可以针对语音转拼音优化,使得转拼音的准确率提到较高水平。因此,也可以考虑在语音识别后且修正后的文本旁,附上对应的用户输入的语音信息。这样,一方面可以让信息接 收方更加确认文字的准确性,另一方面还可以提供非常好的平行语料,该平行语料可用于个性化提高用户语音引擎的准确率。Through a large number of tonal pinyin and the corresponding speech data, the speech recognition engine of the correction mode is optimized. The engine can be simpler than the normal mode, but it can be optimized for the conversion of phonetic pinyin, so that the accuracy rate of pinyin conversion is higher. Level. Therefore, it can also be considered that the corresponding voice information input by the user is attached beside the text after voice recognition and correction. In this way, on the one hand, the information receiver can confirm the accuracy of the text, and on the other hand, it can also provide a very good parallel corpus, and the parallel corpus can be used to improve the accuracy of the user's speech engine.
本申请实施例中,通过语音输入来获得想要输入的词语或短句的拼音,采用多级候选和高阶词辅助候选方式,选择该拼音对应的汉字,操作简便。In the embodiment of the present application, the pinyin of the word or short sentence to be input is obtained through voice input, and the Chinese character corresponding to the pinyin is selected by using multi-level candidates and high-level word auxiliary candidates, which is easy to operate.
需要说明的是,本申请实施例提供的信息修正方法,执行主体可以为信息修正装置,或者该信息修正装置中的用于执行信息修正方法的控制模块。本申请实施例中以信息修正装置执行信息修正方法为例,说明本申请实施例提供的信息修正装置。It should be noted that, in the information correction method provided by the embodiments of the present application, the execution body may be an information correction device, or a control module in the information correction device for executing the information correction method. In the embodiment of the present application, the information correction device provided by the embodiment of the present application is described by taking the information correction device executing the information correction method as an example.
如图15所示,本申请实施例还提供了一种信息修正装置1500,包括:As shown in FIG. 15 , an embodiment of the present application further provides an information correction apparatus 1500, including:
显示模块1501,用于将根据第一语音信息转化生成的第一文字信息显示在信息编辑界面中的目标区域;The display module 1501 is used to display the first text information converted and generated according to the first voice information in the target area in the information editing interface;
处理模块1502,用于根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词;The processing module 1502 is configured to display the corresponding candidate words in the candidate information area in the information editing interface according to the received first input;
目标确定模块1503,用于根据用户在所述候选信息区域的操作,获得第一指示信息,所述第一指示信息用于指示所述候选词中的目标候选词;A target determination module 1503, configured to obtain first indication information according to the user's operation in the candidate information area, where the first indication information is used to indicate the target candidate word in the candidate word;
信息修正模块1504,用于根据所述目标候选词,对所述目标区域中的第一文字信息进行修正。The information modification module 1504 is configured to modify the first text information in the target area according to the target candidate word.
可选地,所述处理模块1502包括:Optionally, the processing module 1502 includes:
第一处理单元,用于根据接收到的第二语音信息,获得第一目标拼音;a first processing unit, configured to obtain the first target pinyin according to the received second voice information;
第二处理单元,用于根据所述第一目标拼音,在所述在候选信息区域显示所述第二语音信息对应的候选词。The second processing unit is configured to display the candidate words corresponding to the second voice information in the candidate information area according to the first target pinyin.
可选地,所述处理模块1502包括:Optionally, the processing module 1502 includes:
第三处理单元,用于根据接收到的第二语音信息以及预设规则,在所述在候选信息区域显示所述第二语音信息对应的候选词;a third processing unit, configured to display the candidate words corresponding to the second voice information in the candidate information area according to the received second voice information and a preset rule;
其中,所述预设规则包括:在所述候选词为词组的情况下,将所述候选词中的高阶词显示在低阶词之前,以及,将所述高阶词的次阶词显示在所述次阶词的同阶词之前;高阶词所包含的词的数量大于低阶词所包含的词的数量,两个同阶词所包含的词的数量相同,高阶词的次阶词为包含在所述高阶词中的词;Wherein, the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words. An order word is a word contained in the higher order word;
所述第二语音信息为包含所述目标候选词的高阶词所对应的语音信息;The second voice information is the voice information corresponding to the higher-order words including the target candidate words;
所述第一指示信息为根据长按操作得到的指示信息。The first indication information is indication information obtained according to a long-press operation.
可选地,所述处理模块1502包括:Optionally, the processing module 1502 includes:
第四处理单元,用于在所述目标区域内的第二文字信息被选中的情况下,根据所述第二文字信息,确定第二目标拼音;a fourth processing unit, configured to determine a second target pinyin according to the second text information when the second text information in the target area is selected;
第五处理单元,用于根据所述第二目标拼音,在所述在候选信息区域显示所述第二目标拼音对应的候选词;a fifth processing unit, configured to display candidate words corresponding to the second target pinyin in the candidate information area according to the second target pinyin;
其中,所述目标区域中的第一文字信息包括所述第二文字信息。Wherein, the first text information in the target area includes the second text information.
可选地,所述信息修正装置1500还包括:Optionally, the information correction apparatus 1500 further includes:
控件显示模块,用于在所述候选信息区域上显示删除控件和修正完成确认控件,所述删除控件包括向前删除控件和向后删除控件中的至少一种;a control display module, configured to display a deletion control and a correction completion confirmation control on the candidate information area, the deletion control includes at least one of a forward deletion control and a backward deletion control;
其中,所述向前删除控件用于接收删除所述信息编辑界面中第一目标位置的左侧相邻位置的字符的指令;Wherein, the forward deletion control is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface;
所述向后删除控件用于接收删除所述信息编辑界面中第一目标位置的右侧相邻位置的字符的指令;The backward deletion control is used to receive an instruction to delete the character at the right adjacent position of the first target position in the information editing interface;
所述修正完成确认控件用于接收将光标移动至所述目标区域中的文字信息的句尾的指令;The correction completion confirmation control is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
所述第一目标位置根据用户在所述信息编辑界面中的操作确定。The first target position is determined according to the user's operation in the information editing interface.
可选地,所述信息修正模块1504包括:Optionally, the information correction module 1504 includes:
第一修正单元,用于通过所述删除控件对目标区域中与第二指示信息对应的文字信息进行删除;a first correction unit, configured to delete the text information corresponding to the second indication information in the target area through the deletion control;
第二修正单元,用于根据所述第二指示信息,确定第二目标位置,将目标候选词显示在所述第二目标位置;a second correction unit, configured to determine a second target position according to the second indication information, and display the target candidate word at the second target position;
其中,所述第二指示信息根据用户在所述目标区域的操作获得。Wherein, the second indication information is obtained according to the user's operation in the target area.
在本申请实施例中,通过将用户的输入转化为对应的拼音信息,并针对该拼音展开候选词,使得用户可以方便地修正语音输入时发生的识别错误情况,快速准确地获得想要输入的内容,提高了语音输入效率,提升了用户体验。In the embodiment of the present application, by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
本申请实施例中的信息修正装置可以是装置,也可以是终端中的部件、 集成电路、或芯片。该装置可以是移动电子设备,也可以为非移动电子设备。示例性的,移动电子设备可以为手机、平板电脑、笔记本电脑、掌上电脑、车载电子设备、可穿戴设备、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本或者个人数字助理(personal digital assistant,PDA)等,非移动电子设备可以为服务器、网络附属存储器(Network Attached Storage,NAS)、个人计算机(personal computer,PC)、电视机(television,TV)、柜员机或者自助机等,本申请实施例不作具体限定。The information correction device in this embodiment of the present application may be a device, or may be a component, an integrated circuit, or a chip in a terminal. The apparatus may be a mobile electronic device or a non-mobile electronic device. Exemplarily, the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant). assistant, PDA), etc., non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
本申请实施例中的信息修正装置可以为具有操作系统的装置。该操作系统可以为安卓(Android)操作系统,可以为ios操作系统,还可以为其他可能的操作系统,本申请实施例不作具体限定。The information correction device in the embodiment of the present application may be a device with an operating system. The operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
本申请实施例提供的信息修正装置能够实现图1至图14的方法实施例实现的各个过程,为避免重复,这里不再赘述。The information correction apparatus provided in this embodiment of the present application can implement each process implemented by the method embodiments in FIG. 1 to FIG. 14 , and to avoid repetition, details are not repeated here.
可选的,如图16所示,本申请实施例还提供一种电子设备1600,包括处理器1602,存储器1601,存储在存储器1601上并可在所述处理器1602上运行的程序或指令,该程序或指令被处理器1602执行时实现上述信息修正方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。Optionally, as shown in FIG. 16, an embodiment of the present application further provides an electronic device 1600, including a processor 1602, a memory 1601, a program or instruction stored in the memory 1601 and executable on the processor 1602, When the program or instruction is executed by the processor 1602, each process of the above-mentioned information correction method embodiment is implemented, and the same technical effect can be achieved. To avoid repetition, details are not described here.
需要说明的是,本申请实施例中的电子设备包括上述所述的移动电子设备和非移动电子设备。It should be noted that the electronic devices in the embodiments of the present application include the aforementioned mobile electronic devices and non-mobile electronic devices.
图17为实现本申请实施例的一种电子设备的硬件结构示意图。FIG. 17 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
该电子设备1700包括但不限于:射频单元1701、网络模块1702、音频输出单元1703、输入单元1704、传感器1705、显示单元1706、用户输入单元1707、接口单元1708、存储器1709、以及处理器1710等部件。The electronic device 1700 includes but is not limited to: a radio frequency unit 1701, a network module 1702, an audio output unit 1703, an input unit 1704, a sensor 1705, a display unit 1706, a user input unit 1707, an interface unit 1708, a memory 1709, and a processor 1710, etc. part.
本领域技术人员可以理解,电子设备1700还可以包括给各个部件供电的电源(比如电池),电源可以通过电源管理系统与处理器1710逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。图17中示出的电子设备结构并不构成对电子设备的限定,电子设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置,在此不再赘述。Those skilled in the art can understand that the electronic device 1700 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 1710 through a power management system, so as to manage charging, discharging, and power consumption through the power management system. consumption management and other functions. The structure of the electronic device shown in FIG. 17 does not constitute a limitation on the electronic device. The electronic device may include more or less components than those shown in the figure, or combine some components, or arrange different components, which will not be repeated here. .
其中,处理器1710,用于将根据第一语音信息转化生成的第一文字信息 显示在信息编辑界面中的目标区域;根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词;根据用户在所述候选信息区域的操作,获得第一指示信息,所述第一指示信息用于指示所述候选词中的目标候选词;根据所述目标候选词,对所述目标区域中的第一文字信息进行修正。The processor 1710 is configured to display the first text information converted and generated according to the first voice information in the target area in the information editing interface; according to the received first input, display in the candidate information area in the information editing interface corresponding candidate words; first indication information is obtained according to the user's operation in the candidate information area, and the first indication information is used to indicate the target candidate words in the candidate words; The first text information in the target area is modified.
在本申请实施例中,通过将用户的输入转化为对应的拼音信息,并针对该拼音展开候选词,使得用户可以方便地修正语音输入时发生的识别错误情况,快速准确地获得想要输入的内容,提高了语音输入效率,提升了用户体验。In the embodiment of the present application, by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
可选地,处理器1710在根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词时,还用于:根据接收到的第二语音信息,获得第一目标拼音;根据所述第一目标拼音,在所述在候选信息区域显示所述第二语音信息对应的候选词。Optionally, when displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input, the processor 1710 is further configured to: obtain the first voice information according to the received second voice information target pinyin; according to the first target pinyin, display the candidate words corresponding to the second voice information in the candidate information area.
可选地,处理器1710在根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词时,还用于:根据接收到的第二语音信息以及预设规则,在所述在候选信息区域显示所述第二语音信息对应的候选词;Optionally, when displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input, the processor 1710 is further configured to: according to the received second voice information and preset rules , displaying the candidate words corresponding to the second voice information in the candidate information area;
其中,所述预设规则包括:在所述候选词为词组的情况下,将所述候选词中的高阶词显示在低阶词之前,以及,将所述高阶词的次阶词显示在所述次阶词的同阶词之前;高阶词所包含的词的数量大于低阶词所包含的词的数量,两个同阶词所包含的词的数量相同,高阶词的次阶词为包含在所述高阶词中的词;所述第二语音信息为包含所述目标候选词的高阶词所对应的语音信息;所述第一指示信息为根据长按操作得到的指示信息。Wherein, the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words. The first-order word is the word contained in the higher-order word; the second voice information is the phonetic information corresponding to the higher-order word including the target candidate word; the first indication information is obtained according to the long-press operation Instructions.
可选地,处理器1710在根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词时,还用于:在所述目标区域内的第二文字信息被选中的情况下,根据所述第二文字信息,确定第二目标拼音;Optionally, when displaying the corresponding candidate words in the candidate information area in the information editing interface according to the received first input, the processor 1710 is further configured to: the second text information in the target area is In the case of selection, according to the second text information, determine the second target pinyin;
根据所述第二目标拼音,在所述在候选信息区域显示所述第二目标拼音对应的候选词;According to the second target pinyin, the candidate words corresponding to the second target pinyin are displayed in the candidate information area;
其中,所述目标区域中的第一文字信息包括所述第二文字信息。Wherein, the first text information in the target area includes the second text information.
可选地,处理器1710在根据所述目标候选词,对所述目标区域中的第一 文字信息进行修正之前,还用于:在所述候选信息区域上显示删除控件和修正完成确认控件,所述删除控件包括向前删除控件和向后删除控件中的至少一种;Optionally, before correcting the first text information in the target area according to the target candidate word, the processor 1710 is further configured to: display a delete control and a correction completion confirmation control on the candidate information area, so that The deletion control includes at least one of forward deletion control and backward deletion control;
其中,所述向前删除控件用于接收删除所述信息编辑界面中第一目标位置的左侧相邻位置的字符的指令;Wherein, the forward deletion control is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface;
所述向后删除控件用于接收删除所述信息编辑界面中第一目标位置的右侧相邻位置的字符的指令;The backward deletion control is used to receive an instruction to delete the character at the right adjacent position of the first target position in the information editing interface;
所述修正完成确认控件用于接收将光标移动至所述目标区域中的文字信息的句尾的指令;The correction completion confirmation control is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
所述第一目标位置根据用户在所述信息编辑界面中的操作确定。The first target position is determined according to the user's operation in the information editing interface.
可选地,处理器1710在根据所述目标候选词,对所述目标区域中的第一文字信息进行修正时,还用于:通过所述删除控件对目标区域中与第二指示信息对应的文字信息进行删除;根据所述第二指示信息,确定第二目标位置,将目标候选词显示在所述第二目标位置;其中,所述第二指示信息根据用户在所述目标区域的操作获得。Optionally, when the processor 1710 corrects the first text information in the target area according to the target candidate words, the processor 1710 is further configured to: use the delete control to modify the text corresponding to the second indication information in the target area information is deleted; according to the second indication information, a second target position is determined, and target candidate words are displayed in the second target position; wherein, the second indication information is obtained according to the user's operation in the target area.
在本申请实施例中,通过将用户的输入转化为对应的拼音信息,并针对该拼音展开候选词,使得用户可以方便地修正语音输入时发生的识别错误情况,快速准确地获得想要输入的内容,提高了语音输入效率,提升了用户体验。In the embodiment of the present application, by converting the user's input into the corresponding pinyin information, and developing candidate words for the pinyin, the user can easily correct the recognition errors that occur during voice input, and quickly and accurately obtain the desired input. content, improve the efficiency of voice input, and improve the user experience.
应理解的是,本申请实施例中,输入单元1704可以包括图形处理器(Graphics Processing Unit,GPU)17041和麦克风17042,图形处理器17041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。显示单元1706可包括显示面板17061,可以采用液晶显示器、有机发光二极管等形式来配置显示面板17061。用户输入单元1707包括触控面板17071以及其他输入设备17072。触控面板17071,也称为触摸屏。触控面板17071可包括触摸检测装置和触摸控制器两个部分。其他输入设备17072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆,在此不再赘述。存储器1709可用于存储软件程序以及各种数据,包括但不限于应用程序和操作系统。处理 器1710可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器1710中。It should be understood that, in this embodiment of the present application, the input unit 1704 may include a graphics processor (Graphics Processing Unit, GPU) 17041 and a microphone 17042. Such as camera) to obtain still pictures or video image data for processing. The display unit 1706 may include a display panel 17061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 1707 includes a touch panel 17071 and other input devices 17072 . The touch panel 17071 is also called a touch screen. The touch panel 17071 may include two parts, a touch detection device and a touch controller. Other input devices 17072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which will not be repeated here. Memory 1709 may be used to store software programs as well as various data including, but not limited to, application programs and operating systems. The processor 1710 may integrate an application processor and a modem processor, wherein the application processor mainly handles the operating system, user interface, and application programs, and the like, and the modem processor mainly handles wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 1710.
本申请实施例还提供一种可读存储介质,所述可读存储介质上存储有程序或指令,该程序或指令被处理器执行时实现上述信息修正方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。Embodiments of the present application further provide a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, each process of the above information modification method embodiment can be achieved, and the same can be achieved. In order to avoid repetition, the technical effect will not be repeated here.
其中,所述处理器为上述实施例中所述的电子设备中的处理器。所述可读存储介质,包括计算机可读存储介质,如计算机只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等。Wherein, the processor is the processor in the electronic device described in the foregoing embodiments. The readable storage medium includes a computer-readable storage medium, such as a computer read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.
本申请实施例另提供了一种芯片,所述芯片包括处理器和通信接口,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现上述信息修正方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。An embodiment of the present application further provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction to implement the information modification method embodiments described above. Each process can achieve the same technical effect. In order to avoid repetition, it will not be repeated here.
应理解,本申请实施例提到的芯片还可以称为系统级芯片、系统芯片、芯片系统或片上系统芯片等。It should be understood that the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.
本申请实施例另提供了一种计算机程序产品,所述计算机程序产品被存储在非易失的存储介质中,所述计算机程序产品被至少一个处理器执行以实现上述信息修正方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。An embodiment of the present application further provides a computer program product, where the computer program product is stored in a non-volatile storage medium, and the computer program product is executed by at least one processor to implement each of the foregoing information correction method embodiments process, and can achieve the same technical effect, in order to avoid repetition, it will not be repeated here.
本申请实施例另提供了一种电子设备,被配置为执行上述信息修正方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。The embodiment of the present application further provides an electronic device, which is configured to execute each process of the above-mentioned information correction method embodiment, and can achieve the same technical effect. To avoid repetition, details are not described here.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。此外,需要指出的是,本申请实施方式中的方法和装置的范围不限按示出或讨论的顺序来执行功能,还 可包括根据所涉及的功能按基本同时的方式或按相反的顺序来执行功能,例如,可以按不同于所描述的次序来执行所描述的方法,并且还可以添加、省去、或组合各种步骤。另外,参照某些示例所描述的特征可在其他示例中被组合。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element. Furthermore, it should be noted that the scope of the methods and apparatus in the embodiments of the present application is not limited to performing the functions in the order shown or discussed, but may also include performing the functions in a substantially simultaneous manner or in the reverse order depending on the functions involved. To perform functions, for example, the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to some examples may be combined in other examples.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or in a part that contributes to the prior art, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of this application.
上面结合附图对本申请的实施例进行了描述,但是本申请并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本申请的启示下,在不脱离本申请宗旨和权利要求所保护的范围情况下,还可做出很多形式,均属于本申请的保护之内。The embodiments of the present application have been described above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of this application, without departing from the scope of protection of the purpose of this application and the claims, many forms can be made, which all fall within the protection of this application.

Claims (17)

  1. 一种信息修正方法,包括:An information correction method comprising:
    将根据第一语音信息转化生成的第一文字信息显示在信息编辑界面中的目标区域;The first text information that is converted and generated according to the first voice information is displayed in the target area in the information editing interface;
    根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词;According to the received first input, the corresponding candidate word is displayed in the candidate information area in the information editing interface;
    根据用户在所述候选信息区域的操作,获得第一指示信息,所述第一指示信息用于指示所述候选词中的目标候选词;Obtain first indication information according to the user's operation in the candidate information area, where the first indication information is used to indicate the target candidate word in the candidate word;
    根据所述目标候选词,对所述目标区域中的第一文字信息进行修正。According to the target candidate word, the first text information in the target area is modified.
  2. 根据权利要求1所述的信息修正方法,其中,所述根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词,包括:The information correction method according to claim 1, wherein, according to the received first input, the corresponding candidate words are displayed in the candidate information area in the information editing interface, comprising:
    根据接收到的第二语音信息,获得第一目标拼音;Obtain the first target pinyin according to the received second voice information;
    根据所述第一目标拼音,在所述在候选信息区域显示所述第二语音信息对应的候选词。According to the first target pinyin, the candidate words corresponding to the second voice information are displayed in the candidate information area.
  3. 根据权利要求1所述的信息修正方法,其中,所述根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词,包括:The information correction method according to claim 1, wherein, according to the received first input, the corresponding candidate words are displayed in the candidate information area in the information editing interface, comprising:
    根据接收到的第二语音信息以及预设规则,在所述在候选信息区域显示所述第二语音信息对应的候选词;Display the candidate words corresponding to the second voice information in the candidate information area according to the received second voice information and preset rules;
    其中,所述预设规则包括:在所述候选词为词组的情况下,将所述候选词中的高阶词显示在低阶词之前,以及,将所述高阶词的次阶词显示在所述次阶词的同阶词之前;高阶词所包含的词的数量大于低阶词所包含的词的数量,两个同阶词所包含的词的数量相同,高阶词的次阶词为包含在所述高阶词中的词;Wherein, the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words. An order word is a word contained in the higher order word;
    所述第二语音信息为包含所述目标候选词的高阶词所对应的语音信息;The second voice information is the voice information corresponding to the higher-order words including the target candidate words;
    所述第一指示信息为根据长按操作得到的指示信息。The first indication information is indication information obtained according to a long-press operation.
  4. 根据权利要求1所述的信息修正方法,其中,所述根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词,包括:The information correction method according to claim 1, wherein, according to the received first input, the corresponding candidate words are displayed in the candidate information area in the information editing interface, comprising:
    在所述目标区域内的第二文字信息被选中的情况下,根据所述第二文字信息,确定第二目标拼音;In the case that the second text information in the target area is selected, determine the second target pinyin according to the second text information;
    根据所述第二目标拼音,在所述在候选信息区域显示所述第二目标拼音对应的候选词;According to the second target pinyin, the candidate words corresponding to the second target pinyin are displayed in the candidate information area;
    其中,所述目标区域中的第一文字信息包括所述第二文字信息。Wherein, the first text information in the target area includes the second text information.
  5. 根据权利要求1所述的信息修正方法,其中,根据所述目标候选词,对所述目标区域中的第一文字信息进行修正之前,所述信息修正方法还包括:The information modification method according to claim 1, wherein, before modifying the first text information in the target area according to the target candidate word, the information modification method further comprises:
    在所述候选信息区域上显示删除控件和修正完成确认控件,所述删除控件包括向前删除控件和向后删除控件中的至少一种;Displaying a deletion control and a correction completion confirmation control on the candidate information area, the deletion control includes at least one of a forward deletion control and a backward deletion control;
    其中,所述向前删除控件用于接收删除所述信息编辑界面中第一目标位置的左侧相邻位置的字符的指令;Wherein, the forward delete control is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface;
    所述向后删除控件用于接收删除所述信息编辑界面中第一目标位置的右侧相邻位置的字符的指令;The backward deletion control is used to receive an instruction to delete the character at the right adjacent position of the first target position in the information editing interface;
    所述修正完成确认控件用于接收将光标移动至所述目标区域中的文字信息的句尾的指令;The correction completion confirmation control is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
    所述第一目标位置根据用户在所述信息编辑界面中的操作确定。The first target position is determined according to the user's operation in the information editing interface.
  6. 根据权利要求5所述的信息修正方法,其中,所述根据所述目标候选词,对所述目标区域中的第一文字信息进行修正,包括:The information modification method according to claim 5, wherein the modifying the first text information in the target area according to the target candidate word comprises:
    通过所述删除控件对目标区域中与第二指示信息对应的文字信息进行删除;Delete the text information corresponding to the second indication information in the target area through the delete control;
    根据所述第二指示信息,确定第二目标位置,将目标候选词显示在所述第二目标位置;determining a second target position according to the second indication information, and displaying the target candidate word at the second target position;
    其中,所述第二指示信息根据用户在所述目标区域的操作获得。Wherein, the second indication information is obtained according to the user's operation in the target area.
  7. 一种信息修正装置,包括:An information correction device, comprising:
    显示模块,用于将根据第一语音信息转化生成的第一文字信息显示在信 息编辑界面中的目标区域;A display module, for displaying the first text information generated according to the first voice information conversion in the target area in the information editing interface;
    处理模块,用于根据接收到的第一输入,在所述信息编辑界面中的候选信息区域显示对应的候选词;a processing module, configured to display corresponding candidate words in the candidate information area in the information editing interface according to the received first input;
    目标确定模块,用于根据用户在所述候选信息区域的操作,获得第一指示信息,所述第一指示信息用于指示所述候选词中的目标候选词;a target determination module, configured to obtain first indication information according to a user's operation in the candidate information area, where the first indication information is used to indicate a target candidate word in the candidate word;
    信息修正模块,用于根据所述目标候选词,对所述目标区域中的第一文字信息进行修正。An information modification module, configured to modify the first text information in the target area according to the target candidate word.
  8. 根据权利要求7所述的信息修正装置,其中,所述处理模块包括:The information correction device according to claim 7, wherein the processing module comprises:
    第一处理单元,用于根据接收到的第二语音信息,获得第一目标拼音;a first processing unit, configured to obtain the first target pinyin according to the received second voice information;
    第二处理单元,用于根据所述第一目标拼音,在所述在候选信息区域显示所述第二语音信息对应的候选词。The second processing unit is configured to display the candidate words corresponding to the second voice information in the candidate information area according to the first target pinyin.
  9. 根据权利要求7所述的信息修正装置,其中,所述处理模块包括:The information correction device according to claim 7, wherein the processing module comprises:
    第三处理单元,用于根据接收到的第二语音信息以及预设规则,在所述在候选信息区域显示所述第二语音信息对应的候选词;a third processing unit, configured to display the candidate words corresponding to the second voice information in the candidate information area according to the received second voice information and a preset rule;
    其中,所述预设规则包括:在所述候选词为词组的情况下,将所述候选词中的高阶词显示在低阶词之前,以及,将所述高阶词的次阶词显示在所述次阶词的同阶词之前;高阶词所包含的词的数量大于低阶词所包含的词的数量,两个同阶词所包含的词的数量相同,高阶词的次阶词为包含在所述高阶词中的词;Wherein, the preset rule includes: when the candidate word is a phrase, displaying the higher-order word in the candidate word before the lower-order word, and displaying the second-order word of the higher-order word Before the same-order words of the second-order words; the number of words contained in the higher-order words is greater than the number of words contained in the lower-order words, the number of words contained in the two same-order words is the same, and the second-order words of the higher-order words contain the same number of words. An order word is a word contained in the higher order word;
    所述第二语音信息为包含所述目标候选词的高阶词所对应的语音信息;The second voice information is the voice information corresponding to the higher-order words including the target candidate words;
    所述第一指示信息为根据长按操作得到的指示信息。The first indication information is indication information obtained according to a long-press operation.
  10. 根据权利要求7所述的信息修正装置,其中,所述处理模块包括:The information correction device according to claim 7, wherein the processing module comprises:
    第四处理单元,用于在所述目标区域内的第二文字信息被选中的情况下,根据所述第二文字信息,确定第二目标拼音;a fourth processing unit, configured to determine a second target pinyin according to the second text information when the second text information in the target area is selected;
    第五处理单元,用于根据所述第二目标拼音,在所述在候选信息区域显示所述第二目标拼音对应的候选词;a fifth processing unit, configured to display the candidate words corresponding to the second target pinyin in the candidate information area according to the second target pinyin;
    其中,所述目标区域中的第一文字信息包括所述第二文字信息。Wherein, the first text information in the target area includes the second text information.
  11. 根据权利要求7所述的信息修正装置,其中,所述信息修正装置还包括:The information correction device according to claim 7, wherein the information correction device further comprises:
    控件显示模块,用于在所述候选信息区域上显示删除控件和修正完成确认控件,所述删除控件包括向前删除控件和向后删除控件中的至少一种;a control display module, configured to display a deletion control and a correction completion confirmation control on the candidate information area, the deletion control includes at least one of a forward deletion control and a backward deletion control;
    其中,所述向前删除控件用于接收删除所述信息编辑界面中第一目标位置的左侧相邻位置的字符的指令;Wherein, the forward delete control is used to receive an instruction to delete the character at the left adjacent position of the first target position in the information editing interface;
    所述向后删除控件用于接收删除所述信息编辑界面中第一目标位置的右侧相邻位置的字符的指令;The backward deletion control is used to receive an instruction to delete a character adjacent to the right side of the first target position in the information editing interface;
    所述修正完成确认控件用于接收将光标移动至所述目标区域中的文字信息的句尾的指令;The correction completion confirmation control is used to receive an instruction to move the cursor to the end of the sentence of the text information in the target area;
    所述第一目标位置根据用户在所述信息编辑界面中的操作确定。The first target position is determined according to the user's operation in the information editing interface.
  12. 根据权利要求11所述的信息修正装置,其中,所述信息修正模块包括:The information correction device according to claim 11, wherein the information correction module comprises:
    第一修正单元,用于通过所述删除控件对目标区域中与第二指示信息对应的文字信息进行删除;a first correction unit, configured to delete the text information corresponding to the second indication information in the target area through the deletion control;
    第二修正单元,用于根据所述第二指示信息,确定第二目标位置,将目标候选词显示在所述第二目标位置;a second correction unit, configured to determine a second target position according to the second indication information, and display the target candidate word at the second target position;
    其中,所述第二指示信息根据用户在所述目标区域的操作获得。Wherein, the second indication information is obtained according to the user's operation in the target area.
  13. 一种电子设备,包括处理器,存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,其中,所述程序或指令被所述处理器执行时实现如权利要求1至6任一项所述的信息修正方法的步骤。An electronic device, comprising a processor, a memory, and a program or instruction stored on the memory and executable on the processor, wherein the program or instruction is executed by the processor to achieve as claimed in claim 1 Steps of the information correction method described in any one of to 6.
  14. 一种可读存储介质,所述可读存储介质上存储程序或指令,其中,所述程序或指令被处理器执行时实现如权利要求1至6任一项所述的信息修正方法的步骤。A readable storage medium storing programs or instructions on the readable storage medium, wherein when the programs or instructions are executed by a processor, the steps of the information modification method according to any one of claims 1 to 6 are implemented.
  15. 一种芯片,包括处理器和通信接口,其中,所述通信接口和所述处理器耦合,所述处理器用于运行程序或指令,实现如权利要求1至6任一项所述的信息修正方法。A chip, comprising a processor and a communication interface, wherein the communication interface is coupled to the processor, and the processor is used to run a program or an instruction to implement the information modification method according to any one of claims 1 to 6 .
  16. 一种计算机程序产品,其中,所述程序产品被存储在非易失的存储介质中,所述程序产品被至少一个处理器执行以实现如权利要求1至6任一项所述的信息修正方法。A computer program product, wherein the program product is stored in a non-volatile storage medium, and the program product is executed by at least one processor to implement the information correction method according to any one of claims 1 to 6 .
  17. 一种电子设备,其中,所述电子设备被配置为执行如权利要求1至6任一项所述的信息修正方法。An electronic device, wherein the electronic device is configured to perform the information correction method according to any one of claims 1 to 6.
PCT/CN2021/141161 2020-12-30 2021-12-24 Information correction method and apparatus, and electronic device WO2022143454A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011608156.X 2020-12-30
CN202011608156.XA CN112684913B (en) 2020-12-30 2020-12-30 Information correction method and device and electronic equipment

Publications (1)

Publication Number Publication Date
WO2022143454A1 true WO2022143454A1 (en) 2022-07-07

Family

ID=75454977

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/141161 WO2022143454A1 (en) 2020-12-30 2021-12-24 Information correction method and apparatus, and electronic device

Country Status (2)

Country Link
CN (1) CN112684913B (en)
WO (1) WO2022143454A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112684913B (en) * 2020-12-30 2023-07-14 维沃移动通信有限公司 Information correction method and device and electronic equipment
CN113359999B (en) * 2021-05-24 2023-06-27 维沃移动通信有限公司 Candidate word updating method and device and electronic equipment
CN113378530A (en) * 2021-06-28 2021-09-10 北京七维视觉传媒科技有限公司 Voice editing method and device, equipment and medium
CN114286169B (en) * 2021-08-31 2023-06-20 腾讯科技(深圳)有限公司 Video generation method, device, terminal, server and storage medium
CN114267352B (en) * 2021-12-24 2023-04-14 北京信息科技大学 Voice information processing method, electronic equipment and computer storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107678561A (en) * 2017-09-29 2018-02-09 百度在线网络技术(北京)有限公司 Phonetic entry error correction method and device based on artificial intelligence
CN109243468A (en) * 2018-11-14 2019-01-18 北京羽扇智信息科技有限公司 Audio recognition method, device, electronic equipment and storage medium
US10388272B1 (en) * 2018-12-04 2019-08-20 Sorenson Ip Holdings, Llc Training speech recognition systems using word sequences
CN110176237A (en) * 2019-07-09 2019-08-27 北京金山数字娱乐科技有限公司 A kind of audio recognition method and device
CN110347996A (en) * 2019-07-15 2019-10-18 北京百度网讯科技有限公司 Amending method, device, electronic equipment and the storage medium of text
CN110717021A (en) * 2019-09-17 2020-01-21 平安科技(深圳)有限公司 Input text and related device for obtaining artificial intelligence interview
CN111540356A (en) * 2020-04-20 2020-08-14 苏州思必驰信息科技有限公司 Correction method and system for voice conversation
US20210074277A1 (en) * 2019-09-06 2021-03-11 Microsoft Technology Licensing, Llc Transcription revision interface for speech recognition system
CN112684913A (en) * 2020-12-30 2021-04-20 维沃移动通信有限公司 Information correction method and device and electronic equipment

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101577115A (en) * 2008-05-09 2009-11-11 台达电子工业股份有限公司 Voice input system and voice input method
US20150100537A1 (en) * 2013-10-03 2015-04-09 Microsoft Corporation Emoji for Text Predictions
CN110970026A (en) * 2019-12-17 2020-04-07 用友网络科技股份有限公司 Voice interaction matching method, computer device and computer-readable storage medium
CN112148135A (en) * 2020-09-30 2020-12-29 维沃移动通信有限公司 Input method processing method and device and electronic equipment

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107678561A (en) * 2017-09-29 2018-02-09 百度在线网络技术(北京)有限公司 Phonetic entry error correction method and device based on artificial intelligence
CN109243468A (en) * 2018-11-14 2019-01-18 北京羽扇智信息科技有限公司 Audio recognition method, device, electronic equipment and storage medium
US10388272B1 (en) * 2018-12-04 2019-08-20 Sorenson Ip Holdings, Llc Training speech recognition systems using word sequences
CN110176237A (en) * 2019-07-09 2019-08-27 北京金山数字娱乐科技有限公司 A kind of audio recognition method and device
CN110347996A (en) * 2019-07-15 2019-10-18 北京百度网讯科技有限公司 Amending method, device, electronic equipment and the storage medium of text
US20210074277A1 (en) * 2019-09-06 2021-03-11 Microsoft Technology Licensing, Llc Transcription revision interface for speech recognition system
CN110717021A (en) * 2019-09-17 2020-01-21 平安科技(深圳)有限公司 Input text and related device for obtaining artificial intelligence interview
CN111540356A (en) * 2020-04-20 2020-08-14 苏州思必驰信息科技有限公司 Correction method and system for voice conversation
CN112684913A (en) * 2020-12-30 2021-04-20 维沃移动通信有限公司 Information correction method and device and electronic equipment

Also Published As

Publication number Publication date
CN112684913A (en) 2021-04-20
CN112684913B (en) 2023-07-14

Similar Documents

Publication Publication Date Title
WO2022143454A1 (en) Information correction method and apparatus, and electronic device
JP6151381B2 (en) Virtual keyboard input for international languages
US20210073467A1 (en) Method, System and Apparatus for Entering Text on a Computing Device
US9798718B2 (en) Incremental multi-word recognition
US9606634B2 (en) Device incorporating improved text input mechanism
RU2377664C2 (en) Text input method
CN110083254B (en) Multi-gesture text input prediction
EP1950669B1 (en) Device incorporating improved text input mechanism using the context of the input
US7117144B2 (en) Spell checking for text input via reduced keypad keys
AU2014212844B2 (en) Character and word level language models for out-of-vocabulary text input
EP2133772B1 (en) Device and method incorporating an improved text input mechanism
TWI394065B (en) Multiple predictions in a reduced keyboard disambiguating system
US8756499B1 (en) Gesture keyboard input of non-dictionary character strings using substitute scoring
CN107797676B (en) Single character input method and device
US11886801B1 (en) System, method and device for multimodal text editing
CN115145406A (en) Text input method, text input device, computer equipment and storage medium
TWI423048B (en) A system for inputting in the form of a complement code and a method thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21914191

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21914191

Country of ref document: EP

Kind code of ref document: A1