CN104102625B - The method and apparatus that spell check is improved by application keyboard layout information - Google Patents

The method and apparatus that spell check is improved by application keyboard layout information Download PDF

Info

Publication number
CN104102625B
CN104102625B CN201310127321.3A CN201310127321A CN104102625B CN 104102625 B CN104102625 B CN 104102625B CN 201310127321 A CN201310127321 A CN 201310127321A CN 104102625 B CN104102625 B CN 104102625B
Authority
CN
China
Prior art keywords
character
word
weight
candidate word
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310127321.3A
Other languages
Chinese (zh)
Other versions
CN104102625A (en
Inventor
路光明
张萌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to CN201310127321.3A priority Critical patent/CN104102625B/en
Publication of CN104102625A publication Critical patent/CN104102625A/en
Application granted granted Critical
Publication of CN104102625B publication Critical patent/CN104102625B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

Method and apparatus the present invention relates to improve spell check by application keyboard layout information.The method according to the invention includes:Keyboard layout information is obtained, the keyboard layout information includes the layout information of left hand and right hand input area;Word for input provides spell check suggestion, and the spell check suggestion includes candidate word list;The weight of the candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area, position of the candidate word in the candidate word list is to be at least partially based on the weight of the candidate word and determine.Have benefited from the present invention, the efficiency of spell check can be improved.

Description

The method and apparatus that spell check is improved by application keyboard layout information
Technical field
Method and apparatus the present invention relates to be used to improve spell check.More particularly, it relates to pass through apply with The method and apparatus that left hand and right hand is input into related keyboard layout information to improve spell check.
Background technology
In various text processing applications(Such as MicrosoftApplication program)And comprising word processing work( The application of energy(Mailbox client, electronic dictionary, search engine etc.)In, spelling checker is widely used, spelling inspection Device is looked into for marking the word of possible misspelling, and is preferably provided candidate word list or the word is corrected automatically.
In spell check technical field, there are various modes to be ranked up candidate word.A kind of mode is to calculate Editing distance between candidate word and the word of input(edit distance)(It is also called Levenshtein distances), and be based on Be ranked up for candidate word and user is presented to by the editing distance.Editing distance refers to that a word is become into another word institute The minimum spelling for needing changes number of times.Spelling change can include for a character being substituted for another character, one word of insertion Symbol, one character of deletion etc..For example, " kitten " one word is become " sitting " as follows:
1.sitten(k→s)
2.sittin(e→i)
3.sitting(→g)
Therefore, it is 3 from " kitten " to the editing distance of " sitting ".
Editing distance is based entirely on the comparing between two words, the characteristic without considering input equipment.As a result, candidate word The order of the candidate word in list may not be order optimal for a user.
In addition, giving a kind of characteristic for considering input equipment in United States Patent (USP) No.US7957955 to enter candidate word The method of row sequence.When by keyboard to be input into, it is considered to which keyboard layout is ranked up to candidate word.Specifically, Possible candidate word suggestion is recognized based on the character of input and the corresponding adjacent character on keyboard layout, and it is entered Row is scored and is sorted.Fig. 1 shows and uses an example of the method.In Fig. 1, reference " 602 " indicates that user is defeated The character string " rheatre " for entering, reference " 604 " indicate in keyboard respectively with the character string in each character phase Adjacent character, reference " 608 " indicates the candidate word list produced according to character permutations.
However, inventors herein have recognized that, when user is familiar to keyboard layout, user hardly pushes the wrong Key.Conversely, user may quickly be input into, and when being input into by keyboard using two hands, it is susceptible to Because two hands are with the order that overturns or press two correct keys etc. simultaneously and cause the situation of misspelling.And existing skill The method of art does not all account for influence of two characteristics of hand input to spell check, therefore the order of candidate word is not completely Rationally.This may cause the precedence of the desired candidate word of user to compare rearward so that effectively wrong word can not be corrected.
The content of the invention
Accordingly, it would be desirable to a kind of method taken left hand and right hand input characteristics and keyboard layout into consideration to improve spell check And equipment.
According to an aspect of the present invention, in order to solve the above-mentioned technical problem, the present invention provides a kind of by application keyboard Come the method for improving spell check, it includes the steps to layout information:Obtain keyboard layout information, the keyboard layout letter Breath includes the layout information of left hand and right hand input area;Word for input provides spell check suggestion, the spell check suggestion Including candidate word list;Candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area Weight, position of the candidate word in the candidate word list is to be at least partially based on the weight of the candidate word and determine.
According to another aspect of the present invention, in order to solve the above-mentioned technical problem, the present invention provides a kind of by application keyboard Come the equipment of improving spell check, it includes layout information:Keyboard layout information acquisition module, is configured as obtaining keyboard layout Information, the keyboard layout information includes the layout information of left hand and right hand input area;Spell check module, is configured as being directed to The word of input provides spell check suggestion, and the spell check suggestion includes candidate word list;Weight determination module, is configured as The weight of the candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area, candidate word is in institute It is to be at least partially based on the weight of the candidate word and determine to state the position in candidate word list.
Method and apparatus according to the invention has taken left hand and right hand input characteristics and keyboard layout into consideration such that it is able to carry For the more reasonably sequence for candidate word, therefore improve the efficiency of spell check.
According to following description referring to the drawings, other property features of the invention and advantage will become apparent.
Brief description of the drawings
The accompanying drawing for being incorporated in specification and constituting a part for specification shows embodiments of the invention, and with retouch To state be used for together and illustrate principle of the invention.
Fig. 1 shows an example using the spell checking methods of prior art.
Fig. 2 is the block diagram of the hardware configuration of the computer system for illustrating the ability to implementation embodiments of the invention.
Fig. 3 shows a kind of qwerty keyboard layout.
Fig. 4 shows Microsoft of the prior artApplication program is given for the word " amin " being input into The result of the spell check for going out.
Fig. 5 shows the result of the spell check provided for the word " amin " being input into of the invention.
Fig. 6 shows Microsoft of the prior artApplication program is given for the word " domin " being input into The result of the spell check for going out.
Fig. 7 shows the result of the spell check provided for the word " domin " being input into of the invention.
Fig. 8 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information The flow chart of method.
Fig. 9 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information The block diagram of equipment.
Specific embodiment
Describe a preferred embodiment of the present invention in detail below with reference to the accompanying drawings.It is not details and work(required in this invention Can be omitted, so as not to understanding of the invention can be obscured.
Note that similar reference numeral refers to the similar project in figure with letter, thus once in a width figure A project is defined, avoids the need for being discussed in the figure after.
In the disclosure, term " first ", " second " etc. are only used only for being made a distinction between element or step, and simultaneously It is not intended to represent time sequencing, priority or importance.
(The hardware configuration of computer system)
Fig. 2 is the block diagram of the hardware configuration of the computer system 1000 for illustrating the ability to implementation embodiments of the invention.
As shown in Figure 2, computer system includes computer 1110.Computer 1110 includes connecting via system bus 1121 Processing unit 1120, system storage 1130, fixed non-volatile memory interface 1140, the removable non-volatile memories for connecing Device interface 1150, user input interface 1160, network interface 1170, video interface 1190 and peripheral interface 1195.
System storage 1130 includes ROM(Read-only storage)1131 and RAM(Random access memory)1132.BIOS (Basic input output system)During 1133 reside in ROM1131.Operating system 1134, application program 1135, other program modules 1136 and during some routine datas 1137 reside in RAM1132.
The fixed non-volatile memory 1141 of such as hard disk etc is connected to fixed non-volatile memory interface 1140. Fixed non-volatile memory 1141 for example can be with storage program area 1144, application program 1145, other program modules 1146 With some routine datas 1147.
The removable non-volatile memory of such as floppy disk 1151 and CD-ROM drive 1155 etc is connected to Removable non-volatile memory interface 1150.For example, diskette 1 152 can be inserted into floppy disk 1151, and CD (CD)1156 can be inserted into CD-ROM drive 1155.
The input equipment of such as microphone 1161 and keyboard 1162 etc is connected to user input interface 1160.
Computer 1110 can be connected to remote computer 1180 by network interface 1170.For example, network interface 1170 Remote computer 1180 can be connected to via LAN 1171.Or, network interface 1170 may be coupled to modem (Modulator-demodulator)1172, and modem 1172 is connected to remote computer 1180 via wide area network 1173.
Remote computer 1180 can include the memory 1181 of such as hard disk etc, its storage remote application 1185。
Video interface 1190 is connected to monitor 1191.
Peripheral interface 1195 is connected to printer 1196 and loudspeaker 1197.
Computer system shown in Fig. 2 is merely illustrative and is never intended to enter invention, its application, or uses Any limitation of row.
Computer system shown in Fig. 2 can be incorporated in any embodiment, as stand-alone computer, or be able to can also make It is the processing system in equipment, one or more unnecessary components can be removed, it is also possible to be added to one or more Individual additional component.
(Principle of the invention)
As it was previously stated, inventors herein have recognized that, when user is familiar to keyboard layout, user hardly presses Wrong key.Conversely, user may be susceptible to the error in terms of two hand cooperations(Such as, two hands with overturn order or Press two correct keys, etc. simultaneously), so as to cause misspelling.Order or simultaneously with two hands to overturn below Adjacent character caused by two correct keys is pressed to exchange or describe original of the invention in detail in case of character is lacked Reason.
When user passes through input through keyboard character using two hands, in former and later two continuous characters respectively positioned at keyboard In the case of in left hand input area and right hand input area, easily pushed button simultaneously or with opposite suitable due to two hands Sequence is pushed button, therefore, a character in the two characters may be lacked, or the input sequence of the two characters is probably Reverse.That is, many misspellings are the character missings due to occurring when right-hand man is input into continuous character respectively Caused by being exchanged with adjacent character.In addition, there is character missing or adjacent character when continuous character is input into by a hand The probability of exchange is relatively low, and because user is generally familiar to keyboard layout, therefore user can seldom press the wrong button.
Therefore, spell check can be improved by considering the layout of the left hand and right hand input area of keyboard.That is, can Judged with the layout information of the left hand and right hand input area based on keyboard input word and candidate word between whether there is due to Left hand and right hand input cause character missing or adjacent character exchange etc. situation.If it is present the candidate word is correct word Probability it is larger, therefore the weight of the candidate word can be improved, that is, lift precedence of the candidate word in candidate word list.
Principle of the invention and the present invention are described in detail below in conjunction with specific example and statistics relative to existing There is the advantage of technology.
Keyboard described in this specification can be the dummy keyboard for touching screen display(Also referred to as soft keyboard), or Physical keyboard(Also referred to as hard manual).Keyboard layout can be in accordance with conventional QWERTY layout or its modification, or can also It is other layouts.Principle of the invention, implementation are illustrated as keyboard layout using QWERTY layout as shown in Figure 3 below Example, specific example etc., but those skilled in the art understand, and the present invention is not limited to QWERTY layout.In addition, such as this area In it is known, shown in Fig. 3 qwerty keyboard layout in, with scheme centre black line be boundary, it is defeated that left side button belongs to left hand Enter region, and the right button belongs to right hand input area.
1st, the example that adjacent character is exchanged
User wants input " main " one word, however, when two hands rapidly input, " m " key that the right hand is pressed somewhat falls behind In " a " key that left hand is pressed.Therefore, input becomes " amin ".Now, the word " amin " being input into is carried out with candidate word " main " Compare, it is found that " a " have exchanged position with " m ".
Fig. 4 shows Microsoft of the prior artApplication program is given for the word " amin " being input into The result of the spell check for going out.From fig. 4, it can be seen that in candidate word list, " amen " is the candidate word of the top, and " main " is the 3rd candidate word.In this case, the candidate word of the top and that word of the desired selection of non-user.
According to the present invention, the information of the right-hand man's input area based on qwerty keyboard, it can be determined that " m " and " a " be by Different hand inputs, thus improve the weight of candidate word " main ".Fig. 5 shows the word for being input into of the invention The result of the spell check that " amin " is given.From fig. 5, it can be seen that in candidate word list, " main " is located at the top, and Candidate word " main " is the word of user view input.
As seen from the above, compared with prior art, present invention improves over spell check.Tool of the inventor for the example Body is explained as follows:When user passes through input through keyboard character using two hands, it should continuous by two of the same hand input The situation that the input sequence of character is exchanged can seldom occur;And in some cases, when user passes through keyboard using two hands When being rapidly input into character, two hands can very closely press two buttons in time sometimes, and this may cause two hands Two reversed orders of character of input.That is, this exchange by adjacent character caused by two hand inputs should be more normal The input error seen, therefore the weight of the corresponding candidate word of raising can cause that putting in order for candidate word is more reasonable.
2nd, the example of character missing
User wants input " domain " one word.However, after " dom " is input into, due to right for most people Hand is quicker than left hand, therefore, user may press " a " key and press " i " key by the right hand by left hand simultaneously.As a result, " a " does not have It is transfused to, so that the word of input becomes " domin ".
Fig. 6 shows Microsoft of the prior artApplication program is given for the word " domin " being input into The result of the spell check for going out.From fig. 6, it can be seen that in candidate word list, " doming " is the candidate word of the top, and " domain " is the 3rd candidate word.In this case, the candidate word of the top and that word of the desired selection of non-user.
According to the present invention, the information of the right-hand man's input area based on qwerty keyboard, it can be determined that candidate word " a " of missing from previous character " m " and latter character " i " is input into by different hands in " domain ", thus improves candidate The weight of word " domain ".Fig. 7 shows the knot of the spell check provided for the word " domin " being input into of the invention Really.From figure 7 it can be seen that in candidate word list, " domain " is located at the top, and candidate word " domain " is user's meaning Scheme the word of input.
As seen from the above, compared with prior art, present invention improves over spell check.Tool of the inventor for the example Body is explained as follows:When user passes through input through keyboard character using two hands, two feelings of button are only pressed by a hand simultaneously Shape can seldom occur;And in some cases, when user is rapidly input into character using two hands by keyboard, two sometimes Hand can simultaneously press two buttons, and this may cause one in two two characters of hand input not to be transfused to.Namely Say, this also should be more typical input error by character missing caused by two hand inputs, therefore improve corresponding candidate word Weight can cause that putting in order for candidate word is more reasonable.
3rd, statistics
On the website that network address is www.spellcheck.net, there is the statistics of the spelling for each word.From In January, 2010 in June, 2012, the website have collected more than 15,411,110 spelling information, wherein:
- for " main " most common misspellings it is " mian " (31%) and " amin " (11%).Both situations are all Caused by being exchanged due to the adjacent character that two hand inputs cause, about 42% is accounted for altogether.
- for many other words, the adjacent character that its misspellings also causes mainly due to two hand inputs is exchanged Caused by character missing.
Table 1 below lists realityThe all misspellings occurred in form.
Table 1
As can be seen from the above table, 5 in 18 mistakes belong to the situation of character missing, and this 5 character missing errors In 3 can preferably be corrected by principle of the invention.
In addition, 4 in the 18 mistakes situations for belonging to adjacent character exchange, and all this 4 adjacent characters exchanges Mistake can preferably be corrected by principle of the invention.
Sum it up, can improve above-mentioned actual by using the present inventionSpell check knot in form In fruit 38.8%.
Based on above-mentioned principle of the invention, it is proposed that a kind of method and apparatus of improvement spell check, its specific descriptions It is as follows.
(The method and apparatus for improving spell check)
Fig. 8 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information The flow chart of method.
As shown in figure 8, in step 610, obtaining keyboard layout information, the keyboard layout information is input into including left hand and right hand The layout information in region.
As described above, keyboard layout can be QWERTY layout or its modification, or can also be other layouts.The present invention Go for various keyboards(Including soft keyboard and hard manual)As long as the keyboard can be divided into left hand input area and the right side Hand input area.The layout information of right-hand man's input area can be included only in the left hand input area including keyboard The character included in character and right hand input area, or can also include that the position between these characters is closed in some cases System etc..
In step 620, the word for input provides spell check suggestion, and spell check suggestion includes that candidate word is arranged Table.
The candidate word list can be obtained by various spell checking methods as known in the art.For example, generally, , it is necessary to the word of input is compared with each word in dictionary when producing candidate word list, and calculate the word of input and wait Select the editing distance between word.Then, the editing distance according to candidate word is ranked up to it, so as to obtain candidate word list.
In act 630, the candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area Weight, position of the candidate word in the candidate word list is to be at least partially based on the weight of the candidate word and determine.Spelling In the case that the device that writes a self-criticism also is corrected automatically to misspelling, the method according to the invention is also performed after step 630 Following steps:The word of input is replaced using the candidate word of foremost one in the candidate word list, so as to be corrected automatically.
In a kind of implementation of step 630, candidate word can be compared with the word of input first, it is defeated to determine The word for entering relative to candidate word whether have adjacent character exchange or missing character, then it is determined that input word relative to Candidate word have adjacent character exchange or missing character in the case of, the layout information based on left hand and right hand input area come Adjust the weight of candidate word.
In the implementation, it is preferable that can be exchanged and character deletion condition for adjacent character respectively, based on it is left, The layout information of right hand input area adjusts the weight of candidate word.
As it was previously stated, it should be wrong more typical input to be exchanged with character missing by adjacent character caused by two hand inputs By mistake, there is the probability that adjacent character is exchanged and character is lacked when being input into continuous character by a hand relatively low.
When therefore, being exchanged for adjacent character, the left hand that can be belonging respectively to keyboard in the adjacent character for exchanging is defeated In the case of entering region and right hand input area, the weight of candidate word is improved;Or, belong to keyboard in the adjacent character for exchanging Left hand input area or right hand input area in the case of, reduce candidate word weight.
Additionally, when being lacked for character, can be in the previous character or latter in word of the character of missing with input In the case that character is belonging respectively to the left hand input area and right hand input area of keyboard, the weight of candidate word is improved;Or, Previous character and latter character in the character of missing and the word of input belong to left hand input area or the right hand input of keyboard In the case of region, the weight of candidate word is reduced.
In addition, as described above, exchanging the misspelling lacked with character one by adjacent character caused by two hand inputs It is occur in the case where user rapidly inputs in the case of a little.Therefore, in some cases it may with reference to right-hand man input area Time interval between the layout information and character input in domain adjusts the weight of candidate word.In some cases, by determining Time interval between two character inputs, can more accurately determine whether that there is this adjacent character exchanges scarce with character Lose, so as to be more effectively carried out spell check.
Therefore, exchanged and character deletion condition for adjacent character, as described above according to right-hand man's input area Layout information is improved or reduced after the weight of candidate word, it is preferable that may further determine that exchange character input it Between time interval or with the character of missing close to two inputs of character between time interval whether less than predetermined Threshold value.Then, the weight of candidate word is further improved/reduced according to the determination result.In some cases, can also be to defeated Time interval between the input of each two character in the word for entering is averaging, and determines whether the average time interval is less than Predetermined threshold value.
When specifically, it is preferable that being exchanged for adjacent character, as described above in the adjacent character point for exchanging After not belonging to the weight that candidate word is improved in the case of the left hand input area and right hand input area of keyboard, it is determined that exchange Whether the time interval between the input of adjacent character is less than predetermined threshold value, and is less than predetermined threshold in the time interval In the case of value, the weight of candidate word is further improved.Alternately, it is preferable that be belonging respectively to key in the adjacent character for exchanging In the case of the left hand input area and right hand input area of disk, it is determined that the time interval between the input of the adjacent character for exchanging Whether predetermined threshold value is less than, and in the case where the time interval is less than predetermined threshold value, improves the weight of candidate word.
In addition, belonging to left hand input area or the right hand input area of keyboard in the adjacent character for exchanging as described above In the case of domain after the weight of reduction candidate word, it is determined that whether the time interval between the input of the adjacent character for exchanging is less than Predetermined threshold value, and in the case where the time interval is not less than predetermined threshold value, further reduce the weight of candidate word. Alternately, it is preferable that belong to the left hand input area of keyboard or the situation of right hand input area in the adjacent character for exchanging Under, it is determined that whether the time interval between the input of the adjacent character for exchanging is less than predetermined threshold value, and between the time In the case of not less than predetermined threshold value, the weight of candidate word is reduced.
It is previous in the character of missing with the word of input as described above preferably for the situation of character missing Character or latter character improve the power of candidate word in the case of being belonging respectively to the left hand input area and right hand input area of keyboard After weight, determine the character of the missing previous character and latter character input between time interval whether less than predetermined Threshold value, and the time interval be less than predetermined threshold value in the case of, further improve candidate word weight.It is alternative Ground, it is preferable that previous character or latter character in the character of missing with the word of input are belonging respectively to the left hand input of keyboard In the case of region and right hand input area, determine between the input of previous character and latter character of the character of the missing Whether time interval is less than predetermined threshold value, and in the case where the time interval is less than predetermined threshold value, improves candidate The weight of word.
In addition, belonging to key in the previous character and latter character as described above in the character of missing with the word of input Reduced in the case of the left hand input area or right hand input area of disk after the weight of candidate word, determine the character of the missing Previous character and the input of latter character between time interval whether be less than predetermined threshold value, and in the time interval In the case of not less than predetermined threshold value, the weight of candidate word is further reduced.Alternately, it is preferable that in the character of missing The left hand input area of keyboard or the situation of right hand input area are belonged to the previous character and latter character in the word of input Under, determine the character of the missing previous character and latter character input between time interval whether less than predetermined threshold Value, and in the case where the time interval is not less than predetermined threshold value, reduce the weight of candidate word.
Note that above-mentioned each can in conjunction or individually the step of improve/reduce the weight of candidate word Perform.
Following table 2 gives a specific example of weight adjustment:
Table 2
Obviously, those skilled artisans will appreciate that the mode and numerical value of weight adjustment are not limited to be shown in above table Example.
Fig. 9 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information The block diagram of equipment.
As shown in figure 9, the spelling that improved by application keyboard layout information of exemplary embodiment of the invention is examined The equipment 700 looked into includes:Keyboard layout information acquisition module 710, spell check module 720 and weight determination module 730.
More specifically, keyboard layout information acquisition module 710 is configured as obtaining keyboard layout information, the keyboard layout Information includes the layout information of left hand and right hand input area.
Spell check module 720 is configured as providing spell check suggestion for the word of input, spell check suggestion bag Include candidate word list.
Weight determination module 730 is configured as the layout information based on left hand and right hand input area to adjust candidate word row The weight of the candidate word in table, position of the candidate word in the candidate word list be at least partially based on the candidate word weight and Determine.
Unit in the equipment 700 can be configured as performing each step shown by the flow chart in Fig. 8.
Unit described above is the exemplary and/or preferred module for implementing the treatment described in the disclosure.This A little units can be hardware cell(Such as field programmable gate array(FPGA), digital signal processor or application specific integrated circuit Deng)And/or software module(Such as computer-readable program).List for implementing each step is not described at large below Unit.As long as however, the step of having certain treatment of execution, it is possible to have the corresponding functional module or list for implementing same treatment Unit(By hardware and/or software implementation).Limited by all combinations of described step and unit corresponding with these steps Fixed technical scheme is all included in present disclosure, if they constitute these technical schemes be it is complete and It is applicable.
Additionally, the said equipment 700 being made up of various units can be incorporated into such as computer, move as functional module In the electronic installation of mobile phone, hand-held device etc., as long as the need for existing for spelling-checker in the electronic installation i.e. Can.In addition to the equipment 700, the electronic installation is it is of course possible to have other hardware or software part.
As described above, method and apparatus according to the invention is applied to various spelling checkers and comprising spell check work( The various applications of energy or various devices.
The method of the present invention and equipment can in many ways be implemented.For example, can by software, hardware, firmware, Or its any combinations implements the method for the present invention and equipment.The order of above-mentioned method and step is merely illustrative, the present invention Method and step be not limited to order described in detail above, clearly state unless otherwise.Additionally, in some embodiments In, the present invention can also be implemented as recording program in the recording medium, and it is included for realizing the method according to the invention Machine readable instructions.Thus, the present invention also covering storage is used for the recording medium of the program for realizing the method according to the invention.
Although passed through example illustrates some specific embodiments of the invention in detail, those skilled in the art should Understand, above-mentioned example is intended merely to be illustrative and do not limit the scope of the invention.It should be appreciated by those skilled in the art that above-mentioned Embodiment can be changed in the case where the scope of the present invention and essence is not departed from.The scope of the present invention is by appended power Profit requires what is limited.

Claims (28)

1. a kind of method that spell check is improved by application keyboard layout information, including the steps:
Keyboard layout information is obtained, the keyboard layout information includes the layout information of left hand and right hand input area;
Word for input provides spell check suggestion, and the spell check suggestion includes candidate word list;
Candidate word is compared with the word of input, whether is exchanged with adjacent character relative to candidate word with the word for determining input Or the character of missing;It is determined that the word of input has a case that the character that adjacent character is exchanged or lacked relative to candidate word Under, the weight of candidate word is adjusted based on the layout information of left hand and right hand input area, candidate word is in the candidate word list Position is to be at least partially based on the weight of the candidate word and determine.
2. method according to claim 1, also including using the candidate word of foremost one in the candidate word list The word of input is replaced, so as to be corrected automatically.
3. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to be belonging respectively to a left side for keyboard in the word of input In the case of hand input area and right hand input area, the weight of candidate word is improved.
4. method according to claim 3, wherein the step of weight of adjustment candidate word also includes the steps:
It is determined that whether the time interval between the input of the adjacent character for exchanging is less than predetermined threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is further improved.
5. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to be belonging respectively to a left side for keyboard in the word of input In the case of hand input area and right hand input area, it is determined that whether the time interval between the input of the adjacent character for exchanging is small In predetermined threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is improved.
6. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to belong to the left hand of keyboard in the word of input In the case of input area or right hand input area, the weight of candidate word is reduced.
7. method according to claim 6, wherein the step of weight of adjustment candidate word also includes the steps:
It is determined that whether the time interval between the input of the adjacent character for exchanging is less than predetermined threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is further reduced.
8. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to belong to the left hand of keyboard in the word of input In the case of input area or right hand input area, it is determined that whether the time interval between the input of the adjacent character for exchanging is less than Predetermined threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is reduced.
9. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input In the case that character or latter character are belonging respectively to the left hand input area and right hand input area of keyboard, the power of candidate word is improved Weight.
10. method according to claim 9, wherein the step of weight of adjustment candidate word also includes the steps:
Determine the character of the missing previous character and latter character input between time interval whether less than predetermined Threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is further improved.
11. methods according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input In the case that character or latter character are belonging respectively to the left hand input area and right hand input area of keyboard, the missing is determined Whether the time interval between the input of the previous character and latter character of character is less than predetermined threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is improved.
12. methods according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input In the case that character and latter character belong to the left hand input area or right hand input area of keyboard, the power of candidate word is reduced Weight.
13. methods according to claim 12, wherein the step of weight of adjustment candidate word also includes the steps:
Determine the character of the missing previous character and latter character input between time interval whether less than predetermined Threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is further reduced.
14. methods according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input In the case that character and latter character belong to the left hand input area or right hand input area of keyboard, the word of the missing is determined Whether the time interval between the previous character of symbol and the input of latter character is less than predetermined threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is reduced.
A kind of 15. equipment that spell check is improved by application keyboard layout information, including:
Keyboard layout information acquisition module, is configured as obtaining keyboard layout information, and the keyboard layout information includes left hand and right hand The layout information of input area;
Spell check module, is configured as providing spell check suggestion for the word of input, and the spell check suggestion includes waiting Select word list;
Weight determination module, wherein the weight determination module includes:
Comparison module, is configured as being compared candidate word with the word of input, is relative to candidate word with the word for determining input The no character for being exchanged with adjacent character or being lacked;
Weight adjusting module, be configured as it is determined that input word relative to candidate word have adjacent character exchange or missing In the case of character, the weight of candidate word is adjusted based on the layout information of left hand and right hand input area, candidate word is in the candidate Position in word list is to be at least partially based on the weight of the candidate word and determine.
16. equipment according to claim 15, also including replacement module, the replacement module is configured with the time Select the candidate word of foremost one in word list to replace the word of input, so as to be corrected automatically.
17. equipment according to claim 15, wherein the weight adjusting module includes:
Weight improves module, is configured as word in input adjacent relative to what candidate word had that adjacent character exchanges and exchange In the case that character is belonging respectively to the left hand input area and right hand input area of keyboard, the weight of candidate word is improved.
18. equipment according to claim 17, wherein the weight adjusting module also includes:
Whether time interval determining module, be configured to determine that the time interval between the input of the adjacent character of exchange less than pre- Fixed threshold value;
Weight further improves module, is configured as, in the case where the time interval is less than predetermined threshold value, further carrying The weight of candidate word high.
19. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, is configured as having what adjacent character was exchanged and exchanged relative to candidate word in the word of input In the case that adjacent character is belonging respectively to the left hand input area and right hand input area of keyboard, it is determined that the adjacent character for exchanging Whether the time interval between input is less than predetermined threshold value;
Weight improves module, is configured as, in the case where the time interval is less than predetermined threshold value, improving the power of candidate word Weight.
20. equipment according to claim 15, wherein the weight adjusting module includes:
Weight reduction module, is configured as word in input adjacent relative to what candidate word had that adjacent character exchanges and exchange In the case that character belongs to the left hand input area or right hand input area of keyboard, the weight of candidate word is reduced.
21. equipment according to claim 20, wherein the weight adjusting module also includes:
Whether time interval determining module, be configured to determine that the time interval between the input of the adjacent character of exchange less than pre- Fixed threshold value;
Weight further reduces module, is configured as in the case where the time interval is not less than predetermined threshold value, further Reduce the weight of candidate word.
22. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, is configured as having what adjacent character was exchanged and exchanged relative to candidate word in the word of input In the case that adjacent character belongs to the left hand input area or right hand input area of keyboard, it is determined that the adjacent character for exchanging is defeated Whether the time interval between entering is less than predetermined threshold value;
Weight reduction module, is configured as, in the case where the time interval is not less than predetermined threshold value, reducing candidate word Weight.
23. equipment according to claim 15, wherein the weight adjusting module includes:
Weight improves module, and the word being configured as in input has the character of missing and the word of the missing relative to candidate word Previous character or latter character in symbol and the word of input are belonging respectively to the left hand input area and right hand input area of keyboard In the case of, improve the weight of candidate word.
24. equipment according to claim 23, wherein the weight adjusting module also includes:
Time interval determining module, is configured to determine that between the previous character of the character of the missing and the input of latter character Time interval whether be less than predetermined threshold value;
Weight further improves module, is configured as, in the case where the time interval is less than predetermined threshold value, further carrying The weight of candidate word high.
25. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, the word being configured as in input has the character and the missing of missing relative to candidate word Character and the word of input in previous character or latter character be belonging respectively to left hand input area and the right hand input area of keyboard In the case of domain, determine the character of the missing previous character and latter character input between time interval whether be less than Predetermined threshold value;
Weight improves module, is configured as, in the case where the time interval is less than predetermined threshold value, improving the power of candidate word Weight.
26. equipment according to claim 15, wherein the weight adjusting module includes:
Weight reduction module, the word being configured as in input has the character of missing and the word of the missing relative to candidate word Previous character and latter character in symbol and the word of input belong to the left hand input area of keyboard or the feelings of right hand input area Under condition, the weight of candidate word is reduced.
27. equipment according to claim 26, wherein the weight adjusting module also includes:
Time interval determining module, is configured to determine that between the previous character of the character of the missing and the input of latter character Time interval whether be less than predetermined threshold value;
Weight further reduces module, is configured as in the case where the time interval is not less than predetermined threshold value, further Reduce the weight of candidate word.
28. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, the word being configured as in input has the character and the missing of missing relative to candidate word Character and the word of input in previous character and latter character belong to the left hand input area or right hand input area of keyboard In the case of, determine the character of the missing previous character and latter character input between time interval whether less than pre- Fixed threshold value;
Weight reduction module, is configured as, in the case where the time interval is not less than predetermined threshold value, reducing candidate word Weight.
CN201310127321.3A 2013-04-15 2013-04-15 The method and apparatus that spell check is improved by application keyboard layout information Active CN104102625B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310127321.3A CN104102625B (en) 2013-04-15 2013-04-15 The method and apparatus that spell check is improved by application keyboard layout information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310127321.3A CN104102625B (en) 2013-04-15 2013-04-15 The method and apparatus that spell check is improved by application keyboard layout information

Publications (2)

Publication Number Publication Date
CN104102625A CN104102625A (en) 2014-10-15
CN104102625B true CN104102625B (en) 2017-07-04

Family

ID=51670790

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310127321.3A Active CN104102625B (en) 2013-04-15 2013-04-15 The method and apparatus that spell check is improved by application keyboard layout information

Country Status (1)

Country Link
CN (1) CN104102625B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105718427B (en) * 2016-01-15 2019-12-24 联想(北京)有限公司 Information processing method and electronic equipment
CN111078028B (en) * 2019-12-09 2023-11-21 科大讯飞股份有限公司 Input method, related device and readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101371253A (en) * 2005-04-25 2009-02-18 微软公司 Method and system for generating spelling suggestions
CN101625678A (en) * 2008-07-11 2010-01-13 英业达股份有限公司 System and method for checking spelling
CN101641661A (en) * 2007-01-05 2010-02-03 苹果公司 Method and system for providing word recommendations for text input

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8797266B2 (en) * 2011-05-16 2014-08-05 John Zachary Dennis Typing input systems, methods, and devices

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101371253A (en) * 2005-04-25 2009-02-18 微软公司 Method and system for generating spelling suggestions
CN101641661A (en) * 2007-01-05 2010-02-03 苹果公司 Method and system for providing word recommendations for text input
CN101625678A (en) * 2008-07-11 2010-01-13 英业达股份有限公司 System and method for checking spelling

Also Published As

Publication number Publication date
CN104102625A (en) 2014-10-15

Similar Documents

Publication Publication Date Title
CN1259632C (en) Method and system for filtering & selecting from a candidate listing generated by random inputting method
US10156981B2 (en) User-centric soft keyboard predictive technologies
JP4998219B2 (en) Form recognition program, form recognition apparatus, and form recognition method
US8024280B2 (en) Academic filter
CN103365573B (en) A kind of method and apparatus that many key input characters are identified
US10755043B2 (en) Method for revising errors by means of correlation decisions between character strings
CN106233375A (en) User version based on mass-rent input starts anew to learn language model
US10242296B2 (en) Method and device for realizing chinese character input based on uncertainty information
TW200842613A (en) Spell-check for a keyboard system with automatic correction
CA2503636A1 (en) A method of formatting documents
DE202008000265U1 (en) Portable communication device
KR101228865B1 (en) Document display apparatus and method for extracting key word in document
JP6219935B2 (en) Method, controller and apparatus for composing words
JP2001325562A (en) Image recognizing device, image forming device, image recognizing method, and computer-readable recording medium with image reocgnizing program stored therein
CN104102625B (en) The method and apparatus that spell check is improved by application keyboard layout information
JP2001297077A (en) Line-spacing controllable dtp system, line-spacing control method, line-spacing program, and recording medium where the same program is recorded
Dunlop et al. Qwerth: an optimized semi-ambiguous keyboard design
JP2009059159A (en) Information processor, information processing method and program
JP2015057707A (en) Japanese input method for automatically correcting input error on the basis of backspace key, input system, computer program, and recording medium
CN107229953A (en) A kind of broken document joining method based on DFS with improvement central cluster method
US7715631B2 (en) Method and apparatus for extracting feature information, and computer product
JP2006099423A (en) Text mining server and program
CN106293368A (en) A kind of data processing method and electronic equipment
CN110489933B (en) Method and system for generating planar design framework
Rodrigues et al. Improving text entry performance on tablet devices

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant