CN104102625B - The method and apparatus that spell check is improved by application keyboard layout information - Google Patents
The method and apparatus that spell check is improved by application keyboard layout information Download PDFInfo
- Publication number
- CN104102625B CN104102625B CN201310127321.3A CN201310127321A CN104102625B CN 104102625 B CN104102625 B CN 104102625B CN 201310127321 A CN201310127321 A CN 201310127321A CN 104102625 B CN104102625 B CN 104102625B
- Authority
- CN
- China
- Prior art keywords
- character
- word
- weight
- candidate word
- input
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Abstract
Method and apparatus the present invention relates to improve spell check by application keyboard layout information.The method according to the invention includes:Keyboard layout information is obtained, the keyboard layout information includes the layout information of left hand and right hand input area;Word for input provides spell check suggestion, and the spell check suggestion includes candidate word list;The weight of the candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area, position of the candidate word in the candidate word list is to be at least partially based on the weight of the candidate word and determine.Have benefited from the present invention, the efficiency of spell check can be improved.
Description
Technical field
Method and apparatus the present invention relates to be used to improve spell check.More particularly, it relates to pass through apply with
The method and apparatus that left hand and right hand is input into related keyboard layout information to improve spell check.
Background technology
In various text processing applications(Such as MicrosoftApplication program)And comprising word processing work(
The application of energy(Mailbox client, electronic dictionary, search engine etc.)In, spelling checker is widely used, spelling inspection
Device is looked into for marking the word of possible misspelling, and is preferably provided candidate word list or the word is corrected automatically.
In spell check technical field, there are various modes to be ranked up candidate word.A kind of mode is to calculate
Editing distance between candidate word and the word of input(edit distance)(It is also called Levenshtein distances), and be based on
Be ranked up for candidate word and user is presented to by the editing distance.Editing distance refers to that a word is become into another word institute
The minimum spelling for needing changes number of times.Spelling change can include for a character being substituted for another character, one word of insertion
Symbol, one character of deletion etc..For example, " kitten " one word is become " sitting " as follows:
1.sitten(k→s)
2.sittin(e→i)
3.sitting(→g)
Therefore, it is 3 from " kitten " to the editing distance of " sitting ".
Editing distance is based entirely on the comparing between two words, the characteristic without considering input equipment.As a result, candidate word
The order of the candidate word in list may not be order optimal for a user.
In addition, giving a kind of characteristic for considering input equipment in United States Patent (USP) No.US7957955 to enter candidate word
The method of row sequence.When by keyboard to be input into, it is considered to which keyboard layout is ranked up to candidate word.Specifically,
Possible candidate word suggestion is recognized based on the character of input and the corresponding adjacent character on keyboard layout, and it is entered
Row is scored and is sorted.Fig. 1 shows and uses an example of the method.In Fig. 1, reference " 602 " indicates that user is defeated
The character string " rheatre " for entering, reference " 604 " indicate in keyboard respectively with the character string in each character phase
Adjacent character, reference " 608 " indicates the candidate word list produced according to character permutations.
However, inventors herein have recognized that, when user is familiar to keyboard layout, user hardly pushes the wrong
Key.Conversely, user may quickly be input into, and when being input into by keyboard using two hands, it is susceptible to
Because two hands are with the order that overturns or press two correct keys etc. simultaneously and cause the situation of misspelling.And existing skill
The method of art does not all account for influence of two characteristics of hand input to spell check, therefore the order of candidate word is not completely
Rationally.This may cause the precedence of the desired candidate word of user to compare rearward so that effectively wrong word can not be corrected.
The content of the invention
Accordingly, it would be desirable to a kind of method taken left hand and right hand input characteristics and keyboard layout into consideration to improve spell check
And equipment.
According to an aspect of the present invention, in order to solve the above-mentioned technical problem, the present invention provides a kind of by application keyboard
Come the method for improving spell check, it includes the steps to layout information:Obtain keyboard layout information, the keyboard layout letter
Breath includes the layout information of left hand and right hand input area;Word for input provides spell check suggestion, the spell check suggestion
Including candidate word list;Candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area
Weight, position of the candidate word in the candidate word list is to be at least partially based on the weight of the candidate word and determine.
According to another aspect of the present invention, in order to solve the above-mentioned technical problem, the present invention provides a kind of by application keyboard
Come the equipment of improving spell check, it includes layout information:Keyboard layout information acquisition module, is configured as obtaining keyboard layout
Information, the keyboard layout information includes the layout information of left hand and right hand input area;Spell check module, is configured as being directed to
The word of input provides spell check suggestion, and the spell check suggestion includes candidate word list;Weight determination module, is configured as
The weight of the candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area, candidate word is in institute
It is to be at least partially based on the weight of the candidate word and determine to state the position in candidate word list.
Method and apparatus according to the invention has taken left hand and right hand input characteristics and keyboard layout into consideration such that it is able to carry
For the more reasonably sequence for candidate word, therefore improve the efficiency of spell check.
According to following description referring to the drawings, other property features of the invention and advantage will become apparent.
Brief description of the drawings
The accompanying drawing for being incorporated in specification and constituting a part for specification shows embodiments of the invention, and with retouch
To state be used for together and illustrate principle of the invention.
Fig. 1 shows an example using the spell checking methods of prior art.
Fig. 2 is the block diagram of the hardware configuration of the computer system for illustrating the ability to implementation embodiments of the invention.
Fig. 3 shows a kind of qwerty keyboard layout.
Fig. 4 shows Microsoft of the prior artApplication program is given for the word " amin " being input into
The result of the spell check for going out.
Fig. 5 shows the result of the spell check provided for the word " amin " being input into of the invention.
Fig. 6 shows Microsoft of the prior artApplication program is given for the word " domin " being input into
The result of the spell check for going out.
Fig. 7 shows the result of the spell check provided for the word " domin " being input into of the invention.
Fig. 8 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information
The flow chart of method.
Fig. 9 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information
The block diagram of equipment.
Specific embodiment
Describe a preferred embodiment of the present invention in detail below with reference to the accompanying drawings.It is not details and work(required in this invention
Can be omitted, so as not to understanding of the invention can be obscured.
Note that similar reference numeral refers to the similar project in figure with letter, thus once in a width figure
A project is defined, avoids the need for being discussed in the figure after.
In the disclosure, term " first ", " second " etc. are only used only for being made a distinction between element or step, and simultaneously
It is not intended to represent time sequencing, priority or importance.
(The hardware configuration of computer system)
Fig. 2 is the block diagram of the hardware configuration of the computer system 1000 for illustrating the ability to implementation embodiments of the invention.
As shown in Figure 2, computer system includes computer 1110.Computer 1110 includes connecting via system bus 1121
Processing unit 1120, system storage 1130, fixed non-volatile memory interface 1140, the removable non-volatile memories for connecing
Device interface 1150, user input interface 1160, network interface 1170, video interface 1190 and peripheral interface 1195.
System storage 1130 includes ROM(Read-only storage)1131 and RAM(Random access memory)1132.BIOS
(Basic input output system)During 1133 reside in ROM1131.Operating system 1134, application program 1135, other program modules
1136 and during some routine datas 1137 reside in RAM1132.
The fixed non-volatile memory 1141 of such as hard disk etc is connected to fixed non-volatile memory interface 1140.
Fixed non-volatile memory 1141 for example can be with storage program area 1144, application program 1145, other program modules 1146
With some routine datas 1147.
The removable non-volatile memory of such as floppy disk 1151 and CD-ROM drive 1155 etc is connected to
Removable non-volatile memory interface 1150.For example, diskette 1 152 can be inserted into floppy disk 1151, and CD
(CD)1156 can be inserted into CD-ROM drive 1155.
The input equipment of such as microphone 1161 and keyboard 1162 etc is connected to user input interface 1160.
Computer 1110 can be connected to remote computer 1180 by network interface 1170.For example, network interface 1170
Remote computer 1180 can be connected to via LAN 1171.Or, network interface 1170 may be coupled to modem
(Modulator-demodulator)1172, and modem 1172 is connected to remote computer 1180 via wide area network 1173.
Remote computer 1180 can include the memory 1181 of such as hard disk etc, its storage remote application
1185。
Video interface 1190 is connected to monitor 1191.
Peripheral interface 1195 is connected to printer 1196 and loudspeaker 1197.
Computer system shown in Fig. 2 is merely illustrative and is never intended to enter invention, its application, or uses
Any limitation of row.
Computer system shown in Fig. 2 can be incorporated in any embodiment, as stand-alone computer, or be able to can also make
It is the processing system in equipment, one or more unnecessary components can be removed, it is also possible to be added to one or more
Individual additional component.
(Principle of the invention)
As it was previously stated, inventors herein have recognized that, when user is familiar to keyboard layout, user hardly presses
Wrong key.Conversely, user may be susceptible to the error in terms of two hand cooperations(Such as, two hands with overturn order or
Press two correct keys, etc. simultaneously), so as to cause misspelling.Order or simultaneously with two hands to overturn below
Adjacent character caused by two correct keys is pressed to exchange or describe original of the invention in detail in case of character is lacked
Reason.
When user passes through input through keyboard character using two hands, in former and later two continuous characters respectively positioned at keyboard
In the case of in left hand input area and right hand input area, easily pushed button simultaneously or with opposite suitable due to two hands
Sequence is pushed button, therefore, a character in the two characters may be lacked, or the input sequence of the two characters is probably
Reverse.That is, many misspellings are the character missings due to occurring when right-hand man is input into continuous character respectively
Caused by being exchanged with adjacent character.In addition, there is character missing or adjacent character when continuous character is input into by a hand
The probability of exchange is relatively low, and because user is generally familiar to keyboard layout, therefore user can seldom press the wrong button.
Therefore, spell check can be improved by considering the layout of the left hand and right hand input area of keyboard.That is, can
Judged with the layout information of the left hand and right hand input area based on keyboard input word and candidate word between whether there is due to
Left hand and right hand input cause character missing or adjacent character exchange etc. situation.If it is present the candidate word is correct word
Probability it is larger, therefore the weight of the candidate word can be improved, that is, lift precedence of the candidate word in candidate word list.
Principle of the invention and the present invention are described in detail below in conjunction with specific example and statistics relative to existing
There is the advantage of technology.
Keyboard described in this specification can be the dummy keyboard for touching screen display(Also referred to as soft keyboard), or
Physical keyboard(Also referred to as hard manual).Keyboard layout can be in accordance with conventional QWERTY layout or its modification, or can also
It is other layouts.Principle of the invention, implementation are illustrated as keyboard layout using QWERTY layout as shown in Figure 3 below
Example, specific example etc., but those skilled in the art understand, and the present invention is not limited to QWERTY layout.In addition, such as this area
In it is known, shown in Fig. 3 qwerty keyboard layout in, with scheme centre black line be boundary, it is defeated that left side button belongs to left hand
Enter region, and the right button belongs to right hand input area.
1st, the example that adjacent character is exchanged
User wants input " main " one word, however, when two hands rapidly input, " m " key that the right hand is pressed somewhat falls behind
In " a " key that left hand is pressed.Therefore, input becomes " amin ".Now, the word " amin " being input into is carried out with candidate word " main "
Compare, it is found that " a " have exchanged position with " m ".
Fig. 4 shows Microsoft of the prior artApplication program is given for the word " amin " being input into
The result of the spell check for going out.From fig. 4, it can be seen that in candidate word list, " amen " is the candidate word of the top, and
" main " is the 3rd candidate word.In this case, the candidate word of the top and that word of the desired selection of non-user.
According to the present invention, the information of the right-hand man's input area based on qwerty keyboard, it can be determined that " m " and " a " be by
Different hand inputs, thus improve the weight of candidate word " main ".Fig. 5 shows the word for being input into of the invention
The result of the spell check that " amin " is given.From fig. 5, it can be seen that in candidate word list, " main " is located at the top, and
Candidate word " main " is the word of user view input.
As seen from the above, compared with prior art, present invention improves over spell check.Tool of the inventor for the example
Body is explained as follows:When user passes through input through keyboard character using two hands, it should continuous by two of the same hand input
The situation that the input sequence of character is exchanged can seldom occur;And in some cases, when user passes through keyboard using two hands
When being rapidly input into character, two hands can very closely press two buttons in time sometimes, and this may cause two hands
Two reversed orders of character of input.That is, this exchange by adjacent character caused by two hand inputs should be more normal
The input error seen, therefore the weight of the corresponding candidate word of raising can cause that putting in order for candidate word is more reasonable.
2nd, the example of character missing
User wants input " domain " one word.However, after " dom " is input into, due to right for most people
Hand is quicker than left hand, therefore, user may press " a " key and press " i " key by the right hand by left hand simultaneously.As a result, " a " does not have
It is transfused to, so that the word of input becomes " domin ".
Fig. 6 shows Microsoft of the prior artApplication program is given for the word " domin " being input into
The result of the spell check for going out.From fig. 6, it can be seen that in candidate word list, " doming " is the candidate word of the top, and
" domain " is the 3rd candidate word.In this case, the candidate word of the top and that word of the desired selection of non-user.
According to the present invention, the information of the right-hand man's input area based on qwerty keyboard, it can be determined that candidate word
" a " of missing from previous character " m " and latter character " i " is input into by different hands in " domain ", thus improves candidate
The weight of word " domain ".Fig. 7 shows the knot of the spell check provided for the word " domin " being input into of the invention
Really.From figure 7 it can be seen that in candidate word list, " domain " is located at the top, and candidate word " domain " is user's meaning
Scheme the word of input.
As seen from the above, compared with prior art, present invention improves over spell check.Tool of the inventor for the example
Body is explained as follows:When user passes through input through keyboard character using two hands, two feelings of button are only pressed by a hand simultaneously
Shape can seldom occur;And in some cases, when user is rapidly input into character using two hands by keyboard, two sometimes
Hand can simultaneously press two buttons, and this may cause one in two two characters of hand input not to be transfused to.Namely
Say, this also should be more typical input error by character missing caused by two hand inputs, therefore improve corresponding candidate word
Weight can cause that putting in order for candidate word is more reasonable.
3rd, statistics
On the website that network address is www.spellcheck.net, there is the statistics of the spelling for each word.From
In January, 2010 in June, 2012, the website have collected more than 15,411,110 spelling information, wherein:
- for " main " most common misspellings it is " mian " (31%) and " amin " (11%).Both situations are all
Caused by being exchanged due to the adjacent character that two hand inputs cause, about 42% is accounted for altogether.
- for many other words, the adjacent character that its misspellings also causes mainly due to two hand inputs is exchanged
Caused by character missing.
Table 1 below lists realityThe all misspellings occurred in form.
Table 1
As can be seen from the above table, 5 in 18 mistakes belong to the situation of character missing, and this 5 character missing errors
In 3 can preferably be corrected by principle of the invention.
In addition, 4 in the 18 mistakes situations for belonging to adjacent character exchange, and all this 4 adjacent characters exchanges
Mistake can preferably be corrected by principle of the invention.
Sum it up, can improve above-mentioned actual by using the present inventionSpell check knot in form
In fruit 38.8%.
Based on above-mentioned principle of the invention, it is proposed that a kind of method and apparatus of improvement spell check, its specific descriptions
It is as follows.
(The method and apparatus for improving spell check)
Fig. 8 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information
The flow chart of method.
As shown in figure 8, in step 610, obtaining keyboard layout information, the keyboard layout information is input into including left hand and right hand
The layout information in region.
As described above, keyboard layout can be QWERTY layout or its modification, or can also be other layouts.The present invention
Go for various keyboards(Including soft keyboard and hard manual)As long as the keyboard can be divided into left hand input area and the right side
Hand input area.The layout information of right-hand man's input area can be included only in the left hand input area including keyboard
The character included in character and right hand input area, or can also include that the position between these characters is closed in some cases
System etc..
In step 620, the word for input provides spell check suggestion, and spell check suggestion includes that candidate word is arranged
Table.
The candidate word list can be obtained by various spell checking methods as known in the art.For example, generally,
, it is necessary to the word of input is compared with each word in dictionary when producing candidate word list, and calculate the word of input and wait
Select the editing distance between word.Then, the editing distance according to candidate word is ranked up to it, so as to obtain candidate word list.
In act 630, the candidate word in the candidate word list is adjusted based on the layout information of left hand and right hand input area
Weight, position of the candidate word in the candidate word list is to be at least partially based on the weight of the candidate word and determine.Spelling
In the case that the device that writes a self-criticism also is corrected automatically to misspelling, the method according to the invention is also performed after step 630
Following steps:The word of input is replaced using the candidate word of foremost one in the candidate word list, so as to be corrected automatically.
In a kind of implementation of step 630, candidate word can be compared with the word of input first, it is defeated to determine
The word for entering relative to candidate word whether have adjacent character exchange or missing character, then it is determined that input word relative to
Candidate word have adjacent character exchange or missing character in the case of, the layout information based on left hand and right hand input area come
Adjust the weight of candidate word.
In the implementation, it is preferable that can be exchanged and character deletion condition for adjacent character respectively, based on it is left,
The layout information of right hand input area adjusts the weight of candidate word.
As it was previously stated, it should be wrong more typical input to be exchanged with character missing by adjacent character caused by two hand inputs
By mistake, there is the probability that adjacent character is exchanged and character is lacked when being input into continuous character by a hand relatively low.
When therefore, being exchanged for adjacent character, the left hand that can be belonging respectively to keyboard in the adjacent character for exchanging is defeated
In the case of entering region and right hand input area, the weight of candidate word is improved;Or, belong to keyboard in the adjacent character for exchanging
Left hand input area or right hand input area in the case of, reduce candidate word weight.
Additionally, when being lacked for character, can be in the previous character or latter in word of the character of missing with input
In the case that character is belonging respectively to the left hand input area and right hand input area of keyboard, the weight of candidate word is improved;Or,
Previous character and latter character in the character of missing and the word of input belong to left hand input area or the right hand input of keyboard
In the case of region, the weight of candidate word is reduced.
In addition, as described above, exchanging the misspelling lacked with character one by adjacent character caused by two hand inputs
It is occur in the case where user rapidly inputs in the case of a little.Therefore, in some cases it may with reference to right-hand man input area
Time interval between the layout information and character input in domain adjusts the weight of candidate word.In some cases, by determining
Time interval between two character inputs, can more accurately determine whether that there is this adjacent character exchanges scarce with character
Lose, so as to be more effectively carried out spell check.
Therefore, exchanged and character deletion condition for adjacent character, as described above according to right-hand man's input area
Layout information is improved or reduced after the weight of candidate word, it is preferable that may further determine that exchange character input it
Between time interval or with the character of missing close to two inputs of character between time interval whether less than predetermined
Threshold value.Then, the weight of candidate word is further improved/reduced according to the determination result.In some cases, can also be to defeated
Time interval between the input of each two character in the word for entering is averaging, and determines whether the average time interval is less than
Predetermined threshold value.
When specifically, it is preferable that being exchanged for adjacent character, as described above in the adjacent character point for exchanging
After not belonging to the weight that candidate word is improved in the case of the left hand input area and right hand input area of keyboard, it is determined that exchange
Whether the time interval between the input of adjacent character is less than predetermined threshold value, and is less than predetermined threshold in the time interval
In the case of value, the weight of candidate word is further improved.Alternately, it is preferable that be belonging respectively to key in the adjacent character for exchanging
In the case of the left hand input area and right hand input area of disk, it is determined that the time interval between the input of the adjacent character for exchanging
Whether predetermined threshold value is less than, and in the case where the time interval is less than predetermined threshold value, improves the weight of candidate word.
In addition, belonging to left hand input area or the right hand input area of keyboard in the adjacent character for exchanging as described above
In the case of domain after the weight of reduction candidate word, it is determined that whether the time interval between the input of the adjacent character for exchanging is less than
Predetermined threshold value, and in the case where the time interval is not less than predetermined threshold value, further reduce the weight of candidate word.
Alternately, it is preferable that belong to the left hand input area of keyboard or the situation of right hand input area in the adjacent character for exchanging
Under, it is determined that whether the time interval between the input of the adjacent character for exchanging is less than predetermined threshold value, and between the time
In the case of not less than predetermined threshold value, the weight of candidate word is reduced.
It is previous in the character of missing with the word of input as described above preferably for the situation of character missing
Character or latter character improve the power of candidate word in the case of being belonging respectively to the left hand input area and right hand input area of keyboard
After weight, determine the character of the missing previous character and latter character input between time interval whether less than predetermined
Threshold value, and the time interval be less than predetermined threshold value in the case of, further improve candidate word weight.It is alternative
Ground, it is preferable that previous character or latter character in the character of missing with the word of input are belonging respectively to the left hand input of keyboard
In the case of region and right hand input area, determine between the input of previous character and latter character of the character of the missing
Whether time interval is less than predetermined threshold value, and in the case where the time interval is less than predetermined threshold value, improves candidate
The weight of word.
In addition, belonging to key in the previous character and latter character as described above in the character of missing with the word of input
Reduced in the case of the left hand input area or right hand input area of disk after the weight of candidate word, determine the character of the missing
Previous character and the input of latter character between time interval whether be less than predetermined threshold value, and in the time interval
In the case of not less than predetermined threshold value, the weight of candidate word is further reduced.Alternately, it is preferable that in the character of missing
The left hand input area of keyboard or the situation of right hand input area are belonged to the previous character and latter character in the word of input
Under, determine the character of the missing previous character and latter character input between time interval whether less than predetermined threshold
Value, and in the case where the time interval is not less than predetermined threshold value, reduce the weight of candidate word.
Note that above-mentioned each can in conjunction or individually the step of improve/reduce the weight of candidate word
Perform.
Following table 2 gives a specific example of weight adjustment:
Table 2
Obviously, those skilled artisans will appreciate that the mode and numerical value of weight adjustment are not limited to be shown in above table
Example.
Fig. 9 is showed and according to an embodiment of the invention is improved spell check by application keyboard layout information
The block diagram of equipment.
As shown in figure 9, the spelling that improved by application keyboard layout information of exemplary embodiment of the invention is examined
The equipment 700 looked into includes:Keyboard layout information acquisition module 710, spell check module 720 and weight determination module 730.
More specifically, keyboard layout information acquisition module 710 is configured as obtaining keyboard layout information, the keyboard layout
Information includes the layout information of left hand and right hand input area.
Spell check module 720 is configured as providing spell check suggestion for the word of input, spell check suggestion bag
Include candidate word list.
Weight determination module 730 is configured as the layout information based on left hand and right hand input area to adjust candidate word row
The weight of the candidate word in table, position of the candidate word in the candidate word list be at least partially based on the candidate word weight and
Determine.
Unit in the equipment 700 can be configured as performing each step shown by the flow chart in Fig. 8.
Unit described above is the exemplary and/or preferred module for implementing the treatment described in the disclosure.This
A little units can be hardware cell(Such as field programmable gate array(FPGA), digital signal processor or application specific integrated circuit
Deng)And/or software module(Such as computer-readable program).List for implementing each step is not described at large below
Unit.As long as however, the step of having certain treatment of execution, it is possible to have the corresponding functional module or list for implementing same treatment
Unit(By hardware and/or software implementation).Limited by all combinations of described step and unit corresponding with these steps
Fixed technical scheme is all included in present disclosure, if they constitute these technical schemes be it is complete and
It is applicable.
Additionally, the said equipment 700 being made up of various units can be incorporated into such as computer, move as functional module
In the electronic installation of mobile phone, hand-held device etc., as long as the need for existing for spelling-checker in the electronic installation i.e.
Can.In addition to the equipment 700, the electronic installation is it is of course possible to have other hardware or software part.
As described above, method and apparatus according to the invention is applied to various spelling checkers and comprising spell check work(
The various applications of energy or various devices.
The method of the present invention and equipment can in many ways be implemented.For example, can by software, hardware, firmware,
Or its any combinations implements the method for the present invention and equipment.The order of above-mentioned method and step is merely illustrative, the present invention
Method and step be not limited to order described in detail above, clearly state unless otherwise.Additionally, in some embodiments
In, the present invention can also be implemented as recording program in the recording medium, and it is included for realizing the method according to the invention
Machine readable instructions.Thus, the present invention also covering storage is used for the recording medium of the program for realizing the method according to the invention.
Although passed through example illustrates some specific embodiments of the invention in detail, those skilled in the art should
Understand, above-mentioned example is intended merely to be illustrative and do not limit the scope of the invention.It should be appreciated by those skilled in the art that above-mentioned
Embodiment can be changed in the case where the scope of the present invention and essence is not departed from.The scope of the present invention is by appended power
Profit requires what is limited.
Claims (28)
1. a kind of method that spell check is improved by application keyboard layout information, including the steps:
Keyboard layout information is obtained, the keyboard layout information includes the layout information of left hand and right hand input area;
Word for input provides spell check suggestion, and the spell check suggestion includes candidate word list;
Candidate word is compared with the word of input, whether is exchanged with adjacent character relative to candidate word with the word for determining input
Or the character of missing;It is determined that the word of input has a case that the character that adjacent character is exchanged or lacked relative to candidate word
Under, the weight of candidate word is adjusted based on the layout information of left hand and right hand input area, candidate word is in the candidate word list
Position is to be at least partially based on the weight of the candidate word and determine.
2. method according to claim 1, also including using the candidate word of foremost one in the candidate word list
The word of input is replaced, so as to be corrected automatically.
3. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to be belonging respectively to a left side for keyboard in the word of input
In the case of hand input area and right hand input area, the weight of candidate word is improved.
4. method according to claim 3, wherein the step of weight of adjustment candidate word also includes the steps:
It is determined that whether the time interval between the input of the adjacent character for exchanging is less than predetermined threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is further improved.
5. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to be belonging respectively to a left side for keyboard in the word of input
In the case of hand input area and right hand input area, it is determined that whether the time interval between the input of the adjacent character for exchanging is small
In predetermined threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is improved.
6. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to belong to the left hand of keyboard in the word of input
In the case of input area or right hand input area, the weight of candidate word is reduced.
7. method according to claim 6, wherein the step of weight of adjustment candidate word also includes the steps:
It is determined that whether the time interval between the input of the adjacent character for exchanging is less than predetermined threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is further reduced.
8. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Relative to candidate word there is the adjacent character that adjacent character is exchanged and exchanged to belong to the left hand of keyboard in the word of input
In the case of input area or right hand input area, it is determined that whether the time interval between the input of the adjacent character for exchanging is less than
Predetermined threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is reduced.
9. method according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input
In the case that character or latter character are belonging respectively to the left hand input area and right hand input area of keyboard, the power of candidate word is improved
Weight.
10. method according to claim 9, wherein the step of weight of adjustment candidate word also includes the steps:
Determine the character of the missing previous character and latter character input between time interval whether less than predetermined
Threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is further improved.
11. methods according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input
In the case that character or latter character are belonging respectively to the left hand input area and right hand input area of keyboard, the missing is determined
Whether the time interval between the input of the previous character and latter character of character is less than predetermined threshold value;
In the case where the time interval is less than predetermined threshold value, the weight of candidate word is improved.
12. methods according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input
In the case that character and latter character belong to the left hand input area or right hand input area of keyboard, the power of candidate word is reduced
Weight.
13. methods according to claim 12, wherein the step of weight of adjustment candidate word also includes the steps:
Determine the character of the missing previous character and latter character input between time interval whether less than predetermined
Threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is further reduced.
14. methods according to claim 1, wherein the step of weight of adjustment candidate word includes the steps:
Word in input has the character of missing relative to candidate word and previous in word of the character of the missing with input
In the case that character and latter character belong to the left hand input area or right hand input area of keyboard, the word of the missing is determined
Whether the time interval between the previous character of symbol and the input of latter character is less than predetermined threshold value;
In the case where the time interval is not less than predetermined threshold value, the weight of candidate word is reduced.
A kind of 15. equipment that spell check is improved by application keyboard layout information, including:
Keyboard layout information acquisition module, is configured as obtaining keyboard layout information, and the keyboard layout information includes left hand and right hand
The layout information of input area;
Spell check module, is configured as providing spell check suggestion for the word of input, and the spell check suggestion includes waiting
Select word list;
Weight determination module, wherein the weight determination module includes:
Comparison module, is configured as being compared candidate word with the word of input, is relative to candidate word with the word for determining input
The no character for being exchanged with adjacent character or being lacked;
Weight adjusting module, be configured as it is determined that input word relative to candidate word have adjacent character exchange or missing
In the case of character, the weight of candidate word is adjusted based on the layout information of left hand and right hand input area, candidate word is in the candidate
Position in word list is to be at least partially based on the weight of the candidate word and determine.
16. equipment according to claim 15, also including replacement module, the replacement module is configured with the time
Select the candidate word of foremost one in word list to replace the word of input, so as to be corrected automatically.
17. equipment according to claim 15, wherein the weight adjusting module includes:
Weight improves module, is configured as word in input adjacent relative to what candidate word had that adjacent character exchanges and exchange
In the case that character is belonging respectively to the left hand input area and right hand input area of keyboard, the weight of candidate word is improved.
18. equipment according to claim 17, wherein the weight adjusting module also includes:
Whether time interval determining module, be configured to determine that the time interval between the input of the adjacent character of exchange less than pre-
Fixed threshold value;
Weight further improves module, is configured as, in the case where the time interval is less than predetermined threshold value, further carrying
The weight of candidate word high.
19. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, is configured as having what adjacent character was exchanged and exchanged relative to candidate word in the word of input
In the case that adjacent character is belonging respectively to the left hand input area and right hand input area of keyboard, it is determined that the adjacent character for exchanging
Whether the time interval between input is less than predetermined threshold value;
Weight improves module, is configured as, in the case where the time interval is less than predetermined threshold value, improving the power of candidate word
Weight.
20. equipment according to claim 15, wherein the weight adjusting module includes:
Weight reduction module, is configured as word in input adjacent relative to what candidate word had that adjacent character exchanges and exchange
In the case that character belongs to the left hand input area or right hand input area of keyboard, the weight of candidate word is reduced.
21. equipment according to claim 20, wherein the weight adjusting module also includes:
Whether time interval determining module, be configured to determine that the time interval between the input of the adjacent character of exchange less than pre-
Fixed threshold value;
Weight further reduces module, is configured as in the case where the time interval is not less than predetermined threshold value, further
Reduce the weight of candidate word.
22. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, is configured as having what adjacent character was exchanged and exchanged relative to candidate word in the word of input
In the case that adjacent character belongs to the left hand input area or right hand input area of keyboard, it is determined that the adjacent character for exchanging is defeated
Whether the time interval between entering is less than predetermined threshold value;
Weight reduction module, is configured as, in the case where the time interval is not less than predetermined threshold value, reducing candidate word
Weight.
23. equipment according to claim 15, wherein the weight adjusting module includes:
Weight improves module, and the word being configured as in input has the character of missing and the word of the missing relative to candidate word
Previous character or latter character in symbol and the word of input are belonging respectively to the left hand input area and right hand input area of keyboard
In the case of, improve the weight of candidate word.
24. equipment according to claim 23, wherein the weight adjusting module also includes:
Time interval determining module, is configured to determine that between the previous character of the character of the missing and the input of latter character
Time interval whether be less than predetermined threshold value;
Weight further improves module, is configured as, in the case where the time interval is less than predetermined threshold value, further carrying
The weight of candidate word high.
25. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, the word being configured as in input has the character and the missing of missing relative to candidate word
Character and the word of input in previous character or latter character be belonging respectively to left hand input area and the right hand input area of keyboard
In the case of domain, determine the character of the missing previous character and latter character input between time interval whether be less than
Predetermined threshold value;
Weight improves module, is configured as, in the case where the time interval is less than predetermined threshold value, improving the power of candidate word
Weight.
26. equipment according to claim 15, wherein the weight adjusting module includes:
Weight reduction module, the word being configured as in input has the character of missing and the word of the missing relative to candidate word
Previous character and latter character in symbol and the word of input belong to the left hand input area of keyboard or the feelings of right hand input area
Under condition, the weight of candidate word is reduced.
27. equipment according to claim 26, wherein the weight adjusting module also includes:
Time interval determining module, is configured to determine that between the previous character of the character of the missing and the input of latter character
Time interval whether be less than predetermined threshold value;
Weight further reduces module, is configured as in the case where the time interval is not less than predetermined threshold value, further
Reduce the weight of candidate word.
28. equipment according to claim 15, wherein the weight adjusting module includes:
Time interval determining module, the word being configured as in input has the character and the missing of missing relative to candidate word
Character and the word of input in previous character and latter character belong to the left hand input area or right hand input area of keyboard
In the case of, determine the character of the missing previous character and latter character input between time interval whether less than pre-
Fixed threshold value;
Weight reduction module, is configured as, in the case where the time interval is not less than predetermined threshold value, reducing candidate word
Weight.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310127321.3A CN104102625B (en) | 2013-04-15 | 2013-04-15 | The method and apparatus that spell check is improved by application keyboard layout information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310127321.3A CN104102625B (en) | 2013-04-15 | 2013-04-15 | The method and apparatus that spell check is improved by application keyboard layout information |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104102625A CN104102625A (en) | 2014-10-15 |
CN104102625B true CN104102625B (en) | 2017-07-04 |
Family
ID=51670790
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310127321.3A Active CN104102625B (en) | 2013-04-15 | 2013-04-15 | The method and apparatus that spell check is improved by application keyboard layout information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104102625B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105718427B (en) * | 2016-01-15 | 2019-12-24 | 联想(北京)有限公司 | Information processing method and electronic equipment |
CN111078028B (en) * | 2019-12-09 | 2023-11-21 | 科大讯飞股份有限公司 | Input method, related device and readable storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101371253A (en) * | 2005-04-25 | 2009-02-18 | 微软公司 | Method and system for generating spelling suggestions |
CN101625678A (en) * | 2008-07-11 | 2010-01-13 | 英业达股份有限公司 | System and method for checking spelling |
CN101641661A (en) * | 2007-01-05 | 2010-02-03 | 苹果公司 | Method and system for providing word recommendations for text input |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8797266B2 (en) * | 2011-05-16 | 2014-08-05 | John Zachary Dennis | Typing input systems, methods, and devices |
-
2013
- 2013-04-15 CN CN201310127321.3A patent/CN104102625B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101371253A (en) * | 2005-04-25 | 2009-02-18 | 微软公司 | Method and system for generating spelling suggestions |
CN101641661A (en) * | 2007-01-05 | 2010-02-03 | 苹果公司 | Method and system for providing word recommendations for text input |
CN101625678A (en) * | 2008-07-11 | 2010-01-13 | 英业达股份有限公司 | System and method for checking spelling |
Also Published As
Publication number | Publication date |
---|---|
CN104102625A (en) | 2014-10-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1259632C (en) | Method and system for filtering & selecting from a candidate listing generated by random inputting method | |
US10156981B2 (en) | User-centric soft keyboard predictive technologies | |
JP4998219B2 (en) | Form recognition program, form recognition apparatus, and form recognition method | |
US8024280B2 (en) | Academic filter | |
CN103365573B (en) | A kind of method and apparatus that many key input characters are identified | |
US10755043B2 (en) | Method for revising errors by means of correlation decisions between character strings | |
CN106233375A (en) | User version based on mass-rent input starts anew to learn language model | |
US10242296B2 (en) | Method and device for realizing chinese character input based on uncertainty information | |
TW200842613A (en) | Spell-check for a keyboard system with automatic correction | |
CA2503636A1 (en) | A method of formatting documents | |
DE202008000265U1 (en) | Portable communication device | |
KR101228865B1 (en) | Document display apparatus and method for extracting key word in document | |
JP6219935B2 (en) | Method, controller and apparatus for composing words | |
JP2001325562A (en) | Image recognizing device, image forming device, image recognizing method, and computer-readable recording medium with image reocgnizing program stored therein | |
CN104102625B (en) | The method and apparatus that spell check is improved by application keyboard layout information | |
JP2001297077A (en) | Line-spacing controllable dtp system, line-spacing control method, line-spacing program, and recording medium where the same program is recorded | |
Dunlop et al. | Qwerth: an optimized semi-ambiguous keyboard design | |
JP2009059159A (en) | Information processor, information processing method and program | |
JP2015057707A (en) | Japanese input method for automatically correcting input error on the basis of backspace key, input system, computer program, and recording medium | |
CN107229953A (en) | A kind of broken document joining method based on DFS with improvement central cluster method | |
US7715631B2 (en) | Method and apparatus for extracting feature information, and computer product | |
JP2006099423A (en) | Text mining server and program | |
CN106293368A (en) | A kind of data processing method and electronic equipment | |
CN110489933B (en) | Method and system for generating planar design framework | |
Rodrigues et al. | Improving text entry performance on tablet devices |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |