CN102402298A - Pinyin input method and user word adding method and system of same - Google Patents

Pinyin input method and user word adding method and system of same Download PDF

Info

Publication number
CN102402298A
CN102402298A CN2010102871333A CN201010287133A CN102402298A CN 102402298 A CN102402298 A CN 102402298A CN 2010102871333 A CN2010102871333 A CN 2010102871333A CN 201010287133 A CN201010287133 A CN 201010287133A CN 102402298 A CN102402298 A CN 102402298A
Authority
CN
China
Prior art keywords
user
character string
input
words
dictionary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102871333A
Other languages
Chinese (zh)
Inventor
林吓洪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Shiji Guangsu Information Technology Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN2010102871333A priority Critical patent/CN102402298A/en
Publication of CN102402298A publication Critical patent/CN102402298A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention provides a Pinyin input method and a user word adding method and a system of same, which belong to the field of input methods. The method comprises the steps that: a wrong string input by a user through the pinyin input method is obtained; a word selected by the user when the user inputs a correct string is obtained; and the input wrong string and the word are correspondingly stored in a word bank. According to the embodiment of the invention, the wrong string input by the user is stored, and the wrong string and the correct word input by the user are correspondingly stored in the word bank, so that the pinyin input method does not need to preset a plurality of vague words and error-prone words, the space occupied by the word bank of the pinyin input method is reduced, and the search efficiency when the user inputs is improved. Simultaneously, personal word banks can be generated aiming at the user habits of different users, which well accords with the personalized use habits of the users.

Description

The user's speech adding method and the system of a kind of spelling input method and spelling input method
Technical field
The present invention relates to input method field, the user's speech adding method and the system of particularly a kind of spelling input method and spelling input method.
Background technology
In order to realize the Chinese input, all need pass through the character string (pinyin string that English character string form) of Input Software, and obtain the Chinese written language that is complementary with it in the prior art with the match user input.And in the input method, spelling input method have input mode flexibly, need not to write down root, the advantage such as fast of crossing the threshold, be widely used now.But spelling input method has its shortcoming equally, that is: Chinese region is extensive, and dialect is numerous, therefore when input, when regular meeting causes input owing to phonetic is inaccurate, repeatedly attempts just finding own required words.
For example: " template " corresponding correct phonetic should be mu ban, and a lot of user can read the ban into mo with it, can cause the user when input, to waste time and energy like this.For this reason; A lot of spelling input methods are provided with fuzzy this input; That is: the phonetic inputed by mistake of pre-estimation user easier; And the phonetic of obscuring easily in the dialect (for example a lot of regional z and zh, c and ch, s and sh are regardless of), and will these a plurality of correct and wrong character strings all with the words corresponding stored in preset character word stock.So no matter the character string of user input is correct or wrong, can this words be shown to the user.
In realizing process of the present invention, the inventor finds that there is following problem at least in prior art:
Though to solving the problem of the inaccurate user of a part of phonetic when importing, this mode can only be directed against ubiquitous few cases, and can't customize corresponding character word stock to the use habit of different user in the prior art.Can't be applicable to the user of different regions, different use habit, different pronunciation customs like this, therefore poor to the help property of user's input.
Summary of the invention
In order to solve the use habit that character word stock of the prior art can only can't be directed against different user to common situation, the embodiment of the invention has proposed the user's speech adding method and the system of a kind of spelling input method and spelling input method.Said technical scheme is following:
The embodiment of the invention has proposed a kind of user's speech adding method of spelling input method, comprising:
The character string of input error when obtaining the user through the spelling input method input;
Obtain the words of selecting when the user imports correct character string;
With the character string of said input error and said words corresponding stored in dictionary.
Preferred as technique scheme, the character string of input error comprised when the said user of obtaining imported through spelling input method:
Read the daily record of presetting, obtain all character strings of user's input in the daily record;
Obtain all character strings of importing between two words that the user selects in the daily record, and obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein.
Preferred as technique scheme, said method also comprises:
Judge whether said each character string has identical record in said dictionary, if then ignore this character string.
Preferred as technique scheme, said method also comprises:
Judge the access times of the character string in the said dictionary, when access times are lower than predetermined threshold value, with said character string deletion.
The embodiment of the invention has also proposed a kind of user's speech add-on system of spelling input method, comprising:
Acquisition module, the character string of input error when being used to obtain the user through the spelling input method input;
Logging modle is used to obtain the words of selecting when the user imports correct character string;
The Word library updating module is used for character string and said words corresponding stored with said input error.
Preferred as technique scheme, said acquisition module comprises:
Log unit is used to read preset daily record, obtains all character strings of user's input in the daily record;
Split cells is used for obtaining all character strings of importing between two words of daily record user selection, to obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein.
Preferred as technique scheme, said acquisition module also comprises:
Matching unit is used for judging whether said each character string has identical record at said dictionary, if then ignore this character string.
Preferred as technique scheme, said system also comprises:
Removing module is used for obtaining the number of times that character string that said dictionary adds and corresponding words use; When said number of times is lower than predetermined threshold value, with said character string deletion.
The embodiment of the invention has also proposed a kind of spelling input method, comprising: load module, display module, dictionary; Also comprise:
Acquisition module, the character string of input error when being used to obtain the user through the spelling input method input;
Logging modle is used to obtain the words of selecting when the user imports correct character string;
The Word library updating module is used for character string and said words corresponding stored with said input error.
Preferred as technique scheme, said acquisition module comprises:
Log unit is used to read preset daily record, obtains all character strings of user's input in the daily record;
Split cells is used for obtaining all character strings of importing between two words of daily record user selection, to obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein.
Preferred as technique scheme, said acquisition module also comprises:
Matching unit is used for judging whether said each character string has identical record at said dictionary, if then ignore this character string.
Preferred as technique scheme, said spelling input method also comprises:
Removing module is used for obtaining the number of times that character string that said dictionary adds and corresponding words use; When said number of times is lower than predetermined threshold value, with said character string deletion.
The beneficial effect of the technical scheme that the embodiment of the invention provides is: the embodiment of the invention stores through the character string with user's input error, and itself and user are imported correct words corresponding stored in dictionary.Can make spelling input method need not to be provided with in advance a plurality of fuzzy words and fallibility speech like this, to reduce the shared space of dictionary of spelling input method, effectiveness of retrieval when improving input.Simultaneously, can generate individual's dictionary again to the use habit of different user, with the individual character use habit of more being close to the users.
Description of drawings
In order to be illustrated more clearly in the technical scheme of the embodiment of the invention; To do one to the accompanying drawing that uses among the embodiment below introduces simply; Obviously, below listed accompanying drawing only be some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the schematic flow sheet of first embodiment of the invention;
Fig. 2 is the schematic flow sheet of the content obtaining the user in the second embodiment of the invention and duplicate;
Fig. 3 is the structural representation of third embodiment of the invention;
Fig. 4 is the structural representation of fourth embodiment of the invention.
Fig. 5 is the structural representation of fifth embodiment of the invention
Fig. 6 is the structural representation of sixth embodiment of the invention.
Embodiment
For making the object of the invention, technical scheme and advantage clearer, will combine accompanying drawing that embodiment of the present invention is done to describe in detail further below.
The mentality of designing of the embodiment of the invention is: the user if do not have required words in the prepare word that the character string mistake of input causes showing, can delete the character string of input, and re-enter other character strings, and select required words when input.For example: " Mongolian oak " word (hu) much human in Cai Zhiheng " mistletoe " at the beginning all mistake be combined into " jie ".The user generally can do following deletion and rewrite behavior: input jie->can not find " Mongolian oak "-deletion jie-in the candidate>input hu->select target word " Mongolian oak ".Be exactly to have utilized this process to upgrade user's dictionary in the embodiment of the invention, that is: character string of recording user deletion, and the character string that will delete and correct target word are mapped and store in the dictionary.The dictionary that generates like this is more near user's use habit,
Below through embodiment the present invention is further explained.
Embodiment 1
First embodiment of the invention has proposed a kind of user's speech adding method of spelling input method, and its flow process is as shown in Figure 1, comprising:
The character string of step 101, input error when obtaining the user through the spelling input method input;
Step 102, obtain the words of selecting when the user imports correct character string;
Step 103, with the character string of said input error and said words corresponding stored in dictionary.
The embodiment of the invention stores through the character string with user's input error, and itself and user are imported correct words corresponding stored in dictionary.Can make spelling input method need not to be provided with in advance a plurality of fuzzy words and fallibility speech like this, to reduce the shared space of dictionary of spelling input method, effectiveness of retrieval when improving input.Simultaneously, can generate individual's dictionary again to the use habit of different user, with the individual character use habit of more being close to the users.
Embodiment 2
Second embodiment of the invention has proposed a kind of user's speech adding method of spelling input method, is on the first embodiment basis, to improve, and its flow process is as shown in Figure 2, comprising:
The character string of step 201, input error when obtaining the user through the spelling input method input.Wherein, the character string of input error can obtain in the following manner:
Character string when obtaining the user, and the words of selecting through the spelling input method input.Can obtain all character strings between the words that two users select like this.If comprise user's deletion action in this character string, then can think the character string that has comprised user's input error in this character string.
In one embodiment of the invention, a daily record can be set in spelling input method, with the character string of recording user input.For example: when this spelling input method starts, start the process of a supervisory user keyboard input simultaneously.This process is stored user's all character strings through the keyboard input in daily record.
This be since the user through spelling input method when input, if because when not having the required words of user in the inaccurate words that causes demonstrating of character string of input, the character string that the delete key deletion has been imported.The user delete this character string method can for:
When the user imports one section character string, for example user input " ze ", demonstration be " thief ", and the words of the actual required selection of user is " this ", then the user can import " ze " through the delete key deletion, or deletion " e " wherein only.Because each button of keyboard is all to there being the key assignments of standard; System translates into electric signal that system can set with the user for the operation of keyboard through this key assignments, so can adopt mode of the prior art to obtain the button of user's input in the embodiment of the invention.
Wherein, this delete key can be the esc key on the keyboard or delete key or backspace key.That is: after the user imports one section character string, find that this section character string of input is wrong, can be by all character strings of esc key cancellation input; Can delete one section character string wherein through the delete key, or through backspace key deletion one section character string wherein.
Wherein, if the user has only inputed by mistake once, then the character string of deletion has only one section.For example, the user imports " ze ", when finding input error then, has deleted " e " and has re-entered " he " through the backspace key.The result who then in daily record, writes down is:
ze
←backspace
he
This
Just can confirm that the ze of user's input is the character string of input error this moment, its pairing words should be " this ".
Because possibly there is the situation of repeatedly input error in the user, so just need all select words to be mapped with the user respectively the character string that the user repeatedly deletes, and be stored in the dictionary.For example, continue and go up example, after the user has imported " this " word, import words " Mongolian oak " again.Input error is found in input of character string " jie " back, has deleted " jie " through the backspace key; Re-enter " xie " back then and find still to be wrong, deleted " xie " through the backspace key; Re-enter " hu " and select required words " Mongolian oak ".The result who then in daily record, writes down is:
ze
←backspace
he
This
jie
←backspace
xie
←backspace
hu
Mongolian oak
Then step 201 can be specially:
Step 2011, read the daily record of presetting, obtain all character strings of user's input in the daily record;
Step 2012, obtain all character strings of importing between two words that user in the daily record selects, to obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein;
Example with the front is an example, has comprised between two words " this " and " Mongolian oak " two deletion actions all character strings being divided into three sections, is respectively " jie ", " xie ", " hu ".So just can be with all corresponding respectively " Mongolian oak " word of these three sections character strings.
Further, the final stage character string of in step 2032, obtaining " hu " is the correct character string of last input, therefore can this section character string be ignored.
Step 2013, judge whether said each character string has identical record in said dictionary, if then ignore this character string.
Can prevent like this to be stored many identical records in the dictionary, cause dictionary too too fat to move, to improve the dictionary effectiveness of retrieval.
Certainly, above-mentioned steps just illustrates, and can also adopt other modes to obtain the multistage character string of user's deletion in the embodiment of the invention, for example: when the user deletes, insert a mark, so also can very easily the multistage character string be distinguished.
Step 202, obtain the words of selecting when the user imports correct character string.
Same above example is the example explanation, and " Mongolian oak " wherein is the correct words of input.
Step 203, with the character string of said user deletion, and the words corresponding stored that the user selects is in dictionary.This dictionary can be the user thesaurus of spelling input method, or an independent fuzzy word dictionary of spelling input method, also can be other dictionaries in the operating system, and the embodiment of the invention is not made qualification to this.
When being a plurality of to the character string of the user described in the step 201 deletion, then step 203 can be specially:
Step 203 ', said each section character string is mapped with the words that said user selects respectively, and store in the dictionary.
That is: can " hu " be ignored after, " jie ", " xie " are mapped with " Mongolian oak " respectively, and are stored in the dictionary.
The foregoing description can be through daily record mode with all character strings storages of user's deletion, and through obtaining the correct words that the user selects, with the character string of these deletions and this words corresponding stored in dictionary.The mode that presets in the user thesaurus storage fuzzy word of the prior art of comparing so both can reduce the size and the user thesaurus of spelling input method, can generate corresponding dictionary according to the use habit of different user again, to improve user's input efficiency.
Further, the foregoing description also comprises:
Step 204, obtain in the said dictionary number of times that the character string of adding according to abovementioned steps 201-203 and corresponding words use; When said number of times is lower than predetermined threshold value, with said character string deletion.
This is because the user character string occurs inputing by mistake to cause this character string to be added in the dictionary accidentally, can cause dictionary too too fat to move like this.Therefore through the mode that long-time obsolete character string is deleted dictionary is upgraded in the embodiment of the invention, renewal can be every interval one Preset Time.
Owing to increased deleting mechanism, the mode of a large amount of fuzzy words of indication can greatly reduce the storage space that input method takies in the prior art of comparing in the embodiment of the invention, and the system overhead when reducing the retrieval user dictionary simultaneously also improves recall precision.
Embodiment 3
Third embodiment of the invention has proposed a kind of user's speech add-on system of spelling input method, and its structure is as shown in Figure 2, comprising:
Acquisition module 1, the character string of input error when being used to obtain the user through the spelling input method input;
Logging modle 2 is used to obtain the words of selecting when the user imports correct character string;
Word library updating module 3 is used for character string and said words corresponding stored with said input error.
The embodiment of the invention stores through the character string with user's input error, and itself and user are imported correct words corresponding stored in dictionary.Can make spelling input method need not to be provided with in advance a plurality of fuzzy words and fallibility speech like this, to reduce the shared space of dictionary of spelling input method, effectiveness of retrieval when improving input.Simultaneously, can generate individual's dictionary again to the use habit of different user, with the individual character use habit of more being close to the users.
Embodiment 4
Fourth embodiment of the invention has proposed a kind of user's speech add-on system of spelling input method, is on the 3rd embodiment basis, to improve, and its structure is as shown in Figure 4, comprising:
Acquisition module 1, the character string of input error when being used to obtain the user through the spelling input method input.Wherein, the character string of input error can obtain in the following manner:
Character string when obtaining the user, and the words of selecting through the spelling input method input.Can obtain all character strings between the words that two users select like this.If comprise user's deletion action in this character string, then can think the character string that has comprised user's input error in this character string.
In one embodiment of the invention, a daily record can be set in spelling input method, with the character string of recording user input.For example: when this spelling input method starts, start the process of a supervisory user keyboard input simultaneously.This process is stored user's all character strings through the keyboard input in daily record.
This be since the user through spelling input method when input, if because when not having the required words of user in the inaccurate words that causes demonstrating of character string of input, the character string that the delete key deletion has been imported.The user delete this character string method can for:
When the user imports one section character string, for example user input " ze ", demonstration be " thief ", and the words of the actual required selection of user is " this ", then the user can import " ze " through the delete key deletion, or deletion " e " wherein only.Because each button of keyboard is all to there being the key assignments of standard; System translates into electric signal that system can set with the user for the operation of keyboard through this key assignments, so can adopt mode of the prior art to obtain the button of user's input in the embodiment of the invention.
Wherein, this delete key can be the esc key on the keyboard or delete key or backspace key.That is: after the user imports one section character string, find that this section character string of input is wrong, can be by all character strings of esc key cancellation input; Can delete one section character string wherein through the delete key, or through backspace key deletion one section character string wherein.
Wherein, if the user has only inputed by mistake once, then the character string of deletion has only one section.For example, the user imports " ze ", when finding input error then, has deleted " e " and has re-entered " he " through the backspace key.The result who then in daily record, writes down is:
ze
←backspace
he
This
Just can confirm that the ze of user's input is the character string of input error this moment, its pairing words should be " this ".
Because possibly there is the situation of repeatedly input error in the user, so just need all select words to be mapped with the user respectively the character string that the user repeatedly deletes, and be stored in the dictionary.For example, continue and go up example, after the user has imported " this " word, import words " Mongolian oak " again.Input error is found in input of character string " jie " back, has deleted " jie " through the backspace key; Re-enter " xie " back then and find still to be wrong, deleted " xie " through the backspace key; Re-enter " hu " and select required words " Mongolian oak ".The result who then in daily record, writes down is:
ze
←backspace
he
This
jie
←backspace
xie
←backspace
hu
Mongolian oak
Then acquisition module 1 can be specially:
Log unit 11 is used to read preset daily record, obtains all character strings of user's input in the daily record;
Split cells 12 is used for obtaining all character strings of importing between two words of daily record user selection, to obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein;
Example with the front is an example, has comprised between two words " this " and " Mongolian oak " two deletion actions all character strings being divided into three sections, is respectively " jie ", " xie ", " hu ".So just can be with all corresponding respectively " Mongolian oak " word of these three sections character strings.
Further, the final stage character string " hu " that split cells 12 obtains is the correct character string of last input, therefore can this section character string be ignored.
Matching unit 13, judge whether said each character string has identical record in said dictionary, if then ignore this character string.
Can prevent like this to be stored many identical records in the dictionary, cause dictionary too too fat to move, to improve the dictionary effectiveness of retrieval.
Certainly, said system just illustrates, and can also adopt other modes to obtain the multistage character string of user's deletion in the embodiment of the invention, for example: when the user deletes, insert a mark, so also can very easily the multistage character string be distinguished.
Logging modle 2 is used to obtain the words of selecting when the user imports correct character string.
Same above example is the example explanation, and " Mongolian oak " wherein is the correct words of input.
Word library updating module 3 be used for the character string with said user's deletion, and the words corresponding stored that the user selects is in dictionary.This dictionary can be the user thesaurus of spelling input method, or an independent fuzzy word dictionary of spelling input method, also can be other dictionaries in the operating system, and the embodiment of the invention is not made qualification to this.
When being a plurality of to the character string of described user deletion, then Word library updating module 3 also is used for said each section character string is mapped with the words that said user selects respectively, and stores in the dictionary.
That is: can " hu " be ignored after, " jie ", " xie " are mapped with " Mongolian oak " respectively, and are stored in the dictionary.
The foregoing description can be through daily record mode with all character strings storages of user's deletion, and through obtaining the correct words that the user selects, with the character string of these deletions and this words corresponding stored in dictionary.The mode that presets in the user thesaurus storage fuzzy word of the prior art of comparing so both can reduce the size and the user thesaurus of spelling input method, can generate corresponding dictionary according to the use habit of different user again, to improve user's input efficiency.
Further, the foregoing description also comprises:
Removing module 4 is used for obtaining the number of times that character string that said dictionary adds and corresponding words use; When said number of times is lower than predetermined threshold value, with said character string deletion.
This is because the user character string occurs inputing by mistake to cause this character string to be added in the dictionary accidentally, can cause dictionary too too fat to move like this.Therefore through the mode that long-time obsolete character string is deleted dictionary is upgraded in the embodiment of the invention, renewal can be every interval one Preset Time.
Owing to increased deleting mechanism, the mode of a large amount of fuzzy words of indication can greatly reduce the storage space that input method takies in the prior art of comparing in the embodiment of the invention, and the system overhead when reducing the retrieval user dictionary simultaneously also improves recall precision.
The system of the embodiment of the invention third and fourth embodiment, identical with the design of the method for aforesaid first and second embodiment with principle, so in third and fourth embodiment to first and second embodiment in identical part repeat no more.
Embodiment 5
Fifth embodiment of the invention has proposed a kind of spelling input method, and its structure is as shown in Figure 5, comprising: load module 5, display module 6, dictionary 7 also comprise:
Acquisition module 1, the character string of input error when being used to obtain the user through the spelling input method input;
Logging modle 2 is used to obtain the words of selecting when the user imports correct character string;
Word library updating module 3 is used for character string and said words corresponding stored with said input error.
The embodiment of the invention stores through the character string with user's input error, and itself and user are imported correct words corresponding stored in dictionary.Can make spelling input method need not to be provided with in advance a plurality of fuzzy words and fallibility speech like this, to reduce the shared space of dictionary of spelling input method, effectiveness of retrieval when improving input.Simultaneously, can generate individual's dictionary again to the use habit of different user, with the individual character use habit of more being close to the users.
Embodiment 6
Sixth embodiment of the invention has proposed a kind of user's speech add-on system of spelling input method, is on the 5th embodiment basis, to improve, and its structure is as shown in Figure 6, comprising: load module 5, display module 6, dictionary 7 also comprise:
Acquisition module 1, the character string of input error when being used to obtain the user through the spelling input method input.Wherein, the character string of input error can obtain in the following manner:
Character string when obtaining the user, and the words of selecting through the spelling input method input.Can obtain all character strings between the words that two users select like this.If comprise user's deletion action in this character string, then can think the character string that has comprised user's input error in this character string.
In one embodiment of the invention, a daily record can be set in spelling input method, with the character string of recording user input.For example: when this spelling input method starts, start the process of a supervisory user keyboard input simultaneously.This process is stored user's all character strings through the keyboard input in daily record.
This be since the user through spelling input method when input, if because when not having the required words of user in the inaccurate words that causes demonstrating of character string of input, the character string that the delete key deletion has been imported.The user delete this character string method can for:
When the user imports one section character string, for example user input " ze ", demonstration be " thief ", and the words of the actual required selection of user is " this ", then the user can import " ze " through the delete key deletion, or deletion " e " wherein only.Because each button of keyboard is all to there being the key assignments of standard; System translates into electric signal that system can set with the user for the operation of keyboard through this key assignments, so can adopt mode of the prior art to obtain the button of user's input in the embodiment of the invention.
Wherein, this delete key can be the esc key on the keyboard or delete key or backspace key.That is: after the user imports one section character string, find that this section character string of input is wrong, can be by all character strings of esc key cancellation input; Can delete one section character string wherein through the delete key, or through backspace key deletion one section character string wherein.
Wherein, if the user has only inputed by mistake once, then the character string of deletion has only one section.For example, the user imports " ze ", when finding input error then, has deleted " e " and has re-entered " he " through the backspace key.The result who then in daily record, writes down is:
ze
←backspace
he
This
Just can confirm that the ze of user's input is the character string of input error this moment, its pairing words should be " this ".
Because possibly there is the situation of repeatedly input error in the user, so just need all select words to be mapped with the user respectively the character string that the user repeatedly deletes, and be stored in the dictionary.For example, continue and go up example, after the user has imported " this " word, import words " Mongolian oak " again.Input error is found in input of character string " jie " back, has deleted " jie " through the backspace key; Re-enter " xie " back then and find still to be wrong, deleted " xie " through the backspace key; Re-enter " hu " and select required words " Mongolian oak ".The result who then in daily record, writes down is:
ze
←backspace
he
This
jie
←backspace
xie
←backspace
hu
Mongolian oak
Then acquisition module 1 can be specially:
Log unit 11 is used to read preset daily record, obtains all character strings of user's input in the daily record;
Split cells 12 is used for obtaining all character strings of importing between two words of daily record user selection, to obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein;
Example with the front is an example, has comprised between two words " this " and " Mongolian oak " two deletion actions all character strings being divided into three sections, is respectively " jie ", " xie ", " hu ".So just can be with all corresponding respectively " Mongolian oak " word of these three sections character strings.
Further, the final stage character string " hu " that split cells 12 obtains is the correct character string of last input, therefore can this section character string be ignored.
Matching unit 13, judge whether said each character string has identical record in said dictionary, if then ignore this character string.
Can prevent like this to be stored many identical records in the dictionary, cause dictionary too too fat to move, to improve the dictionary effectiveness of retrieval.
Certainly, said system just illustrates, and can also adopt other modes to obtain the multistage character string of user's deletion in the embodiment of the invention, for example: when the user deletes, insert a mark, so also can very easily the multistage character string be distinguished.
Logging modle 2 is used to obtain the words of selecting when the user imports correct character string.
Same above example is the example explanation, and " Mongolian oak " wherein is the correct words of input.
Word library updating module 3 be used for the character string with said user's deletion, and the words corresponding stored that the user selects is in dictionary.This dictionary can be the user thesaurus of spelling input method, or an independent fuzzy word dictionary of spelling input method, also can be other dictionaries in the operating system, and the embodiment of the invention is not made qualification to this.
When being a plurality of to the character string of described user deletion, then Word library updating module 3 also is used for said each section character string is mapped with the words that said user selects respectively, and stores in the dictionary.
That is: can " hu " be ignored after, " jie ", " xie " are mapped with " Mongolian oak " respectively, and are stored in the dictionary.
The foregoing description can be through daily record mode with all character strings storages of user's deletion, and through obtaining the correct words that the user selects, with the character string of these deletions and this words corresponding stored in dictionary.The mode that presets in the user thesaurus storage fuzzy word of the prior art of comparing so both can reduce the size and the user thesaurus of spelling input method, can generate corresponding dictionary according to the use habit of different user again, to improve user's input efficiency.
Further, the foregoing description also comprises:
Removing module 4 is used for obtaining the number of times that character string that said dictionary adds and corresponding words use; When said number of times is lower than predetermined threshold value, with said character string deletion.
This is because the user character string occurs inputing by mistake to cause this character string to be added in the dictionary accidentally, can cause dictionary too too fat to move like this.Therefore through the mode that long-time obsolete character string is deleted dictionary is upgraded in the embodiment of the invention, renewal can be every interval one Preset Time.
Owing to increased deleting mechanism, the mode of a large amount of fuzzy words of indication can greatly reduce the storage space that input method takies in the prior art of comparing in the embodiment of the invention, and the system overhead when reducing the retrieval user dictionary simultaneously also improves recall precision.
The system of the embodiment of the invention the 5th and the 6th embodiment; Be to use the spelling input method of system of method and third and fourth embodiment of aforesaid first and second embodiment; Its design is identical with first, second, third, fourth embodiment with principle, and therefore wherein identical part repeats no more.
If the said integrated unit of the embodiment of the invention is realized with the form of SFU software functional unit and during as independently production marketing or use, also can be stored in the computer read/write memory medium.Based on such understanding; The part that technical scheme of the present invention contributes to prior art in essence in other words can be come out with the embodied of software product; This computer software product is stored in the storage medium; Comprise some instructions with so that computer equipment (can be personal computer, server, the perhaps network equipment etc.) carry out all or part of of the said method of each embodiment of the present invention.And aforesaid storage medium comprises: various media that can be program code stored such as USB flash disk, portable hard drive, ROM (read-only memory) (ROM, Read-Only Memory), RAS (RAM, Random Access Memory), magnetic disc or CD.
More than be merely preferred embodiment of the present invention, or not all within spirit of the present invention and principle in order to restriction the present invention, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (12)

1. user's speech adding method of a spelling input method is characterized in that, comprising:
The character string of input error when obtaining the user through the spelling input method input;
Obtain the words of selecting when the user imports correct character string;
With the character string of said input error and said words corresponding stored in dictionary.
2. user's speech adding method of spelling input method according to claim 1 is characterized in that, the character string of input error comprised when the said user of obtaining imported through spelling input method:
Read the daily record of presetting, obtain all character strings of user's input in the daily record;
Obtain all character strings of importing between two words that the user selects in the daily record, and obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein.
3. user's speech adding method of spelling input method according to claim 2 is characterized in that said method also comprises:
Judge whether said each character string has identical record in said dictionary, if then ignore this character string.
4. according to user's speech adding method of claim 1 or 2 or 3 described spelling input methods, it is characterized in that said method also comprises:
Judge the access times of the character string in the said dictionary, when access times are lower than predetermined threshold value, with said character string deletion.
5. user's speech add-on system of a spelling input method is characterized in that, comprising:
Acquisition module, the character string of input error when being used to obtain the user through the spelling input method input;
Logging modle is used to obtain the words of selecting when the user imports correct character string;
The Word library updating module is used for character string and said words corresponding stored with said input error.
6. user's speech add-on system of spelling input method according to claim 5 is characterized in that said acquisition module comprises:
Log unit is used to read preset daily record, obtains all character strings of user's input in the daily record;
Split cells is used for obtaining all character strings of importing between two words of daily record user selection, to obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein.
7. user's speech add-on system of spelling input method according to claim 6 is characterized in that said acquisition module also comprises:
Matching unit is used for judging whether said each character string has identical record at said dictionary, if then ignore this character string.
8. according to user's speech add-on system of claim 5 or 6 or 7 described spelling input methods, it is characterized in that said system also comprises:
Removing module is used for obtaining the number of times that character string that said dictionary adds and corresponding words use; When said number of times is lower than predetermined threshold value, with said character string deletion.
9. spelling input method comprises: load module, display module, dictionary, it is characterized in that, and also comprise:
Acquisition module, the character string of input error when being used to obtain the user through the spelling input method input;
Logging modle is used to obtain the words of selecting when the user imports correct character string;
The Word library updating module is used for character string and said words corresponding stored with said input error.
10. spelling input method according to claim 9 is characterized in that, said acquisition module comprises:
Log unit is used to read preset daily record, obtains all character strings of user's input in the daily record;
Split cells is used for obtaining all character strings of importing between two words of daily record user selection, to obtain all deletion actions wherein; And with this deletion action as separation, obtain each section character string wherein.
11. spelling input method according to claim 10 is characterized in that, said acquisition module also comprises:
Matching unit is used for judging whether said each character string has identical record at said dictionary, if then ignore this character string.
12., it is characterized in that said spelling input method also comprises according to claim 9 or 10 or 11 described spelling input methods:
Removing module is used for obtaining the number of times that character string that said dictionary adds and corresponding words use; When said number of times is lower than predetermined threshold value, with said character string deletion.
CN2010102871333A 2010-09-16 2010-09-16 Pinyin input method and user word adding method and system of same Pending CN102402298A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102871333A CN102402298A (en) 2010-09-16 2010-09-16 Pinyin input method and user word adding method and system of same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102871333A CN102402298A (en) 2010-09-16 2010-09-16 Pinyin input method and user word adding method and system of same

Publications (1)

Publication Number Publication Date
CN102402298A true CN102402298A (en) 2012-04-04

Family

ID=45884582

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102871333A Pending CN102402298A (en) 2010-09-16 2010-09-16 Pinyin input method and user word adding method and system of same

Country Status (1)

Country Link
CN (1) CN102402298A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103616962A (en) * 2013-12-13 2014-03-05 联想(北京)有限公司 Information processing method and device
CN103903615A (en) * 2014-03-10 2014-07-02 联想(北京)有限公司 Information processing method and electronic device
CN107688400A (en) * 2016-08-05 2018-02-13 北京搜狗科技发展有限公司 It is a kind of to input error correction method and device, a kind of device for being used to input error correction
CN109308126A (en) * 2017-07-27 2019-02-05 北京搜狗科技发展有限公司 A kind of candidate word methods of exhibiting and device
CN110874146A (en) * 2018-08-30 2020-03-10 北京搜狗科技发展有限公司 Input method and device and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101013443A (en) * 2007-02-13 2007-08-08 北京搜狗科技发展有限公司 Intelligent word input method and input method system and updating method thereof
CN101030157A (en) * 2007-04-20 2007-09-05 北京搜狗科技发展有限公司 Method and system for updating user vocabulary synchronouslly
CN101241514A (en) * 2008-03-21 2008-08-13 北京搜狗科技发展有限公司 Method for creating error-correcting database, automatic error correcting method and system
CN101276245A (en) * 2008-04-16 2008-10-01 北京搜狗科技发展有限公司 Reminding method and system for coding to correct error in input process
CN101452461A (en) * 2007-12-06 2009-06-10 英业达股份有限公司 Lexical learning system and method based on enquiry frequency

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101013443A (en) * 2007-02-13 2007-08-08 北京搜狗科技发展有限公司 Intelligent word input method and input method system and updating method thereof
CN101030157A (en) * 2007-04-20 2007-09-05 北京搜狗科技发展有限公司 Method and system for updating user vocabulary synchronouslly
CN101452461A (en) * 2007-12-06 2009-06-10 英业达股份有限公司 Lexical learning system and method based on enquiry frequency
CN101241514A (en) * 2008-03-21 2008-08-13 北京搜狗科技发展有限公司 Method for creating error-correcting database, automatic error correcting method and system
CN101276245A (en) * 2008-04-16 2008-10-01 北京搜狗科技发展有限公司 Reminding method and system for coding to correct error in input process

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王劲松: "《中文之星2.0+ for windows 95操作指南》", 28 February 1997, 北京航空航天大学出版社 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103616962A (en) * 2013-12-13 2014-03-05 联想(北京)有限公司 Information processing method and device
CN103903615A (en) * 2014-03-10 2014-07-02 联想(北京)有限公司 Information processing method and electronic device
CN107688400A (en) * 2016-08-05 2018-02-13 北京搜狗科技发展有限公司 It is a kind of to input error correction method and device, a kind of device for being used to input error correction
CN107688400B (en) * 2016-08-05 2021-11-30 北京搜狗科技发展有限公司 Input error correction method and device for input error correction
CN109308126A (en) * 2017-07-27 2019-02-05 北京搜狗科技发展有限公司 A kind of candidate word methods of exhibiting and device
CN110874146A (en) * 2018-08-30 2020-03-10 北京搜狗科技发展有限公司 Input method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN101622616B (en) Shared language model
US7979268B2 (en) String matching method and system and computer-readable recording medium storing the string matching method
US20070156404A1 (en) String matching method and system using phonetic symbols and computer-readable recording medium storing computer program for executing the string matching method
US10838996B2 (en) Document revision change summarization
CN101641691A (en) Integrated pinyin and stroke input
CN103026318A (en) Input method editor
CN105378606A (en) Alternative hypothesis error correction for gesture typing
CN101556508A (en) Candidate phrase generating method, equipment, system and device in input method
CN101950285A (en) Utilize native language pronunciation string converting system and the method thereof of statistical method to Chinese character
CN102725790A (en) Recognition dictionary creation device and speech recognition device
KR20070087399A (en) Method and apparatus for searching media file through extracting partial search word
JP2010505208A (en) Generation method of typing candidates for improving typing efficiency
KR101797125B1 (en) Multi-lingual business indicia curation and transliteration synthesis
CN104916177B (en) The data output method of electronic equipment and electronic equipment
CN103942223A (en) Method and system for conducting online error correction on language model
CN102402298A (en) Pinyin input method and user word adding method and system of same
CN101546228A (en) Input method and device for realizing English reminding
CN103049458A (en) Method and system for revising user word bank
CN104424180A (en) Text input method and equipment
CN102314412A (en) Method and system for recording contextual information and tracing new word context
KR102639979B1 (en) Keyword extraction apparatus, control method thereof and keyword extraction program
CN104239289A (en) Syllabication method and syllabication device
CN1704882A (en) Asian language input by using keyboard
CA2523992A1 (en) Automatic segmentation of texts comprising chunks without separators
CN101622617A (en) Stroke number input

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: SHENZHEN SHIJI LIGHT SPEED INFORMATION TECHNOLOGY

Free format text: FORMER OWNER: TENGXUN SCI-TECH (SHENZHEN) CO., LTD.

Effective date: 20131104

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518000 SHENZHEN, GUANGDONG PROVINCE TO: 518057 SHENZHEN, GUANGDONG PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20131104

Address after: A Tencent Building in Shenzhen Nanshan District City, Guangdong streets in Guangdong province science and technology 518057 16

Applicant after: Shenzhen Shiji Guangsu Information Technology Co., Ltd.

Address before: 518000 Guangdong city of Shenzhen province Futian District SEG Science Park 2 East Room 403

Applicant before: Tencent Technology (Shenzhen) Co., Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120404