CN102637166A - Method and device for optimizing word order of input method and system of input method - Google Patents

Method and device for optimizing word order of input method and system of input method Download PDF

Info

Publication number
CN102637166A
CN102637166A CN2012100701089A CN201210070108A CN102637166A CN 102637166 A CN102637166 A CN 102637166A CN 2012100701089 A CN2012100701089 A CN 2012100701089A CN 201210070108 A CN201210070108 A CN 201210070108A CN 102637166 A CN102637166 A CN 102637166A
Authority
CN
China
Prior art keywords
word
input method
vocabulary
input
order
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012100701089A
Other languages
Chinese (zh)
Inventor
曾相宗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vtron Technologies Ltd
Original Assignee
Vtron Technologies Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vtron Technologies Ltd filed Critical Vtron Technologies Ltd
Priority to CN2012100701089A priority Critical patent/CN102637166A/en
Publication of CN102637166A publication Critical patent/CN102637166A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The invention provides a method for optimizing word order of an input method. The method comprises the steps of acquiring text information of input environment of users, dividing the text information into words, computing frequency of each word and optimizing the word order of candidate words of the input method according to the word frequency. The invention also provides the device for optimizing the word order and the system of the input method. By the technology of the invention, close binding of the input method and the current input environment of users is realized to more accurately satisfy the word input need of users. While inputting words, the users can quickly choose the proper words, so that the word input speed is accelerated.

Description

Input method word order optimization method, device and input method system
Technical field
The present invention relates to input method technology, particularly relate to a kind of input method word order optimization method, device and input method system.
Background technology
In the information age now; People are more and more to the dependence of keyboard input; Requirement for input speed also is increasingly high; At present, some input methods can both be carried out record for the frequency of utilization of some vocabulary according to the user, when carrying out the candidate word ordering, will select the higher vocabulary of frequency to come forward position in the past.
But user's use for vocabulary in different input environments all is different; Often appear at when carrying out certain documents editing, required vocabulary is not the higher vocabulary of frequency of utilization in the past, but relevant with the present located input environment; For example; The user is current when handling the document relevant with computing machine, and this moment, user's residing input environment was the Word message relevant with computer realm, if come candidate's vocabulary is sorted by frequency of utilization in the past at this moment; Then the user need do more selection to candidate word and just can choose suitable vocabulary, causes input speed slack-off.
Summary of the invention
Based on this; Be necessary to come candidate's vocabulary is sorted by frequency of utilization in the past to above-mentioned; Then the user need do more selection to candidate word and just can choose suitable vocabulary; Cause the slack-off problem of input speed, a kind of input method word order optimization method, device and input method system are provided.
A kind of input method word order optimization method comprises the steps:
Obtain the Word message of user's input environment;
Said Word message is split into vocabulary, and calculate the word frequency of each vocabulary;
Be optimized according to the word order of said word frequency the candidate word of input method.
A kind of input method word order optimization means comprises:
The input environment acquiring unit is used to obtain the Word message of user's input environment;
The intelligence learning unit is used for said Word message is split into vocabulary, and calculates the word frequency of each vocabulary;
The unit optimized in word order, is used for being optimized according to the word order of said word frequency to the candidate word of input method.
A kind of input method system comprises: like above-mentioned input method word order optimization means.
Above-mentioned input method word order optimization method, device and input method all are condition of different to user's use for vocabulary in different input environments, the Word message of the input environment through obtaining the user; Word message is carried out intelligence learning; Participle also calculates word frequency, is optimized according to the vocabulary of this participle and word frequency thereof the candidate word to input method, has realized combining closely between input method and the current input environment of user; Can satisfy user's literal input demand more exactly; When carrying out the literal input, the user can choose suitable vocabulary apace, has improved the speed of literal input.
Description of drawings
Fig. 1 is the process flow diagram of the embodiment of input method word order optimization method of the present invention;
Fig. 2 is the structural representation of the embodiment of input method word order optimization means of the present invention.
Embodiment
Below in conjunction with accompanying drawing the embodiment of input method word order optimization method of the present invention is described in detail.
As shown in Figure 1, Fig. 1 is the process flow diagram of an embodiment of input method word order optimization method of the present invention, comprises the steps:
S101, obtain the Word message of user's input environment;
In one embodiment, under active user's input environment, obtain the first-class Word message of document, webpage of various forms, for example under the current input environment of user, document D is arranged, then the Word message on the document D is discerned and read.
S102, said Word message is split into vocabulary, and calculate the word frequency of each vocabulary;
In one embodiment, the Word message that is obtained is split as vocabulary commonly used, and calculates each vocabulary and corresponding word frequency thereof; Wherein, Word frequency can be used the number of times direct representation of appearance, like certain vocabulary occurrence number three times, remembers that then word frequency is 3; Further, with vocabulary and word frequency thereof with the stored in form that concerns group in database.
S103, be optimized according to the word order of said word frequency to the candidate word of input method.
In one embodiment, the process of optimization specifically may further comprise the steps:
(1) extraction is gathered with the identical vocabulary and the word frequency compositional optimization thereof of the candidate word of input method from database; For example, the user thinks input characters W, in current input method, keys in coding a, can obtain candidate word sequence q n, from database, take out and sequence q nThe set that is optimized of identical vocabulary and word frequency thereof;
(2) size order according to word frequency sorts to the vocabulary in the above-mentioned optimization set; For example, that will from database, extract and above-mentioned sequence q nIdentical vocabulary, sorting obtains sequence N;
(3) will optimize set merges with the candidate collection that is made up of candidate word; Be about to the sequence q of candidate word nWith sequence N combination, wherein, q nIn word frequency be 0 all, the word frequency among the N is the record data in the database;
(4) be combined the candidate word that the word frequency rearrangement that obtains and concentrate vocabulary obtains to optimize word order; Promptly get q nWith two union of sets collection of N, and recomputate word frequency, to this union by the word frequency size sequence q that is optimized that resequences n', the user promptly can be at sequence of words q n' middle selection vocabulary W, the storehouse that Updates Information then, the word frequency with vocabulary W in database increases by 1.
Below in conjunction with accompanying drawing the embodiment of the corresponding device of input method word order optimization method of the present invention is described in detail.
As shown in Figure 2, Fig. 2 is the structural representation of an embodiment of input method word order optimization means of the present invention, comprising:
The input environment acquiring unit is used to obtain the Word message of user's input environment;
The intelligence learning unit is used for said Word message is split into vocabulary, and calculates the word frequency of each vocabulary;
The unit optimized in word order, is used for being optimized according to the word order of said word frequency to the candidate word of input method.
Preferably, input method word order optimization means of the present invention can also comprise storage unit, is used for said vocabulary and word frequency thereof with the stored in form that concerns group at database.
Preferably, optimize the unit, specifically comprise for said word order:
Set is provided with module, is used for from identical vocabulary and the word frequency compositional optimization set thereof of database extraction with the candidate word of said input method;
Order module is used for sorting according to the vocabulary that the size order of said word frequency is gathered said optimization;
Set merges module, is used for said optimization set is merged with the candidate collection that is made up of said candidate word;
Reordering module, the word frequency rearrangement that is used for said merging is obtained and concentrate vocabulary obtains to optimize the candidate word of word order.
For more clear technology of the present invention, enumerate an application example below and do detailed description.
Like the user at a document relevant of editor with computing machine, under its current input environment, browse other with computing machine document associated and related web page etc.; Suppose the current use spelling input method of user, at this moment, when the user thinks input " computing machine "; Input coding is " jsj " on keyboard, is " construction bureau ", " Technical Board ", " family planning office " " anti-smuggling office " because the user imported more vocabulary in the past ... So, during input this moment " jsj "; Coming top candidate word is above-mentioned vocabulary, and " computing machine " may come the position, back of word order, so the user possibly carry out repeatedly page turning and just can choose vocabulary " computing machine "; And after adopting technology of the present invention, owing to the Word message in the current input environment is carried out intelligence learning, the word frequency of " computing machine " is higher; And through calculating and being stored in the database; At this moment, utilize the data of data-base recording that candidate word is resequenced, then " computing machine " will come the front of word order; The user promptly can choose required vocabulary apace, has improved the input speed of input method.
In addition, input method word order optimisation technique of the present invention support is striden input method and is used, and accomplishes because participle and word frequency are calculated to be based under user's input environment; Therefore, when the user is switched input method, irrelevant with input method coding; Under the input environment same case, just can carry out word order optimization.
Describe in detail in the face of the embodiment of input method system of the present invention down.
A kind of input method system comprises the input method word order optimization means like above-mentioned embodiment; The current input environment of this input method system and user is combined closely, and can satisfy user's literal input demand more exactly.
The above embodiment has only expressed several kinds of embodiments of the present invention, and it describes comparatively concrete and detailed, but can not therefore be interpreted as the restriction to claim of the present invention.Should be pointed out that for the person of ordinary skill of the art under the prerequisite that does not break away from the present invention's design, can also make some distortion and improvement, these all belong to protection scope of the present invention.Therefore, the protection domain of patent of the present invention should be as the criterion with accompanying claims.

Claims (7)

1. an input method word order optimization method is characterized in that, comprises the steps:
Obtain the Word message of user's input environment;
Said Word message is split into vocabulary, and calculate the word frequency of each vocabulary;
Be optimized according to the word order of said word frequency the candidate word of input method.
2. input method word order optimization method according to claim 1 is characterized in that, also comprises: with said vocabulary and word frequency thereof with the stored in form that concerns group in database.
3. input method word order optimization method according to claim 2 is characterized in that, saidly according to said word frequency the step that the word order of the candidate word of input method is optimized is comprised:
Extraction is gathered with the identical vocabulary and the word frequency compositional optimization thereof of the candidate word of said input method from database;
Size order according to said word frequency sorts to the vocabulary in the said optimization set;
Said optimization set is merged with the candidate collection that is made up of said candidate word;
The word frequency rearrangement of vocabulary that said merging is obtained and concentrated obtains to optimize the candidate word of word order.
4. an input method word order optimization means is characterized in that, comprising:
The input environment acquiring unit is used to obtain the Word message of user's input environment;
The intelligence learning unit is used for said Word message is split into vocabulary, and calculates the word frequency of each vocabulary;
The unit optimized in word order, is used for being optimized according to the word order of said word frequency to the candidate word of input method.
5. input method word order optimization means according to claim 4 is characterized in that, also comprises: storage unit is used for said vocabulary and word frequency thereof with the stored in form that concerns group at database.
6. input method word order optimization means according to claim 5 is characterized in that, said word order is optimized the unit and comprised:
Set is provided with module, is used for from identical vocabulary and the word frequency compositional optimization set thereof of database extraction with the candidate word of said input method;
Order module is used for sorting according to the vocabulary that the size order of said word frequency is gathered said optimization;
Set merges module, is used for said optimization set is merged with the candidate collection that is made up of said candidate word;
Reordering module, the word frequency rearrangement that is used for said merging is obtained and concentrate vocabulary obtains to optimize the candidate word of word order.
7. an input method system is characterized in that, comprising: like each described input method word order optimization means of claim 4 to 6.
CN2012100701089A 2012-03-15 2012-03-15 Method and device for optimizing word order of input method and system of input method Pending CN102637166A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012100701089A CN102637166A (en) 2012-03-15 2012-03-15 Method and device for optimizing word order of input method and system of input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012100701089A CN102637166A (en) 2012-03-15 2012-03-15 Method and device for optimizing word order of input method and system of input method

Publications (1)

Publication Number Publication Date
CN102637166A true CN102637166A (en) 2012-08-15

Family

ID=46621563

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012100701089A Pending CN102637166A (en) 2012-03-15 2012-03-15 Method and device for optimizing word order of input method and system of input method

Country Status (1)

Country Link
CN (1) CN102637166A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103500016A (en) * 2013-09-27 2014-01-08 北京邮电大学 Character input optimization method based on interaction
CN107577666A (en) * 2017-09-14 2018-01-12 中国科学院声学研究所 A kind of Chinese preprocess method freely customized and its system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1936893A (en) * 2006-06-02 2007-03-28 北京搜狗科技发展有限公司 Method and system for generating input-method word frequency base based on internet information
CN101334774A (en) * 2007-06-29 2008-12-31 北京搜狗科技发展有限公司 Character input method and input method system
EP2157497A1 (en) * 2007-07-24 2010-02-24 Research in Motion Limited Handheld electronic device and associated method enabling the output of non-alphabetic characters in a disambiguation environment

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1936893A (en) * 2006-06-02 2007-03-28 北京搜狗科技发展有限公司 Method and system for generating input-method word frequency base based on internet information
CN101334774A (en) * 2007-06-29 2008-12-31 北京搜狗科技发展有限公司 Character input method and input method system
EP2157497A1 (en) * 2007-07-24 2010-02-24 Research in Motion Limited Handheld electronic device and associated method enabling the output of non-alphabetic characters in a disambiguation environment

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103500016A (en) * 2013-09-27 2014-01-08 北京邮电大学 Character input optimization method based on interaction
CN107577666A (en) * 2017-09-14 2018-01-12 中国科学院声学研究所 A kind of Chinese preprocess method freely customized and its system
CN107577666B (en) * 2017-09-14 2019-11-19 中国科学院声学研究所 A kind of Chinese preprocess method freely customized and its system

Similar Documents

Publication Publication Date Title
CN107766371B (en) Text information classification method and device
CN100362525C (en) Method for gathering and recording business card information in mobile phone by using image recognition
CN101593200B (en) Method for classifying Chinese webpages based on keyword frequency analysis
US8977606B2 (en) Method and apparatus for generating extended page snippet of search result
US10366154B2 (en) Information processing device, information processing method, and computer program product
US20140115439A1 (en) Methods and systems for annotating web pages and managing annotations and annotated web pages
CN103777774B (en) The word error correction method of terminal installation and input method
CN101609707B (en) Information processing apparatus and information processing method
CN104933028A (en) Information pushing method and information pushing device
WO2015047920A1 (en) Title and body extraction from web page
CN101464903A (en) OCR picture and text recognition and retrieval method and system through web mode
CN102591475A (en) Content input method and system for online editor
EP3029567B1 (en) Method and device for updating input method system, computer storage medium, and device
CN105094775B (en) Webpage generation method and device
CN110889280B (en) Knowledge base construction method and device based on document splitting
CN104919457A (en) Method and apparatus for enriching social media to improve personalized user experience
CN103729457A (en) Digitalized book auxiliary reading system based on Internet, and method thereof
CN102200968A (en) Method and device for removing duplications of EXCEL form data
CN102141868A (en) Method for quickly operating information interaction page, input method system and browser plug-in
CN101727201A (en) Method and device for automatically adjusting symbol rank and input method system
CN111753514B (en) Automatic generation method and device of patent application text
CN114238689A (en) Video generation method, video generation device, electronic device, storage medium, and program product
CN111414471A (en) Method and apparatus for outputting information
CN103076894A (en) Method and equipment for building input entries for object identity information according to object identity information
CN101470699B (en) Information extraction model training apparatus, information extraction apparatus and information extraction system and method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120815