CN102567365B - A kind of it is directed to input method and the system that key word is labeled - Google Patents

A kind of it is directed to input method and the system that key word is labeled Download PDF

Info

Publication number
CN102567365B
CN102567365B CN201010605285.3A CN201010605285A CN102567365B CN 102567365 B CN102567365 B CN 102567365B CN 201010605285 A CN201010605285 A CN 201010605285A CN 102567365 B CN102567365 B CN 102567365B
Authority
CN
China
Prior art keywords
key word
word
user
labeled
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201010605285.3A
Other languages
Chinese (zh)
Other versions
CN102567365A (en
Inventor
马宇尘
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Liangming Technology Development Co Ltd
Original Assignee
Shanghai Liangming Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Liangming Technology Development Co Ltd filed Critical Shanghai Liangming Technology Development Co Ltd
Priority to CN201010605285.3A priority Critical patent/CN102567365B/en
Publication of CN102567365A publication Critical patent/CN102567365A/en
Application granted granted Critical
Publication of CN102567365B publication Critical patent/CN102567365B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Input From Keyboards Or The Like (AREA)

Abstract

The present invention provides a kind of and is directed to input method and the system that key word is labeled, and belongs to computer, software technology field.The method includes: step 1, gathers user's inputting character information in input interactive interface;Step 2, it is judged that whether the word corresponding to user's input information may make up key word;Step 3, adds key word mark to key word;Step 4, gathers user and inputs key word mark, recall key word from the candidate word list of input interactive interface.The present invention carries out keyword extraction, mark by the character information that user is inputted, with in key word dictionary, individually mate search, can be quickly found out, the final word result that user needs, the first-selected word hit rate overcoming existing input method is not high, thus causing that the input speed of user slows down, input efficiency reduces, the shortcoming of poor user experience.

Description

A kind of it is directed to input method and the system that key word is labeled
Technical field
The invention belongs to computer, software technology field.
Background technology
Current input method system is all inevitably present the corresponding multiple candidate's word problems of identical coding, for spelling input method, as: phonetic adding input method, purple light China space spelling input method etc., the word frequency (usage frequency of words) that this existing input method is all based in its dictionary and dictionary provides the sequence of candidate word for user in Information Inputting Process, the commonly used words that preferential display word frequency is the highest, i.e. first-selected word.The sequence of candidate word is user's important indicator of first-selected word hit rate height in Information Inputting Process.Described first-selected word hit rate refers to, after user inputs certain keypad information, sort preceding word, word or sentence are that user needs most.Such as, input Pinyin " guanxituili " (relation inference), described existing input method can obtain all of candidate word in dictionary according to phonetic " guanxi ", such as " relation ", " washing " and " Northwest " etc., then preferentially show that the everyday words " relation " that word frequency is the highest is first-selected word, meanwhile, obtaining, according to " tuili ", the word " reasoning " that in dictionary, word frequency is the highest is first-selected word, and composition " relation inference " is supplied to user's input.In this instance, the hit rate of first-selected word is 100%, namely complies fully with the needs of user.
Certainly, technically, input method system itself cannot know which words is that user needs most, but in vast as the open sea Chinese words, the use of each words and the frequency of occurrences are different, will appear from the higher words sequence of frequency in the front first-selected word hit rate that just can be greatly improved input method system, namely can improve the preceding words that sorts from probability and meet the probability of user's needs.
But, if the words required for user does not also correspond to the words that word frequency is the highest, such as, user's input " zizhuxuexiao " (subsidizes school), and input method gets the highest word of word frequency accordingly for " autonomous school ", in this case, it is necessary to user selects " subsidy " in all candidate word, to obtain required result.In practice, user adopts existing input method by selecting candidate word to obtain the probability of the result needed, than being directly obtained, the probability of effective first-selected word is much higher, and this indicates that, the first-selected word hit rate of existing input method is not high, thus causing that the input speed of user slows down, input efficiency reduces, and poor user experience, especially when a lot of phonetic of the disposable input of user, user need select candidate word number of times can be more, process is more loaded down with trivial details.
The present invention is to solve the problems referred to above, it is provided that a kind of character information to user's input carries out the input method of keyword extraction, mark and individually coupling search and supporting system thereof.
Summary of the invention
It is an object of the invention to overcome the defect of current input method and system thereof, it is provided that a kind of character information to user's input carries out the input method of keyword extraction, mark and individually coupling search and supporting system thereof.
A kind of being directed to the input method that key word is labeled, the method comprises the steps:
Step 1, gathers user's inputting character information in input interactive interface;
Step 2, it is judged that whether the word corresponding to user's input information may make up key word;
Step 3, adds key word mark to key word;
Step 4, gathers user and inputs key word mark, recall key word from the candidate word list of input interactive interface.
Further, described a kind of it is directed to the input method that key word is labeled, also there is following technical characteristic:
Described step 1 also comprises the steps:
A gathers user and carries out the input information of character information by inputting interactive interface unit, and the string encoding information that user is inputted by client is analyzed, and transfers the word information of correspondence in dictionary;
The word information recalled, according to the use frequency situation of user, is ranked up by b;
Word information composition candidate word sequence after sequence is exported to user by c.
In described step 2, by key word identifying unit, key word being judged, this step also includes following sub-step:
A passes through keyword feature value threshold module, sets the eigenvalue threshold size criteria of key word;
B calculates the eigenvalue size of word by word feature value module;
C passes through key word threshold value comparison module, the eigenvalue threshold size criteria of the eigenvalue size of above-mentioned word Yu key word is compared, thus showing whether this word is key word.
In described step 3, it is mark unit by key word, according to the judged result of key word in step 2, key word is labeled.
Described step 4 also comprises the following steps that
A transfers unit by key word, the key word marked out carries out coupling in key word dictionary and extracts;
The key word information extracted, according to the use frequency situation of user, is ranked up by b;
Key word information after sequence is exported by c;
D user selects target keyword in the candidate word list of output.
In described step 3, if key word dictionary does not include this key word, then carry out further searching for and combination from common dictionary special secondary school door to this key word, subsequently this key word is added module by key word and add in key word dictionary, by automatically updating module, key word dictionary is automatically updated.
A kind of being directed to the input system that key word is labeled, it includes common dictionary, and this system includes following ingredient:
Input interactive interface unit, it is the interfacial structure that user carries out being operated in character entering function;
Key word identifying unit, according to the user's input information that input interactive interface unit obtains, judges the functional structure that can make up multi-character words, or determines whether the functional structure belonging to implication key word;
Key word mark unit, it is the judged result obtained according to key word identifying unit, is directed to the functional structure that key word is labeled;
Unit transferred in key word, and it is the key word mark inputted according to user, recalls the functional structure of corresponding key word from candidate word list.
Further, described a kind of it is directed to the input system that key word is labeled, also there is following technical characteristic:
Described a kind of it is directed to the input system that key word is labeled, also includes the key word dictionary for storing key word information.
Described key word identifying unit, also includes the word feature value module for the eigenvalue of word is judged.
Described key word identifying unit, also includes for setting district participle language the word feature value threshold module of the eigenvalue threshold being whether key word.
Described key word identifying unit, also includes for the eigenvalue of word and set threshold value being compared, thus whether determine is the key word threshold value comparison module of key word.
Described a kind of it is directed to the input system that key word is labeled, is additionally provided with and adds module for the key word that the key word of storage will do not had in keywords database to be added.
Described a kind of it is directed to the input system that key word is labeled, also includes and automatically update module for what keywords database was updated.
Implement the present invention, have the advantages that this input method of the present invention carries out keyword extraction by the character information that user is inputted, mark, with in key word dictionary, individually mate search, it is possible to be quickly found out, user need final word result, the first-selected word hit rate overcoming existing input method is not high, thus causing that the input speed of user slows down, input efficiency reduces, the shortcoming of poor user experience.Especially when a lot of phonetic of the disposable input of user, user has only to the word information that will input, carry out keyword extraction and mark, then carry out individually quickly mating search to the key word of mark at key word dictionary, the highest for word frequency in the keyword candidate word searched out is displayed to user as first-selection word, user selects correct key word, and other correct non-key word completes once to input, it is no longer necessary to user refund and reselect correct words, it is to avoid input loaded down with trivial details.
Accompanying drawing explanation
Fig. 1 is a kind of theory diagram being directed to the input system that key word is labeled of the present invention.
Fig. 2 is a kind of flow chart being directed to the input method that key word is labeled of the present invention.
Fig. 3 is a kind of embodiment schematic diagram in the present invention, key word being labeled.
Fig. 4 is the embodiment schematic diagram that the candidate word list of key word carries out in the present invention Keyword Selection, for one of which embodiment.
Fig. 5 is the embodiment schematic diagram that the candidate word list of key word carries out in the present invention Keyword Selection, for another kind of embodiment.
Detailed description of the invention
Below in conjunction with accompanying drawing, a kind of it is directed to input method and the system that key word is labeled to of the present invention, does more detailed introduction.
Ginseng Fig. 1, shown in 2, respectively show this cardinal principle structure being directed to the input system that key word is labeled of the present invention and the method flow of correspondence.
It is directed to the input system 100 that key word is labeled as can be seen from Figure 1 to include: key word dictionary 110, input interactive interface unit 120, key word identifying unit 130, key word mark unit 140, unit 150 transferred in key word, key word adds module 160, automatically updates the structures such as module 170.
Key word identifying unit 130 therein, also includes word feature value module 131, keyword feature value threshold module 132 and key word threshold value comparison module 133.
Below in conjunction with concrete method, this it is directed to input method and the system that key word is labeled to of the present invention, is described in detail.
Described in the invention is directed to the input method that key word is labeled, and comprises the steps:
Step 1, gathers user's inputting character information in input interactive interface.
A user carries out the input of character information by inputting interactive interface unit 120, and the string encoding information that user is inputted by client is analyzed, and transfers the word information of correspondence in common dictionary.
The word information recalled, according to the use frequency situation of user, is ranked up by b.
Word information composition candidate word sequence after sequence is exported to user by c.
For example and without limitation, the present invention once inputs the situation of relatively more character informations suitable in user.
Step 2, it is judged that whether the word corresponding to user's input information may make up key word.
In described step 2, by key word identifying unit 130, key word being judged, this step also includes following sub-step:
A, by the keyword feature value threshold module 132 in key word identifying unit 130, sets the eigenvalue threshold size criteria of key word;
B, by the word feature value module 131 in key word identifying unit 130, calculates the eigenvalue size of word;
The eigenvalue threshold size criteria of the eigenvalue size of above-mentioned word Yu key word, by the key word threshold value comparison module 133 in key word identifying unit 130, is compared by c,
If the eigenvalue of word is less than keyword feature value threshold value, then this word is judged to it is not key word;If the eigenvalue of word is be more than or equal to keyword feature value threshold value, then this word is judged to it is key word.
Wherein word feature value module 131 is when carrying out eigenvalue calculation to word, a factor that word feature is influential is the frequency that word is used by ordinary populace user, such as, the word inputted as user is very popular common words, it is that everybody is commonly used, " direction " is had as the word that phonetic is " fangxiang ", " fragrance ", corresponding words such as " Fang Xiang ", although the word of correspondence is relatively more, but we are it is seen that the meaning between word falls far short, in this case, the word characteristic of correspondence value that phonetic is " fangxiang " will be smaller.
The word that another influence factor is the user using this terminal uses frequency, when a word, it it is not very popular word, when user inputs first time, this word characteristic of correspondence value will be higher, thus being judged as key word, while user inputs this key word, this word is added in the key word dictionary 110 of access customer correspondence client, when next time, user used, this word characteristic of correspondence value will be relatively low, and especially corresponding nonexpondable key word word can also directly be judged to non-key word.
Also has an influence factor, it is exactly polyphone and the polyphonic word situation of word, such as, the word that phonetic is " gongshi " is just to having " formula ", " common recognition ", " offensive ", " publicity " " working together ", " public affair " etc., relatively many as this polyphonic word, its characteristic of correspondence value also can be higher, thus being more likely judged as key word.
For example and without limitation, the eigenvalue of described word can change, even if same word also can change according to the service condition of user, and the accuracy rate of only in this way guarantee user input and the speed of input.
Step 3, adds key word mark to key word.
In this step, it is mark unit 140 by key word, according to the judged result of key word in step 2, key word is labeled.The mode of concrete mark as shown in Figure 3, marked by underscore, can also be otherwise, such as, by change word brightness, bracket, square frame, etc. form be labeled, simultaneously below the key word being marked or side arrange and fast select key, carry out selecting operation to key word by this button, be utilize shortcut " F1, F2 " to carry out the key word in candidate word list selecting operation in the drawings.
Step 4, gathers user and inputs key word mark, recall key word from the candidate word list of input interactive interface.
A transfers unit 150 by key word, the key word marked out carries out coupling in key word dictionary 110 and extracts.
In this step, if key word dictionary 110 does not include this key word, then carry out further searching for and combination from common dictionary special secondary school door to this key word, subsequently this key word is added module 160 by key word and add in key word dictionary, by automatically updating module 170, key word dictionary is automatically updated, to ensure that user is using up-to-date key word dictionary always.
The key word information extracted, according to the use frequency situation of user, is ranked up by b.
Key word information after sequence is exported by c, as shown in Figure 4,5, for the wherein key word information two of which way of output, the operated corresponding key word of shortcut " F2 " is " chaoxian ", can be seen that the information meaning that user inputs, user needs the word selected to be " cosmic string ", operates so just key word can be carried out selection by the numeral that shortcut " F2 " and key word are corresponding.
D user selects target keyword in the candidate word list of output.
In order to make it easy to understand, we provide a kind of mode that key word is labeled, as shown in Figure 3.
The display mode of Fig. 4 and Fig. 5 respectively two kinds of key word information, in the diagram, candidate keywords list is shown in below the shortcut " F2 " of correspondence, utilizes shortcut " F2 " and corresponding digital keys to carry out the selection of target keyword.In Figure 5, candidate keywords list is shown in the right side of the shortcut " F2 " of correspondence, carries out the selection of target keyword also with shortcut " F2 " and corresponding digital keys.
Certainly, the mode being particularly shown and selecting of key word is not limited by the present invention.
It is above the description of this invention and non-limiting, based on other embodiment of inventive concept, all among protection scope of the present invention.

Claims (13)

1. one kind is directed to the input method that key word is labeled, it is characterised in that the method comprises the steps:
Step 1, gathers user's inputting character information in input interactive interface;
Step 2, it is judged that whether the word corresponding to user's input information may make up key word;
Step 3, adds key word mark to key word;
Step 4, gathers user and inputs key word mark, recall key word from the candidate word list of input interactive interface.
2. according to claim 1 a kind of it is directed to the input method that key word is labeled, it is characterised in that described step 1 also comprises the steps:
A gathers user and carries out the input information of character information by inputting interactive interface unit, and the string encoding information that user is inputted by client is analyzed, and transfers the word information of correspondence in dictionary;
The word information recalled, according to the use frequency situation of user, is ranked up by b;
Word information composition candidate word sequence after sequence is exported to user by c.
3. according to claim 1 a kind of it is directed to the input method that key word is labeled, it is characterised in that in described step 2, by key word identifying unit, key word being judged, this step also includes following sub-step:
A passes through keyword feature value threshold module, sets the eigenvalue threshold size criteria of key word;
B calculates the eigenvalue size of word by word feature value module;
C passes through key word threshold value comparison module, the eigenvalue threshold size criteria of the eigenvalue size of above-mentioned word Yu key word is compared, thus showing whether this word is key word.
4. according to claim 1 a kind of it is directed to the input method that key word is labeled, it is characterised in that in described step 3, be mark unit by key word, according to the judged result of key word in step 2, key word be labeled.
5. according to claim 1 a kind of it is directed to the input method that key word is labeled, it is characterised in that described step 4 also comprises the following steps that
A transfers unit by key word, the key word marked out carries out coupling in key word dictionary and extracts;
The key word information extracted, according to the use frequency situation of user, is ranked up by b;
Key word information after sequence is exported by c;
D user selects target keyword in the candidate word list of output.
6. according to claim 1 a kind of it is directed to the input method that key word is labeled, it is characterized in that, in described step 3, if key word dictionary does not include this key word, then carry out further searching for and combination from common dictionary special secondary school door to this key word, subsequently this key word is added module by key word and add in key word dictionary, by automatically updating module, key word dictionary is automatically updated.
7. being directed to the input system that key word is labeled, it includes common dictionary, it is characterised in that this system includes following ingredient:
Input interactive interface unit, it is the interfacial structure that user carries out being operated in character entering function;
Key word identifying unit, according to the user's input information that input interactive interface unit obtains, judges the functional structure that can make up multi-character words, and determines whether the functional structure belonging to implication key word;
Key word mark unit, it is the judged result obtained according to key word identifying unit, is directed to the functional structure that key word is labeled;
Unit transferred in key word, and it is the key word mark inputted according to user, recalls the functional structure of corresponding key word from candidate word list.
8. according to claim 7 a kind of it is directed to the input system that key word is labeled, it is characterised in that described a kind of be directed to the input system that key word is labeled, also includes the key word dictionary for storing key word information.
9. according to claim 7 a kind of it is directed to the input system that key word is labeled, it is characterised in that described key word identifying unit, also includes the word feature value module for the eigenvalue of word is judged.
10. according to claim 7 a kind of it is directed to the input system that key word is labeled, it is characterised in that described key word identifying unit, also includes for setting district participle language the keyword feature value threshold module of the eigenvalue threshold being whether key word.
A kind of it is directed to the input system that key word is labeled 11. according to claim 7, it is characterized in that, described key word identifying unit, also include for the eigenvalue of word and set threshold value are compared, thus whether determine is the key word threshold value comparison module of key word.
A kind of it is directed to the input system that key word is labeled 12. according to claim 7, it is characterized in that, described a kind of it is directed to the input system that key word is labeled, is additionally provided with and adds module for the key word that the key word of storage will do not had in keywords database to be added.
A kind of it is directed to the input system that key word is labeled 13. according to claim 7, it is characterised in that described a kind of be directed to the input system that key word is labeled, also includes and automatically update module for what keywords database was updated.
CN201010605285.3A 2010-12-26 2010-12-26 A kind of it is directed to input method and the system that key word is labeled Active CN102567365B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010605285.3A CN102567365B (en) 2010-12-26 2010-12-26 A kind of it is directed to input method and the system that key word is labeled

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010605285.3A CN102567365B (en) 2010-12-26 2010-12-26 A kind of it is directed to input method and the system that key word is labeled

Publications (2)

Publication Number Publication Date
CN102567365A CN102567365A (en) 2012-07-11
CN102567365B true CN102567365B (en) 2016-07-06

Family

ID=46412805

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010605285.3A Active CN102567365B (en) 2010-12-26 2010-12-26 A kind of it is directed to input method and the system that key word is labeled

Country Status (1)

Country Link
CN (1) CN102567365B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9116549B2 (en) * 2012-08-16 2015-08-25 Chih-Lung Yang Electronic device having input control application
CN102999625A (en) * 2012-12-05 2013-03-27 北京海量融通软件技术有限公司 Method for realizing semantic extension on retrieval request
CN103076894B (en) * 2012-12-31 2016-05-18 百度在线网络技术(北京)有限公司 A kind of for build the method and apparatus of input entry according to object id information
CN105653157A (en) * 2015-12-30 2016-06-08 广州华多网络科技有限公司 Processing method and device for copied text
CN106527752B (en) * 2016-09-23 2019-03-19 百度在线网络技术(北京)有限公司 It is a kind of for provide input candidate item method and apparatus
CN106484135B (en) * 2016-09-23 2019-03-19 百度在线网络技术(北京)有限公司 It is a kind of for provide input candidate item method and apparatus
WO2021026428A1 (en) * 2019-08-07 2021-02-11 Zinatt Technologies, Inc. Data entry feature for information tracking system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101013443A (en) * 2007-02-13 2007-08-08 北京搜狗科技发展有限公司 Intelligent word input method and input method system and updating method thereof
CN101520786A (en) * 2008-02-27 2009-09-02 北京搜狗科技发展有限公司 Method for realizing input method dictionary and input method system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070022134A1 (en) * 2005-07-22 2007-01-25 Microsoft Corporation Cross-language related keyword suggestion

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101013443A (en) * 2007-02-13 2007-08-08 北京搜狗科技发展有限公司 Intelligent word input method and input method system and updating method thereof
CN101520786A (en) * 2008-02-27 2009-09-02 北京搜狗科技发展有限公司 Method for realizing input method dictionary and input method system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
一个拼音汉字自动转换系统的设计与实现;成华等;《北京航空航天大学学报》;19960831;第22卷(第4期);第465-469页 *
人工智能在拼音输入法中的应用;袁哲;《软件导刊》;20100630;第9卷(第6期);第10-12页 *

Also Published As

Publication number Publication date
CN102567365A (en) 2012-07-11

Similar Documents

Publication Publication Date Title
CN102567365B (en) A kind of it is directed to input method and the system that key word is labeled
TWI677796B (en) Semantic extraction method and device of natural language and computer storage medium
CN103491205B (en) The method for pushing of a kind of correlated resources address based on video search and device
US9396178B2 (en) Systems and methods for an automated personalized dictionary generator for portable devices
US10558754B2 (en) Method and system for automating training of named entity recognition in natural language processing
CN105094368B (en) A kind of control method and control device that frequency modulation sequence is carried out to candidates of input method
CN103605665A (en) Keyword based evaluation expert intelligent search and recommendation method
CN105917327A (en) System and method for inputting text into electronic devices
US11907671B2 (en) Role labeling method, electronic device and storage medium
CN103365925A (en) Method for acquiring polyphone spelling, method for retrieving based on spelling, and corresponding devices
CN104850241A (en) Mobile terminal and text input method thereof
CN105630884A (en) Geographic position discovery method for microblog hot event
CN104281702A (en) Power keyword segmentation based data retrieval method and device
CN108446316A (en) Recommendation method, apparatus, electronic equipment and the storage medium of associational word
CN110532354A (en) The search method and device of content
EP2875418A1 (en) String predictions from buffer
JP2022530690A (en) Query auto-completion methods, appliances, equipment, and computer storage media
CN104281275B (en) The input method of a kind of English and device
CN103488787A (en) Method and device for pushing online playing entry objects based on video retrieval
CN103617204B (en) Contact fast searching method based on android system
CN114860913B (en) Intelligent question-answering system construction method, question-answering processing method and device
CN104035955A (en) Search method and device
CN107679122B (en) Fuzzy search method and terminal
CN103500214B (en) Word segmentation information pushing method and device based on video searching
CN105260419A (en) Associated keyword recommendation method and apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant