CN109002184A - A kind of association method and device of input method candidate word - Google Patents

A kind of association method and device of input method candidate word Download PDF

Info

Publication number
CN109002184A
CN109002184A CN201710424511.XA CN201710424511A CN109002184A CN 109002184 A CN109002184 A CN 109002184A CN 201710424511 A CN201710424511 A CN 201710424511A CN 109002184 A CN109002184 A CN 109002184A
Authority
CN
China
Prior art keywords
candidate word
proper noun
information
classification information
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710424511.XA
Other languages
Chinese (zh)
Other versions
CN109002184B (en
Inventor
费腾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201710424511.XA priority Critical patent/CN109002184B/en
Publication of CN109002184A publication Critical patent/CN109002184A/en
Application granted granted Critical
Publication of CN109002184B publication Critical patent/CN109002184B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

The embodiment of the invention provides the association methods and device of a kind of input method candidate word, which comprises obtains the text information for having gone up screen;Judge whether the text information is proper noun;If so, identifying the classification information of the proper noun;Obtain at least one candidate word to match with the classification information;Show at least one described candidate word, it solves in the prior art after user inputs the proper nouns such as name, place name, it can not continue to show to user by the association function of input method or recommended candidate word problem, the classification information that the embodiment of the present invention passes through identification proper noun, to carry out association's recommendation according to corresponding classification information, the input operating process for reducing user, improves the input speed of user.

Description

A kind of association method and device of input method candidate word
Technical field
The present invention relates to input method technique fields, defeated more particularly to the association method and one kind of a kind of input method candidate word Enter association's device of method candidate word.
Background technique
Input method generally all has association function, and after shielding some word on user, input method can continue to show more It is a with the word to there is the candidate word of particular kind of relationship to select for user.For example, as shown in Figure 1, being the connection of input method in the prior art Think functional schematic, when user is after upper screen " eats " this word, input method can associate the candidate words such as " meal ", " chafing dish " confession out User's selection, to greatly facilitate the input operation of user.
The association function of input method is typically based on binary crelation between word and word or n-tuple relation to realize.Example Such as, for " eating " and " chafing dish " two words, since word can be organized into blocked shot, it may be considered that " eating " and " chafing dish " two words With binary crelation.And for " eating ", " big " and " apple " three words, since word can be organized at " eating Big Apple ", then can recognize There is ternary relation for " eating ", " big " and " apple " three words, and so on, when having above-mentioned relation between multiple words, just It is n-tuple relation.
But the association of n-tuple relations usually few for proper nouns such as name, place names, while being limited by input method Dictionary size limitation, this kind of proper noun is difficult to be included in the binary crelation or n-tuple relation of above-mentioned formation so that After some proper noun of upper screen, input method is difficult to show the time to match with the proper noun to user by association function Word is selected, user, which must re-enter, could obtain desired candidate word.
Summary of the invention
In view of the above problems, it proposes the embodiment of the present invention and overcomes the above problem or at least partly in order to provide one kind A kind of association method of the input method candidate word to solve the above problems and a kind of corresponding association's device of input method candidate word.
To solve the above-mentioned problems, the embodiment of the invention discloses a kind of association methods of input method candidate word, comprising:
Obtain the text information for having gone up screen;
Judge whether the text information is proper noun;
If so, identifying the classification information of the proper noun;
Obtain at least one candidate word to match with the classification information;
Show at least one described candidate word.
Optionally, the proper noun includes name and/or place name.
Optionally, described to judge that the step of whether text information is proper noun includes:
Initialized data base is traversed, the initialized data base includes multiple default proper nouns, each default proper noun tool There is corresponding classification information;
Judge whether any default proper noun is identical as the text information;
If so, judging that the text information is proper noun, and obtain the classification information of the proper noun.
Optionally, the initialized data base includes the address list information of terminal, the classification of the identification proper noun The step of information includes:
When the text information is overlapped with name any in address list, identify that the classification information of the proper noun is general Logical name classification.
Optionally, each classification information includes at least one candidate word to match with the classification information, the acquisition The step of at least one candidate word to match with the classification information includes:
At least one candidate word to match with the classification information of the proper noun is obtained from the initialized data base.
Optionally, the initialized data base generates in the following way:
Acquire multiple proper nouns;
Classify to the multiple proper noun, to generate multiple classification informations;
The candidate word that there is particular kind of relationship with each classification information is obtained respectively;
According to the multiple classification information and the candidate word to match with classification information, initialized data base is generated.
Optionally, the particular kind of relationship includes binary crelation or n-tuple relation, acquisition and each classification information respectively The step of having the candidate word of particular kind of relationship includes:
Corpus information is acquired, the corpus information respectively includes multiple participles;
Extract the proper noun and the corresponding classification information of the proper noun in the multiple participle;
The candidate word that there is binary crelation or n-tuple relation with the proper noun in the multiple participle is counted respectively, as The candidate word of the corresponding classification information of the proper noun.
Optionally, further includes:
When receiving user and selecting the instruction of any candidate word, candidate word described in upper screen.
To solve the above-mentioned problems, the embodiment of the invention discloses a kind of association's devices of input method candidate word, comprising:
Text information obtains module, for obtaining the text information for having gone up screen;
Proper noun judgment module, for judging whether the text information is proper noun;
Classification information identification module, for the text information be proper noun when, identify the classification of the proper noun Information;
Candidate word obtains module, for obtaining at least one candidate word to match with the classification information;
Candidate word display module, for showing at least one described candidate word.
Optionally, the proper noun includes name and/or place name.
Optionally, the proper noun judgment module includes:
Submodule is traversed, for traversing initialized data base, the initialized data base includes multiple default proper nouns, each Default proper noun has corresponding classification information;
Whether judging submodule is identical as the text information in any default proper noun for judging;
Acquisition submodule, for judging the text envelope when identical as the text information in the presence of default proper noun Breath is proper noun, and obtains the classification information of the proper noun.
Optionally, the initialized data base includes the address list information of terminal, and the classification information identification module includes:
Submodule is identified, for identifying the proprietary name when the text information is overlapped with name any in address list The classification information of word is common name classification.
Optionally, each classification information includes at least one candidate word to match with the classification information, the candidate Word obtains module
Candidate word acquisition submodule, for obtaining the classification information phase with the proper noun from the initialized data base At least one matched candidate word.
Optionally, the initialized data base is by calling following module to generate:
Acquisition module, for acquiring multiple proper nouns;
Categorization module, for classifying to the multiple proper noun, to generate multiple classification informations;
Module is obtained, for obtaining the candidate word that there is particular kind of relationship with each classification information respectively;
Generation module, for generating pre- according to the multiple classification information and the candidate word to match with classification information Set database.
Optionally, the particular kind of relationship includes binary crelation or n-tuple relation, and the acquisition module includes:
Corpus information acquires submodule, and for acquiring corpus information, the corpus information respectively includes multiple participles;
Proper noun extracting sub-module, for extracting proper noun and the proper noun pair in the multiple participle The classification information answered;
Candidate word statistic submodule, for count respectively with the proper noun in the multiple participle have binary crelation or The candidate word of n-tuple relation, the candidate word as the corresponding classification information of the proper noun.
Optionally, described device further include:
Panel module in candidate word, for when receiving user and selecting the instruction of any candidate word, candidate word described in upper screen.
To solve the above-mentioned problems, the embodiment of the invention discloses a kind of association's device of input method candidate word, include Perhaps more than one program one of them or more than one program is stored in memory by memory and one, and is passed through Configuration includes for carrying out following grasp to execute the one or more programs by one or more than one processor The instruction of work:
Obtain the text information for having gone up screen;
Judge whether the text information is proper noun;
If so, identifying the classification information of the proper noun;
Obtain at least one candidate word to match with the classification information;
Show at least one described candidate word.
Compared with the background art, the embodiment of the present invention includes following advantages:
The embodiment of the present invention, the text information of screen has been gone up by obtaining, and judges whether the text information is proprietary name Word, if so, can by identifying the classification information of the proper noun, then obtain with the classification information match to A few candidate word, and then show at least one described candidate word, it solves and inputs name, place name etc. in user in the prior art After proper noun, it can not continue to show to user by the association function of input method or recommended candidate word problem, the present invention are real Applying example reduces user to carry out association's recommendation according to corresponding classification information by the classification information of identification proper noun Input operating process, improve the input speed of user.
Detailed description of the invention
Fig. 1 is the association function schematic diagram of input method in the prior art;
Fig. 2 is a kind of step flow chart of the association method embodiment one of input method candidate word of the invention;
Fig. 3 is that a kind of candidate word of the invention shows schematic diagram;
Fig. 4 is a kind of step flow chart of the association method embodiment two of input method candidate word of the invention;
Fig. 5 is a kind of structural block diagram of association's Installation practice of input method candidate word of the invention;
Fig. 6 is a kind of block diagram of association's device of input method candidate word shown according to an exemplary embodiment.
Specific embodiment
In order to make the foregoing objectives, features and advantages of the present invention clearer and more comprehensible, with reference to the accompanying drawing and specific real Applying mode, the present invention is described in further detail.
Referring to Fig. 2, a kind of step flow chart of the association method embodiment one of input method candidate word of the invention is shown, It can specifically include following steps:
Step 201, the text information for having gone up screen is obtained;
In the concrete realization, the embodiment of the present invention can be applied in each Terminal Type, for example, mobile phone, PDA (Personal Digital Assistant, personal digital assistant), computer, palm PC etc., the embodiment of the present invention is to the specific of terminal Type is not construed as limiting.
These terminals can support to include the operating systems such as Windows, Android (Android), IOS, WindowsPhone, It can be inputted by external input equipment, such as keyboard;The application journey inputted by dummy keyboard can also be run Sequence, for example, input method procedure.
By taking computer as an example, user can input character string by the physical button tapped on keyboard, and for having touching For the mobile terminal for touching screen, user can carry out the input of character string by the virtual key clicked on dummy keyboard, thus Realize the input to text information.
In general, the texts such as Chinese character, Japanese as basic language unit are not generally direct in the voices such as Chinese, Japanese It is mapped with the key on keyboard, therefore, the conversion for carrying out character and words is generally required in input.
Specifically, input method system can by coding rule by the texts such as Chinese character, Japanese with can directly input Character string establishes mapping relations, such as leading to common coding in Chinese is phonetic (such as simplicity, Two bors d's oeuveres, spelling, fuzzy phoneme), five Pen etc..
In embodiments of the present invention, user is after inputting character string, for example, input method can after input Pinyin character string To show the multiple Chinese characters or phrase that match with the pinyin character string, thus in user selected wherein some Chinese character or phrase Afterwards, upper screen is carried out to the Chinese character or phrase.
For example, for user input character string " guoqing ", input method can match " National Day ", " national conditions " or Phrases such as " state are green " can carry out upper screen to the phrase when user selectes " National Day " this phrase.
It should be noted that the text information for having gone up screen in the embodiment of the present invention can be user's last time in input The Chinese character or phrase of upper screen.
For example, when user is at input " I wants to go to Sydney ", it is successively upper to shield " I ", " wanting to go to " and " Sydney ", then it is of the invention The text information for having gone up screen in embodiment can refer to last upper " Sydney " this phrase shielded.
Step 202, judge whether the text information is proper noun;
In embodiments of the present invention, after getting the text information for having gone up screen, it can first determine whether that text information is No is certain types of phrase, such as, if for proper nouns such as people's name, place names.If so, step 203 can be executed.
It certainly, can also include the other kinds of proper nouns such as mechanism name, organization name, the present invention in addition to name, place name Embodiment is not construed as limiting the concrete type of proper noun.
As a kind of example of the invention, can the initialized data base in terminal or input method in advance, by the data The default proper noun of the classification informations such as preset multiple names, place name in library passes through after getting the text information for having gone up screen Initialized data base is traversed, judges whether identical as the text information in any default proper noun, if they are the same, then may determine that Text information is proper noun, and further obtains the classification information of the proper noun.
In the concrete realization, can acquire multiple common names, female star name, matinée idol name, sportsman's name, Name etc., is sorted out according to respective classification information respectively, when the text information shielded on user hits any classification information In a phrase when, it is believed that text information be proper noun.For example, passing through traversal behind screen " Jin east " on user Initialized data base, discovery includes " Jin east " this phrase in this classification of matinée idol name, it may be considered that having gone up the word of screen Group " Jin east " is proper noun, and the classification information for obtaining the proper noun is matinée idol name.
It can also include the address list information of terminal as another example of the invention, in initialized data base, so as to To determine whether the last one phrase shielded is common name by the address list information of reflexless terminal.
In general, address list information all includes the contact details of at least one contact person, name, cell-phone number such as contact person Code etc..By judging whether text information is overlapped with name any in address list, so as to whether judge text information For common name.
Certainly, other than address list information, other local files that can also obtain terminal are added into initialized data base. Such as the information such as available memorandum or calendar remarks, if record has " birthday of Zhang San " in calendar remarks, if upper screen The last one phrase is " Zhang San ", it may be considered that " Zhang San " is a name.
Step 203, the classification information of the proper noun is identified;
In embodiments of the present invention, after determining the text information for having gone up screen is proper noun, this can be further identified The classification information of proper noun, such as, if it is common name, female star name, matinée idol name, sportsman's name, place name Etc..
Certainly, those skilled in the art can determine others classification information, the embodiment of the present invention pair according to actual needs This is not construed as limiting.
Step 204, at least one candidate word to match with the classification information is obtained;
In embodiments of the present invention, multiple times can be set according to classification information, the phrase of respectively different classes of information Word is selected, after identifying the classification information for having gone up the text information of screen, obtains at least one time to match with category information Select word.
For example, can be the candidate words such as matinée idol name this classification information setting " good general ", " performance ", for common name The candidate words such as this classification information setting " ", " going ", when the classification information for the specific term " Jin east " for identifying upper screen is male When star's name, available " good general " this candidate word;And when the classification information for the specific term " Zhang San " for identifying screen When for common name, available " " this candidate word.Certainly, those skilled in the art can also have according to actual needs Body determines that the candidate word of each classification information, the embodiment of the present invention are not construed as limiting this.
Step 205, show at least one described candidate word.
It, can be by the candidate word after getting the candidate word to match with the classification information for the proper noun for having gone up screen It is presented to user, so that facilitating user directly to select the candidate word carries out screen.
As shown in figure 3, be that a kind of candidate word of the invention shows schematic diagram, when user is after upper screen phrase " Zhang San ", By judge the phrase for common name, so as to get " again " that matches with this kind of other information of the common name, Multiple candidate words such as " ", so that above-mentioned multiple candidate words are presented to user.
In embodiments of the present invention, the text information of screen has been gone up by obtaining, and judges whether the text information is special There is noun, if so, can then obtain and match with the classification information by the classification information of the identification proper noun At least one candidate word, and then show at least one described candidate word, solve in the prior art user input name, Name etc. is after proper nouns, can not continue to show to user by the association function of input method or recommended candidate word problem, this hair Bright embodiment is reduced by the classification information of identification proper noun to carry out association's recommendation according to corresponding classification information The input operating process of user, improves the input speed of user.
Referring to Fig. 4, a kind of step flow chart of the association method embodiment two of input method candidate word of the invention is shown, It can specifically include following steps:
Step 401, initialized data base is generated;
In the concrete realization, the embodiment of the present invention can be applied in each Terminal Type, for example, mobile phone, computer, plate are electric Brain etc., the embodiment of the present invention are not construed as limiting the concrete type of terminal.
In embodiments of the present invention, the step of generation initialized data base can specifically include following sub-step:
Sub-step 4011 acquires multiple proper nouns;
In general, when user shows proprietary with this in proper nouns, input methods such as some names of input, place names without normal direction user The candidate word that noun matches, user, which must re-enter, could obtain desired candidate word.Therefore, in order to solve above-mentioned ask Topic, can acquire multiple proper nouns first.
In embodiments of the present invention, multiple proper nouns can be grabbed using web crawlers (web crawler).Network is climbed Worm is otherwise known as webpage spider, is according to certain rules, automatically to grab a kind of program or script of web message, it Can be according to set crawl target, the webpage selectively accessed on WWW is linked to relevant, letter required for obtaining Breath.
Specifically, data can be grabbed from special website using web crawlers.For example, can be from star's database etc. The name that matinée idol and female star are grabbed on the website of type, grabs the data of common surname and name from website of giving a name, The data such as place name are grabbed from information of place names website.Those skilled in the art can be specifically chosen according to actual needs and need to grab The data taken, the embodiment of the present invention are not construed as limiting this.
Certainly, those skilled in the art can also acquire proper noun using other modes, for example, by obtaining terminal The information such as address list or memorandum, using the name of the contact person recorded in address list or memorandum as the proprietary name of subsequent consideration One kind of word, the embodiment of the present invention are also not construed as limiting this.
Sub-step 4012 classifies to the multiple proper noun, to generate multiple classification informations;
In the concrete realization, the proper noun of crawl can be classified as multiple classification informations, for example, place name classification, general Logical name classification or non-generic name classification, place name classification may include domestic place name classification, external place name classification again;It is non-generic Name classification also may include star's name classification, sportsman's name classification etc..Certainly, those skilled in the art can be with root According to actual needs, other classification informations are determined, for example, can also be classified as matinée idol name classification for star's name classification With female's star's name classification, and matinée idol name classification can also be classified as domestic matinée idol name classification and external matinée idol people Name classification, this is not limited by the present invention.
It should be noted that acquiring multiple proper nouns, and when classifying to the multiple proper nouns collected, It can also be checked, corrected or be supplemented by artificial means, to guarantee the accuracy of database.
Sub-step 4013 obtains the candidate word for having particular kind of relationship with each classification information respectively;
In embodiments of the present invention, the particular kind of relationship may include the binary crelation or n-tuple relation between word and word.
For example, for " going " and " Sydney " two words, since word can be organized at " going to Sydney ", it may be considered that " going " and " Sydney " two words have binary crelation.And for " I ", " liking " and " Fan Bingbing " three words, since word can be organized into " I Like model ice ice ", it may be considered that " I ", " liking " and " Fan Bingbing " three words have ternary relation, and so on, when multiple It is exactly n-tuple relation when there is above-mentioned relation between word.
When obtaining the candidate word that there is binary crelation or n-tuple relation with each classification information, corpus can be acquired first Information.
In general, it is practically impossible to observe large-scale language example in statistics natural language processing, it therefore, can be with Simply text is used alternatively, and using the context relation in text as the context relation of language in real world Substitute, this text by as substitute can be referred to as corpus information, and every section of corpus information may include multiple points Word.
For example, " Li Ming is student " can regard one section of corpus information as.The corpus information can be split as " Lee It is bright ", "Yes" and " student " three participle.
It in the concrete realization, can be by acquiring corpus information from the certain database of internet.For example, can be from report More reports in some period are acquired in periodical full-text database, therefrom obtain multistage corpus information, the embodiment of the present invention pair This is not construed as limiting.
It is then possible to extract the proper noun in the multiple participle, the proper noun has corresponding classification information, And then the candidate word that there is binary crelation or n-tuple relation with the proper noun in the multiple participle is counted respectively, specially as this There is the candidate word of the corresponding classification information of noun.
For example, " Li Ming " this name can be extracted, and count the candidate word that there is binary crelation with the name For "Yes", since the classification information of " Li Ming " is common name classification, can be using "Yes" as the time of common name classification Select word.
In another example being " Liu Kaiwei is good handsome " for one section of corpus information, by cutting, this section of corpus information may include " Liu Kaiwei " and " good general " two participles, have by extracting proper noun " Liu Kaiwei " therein and counting with the proper noun The candidate word of binary crelation is " good general ", since " Liu Kaiwei " is matinée idol name classification, can be by " good general " as male The candidate word of star's name classification.
The above is only example, those skilled in the art can generate different times according to the specific corpus information of acquisition Word is selected, the embodiment of the present invention is not construed as limiting this.
Sub-step 4014 generates preset according to the multiple classification information and the candidate word to match with classification information Database.
Classify to the proper noun of acquisition, obtains multiple classification informations, and get by acquiring corpus information It, can be according to the multiple classification information and the candidate to match with classification information after the corresponding candidate word of each classification information Word generates initialized data base.
Step 402, the text information for having gone up screen is obtained;
It should be noted that the text information for having gone up screen in the embodiment of the present invention can be user's last time in input The Chinese character or phrase of upper screen.For example, when user is at input " I wants to go to Sydney ", it is successively upper to shield " I ", " wanting to go to " and " Sydney ", Then the text information for having gone up screen in the embodiment of the present invention can refer to last upper " Sydney " this phrase shielded.
Step 403, initialized data base is traversed, the initialized data base includes multiple default proper nouns, each default special There is noun that there is corresponding classification information;
For example, initialized data base may include having place name classification, common name classification, female's star's name classification, matinée idol Name classification, sportsman's name classification etc..And the multiple proper nouns acquired in step 401 are then believed as each classification The default proper noun of breath, such as may include " Zhang San ", " Li Si ", " king five " multiple default ordinary peoples in common name classification Name.
It should be noted that the default proper noun of each classification information is according to the actual acquisition when generating initialized data base The categorized determination of proper noun, and the candidate word of each classification information be then according to when generating initialized data base actual acquisition Corpus information determine, those skilled in the art can be according to actual needs to the default proper noun and time of each classification information Word is selected to carry out additions and deletions, the embodiment of the present invention is not construed as limiting this.
Step 404, judge whether identical as the text information in any default proper noun;
In the concrete realization, it can be determined that gone up screen text information whether hit it is any default special in initialized data base There is noun, for example, can determine described preset when traversing initialized data base when the text information for having gone up screen is " Zhang San " It whether include " Zhang San " this proper noun in database.
Step 405, if they are the same, then judge that the text information is proper noun, and obtain the classification of the proper noun Information;
In embodiments of the present invention, each of initialized data base presets proper noun and all has corresponding classification information, Therefore, when determining includes word identical with the text information for having gone up screen or phrase in initialized data base, it can be determined that the text Information is proper noun, and obtains the classification information of the proper noun.
For example, can identify that the classification information of " Zhang San " is common name classification.
As a kind of example of the invention, due to that can also include the address list information of terminal in initialized data base, When the text information for having gone up screen is overlapped with name any in address list, it can identify that the classification information of the proper noun is general Logical name classification.
Step 406, at least one to match with the classification information of the proper noun is obtained from the initialized data base A candidate word;
It is then possible to obtain at least one candidate word to match with category information.For example, available and ordinary people " again ", candidate words such as " " that name classification matches.
Step 407, show at least one described candidate word;
It, can be by the candidate word after getting the candidate word to match with the classification information for the proper noun for having gone up screen It is presented to user, so that facilitating user directly to select the candidate word carries out screen.
As shown in figure 3, when user is after upper screen phrase " Zhang San ", by judge the phrase for common name, so as to Multiple candidate words such as " again " that matches with this kind of other information of the common name, " " are got, thus by above-mentioned multiple times Word is selected to be presented to user.
Step 408, when receiving user and selecting the instruction of any candidate word, candidate word described in upper screen.
When user selectes the candidate word that some shows, input method can carry out upper screen to chosen candidate word, thus Reduce the input operating process of user.For example, when " " this candidate word in the selected such as Fig. 3 of user, it can be to the time Word is selected to carry out screen.
The embodiment of the present invention generates initialized data base by acquiring multiple proper nouns, thus get user on After the text information of screen, by judging whether text information is the proper noun of certain categorization information, if so, can pass through At least one candidate word to match with category information is obtained, and then at least one candidate word is presented to user, it is convenient User directly selectes candidate word therein and carries out screen, and the perfect association function of input method improves the input speed of user.
It should be noted that for simple description, therefore, it is stated as a series of action groups for embodiment of the method It closes, but those skilled in the art should understand that, embodiment of that present invention are not limited by the describe sequence of actions, because according to According to the embodiment of the present invention, some steps may be performed in other sequences or simultaneously.Secondly, those skilled in the art also should Know, the embodiments described in the specification are all preferred embodiments, and the related movement not necessarily present invention is implemented Necessary to example.
Referring to Fig. 5, a kind of structural block diagram of association's Installation practice of input method candidate word of the invention is shown, specifically May include following module:
Text information obtains module 501, for obtaining the text information for having gone up screen;
Proper noun judgment module 502, for judging whether the text information is proper noun;
Classification information identification module 503, for the text information be proper noun when, identify the class of the proper noun Other information;
Candidate word obtains module 504, for obtaining at least one candidate word to match with the classification information;
Candidate word display module 505, for showing at least one described candidate word.
In embodiments of the present invention, the proper noun may include name and/or place name.
In embodiments of the present invention, the proper noun judgment module 502 can specifically include following submodule:
Submodule is traversed, for traversing initialized data base, the initialized data base may include multiple default proper nouns, Each default proper noun has corresponding classification information;
Whether judging submodule is identical as the text information in any default proper noun for judging;
Acquisition submodule, for judging the text envelope when identical as the text information in the presence of default proper noun Breath is proper noun, and obtains the classification information of the proper noun.
In embodiments of the present invention, the initialized data base may include the address list information of terminal, the classification information Identification module 503 can specifically include following submodule:
Submodule is identified, for identifying the proprietary name when the text information is overlapped with name any in address list The classification information of word is common name classification.
In embodiments of the present invention, each classification information may include at least one time to match with the classification information Word is selected, the candidate word, which obtains module 504, can specifically include following submodule:
Candidate word acquisition submodule, for obtaining the classification information phase with the proper noun from the initialized data base At least one matched candidate word.
In embodiments of the present invention, the initialized data base can be by calling following module to generate:
Acquisition module, for acquiring multiple proper nouns;
Categorization module, for classifying to the multiple proper noun, to generate multiple classification informations;
Module is obtained, for obtaining the candidate word that there is particular kind of relationship with each classification information respectively;
Generation module, for generating pre- according to the multiple classification information and the candidate word to match with classification information Set database.
In embodiments of the present invention, the particular kind of relationship may include binary crelation or n-tuple relation, the acquisition module It can specifically include following submodule:
Corpus information acquires submodule, and for acquiring corpus information, the corpus information respectively includes multiple participles;
Proper noun extracting sub-module, for extracting proper noun and the proper noun pair in the multiple participle The classification information answered;
Candidate word statistic submodule, for count respectively with the proper noun in the multiple participle have binary crelation or The candidate word of n-tuple relation, the candidate word as the corresponding classification information of the proper noun.
In embodiments of the present invention, described device can also include following module:
Panel module in candidate word, for when receiving user and selecting the instruction of any candidate word, candidate word described in upper screen.
For device embodiment, since it is basically similar to the method embodiment, related so being described relatively simple Place illustrates referring to the part of embodiment of the method.
Fig. 6 is a kind of block diagram of association's device 600 of input method candidate word shown according to an exemplary embodiment.Example Such as, device 600 can be mobile phone, computer, digital broadcasting terminal, messaging device, game console, and plate is set It is standby, Medical Devices, body-building equipment, personal digital assistant etc..
Referring to Fig. 6, device 600 may include following one or more components: processing component 602, memory 604, power supply Component 606, multimedia component 608, audio component 610, the interface 612 of input/output (I/O), sensor module 614, and Communication component 616.
The integrated operation of the usual control device 600 of processing component 602, such as with display, telephone call, data communication, phase Machine operation and record operate associated operation.Processing element 602 may include that one or more processors 620 refer to execute It enables, to complete all or part of the steps of the association method of above-mentioned input method candidate word.In addition, processing component 602 can wrap One or more modules are included, convenient for the interaction between processing component 602 and other assemblies.For example, processing component 602 may include Multi-media module, to facilitate the interaction between multimedia component 608 and processing component 602.
Memory 604 is configured as storing various types of data to support the operation in device 600.These data are shown Example includes the instruction of any application or method for operating on device 600, contact data, and telephone book data disappears Breath, picture, video etc..Memory 604 can be by any kind of volatibility or non-volatile memory device or their group It closes and realizes, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM) is erasable to compile Journey read-only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash Device, disk or CD.
Power supply module 606 provides electric power for the various assemblies of device 600.Power supply module 606 may include power management system System, one or more power supplys and other with for device 600 generate, manage, and distribute the associated component of electric power.
Multimedia component 608 includes the screen of one output interface of offer between described device 600 and user.One In a little embodiments, screen may include liquid crystal display (LCD) and touch panel (TP).If screen includes touch panel, screen Curtain may be implemented as touch screen, to receive input signal from the user.Touch panel includes one or more touch sensings Device is to sense the gesture on touch, slide, and touch panel.The touch sensor can not only sense touch or sliding action Boundary, but also detect duration and pressure associated with the touch or slide operation.In some embodiments, more matchmakers Body component 608 includes a front camera and/or rear camera.When device 600 is in operation mode, such as screening-mode or When video mode, front camera and/or rear camera can receive external multi-medium data.Each front camera and Rear camera can be a fixed optical lens system or have focusing and optical zoom capabilities.
Audio component 610 is configured as output and/or input audio signal.For example, audio component 610 includes a Mike Wind (MIC), when device 600 is in operation mode, when such as call mode, recording mode, and voice recognition mode, microphone is matched It is set to reception external audio signal.The received audio signal can be further stored in memory 604 or via communication set Part 616 is sent.In some embodiments, audio component 610 further includes a loudspeaker, is used for output audio signal.
I/O interface 612 provides interface between processing component 602 and peripheral interface module, and above-mentioned peripheral interface module can To be keyboard, click wheel, button etc..These buttons may include, but are not limited to: home button, volume button, start button and lock Determine button.
Sensor module 614 includes one or more sensors, and the state for providing various aspects for device 600 is commented Estimate.For example, sensor module 614 can detecte the state that opens/closes of device 600, and the relative positioning of component, for example, it is described Component is the display and keypad of device 600, and sensor module 614 can be with 600 1 components of detection device 600 or device Position change, the existence or non-existence that user contacts with device 600,600 orientation of device or acceleration/deceleration and device 600 Temperature change.Sensor module 614 may include proximity sensor, be configured to detect without any physical contact Presence of nearby objects.Sensor module 614 can also include optical sensor, such as CMOS or ccd image sensor, at As being used in application.In some embodiments, which can also include acceleration transducer, gyro sensors Device, Magnetic Sensor, pressure sensor or temperature sensor.
Communication component 616 is configured to facilitate the communication of wired or wireless way between device 600 and other equipment.Device 600 can access the wireless network based on communication standard, such as WiFi, 2G or 3G or their combination.In an exemplary implementation In example, communication component 616 receives broadcast singal or broadcast related information from external broadcasting management system via broadcast channel. In one exemplary embodiment, the communication component 616 further includes near-field communication (NFC) module, to promote short range communication.Example Such as, NFC module can be based on radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band (UWB) technology, Bluetooth (BT) technology and other technologies are realized.
In the exemplary embodiment, device 600 can be believed by one or more application specific integrated circuit (ASIC), number Number processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for executing above-mentioned input method candidate word Association method.
In the exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instruction, example are additionally provided It such as include the memory 604 of instruction, above-metioned instruction can be executed candidate to complete above-mentioned input method by the processor 620 of device 600 The association method of word.For example, the non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, tape, floppy disk and optical data storage devices etc..
A kind of non-transitorycomputer readable storage medium, when the instruction in the storage medium is held by the processor of terminal When row, enable the terminal to perform the following operations:
Obtain the text information for having gone up screen;
Judge whether the text information is proper noun;
If so, identifying the classification information of the proper noun;
Obtain at least one candidate word to match with the classification information;
Show at least one described candidate word.
All the embodiments in this specification are described in a progressive manner, the highlights of each of the examples are with The difference of other embodiments, the same or similar parts between the embodiments can be referred to each other.
It should be understood by those skilled in the art that, the embodiment of the embodiment of the present invention can provide as method, apparatus or calculate Machine program product.Therefore, the embodiment of the present invention can be used complete hardware embodiment, complete software embodiment or combine software and The form of the embodiment of hardware aspect.Moreover, the embodiment of the present invention can be used one or more wherein include computer can With in the computer-usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) of program code The form of the computer program product of implementation.
The embodiment of the present invention be referring to according to the method for the embodiment of the present invention, terminal device (system) and computer program The flowchart and/or the block diagram of product describes.It should be understood that flowchart and/or the block diagram can be realized by computer program instructions In each flow and/or block and flowchart and/or the block diagram in process and/or box combination.It can provide these Computer program instructions are set to general purpose computer, special purpose computer, Embedded Processor or other programmable data processing terminals Standby processor is to generate a machine, so that being held by the processor of computer or other programmable data processing terminal devices Capable instruction generates for realizing in one or more flows of the flowchart and/or one or more blocks of the block diagram The device of specified function.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing terminal devices In computer-readable memory operate in a specific manner, so that instruction stored in the computer readable memory generates packet The manufacture of command device is included, which realizes in one side of one or more flows of the flowchart and/or block diagram The function of being specified in frame or multiple boxes.
These computer program instructions can also be loaded into computer or other programmable data processing terminal devices, so that Series of operation steps are executed on computer or other programmable terminal equipments to generate computer implemented processing, thus The instruction executed on computer or other programmable terminal equipments is provided for realizing in one or more flows of the flowchart And/or in one or more blocks of the block diagram specify function the step of.
Although the preferred embodiment of the embodiment of the present invention has been described, once a person skilled in the art knows bases This creative concept, then additional changes and modifications can be made to these embodiments.So the following claims are intended to be interpreted as Including preferred embodiment and fall into all change and modification of range of embodiment of the invention.
Finally, it is to be noted that, herein, relational terms such as first and second and the like be used merely to by One entity or operation are distinguished with another entity or operation, without necessarily requiring or implying these entities or operation Between there are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant meaning Covering non-exclusive inclusion, so that process, method, article or terminal device including a series of elements not only wrap Those elements are included, but also including other elements that are not explicitly listed, or further includes for this process, method, article Or the element that terminal device is intrinsic.In the absence of more restrictions, being wanted by what sentence "including a ..." limited Element, it is not excluded that there is also other identical elements in process, method, article or the terminal device for including the element.
Above to a kind of association method and a kind of connection of input method candidate word of input method candidate word provided by the present invention Think device, is described in detail, it is used herein that a specific example illustrates the principle and implementation of the invention, The above description of the embodiment is only used to help understand the method for the present invention and its core ideas;Meanwhile for the one of this field As technical staff, according to the thought of the present invention, there will be changes in the specific implementation manner and application range, to sum up institute It states, the contents of this specification are not to be construed as limiting the invention.

Claims (10)

1. a kind of association method of input method candidate word characterized by comprising
Obtain the text information for having gone up screen;
Judge whether the text information is proper noun;
If so, identifying the classification information of the proper noun;
Obtain at least one candidate word to match with the classification information;
Show at least one described candidate word.
2. the method according to claim 1, wherein the proper noun includes name and/or place name.
3. judging whether the text information is proper noun the method according to claim 1, wherein described Step includes:
Initialized data base is traversed, the initialized data base includes multiple default proper nouns, and each default proper noun has phase The classification information answered;
Judge whether any default proper noun is identical as the text information;
If so, judging that the text information is proper noun, and obtain the classification information of the proper noun.
4. according to the method described in claim 3, it is characterized in that, the initialized data base includes the address list information of terminal, The step of classification information of the identification proper noun includes:
When the text information is overlapped with name any in address list, identify that the classification information of the proper noun is ordinary people Name classification.
5. according to the method described in claim 3, it is characterized in that, each classification information includes matching with the classification information At least one candidate word, it is described obtain match with the classification information at least one candidate word the step of include:
At least one candidate word to match with the classification information of the proper noun is obtained from the initialized data base.
6. according to method as claimed in claim 3 to 5, which is characterized in that the initialized data base is given birth in the following way At:
Acquire multiple proper nouns;
Classify to the multiple proper noun, to generate multiple classification informations;
The candidate word that there is particular kind of relationship with each classification information is obtained respectively;
According to the multiple classification information and the candidate word to match with classification information, initialized data base is generated.
7. according to the method described in claim 6, it is characterized in that, the particular kind of relationship includes binary crelation or n-tuple relation, It is described to obtain the step of there is the candidate word of particular kind of relationship with each classification information respectively and include:
Corpus information is acquired, the corpus information respectively includes multiple participles;
Extract the proper noun and the corresponding classification information of the proper noun in the multiple participle;
The candidate word that there is binary crelation or n-tuple relation with the proper noun in the multiple participle is counted respectively, as described The candidate word of the corresponding classification information of proper noun.
8. the method according to claim 1, wherein further include:
When receiving user and selecting the instruction of any candidate word, candidate word described in upper screen.
9. a kind of association's device of input method candidate word characterized by comprising
Text information obtains module, for obtaining the text information for having gone up screen;
Proper noun judgment module, for judging whether the text information is proper noun;
Classification information identification module, for the text information be proper noun when, identify the classification information of the proper noun;
Candidate word obtains module, for obtaining at least one candidate word to match with the classification information;
Candidate word display module, for showing at least one described candidate word.
10. a kind of association's device of input method candidate word, which is characterized in that include memory and one or one with On program, one of them perhaps more than one program be stored in memory and be configured to by one or more than one It includes the instruction for performing the following operation that processor, which executes the one or more programs:
Obtain the text information for having gone up screen;
Judge whether the text information is proper noun;
If so, identifying the classification information of the proper noun;
Obtain at least one candidate word to match with the classification information;
Show at least one described candidate word.
CN201710424511.XA 2017-06-07 2017-06-07 Association method and device for candidate words of input method Active CN109002184B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710424511.XA CN109002184B (en) 2017-06-07 2017-06-07 Association method and device for candidate words of input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710424511.XA CN109002184B (en) 2017-06-07 2017-06-07 Association method and device for candidate words of input method

Publications (2)

Publication Number Publication Date
CN109002184A true CN109002184A (en) 2018-12-14
CN109002184B CN109002184B (en) 2022-09-23

Family

ID=64573122

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710424511.XA Active CN109002184B (en) 2017-06-07 2017-06-07 Association method and device for candidate words of input method

Country Status (1)

Country Link
CN (1) CN109002184B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110888539A (en) * 2019-11-18 2020-03-17 腾讯科技(深圳)有限公司 Name recommendation method, device, equipment and storage medium in input method
CN111435270A (en) * 2019-01-11 2020-07-21 北京搜狗科技发展有限公司 Recommendation method and device and electronic equipment
CN111752397A (en) * 2019-03-29 2020-10-09 北京搜狗科技发展有限公司 Candidate word determination method and device
CN112684915A (en) * 2021-01-04 2021-04-20 上海臣星软件技术有限公司 Candidate word output method and device, electronic equipment and computer storage medium
CN113703590A (en) * 2021-08-13 2021-11-26 北京搜狗科技发展有限公司 Input method, input device and input device

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101271459A (en) * 2007-03-22 2008-09-24 北京搜狗科技发展有限公司 Word library generation method, input method and input method system
US20080313182A1 (en) * 2007-06-15 2008-12-18 Sony Ericsson Mobile Communications Ab Methods, devices, and computer program products for predictive text entry in mobile terminals using multiple databases
CN201260222Y (en) * 2008-03-28 2009-06-17 宇龙计算机通信科技(深圳)有限公司 Mobile terminal
US20090198691A1 (en) * 2008-02-05 2009-08-06 Nokia Corporation Device and method for providing fast phrase input
CN102193646A (en) * 2010-03-18 2011-09-21 腾讯科技(深圳)有限公司 Method and device for generating personal name candidate words
CN103019405A (en) * 2012-11-12 2013-04-03 东莞宇龙通信科技有限公司 Method and device for inputting names
CN103760991A (en) * 2014-01-13 2014-04-30 北京搜狗科技发展有限公司 Physical input method and physical input device
CN103914513A (en) * 2014-01-13 2014-07-09 北京搜狗科技发展有限公司 Entity input method and device
CN104268166A (en) * 2014-09-09 2015-01-07 北京搜狗科技发展有限公司 Input method, device and electronic device
CN105022547A (en) * 2014-04-24 2015-11-04 刘健萍 Text input method and apparatus
CN106202045A (en) * 2016-07-08 2016-12-07 成都之达科技有限公司 Special audio recognition method based on car networking
CN106503246A (en) * 2016-11-09 2017-03-15 天津赛因哲信息技术有限公司 Method for establishing ancient book intelligent digital document library

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101271459A (en) * 2007-03-22 2008-09-24 北京搜狗科技发展有限公司 Word library generation method, input method and input method system
US20080313182A1 (en) * 2007-06-15 2008-12-18 Sony Ericsson Mobile Communications Ab Methods, devices, and computer program products for predictive text entry in mobile terminals using multiple databases
US20090198691A1 (en) * 2008-02-05 2009-08-06 Nokia Corporation Device and method for providing fast phrase input
CN201260222Y (en) * 2008-03-28 2009-06-17 宇龙计算机通信科技(深圳)有限公司 Mobile terminal
CN102193646A (en) * 2010-03-18 2011-09-21 腾讯科技(深圳)有限公司 Method and device for generating personal name candidate words
CN103019405A (en) * 2012-11-12 2013-04-03 东莞宇龙通信科技有限公司 Method and device for inputting names
CN103760991A (en) * 2014-01-13 2014-04-30 北京搜狗科技发展有限公司 Physical input method and physical input device
CN103914513A (en) * 2014-01-13 2014-07-09 北京搜狗科技发展有限公司 Entity input method and device
CN105022547A (en) * 2014-04-24 2015-11-04 刘健萍 Text input method and apparatus
CN104268166A (en) * 2014-09-09 2015-01-07 北京搜狗科技发展有限公司 Input method, device and electronic device
CN106202045A (en) * 2016-07-08 2016-12-07 成都之达科技有限公司 Special audio recognition method based on car networking
CN106503246A (en) * 2016-11-09 2017-03-15 天津赛因哲信息技术有限公司 Method for establishing ancient book intelligent digital document library

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
柯修 等: "基于串频统计的汉语和孟加拉语专有名词识别", 《现代图书情报技术》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111435270A (en) * 2019-01-11 2020-07-21 北京搜狗科技发展有限公司 Recommendation method and device and electronic equipment
CN111435270B (en) * 2019-01-11 2024-03-01 北京搜狗科技发展有限公司 Recommendation method and device and electronic equipment
CN111752397A (en) * 2019-03-29 2020-10-09 北京搜狗科技发展有限公司 Candidate word determination method and device
CN110888539A (en) * 2019-11-18 2020-03-17 腾讯科技(深圳)有限公司 Name recommendation method, device, equipment and storage medium in input method
CN110888539B (en) * 2019-11-18 2024-03-26 腾讯科技(深圳)有限公司 Name recommendation method, device, equipment and storage medium in input method
CN112684915A (en) * 2021-01-04 2021-04-20 上海臣星软件技术有限公司 Candidate word output method and device, electronic equipment and computer storage medium
CN113703590A (en) * 2021-08-13 2021-11-26 北京搜狗科技发展有限公司 Input method, input device and input device

Also Published As

Publication number Publication date
CN109002184B (en) 2022-09-23

Similar Documents

Publication Publication Date Title
CN107357779B (en) A kind of method and device obtaining organization names
CN109002184A (en) A kind of association method and device of input method candidate word
CN106251869B (en) Voice processing method and device
JP2018504727A (en) Reference document recommendation method and apparatus
CN106202150B (en) Information display method and device
CN104735243B (en) Contact list displaying method and device
CN106484138B (en) A kind of input method and device
CN110147467A (en) A kind of generation method, device, mobile terminal and the storage medium of text description
CN108038102A (en) Recommendation method, apparatus, terminal and the storage medium of facial expression image
CN108509412A (en) A kind of data processing method, device, electronic equipment and storage medium
CN110390086A (en) A kind of method, apparatus and storage medium generating text
CN109582768A (en) A kind of text entry method and device
CN105447109A (en) Key word searching method and apparatus
CN111583919A (en) Information processing method, device and storage medium
JP7116088B2 (en) Speech information processing method, device, program and recording medium
CN108650543A (en) The caption editing method and device of video
CN110069143A (en) A kind of information is anti-error to entangle method, apparatus and electronic equipment
CN110019885A (en) A kind of expression data recommended method and device
CN110069624A (en) Text handling method and device
CN111739535A (en) Voice recognition method and device and electronic equipment
CN113936697B (en) Voice processing method and device for voice processing
CN113033163A (en) Data processing method and device and electronic equipment
CN108073293A (en) A kind of definite method and apparatus of target phrase
CN108628461A (en) A kind of input method and device, a kind of method and apparatus of update dictionary
CN105302335B (en) Vocabulary recommends method and apparatus and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant