CN109901725A - A kind of pinyin string cutting method and device - Google Patents

A kind of pinyin string cutting method and device Download PDF

Info

Publication number
CN109901725A
CN109901725A CN201711284974.7A CN201711284974A CN109901725A CN 109901725 A CN109901725 A CN 109901725A CN 201711284974 A CN201711284974 A CN 201711284974A CN 109901725 A CN109901725 A CN 109901725A
Authority
CN
China
Prior art keywords
cutting
cutting result
syllable
reasonability
condition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711284974.7A
Other languages
Chinese (zh)
Other versions
CN109901725B (en
Inventor
姚波怀
张扬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201711284974.7A priority Critical patent/CN109901725B/en
Publication of CN109901725A publication Critical patent/CN109901725A/en
Application granted granted Critical
Publication of CN109901725B publication Critical patent/CN109901725B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The embodiment of the present application discloses a kind of pinyin string cutting method and device, when multiple cutting results can be obtained according to the pinyin string of input, it can judge whether cutting result meets reasonability condition according to the input interval between adjacent syllable each in cutting result segmentation, the cutting result for meeting the reasonability condition is not only according to syllable splitting, also the characteristics of capable of meeting input interval, the candidate item directly determined according to the cutting result for meeting reasonability condition when determining candidate item, so that occurring inputting relative to user in the candidate item shown for the pinyin string, demand is meaningless or unwanted candidate item quantity is reduced, to reduce the time that user selects candidate item, improve the input experience of user.

Description

A kind of pinyin string cutting method and device
Technical field
This application involves input method fields, more particularly to a kind of pinyin string cutting method and device.
Background technique
Input method refers to the coding method used for various symbols are inputted computer or other equipment (such as mobile phone), uses Input method can be used easily by the character input electronic equipment of needs in family.It, can be by defeated such as in input method of Chinese character Enter pinyin string to input Chinese characters into electronic equipment.
For the pinyin string of user's input, in order to determine that its corresponding text is, input method is needed the pinyin string Cutting is carried out, each section after cutting generally corresponds to a syllable, and separates by separator, such as the pinyin string of input is " wom ", a kind of cutting result can be " wo ' m ", syllable " wo " and " m " by separator " ' " separate.
However, this mode only in accordance with syllable as cutting pinyin string of traditional approach, it is general to be directed to same pinyin string A variety of cutting knots that cutting obtains can be may result in when user's input Pinyin string is wrong or longer there are many slit mode Fruit is largely to input the meaningless cutting of demand relative to user as a result, and the candidate that is shown according to these cutting results Item can tie up candidate item corresponding to effective cutting result, select candidate item to bring puzzlement for user, extend time needed for selection The time of option reduces the input experience of user.
Summary of the invention
In order to solve the above-mentioned technical problem, this application provides a kind of pinyin string cutting methods, reduce and are directed to input Pinyin It goes here and there in shown candidate item and to input that demand is meaningless or unwanted candidate item quantity relative to user, to reduce user Select the time of candidate item.
The embodiment of the present application discloses following technical solution:
In a first aspect, the embodiment of the present application provides a kind of pinyin string cutting method, which comprises
The multiple cuttings obtained according to the pinyin string cutting of acquisition are as a result, any one cutting result includes multiple syllables Segmentation;
Judge whether cutting result meets reasonability item according to the input interval between adjacent syllable each in cutting result segmentation Part;
The candidate item for being directed to the pinyin string is determined according to the cutting result for meeting the reasonability condition.
Optionally, the input interval between the segmentation according to adjacent syllable each in cutting result judges whether cutting result is full Sufficient reasonability condition, comprising:
Cutting knot is judged according to the quantity at input interval and syllable segmentation between adjacent syllable each in cutting result segmentation Whether fruit meets reasonability condition.
Optionally, the input interval between the segmentation according to adjacent syllable each in cutting result judges whether cutting result is full Sufficient reasonability condition, comprising:
Obtain the history input interval data for inputting the user of the pinyin string;
The input interval judgement in interval data and cutting result between each adjacent syllable segmentation is inputted according to the history Whether cutting result meets the reasonability condition.
Optionally, the quantity of the input interval between the segmentation according to adjacent syllable each in cutting result and syllable segmentation Judge whether cutting result meets reasonability condition, comprising:
Obtain the history input syllable data for inputting the user of the pinyin string;
According to the input interval between adjacent syllable segmentation each in history input syllable quantity, cutting result, Yi Jiyin The quantity of section segmentation judges whether cutting result meets reasonability condition.
Optionally, the candidate for the pinyin string is determined according to the cutting result for meeting the reasonability condition described Before, further includes:
Error correction is carried out to the syllable segmentation in the cutting result for meeting the reasonability condition.
Optionally, if in the cutting result for meeting the reasonability condition including the first cutting result and the second cutting As a result, described determine the candidate item for being directed to the pinyin string according to the cutting result for meeting the reasonability condition, comprising:
According to the satisfaction degree of the first cutting result and the reasonability condition, to for the first cutting result Candidate item be ranked up;
According to the satisfaction degree of the second cutting result and the reasonability condition, to for the second cutting result Candidate item be ranked up;
According to for the first cutting result ranking results and for the ranking results of the second cutting result it is true Surely for the candidate item of the pinyin string and displaying sequence.
Second aspect, the embodiment of the present application provide a kind of pinyin string cutting device, and described device includes:
Cutting module, the multiple cuttings obtained for the pinyin string cutting according to acquisition are as a result, any one cutting knot Fruit includes multiple syllable segmentations;
Judgment module, for whether judging cutting result according to the input interval between adjacent syllable each in cutting result segmentation Meet reasonability condition;
Determining module, for determining the candidate for being directed to the pinyin string according to the cutting result for meeting the reasonability condition ?.
Optionally, the judgment module includes:
First judging unit, for according to adjacent syllable each in cutting result segmentation between input interval and syllable be segmented Quantity judge whether cutting result meets reasonability condition.
Optionally, the judgment module includes:
History inputs interval data acquiring unit, for obtaining the history input space-number for inputting the user of the pinyin string According to;
Second judgment unit is segmented for inputting each adjacent syllable in interval data and cutting result according to the history Between input interval judge whether cutting result meets the reasonability condition.
Optionally, first judging unit includes:
History inputs syllable data acquisition subelement, for obtaining the history input syllable for inputting the user of the pinyin string Data;
First judgment sub-unit, for being segmented according to each adjacent syllable in history input syllable quantity, cutting result Between input interval and syllable segmentation quantity judge whether cutting result meets reasonability condition.
Optionally, described device further include:
Correction module, for carrying out error correction to the syllable segmentation in the cutting result for meeting the reasonability condition.
Optionally, if in the cutting result for meeting the reasonability condition including the first cutting result and the second cutting As a result, described determine the candidate item for being directed to the pinyin string, described device according to the cutting result for meeting the reasonability condition Include:
First sorting module, for the satisfaction degree according to the first cutting result and the reasonability condition, to needle The candidate item of the first cutting result is ranked up;
Second sorting module, for the satisfaction degree according to the second cutting result and the reasonability condition, to needle The candidate item of the second cutting result is ranked up;
Candidate item module is determined, for cutting according to the ranking results for the first cutting result and for described second The ranking results of result are divided to determine for the candidate item of the pinyin string and displaying sequence.
The third aspect, the embodiment of the present application provide a kind of processing equipment for pinyin string cutting, include memory, And one perhaps more than one program one of them or more than one program be stored in memory, and be configured to Executing the one or more programs by one or more than one processor includes the finger for performing the following operation It enables:
The multiple cuttings obtained according to the pinyin string cutting of acquisition are as a result, any one cutting result includes multiple syllables Segmentation;
Judge whether cutting result meets reasonability item according to the input interval between adjacent syllable each in cutting result segmentation Part;
The candidate item for being directed to the pinyin string is determined according to the cutting result for meeting the reasonability condition.
Fourth aspect, the embodiment of the present application provide a kind of machine readable media, are stored thereon with instruction, when by one or When multiple processors execute, so that device executes pinyin string cutting method described in one or more in first aspect.
It, can be with it can be seen from above-mentioned technical proposal when multiple cutting results can be obtained according to the pinyin string of input Judge whether cutting result meets reasonability condition according to the input interval between adjacent syllable each in cutting result segmentation, meets The cutting result of the reasonability condition is not only also meet the characteristics of input is spaced according to syllable splitting, so as to incite somebody to action It, can when determining candidate item although part, according to syllable splitting, the cutting result that interval too small is inputted between syllable segmentation is eliminated Not have to consider the cutting that is eliminated of this part as a result, and the candidate that is determined according to the cutting result for meeting reasonability condition , so that occurring inputting relative to user in the candidate item shown for the pinyin string, demand is meaningless or unwanted candidate Item quantity is reduced, to reduce the time that user selects candidate item, improves the input experience of user.
Detailed description of the invention
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of application without any creative labor, may be used also for those of ordinary skill in the art To obtain other drawings based on these drawings.
Fig. 1 is a kind of pinyin string cutting method flow chart provided by the embodiments of the present application;
Fig. 2 is the method flow diagram of a kind of determining pinyin string candidate item and displaying sequence provided by the embodiments of the present application;
Fig. 3 is a kind of structural block diagram of pinyin string cutting device provided by the embodiments of the present application;
Fig. 4 is a kind of block diagram of the device for pinyin string cutting provided by the embodiments of the present application;
Fig. 5 is a kind of block diagram of the server for pinyin string cutting provided by the embodiments of the present application.
Specific embodiment
With reference to the accompanying drawing, embodiments herein is described.
When user uses input method, usually Chinese character is input in electronic equipment by input Pinyin string.In order to determine use The corresponding Chinese character of pinyin string of family input, the pinyin string that input method is generally inputted using syllable as unit cutting user, for example, user The pinyin string of input is " women ", and input method carries out cutting to the pinyin string, obtains " wo " and " men " two syllables, Jin Erxian Show candidate item of the corresponding Chinese character of each syllable as user.
However, being only according to user with syllable in the case where user's input Pinyin string is wrong or the longer situation of pinyin string The pinyin string of input carries out cutting, and meeting cutting obtains a variety of cutting results for being directed to the pinyin string, wherein most of cutting knot Fruit may be skimble-skamble cutting as a result, also, corresponding to the candidate items of these cutting results and can tie up for a user Candidate item corresponding with effective cutting result, and then cause user that can not rapidly find in candidate item when selecting candidate item Meet the option of input demand.
For example, user needs to input " we " in input method, but during input Pinyin string, accidentally by " women " It is entered as " womwn ".Correspondingly, " womwn " cutting can be " wo ", " m ", " w " and " n " first using syllable as foundation by input method, And then will Chinese character corresponding with syllable " wo ", " m ", " w " and " n " as candidate item, be shown in front of candidate item region.And incite somebody to action " womwn " cutting is " wo " and " mwn ", carries out error correction to " mwn " and obtains " men ", then will be corresponding to syllable " wo " and " men " Chinese character will be displayed on behind candidate item region as candidate item, the candidate item.Thus, cause user in candidate item region It can not be quickly found the option for meeting input demand, user experience is poor.
In order to solve above-mentioned the problems of the prior art, this application provides a kind of methods of pinyin string cutting, in cutting When pinyin string, according to the input interval between adjacent syllable each in cutting result segmentation, judge whether each cutting result is reasonable, in turn Candidate item is determined according only to reasonable cutting result.
Specifically, the pinyin string to acquisition carries out cutting, each cutting for including multiple syllable segmentations is obtained as a result, root According to the input interval between adjacent syllable each in each cutting result segmentation, judge whether each cutting result meets reasonability item Part eliminates the cutting result for being unsatisfactory for reasonability condition, acquired according to the cutting result determination for meeting reasonability condition The candidate item of pinyin string.
Pinyin string cutting method provided by the present application, according between the input between adjacent syllable each in each cutting result segmentation Every judging whether each cutting result meets reasonability condition, the cutting result for meeting reasonability condition obtained with this condition The characteristics of being not only according to syllable splitting, while also complying with input interval, and eliminated according to syllable splitting, but syllable point The cutting result of input interval too small between section.Therefore, when determining candidate item, the syllable without considering to be eliminated is defeated between being segmented Enter the cutting result of interval too small, it is only necessary to candidate item is determined according to the cutting result for meeting reasonability condition, correspondingly, for Meaningless for user's input demand or unwanted candidate item quantity is reduced in the candidate item that the pinyin string is shown, to subtract Lack the time that user selects candidate item, improves the input experience of user.
Embodiment one
It is a kind of flow chart of pinyin string cutting method provided in this embodiment referring to Fig. 1, this method comprises:
Step 101: the multiple cuttings obtained according to the pinyin string cutting of acquisition are as a result, any one cutting result includes Multiple syllable segmentations.
All phonetics that pinyin string is inputted in this input by user form, using syllable as foundation, to the phonetic of acquisition String carries out cutting, obtains multiple cutting results, wherein includes multiple syllable segmentations in each cutting result.
For example, user has input pinyin string " women " in primary input, it, can be by the pinyin string using syllable as foundation Cutting obtains multiple cutting results, wherein a kind of cutting result includes syllable segmentation " wo " and " men ", also a kind of cutting knot Fruit includes syllable segmentation " wo ", " me " and " n ".
Step 102: judging whether cutting result meets according to the input interval between adjacent syllable each in cutting result segmentation Reasonability condition.
Step 103: the candidate item for being directed to pinyin string is determined according to the cutting result for meeting reasonability condition.
When user inputs, it may be accustomed to leading to the input interval between the adjacent syllable of input because of brain, with phase in syllable Input interval between adjacent phonetic is different, and in general, the input interval between adjacent syllable is longer, and between phonetic adjacent in syllable Input interval it is shorter.
Therefore, the input interval in each cutting result between each adjacent syllable segmentation is obtained, it can be according in cutting result Input interval between each adjacent syllable segmentation, judges whether the input interval in the cutting result between each syllable segmentation meets user The interval between adjacent syllable is inputted, if the input interval in cutting result between each syllable segmentation meets user and inputs between adjacent syllable Input interval, then illustrate in the cutting result syllable segmentation may with user input syllable it is identical;If in cutting result Input interval between each syllable segmentation is shorter, is unsatisfactory for the input interval that user inputs between adjacent syllable, then illustrating the cutting knot Syllable segmentation in fruit may be during cutting pinyin string, by two adjacent phonetics in the same syllable of user's input The segmentation of syllable obtained from cutting is opened, rather than the syllable of user's input.
Reasonability condition judges that this is cut according to the input interval between syllable segmentation in each cutting result as judgment basis Divide the syllable segmentation in result, if may be the syllable of user's input.If cutting result meets reasonability condition, illustrate this Syllable segmentation in cutting result may be the syllable of user's input, conversely, saying if cutting result is unsatisfactory for reasonability condition Syllable segmentation in the bright cutting result can not be the syllable of user's input.
When specific implementation, the average value at interval is inputted in some available cutting result between each adjacent syllable segmentation, is sentenced Whether the average value for inputting interval in the cutting result of breaking between each adjacent syllable segmentation meets reasonability condition, the reasonability condition To be directed to the condition for inputting the average value at interval between each adjacent syllable segmentation and being arranged in conjunction with actual conditions.
It is greater than for example, setting reasonability condition and inputting the average value at interval as between adjacent syllable each in each cutting result segmentation Or it is equal to 0.5s, cutting pinyin string " women " obtains two cuttings as a result, first cutting result is " wo " and " men ", the Two cutting results are " wo ", " me " and " n ".In first cutting result, between " wo " and " men " two syllable segmentations 0.55s is divided between input, correspondingly, the average value at each adjacent syllable segmentation input interval of the cutting result is also 0.55s, full The reasonability condition being arranged enough then illustrates that the syllable segmentation in first cutting result may be the syllable of user's input.? In two cutting results, it is divided into 0.55s between the input between " wo " and " me " two syllable segmentations, " me " and " n " two syllables It is divided into 0.2s between input between segmentation, calculating and inputting the average value at interval between each adjacent syllable is segmented is 0.375s, this is average Value is unsatisfactory for reasonability condition, then illustrates that the syllable segmentation in second cutting result may not be the syllable of user's input.
It is, of course, also possible to which being segmented input interval to adjacent syllable each in each cutting result carries out other processing, judgement Processing result corresponding with each cutting result, if meet reasonability condition, correspondingly, which is corresponding to this The condition of processing mode setting.Any restriction is not done herein.
Do not consider the cutting for being unsatisfactory for reasonability condition and is obtained as a result, being determined according to the cutting result for meeting reasonability condition Pinyin string candidate item, specifically, can determine corresponding to syllable each in cutting result segmentation Chinese character, judge cutting result In each syllable be segmented whether corresponding Chinese character can make up word, and then the Chinese character combination of word will be formed as candidate item It is shown, naturally it is also possible to determine the candidate item for meeting reasonability condition using other modes, not do any restriction herein.
Above-mentioned pinyin string cutting method, according to the input interval between adjacent syllable each in each cutting result segmentation, judgement is each Whether cutting result meets reasonability condition, and the cutting result for meeting reasonability condition obtained with this condition is not only foundation Syllable splitting, while the characteristics of input is spaced is also complied with, and eliminated according to syllable splitting, but the input between syllable segmentation The cutting result of interval too small.Therefore, when determining candidate item, interval too small is inputted between the syllable being eliminated segmentation without considering Cutting result, it is only necessary to candidate item is determined according to the cutting result for meeting reasonability condition, correspondingly, for the pinyin string institute Meaningless for user's input demand or unwanted candidate item quantity is reduced in the candidate item of displaying, to reduce user's choosing The time for selecting candidate item improves the input experience of user.
For step 202, judging cutting according to the input interval between adjacent syllable each in cutting result segmentation the result is that It is not no when meeting reasonability condition, it, can be in order to improve the accuracy rate of the cutting result for meeting reasonability condition filtered out In conjunction with syllable be segmented quantity judged, i.e., according to adjacent syllable each in cutting result segmentation between input interval and syllable The quantity of segmentation judges whether cutting result meets reasonability condition.
When carrying out cutting to pinyin string, pinyin string cutting can be obtained into multiple cuttings according to syllable as a result, and each cutting It include multiple syllables segmentations in point result, when judging whether each cutting result meets reasonability condition, in addition to needing to consider Input interval in each cutting result between each adjacent syllable segmentation, it is also contemplated that the syllable number of fragments in each cutting result, In general, when user is inputted using input method, the syllable quantity once inputted is in certain range, and user is general Excessive or very few syllable will not be inputted in primary input.Therefore, when the quantity of the syllable segmentation in cutting result is not at When in the range of normal syllable input quantity, then illustrate that the cutting result may not be cutting required for user as a result, corresponding , the syllable segmentation in the cutting result may not be the syllable of user's input.
For example, obtaining in some cutting result includes 50 syllable segmentations, and user generally will not be in primary input 50 syllables are inputted, it is therefore contemplated that the cutting result for including the segmentation of 50 syllables is not to cut required for user Divide result.
Therefore, the number input interval between adjacent syllable each in cutting result segmentation being segmented with the syllable in cutting result Amount combines, and judges whether each cutting result meets reasonability condition, can be required for further screening more user Cutting result.
A kind of optional method provided in this embodiment is described below, this method can be according to adjacent syllable each in cutting result Input interval and syllable number of fragments between segmentation, judge whether cutting result meets reasonability condition:
It is arranged using the average value at the input interval between adjacent syllable each in cutting result segmentation as the first function of variable, cuts Input interval when the average value at the input interval in result between each adjacent syllable segmentation being divided to input adjacent syllable closer to user, Then the corresponding first function value of the cutting result is bigger, conversely, the input interval in cutting result between each adjacent syllable segmentation Input interval when average value inputs adjacent syllable with user differs more, then the corresponding first function value of the cutting result is got over It is small.
The quantity being segmented using syllable in cutting result is set as the second function of variable, if syllable segmentation in cutting result Quantity is in the range of normal input syllable quantity, then larger corresponding to the second function value of the cutting result, if cutting knot The quantity that syllable is segmented in fruit is not at the range of normal input syllable quantity, or differs more with the range, then corresponding to should The second function value of cutting result is smaller.
It is to be greater than or equal to corresponding to the first function value of cutting result and the sum of second function value that reasonability condition, which is arranged, The first function value for corresponding to each cutting result is correspondingly added by certain preset value with second function value, is obtained corresponding to and respectively be cut The functional value of point result and, and then judge the functional value of each cutting result and whether more than or equal to presetting in reasonability condition Value then illustrates that the cutting result meets reasonability condition, conversely, then not if more than or equal to preset value in reasonability condition Meet reasonability condition.
It is, of course, also possible to input interval and cutting using other modes, between adjacent syllable each in cutting result is segmented As a result the quantity of middle syllable segmentation combines, and judges whether cutting result meets reasonability condition, does not do any restriction herein.
Input interval between adjacent syllable each in cutting result segmentation is combined with syllable number of fragments, judges each cutting As a result whether meet reasonability condition, can be further improved the accuracy rate of cutting result screening, each sound can be met by some Condition needed for input interval between section segmentation, but the unreasonable cutting result of syllable number of fragments is further eliminated, into one Step reduces meaningless candidate item for a user.
Since different users inputs, interval habit is different, and hence it is also possible to the input interval in conjunction with user is accustomed to, judgement Whether each cutting result meets reasonability condition.
Specifically, the history input interval data of the user of input Pinyin string can be obtained first.Wherein, history input interval When data refer to that user is inputted using input method, input interval between adjacent syllable, for example, certain user is defeated using input method It is fashionable, it inputs among two adjacent syllables and needs to be spaced 0.3s, then the history input interval data of the user is 0.3s.And And it is directed to different users, history inputs interval data may be different.
When the specific history for obtaining user inputs interval data, the mark or input method of available input equipment is current Login account correspondingly according to the current login account of the mark of input equipment or input method, determines the use of input Pinyin string Family, and then obtain history corresponding with the user and input interval data.It is, of course, also possible to obtain going through for user using other methods History inputs interval data, does not do any restriction herein.
It should according to the input interval judgement that the history inputs in interval data and cutting result between each adjacent syllable segmentation Whether cutting result meets reasonability condition.
When different user's input Pinyin string, the input interval habit between each adjacent syllable segmentation of input may be different, If being directed to all users is respectively provided with identical reasonability condition, difference may be accustomed to because of the input interval of individual subscriber, and Cause to judge each cutting result using the identical reasonability condition as foundation and the cutting result that filters out and be not allowed It really, correspondingly, may not be the candidate item of user's needs according to the candidate item that the cutting result filtered out is determined.
For example, the history input interval data for obtaining certain user is 0.3s, and system is the identical conjunction of all user settings Rational Conditions, the reasonability condition are that the average value at the input interval in cutting result between each adjacent syllable segmentation is greater than or equal to 0.5s.But according to the input habit of the user, normally input interval when input between adjacent syllable segmentation is 0.3s, relatively 0.5s in reasonability condition wants short, if using the 0.5s in reasonability condition as judgment basis, it may be because of cutting result In input interval between the segmentation of each adjacent syllable it is shorter, and will include that the user inputs the cutting result of syllable and is judged as discontented The cutting of sufficient reasonability condition is unable to get the time of user's needs as a result, do not determine candidate item according to the cutting result in turn Option.
The history of the generation of above-mentioned phenomenon in order to prevent, the user that can be will acquire inputs in interval data and cutting result Input interval between each adjacent syllable segmentation is combined, and then judges whether each cutting result meets reasonability condition.
It should be noted that the user's input interval data reference value acquired in some cases is not high, according to this kind In the case of user input the history input interval data that interval data is determined, possibly can not accurately reflect the input of user Interval habit.For example, user is inputted when on foot using input method, input interval data at this time may be longer, alternatively, User, because being influenced by the external world, and slows down or interrupts input during input, also result in the input space-number got According to history input interval data that is longer, determining using the input interval data in the case of these, it can not accurately reflect use It is accustomed at the input interval at family.Therefore, when the user's history of acquisition inputs interval data, the history input interval to user is needed Data are screened, and input interval data unreasonable in history input interval data are filtered out, according to remaining relatively reasonable Input interval data determines the history input interval data of user.
Be described below two kinds it is provided in this embodiment optional, judge whether cutting result meets the side of reasonability condition Method:
First method obtains the history input interval data of the user of the input Pinyin string, can be according to the user's History input interval data setting is directed to the reasonability condition of the user, or is adjusted to default reasonability condition, obtains To the reasonability condition for meeting user's input habit.And then using the reasonability condition as the standard of judgement, to each cutting knot Input interval in fruit between each adjacent syllable segmentation is judged, judges whether each cutting result meets user's input habit phase The reasonability condition of pass.In turn, candidate item is determined according to the cutting result for meeting the reasonability condition.
For example, when the history input interval data for obtaining certain user is 0.3s, i.e. user's input Pinyin string, adjacent syllable Input interval between segmentation is generally 0.3s, correspondingly, can be directed to the user, setting reasonability condition is in cutting result The average value at the input interval between each adjacent syllable segmentation is greater than or equal to 0.3s.Calculate each adjacent syllable point in each cutting result The average value at the input interval between section eliminates cutting of the input interval averages less than 0.3s as a result, big according only to input interval Candidate item is determined in or equal to the cutting result of 0.3s.
Second method, it is defeated according to the history after the history for getting the user of input Pinyin string inputs interval data Enter the difference between interval data and preset reasonability condition, the input interval the segmentation of each adjacent syllable is adjusted, into And judge whether the input interval between the segmentation of each adjacent syllable in cutting result adjusted meets reasonability condition.
For example, the history input interval data for obtaining certain user is 0.3s, and default reasonability condition is in cutting result The average value at the input interval between each adjacent syllable segmentation is greater than or equal to 0.5s, due to the input habit according to the user, just Often input interval when input between adjacent syllable segmentation is 0.3s, wants short relative to the 0.5s in reasonability condition, Ke Yigen The difference between interval data and reasonability condition is inputted according to the history of the user, to each adjacent in each cutting result of acquisition Input interval between syllable segmentation is adjusted, that is, due in the history input interval data and reasonability condition of the user Data differ 0.2s, and therefore, what the input being segmented between adjacent syllable each in each cutting result got was spaced is averaged Value increases 0.2s, according to the average value at the input interval between adjacent syllable each in adjusted each cutting result segmentation, judgement Whether each cutting result meets reasonability condition.
Further, it is also possible to further consider in cutting result when judging whether cutting result meets reasonability condition The quantity of syllable segmentation inputs interval according to the history at input interval, user between adjacent syllable each in cutting result segmentation The quantity that syllable is segmented in data and cutting result, judges whether cutting result meets reasonability condition, further increases and sieved The accuracy rate for the cutting result for meeting reasonability condition selected.
By combining the input interval of user to be accustomed to, and include in the cutting result for meeting reasonability condition filtered out Syllable be segmented into user input syllable a possibility that it is higher, it is correspondingly, true according to the cutting result for meeting reasonability condition A possibility that candidate item made is candidate item required for user is higher.
In addition, in some cases, the reference value of the input interval data of user may be not too much high, for example, with While family is inputted using input method, other things are being done, input interval at this time may use usually defeated with user The input interval entered when method is inputted is different.For example, the input interval that user is inputted when on foot using input method can Can be different from input interval when normal input or user is during input is influenced by the external world, and cause to input and slow down Or interrupt, the input interval of user is also different from input interval when normal input at this time.Therefore, for above situation, if only Meet the cutting of reasonability condition as a result, according to obtained satisfaction according to the input interval screening between syllable segmentation in cutting result The candidate item that the cutting result of reasonability condition is determined may not be the desired candidate item of user.And hence it is also possible to obtain User inputs the habit of syllable data, in conjunction with the habit of the input syllable data of user, judges whether each cutting result meets conjunction Rational Conditions.
Specifically, the history input syllable data for inputting the user of the pinyin string can be obtained first.Wherein, history inputs Syllable data refer to the quantity for the syllable that user often inputs in primary input, for example, certain user passes through in primary input Often 2 syllables of input, then the history input syllable quantity of the user is 2.Also, it is directed to different users, the history of user Inputting syllable data may be different.
When the specific history for obtaining user inputs syllable data, the mark or input method of available input equipment is current Login account correspondingly according to the current login account of the mark of input equipment or input method, determines the use of input Pinyin string Family, and then obtain history corresponding with the user and input syllable data.It is, of course, also possible to obtain going through for user using other methods History inputs syllable data, does not do any restriction herein.
Input interval and syllable point between being segmented according to each adjacent syllable in history input syllable data, cutting result The quantity of section judges whether cutting result meets reasonability condition.
The quantity for obtaining the segmentation of syllable included in each cutting result, according to the history input data of the user, judgement The habit matching degree of the input syllable quantity of the quantity and user for the syllable segmentation for including in each cutting result;According to cutting As a result the input interval between the segmentation of each adjacent syllable, judge input interval in cutting result between each adjacent syllable segmentation with just The often matching degree at the interval of input inter-syllable;Syllable number of fragments and user are inputted to the matching degree of syllable quantity habit, Input interval between each adjacent syllable segmentation combines with the matching degree at the interval of normal input inter-syllable, judges that this is cut Whether point result meets reasonability condition.
A kind of optional method for judging cutting result and whether meeting reasonability condition provided in this embodiment is described below:
Third function is set in conjunction with the history input syllable data of user, which is segmented with syllable in cutting result Quantity is variable, and the corresponding first function value of different syllable number of fragments is different, if what the syllable for including in cutting result was segmented Quantity is closer to the history input syllable data of the user, and the corresponding first function value of the cutting result is larger, conversely, if cutting The syllable number of fragments for including in point result differ more with the history of user input syllable quantity, then cutting result correspondence First function value it is smaller.
4th function is set, which will be cut with being divided into variable between the input between adjacent syllable each in cutting result segmentation The function is brought at input interval in point result between each adjacent syllable segmentation into, correspondingly, between the input between each adjacent syllable segmentation The input interval of adjacent inter-syllable when more meeting normal input, then the corresponding functional value of cutting result is bigger, conversely, each phase Input interval and the input interval difference of inter-syllable adjacent when normal input between adjacent syllable segmentation is more, then the cutting result pair The functional value answered is smaller.
It is that the sum of third functional value and the 4th functional value are greater than or equal to a certain default reasonability article that reasonability condition, which is arranged, Part value, therefore, the sum of the corresponding third functional value of cutting result and the 4th functional value are greater than or equal to the default reasonability condition Value, then illustrate that the cutting result meets reasonability condition, conversely, then illustrating that the cutting result is unsatisfactory for reasonability condition.
Other modes can certainly be used, are segmented according to each adjacent syllable in history input syllable quantity, cutting result Between input interval and syllable segmentation quantity judge whether cutting result meets reasonability condition, do not do any limit herein It is fixed.
In order to make it easy to understand, the above method is illustrated below:
The history input syllable data for obtaining certain user are 2, that is, illustrate that the user typically enters two in primary input Syllable.For pinyin string " sougou " the progress cutting of certain input of the user, two kinds of cuttings are obtained as a result, the first cutting is " sou " and " gou " two syllable segmentations, second is that cutting is " s ", " ou ", " g " and " ou " four syllable segmentations, if third Function is g (x), wherein x represents the syllable segments in each cutting result, due to syllable segments in the first cutting result It is identical as the history of user input syllable quantity, therefore, the g (x corresponding to the first cutting result1) numerical value is larger, it is 200, and the syllable segments in second of cutting result differs more with the history of user input syllable quantity, then corresponds to G (the x of second of cutting result2) numerical value is smaller, it is 50.
It is arranged to be divided into the 4th function of variable between the input between adjacent syllable segmentation each in cutting result as f (y), Middle y represents the average value at the input interval in each cutting result between each adjacent syllable segmentation, by each phase in the first cutting result The average value at the input interval between adjacent syllable segmentation brings f (y) into, obtains the corresponding f (y of the first cutting result1) it is 150, by the The average value at the input interval in two kinds of cutting results between each adjacent syllable segmentation brings f (y) into, and it is corresponding to obtain the second cutting result F (y2) it is 70.
Since pre-set reasonability condition is that the sum of two functions are greater than or equal to 300, it is clear that be directed to the first The sum of the sum of two functions of cutting result are 350, meet reasonability condition, and be directed to two functions of second of cutting result Only 120, and it is unsatisfactory for reasonability condition.
Here, the input interval that user inputs in the habit and cutting result of syllable quantity between each adjacent syllable segmentation is tied Altogether, judge whether each cutting result meets reasonability condition, the accuracy rate of cutting result screening is further improved, by one The cutting result for not meeting user's input habit a bit is eliminated, or the corresponding time of cutting result that will less meet user's input habit Option is placed on behind display area, selects the candidate item needed to prevent influencing user as distracter.
In addition, before determining the candidate item for pinyin string according to the cutting result for meeting reasonability condition, it can be with Correction process is carried out to the syllable segmentation in each cutting result for meeting reasonability condition.
Specifically, can be closed to the satisfaction if meeting the phonetic in the cutting result of reasonability condition there are erroneous input Syllable segmentation in the cutting result of Rational Conditions carries out error correction.For example, obtaining pinyin string " womwn " meets reasonability condition Include syllable segmentation " wo " and " mwn " in cutting result, error correction is carried out to " mwn ", is obtained correctly " men ", and then according to entangling The candidate item that syllable segmentation " wo " and " men " in cutting result after mistake determines.
In error correction, without the concern for the cutting for being unsatisfactory for reasonability condition as a result, only cutting to meeting reasonability condition Divide result to carry out error correction, that is, does not need to carry out error correction to meaningless cutting result, reduce the error correction workload of system, improve Error correction efficiency.
Generally the cutting result of some pinyin string is judged, judges whether each cutting result meets reasonability condition Afterwards, there can be the case where multiple cutting results are all satisfied reasonability condition, it in the case can be corresponding to multiple cutting results Candidate item be ranked up, in order to which user is rapidly selected required option.
Embodiment two
It referring to fig. 2, is the flow chart of determining pinyin string candidate item provided in this embodiment and the method for displaying sequence, this reality Example is applied for meeting the cutting result of reasonability condition there are two, this method is introduced, certainly, there are two or more When meeting the cutting result of reasonability condition, this method equally can be used and determine pinyin string candidate item and displaying sequence.
Step 201: according to the satisfaction degree of the first cutting result and reasonability condition, to the time for being directed to the first cutting result Option is ranked up.
Step 202: according to the satisfaction degree of the second cutting result and reasonability condition, to the time for being directed to the second cutting result Option is ranked up.
The satisfaction degree of each cutting result and reasonability condition is obtained, if the satisfaction of some cutting result and reasonability condition Degree is higher, then illustrate the syllable in the cutting result be segmented into user input syllable a possibility that it is higher, conversely, if some Cutting result and the satisfaction degree of reasonability condition are lower, then illustrate a possibility that cutting result is the syllable of user's input phase To lower.
When to candidate item is determined according to the cutting result for meeting reasonability condition, being directed to same cutting result may Multiple candidate items are determined, at this point it is possible to which the candidate item of same cutting result will be corresponded to according to the other function in input method It is ranked up.Specifically, can be accustomed to being ranked up candidate item according to the group word of user, it will more meet user group word habit Candidate item come front.
It should be noted that step 201 and step 202 are two steps arranged side by side, execution sequence in no particular order, Ke Yixian Step 201 is executed, then executes step 202, step 202 can also be first carried out, then execute step 201, also may be performed simultaneously step Rapid 201 and step 202, do not do any restriction herein.
Step 203: according to for the first cutting result ranking results and for the ranking results of the second cutting result it is true Surely for the candidate item of pinyin string and displaying sequence.
The ranking results for being directed to candidate item corresponding with each cutting result are integrated, it is defeated to be directed to user with determination The candidate item of the pinyin string entered and displaying sequence.When specific implementation, comprehensively consider between each cutting result and reasonability condition The influence of satisfaction degree and input method other function pair candidate item corresponding with each cutting result, is meeting reasonability with each In the corresponding candidate item of cutting result of condition, selection can show the candidate item in show area, and shown in show area The displaying sequence of candidate item.
The specific implementation of one kind optional determining pinyin string candidate item provided in this embodiment and displaying sequence is described below Method:
According to the satisfaction degree of each cutting result and reasonability condition, the is carried out to candidate item corresponding with each cutting result One marking, corresponding to the different candidate items of same cutting result, first item score Score1 is identical.In conjunction with other in input method Function carries out Section 2 marking Score2 to according to the candidate item for being directed to same cutting result.Respectively first item score Weight w1 and w2 is arranged in Score1 and Section 2 score Score1, comprehensive according to corresponding weight by the way of linear weighted function The first item score and Section 2 score of each candidate item obtain the total score Score, i.e. Score=w1* corresponding to each candidate item Score1+w2*Score2.And then according to the total score Score of each candidate item to each candidate item for being directed to same cutting result Sequence.Wherein, the weight being arranged for first item score w1 and Section 2 score w2, can be configured according to historical experience.
The total score Score for corresponding to the candidate item for the cutting result for meeting reasonability condition is obtained, and then according to all Displaying sequence is arranged for all candidate items in the total score Score of candidate item, specifically, for the higher candidate item setting of total score More forward displaying sequence, candidate item lower for total score may be eliminated or to be arranged more rearward for it Displaying sequence.
In order to make it easy to understand, the above method is illustrated below:
The pinyin string for obtaining certain user input is " fangan ", is directed in the cutting result of pinyin string, and there are two cuttings As a result it is all satisfied reasonability condition.First cutting result includes syllable segmentation " fang " and " an ", and the second cutting result includes sound Section segmentation " fan " and " gan ".
The candidate item for being directed to the first cutting result includes " scheme " and " room is dark ", due to the first cutting result and rationally The satisfaction degree of property condition is higher, and the first item score of the candidate item corresponding to the first cutting result is 450, in conjunction with input method The group word of middle user is accustomed to, and carries out Section 2 marking to the candidate item for being directed to the first cutting result, due to " scheme " relative to " room is dark " more meets the group word habit of user, and therefore, the Section 2 of candidate item " scheme " is scored at 200, candidate item " room is dark " Section 2 is scored at 10.Respectively first item score and Section 2 score distributes different weights, wherein first item score weight It is 0.9, the weight of Section 2 score is 0.1.Using the calculation of linear weighted function, calculates and correspond to the total of candidate item " scheme " 425 are scored at, 406 must be divided into corresponding to " room is dark " by calculating.Correspondingly, the first cutting is directed to as a result, " scheme " sorts It is more forward than " room is dark ".
The candidate item for being directed to the second cutting result includes " dislike " and " tired sense ", due to the second cutting result and rationally The satisfaction degree of property condition is lower, and the first item score of the candidate item corresponding to the second cutting result is 440, in conjunction with input method The group word of middle user is accustomed to, and carries out Section 2 marking to the candidate item for being directed to the second cutting result, due to " dislike " relative to " tired sense " more meets the group word habit of user, and therefore, the Section 2 of candidate item " dislike " is scored at 200, candidate item " tired sense " Section 2 is scored at 50.According to same weight, the total score for corresponding to each candidate item of the second cutting result is calculated, specifically , calculate " dislike " must be divided into 416, and calculate " tired sense " must be divided into 401.Correspondingly, be directed to the second cutting as a result, " dislike " sequence is more forward than " tired sense ".
The ranking results of the candidate item of the first cutting result will be directed to, with the candidate item that is directed to the second cutting result Ranking results combine, i.e., according to the total score of each candidate item, to the candidate item for being directed to the first cutting result and are directed to It is ranked up from high to low in the candidate item of the second cutting result according to total score, obtains candidate item displaying sequence, i.e. candidate item Displaying sequence is " scheme ", " dislike ", " room is dark ", " tired sense " from front to back.
Method provided in this embodiment, in each cutting result for meeting reasonability condition, further according to each cutting knot The satisfaction degree of fruit and reasonability condition determines the displaying sequence of candidate item corresponding with each cutting result, so that user is selecting When selecting candidate item, required candidate item can be quickly found.
Based on the pinyin string cutting method that previous embodiment provides, a kind of pinyin string cutting dress is present embodiments provided It sets, Fig. 3 shows a kind of structural block diagram of pinyin string cutting device, and described device includes:
Cutting module 301, the multiple cuttings obtained for the pinyin string cutting according to acquisition are as a result, any one cutting It as a result include that multiple syllables are segmented;
Judgment module 302, for judging cutting result according to the input interval between adjacent syllable each in cutting result segmentation Whether reasonability condition is met;
Determining module 303, for being determined according to the cutting result for meeting the reasonability condition for the pinyin string Candidate item.
Optionally, the judgment module includes:
First judging unit, for according to adjacent syllable each in cutting result segmentation between input interval and syllable be segmented Quantity judge whether cutting result meets reasonability condition.
Optionally, the judgment module includes:
History inputs interval data acquiring unit, for obtaining the history input space-number for inputting the user of the pinyin string According to;
Second judgment unit is segmented for inputting each adjacent syllable in interval data and cutting result according to the history Between input interval judge whether cutting result meets the reasonability condition.
Optionally, first judging unit includes:
History inputs syllable data acquisition subelement, for obtaining the history input syllable for inputting the user of the pinyin string Data;
First judgment sub-unit, for being segmented according to each adjacent syllable in history input syllable quantity, cutting result Between input interval and syllable segmentation quantity judge whether cutting result meets reasonability condition.
Optionally, described device further include:
Correction module, for carrying out error correction to the syllable segmentation in the cutting result for meeting the reasonability condition.
Optionally, if in the cutting result for meeting the reasonability condition including the first cutting result and the second cutting As a result, described determine the candidate item for being directed to the pinyin string, described device according to the cutting result for meeting the reasonability condition Include:
First sorting module, for the satisfaction degree according to the first cutting result and the reasonability condition, to needle The candidate item of the first cutting result is ranked up;
Second sorting module, for the satisfaction degree according to the second cutting result and the reasonability condition, to needle The candidate item of the second cutting result is ranked up;
Candidate item module is determined, for cutting according to the ranking results for the first cutting result and for described second The ranking results of result are divided to determine for the candidate item of the pinyin string and displaying sequence.
Above-mentioned pinyin string cutting device, according to the input interval between adjacent syllable each in each cutting result segmentation, judgement is each Whether cutting result meets reasonability condition, and the cutting result for meeting reasonability condition obtained with this condition is not only foundation Syllable splitting, while the characteristics of input is spaced is also complied with, and eliminated according to syllable splitting, but the input between syllable segmentation The cutting result of interval too small.Therefore, when determining candidate item, interval too small is inputted between the syllable being eliminated segmentation without considering Cutting result, it is only necessary to candidate item is determined according to the cutting result for meeting reasonability condition, correspondingly, for the pinyin string institute Meaningless for user's input demand or unwanted candidate item quantity is reduced in the candidate item of displaying, to reduce user's choosing The time for selecting candidate item improves the input experience of user.
Fig. 4 is a kind of block diagram of device 400 for pinyin string cutting shown according to an exemplary embodiment.For example, Device 400 can be robot, mobile phone, computer, digital broadcasting terminal, messaging device, game console, plate Equipment, Medical Devices, body-building equipment, personal digital assistant etc..
Referring to Fig. 4, device 400 may include following one or more components: processing component 402, memory 404, power supply Component 406, multimedia component 408, audio component 410, the interface 412 of input/output (I/O), sensor module 414, and Communication component 416.
The integrated operation of the usual control device 400 of processing component 402, such as with display, telephone call, data communication, phase Machine operation and record operate associated operation.Processing element 402 may include that one or more processors 420 refer to execute It enables, to perform all or part of the steps of the methods described above.In addition, processing component 402 may include one or more modules, just Interaction between processing component 402 and other assemblies.For example, processing component 402 may include multi-media module, it is more to facilitate Interaction between media component 408 and processing component 402.
Memory 404 is configured as storing various types of data to support the operation in device 400.These data are shown Example includes the instruction of any application or method for operating on device 400, contact data, and telephone book data disappears Breath, picture, video etc..Memory 404 can be by any kind of volatibility or non-volatile memory device or their group It closes and realizes, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM) is erasable to compile Journey read-only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash Device, disk or CD.
Power supply module 406 provides electric power for the various assemblies of device 400.Power supply module 406 may include power management system System, one or more power supplys and other with for device 400 generate, manage, and distribute the associated component of electric power.
Multimedia component 408 includes the screen of one output interface of offer between described device 400 and user.One In a little embodiments, screen may include liquid crystal display (LCD) and touch panel (TP).If screen includes touch panel, screen Curtain may be implemented as touch screen, to receive input signal from the user.Touch panel includes one or more touch sensings Device is to sense the gesture on touch, slide, and touch panel.The touch sensor can not only sense touch or sliding action Boundary, but also detect duration and pressure associated with the touch or slide operation.In some embodiments, more matchmakers Body component 408 includes a front camera and/or rear camera.When device 400 is in operation mode, such as screening-mode or When video mode, front camera and/or rear camera can receive external multi-medium data.Each front camera and Rear camera can be a fixed optical lens system or have focusing and optical zoom capabilities.
Audio component 410 is configured as output and/or input audio signal.For example, audio component 410 includes a Mike Wind (MIC), when device 400 is in operation mode, when such as call mode, recording mode, and voice recognition mode, microphone is matched It is set to reception external audio signal.The received audio signal can be further stored in memory 404 or via communication set Part 416 is sent.In some embodiments, audio component 410 further includes a loudspeaker, is used for output audio signal.
I/O interface 412 provides interface between processing component 402 and peripheral interface module, and above-mentioned peripheral interface module can To be keyboard, click wheel, button etc..These buttons may include, but are not limited to: home button, volume button, start button and lock Determine button.
Sensor module 414 includes one or more sensors, and the state for providing various aspects for device 400 is commented Estimate.For example, sensor module 414 can detecte the state that opens/closes of device 400, and the relative positioning of component, for example, it is described Component is the display and keypad of device 400, and sensor module 414 can be with 400 1 components of detection device 400 or device Position change, the existence or non-existence that user contacts with device 400,400 orientation of device or acceleration/deceleration and device 400 Temperature change.Sensor module 414 may include proximity sensor, be configured to detect without any physical contact Presence of nearby objects.Sensor module 414 can also include optical sensor, such as CMOS or ccd image sensor, at As being used in application.In some embodiments, which can also include acceleration transducer, gyro sensors Device, Magnetic Sensor, pressure sensor or temperature sensor.
Communication component 416 is configured to facilitate the communication of wired or wireless way between device 400 and other equipment.Device 400 can access the wireless network based on communication standard, such as WiFi, 2G or 8G or their combination.In an exemplary implementation In example, communication component 416 receives broadcast singal or broadcast related information from external broadcasting management system via broadcast channel. In one exemplary embodiment, the communication component 416 further includes near-field communication (NFC) module, to promote short range communication.Example Such as, NFC module can be based on radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band (UWB) technology, Bluetooth (BT) technology and other technologies are realized.
In the exemplary embodiment, device 400 can be believed by one or more application specific integrated circuit (ASIC), number Number processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for executing the above method.
In the exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instruction, example are additionally provided It such as include the memory 404 of instruction, above-metioned instruction can be executed by the processor 420 of device 400 to complete the above method.For example, The non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, tape, floppy disk With optical data storage devices etc..
A kind of non-transitorycomputer readable storage medium, when the instruction in the storage medium is by the processing of mobile terminal When device executes, so that mobile terminal is able to carry out a kind of pinyin string cutting method, which comprises
The multiple cuttings obtained according to the pinyin string cutting of acquisition are as a result, any one cutting result includes multiple syllables Segmentation;
Judge whether cutting result meets reasonability item according to the input interval between adjacent syllable each in cutting result segmentation Part;
The candidate item for being directed to the pinyin string is determined according to the cutting result for meeting the reasonability condition.
Fig. 5 is the structural schematic diagram of server in the embodiment of the present invention.The server 500 can be due to configuration or performance be different Generate bigger difference, may include one or more central processing units (central processing units, CPU) 522 (for example, one or more processors) and memory 532, one or more storage application programs 542 or The storage medium 530 (such as one or more mass memory units) of data 544.Wherein, memory 532 and storage medium 530 can be of short duration storage or persistent storage.The program for being stored in storage medium 530 may include one or more modules (diagram does not mark), each module may include to the series of instructions operation in server.Further, central processing unit 522 can be set to communicate with storage medium 530, and the series of instructions behaviour in storage medium 530 is executed on server 500 Make.
Server 500 can also include one or more power supplys 524, one or more wired or wireless networks Interface 550, one or more input/output interfaces 558, one or more keyboards 554, and/or, one or one The above operating system 541, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through The relevant hardware of program instruction is completed, and foregoing routine can be stored in a computer readable storage medium, which exists When execution, step including the steps of the foregoing method embodiments is executed;And storage medium above-mentioned can be at least one in following media Kind: read-only memory (English: read-only memory, abbreviation: ROM), RAM, magnetic or disk etc. are various to be can store The medium of program code.
It should be noted that all the embodiments in this specification are described in a progressive manner, each embodiment it Between same and similar part may refer to each other, each embodiment focuses on the differences from other embodiments. For equipment and system embodiment, since it is substantially similar to the method embodiment, so describe fairly simple, The relevent part can refer to the partial explaination of embodiments of method.Equipment and system embodiment described above is only schematic , wherein unit may or may not be physically separated as illustrated by the separation member, it is shown as a unit Component may or may not be physical unit, it can and it is in one place, or may be distributed over multiple networks On unit.Some or all of the modules therein can be selected to achieve the purpose of the solution of this embodiment according to the actual needs. Those of ordinary skill in the art can understand and implement without creative efforts.
The above, only a kind of specific embodiment of the application, but the protection scope of the application is not limited thereto, Within the technical scope of the present application, any changes or substitutions that can be easily thought of by anyone skilled in the art, Should all it cover within the scope of protection of this application.Therefore, the protection scope of the application should be with scope of protection of the claims Subject to.

Claims (10)

1. a kind of pinyin string cutting method, which is characterized in that the described method includes:
The multiple cuttings obtained according to the pinyin string cutting of acquisition are as a result, any one cutting result includes multiple syllables point Section;
Judge whether cutting result meets reasonability condition according to the input interval between adjacent syllable each in cutting result segmentation;
The candidate item for being directed to the pinyin string is determined according to the cutting result for meeting the reasonability condition.
2. the method according to claim 1, wherein between the segmentation according to adjacent syllable each in cutting result Input interval judges whether cutting result meets reasonability condition, comprising:
According to adjacent syllable each in cutting result segmentation between input interval and syllable segmentation quantity judge cutting the result is that It is no to meet reasonability condition.
3. the method according to claim 1, wherein between the segmentation according to adjacent syllable each in cutting result Input interval judges whether cutting result meets reasonability condition, comprising:
Obtain the history input interval data for inputting the user of the pinyin string;
Cutting is judged according to the input interval that the history inputs in interval data and cutting result between each adjacent syllable segmentation As a result whether meet the reasonability condition.
4. according to the method described in claim 2, it is characterized in that, between the segmentation according to adjacent syllable each in cutting result The quantity of input interval and syllable segmentation judges whether cutting result meets reasonability condition, comprising:
Obtain the history input syllable data for inputting the user of the pinyin string;
Input interval and syllable point between being segmented according to each adjacent syllable in history input syllable quantity, cutting result The quantity of section judges whether cutting result meets reasonability condition.
5. the method according to claim 1, wherein described according to the cutting knot for meeting the reasonability condition Fruit determines before the candidate item for the pinyin string, further includes:
Error correction is carried out to the syllable segmentation in the cutting result for meeting the reasonability condition.
6. the method according to claim 1, wherein if in the cutting result for meeting the reasonability condition Including the first cutting result and the second cutting as a result, described determine according to the cutting result for meeting the reasonability condition for institute State the candidate item of pinyin string, comprising:
According to the satisfaction degree of the first cutting result and the reasonability condition, to the time for being directed to the first cutting result Option is ranked up;
According to the satisfaction degree of the second cutting result and the reasonability condition, to the time for being directed to the second cutting result Option is ranked up;
Needle is determined according to the ranking results for the first cutting result and for the ranking results of the second cutting result Candidate item and displaying sequence to the pinyin string.
7. a kind of pinyin string cutting device, which is characterized in that described device includes:
Cutting module, the multiple cuttings obtained for the pinyin string cutting according to acquisition are as a result, any one cutting result packet Include multiple syllable segmentations;
Judgment module, for judging whether cutting result meets according to the input interval between adjacent syllable each in cutting result segmentation Reasonability condition;
Determining module, for determining the candidate item for being directed to the pinyin string according to the cutting result for meeting the reasonability condition.
8. device according to claim 7, which is characterized in that the judgment module includes:
First judging unit, for the number according to input interval and syllable segmentation between adjacent syllable each in cutting result segmentation Amount judges whether cutting result meets reasonability condition.
9. a kind of processing equipment for pinyin string cutting, which is characterized in that include memory and one or one with On program, one of them perhaps more than one program be stored in memory and be configured to by one or more than one It includes the instruction for performing the following operation that processor, which executes the one or more programs:
The multiple cuttings obtained according to the pinyin string cutting of acquisition are as a result, any one cutting result includes multiple syllables point Section;
Judge whether cutting result meets reasonability condition according to the input interval between adjacent syllable each in cutting result segmentation;
The candidate item for being directed to the pinyin string is determined according to the cutting result for meeting the reasonability condition.
10. a kind of machine readable media is stored thereon with instruction, when executed by one or more processors, so that device is held Pinyin string cutting method of the row as described in one or more in claim 1 to 6.
CN201711284974.7A 2017-12-07 2017-12-07 Pinyin string segmentation method and device Active CN109901725B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711284974.7A CN109901725B (en) 2017-12-07 2017-12-07 Pinyin string segmentation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711284974.7A CN109901725B (en) 2017-12-07 2017-12-07 Pinyin string segmentation method and device

Publications (2)

Publication Number Publication Date
CN109901725A true CN109901725A (en) 2019-06-18
CN109901725B CN109901725B (en) 2022-05-06

Family

ID=66939205

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711284974.7A Active CN109901725B (en) 2017-12-07 2017-12-07 Pinyin string segmentation method and device

Country Status (1)

Country Link
CN (1) CN109901725B (en)

Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075262A (en) * 2007-06-12 2007-11-21 腾讯科技(深圳)有限公司 Method and system for inputting Chinese character by computer
CN101644961A (en) * 2009-08-14 2010-02-10 北京搜狗科技发展有限公司 Encoded string sequencing method, device and character input method and device
CN102200839A (en) * 2010-03-25 2011-09-28 阿里巴巴集团控股有限公司 Method and system for processing pinyin string in process of inputting Chinese characters
CN102566775A (en) * 2010-12-31 2012-07-11 上海量明科技发展有限公司 Input method and system for generating character interval
CN102866783A (en) * 2011-07-06 2013-01-09 哈尔滨工业大学 Syncopation method of Chinese phonetic string and system thereof
CN102866782A (en) * 2011-07-06 2013-01-09 哈尔滨工业大学 Input method and input method system for improving sentence generating efficiency
CN102955770A (en) * 2011-08-17 2013-03-06 腾讯科技(深圳)有限公司 Method and system for automatic recognition of pinyin
CN103201708A (en) * 2010-12-16 2013-07-10 佐竹靖彦 Input method for Chinese language electronic devices
CN104252484A (en) * 2013-06-28 2014-12-31 重庆新媒农信科技有限公司 Pinyin error correction method and system
US20150025877A1 (en) * 2013-07-19 2015-01-22 Kabushiki Kaisha Toshiba Character input device, character input method, and computer program product
CN104345896A (en) * 2013-07-31 2015-02-11 淘宝(中国)软件有限公司 Alphabetic writing word group inputting method and alphabetic writing word group inputting system
CN104423621A (en) * 2013-08-22 2015-03-18 北京搜狗科技发展有限公司 Pinyin string processing method and device
CN104516522A (en) * 2013-09-29 2015-04-15 北京三星通信技术研究有限公司 Input method and device of nine-rectangle-grid keyboard
US20150269137A1 (en) * 2014-03-19 2015-09-24 Baidu Online Network Technology (Beijing) Co., Ltd Input method and system
CN105335415A (en) * 2014-08-04 2016-02-17 北京搜狗科技发展有限公司 Search method based on input prediction, and input method system
CN105843414A (en) * 2015-01-13 2016-08-10 北京搜狗科技发展有限公司 Input correction method for input method and input method device
CN106484132A (en) * 2015-09-02 2017-03-08 北京搜狗科技发展有限公司 A kind of input error correction method and input subtraction unit
CN106484131A (en) * 2015-09-02 2017-03-08 北京搜狗科技发展有限公司 A kind of input error correction method and input subtraction unit

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075262A (en) * 2007-06-12 2007-11-21 腾讯科技(深圳)有限公司 Method and system for inputting Chinese character by computer
CN101644961A (en) * 2009-08-14 2010-02-10 北京搜狗科技发展有限公司 Encoded string sequencing method, device and character input method and device
CN102200839A (en) * 2010-03-25 2011-09-28 阿里巴巴集团控股有限公司 Method and system for processing pinyin string in process of inputting Chinese characters
CN103201708A (en) * 2010-12-16 2013-07-10 佐竹靖彦 Input method for Chinese language electronic devices
CN102566775A (en) * 2010-12-31 2012-07-11 上海量明科技发展有限公司 Input method and system for generating character interval
CN102866783A (en) * 2011-07-06 2013-01-09 哈尔滨工业大学 Syncopation method of Chinese phonetic string and system thereof
CN102866782A (en) * 2011-07-06 2013-01-09 哈尔滨工业大学 Input method and input method system for improving sentence generating efficiency
CN102955770A (en) * 2011-08-17 2013-03-06 腾讯科技(深圳)有限公司 Method and system for automatic recognition of pinyin
CN104252484A (en) * 2013-06-28 2014-12-31 重庆新媒农信科技有限公司 Pinyin error correction method and system
US20150025877A1 (en) * 2013-07-19 2015-01-22 Kabushiki Kaisha Toshiba Character input device, character input method, and computer program product
CN104345896A (en) * 2013-07-31 2015-02-11 淘宝(中国)软件有限公司 Alphabetic writing word group inputting method and alphabetic writing word group inputting system
CN104423621A (en) * 2013-08-22 2015-03-18 北京搜狗科技发展有限公司 Pinyin string processing method and device
CN104516522A (en) * 2013-09-29 2015-04-15 北京三星通信技术研究有限公司 Input method and device of nine-rectangle-grid keyboard
US20150269137A1 (en) * 2014-03-19 2015-09-24 Baidu Online Network Technology (Beijing) Co., Ltd Input method and system
CN105335415A (en) * 2014-08-04 2016-02-17 北京搜狗科技发展有限公司 Search method based on input prediction, and input method system
CN105843414A (en) * 2015-01-13 2016-08-10 北京搜狗科技发展有限公司 Input correction method for input method and input method device
CN106484132A (en) * 2015-09-02 2017-03-08 北京搜狗科技发展有限公司 A kind of input error correction method and input subtraction unit
CN106484131A (en) * 2015-09-02 2017-03-08 北京搜狗科技发展有限公司 A kind of input error correction method and input subtraction unit

Also Published As

Publication number Publication date
CN109901725B (en) 2022-05-06

Similar Documents

Publication Publication Date Title
CN105426152B (en) The display methods and device of barrage
CN106708282B (en) A kind of recommended method and device, a kind of device for recommendation
US20090094555A1 (en) Adaptive user interface elements on display devices
CN105912226A (en) Method and apparatus for displaying pages in application
CN107229348A (en) A kind of input error correction method, device and the device for inputting error correction
CN104216973B (en) A kind of method and device of data search
CN107948429B (en) Content demonstration method, terminal equipment and computer readable storage medium
CN107870677A (en) A kind of input method, device and the device for input
CN107390997A (en) A kind of application programe switch-over method and device
CN108038102A (en) Recommendation method, apparatus, terminal and the storage medium of facial expression image
CN110083266A (en) Information processing method, device and storage medium
CN112269898A (en) Background music obtaining method and device, electronic equipment and readable storage medium
CN107333182A (en) The player method and device of multimedia file
CN109582768A (en) A kind of text entry method and device
CN116547640B (en) Application recommendation method and electronic equipment
CN103207726B (en) The apparatus and method of clipper service are provided in portable terminal
CN103970831B (en) Recommend the method and apparatus of icon
CN110286775A (en) A kind of dictionary management method and device
CN109901725A (en) A kind of pinyin string cutting method and device
CN108345886A (en) A kind of video flowing text recognition method and device
WO2022267433A1 (en) Video resource processing method and apparatus
CN105205101A (en) Evaluation method and device for network information, and terminal
CN107179835A (en) A kind of input method and device, a kind of device for being used to input
CN106161208A (en) A kind of information that carries out in the application specifies device, method and the mobile terminal shared
CN109799916A (en) A kind of candidate item association method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant