CN101354749B

CN101354749B - Method for making dictionary, hand-written input method and apparatus

Info

Publication number: CN101354749B
Application number: CN 200710130196
Authority: CN
Inventors: 沈利; 吴波; 吴亚栋
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2007-07-24
Filing date: 2007-07-24
Publication date: 2013-01-09
Anticipated expiration: 2027-07-24
Also published as: CN101354749A

Abstract

The invention discloses a method for manufacturing a dictionary, a handwriting input method and equipment. The method for manufacturing the dictionary can predict a character input by strokes so as to relieve the burden of a user. The method comprises the following steps: the feature of the integral character of an integral character sample of characters and the feature of partial strokes of a partial stroke sample of the characters with the number of the strokes larger than a preset value are extracted; and a learning algorithm by a machine is used to carry out learning to the feature of the integral character and the feature of the partial strokes to generate an integral character template and/or a partial stroke template of the characters, which are used as items of the dictionary. The system has the advantages of simple structure, low hardware requirement, rapid identification speed, high identification rate, and so on, and can be realized in an embedded type system and so on.

Description

Dictionary method for making, hand-written inputting method and equipment

Technical field

The present invention relates to the literal input, be specifically related to a kind of dictionary method for making, hand-written inputting method and equipment that is applied to electronic product, the literal that the stroke prediction that can enoughly input will be inputted is in order to alleviate user's burden.

Background technology

For a long time, the quick input of non-alphabetic character is puzzlement people's a difficult problem such as Chinese character, because the keyboard of computing machine is applicable to the input of western language such as English, and is not suitable for the input of Chinese character.

The input of Chinese character is divided into two kinds of keyboard input and handwriting inputs usually.The keyboard input is to give certain coding according to certain coding rule to each Chinese character, identifies Chinese character by the keyboard input coding, for example various spelling input methods and five-stroke character input method.Handwriting input is identified Chinese character by the person's handwriting that the identification people directly write, thereby reaches the purpose of input Chinese character.Because the keyboard input needs the user skillfully to use keyboard and/or keeps various codes or the coding rule of each Chinese character firmly in mind, can't promote the use of in the people that seldom use keyboard.In addition, because the cause of dialect, a lot of people can write certain Chinese character, really can't accurately read its pronunciation, and this is so that spelling input method has run into the difficulty that is difficult to overcome in actual use.

Because do not need the user to carry out the study of input method before using, hand-writing input method has obtained seldom using the people's of keyboard welcome.In principle, handwriting input does not need the user to carry out any study, as long as he can write Chinese character.

Patent documentation 1 (US6028959) has disclosed a kind of method that adopts the stroke forecasting techniques to input Chinese character.According to the stroke of Chinese ideograph of having write, before whole Chinese character writes, the just measurable Chinese character that goes out will write, thus greatly improved the speed of handwriting input.Particularly, the method for patent documentation 1 has adopted time-delay neural network (TDNN) to carry out the stroke prediction in conjunction with the mode of multi-Layer Perceptron Neural Network (MLP).

Patent documentation 2 (Unexamined Patent 2005-25566) has disclosed a kind of method of handwriting input Chinese character, the handwritten stroke centralized stores of wherein input part being inputted is in storage part, with comprising the coordinate feature in the storage part, mating to the searching object information of measure feature and graphic feature etc. and the dictionary that creates in advance, use to comprise the various method for mode matching generation forecast candidates such as OCR, DP.Then, the candidate selection portion literal that selection will be inputted from the candidate of prediction.Input part with the input stroke set as a whole image process, can reduce user's burden.

But the hand-written inputting method prediction steps of patent documentation 1 is too complicated, it adopted comprise two classes totally 68 neural networks relate to many parameters of 5M, cause forecasting process very complicated.Do not need to write complete word although the method for patent documentation 2 has provided, just can want by the coupling prediction method of input results, this invention is processed the integral body of the stroke of input as image, cause inefficiency.

Summary of the invention

In view of the problem of prior art, finished the present invention.The purpose of this invention is to provide a kind of fully dictionary creating method and apparatus, hand-written inputting method and the equipment of innovation, can predict the literal that to input by the stroke of writing, in order to alleviate user's burden.

In a first aspect of the present invention, a kind of method of making dictionary has been proposed, comprise step: extract the whole printed words whole word feature originally of literal, and stroke number is greater than the part stroke feature of the part stroke sample of the literal of predetermined value; And by with machine learning algorithm described whole word feature and described part stroke feature being learnt whole character matrix plate and/or the part stroke template of generating character, as the project in the dictionary.

In a second aspect of the present invention, a kind of equipment of making dictionary has been proposed, comprising: extract the whole printed words whole word feature originally of literal, and stroke number is greater than the device of the part stroke feature of the part stroke sample of the literal of predetermined value; And by with machine learning algorithm described whole word feature and described part stroke feature being learnt whole character matrix plate and/or the part stroke template of generating character, as the device of the project in the dictionary.

In a third aspect of the present invention, a kind of hand-written inputting method has been proposed, comprise the feature of at least part of handwriting of step extraction literal; And calculate described feature and the dictionary that creates according to the method for above-mentioned first aspect in template between distance; And will be apart from the literal of the template representative of less at least one as recognition result.

In a fourth aspect of the present invention, a kind of handwriting input device has been proposed, comprising: the device that extracts at least part of handwriting feature of literal; The device of the distance between the template in the dictionary that calculates described feature and create according to the method for above-mentioned first aspect; And will be apart from the literal of the template representative of less at least one device as recognition result.

The present invention proposes complete printed words basis and part stroke printed words originally, and made on this basis recognition dictionary, by mating with this recognition dictionary, hand-written user is input characters not exclusively, automatic Prediction goes out the literal candidate that will input, reduces writer's burden.That native system has is simple in structure, hardware requirement is low, recognition speed is fast, and the discrimination advantages of higher can realize in embedded system etc.

In addition, in the manufacturing process of recognition dictionary, the feature of extracting and the order of strokes observed in calligraphy of handwriting, connect pen and stroke number is irrelevant, thereby make the user when writing, break away from the order of strokes observed in calligraphy, connect the restriction of pen and stroke number.

In addition, by dimensionality reduction and quantification, both greatly reduce the required internal memory of recognition dictionary, realized miniaturization, reduced again the calculated amount in the identifying, avoided floating-point operation, improved recognition speed, be conducive to the realization of high speed.

In addition, in identifying, adopted the sectional type fast matching method, step by step filtering candidate item is dwindled comparison range, in the situation that affect hardly discrimination, greatly improve recognition speed, finally ensured the realization of hand script Chinese input equipment character identification system high speed.

In addition, the tabulation of ten candidate words provides a kind of more friendly operation interface in conjunction with the background prompting mode, has avoided the frequent transfer of user's sight line between input field and candidate regions, reduce so on the one hand user's working strength, improved again on the other hand handwriting input speed.

In addition, automatically send the word mode by the self-adaptation non-timed, system can write according to user's writing style and institute, adjusts intelligently the interval stand-by period between word and the word, provides a kind of more humane control mode, also so that handwriting input is more efficient.

Description of drawings

By below in conjunction with description of drawings the preferred embodiments of the present invention, will make of the present invention above-mentioned and other objects, features and advantages are clearer, wherein:

Fig. 1 shows the functional block diagram according to the handwriting input device of the embodiment of the invention;

Fig. 2 is the automatic generation of part stroke printed words basis and the schematic diagram of whole word class and part stroke word class;

Fig. 3 is the process flow diagram of describing according to the dictionary method for making of the embodiment of the invention;

Fig. 4 is the schematic diagram that is described in the equidistant re-sampling operations of carrying out in the preprocessing process;

Fig. 5 is the schematic diagram of the barycenter normalization carried out in preprocessing process and non-linear normalizing operation;

Fig. 6 is the schematic diagram of describing the process of extracting the stroke direction distribution characteristics;

Fig. 7 is the schematic diagram of describing the process of extracting the grid stroke feature;

Fig. 8 is the schematic diagram of describing the process of extracting the peripheral direction feature;

Fig. 9 describes the schematic diagram utilize the process that GLVQ learns;

Figure 10 is the detail flowchart according to the identifying in the hand-written inputting method of the embodiment of the invention;

Figure 11 is the schematic diagram of describing the Rapid matching process; And

Figure 12 is that the tabulation of ten candidate words is in conjunction with the schematic diagram of first-selected word background prompting.

Embodiment

To a preferred embodiment of the present invention will be described in detail, having omitted in the description process is unnecessary details and function for the present invention with reference to the accompanying drawings, obscures to prevent that the understanding of the present invention from causing.

Fig. 1 shows the functional block diagram according to the handwriting input device of the embodiment of the invention.As shown in Figure 1, comprise according to the handwriting input device of the embodiment of the invention: handwriting input unit 110, be used for gathering user's person's handwriting, and to its digitizing, as input person's handwriting signal; Handwriting storage unit 120 is used for the input person's handwriting signal that storage handwriting input unit 110 produces; Identification prediction unit 130 is used for extracting from input person's handwriting signal the feature of this signal, the feature of the some or all of stroke of certain word for example, and itself and the template in the dictionary that is pre-created mated, produce the input candidate according to matching degree; Indicative control unit 150, when the user is by handwriting input unit 110 input strokes, show person's handwriting at hand-written display unit 160, present to the user, on the other hand, the candidate item that Identification display predicting unit 130 produces on hand-written display unit 160, the candidate item of ten literal that will input the most approaching for example arranging according to degree of closeness; Candidate item selected cell 140, then the literal that selection will be inputted from ten candidate item under user's operation is shown to the user by hand-written display unit 160.

The below describes the constructive process of the above-mentioned dictionary of mentioning in detail, namely generates process for the template of each literal by machine learning method from hand-written literal sample.

According to the present invention, the handwriting samples of each literal is divided into two classes.One class is called whole word class, and the sample that belongs to such possesses whole strokes, is a complete word.Reasons such as " even pens " can not require that the sample that belongs in such has identical stroke number when considering character calligraph.The another kind of part stroke word class that then is called, the sample stroke disappearance in such is the word that does not write.Some literal, especially single character, such as " people ", " soil " etc., stroke number own seldom, generating portion stroke word class no longer just.On the other hand, a predetermined value STH can be set to the stroke of literal, for example the STH value is 4.Stroke number only has whole word class sample less than or equal to 4 literal, and other literal has two kinds of handwriting samples, i.e. whole word class sample and part stroke word class sample.Fig. 2 is the automatic generation of part stroke printed words basis and the schematic diagram of whole word class and part stroke word class.

As shown in Figure 2, write literal by different users, and record its person's handwriting.The complete person's handwriting of a literal is called aforesaid ' whole word class sample '.By whole word class sample is begun one by one stroke removal from last stroke, obtain aforesaid ' part stroke word class sample '.Row ' ' shown in the left side of Fig. 2 are exactly the whole word class sample of this literal, and the person's handwriting shown in the right side of Fig. 2 is exactly the part stroke word class sample of this literal, and they are to form by remove the part stroke from the whole word class sample in left side.As mentioned above, the stroke number of part stroke word class sample is more than or equal to a predetermined value, and for example 4.

Fig. 3 is the process flow diagram of describing according to the dictionary method for making of the embodiment of the invention.As shown in Figure 3, at step S110, gather each literal, the person's handwriting that produces when namely being write by the user is as the sample set that generates dictionary.At step S120, the sample storage of each literal of collection is the whole word class sample of this literal.

Then, at step S130, judge that whether number of strokes at each literal is greater than predetermined value STH.If stroke number, then means this literal greater than STH and can have whole word class sample and part stroke word class sample.Like this, at step S140, from the whole word class sample generating portion stroke word class sample of this literal, and at step S150, the part stroke word class sample of this literal and this literal and whole word class sample are stored explicitly.

If the stroke number of this literal, then means this literal less than or equal to STH and only have whole word class sample, flow process forwards step S160 to.

At step S160, carry out pre-service for whole word class sample and/or the part stroke word class sample of each literal, operations such as equidistant level and smooth, barycenter normalization and non-linear normalizing is so that so that the feature of this sample becomes regular.

Equidistant smooth operation is that the sampled point to handwriting samples resamples, and makes it the interval even, and Fig. 4 is the schematic diagram that is described in the equidistant re-sampling operations of carrying out in the preprocessing process.

As shown in Figure 4, in order to reduce the operand of smoothing processing, reduce significantly the number of sampled point by equidistant resampling.The original input coordinate sequence of (A) expression of Fig. 4, the result of sampling when it is grade.Therefore, the local sampled point slow in writing speed is very intensive, even has many apart to be zero sampled point, to be called rest point.

Can delete all rest points by equidistant resampling, and reduce significantly the number of sampled point.The equidistant process that resamples is as follows:

Make the starting point P of stroke ₀Be resample points, calculate P ₁And P ₀Distance.If distance is then left out P less than the resampling interval T ₁, continue to calculate P ₂And P ₀Distance.Otherwise, keep P ₁As resample points, continue to calculate P ₂And P ₁Distance, by that analogy.

Because original sample point is very intensive, the resampling interval T is generally much larger than the interval of original sample point.Therefore, needn't consider that the interval of original sample point needs to replenish the situation of sampled point greater than T.(B) of Fig. 4 shows the result of equidistant resampling.As can be seen from the figure, need data volume to be processed greatly to reduce.

Barycenter normalization operation is that sample size is regular to predetermined size, and the center (W/2, H/2) of for example W * H, and barycenter and external square frame overlaps.The normalized purpose of barycenter is to adjust position, the size of input characters, makes it consistent with mode standard in the recognition dictionary, and the geometric center of the boundary rectangle frame of this literal is overlapped with the barycenter of literal.Fig. 5 (A) shows original handwriting samples, and (B) of Fig. 5 shows the schematic diagram of the barycenter normalization process of carrying out in preprocessing process.The height of bidding quasi-mode is H, and wide is W.The height of input characters is h, and wide is w.The computing formula of barycenter is:

\{\begin{matrix} x_{0} = Σ_{k = 1}^{n - 1} (\frac{x_{k} + x_{k + 1}}{2} \cdot \sqrt{{(x_{k} - x_{k + 1})}^{2} + {(y_{k} - y_{k + 1})}^{2}}) \\ y_{0} = Σ_{k = 1}^{n - 1} (\frac{y_{k} + y_{k + 1}}{2} \cdot \sqrt{{(x_{k} - x_{k + 1})}^{2} + {(y_{k} - y_{k + 1})}^{2}}) \end{matrix} \cdot \cdot \cdot (1)

The normalized computing formula of barycenter is:

x^{'} = \{\begin{matrix} x \cdot \frac{W}{2} / x_{0} & if x < x_{0} \\ \frac{W}{2} + (x - x_{0}) \cdot \frac{W}{2} / (w - x_{0}) & if x &GreaterEqual; x_{0} \end{matrix} \cdot \cdot \cdot (2)

y^{'} = \{\begin{matrix} y \cdot \frac{H}{2} / y_{0} & if y < y_{0} \\ \frac{H}{2} + (y - y_{0}) \cdot \frac{H}{2} / (h - y_{0}) & if y &GreaterEqual; y_{0} \end{matrix} \cdot \cdot \cdot (3)

But barycenter normalization operation can only be corrected centroid motion, and it is helpless to the problem out of proportion of the partial structurtes of input characters.

The non-linear normalizing operation can make the interval of sample stroke even.(C) of Fig. 5 shows the schematic diagram of non-linear normalizing operation.Dynamic scaling is then adopted in non-linear normalization in a literal, determine different scalings according to the factors such as stroke distribution density degree of the different parts of input characters.The non-linear normalizing method that is based on dot density that the present embodiment adopts is divided cloth density according to the calculating pen reciprocal of the projection of stroke pixel, dynamically adjusts scaling with this.The local suitably amplification that the stroke distribution density is high, low density place suitably dwindles.The stroke of the literal after adjusting like this distributes and will be tending towards even.Non-linear normalization can more effectively reduce the literal deformation extent, weakens the font difference of different writing styles generations with discrete, can effectively improve the discrimination of hand-written regular literal.

Make the bianry image of f (i, j) expression one hand-written literal, i=1,2 ..., I, j=1,2 ..., J.

H (i) and V (j) are respectively f (i, j) projection function in the horizontal and vertical directions.

H (i) = Σ_{j = 1}^{J} f (i, j)

V (j) = Σ_{i = 1}^{I} f (i, j)

Make g (s, t) be the result behind f (i, the j) non-linear normalizing, s=1,2 ..., S, t=1,2 ..., T, its computing formula is as follows:

s = Σ_{k = 1}^{i} H (k) \times \frac{S}{Σ_{k = 1}^{I} H (k)} \cdot \cdot \cdot (4)

t = Σ_{l = 1}^{j} V (l) \times \frac{T}{Σ_{l = 1}^{J} V (l)} \cdot \cdot \cdot (5)

After pre-treatment step, at step S170, from sample, extract the M dimensional feature: can comprise stroke direction distribution characteristics M ₁Dimension, grid stroke feature M ₂Dimension, peripheral direction feature M ₃The dimension stroke direction, and with irrelevant other features of stroke number, the order of strokes observed in calligraphy.

Fig. 6 is the schematic diagram of describing the process of extracting the stroke direction distribution characteristics.Shown in Fig. 6 (A), the grid that first literal is divided into n * n, calculate each section in each grid to the projection of 8 directions, the projected length of each grid on 8 directions shown in Fig. 6 (B) namely consisted of the stroke direction distribution characteristics of this grid.

Fig. 7 is the schematic diagram of describing the process of extracting the grid stroke feature.The grid that first literal is divided into n * n calculates the shared area of pen section in each grid, namely stroke point count with, just be the grid stroke feature of this grid.

Make bianry image

Then

The grid stroke feature that represents this grid.

Fig. 8 is the schematic diagram of describing the process of extracting the peripheral direction feature.First literal is vertically carried out the n five equilibrium.The length that from left to right searches along horizontal scanning line till half of first stain or literal width is called the left side search length, the average left side search length of the horizontal scanning line of each piecemeal such as grade is the horizontal contour feature in left side of these piecemeals divided by half of literal width, shown in Fig. 8 (A).Similarly, average right side search length can be calculated the horizontal contour feature in right side divided by the literal width.Equally, shown in Fig. 8 (B), literal along horizontal direction n five equilibrium, is calculated upper vertical contour feature and downside vertically profiling feature.

Three kinds of features are connected into following M dimension union feature vector successively

X＝[x ₁，…x _M1，x _M1+1，…，x _M1+M2，x _M1+M2+1，…，x _M1+M2+M3]

M=M wherein ₁+ M ₂+ M ₃

Take Fig. 2 as example, can obtain the complete word characteristic set { X of " " word _{Whole 1}..., X _{Whole 5}And the partial words characteristic set { X of " " word _{Section 1}..., X _{Section 20}}

At step S180, dimensionality reduction and quantification treatment: adopt the KL conversion, the dimension of proper vector is down to the N dimension from the M dimension, again the proper vector behind KL transformation matrix and the dimensionality reduction is quantized, represent its element with WORD (16bits) type and BYTE (8bits) type variable respectively.

The KL conversion is chosen N feature in can selecting from the M dimensional vector, former vectorial dimensionality reduction is become a N dimensional vector (N＜M), and keeping as much as possible in the vector originally to the classification Useful Information.

At first, supposing needs K character of identification now, and the capacity of namely supposing this recognition dictionary is K character, owing to having produced part stroke word class, (K≤W≤2K), the prior probability of all kinds of appearance is P (ω to total W classification _i), i=1,2 ..., W.With X _iExpression is from the M dimensional vector of i class, then the autocorrelation matrix R of i class cluster _iFor

R_{i} = E {X_{i} X_{i}^{T}} \cdot \cdot \cdot (6)

The autocorrelation matrix R of mixed distribution is

R = E {X X^{T}} = Σ_{i = 1}^{W} P (ω_{i}) R_{i} = Σ_{i = 1}^{W} P (ω_{i}) E {X_{i} X_{i}^{T}} \cdot \cdot \cdot (7)

Be that R is the statistical average of all kinds of autocorrelation matrixes.

Secondly, obtain eigenvectors matrix Φ and the eigenvalue matrix Λ of R,

Φ＝[Φ ₁ Φ ₂…Φ _M] …(9)

And require eigenwert to arrange descendingly:

λ ₁≥λ ₂≥…≥λ _M …(10)

Difference character pair vector Φ ₁, Φ ₂..., Φ _M

The 3rd, get top n proper vector Φ _i(i=1,2 ..., N), consist of transformation matrix A

A = {[\begin{matrix} Φ_{1}^{T} \\ Φ_{2}^{T} \\ \cdot \\ \cdot \\ \cdot \\ Φ_{N}^{T} \end{matrix}]}_{N \times M} \cdot \cdot \cdot (11)

Get conversion Y=AX, Y is the N dimensional vector again.

The characteristic set of explanation through obtaining behind the dimensionality reduction as an example of Fig. 2 example, the complete word characteristic set of " " word is { Y _{Whole 1}..., Y _{Whole 5}, each Y wherein _i=[y ₁, y ₂Y _N], the partial words characteristic set for " " word is { Y equally _{Section 1}..., Y _{Section 20}, each Y wherein _i=[y ₁, y ₂Y _N].

Then, matrix A and vectorial Y are quantized:

The quantification of transformation matrix A

Transformation matrix A is a floating-point matrix,

A = [\begin{matrix} a_{11} & a_{12} & \cdot \cdot \cdot & a_{1 M} \\ a_{21} & a_{22} & \cdot \cdot \cdot & a_{2 M} \\ \cdot & \cdot & \cdot \\ \cdot & \cdot & \cdot \\ \cdot & \cdot & \cdot \\ a_{N 1} & a_{N 2} & \cdot \cdot \cdot & a_{NM} \end{matrix}] \cdot \cdot \cdot (12)

Make Q=max (| a _Ij|), K=32767, then quantitative formula is as follows:

a _ij′＝round(a _ij×K/Q) …(13)

Wherein, round () is the round function.Like this, a _Ij' can represent with a WORD (16bits) type variable.

The quantification of new feature vector Y

New feature behind KL conversion dimensionality reduction vector Y, its quantification manner and front similar, difference be Y this as fixed point vector, each element that quantizes rear Y represents with BYTE (8bits) type variable:

Y_{N \times 1} = A_{N \times M} X_{M \times 1} = [\begin{matrix} y_{1} \\ y_{2} \\ \cdot \\ \cdot \\ \cdot \\ y_{N} \end{matrix}] \cdot \cdot \cdot (14)

Template in the recognition dictionary can by the whole word class sampling feature vectors after dimensionality reduction and the quantification and part stroke class sampling feature vectors are carried out respectively the generation of GLVQ Algorithm for Training, be the proper vector T that whole word class sample and part stroke printed words originally generate respectively a N dimension _iAs template, T _i=[t ₁, t ₂T _N].Take Fig. 2 as example, 5 complete word characteristic sets generate a complete character matrix plate through after training, and 20 part stroke word characteristic sets generate a partial words class template through after training.

At step S190, GLVQ (the Generalized Learning VectorQuantization) algorithm after utilization improves is learnt the handwriting samples collection after dimensionality reduction and quantification treatment and is trained, and finally is that each classification generates a template.

If X is a handwritten word sample, P _iAnd P _jBe two templates in the recognition dictionary, and X and P _iBelong to a class, X and P _jDo not belong to a class, P _i' and P _j' be the new template after upgrading, its training formula is as follows:

P_{i}^{'} = P_{i} + α l_{k} (1 - l_{k}) \frac{D_{j}}{{(D_{i} + D_{j})}^{2}} (X - P_{i}) \cdot \cdot \cdot (15)

P_{j}^{'} = P_{j} - α l_{k} (1 - l_{k}) \frac{D_{i}}{{(D_{i} + D_{j})}^{2}} (X - P_{j}) \cdot \cdot \cdot (16)

Wherein, α is learning rate,

D _i＝‖X-P _i‖ …(17)

D _j＝‖X-P _j‖ …(18)

μ_{k} = \frac{D_{i} - D_{j}}{D_{i} + D_{j}} \cdot \cdot \cdot (19)

l_{k} = l_{k} (μ_{k}) = \frac{1}{1 + e^{- μ_{k}}} \cdot \cdot \cdot (20)

D _iAnd D _jBe the matching distance of two patterns, can adopt Euclidean distance, mahalanobis distance and other various distance measures.P _iAnd P _jInitial value can be made as its separately geometric center of class.Take turns circulation, P through this _iClose with X, and P _jPushed away far.All classifications are repeated this process, until the discrimination of training set sample reaches certain level.

Fig. 9 describes the schematic diagram utilize the process that GLVQ learns.In Fig. 9, comprised whole word class and the part stroke word class of " woods " word, and the whole word class of " wood " word.Because the sample of whole word class is compared with part stroke word class sample, the differences between samples less that belongs to such, therefore the feature of the whole word class sample write of different people is comparatively approaching, distribution in the space is comparatively concentrated, shown in the broken circle of Fig. 9, and the differences between samples in the part stroke word class sample of ' woods ' word is larger, so the distribution of part stroke word class sample in the space that different people is write comparatively disperse, shown in the dotted ellipse among Fig. 9.As can be seen from Figure 9, when ' woods ' word had just begun to write, when being write as ' wood ', ' wood ' of its feature and complete word class approached; And work as ' woods ' word when writing soon, its feature approaches with ' woods ' word as complete word class.Therefore, the sample characteristics of part stroke word class distributes and comparatively disperses, and the sample characteristics of whole word class distributes then comparatively concentrated.Accordingly, we have done improvement to the GLVQ of standard.If X is for belonging to whole word class, learning rate is α ₁If X is for belonging to part stroke word class, learning rate then adopts α ₂It satisfies α ₁＞α ₂, in actual applications, get α ₁=4 α ₂Like this can accelerating convergence.

Among Fig. 9, black circle represents all kinds of cluster centres, and it is produced by the GLVQ Algorithm Learning.In when identification, calculate the matching distance of input sample and each cluster centre, apart from reckling as recognition result.

Like this, for each literal generates one or two template, represent respectively complete word class and part stroke word class, finally finish the making of recognition dictionary at step S200.

Although above with the GLVQ arthmetic statement automatic learning process, this only is for purposes of illustration, but not wants to limit the present invention.The present invention also can adopt other machine learning algorithm, SOM (Self-Organizing Maps) network for example, LVQ (Learning VectorQuantization) algorithm, and the modified LVQ1 of LVQ algorithm, LVQ2 and LVQ3 etc.

For the recognition dictionary with forecast function that can identify K character, it comprises W template, and it satisfies K≤W≤2K.Making the proper vector dimension behind the dimensionality reduction is N, and then recognition dictionary is that W N dimensional feature vector is connected in series the one-dimension array that forms from beginning to end:

Dictionary＝[T ₁…T _W]′＝[t ₁₁…t _1N…t _W1…t _WN]′ …(21)

Characteristics of the present invention are miniaturizations, and for the single mode plate dictionary without forecast function that can identify K character, its template number is K, and the proper vector dimension behind the dimensionality reduction is N, and then the size of recognition dictionary is KN (bytes).Make K=7000, N=64, then dictionary size is 448000bytes.

And for the dictionary that creates according to the method for the present embodiment that can identify equally K character, then its template number is W (K≤W≤2K).This moment, the size of recognition dictionary was WN (bytes).Therefore, under the worst case, the data volume of the recognition dictionary that the method for the present embodiment creates can increase to original 2 times.But because the data volume of original dictionary is less, so the data volume of new dictionary is also little.Still with K=7000, N=64 is example, and the data volume maximum of new dictionary is no more than 896000bytes.

Below in conjunction with Figure 10 process according to the hand-written inputting method of the embodiment of the invention is described.Figure 10 is the detail flowchart according to the identifying in the hand-written inputting method of the embodiment of the invention.

As shown in figure 10, at step S310, the user is by the stroke of handwriting input unit 110 certain literal of input.Then, at step S320, stored by handwriting storage unit 120, and be presented on the hand-written display unit 160 by indicative control unit 150.

Be stored in the operation of the stroke execution in step S330～S360 in identification prediction unit 130 in the handwriting storage unit 120.

In view of step S330 basic identical to the operation of the operation of S350 and the step S in the above-mentioned dictionary creating method 160 to S180, so no longer these steps are elaborated here.

At step S360, carry out the feature of input stroke and the Rapid matching of the template in the dictionary.Figure 11 is the schematic diagram of describing the Rapid matching process.Be located at step S350 and adopt the N dimensional feature vector that generates after the KL conversion, each is tieed up contained quantity of information and successively decreases successively.Therefore, the N dimensional feature vector can be divided into the d section, every section comprises respectively N ₁, N ₂..., N _dIndividual element, and satisfy following formula

N＝N ₁+N ₂+…+N _d …(22)

In practice, the segmentation majority meets following condition

N ₁≤N ₂≤…≤N _d …(23)

First round screening: choose first paragraph N ₁Individual element participates in coupling, calculates the characteristic distance between each template in sample to be identified and the recognition dictionary, and threshold value TH1 is set, and keeps the template sustainable participation next round screening greater than threshold value.In practice, threshold value is made as the intermediate value of all distances.Adopt distance measure identical when training with GLVQ herein.

Second takes turns screening: choose second segment N ₂Individual element also participates in coupling, i.e. (N ₁+ N ₂) individual element, calculate the characteristic distance between the remaining template of sample to be identified and last round of screening, threshold value TH2 is set, keep the template sustainable participation next round screening greater than threshold value.

……

Last takes turns screening: choose last remaining N _dIndividual element participates in coupling, and namely whole N elements calculate the characteristic distance between the remaining template of sample to be identified and last round of screening, and minimum 10 are final TOP10 recognition result.

At step S370, upgrade original recognition result, above-mentioned TOP10 recognition result is presented on the hand-written display unit 160, select for the user.

At step S380, if the user has selected certain candidate item among the TOP10 by candidate item selected cell 140, then finished the input of this literal.Otherwise flow process forwards step S310 to, repeats above-mentioned operating process.

Figure 12 is that the tabulation of ten candidate words is in conjunction with the schematic diagram of first-selected word background prompting.As shown in figure 12, inputting interface roughly is divided into Three regions, the zone in left side is used for showing the TOP10 candidate item, select for the user, middle zone is to write the district, shown the in real time stroke of input of user on it, and shown current recognition result in the background of the stroke of inputting, as the prompting in the input process.If the user certain word has been write when a part of and just have been identified this word, then this word need not have been write and just finish input operation.Be the instruction area on the right side of inputting interface, wherein be provided with a plurality of function keys, in input process, carry out other operations such as editor for the user.

In addition, in identifying, can set user's ' lift pen ' and finish after the input, identification prediction unit 130 is after waiting for the appropriate time, automatically the first candidate item is sent to indicative control unit 150, be presented in the candidate item viewing area of hand-written display unit 160.Whether the single character factor is adjusted these time to be entered according to user's writing style and the literal of input in identification prediction unit 130.

In addition, in the superincumbent description, only generated a template for the part stroke word class sample of certain literal, as shown in Figure 2.But also can generate two or more the part stroke word class templates with different priorities to the part stroke word class sample evidence stroke number of this literal, further improve predictive ability.

So far invention has been described in conjunction with the preferred embodiments.Should be appreciated that, those skilled in the art are in the situation that break away from the spirit and scope of the present invention, can carry out various other change, replacement and interpolations.Therefore, scope of the present invention is not limited to above-mentioned specific embodiment, and should be limited by claims.

Claims

1. method of making dictionary comprises step:

Extract the whole printed words whole word feature originally of literal, and stroke number is greater than the part stroke feature of the part stroke sample of the literal of predetermined value; And

By with machine learning algorithm described whole word feature and described part stroke feature being learnt whole character matrix plate and/or the part stroke template of generating character, as the project in the dictionary;

Wherein, described machine learning algorithm comprises GLVQ algorithm, SOM network, LVQ algorithm, LVQ1 algorithm, LVQ2 algorithm and LVQ3 algorithm one of at least, and described machine learning algorithm is four times of the learning rate that adopts for described part stroke sample for this learning rate that adopts of whole printed words.

2. the method for claim 1, wherein said whole word feature and part stroke feature all are the M dimensions, and wherein M is natural number, and described extraction step comprises:

Described M is tieed up whole word feature drop to N dimension, wherein N is that natural number and M are greater than N.

3. the method for claim 1, wherein said whole word feature and described part stroke feature are with stroke number, connect the irrelevant feature of pen and the order of strokes observed in calligraphy.

4. method according to claim 1 also comprises step:

Before extraction step, described whole printed words basis and described part stroke sample are equidistantly resampled, so that the distance between the sampled point of the stroke of sample equates substantially.

5. method according to claim 1 also comprises step:

Before extraction step, with the size of adjusted size for being scheduled to of described whole printed words basis and described part stroke sample, so that the center superposition of the rectangle of the barycenter of sample and preliminary dimension.

6. method according to claim 1 also comprises step:

Before extraction step, divide cloth density according to the calculating pen reciprocal of the projection of stroke pixel, dynamically adjust the scaling of sample.

7. method according to claim 1, wherein said part stroke sample generates from described whole word stroke sample.

8. method according to claim 3, wherein said and stroke number, connect pen and the irrelevant feature of the order of strokes observed in calligraphy comprise following one of at least: stroke direction distribution characteristics, grid stroke feature and peripheral direction feature.

9. method according to claim 2 also comprises step:

Be integer with each Quantification of elements of N dimensional feature.

10. method according to claim 2 wherein drops to the N dimension by the KL conversion with described M dimensional feature.

11. an equipment of making dictionary comprises:

Extract the whole printed words whole word feature originally of literal, and stroke number is greater than the device of the part stroke feature of the part stroke sample of the literal of predetermined value; And

By with machine learning algorithm described whole word feature and described part stroke feature being learnt whole character matrix plate and/or the part stroke template of generating character, as the device of the project in the dictionary;

12. equipment as claimed in claim 11, wherein said whole word feature and part stroke feature all are the M dimensions, wherein M is natural number, and described extraction element comprises:

Described M is tieed up the device that whole word feature drops to N dimension, and wherein N is that natural number and M are greater than N.

13. equipment as claimed in claim 11, wherein said whole word feature and described part stroke feature are the features irrelevant with stroke number, company's pen and the order of strokes observed in calligraphy.

14. a hand-written inputting method comprises step:

Extract the feature of at least part of handwriting of literal; And

Calculate the distance between the template in the dictionary that described feature and method according to claim 1 create; And

Will be apart from the literal of the template representative of less at least one as recognition result.

15. hand-written inputting method according to claim 14, wherein said feature are M dimensions, M is natural number, and described hand-written inputting method also comprises: described M dimensional feature is dropped to the N dimensional feature, and wherein N is that natural number and M are greater than N; And the N dimensional feature vector that will represent described N dimensional feature is divided into a plurality of sections, wherein carries out piecemeal described calculation procedure.

16. hand-written inputting method as claimed in claim 14, wherein said feature are the features irrelevant with stroke number, company's pen and the order of strokes observed in calligraphy.

17. hand-written inputting method according to claim 14 also comprises step:

Before extraction step, described at least part of handwriting is equidistantly resampled, so that the distance between the sampled point of the stroke of at least part of handwriting equates substantially.

18. hand-written inputting method according to claim 14 also comprises step:

Before extraction step, with the size of adjusted size for being scheduled to of described at least part of handwriting, so that the center superposition of the rectangle of the barycenter of described at least part of handwriting and preliminary dimension.

19. hand-written inputting method according to claim 14 also comprises step:

Before extraction step, divide cloth density according to the calculating pen reciprocal of the projection of stroke pixel, dynamically adjust the scaling of described at least part of handwriting.

20. hand-written inputting method according to claim 16, wherein said and stroke number, connect pen and the irrelevant feature of the order of strokes observed in calligraphy comprise following one of at least: stroke direction distribution characteristics, grid stroke feature and peripheral direction feature.

21. hand-written inputting method according to claim 15 also comprises step:

Be integer with each Quantification of elements of N dimensional feature.

22. hand-written inputting method according to claim 15 wherein drops to the N dimension by the KL conversion with described M dimensional feature.

23. hand-written inputting method according to claim 14 also comprises step:

With described at least part of handwriting recognition result of range of a signal minimum synchronously.

24. a handwriting input device comprises:

Extract the device of at least part of handwriting feature of literal, and

Calculate the device of the distance between the template in the dictionary that described feature and method according to claim 1 create; And

Will be apart from the literal of the template representative of less at least one device as recognition result.

25. handwriting input device as claimed in claim 24, wherein said feature are M dimensions, M is natural number, and described handwriting input device also comprises: described M dimensional feature is dropped to the device of N dimensional feature, wherein N is that natural number and M are greater than N; And will represent that the N dimensional feature vector of described N dimensional feature is divided into a plurality of sections device, wherein carry out piecemeal described calculating.

26. handwriting input device as claimed in claim 24, wherein said feature are the features irrelevant with stroke number, company's pen and the order of strokes observed in calligraphy.

27. handwriting input device according to claim 24 also comprises:

Before extracting feature, described at least part of handwriting is equidistantly resampled, so that the device that the distance between the sampled point of the stroke of described at least part of handwriting equates substantially.

28. handwriting input device according to claim 24 also comprises:

Before extracting feature, with the size of adjusted size for being scheduled to of described at least part of handwriting, so that the device of the center superposition of the rectangle of the barycenter of described at least part of handwriting and preliminary dimension.

29. handwriting input device according to claim 24 also comprises:

Before extracting feature, divide cloth density according to the calculating pen reciprocal of the projection of stroke pixel, dynamically adjust the device of the scaling of described at least part of handwriting.

30. handwriting input device according to claim 26, wherein said and stroke number, connect pen and the irrelevant feature of the order of strokes observed in calligraphy comprise following one of at least: stroke direction distribution characteristics, grid stroke feature and peripheral direction feature.

31. handwriting input device according to claim 25 also comprises:

Be the device of integer with each Quantification of elements of N dimensional feature.

32. handwriting input device according to claim 25 wherein drops to the N dimension by the KL conversion with described M dimensional feature.

33. handwriting input device according to claim 24 also comprises:

With described at least part of handwriting device of the recognition result of range of a signal minimum synchronously.