TW200937308A - Method and apparatus for constructing HMM models for recognizing online eastern asian characters - Google Patents

Method and apparatus for constructing HMM models for recognizing online eastern asian characters Download PDF

Info

Publication number
TW200937308A
TW200937308A TW97106872A TW97106872A TW200937308A TW 200937308 A TW200937308 A TW 200937308A TW 97106872 A TW97106872 A TW 97106872A TW 97106872 A TW97106872 A TW 97106872A TW 200937308 A TW200937308 A TW 200937308A
Authority
TW
Taiwan
Prior art keywords
state
hmm
topology
stroke
handwritten
Prior art date
Application number
TW97106872A
Other languages
Chinese (zh)
Inventor
Shi Han
Yu Zou
Ming Chang
Peng Liu
Yi-Jian Wu
Lei Ma
Frank Soong
Dongmei Zhang
Jian Wang
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Priority to TW97106872A priority Critical patent/TW200937308A/en
Publication of TW200937308A publication Critical patent/TW200937308A/en

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

The present invention provides a scheme for designing a HMM model for recognizing online East-Asian characters and a method for constructing such a HMM model. With full consideration of the features of East-Asian characters such as the large number of strokes, the diversity of writing order, the complexity of structure, the diversity of writing styles, and the uncertainty between strokes, the present invention addresses the above issues by introducing turning states, providing multi-paths and parallel state in HMM topology. The present invention also reduces data quantity and operation complexity by clustering and merging.

Description

200937308 九、發明說明: 【發明所屬之技術領域】 本發明涉及手寫字元識別技術,更具體地說,涉及手 寫東亞字元的識別技術。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a handwriting recognition technique, and more particularly to a recognition technique for writing East Asian characters.

V ^ 【先前技術】 手寫輸入識別一直是電腦應用技術中的一個重要的研 © 究方向。與靜態的圖像或者文字識別不同的是,在線手寫 輸入識別是一個時間隨機過程,而不是.一個靜態的物件。 因此,在在線手寫識別領域,隱性馬爾可夫(HMM )模型 經常被使用。 首先簡單介紹一下HMM模型,HMM模型即隱性馬爾 可夫模型,是馬爾可夫模型的一種。馬爾可夫模型可以用 來預測未來某一事件出現某種狀態的概率,而這種概率將 僅僅基於該事件當前的狀態。馬爾可夫模型表現爲一個有 限狀態自動機,狀態之間可以互相轉換,並且,每一次將 ¥ 從一個狀態轉換到下一個狀態(可能是其他的狀態,也可 能是該狀態本身)。 - 馬爾可夫模型可以分爲兩種,一種稱之爲顯性馬爾可 夫模型,在顯性馬爾可夫模型中,狀態之間的轉換順序是 v 已知的。 另一種稱爲隱性馬爾可夫模型(HMM ),其中狀態之間 的轉換順序是未知的,所知道的僅僅是狀態之間的轉移概 6 200937308 率。因此,HMM模型可以被定義爲具有如下的特徵: 1 )是一個有限狀態自動機,狀態之間可以互相轉換, 並且,每一次將從一個狀態轉換到下一個狀態(可能是其 他的狀態,也可能是該狀態本身); 2 )狀態之間的轉移由一組轉移概率決定,一組觀測事 件(觀測序列)的出現概率由與狀態相關的轉移概率決定。V ^ [Prior Art] Handwriting input recognition has always been an important research direction in computer application technology. Unlike static image or text recognition, online handwriting input recognition is a time-random process, not a static object. Therefore, in the field of online handwriting recognition, Hidden Markov (HMM) models are often used. First, a brief introduction to the HMM model, the HMM model is a recessive Markov model, which is a kind of Markov model. The Markov model can be used to predict the probability that a certain event will occur in a future state, and this probability will be based only on the current state of the event. The Markov model behaves as a finite state automaton, states can be converted to each other, and each time ¥ is switched from one state to the next (may be other states, it may be the state itself). - The Markov model can be divided into two types, one called the dominant Markov model. In the dominant Markov model, the order of transition between states is known. The other is called the hidden Markov model (HMM), in which the order of transition between states is unknown, and what is known is only the transition between states. Therefore, the HMM model can be defined as having the following characteristics: 1) is a finite state automaton, states can be converted to each other, and each time from one state to the next state (may be other states, also It may be the state itself); 2) The transition between states is determined by a set of transition probabilities, and the probability of occurrence of a set of observed events (observation sequences) is determined by the state-dependent transition probabilities.

比如,舉一個例子說明 HMM模型。有一台飲料售貨 機,提供兩種飲料,分別是可樂和茶。這個飲料售貨機會 具有兩種狀態,“偏好可樂”的狀態和“偏好茶”的狀 態,當投入一牧硬幣購買飲料時,售貨機會根據其所處的 狀態決定所售出的飲料,即,處於“偏好可樂”的狀態時 將出售可樂,而處於“偏好茶”的狀態時將出售茶。該售 貨機會在每次出售之後在兩種狀態之間進行轉換,轉換的 概率如下: 處於“偏好可樂”狀態時,轉換到“偏好茶”狀態的 概率是3 0%,保持在“偏好可樂”狀態的概率是70% ; 出於“偏好茶”的狀態時,轉換到“偏好可樂”狀態 的概率是5 0%,保持在“偏好茶”狀態的概率也是5 0%。 對於上述的飲料售貨機,如果希望確定某一種特定的 出售飲料序列的概率,可以爲上述的飲料售貨機建立HMM 模型。該模型可參考第1 b圖所示,其中包括兩個狀態:“偏 好可樂”狀態和“偏好茶”狀態,根據上面列出的狀態轉 換概率,確定轉態每一條轉移路徑的概率,如第lb圖所 示。當需要確定某一個特定的出售序列時,可以通過計算 7 200937308 所有會出現上述結果的路徑的概率,並將它們求和,就能 夠獲得14 一種特定出售序列的出現概率。 HMM拓撲結構正是反映了在hmm模型中各個狀態之 間的轉移順序連接關係。 HMM模型以及HMM拓撲結構的應用涉及三類主要的 , 問題: 1 )當給出一個HMM模型時,確定某一種特定的“觀 © 測事件(對應一個狀態轉移順序)出現的概率。 2 )當給出一個hmm模型和一個特定的“觀測事件” 時’選擇一個狀態轉移順序,該狀態轉移順序能最好地描 述該特定的“觀測事件’’。 3 )當給出一個特定的“觀測事件”時以及一組可能的 HMM模型空間,確定最佳的hmm模型來描述該“觀測事 件”。 HMM模型的上述特點使得它在解決下列的問題時尤其 ❹ 有用:潛在的事件的出現概率會影響觀測事件。一個典型 的應用是在識別領域,尤其是語言識別,包括語音識別技 術和手寫識別技術》HMM模型是一種可訓練的模型,通過 ' 大量的資料訓練,能夠獲得一組最適合於表示“觀測事 v 件”的HMM模型參數,之後,採用這個從資料訓練中獲 得的HMM模型,就能確定另外一個“觀測事件”與該模 型的匹配程度,根據匹配程度,就能確定一個未知的“觀 測事件”最可能屬於哪一個模型◊這樣,就達到了識別的 目的。 8 200937308 目前,雖然已經開發了多種利用 HMM模型的識別技 術,但是它們大多應用於西文,而對於東方文字,尤其是 東亞地區使用的文字,諸如漢字,卻不能進行有效的識別。 這主要是由於西文,例如英語和東亞字元,例如漢字在字 形結構上的差異所造成的。For example, give an example to illustrate the HMM model. There is a beverage vending machine offering two drinks, cola and tea. This beverage vending opportunity has two states, the status of "preferable cola" and the state of "preference tea". When investing in a grazing coin to purchase a beverage, the vending opportunity determines the beverage to be sold according to its state, that is, Cola will be sold in the "preferable cola" state, while tea will be sold in the "preference tea" state. The vending machine converts between the two states after each sale, and the probability of conversion is as follows: When in the "Preference Coke" state, the probability of transitioning to the "Preference Tea" state is 30%, remaining in "Preference Coke" The probability of the state is 70%; out of the state of "preference tea", the probability of switching to the "preferable cola" state is 50%, and the probability of staying in the "preference tea" state is also 50%. For the above-described beverage vending machine, if it is desired to determine the probability of a particular one of the beverage sequences being sold, an HMM model can be established for the beverage vending machine described above. The model can be referred to Figure 1 b, which includes two states: "Preference cola" status and "Preference Tea" status. According to the state transition probability listed above, the probability of each transition path is determined. The lb diagram shows. When it is necessary to determine a particular sale sequence, it is possible to obtain the probability of occurrence of 14 a particular sale sequence by calculating the probability of all the paths for which the above results will occur in 200937308 and summing them. The HMM topology reflects the transfer order connection between states in the hmm model. The application of the HMM model and the HMM topology involves three main types of problems: 1) When an HMM model is given, the probability of occurrence of a particular “viewing event (corresponding to a state transition order) is determined. 2) Given a hmm model and a specific "observation event", select a state transition sequence that best describes the particular "observation event". 3) When a specific "observation event" is given and a set of possible HMM model spaces, the best hmm model is determined to describe the "observation event". The above characteristics of the HMM model make it particularly useful when solving the following problems: the probability of occurrence of a potential event affects the observed event. A typical application is in the field of recognition, especially speech recognition, including speech recognition technology and handwriting recognition technology. The HMM model is a trainable model. Through a large amount of data training, it is possible to obtain a group that is most suitable for representing "observation matters." The HMM model parameters of the V piece, after which the HMM model obtained from the data training can be used to determine the degree of matching between the other “observation event” and the model. According to the matching degree, an unknown “observation event” can be determined. "Which model is most likely to belong to this, it achieves the purpose of identification. 8 200937308 At present, although a variety of identification techniques using HMM models have been developed, they are mostly used in Western languages, but for Eastern languages, especially those used in East Asia, such as Chinese characters, they cannot be effectively identified. This is mainly due to Western languages such as English and East Asian characters, such as the difference in the glyph structure of Chinese characters.

對於西文來說,每一個字都是由字母組成,而單個的 字母結構簡單,通常都是一筆可以完成,不存在可變的筆 劃順序的問題,同時,字母和字母之間的相似程度相對較 低,除了個別的字母之外,大多數字母具有自己明顯的特 徵。這些特點都給手寫識別帶來了許多的便利。因此,目 前開發的以西文爲主要識別物件的HMM模型一般都具有 左至右的HMM拓撲結構,通過限定HMM拓撲結構的起始 點和終止點基本就能夠描述一般西文字母的特點。 但是東亞字元具有明顯不同的特點,以漢字爲例: 1 )筆劃多,在手寫過程中存在筆劃順序不同的問題; 2)結構複雜,使得字形變化較多,體現在手寫方面, 就是具有多種手寫風格; 3 )存在筆劃間連接的問題,由於漢字筆劃多,結構複 雜,再加上個人的手寫習慣,會導致筆劃之間的連接存在 很多的不確定性; 4 )資料量大,漢字的每一個字都是一個獨立的個體, 而不是像英文那樣可以進一步度拆分成字母這樣一個數量 有限的單位,因此,對於漢字來說,手寫識別模型的資料 量是十分巨大的。 9 200937308 通過上面的分析,就可看到,由於東亞字元 字形特點上存在很多明顯的差異,就導致了目前針對西文 開發的HMM模型並不適用於東亞字元,尤其 =的棋型不能有效地解決筆劃順序、書寫風格的問 提供解決方Γ東亞字元筆劃之間的連接,也無法很好地 2供:决方案’因此,導致了目前在手寫東亞字元識別方 面,尚沒有一種很好的識別技術。 ❹ ❹ 【發明内容】 本發明的目的旨在提供一種構建用於識別手寫東亞字 时法及系'絶,以針對手寫東亞字元 的特點對手“東亞字元進行有效的識別。 查,據本發明的一態樣,針對手寫東亞字元筆劃多,筆 別,的連接過度關係複雜的特.點,設計一帛建立適用於識 立手寫東亞字元的HMM拓撲結構的方法該設計在所建 的HMM拓撲結構中提供描述手寫東亞字元持續筆劃的 通'^及描述手寫東亞字元筆劃間轉角的轉角狀態。 丨入針對筆劃間轉角的轉角狀態,來使得整個 拓撲結構报好地反映手寫東亞字元的特點。 發月的一個實現中,HMM拓撲結構是被設計成左 .拓撲結構,從一起始狀態開始,至一終止狀態 者拓撲結構中的持續狀態可以轉移至下一狀態或 轉移@轉角狀態只能轉移至下態’不能自轉移; 並且,持續狀態和轉角狀態依次交替存在。 10 200937308 根據本發明的另_態樣,針對手寫東 多樣,書寫風格多樣的特點。提供—種多路徑疋筆劃順序 結構^多路徑HMM拓撲結構中的每一條路=腦拓撲 東亞字元的多種筆劃 '、徑對應手寫 中Sj順序中的一個;去 撲結構十的每一你极, 4甲夕路徑HMM拓 W母條路徑對應手寫東亞字元的 中的一個。通過將 夕種手寫風格 久呀夕種筆劃順序或者手 集成到-個ΗΜΜ拓撲結構中 寫風格的路徑 字元筆劃順序多樣和手寫 解決手寫東亞 和手寫風格多樣的問題。 在本發明的—個會 -條路徑是左向右路徑_括撲結構中的每 至-終止狀態結束.…撲結構,從一起始狀態開始, 供描述手寫東亞ί ΗΜΜ拓撲結構令同樣接 亍冩東亞字元持續筆 丨』樣提 亞字元筆割間轉角續狀態Μ及描述手寫東 用的轉角狀態包括持锖壯能 其中持績狀態可以韓# $ ㈣I態和轉角狀態, 只能轉移至下ιΓ 狀態或者自轉移,轉角狀味 狀態,不能自轉移;持讀妝能士 # 態 也是依次交替存在. 也和轉角狀態 狀態,至同-“狀:二中所有路徑起始於同〜 田口狀態結束。同時,本發明 拓撲結構中的路徑進行人 ^ ^ Hm 進仃合併處理,以控制路徑的數量 根據本發明的另一態#,針對手 ° 連接過渡的不確定性,g , 子70筆割之間 疋法’即存在即可能是實筆, 筆的部分,本發明同樣提供了解決的方能巧虛 拓撲結構尹提供平行狀態,對應手寫東亞一’在ηΜμ 虛筆劃,也可能是實筆劃的部分》或者,:二::能是 構應用多空間概率分佈(MSD),對應 · Α 撲結 果亞子π中既可For Western languages, each word is composed of letters, and the individual letters are simple in structure, usually one can be completed, there is no problem of variable stroke order, and the similarity between letters and letters is relative. Lower, except for individual letters, most letters have their own distinct characteristics. These features bring a lot of convenience to handwriting recognition. Therefore, the HMM models developed with Western language as the main identification object generally have left-to-right HMM topologies. The characteristics of general western letters can be described by defining the starting and ending points of the HMM topology. However, East Asian characters have distinct characteristics. Take Chinese characters as an example: 1) There are many strokes, and there are different stroke sequences in the handwriting process; 2) The structure is complicated, which makes the fonts change more, which is reflected in the handwriting. Handwriting style; 3) There is a problem of connection between strokes. Due to the large number of Chinese strokes and complicated structure, coupled with personal handwriting habits, there will be a lot of uncertainty in the connection between strokes; 4) Large amount of data, Chinese characters Each word is an independent individual, rather than a limited number of units that can be further split into letters like English. Therefore, for Chinese characters, the amount of data in the handwriting recognition model is very large. 9 200937308 Through the above analysis, we can see that there are many obvious differences in the characteristics of East Asian characters, which leads to the current HMM model for Western language development is not applicable to East Asian characters, especially the chess type can not Effectively solve the stroke sequence, the writing style of the question provides a solution between the East Asian character strokes, and can not be well provided 2: the program's, thus leading to the current handwritten East Asian character recognition, there is no one Very good identification technology. ❹ ❹ [Summary] The object of the present invention is to provide a method for constructing a handwritten East Asian word and a method for identifying the East Asian character for the handwritten East Asian character. In one aspect of the invention, a method for designing an HMM topology suitable for handwriting East Asian characters is established in accordance with a method of writing a handwritten East Asian character stroke, a pen, and a connection with a complicated relationship. The HMM topology provides a description of the handwritten East Asian character continuous strokes and describes the corner state of the handwritten East Asian character strokes. The corner state of the corner between the strokes is entered to make the entire topology report well. The characteristics of East Asian characters. In an implementation of the moon, the HMM topology is designed as a left topology. From a starting state to a state of termination, the state of the topology can be transferred to the next state or transferred. @角角状态 can only be transferred to the lower state 'cannot be transferred; and the continuous state and the corner state alternately exist. 10 200937308 According to this issue The other _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ Corresponding to one of the Sj sequences in the handwriting; each of the poles of the structure that goes to the structure of the tenth, the path of the HMM extension and the mother of the four mothers corresponds to one of the handwritten East Asian characters. Sequence or hand integration into a ΗΜΜ topology, the writing style of the path character stroke order is diverse and handwritten to solve the problem of handwritten East Asian and handwritten styles. In the present invention - a - path is left to right path _ Every end-to-end state in the structure ends....The structure of the flap, starting from a starting state, is used to describe the handwritten East Asia ΗΜΜ ΗΜΜ topology so that the same type of East Asian characters continue to be written. State Μ and description of the corner state of the handwritten East include the holding of the strong state of the performance state can be Han # $ (4) I state and corner state, can only be transferred to the next ιΓ state or self-transfer, corner Taste, and can not self-transition; Thorens # makeup holding state is read sequentially and alternately present can corners STATUS, to the same - "shape: all paths starting from the two ends with the Taguchi ~ state. At the same time, the path in the topology of the present invention is subjected to a merge process to control the number of paths according to another state of the present invention #, for the uncertainty of the hand-to-connection transition, g, sub-70 strokes The method of 疋 即 即 即 即 即 即 即 即 即 即 即 即 即 即 即 即 即 本 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供 提供Partially, or,: 2:: can be applied to the application of multi-space probability distribution (MSD), corresponding to Α 结果 结果 亚 亚 亚 亚

200937308 能是虛筆劃,也可能是實筆劃的部分。 根據本發明的另一態樣’爲了減少冗餘的資 低運算的複雜程度,本發明還對HMM拓撲結構 進行聚類,使HMM拓撲結構中的至少一組狀態$ 且對於一組共用參數的狀態,在HMM拓撲結構 一套參數。 本發明同樣提供適合於手寫東亞字元識別的 撲結構的實現方法,該多路徑HMM拓撲結構從 字元的訓練資料中自動構建;根據手寫東亞字元 序或者書寫風格,應用—機器自學習的自動分類 練資料進行聚m ’每-個類別的資料對應於一個 或一個書寫風格。 —_ …识,-〇诉疋、 子π的訓練資料自翻+上、 _ 貝丁寸目動生成而不需要人工的干 料包括手寫東亞字元鲞扰样士 t70筆跡樣本,筆跡樣本包 序或者不同手寫風格200937308 can be a virtual stroke, or it may be part of a real stroke. According to another aspect of the present invention, in order to reduce the complexity of redundant low cost operations, the present invention also clusters HMM topologies such that at least one set of states in the HMM topology is $ and for a set of shared parameters State, a set of parameters in the HMM topology. The present invention also provides an implementation method of a pounce structure suitable for handwritten East Asian character recognition. The multi-path HMM topology is automatically constructed from training materials of characters; according to handwritten East Asian character order or writing style, application-machine self-learning Automatic classification and training of materials to gather m 'Each-category data corresponds to one or one writing style. —_ ... ..., 〇 〇 疋, π 训练 training data self-turning + up, _ Beiding inch visual generation without the need for artificial dry materials including handwritten East Asian characters harassing sample t70 handwriting samples, handwriting sample package Preface or different handwriting style

的筆跡樣本,在構建H. 的過程中,可包括& T ^括如下幾個階段:對於每— 按弧長將其分成备彳 、刀攻數個分段,每一分段分別 而將分段特徵順序排 ..,, 負序排列紐成整個筆跡樣本的. 一体 每-個聚類對應於一插‘ 筆劃順序或者每—種=個聚類的資料確定〗 徑的拓撲及初始參數^風格的HMM拓撲; 發月的—個實見中’還建立對應於; 料量,降 中的狀態 t用參數, 中只保存 HMM拓 手寫東亞 的筆劃順 方法對訓 書寫順序 手寫東亞 該訓練資 同筆劃順 拓撲結構 跡樣本, 特徵;從 特徵;對 順序或者 於每一種 中的一路 及特徵的 12 ❹ ❹ 200937308The handwriting sample, in the process of constructing H., may include & T ^ in the following stages: for each - according to the arc length, it is divided into spare parts, knife attack several segments, each segment will be The segmentation feature sequence is ..,, and the negative sequence is arranged into the entire handwriting sample. The integration of each cluster corresponds to a plug-in stroke order or the data of each cluster = the cluster determines the topology and initial parameters of the path. ^Style of the HMM topology; the month of the month - a real view 'also establishes the corresponding; the amount of material, the state of the drop t with parameters, only save the HMM extension handwriting East Asia stroke stroke method for the training writing sequence handwriting East Asia training笔 笔 顺 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 拓扑 ❹ ❹ ❹ ❹ ❹ ❹ ❹ ❹ ❹ ❹ ❹ ❹ ❹ ❹

子序列方向直方圖向量’以幫助建立該HMM 在本發明的-個實現中,還對多路徑麵 的路徑進行合併,以控料徑的數量。 由於手寫東亞字元的資料量巨大,爲了減 降低實現的複雜度’在本發明的一個實現中 腹BM拓撲結構中每兩個狀態之間的狀態相似 定疋否進行聚類的操作。 用二明同樣提供能夠實現上述的各個態標 用H_模型來對手寫東亞字元進行識別。 本發明針對手寫東亞字元的特點提供了— 東:字元進行識料_拓撲結, 了手寫東亞予儿签划夕 風格多樣、筆劃門連:順序多樣、結相 筆劃間連接不確定的特點, 結構中引入轉角狀態1供多路徑提 來解決上述的問題, 資料量,-低運算的複雜程度…和… 【實施方式] 本發明針對手寫東亞字元的特 多樣、結構複雜、書寫風 也7 一任名,m 依筆劃間連招 別“Γ,Γ…_模型來對手寫㈣ 案文進了ΗΜΜ拓撲結構的特點,心 撲結構中加入轉角狀態、提供多路後、提供平 段來很好地反映手寫東亞字元的上述特點:、 ;撲結構。 拓撲結構中 小資料量, ,通過計算 度度量來確 的系統,利 種建立適用 ,充分考慮 複雜、書寫 ΗΜΜ拓撲 狀態的手段 操作來減小 、筆劃順序 不確定。提 字元進行識 在ΗΜΜ拓 行狀態的手 且,通過聚 13 200937308 類和合併的操作來減小資料量,降低運算的複雜程度。本 發明的HMM拓撲結構是從訓練資料中自行生成,並不需 要人工干預。 需要說明的一點是,在下面所要描述的實施例中,是 以漢字爲例進行說明的,但是本發明的範圍不限於漢字, 而是具有和漢字相似特徵的所有的東亞文字字元,包括曰 本漢字,曰本的假名等等。Subsequence Direction Histogram Vector' to help establish the HMM In an implementation of the present invention, the paths of the multipath faces are also combined to control the number of streams. Since the amount of data written by East Asian characters is large, in order to reduce the complexity of implementation', in one implementation of the present invention, the state between every two states in the abdominal BM topology is similarly determined as the operation of clustering. The use of Erming also provides the ability to implement the above-mentioned various criteria using the H_model to identify handwritten East Asian characters. The invention provides the characteristics of handwritten East Asian characters. - East: Characters for knowledge _ topological knots, handwritten East Asia to children signing eve styles, strokes, door series: sequence diversity, phase-to-phase stroke connection uncertainty In the structure, the corner state 1 is introduced for multipath to solve the above problem, the amount of data, the complexity of the low operation, and the like. [Embodiment] The present invention is directed to the handwriting of East Asian characters, the complexity of the structure, and the writing style. 7 A name, m strokes between strokes, "Γ, Γ..._ model to handwriting (4) The text has entered the characteristics of the topology, adding a corner state to the heartbeat structure, providing multiple channels, providing a flat section is very good It reflects the above characteristics of handwritten East Asian characters: , ; flapping structure. The small amount of data in the topological structure, the system determined by the calculation degree, the establishment of the appropriate kind, the full consideration of the complex, writing and topological state means to reduce The order of the strokes is uncertain. The characters are identified by the hand in the state of the extension line, and the amount of data is reduced by the operation of the class 13 200937308 and the merge operation. The complexity of the operation. The HMM topology of the present invention is self-generated from the training data and does not require manual intervention. It should be noted that in the following embodiments, Chinese characters are taken as an example, but The scope of the present invention is not limited to Chinese characters, but all East Asian characters having similar features to Chinese characters, including kanji Chinese characters, katakana's pseudonyms, and the like.

適當的實現環境 第1 a圖說明了適當的計算系統環境10 0的一例,其中 可以實現本發明。計算系統環境1 00僅是適當的計算環境 的一例並且並非意圖限制本發明的使用範圍或功能。計算 環境1 00不應被解釋爲具有與示例性計算環境1 00中所述 的元件的任一或組合有關的從屬性或要求。 本領域的技術人員可以理解,電腦或其他客戶端或伺 服器設備可以作爲部分電腦網路而採用,或者用於分散式 計算環境中。在這點上,本發明屬於具有任意數量記憶體 或存儲單元的任意電腦系統,以及發生在任意數量存儲單 元或容量上的任意數量的應用程式和過程,它們可以與本 發明一起使用。本發明可以應用於在網路環境或分散式計 算環境中採用伺服器電腦和客戶端電腦的環境。本發明還 可以用於獨立計算設備,具有編程語言功能、以及與遠端 或本地服務一起産生、接收和發射資訊的解譯和執行能力。 本發明可以用多種其他通用或專用計算系統環境或配 置來操作。可以適合與本發明一起使用的公知計算系統、 14 200937308 環境和/或配置的示例包括、但不限於:個人電腦、伺 電腦、手提或攜帶型設備、多處理器系統、基於微處 的系統、機頂盒、可編程用戶電子設備、網路P C、小Appropriate Implementation Environment Figure 1a illustrates an example of a suitable computing system environment 100 in which the present invention may be implemented. The computing system environment 100 is only one example of a suitable computing environment and is not intended to limit the scope of use or functionality of the present invention. The computing environment 100 should not be construed as having a dependency or requirement relating to any or combination of the elements described in the exemplary computing environment 100. Those skilled in the art will appreciate that a computer or other client or server device can be employed as part of a computer network or in a distributed computing environment. In this regard, the present invention pertains to any computer system having any number of memory or storage units, and any number of applications and processes occurring in any number of storage units or capacities that can be used with the present invention. The present invention can be applied to an environment in which a server computer and a client computer are employed in a network environment or a distributed computing environment. The present invention can also be used in stand-alone computing devices, with programming language functions, and interpretation and execution capabilities for generating, receiving, and transmitting information with remote or local services. The invention can operate in a variety of other general purpose or special purpose computing system environments or configurations. Examples of well-known computing systems that may be suitable for use with the present invention, 14 200937308 environments and/or configurations include, but are not limited to, personal computers, servos, portable or portable devices, multi-processor systems, micro-based systems, Set-top box, programmable user electronics, network PC, small

腦、大型電腦、包括任一上述系統的分散式計算環境I * 本發明可以用電腦可執行指令的一般上下文來描 . 譬如由電腦執行的程式模組。一般而言,程式模組包 式、程式、物件、元件、資料結構等,它們執行特定 Α 或實現特定的抽象資料類型。本發明還可以實際用於 ◎ 式計算環境中,其中由通過通信網路或其他資料傳輸 連接的遠端處理設備來執行任務。在分散式計算環境 程式模組及其它資料可以位於本地和遠端存儲媒體中 括記憶體存儲設備。分散式計算通過計算設備和系統 直接交換以便於共用電腦資源和服務。這些資源和服 括資訊、快取記憶體、及文件磁片存儲的交換。分散 算利用網路連接性,允許用戶機發揮它們的集體功效 利於整個公司。在這點上,多種設備可以具有應用程 〇 物件或資源,它們可以利用本發明的技術。 參考第1 a圖,用於實現本發明的示例性系統包括 爲電腦110的通用計算設備。電腦110的元件可以包 但不限於:處理單元1 2 0、系統記憶體1 3 0、及把包括 w 記憶體在内的各種系統元件耦合至處理單元1 2 0的系 流排1 2 1。系統匯流排1 2 1可以是多種類型匯流排結 任一種,包括記憶體匯流排或記憶體控制器、週邊設 流排、及使用任一多種匯流排結構的本地匯流排。經 服器 理器 型電 :等。 述’ 括常 任務 分散 媒體 中, ,包 間的 務包 式計 來有 式、 形式 括、 系統 統匯 -構的 備匯 由示 15 200937308 例但非限制,這種結構包括工業標準結構(ISA)匯流排、微 通道結構(MCA)匯流排、増強型IS A(EIS A)匯流排、視頻 電子標準聯盟(VESA)本地匯流排、及週邊元件互速(PCI) 匯流排(也稱爲Mezzanine匯流排)。Brain, large computer, decentralized computing environment including any of the above systems I* The present invention can be described in the general context of computer executable instructions, such as a program module executed by a computer. In general, program modules include packages, programs, objects, components, data structures, etc. that perform specific Α or implement specific abstract data types. The present invention can also be practiced in a computing environment where tasks are performed by remote processing devices that are connected through a communications network or other data transfer. In a decentralized computing environment, program modules and other data can be stored in local and remote storage media, including memory storage devices. Decentralized computing is directly exchanged between computing devices and systems to facilitate sharing of computer resources and services. These resources are exchanged for information, cache memory, and file disk storage. Decentralized computing uses network connectivity to allow users to leverage their collective benefits for the entire company. In this regard, a variety of devices may have application objects or resources that may utilize the techniques of the present invention. Referring to Figure 1a, an exemplary system for implementing the present invention includes a general purpose computing device for computer 110. The components of computer 110 may include, but are not limited to, processing unit 120, system memory 130, and various system components, including w memory, coupled to stream 1 1 1 of processing unit 120. The system bus 1 1 1 can be any of a variety of types of bus bars, including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus structures. Transceiver processor type: etc. In the "distributed media", the package-based package, the form, the system, and the system are all provided by the company. This structure includes the industry standard structure (ISA). Busbar, Microchannel Structure (MCA) Bus, Reluctant IS A (EIS A) Bus, Video Electronics Standards Alliance (VESA) Local Bus, and Peripheral Component Mutual (PCI) Bus (also known as Mezzanine) Bus bar).

電腦110 —般包括各種電腦可讀媒體。電腦可讀媒體 可以是能由電腦110存取的任何可用媒體並包括揮發性和 非揮發性的媒體、可抽取和不可抽取媒體。通過示例但非 限制’電腦可讀媒體可以包括電腦存儲媒體和通信媒體。 電腦存儲媒體包括揮發性和非揮發性、可抽取和不可抽取 媒體,它們以用於存儲諸如電腦可讀指令、資料結構 '程 式模組或其他資料這樣的資訊的任意方法或技術來實現。 電腦存儲媒體包括、但不限於:RAM、R〇M、EEpR〇M' 快閃記憶體或其他存儲技術、CDR⑽、乡功能數位光碟 (DVD)或其他光碟記憶體、㈣盒、磁帶、磁盤記憶體或 其他磁性存健設備、或用於存儲期望資訊並能由電腦"〇 存取的任意其他媒體。通信媒體—般在諸如載波或其他傳 輸機制這樣的已調資料信號令包含電腦可讀指+、資料結 構:程式棋組或其他資料’並且包括任意資訊傳遞媒體。 術 已調資料仏號意、指其-個或多個特性以對信號内 資訊進仃編碼的方式被設置或改變的信號。通過示例但非 限制通仏媒體包括諸如有線網路或直接線連接這樣的有 線媒體卩及諸如聲音、RF、紅外線這樣的無線媒體及其 它無線媒鱧》i述的任意組合應該包含在電腦可讀媒體的 16 200937308 系統記憶體130包括電腦存儲媒體,其形式爲揮發性 和’或非揮發性記憶體,譬如唯讀記憶體(R〇M)13 1和隨機 存取記憶體(RAM)132。基本輸入/輸出系統i33(BI〇s)—般 存健在ROM 131内’它包含例如啓動期間幫助在電腦ι1〇 内的元件間傳輸資訊的基本常式。RAM 132 —般包含資料 > 和/或程式模組’它們可以立即存取並且/或者當前由處理 單元1 2 0在其上操作。經由示例但非限制,第1 &圊說明了 Q 作業系統134、應用程式135、其他程式模組136和程式資 料 1 3 7。 電腦110還可以包括其他可抽取/不可抽取、揮發性/ 非揮發性電腦存儲媒體。僅僅通過示例,第la圖說明了對 不可抽取、非揮發性磁性媒體進行讀寫的硬碟驅動器 Μ卜對可抽取、非揮發性磁片152進行讀寫的磁碟機151、 以及對可抽取、非揮發性光碟156進行讀寫的光碟媒動器 155,譬如CD ROM或其他光學媒體。示例性操作環境中 可用的其他可抽取/不可抽取、揮發性/非揮發性許瞀六 ^ °τ异存錄 _ 媒體包括、但不限於:磁帶盒、快閃記憶體卡、舫办、s 双证通用 盤、數位視頻磁帶、固態RAM、固態ROM等蓴。成4 ^ 硬碟驅 . 動器141 一般通過如介面140這樣的不可抽取記憶體介面 與系統匯流排121相連,且磁碟機151和光碟驅動器155 一般用如介面150這樣的可抽取記憶體介面與系鉍 ^尔玩匯流排 • 121相連。 上面討論並在第la圖中說明的驅動器和它們沾 1的相關電 腦存儲媒體爲電腦110提供了電腦可讀指令、資料锋構 17 200937308 程式模組和其他資料的存儲。在第la圖中,例如,所述硬 碟驅動器141存儲作業系統144、應用程式145、其他程式 模組1 4 6和程式資料1 4 7。注意到這些元件或者可與作業 系統1 3 4、應用程式1 3 5、其他程式模組1 3 6和程式資料 1 3 7相同,或者與它們不同。這裏爲作業系統1 44、應用程 式145、其他程式模組146和程式資料147給出不同數為 以說明它們至少是不同的副本。用戶可以通過諸如鍵盤 162和指示設備161這樣的輸入設備把命令和資訊輸入到 電腦110中,輸入設備通常稱爲滑鼠、軌跡球或觸板。其 他輸入設備(未示出)可以包括麥克風、遊戲桿、遊戲板、 衛星碟、掃描器等等。這些和其他輸入設備經常通過與系 統匯流排121耦合的用戶輸入介面160與處理單元120相 連,但也可以用其他介面和匯流排結構連接,譬如平行埠、 遊戲埠或通用串列匯流排(USB)。監視器191或其他類型 的顯示設備也通過諸如視頻介面190這樣的介面與系統匯 流排121相連。除了監視器19 1之外,電腦還可以包括其 他外部設備,如揚聲器1 9 7和印表機1 9 6,它們可以通過 輸出周邊介面195連接。 電腦1 1 0可以工作在聯網環境中,該環境使用與諸如 遠端電腦 180這樣的一個或多個遠端電腦之間的邏輯連 接。遠端電腦1 80可以是個人電腦、伺服器、路由器、網 路PC、對等設備或其他公共網路節點,並且一般包括上述 與電腦110有關的許多或全部元件,儘管第la圖中僅說明 了記憶體存儲設備181。第la圖所述的邏輯連接包括區域 18 200937308 網路(LAN) 1 7 1 和廣域網路(WAN) 1 73,但可以還包括其他 網路。這種聯網環境在辦公室、企業範圍電腦網路、企業 内聯網和互聯網中是常見的。Computer 110 typically includes a variety of computer readable media. The computer readable medium can be any available media that can be accessed by computer 110 and includes both volatile and non-volatile media, extractable and non-extractable media. By way of example and not limitation, computer readable media may include computer storage media and communication media. Computer storage media includes volatile and non-volatile, extractable and non-extractable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, or other materials. Computer storage media includes, but is not limited to, RAM, R〇M, EEpR〇M' flash memory or other storage technology, CDR (10), home digital compact disc (DVD) or other disc memory, (4) box, tape, disk memory Body or other magnetic storage device, or any other medium used to store desired information and accessible by the computer. Communication media typically includes a computer readable finger +, data structure: a program chess group or other material' and includes any information delivery media, such as a carrier or other transmission mechanism. A data has been tuned to mean a signal that has one or more characteristics set or changed in such a way as to encode information within the signal. By way of example and not limitation, all media, including wired media such as a wired network or direct-line connection, and any combination of wireless media such as voice, RF, infrared, and other wireless media, should be Media 16 200937308 System Memory 130 includes computer storage media in the form of volatile and 'or non-volatile memory such as read only memory (R〇M) 13 1 and random access memory (RAM) 132. The basic input/output system i33 (BI〇s) is normally stored in the ROM 131. It contains, for example, a basic routine for facilitating the transfer of information between components within the computer ι1〇 during startup. RAM 132 generally contains data > and/or program modules' which are immediately accessible and/or currently being operated by processing unit 120. By way of example and not limitation, Section 1 & 圊 illustrates Q operating system 134, application 135, other programming modules 136, and program information 1 37. Computer 110 may also include other extractable/non-extractable, volatile/non-volatile computer storage media. By way of example only, FIG. 1 illustrates a hard disk drive for reading and writing non-extractable, non-volatile magnetic media, a disk drive 151 for reading and writing non-volatile magnetic sheets 152, and a removable extractor. The disc drive 155 for reading and writing the non-volatile optical disc 156, such as a CD ROM or other optical medium. Other extractable/non-extractable, volatile/non-volatile, non-extractable, non-volatile, and non-volatile memory available in the exemplary operating environment. Media includes, but is not limited to, tape cartridges, flash memory cards, devices, s Dual-certified universal disk, digital video tape, solid-state RAM, solid-state ROM, etc. The hard disk drive 141 is typically connected to the system bus bar 121 by a non-extractable memory interface such as interface 140, and the disk drive 151 and the optical disk drive 155 are typically interfaced with a removable memory interface such as interface 150. Connected to the system 121^尔玩汇排排•121. The drives and their associated computer storage media discussed above and illustrated in FIG. 1 provide computer 110 with computer readable instructions, data source 17 200937308 program modules and other data storage. In the figure la, for example, the hard disk drive 141 stores the operating system 144, the application program 145, the other program modules 146, and the program data 148. It is noted that these components may be the same as or different from the operating system 134, the application 135, the other program modules 136 and the program data 137. Here, the operating system 1 44, the application 145, the other program modules 146, and the program data 147 are given different numbers to indicate that they are at least different copies. The user can enter commands and information into the computer 110 through input devices such as a keyboard 162 and pointing device 161, commonly referred to as a mouse, trackball or touch pad. Other input devices (not shown) may include a microphone, joystick, game board, satellite dish, scanner, and the like. These and other input devices are often coupled to the processing unit 120 via a user input interface 160 coupled to the system bus 121, but may be connected by other interfaces and bus structures, such as parallel ports, gaming ports, or universal serial busses (USB). ). A monitor 191 or other type of display device is also coupled to the system bus 121 via an interface such as video interface 190. In addition to the monitor 19 1 , the computer may include other external devices such as a speaker 197 and a printer 196, which may be connected by an output peripheral interface 195. The computer 110 can operate in a networked environment that uses a logical connection with one or more remote computers, such as the remote computer 180. The remote computer 1 80 can be a personal computer, server, router, network PC, peer device, or other public network node, and generally includes many or all of the components described above in connection with the computer 110, although only illustrated in FIG. A memory storage device 181. The logical connections described in Figure la include the area 18 200937308 Network (LAN) 1 7 1 and Wide Area Network (WAN) 1 73, but may also include other networks. This networking environment is commonplace in offices, enterprise-wide computer networks, corporate intranets, and the Internet.

當用於LAN聯網環境中時,電腦110通過網路介面或 適配器170與LAN 171相連。當用於WAN聯網環境中時, 電腦110 —般包括用於在諸如互聯網這樣的WAN 173上建 立通信的數據機172或其他裝置。數據機172可以是内部 或外部的,它可以通過用戶輸入介面160或其他適當機制 與系統匯流排1 2 1相連。在網路化環境中,關於電腦1 1 0 所述的程式模組或其部分可以存儲在遠端記憶體存儲設備 中。通過示例但非限制,第1 a圖說明了駐留在記憶體設備 181上的遠端應用程式185。可以理解,所示網路連接是示 例性的,也可以使用在電腦間建立通信連接的其他裝置。 術語定義 爲了更簡潔、清楚地描述本發明,在本文中,下列的 術語特指如下的含義, 筆劃:書寫時的運動軌跡。對於識別採樣來說,在書 寫時,筆尖與接觸板接觸時,就會留下“筆跡”,不與接 觸板接觸時,就不會留下“筆跡”。但是,如果考慮書寫 時筆尖在空間的運動,則是一個連續的軌跡。在本發明中, 術語“筆劃”將泛指這種運動的軌跡,無論其是否留下 “筆跡”。 實筆劃:實筆劃指留下筆跡的筆劃,即在書寫時,筆 19 200937308 尖與接觸板接觸時的運動軌跡。 虛筆劃:虛筆劃指不留下筆跡的筆劃,即在書寫時, 筆尖與不接觸板接觸時的運動軌跡,虛筆劃多用於體現實 筆劃之間的連接,需要說明的是,本發明中,虛筆劃主要 體現筆尖的運動趨勢和方向,對於虛筆劃,並不一定要求 完全反映筆尖在這個階段所經過的實際軌跡。 持續筆劃:在一段時間中,方向基本不變的筆劃。持 續筆劃可以是實筆劃,也可以是虛筆劃,持續筆劃反映的 是連續在基本相同方向上的筆尖運動。 轉角:方向明顯變化的筆劃。轉角是持續筆劃之間的 轉換過程,轉角反映具有顯著方向改變的筆尖運動。在筆 劃轉換是會出現轉角是手寫東亞字元的顯著特徵,本發明 爲此專門提供“轉角”來更好地反映這個特徵。 筆跡:筆跡是指手寫東亞字元所留下的具體的痕跡。 字根:一部分的筆劃或者組合。 ❹ 概述 儘管 HMM模型已經被廣泛地用於在線的手寫字元的 ' 識別領域,但是對於在線的手寫東亞字元的識別,仍然具 , 有兩個主要方面的挑戰:手寫東亞字元的筆劃順序多樣和 書寫風格多樣。典型的用於手寫字體識別的HMM拓撲結 構是左向右(left-to-right ) HMM拓撲結構。在本發明中, 涉及對HMM拓撲結構的改進,主要包括: 20 200937308 在HMM拓撲結構中添加“轉角狀態”,以反映手寫東 亞字元手寫過程筆劃之間的“轉角”。“轉角狀態”的自 轉移是受限制的,並且“轉角狀態”與反映手寫東亞字元 手寫過程中持續筆劃的“持續狀態”是交替出現。 在一個HMM中使用多條路徑,以反映同一個手寫東亞 字元的不同的筆劃順序以及書寫風格。 在構建這種HMM拓撲結構的過程中,本發明考慮了如 下的問題: 首先,在一個 HMM模型中,多少數量的路徑是合適 的。對於這個問題,本發明提出了一種新的解決方案,採 用子序列方向直方圖向量 (Subsequence Direction Histogram Vector)來對從字元資料中得到的路徑進行聚 類,以確定合適的路徑數量。 第二,在一個路徑中,多少數量的狀態是合適的。對 於這個問題,本發明利用手寫東亞字元中的“轉角”這一 特性,利用曲率尺度空間中轉角的連接關係來確定一個路 徑中“轉角狀態”的數量,再利用“轉角狀態”和“持續 狀態”是交替出現這一特徵,確定路徑中合適的狀態數。 第三,對於手寫東亞字元中特有的筆劃間連續的不確 定性,本發明採用了在HMM拓撲結構中設置平行狀態(比 如雙高斯混合模型)的方法,或者應用MSD的方法來加以 解決。 下面將分別就上述的問題詳細地加以描述。 21When used in a LAN networking environment, the computer 110 is coupled to the LAN 171 via a network interface or adapter 170. When used in a WAN networking environment, computer 110 typically includes a data machine 172 or other device for establishing communications over a WAN 173, such as the Internet. The data engine 172 can be internal or external and can be coupled to the system bus 1 1 1 via a user input interface 160 or other suitable mechanism. In a networked environment, the program modules described in relation to computer 101 may be stored in a remote memory storage device. By way of example and not limitation, FIG. 1a illustrates a remote application 185 residing on a memory device 181. It will be appreciated that the network connections shown are exemplary and other means of establishing a communication connection between computers may be used. Definition of Terms In order to describe the present invention in a more concise and clear manner, the following terms are specifically referred to herein as strokes: the trajectory of motion when writing. For the identification sampling, when the pen tip is in contact with the contact plate during writing, "handwriting" is left, and when it is not in contact with the contact plate, "writing" is not left. However, if you consider the movement of the nib in space while writing, it is a continuous trajectory. In the present invention, the term "stroke" will generally refer to the trajectory of such motion, whether or not it leaves a "handwriting". Real stroke: The real stroke refers to the stroke that leaves the handwriting, that is, when writing, the movement track of the tip of the pen 19 200937308 when it contacts the contact plate. Virtual strokes: virtual strokes refer to strokes that do not leave handwriting, that is, the movement trajectory when the nib is in contact with the non-contact plate during writing, and the virtual strokes are mostly used for the connection between the realistic strokes. It should be noted that, in the present invention, The virtual stroke mainly reflects the movement trend and direction of the nib. For the virtual stroke, it is not necessarily required to fully reflect the actual trajectory of the nib at this stage. Continuous stroke: A stroke that has a constant orientation over a period of time. The continuous stroke can be a real stroke or a virtual stroke, and the continuous stroke reflects the movement of the nib in the same direction in the same direction. Corner: A stroke that changes direction significantly. The corner is the transition between continuous strokes, and the corner reflects the movement of the tip with a significant change in direction. In the case of stroke conversion, it is a remarkable feature that the corner is a handwritten East Asian character. The present invention specifically provides a "turning angle" to better reflect this feature. Handwriting: Handwriting refers to the specific traces left by handwritten East Asian characters. Root: A part of a stroke or combination.概述 Overview Although the HMM model has been widely used in the 'identification field of online handwriting, the recognition of online handwritten East Asian characters is still there. There are two main challenges: the stroke order of handwritten East Asian characters. Diverse and writing styles. A typical HMM topology for handwritten font recognition is a left-to-right HMM topology. In the present invention, it relates to the improvement of the HMM topology, which mainly includes: 20 200937308 Add a "corner state" in the HMM topology to reflect the "corner" between the handwritten process strokes of the handwritten East Asian character. The self-transition of the "corner state" is restricted, and the "corner state" alternates with the "continuous state" that reflects the continuous stroke of the handwritten East Asian character handwriting. Multiple paths are used in one HMM to reflect the different stroke order and writing style of the same handwritten East Asian character. In the process of constructing such an HMM topology, the present invention considers the following questions: First, how many paths are appropriate in an HMM model. For this problem, the present invention proposes a new solution that uses a Subsequence Direction Histogram Vector to cluster the paths derived from the character data to determine the appropriate number of paths. Second, how many states are appropriate in a path. For this problem, the present invention utilizes the "turning angle" feature in the handwritten East Asian character, and uses the connection relationship of the corners in the curvature scale space to determine the number of "corner states" in a path, and then uses the "corner state" and "continuation". The state is an alternating occurrence of this feature, determining the appropriate number of states in the path. Third, for the continuous uncertainty between the strokes unique to handwritten East Asian characters, the present invention employs a method of setting a parallel state (e.g., a double Gaussian mixture model) in the HMM topology, or applying an MSD method. The above problems will be described in detail below. twenty one

200937308 適於識別手寫東亞字元的HMM拓撲結構 本發明主要針對手寫東亞字元的識別技術,分 東亞字元的特點,可以歸結爲以下幾個主要方面: 1 )筆劃順序多樣,書寫風格多樣。正如上面所S 手寫東亞字元的筆劃多,並且字型結構複雜,導致 個字會出現多種筆劃順序,並且,最終寫完的字會 多種字形或者字體,也就是所謂的手寫風格。 2)由於書寫風格的多樣性,同一個字的不同寫 現如下兩種典型的區別: 2a )對局部連續筆劃的不同的簡化寫法,導 的書寫風格可能具有不同的筆劃數。 2b )在實筆劃之間的連接過程中,可以以實 連,也可以以虛筆劃相連,而對於真正的東亞 使用者來說,無論以哪種筆劃相連,都被認可 的字元,這就對機器識別帶來了難度。 手寫東亞字元識別所要解決的第一個問題,即] 筆劃順序多樣、手寫風格多樣。 同一個漢字,不同的人的書寫習慣決定了會出 筆劃的順序,雖然按照規範漢字的標準,書寫的筆 是固定的,但是實際的統計表明,大多數的漢字在 應用中存在數種書寫順序,並且,採用每一種書寫 人群都佔有相當的比例。作爲手寫識別這樣面向大 術,必須考慮這種情況。據一個例子,參考第3 a圖 其中的“九”字就有兩種不同的筆劃順序,其中, 析手寫 •析的, 了同一 呈現出 法會出 致不同 筆劃相 字元的 爲正確 巧題1 ) 現多種 劃順序 實際的 順序的 衆的技 所示, 用虛線 22 200937308 表示兩個筆劃之間的連接。 同樣,手寫東亞字元的書寫分格更是多種多樣,例如, 參考第3 b圖所示,同樣的漢字“复”,就可能出現很多種 不同的書寫風格。200937308 HMM Topology Suitable for Recognizing Handwritten East Asian Characters The present invention is mainly directed to the recognition technique of handwritten East Asian characters. The characteristics of East Asian characters can be summarized as the following main aspects: 1) The stroke sequence is diverse and the writing style is diverse. Just as the S handwritten East Asian character has many strokes and the font structure is complicated, the word will appear in multiple stroke sequences, and the final written word will have multiple fonts or fonts, which is called handwriting style. 2) Due to the diversity of writing styles, the difference between the same word is as follows: 2a) For different simplified writing of local continuous strokes, the writing style of the guide may have different stroke numbers. 2b) In the process of connecting between real strokes, it can be connected in real or virtual strokes, and for real East Asian users, regardless of which strokes are connected, recognized characters, this is It is difficult to identify the machine. The first problem to be solved in handwritten East Asian character recognition is that the stroke sequence is diverse and the handwriting style is diverse. The same Chinese character, the writing habits of different people determine the order in which the strokes will be drawn. Although the written pens are fixed according to the standard of Chinese characters, the actual statistics show that most Chinese characters have several writing sequences in the application. And, every type of writing population has a considerable proportion. As a handwriting recognition, it is necessary to consider this situation. According to an example, there are two different stroke sequences in the "nine" character in Figure 3a. Among them, the handwriting and analysis of the same presentation method will result in different strokes and characters. 1) Now that the various sequences of the actual sequence are shown, the connection between the two strokes is indicated by the dashed line 22 200937308. Similarly, the handwriting of handwritten East Asian characters is more diverse. For example, referring to Figure 3b, the same Chinese character “complex” may have many different writing styles.

一種實現方式是爲每一種筆劃順序、每一種手寫風格 都構建一個HMM拓撲結構,這樣做的資料量和運算複雜 度比較大,並且,另一個問題是這樣方案將給實現和訓練 都帶來一定的難度。 爲此,本發明提供一種多路徑的HMM拓撲結構,對於 相同的字元,只提供一個HMM拓撲結構(HMM模型), 使用其中的路徑來表現不同的筆劃順序或者是手寫風格。 參考第4a圖,第4a圖示出了根據本發明的一實施例, 所提出的一種適用於進行手寫東亞字元識別的多路徑 HMM拓撲結構的結構圖。參考第4a圖所示,該多路徑HMM 拓撲結構對應於一個手寫東亞字元;多路徑HMM拓撲結 構中的每一條路徑對應手寫東亞字元的多種筆劃順序中的 一個;或者多路徑HMM拓撲結構中的每一條路徑對應手 寫東亞字元的多種手寫風格中的一個。 比如,參考第 4b圖,根據第 3a圖中所使出的漢字 “九”的兩個手寫示例所構建的多路徑HMM拓撲結構具 有2條路徑,其中的每一條對應“九”字的一種筆劃順序。 參考第4c圖,根據第3b圖所示出的漢字“复”的多 個手寫示例,在構建用於漢字“复”的多路徑HMM拓撲 結構之後,聚類出兩種具有典型特徵的手寫風格(聚類的 23 200937308 過程將在下面詳細描述),這樣,該多路徑HMM拓撲結構 具有兩條路徑,其中的每一條對應一種手寫風格的“复” 字。 如上面所說的,由於書寫風格的多樣性所帶來的問 題,其中一個即是2 a )對局部連續筆劃的不同的簡化寫 法,導致不同的書寫風格可能具有不同的筆劃數。每一條 路徑上的狀態數是由這條路徑所對應的書寫風格的筆劃數 確定的。東亞字元在相鄰筆劃間具有明顯的方向轉變特 徵,即“轉角”,而持續筆劃與轉角是交替出現的。 根據本發明,首先需要通過對某一路徑所對應的訓練 樣本進行轉角的檢測,統計出這種書寫風格的典型筆劃 數,從而確定該條路徑的狀態數,具體的步驟會在下面進 行詳細的描述。正是由於持續筆劃和轉角是交替出現這一 特點,可以利用曲率尺度空間(C S S )中的精細化演算法 (coarse-to-fine algorithm )來實現一個路徑中狀態數量的 確定。 對於每一條具體路徑而言,本發明還在路徑中引入了 “轉角狀態”用來描述轉角處的短暫的筆劃轉變特徵。在 同一種書寫風格所對應的一條路徑上,持續筆劃對應於 “持續狀態”,轉角對應於“轉角狀態”,持續狀態與轉 角狀態是交替出現的。由於筆劃轉角處的短時性,本發明 將“轉角狀態”設爲不自轉,這種不自轉的特性結合持續 筆劃特徵與轉角特徵的差異,可以明顯改善訓練樣本的特 徵序列與路徑狀態之間的正確的對應關係。 24 200937308 再研究手寫東亞字元的特徵,可以發現,手寫東亞字 元在完成一個筆劃,開始下一個筆劃之前,會出現一個顯 著的轉向的過程’也就是上面所說的“轉角”。參考第2a 圖所示,第2a圖以一個十分簡單的漢字“上,,的一個手寫 示例爲例。在第 2a圖的示例中,“上”字的實筆劃是 3 劃’就是第2 a圖中所顯示的筆跡,虛筆劃有2劃,在第 2a圖中用帶箭頭的虛線表示。這樣,在總共5個筆劃(3 個實筆劃、2個虛筆劃)之間,出現了 4個轉角2〇2a_2〇2d, 在圖中使用虛線框的圓圈表示。 於是,本發明提供了一種改進的HMM模型,利用這種 HMM模型’能很好地對東亞字元的轉角進行建模。 根據本發明,提供一種利用HM Μ模型識別手寫東亞字 元的方法’提供對應於手寫東亞字元的ΗΜΜ拓撲結構; 其中 在ΗΜΜ拓撲結構中提供描述手寫東亞字元持續筆劃 的持續狀態;以及’在ΗΜΜ拓撲結構中提供描述手寫東 S字元筆劃間轉角的轉角狀態。 上述的ΗΜΜ拓撲結構被構建成左向右ΗΜΜ拓撲結 構從—起始狀態開始,至一終止狀態結束。ΗΜΜ拓撲結 構中的持續狀態可以轉移至下一狀態或者自轉移,而轉角 狀態只能轉移至下一狀態,不能自轉移.持續狀態和轉角 狀態依次交替存在。 上述的這個改進的ΗΜΜ拓撲結構符合手寫東亞字元 的特點,能很好地反映手寫東亞字元的特徵。 25One implementation method is to construct an HMM topology for each stroke order and each handwriting style. The data volume and operation complexity are relatively large, and another problem is that such a scheme will bring certain implementation and training. Difficulty. To this end, the present invention provides a multipath HMM topology in which only one HMM topology (HMM model) is provided for the same character, using the paths therein to represent different stroke sequences or handwritten styles. Referring to Fig. 4a, Fig. 4a shows a block diagram of a proposed multipath HMM topology suitable for handwritten East Asian character recognition, in accordance with an embodiment of the present invention. Referring to FIG. 4a, the multipath HMM topology corresponds to a handwritten East Asian character; each path in the multipath HMM topology corresponds to one of a plurality of stroke sequences of handwritten East Asian characters; or a multipath HMM topology Each of the paths corresponds to one of a plurality of handwritten styles of handwritten East Asian characters. For example, referring to FIG. 4b, the multi-path HMM topology constructed according to the two handwritten examples of the Chinese character "nine" in FIG. 3a has two paths, each of which corresponds to a stroke of the "nine" word. order. Referring to FIG. 4c, according to the multiple handwriting examples of the Chinese character "complex" shown in FIG. 3b, after constructing the multipath HMM topology for the Chinese character "complex", two handwritten styles with typical features are clustered. (The clustering 23 200937308 process will be described in detail below), such that the multipath HMM topology has two paths, each of which corresponds to a handwritten style "complex" word. As mentioned above, one of the problems caused by the diversity of writing styles is 2 a) different simplified writing of local continuous strokes, resulting in different writing styles may have different stroke numbers. The number of states on each path is determined by the number of strokes in the writing style corresponding to this path. East Asian characters have obvious directional transition characteristics between adjacent strokes, that is, "turning angles", while continuous strokes and corners alternate. According to the present invention, it is first necessary to detect the number of strokes of the writing style by detecting the corners of the training samples corresponding to a certain path, thereby determining the number of states of the path, and the specific steps will be detailed below. description. It is precisely because of the fact that continuous strokes and corners alternate, the coarse-to-fine algorithm in the curvature scale space (C S S ) can be used to determine the number of states in a path. For each particular path, the present invention also introduces a "corner state" in the path to describe the short stroke transition feature at the corner. On a path corresponding to the same writing style, the continuous stroke corresponds to the "continuous state", the corner corresponds to the "corner state", and the continuous state and the corner state alternate. Due to the short-term nature at the corner of the stroke, the present invention sets the "corner state" to be non-rotating. This non-rotation characteristic combined with the difference between the continuous stroke feature and the corner feature can significantly improve the relationship between the feature sequence and the path state of the training sample. The correct correspondence. 24 200937308 After studying the characteristics of handwritten East Asian characters, we can find that the handwritten East Asian characters will have a significant turning process before completing a stroke and starting the next stroke, which is the "turning angle" mentioned above. Referring to Fig. 2a, Fig. 2a is an example of a handwritten example of a very simple Chinese character "上,". In the example of Fig. 2a, the real stroke of the word "up" is 3 strokes, which is the 2a. In the handwriting shown in the figure, the virtual stroke has 2 strokes, which is indicated by the dotted line with arrows in Figure 2a. Thus, between the total of 5 strokes (3 real strokes and 2 virtual strokes), 4 appear. The corner 2〇2a_2〇2d is represented by a circle with a dashed box in the figure. Thus, the present invention provides an improved HMM model with which the angle of the East Asian character can be well modeled. The present invention provides a method for recognizing handwritten East Asian characters using the HM Μ model to provide a ΗΜΜ topology corresponding to handwritten East Asian characters; wherein a continuation state describing handwritten East Asian character continuation strokes is provided in the ΗΜΜ topology; The corner state describing the corner between the handwritten East S-character strokes is provided in the topological structure. The above-mentioned ΗΜΜ topology is constructed as a left-to-right ΗΜΜ topology starting from the initial state to a termination state The continuous state in the topology can be transferred to the next state or self-transition, and the corner state can only be transferred to the next state, which cannot be self-transferred. The continuous state and the corner state alternately exist one after the other. The structure conforms to the characteristics of handwritten East Asian characters and can well reflect the characteristics of handwritten East Asian characters.

200937308 參考第2b圖,第2b圖示出了根據本發明的上述 從第2a圖的“上”字筆跡中構建的對應於“上” HMM拓撲結構。 第2a圖所示的“上”字手寫示例中的5個筆劃’ 3個實筆劃和2個虚筆劃分別對應第2b圖中的5個持 態204a-204e,持讀狀態204a-204e的每一個都可以自 或者向下一個狀態轉移° 繼續參考第2b圖’第2a圖所示的4個轉角202a-分別對應第2b圖中的4個轉角狀態206a-206d,轉角 206a-206d不能自轉移’只能轉移到下一狀態。並且 角狀態206a-:206d插入到持續狀態204a-204e之間。 在另一個實施例中’第2b圖所示的該HMM拓撲 還可以起始於一個起始狀態’這個狀態對應於起筆 刻,並且,如果存在起始狀態的話,該起始狀態是不 轉移的。 问栋· 中也可以結束於終止狀態’終止狀態代表收筆的時 果使用終止狀態的話,終止狀態也不能自轉移的。 這樣,第2a圖所示的“上” 上 的手寫示例就被構 第2b圖所示的HMM拓撲結掁, 稱該ΗMM拓撲結構 面的特徵:是左向右ΗΜΜ拓揸蛙进, 撲結構,從一起始狀態 至一終止狀態結束,ΗΜΜ拓捶紝 撲、、口構中的持續狀態可 至下一狀態或者自轉移,而 m ^ φ ^ ^ 锝角狀態只能轉移至 i、’不能自轉移,持續狀態和 轉角狀態依次交替存> 原理 字的 包括 續狀 轉移 202d 狀態 ,轉 結構 的時 能自 施例 ,如 成了 合下 始, 轉移 一狀 第2b圖所示的HMM拓撲結構的另 26 200937308 需要說明的是,在第2 a圖所示的“上”字的手寫示例 中,根據本發明定義的實筆劃的數量與規範漢字寫法中的 筆劃數量是相等的,但是,對於有一些漢字或者漢字筆劃 來說,根據本發明所定義的筆劃數量(包括實筆劃和虛筆 ' 劃)與規範漢字寫法中的筆劃數量可能是不相等的。參考 . 第2 c圖所示出的示例,“ 口”字,根據規範漢字寫法,共 有3劃,但是根據本發明的定義,實筆劃共有4劃,虛筆 ^ 劃共有2劃。主要的區別在於“ π ”這一筆,在規範漢字 〇 中,這是一個筆劃,而根據本發明的定義,“ Π ”包含了 2個實筆劃和1個轉角。 因此,根據本發明的定義,第2c圖所示的“口 ”字的 手寫示例包括實筆劃212a-212d,虛筆劃214a和214b,以 及轉角 2 1 6a-2 1 6e。排列的順序是:實筆劃 2 1 2a、轉角 216a、虛筆劃214a、轉角216b、實筆劃212b、轉角216c、 實筆劃212c、轉角216d、虛筆劃214b、轉角216e、實筆 劃 2 1 2 d。 © 在本發明中,所說的筆劃是按照上面的術語定義中所 定義的筆劃。 回到前面所提到的多路徑的HMΜ拓撲結構,由於主要 適用於手寫東亞字元的識別,因此在每一條ΗΜΜ路徑中, ' 仍然會提供上述的“轉角”狀態。 回到第4a圖,在該多路徑ΗΜΜ拓撲結構中,由於每 一個路徑都是對應於手寫的東亞文字,因此這些路徑中的 每一條都具有如下的特點: 27 200937308 每一條路徑是左向右HMM拓撲結構,從一起始狀態開 始,至一終止狀態結束; ΗMM拓撲結構包括描述手寫東亞字元持續筆劃的持續 狀態,以及描述手寫東亞字元筆劃間轉角的轉角狀態,其 中持續狀態可以轉移至下一狀態或者自轉移,轉角狀態只 能轉移至下一狀態,不能自轉移;持續狀態和轉角狀態依 次交替存在; 對於整個多路徑ΗΜΜ拓撲結構來說,其中所有路徑起 始於同一入口狀態,至同一出口狀態結束。 在第4 a圖中,每一個圓圈代表一個狀態,帶箭頭的線 段表示轉移,其中,指向自己的箭頭表示自轉移,具有狀 態自轉移的狀態是持續狀態,不能自轉移的狀態是轉角狀 態。所有路徑都開始於公共的入口狀態,並結束於公共的 出口狀態。 需要說明的是,雖然在上面所示的示例中,多路徑HMM 拓撲結構中的每一個路徑都是符合前面所定義的特點的路 徑,並且,多路徑HMM拓撲結構中的路徑被示爲用來一 致地對應多種筆劃順序或者手寫風格。但是,本領域的技 術人員能夠理解,本發明的範圍決不限制與此,本發明中 的多路徑HMM拓撲結構的寬泛的限定應該是: 至少有一條路徑是左向右HMM拓撲結構,從一起始狀 態開始,至一終止狀態結束; 對於整個多路徑HMM拓撲結構來說,其中所有路徑起 始於同一入口狀態,至同一出口狀態結束。 28200937308 Referring to Figure 2b, Figure 2b shows the above-described "upper" HMM topology constructed from the "upper" handwriting of Figure 2a in accordance with the present invention. The five strokes in the "upper" handwriting example shown in Fig. 2a '3 real strokes and 2 virtual strokes respectively correspond to the five holding states 204a-204e in the second drawing, and each of the holding states 204a-204e One can be transferred from the next state or the next state. Continue to refer to the four corners 202a shown in Fig. 2b's Fig. 2a - corresponding to the four corner states 206a-206d in Fig. 2b, respectively. The corners 206a-206d cannot be transferred. 'Can only be transferred to the next state. And the angular states 206a-: 206d are inserted between the persistent states 204a-204e. In another embodiment, the HMM topology shown in FIG. 2b may also start in a starting state 'this state corresponds to the styling, and if there is an initial state, the starting state is not transferred. . It is also possible to end in the termination state. The termination state represents the time of the receipt. If the termination state is used, the termination state cannot be transferred. Thus, the handwriting example on the "upper" shown in Fig. 2a is constructed as the HMM topological graph shown in Fig. 2b, which is called the feature of the topological structure of the ΗMM: the left-to-right ΗΜΜ 揸 揸 ,, 扑 结构 structure From the initial state to the end state, the continuous state in the mouth, the mouth state can go to the next state or self-transition, and the m ^ φ ^ ^ corner state can only be transferred to i, 'can't Self-transition, continuous state and corner state are alternately stored> The principle word includes the continuation transition 202d state, and the time of the structure can be self-executed, such as the beginning of the merger, the transfer of the HMM topology shown in Figure 2b Another 26 of the structure 200937308 It should be noted that in the handwritten example of the "upper" word shown in Fig. 2a, the number of real strokes defined according to the present invention is equal to the number of strokes in the canonical Chinese character writing, however, For some Chinese characters or Chinese character strokes, the number of strokes (including real strokes and virtual strokes) defined in accordance with the present invention may not be equal to the number of strokes in standard Chinese characters. Reference. The example shown in Fig. 2c, the word "mouth", according to the canonical Chinese character writing, has a total of three strokes, but according to the definition of the present invention, the real stroke has a total of 4 strokes, and the virtual stroke has a total of 2 strokes. The main difference is the "π" pen, which is a stroke in the canonical Chinese character ,, and according to the definition of the present invention, "Π" contains 2 real strokes and 1 corner. Thus, in accordance with the definition of the present invention, handwritten examples of the "mouth" word shown in Figure 2c include real strokes 212a-212d, virtual strokes 214a and 214b, and corners 2 1 6a-2 1 6e. The order of arrangement is: real stroke 2 1 2a, corner 216a, virtual stroke 214a, corner 216b, real stroke 212b, corner 216c, real stroke 212c, corner 216d, virtual stroke 214b, corner 216e, real stroke 2 1 2 d. © In the present invention, the stroke is a stroke defined in the definition of the above terms. Returning to the multipath HMΜ topology mentioned above, since it is mainly suitable for the recognition of handwritten East Asian characters, 'the above-mentioned 'corner' state will still be provided in each of the ΗΜΜ paths. Returning to Figure 4a, in the multipath ΗΜΜ topology, since each path corresponds to handwritten East Asian characters, each of these paths has the following characteristics: 27 200937308 Each path is left to right The HMM topology starts from an initial state and ends at a termination state; the ΗMM topology includes a description of the continuation state of the handwritten East Asian character continuation stroke, and a description of the corner state of the handwritten East Asian character stroke between corners, wherein the continuation state can be transferred to In the next state or self-transition, the corner state can only be transferred to the next state, and cannot be self-transferred; the continuous state and the corner state alternately exist in sequence; for the entire multi-path topology, in which all paths start in the same entry state, End to the same exit state. In Fig. 4a, each circle represents a state, and the line segment with an arrow indicates a transition, wherein the arrow pointing to itself indicates self-transition, the state with state self-transition is a continuous state, and the state not capable of self-transition is a corner state. All paths begin in a public entry state and end in a public exit state. It should be noted that although in the example shown above, each path in the multipath HMM topology is a path that conforms to the previously defined features, and the path in the multipath HMM topology is shown as being used Consistently corresponds to multiple stroke sequences or handwritten styles. However, those skilled in the art will appreciate that the scope of the present invention is by no means limited thereto. The broad definition of the multipath HMM topology in the present invention should be: At least one path is a left-to-right HMM topology, together The start state begins and ends with a termination state; for the entire multipath HMM topology, where all paths start at the same entry state and end to the same exit state. 28

200937308 並且,描述多種不同的筆劃順序和不同的手寫風格 路徑可以被放置在同一個多路徑HMM拓撲結構中,也 是說,在一個多路徑HMM拓撲結構中,可以由部分的 徑對應不同的筆劃順序,而另一部分的路徑對應不同的 寫風格。 在一種情況中,由於手寫東亞字元中“轉角”特性 作用,至少有一條路徑包括描述手寫東亞字元持續筆劃 持續狀態,以及描述手寫東亞字元筆劃間轉角的轉角 態,其中持續狀態可以轉移至下一狀態或者自轉移,轉 狀態只能轉移至下一狀態,不能自轉移;持續狀態和轉 狀態依次交替存在。 由於本發明的 HMM拓撲結構是從訓練資料中自行 成,而訓練資料中的樣本可能存在很大的冗餘性,或者 中可能記錄了 一些極低概率的偶然性資料,這樣,就可 使得所得到的多路徑HMM拓撲結構中,存在冗餘路徑 或者只有少量手寫資料覆蓋的路徑,這將使得模型資料 大量增加,增加整個HMM拓撲結構的複雜度。爲此, 發明還需要對路徑進行合併處理,消除冗餘的路徑和基 不會使用到的路徑。關於路徑合併的具體實現方法在下 會詳細描述。還需要說明的一點是,對於本領域的技術 員來說,完全可以通過已有的手段來在一個HMM拓撲 構中實現多路徑,因此,本發明在此處主要是提供一種 型的HMM拓撲結構以及實現這種新型的HMM拓撲結構 多路徑HMM模型,因此,本領域的技術人員完全可以 的 就 路 手 的 的 狀 角 角 生 其 能 9 量 本 本 面 人 結 新 的 在 29 200937308 閱讀了本發明的說明書之後實現這裏所述的多路徑 模型而不需要再做出任何的創造性勞動。 現在,本發明的方法已經通過提供“轉角”狀態 地解決了手寫東亞字元的筆劃間連接問題,提供多路 • HMM拓撲結構解決了多種書寫順序和多種書寫風格 . 題。最後一彳ΐ需要解決的問題,也就是問題2 b ),在 劃之間的連接過程中,可以以實筆劃相連,也可以以 ❹ 劃相連,而對於真正的東亞字元的使用者來說,無論 種筆劃相連,都被認可爲正確的字元,這種書寫可變 機器識別帶來了難度。舉例說明,參考第5 a圖所示, “上”字來說,在寫完“ | ” ,到下一筆“一 ”之間 能會出現兩種連接方式,實筆劃連接,例如第5a圖中 的實筆劃5 02,或者虛筆劃連接,例如第5 a圖中所示 筆劃 504,當然,可能還有其他的連接方式,比如部 實筆劃加上部分的虛筆劃,此處,爲了簡明地進行說 暫時以兩種情況爲例進行說明。 〇 第5a圖所示的情況可以稱之爲“虛筆劃/實筆劃 確定性”。這也是本發明所要解決的問題之一。 對於這種情況,本發明提供兩種解決的方案。 ♦ 第一種方式,是在HMM拓撲結構中提供平行狀態 應手寫東亞字元中既可能是虛筆劃,也可能是實筆劃 分。 參考第5b圖所示,其示出了根據本發明的一實施 一種具有平行狀態的HMM拓撲結構的結構圖。其中 HMM 很好 徑的 的問 實筆 虛筆 以哪 性給 對於 ,可 所示 的虛 分的 明, 的不 ,對 的部 例的 ,在 30200937308 Also, it is described that a variety of different stroke sequences and different handwritten style paths can be placed in the same multi-path HMM topology, that is, in a multi-path HMM topology, partial strokes can correspond to different stroke sequences. And the other part of the path corresponds to a different writing style. In one case, due to the "turning" characteristic of the handwritten East Asian character, at least one path includes a description of the continuous stroke state of the handwritten East Asian character, and a corner state describing the angle between the handwritten East Asian character strokes, wherein the continuous state can be transferred. To the next state or self-transition, the transition state can only be transferred to the next state, and cannot be self-transferred; the persistent state and the transition state alternately exist in turn. Since the HMM topology of the present invention is self-contained from the training data, the samples in the training data may have great redundancy, or some extremely low probability contingency data may be recorded, so that the obtained In the multi-path HMM topology, there are redundant paths or paths with only a small amount of handwritten data, which will increase the model data and increase the complexity of the entire HMM topology. To this end, the invention also needs to merge the paths to eliminate redundant paths and paths that are not used by the base. The specific implementation of path merging will be described in detail below. It should also be noted that, for those skilled in the art, multipath can be implemented in an HMM topology by existing means. Therefore, the present invention mainly provides a type of HMM topology and Realizing this new type of HMM topology multi-path HMM model, therefore, those skilled in the art can fully understand the angle of the road hand, and can read the present invention at 29 200937308. The multipath model described herein is implemented after the specification without any additional creative effort. Now, the method of the present invention has solved the problem of inter-stroke connection of handwritten East Asian characters by providing a "corner" state, and provides a multi-channel HMM topology to solve various writing sequences and various writing styles. The last problem that needs to be solved, that is, problem 2 b), can be connected by real strokes or connected by strokes during the connection between strokes, and for users of true East Asian characters. Regardless of the type of strokes, it is recognized as the correct character. This kind of writing variable machine recognition brings difficulty. For example, referring to Figure 5a, for the word "up", two connections can be made between the completion of "|" and the next "one", and the strokes are connected, for example, in Figure 5a. Real strokes 5 02, or virtual strokes, such as strokes 504 shown in Figure 5 a, of course, there may be other connections, such as real strokes and some virtual strokes, here, for the sake of concise Say that the two situations are taken as an example for the time being.情况 The situation shown in Figure 5a can be called “virtual stroke/real stroke certainty”. This is also one of the problems to be solved by the present invention. For this case, the present invention provides two solutions. ♦ The first way is to provide a parallel state in the HMM topology. The handwritten East Asian characters may be either virtual strokes or real strokes. Referring to Figure 5b, there is shown a block diagram of an HMM topology having parallel states in accordance with an implementation of the present invention. Where the HMM is very good, the question of the pen is given by the character, and the imaginary part of the figure can be shown, in the case of the right part, at 30

200937308 狀態 5 0 6和 5 0 8之間的狀態 5 1 0具有一個平 5 1 Ob,狀態5 0 6可以轉移到狀態5 1 0或者5 1 Ob之 一個,狀態5 1 0或者5 1 0 b之中的任何一個也可以 態 50 8。 需要說明的是,本發明所提出的平行狀態的 前所提出的具有持續狀態和轉角狀態的HMM拓 合使用,比如,HMM拓撲結構中的任何一個持續 轉角狀態都可以具有平行的狀態。 上述的平行狀態的概念也可以應用到多路徑 撲結構中,即,在多路徑HMM拓撲結構的任何 中,都可能有一個或者數個的狀態是具有平行狀ί 第5c圖示出了根據第5a圖的手寫示例所構 平行狀態的HMM拓撲機構的示意圖,其中只示 狀態的相關部分。其中,持續狀態5 1 2代表實筆劃 持續狀態5 1 8代表實筆劃“ 一”。對於第5 a圖中 能爲實筆劃也可能爲虛筆劃的部分,在第5c | HMM拓撲結構中,採用持續狀態520代表第5a 筆劃502,用持續狀態520b代表第5b圖中的虛奪 在第5 c圖所示的實施例中,轉角狀態5 1 4和5 1 6 一個平行的轉角狀態5 1 4b和5 1 6b。轉角狀態是 平行狀態與其所對應的轉角在字跡中所處的位置 果轉角的位置更加接近可能爲實筆劃也可能爲虛 分,那麼這個轉角所對應的轉角狀態也會具有一 態,相應的,如果轉角的位置遠離可能爲實筆劃 行的狀態 中的任何 轉移到狀 可以與之 撲結構結 狀態或者 HMM拓 一條路徑 的。 建的具有 出了平行 所示的可 圖所示的 圖中的實 :t>] 5 04 ° 同樣具有 否具有一 有關,如 筆劃的部 個平行狀 也可能爲 31 200937308 虛筆劃的部分,那麼這個轉角所對應的轉移狀態就沒有平 行狀態。通常,在例如第5 c圖所示的情況的Η Μ Μ拓撲結 構中,轉角狀態5 1 4和5 1 6各自具有一個平行狀態5 1 4b 和5 1 6b。在其他的實施例中,可能是5 1 6具有平行的狀態 而5 1 4沒有,或者5 1 4具有平行狀態而5 1 6沒有。當然, 本發明也不排除兩個轉角狀態都沒有平行狀態的情況。200937308 State 5 0 6 and 5 0 8 State 5 1 0 has a flat 5 1 Ob, state 5 0 6 can be transferred to state 5 1 0 or 5 1 Ob one, state 5 1 0 or 5 1 0 b Any one of them can also be 50 8 . It should be noted that the HMM with the continuous state and the corner state proposed in the parallel state proposed by the present invention is used in combination, for example, any one of the continuous corner states in the HMM topology may have a parallel state. The concept of the parallel state described above can also be applied to a multi-path flap structure, that is, in any of the multi-path HMM topologies, there may be one or several states having parallel states. FIG. 5c illustrates A schematic diagram of a HMM topology mechanism in a parallel state constructed by the handwriting example of Figure 5a, in which only the relevant portions of the state are shown. Among them, the continuous state 5 1 2 represents the real stroke. The continuous state 5 1 8 represents the real stroke "one". For the part of Figure 5 that can be a real stroke or a virtual stroke, in the 5c | HMM topology, the persistent state 520 represents the 5th stroke 502, and the persistent state 520b represents the virtual capture in the 5b diagram. In the embodiment shown in Fig. 5c, the corner states 5 1 4 and 5 1 6 have a parallel corner state 5 1 4b and 5 16b. The corner state is a state in which the parallel state and its corresponding corner are closer to the position of the corner of the corner in the handwriting, which may be a real stroke or a virtual point. Then the corner state corresponding to the corner will also have a state, correspondingly, If the position of the corner is far from any state in the state that may be a real stroke, it may be linked to the state of the structure or the HMM. Built in the figure shown in the parallel diagram: t>] 5 04 ° also has a correlation, such as the stroke of the parallel part may also be 31 200937308 virtual stroke part, then The transition state corresponding to this corner has no parallel state. In general, in the Η Μ topology of the case shown, for example, in Fig. 5c, the corner states 5 1 4 and 5 16 each have a parallel state 5 1 4b and 5 1 6b. In other embodiments, it may be that 5 16 has a parallel state and 5 14 does not, or 5 14 has a parallel state and 5 16 does not. Of course, the present invention does not exclude the case where both corner states are not parallel.

提供平行狀態的一種實現形式是採用雙高斯混合模型 (GMM ),即利用高斯函數的分佈特性,使得實筆劃和虛 筆劃分別對應不同的高斯函數的峰值。 第二種解決實筆劃/虛筆劃不確定性的方法是對HMM 拓撲結構應用多空間概率分佈(MSD),對應手寫東亞字元 中既可能是虛筆劃,也可能是實筆劃的部分。MSD是一種 常用的演算法,因此在本發明中不具體描述其詳細的原 理,將MSD應用到HMM拓撲結構上,可以根據下述的方 式來實現:對於每一個HMM拓撲結構,定義兩個空間, 分別對應實筆劃和虛筆劃。通過M S D可以對不確定的部分 分別在兩個空間内進行多空間概率分佈的計算,得到對應 實筆劃的第一度量值和對應虛筆劃的第二度量值,通過對 於度量值的處理來解決實筆劃/虛筆劃不確定性的問題。 這樣,本發明有效地針對手寫東亞字元的主要特點提 供了一種識別的技術,能夠很好地適應手寫東亞字元的特 徵。但是,還面臨的一個問題是模型資料量的問題。就如 上面所描述的,對於每一個手寫東亞字元,會提供一個多 路徑的ΗΜΜ拓撲結構,並且,每一條路徑中,都會提供 32 200937308 數個狀態來對應每一個筆劃,在筆劃之間還需要提供“轉 角”狀態,對於存在實筆劃/虛筆劃不確定性的部分,還需 要提供平行狀態或者應用MSD技術。這些都將導致HMM 模型的資料量非常龐大。爲了有效地節省資料空間,降低 本發明的實現成本,還需要考慮資料壓縮的問題。 雖然東亞字元相互之間的相似性不是很明顯,但是如 果將東亞字元分割成數個部分,這些局部的結構還是具有 不少相似之處的。利用這個特點,能夠實現HMM拓撲結 構中狀態的聚類,從而減小資料量,降低HMM拓撲結構 的複雜度。 根據本發明,在構建HMM拓撲結構之後還會對HMM 拓撲拓撲結構中的狀態進行聚類,使HMM拓撲拓撲結構 中的至少一組狀態共用參數,且對於一組共用參數的狀 態,在HMM拓撲結構中只保存一套參數。 狀態的聚類可能在同一路徑上的狀態之間進行,也可 能在不同路徑上的狀態之間進行,被認爲是可以聚類的狀 態,HMM拓撲結構中將只爲所有這些狀態保留一套參數。 這樣就可以有效地減少資料量,降低HMM拓撲結構的 複雜度。 適於識別手寫東亞字元的HMM拓撲結構的實現方法 對於上面所介紹的HMM拓撲結構,可以按照下面所描 述的方法來實現: 33 200937308 1 )産生字根訓練資料 在進行HMM訓練過程之前,需要對資料進行標記,對 於東亞子元來說’資料通常是採用字元而不是字根來進行 標記。而對於本發明的識別方法來說,建立的HMm拓撲 結構中更需要利用的是由字根進行標記的資料。維特比 (Viterbi )解碼的一個附加效果就能提供筆跡樣本和hmm 狀態之間的對應關係。因此,借助維特比解碼,就能夠自 動從字元資料中獲得對應於字根的筆跡資料,比如,參照 對應字根的HMM模型中對應關係的分界點,將筆跡樣本 進行分解。 爲了做上述的切分得到字根資料,首先要得到一個初 步的HMM模型。此模型雖然可能不一定能非常準確的識 別手寫字元,但能相對準確的得到字根的分界點。該初步 的HMM模型從單一路徑開始構建,並逐漸地分割出越來 越多的路徑直到路徑數足夠爲止,路徑數是否足夠採用路 徑的收斂度量來衡量。該方法的一個示例如下:One implementation that provides a parallel state is to use a double Gaussian mixture model (GMM), which uses the distribution characteristics of the Gaussian function such that the real and virtual strokes correspond to the peaks of different Gaussian functions, respectively. The second method to solve the real stroke/virtual stroke uncertainty is to apply a multi-space probability distribution (MSD) to the HMM topology, which may be either a virtual stroke or a real stroke in the handwritten East Asian character. MSD is a commonly used algorithm, so the detailed principle is not specifically described in the present invention. Applying the MSD to the HMM topology can be implemented according to the following method: For each HMM topology, two spaces are defined. , corresponding to real strokes and virtual strokes. The MSD can calculate the multi-space probability distribution in two spaces for the uncertain part, and obtain the first metric value corresponding to the real stroke and the second metric value corresponding to the virtual stroke, which is solved by processing the metric value. The problem of real stroke/virtual stroke uncertainty. Thus, the present invention effectively provides an identification technique for the main features of handwritten East Asian characters, which is well adapted to the characteristics of handwritten East Asian characters. However, one problem that is still faced is the problem of the amount of data in the model. As described above, for each handwritten East Asian character, a multipath ΗΜΜ topology is provided, and each path provides 32 200937308 states for each stroke, and between strokes. It is necessary to provide a "corner" state, and for the part where there is uncertainty in the real stroke/virtual stroke, it is also necessary to provide a parallel state or apply MSD technology. These will result in a very large amount of data for the HMM model. In order to effectively save the data space and reduce the implementation cost of the present invention, it is also necessary to consider the problem of data compression. Although the similarity between East Asian characters is not obvious, if the East Asian characters are divided into several parts, these partial structures still have many similarities. With this feature, clustering of states in the HMM topology can be achieved, thereby reducing the amount of data and reducing the complexity of the HMM topology. According to the present invention, after constructing the HMM topology, the states in the HMM topology topology are also clustered such that at least one set of states in the HMM topology topology share parameters, and for a set of shared parameter states, in the HMM topology Only one set of parameters is saved in the structure. The clustering of states may be between states on the same path, or between states on different paths, and is considered to be a clusterable state. Only one set of all these states will be reserved in the HMM topology. parameter. This can effectively reduce the amount of data and reduce the complexity of the HMM topology. Implementation Method for HMM Topology Suitable for Recognizing Handwritten East Asian Characters For the HMM topology described above, it can be implemented as described below: 33 200937308 1 ) Generate root training data before the HMM training process is required Marking data, for East Asian children, 'data is usually marked with characters instead of roots. For the identification method of the present invention, it is more necessary to use the data marked by the root in the established HMm topology. An additional effect of Viterbi decoding provides a correspondence between handwriting samples and hmm states. Therefore, with the Viterbi decoding, the handwriting data corresponding to the root can be automatically obtained from the character data, for example, the handwriting sample is decomposed by referring to the boundary point of the corresponding relationship in the HMM model of the corresponding root. In order to do the above-mentioned segmentation to obtain the root data, we must first obtain a preliminary HMM model. Although this model may not be able to identify the handwriting very accurately, it can obtain the demarcation point of the radical relatively accurately. The preliminary HMM model is built from a single path and gradually divides more and more paths until the number of paths is sufficient, and whether the number of paths is sufficient to measure by the convergence measure of the path. An example of this method is as follows:

a) 初始化一具有單一路徑的HMM (n=l); b) 對於HMM中已經存在的n條路徑P1、P2 Pn,計 算它們的收斂度量C(Pl)、c(P2)_..C(Pn),並選擇其中收敛 度量C(Pj)最大的路徑Pj ; c) 如果C(Pj) < T,T爲預定的收斂門限值,則說明對 於目前的訓練資料來說,這個HMM拓撲結構中的路徑數 已經足夠多,不必繼續分割路徑; d)如果 C(Pj) T ’則將路徑Pj複製,增加雜訊後構 34 200937308 建一條新的路蔣·,& μ 的路让,此時,ΗΜΜ模型中具有了 n+1條路徑; ) 1條路從的基礎上進行HMM模型的訓練,直 到無法再提鬲識別精产蛊也_ $ ^ J惰度爲止,至此,已經獲得了初步的 路徑HMM模型,腺空_农Λ.丨 子το資料作字根對齊並切分,得到相 對比較正確的字根資料 理$ιΙ /Λ 頁枓侍到的予根資料將被後面的步驟 用於訓練來得到識別年g宝分& 』予寫子兀的拓撲結構優化的HMM模 型。 2)最優路徑數確定 根據本法明,還提供一種統計特徵來解決對最優路徑 數確定的問題,該特徵稱之爲“子序列方向直方圖向量”。 對於每一個筆跡樣本,按弧長將其分成數個分段,每 一分段分別提取一特徵。其中,這裏所說的弧長是指所有 筆劃長度之和’就是所有的實筆劃加上所有的虛筆劃的長 度之和’分段的過程是基於這個求和之後的弧長。其中的 特徵該分段的形狀特徵。因此’經過分段和提取特徵的操 作之後,能夠獲取一個字跡樣本在每一個分段上的—# ^ ^ —種形 狀特徵。將每個分段上的特徵按順序連接,從而得到該筆 跡樣本的一個特徵。 筆跡樣本的母個为知上的形狀特徵可以實現爲子序列 方向直方圖向量。將筆跡樣本的分段進一步劃分成數個子 段,每^個子段確定一量化的方向特徵,每—個量化的方 向特徵對應一預定角度範圍的方向。每一個分段上的形狀 特徵爲〆子序列方向直方圖,每一個筆跡樣本的特徵爲一 子序列方向直方圖向量° 35 200937308a) Initialize an HMM with a single path (n=l); b) Calculate their convergence metrics C(Pl), c(P2)_..C for the n paths P1, P2 Pn already existing in the HMM Pn), and select the path Pj in which the convergence metric C(Pj) is the largest; c) If C(Pj) < T, T is the predetermined convergence threshold, this HMM topology is described for the current training data. The number of paths in the path is already enough, so you don't have to continue to split the path; d) If C(Pj) T ', copy the path Pj, add noise, and then build a new road, Jiang, & μ At this point, the ΗΜΜ model has n+1 paths;) 1 way to train the HMM model, until it can no longer be raised to identify the elite 蛊 _ $ ^ J inertia, so far, has been obtained The preliminary path HMM model, gland _ Λ Λ 丨 τ τ τ 资料 资料 资料 资料 τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ τ The steps are used for training to obtain a topology-optimized HMM model for identifying the year-old g-scores & 2) Determination of the optimal number of paths According to the present method, a statistical feature is also provided to solve the problem of determining the optimal number of paths, which is called "subsequence direction histogram vector". For each handwriting sample, it is divided into several segments by arc length, and each segment is extracted with a feature. Here, the arc length referred to herein refers to the sum of the lengths of all the strokes 'that is the sum of the lengths of all the real strokes plus all the virtual strokes'. The process of segmentation is based on the arc length after this summation. Among them are the shape features of the segment. Therefore, after the operation of segmenting and extracting features, it is possible to obtain a -^^^-shaped feature of a handwriting sample on each segment. The features on each segment are connected in order to obtain a feature of the handwriting sample. The parent shape of the handwriting sample can be implemented as a subsequence direction histogram vector. The segment of the handwriting sample is further divided into a plurality of sub-segments, each of which determines a quantized directional feature, and each of the quantized directional features corresponds to a direction of a predetermined angular range. The shape feature on each segment is the histogram sequence direction histogram, and the feature of each handwriting sample is a subsequence direction histogram vector ° 35 200937308

參考第6a圖所示,說明了子序列方向直方圖的建立過 程。首先,字跡樣本“王” 6 0 2被按照弧長分割成了數個 分段,其中該弧長是實筆劃和虛筆劃相連之後的總弧長 6 04,在這些分段中,有一些只包括實筆劃,比如分段 604a,有一些只包括虛筆劃,比如分段604c,還有一些既 有實筆劃,也有虛筆劃,比如分段604b。以分段604b爲 例,其又可以被分割成數個字分段,對於這些子分段,根 據它們的方向(參考方向指標 606)和筆劃密度得到了子 序列方向直方圖。比如,604b的虛筆劃部分的一個分段 605a對應的子序列方向直方圖爲605b,604b的實筆劃部 分的一個分段607a對應的子序列方向直方圖爲607b。將 所有子分段的子序列方向直方圖組合在一起,就能得到對 應於每一個分段的子序列方向直方圖,進一步將每一個分 段的子序列方向直方圖組合,就能得到對應筆跡樣本的子 序列方向直方圖向量,比如第6a圖中的608。 在獲得了子序列方向直方圖向量之後,就能容易地對 上述特徵進行聚類,每一個聚類對應於一種筆劃順序或者 一種手寫風格。聚類的操作可以通過諸如高斯混合模型 (Guassian Mixture Model )來實現。 最終,將獲得m個聚類後的結果,m也是對應於一個 手寫東亞字元的優化的路徑數量。然後,基於每一個聚類 的資料確定對應於每一種筆劃順序或者每一種手寫風格的 HMM拓撲結構中的一路徑的參數。 參考第6b圖,第6b圖是進行訓練資料自動分類,更 36 200937308 確切地說是經過聚類操作的實例。第6 b圖中,從訓練資料 中總共提供了 9個樣本資料。通過資料自動分類,將它們 聚類成2個典型的類,代表兩個經過聚類處理之後的書寫 風格的“复”字。 ' 3)最優狀態數量的確定 • 在確定了路徑的數量之後,還需要確定路徑中狀態的 數量,根據本發明,依然採用來自資料的機器自學習的方 φ 法來確定狀態的數量。確定一個路徑中的狀態的數量時, 採用的資料是來自上述的步驟2)中同一個類中的筆跡樣 本。HMM中的狀態反映的是筆跡樣本的形態和變化,因 此,狀態包括兩類:對於方向變化不明顯(彎曲度不大) 的部分,稱之爲“持續筆劃”,對應HMM中的“持續狀 態”:對於方向變化明顯(彎曲度大)的部分,就是“轉 角”,對應於“轉角狀態”。一個路徑中狀態的數量就是 “持續筆劃”與“轉角”的數量之和,也就是“持續狀 態”的數量與“轉角狀態”的數量之和。逐漸精細化的曲 © 率尺度空間演算法(coarse-to-fine Curvature Scale Space ) 可被用於進行轉角的檢測,從而確定轉角的數量,同時, 由於轉角和持續筆劃是交替出現的,又能從轉角的數量推 算出持續筆劃的數量,這樣,狀態的數量就能被確定。 4)狀態連接設計 如前面所述的“虛筆劃/實筆劃的不確定性”的問 題,本發明提供兩種解決的方案。 第一種方式,是在HMM拓撲結構中提供平行狀態,對 37 200937308 應手寫東亞字元中既可能是虛筆劃,也可能是實筆劃的部 分。提供平行狀態的一種實現形式是採用雙高斯混合模型 (GMM ),即利用高斯函數的分佈特性,使得實筆劃和虛 筆劃分別對應不同的高斯函數的峰值,利用這樣的雙高斯 混合模型,就能夠解決解決實筆劃/虚筆劃不確定性的問 題。參考第6c圖,對於“木”字,有連筆和不連筆兩種寫 法,連筆的寫法對應“實筆劃”的情況,而不連筆的寫法 對應“虛筆劃”的情況,參考第6 c圖,實筆劃和虛筆劃分 別具有各自的高斯函數分佈,它們的峰值不同,因此將它 們組合可以獲得雙高斯混合模型,具有兩個不同的峰值, 從而實現平行狀態。 第二種方法,對 HMM拓撲結構應用多空間概率分佈 (MSD),對應手寫東亞字元中既可能是虛筆劃,也可能是 實筆劃的部分。 5)路徑合併 在上述的步驟完成之後,獲得了初步的多路徑的HMM 拓撲結構,其中包括了對應於訓練資料中所有情況的路 徑,包括反映筆劃順序和手寫風格的路徑,但是,由於訓 練資料中存在的冗餘資訊和極低概率的資訊,需要對所獲 得的路徑進行合併處理。路徑的合併處理可以包括兩個方 面:將出現概率極低的路徑去除;以及,將類似的路徑進 行合併。 將出現概率極低的路徑,也就是該路徑所對應的訓練 資料中的資料量較小,占訓練資料總量的比例低去除,去 38Referring to Figure 6a, the process of establishing a subsequence direction histogram is illustrated. First, the handwriting sample "King" 6 0 2 is divided into several segments according to the arc length, where the arc length is the total arc length 6 04 after the real stroke and the virtual stroke are connected. Among these segments, some are only Including real strokes, such as segment 604a, some include only virtual strokes, such as segment 604c, and some have both real strokes and virtual strokes, such as segment 604b. Taking segment 604b as an example, it can be further divided into a number of word segments for which a subsequence direction histogram is obtained based on their direction (reference direction indicator 606) and stroke density. For example, the subsequence direction histogram corresponding to one segment 605a of the virtual stroke portion of 604b is 605b, and the subsequence direction histogram corresponding to one segment 607a of the real stroke portion of 604b is 607b. Combining the subsequence direction histograms of all sub-segments, the sub-sequence direction histogram corresponding to each segment can be obtained, and the sub-sequence direction histogram of each segment can be further combined to obtain the corresponding handwriting. The subsequence direction histogram vector of the sample, such as 608 in Figure 6a. After obtaining the sub-sequence direction histogram vector, the above features can be easily clustered, each cluster corresponding to a stroke order or a handwritten style. The operation of clustering can be achieved by, for example, a Guassian Mixture Model. Finally, the results of m clusters will be obtained, and m is also the number of optimized paths corresponding to a handwritten East Asian character. Then, based on the data of each cluster, parameters of a path in the HMM topology corresponding to each stroke order or each handwritten style are determined. Referring to Figure 6b, Figure 6b is an automatic classification of training data, more 36 200937308 is exactly an example of clustering operations. In Figure 6b, a total of nine sample data were provided from the training materials. By automatically classifying the data, they are clustered into two typical classes, representing the two complex words of the written style after clustering. '3) Determination of the optimal number of states • After determining the number of paths, it is also necessary to determine the number of states in the path. According to the present invention, the number of states of the machine self-learning from the data is still used to determine the number of states. When determining the number of states in a path, the data used is the handwriting sample from the same class in step 2) above. The state in the HMM reflects the shape and variation of the handwriting sample. Therefore, the state includes two categories: the part that is not obvious in direction change (the curvature is not large), called the "continuous stroke", corresponding to the "continuous state" in the HMM. ": For the part where the direction changes significantly (large curvature), it is the "corner", which corresponds to the "corner state". The number of states in a path is the sum of the number of "continuous strokes" and "corners", that is, the sum of the number of "continuous states" and the number of "corner states". The gradually refined texture-to-fine Curvature Scale Space can be used to detect the corners to determine the number of corners. At the same time, because the corners and continuous strokes alternate, The number of continuous strokes is derived from the number of corners so that the number of states can be determined. 4) State Connection Design As described above, the problem of "virtual stroke/real stroke uncertainty", the present invention provides two solutions. The first way is to provide a parallel state in the HMM topology. For the 2009 2009308, you should write a part of the East Asian character that may be either a virtual stroke or a real stroke. One implementation form that provides a parallel state is to use a double Gaussian mixture model (GMM), that is, using the distribution characteristics of the Gaussian function, so that the real stroke and the virtual stroke correspond to the peaks of different Gaussian functions, respectively, and by using such a double Gaussian mixture model, Solve the problem of solving the uncertainty of real strokes/virtual strokes. Referring to Figure 6c, for the word "wood", there are two ways of writing a pen and a pen, and the writing of the pen corresponds to the case of "real strokes", and the case of not writing a pen corresponds to the case of "virtual strokes". In the 6 c-picture, the real stroke and the virtual stroke have their own Gaussian function distributions, and their peaks are different, so they can be combined to obtain a double Gaussian mixture model with two different peaks to achieve a parallel state. In the second method, a multi-space probability distribution (MSD) is applied to the HMM topology, which may be a virtual stroke or a part of a real stroke in the handwritten East Asian character. 5) Path Merging After the above steps are completed, a preliminary multipath HMM topology is obtained, which includes paths corresponding to all cases in the training data, including paths reflecting stroke order and handwritten style, but due to training data Redundant information and very low probability information exist in the process, and the obtained paths need to be combined. The merging of paths can include two aspects: removing paths with very low probability of occurrence; and merging similar paths. A path with a very low probability will appear, that is, the amount of data in the training data corresponding to the path is small, and the proportion of the total amount of training data is removed.

200937308 除這些路徑可以通過設置預定門限的方法實現。 而類似的路徑合併的操作可以通過爲路徑計 度量的方式來實現,通過相似度度量來表示路徑 似程度,當兩條路徑足夠相似時,就把路徑合併 相似度度量可以用 Kullback-Leibler 差值与 Kullbaek-Leibler差值表示路徑之間相似的程 Kullback-Leibler差值低於一預定值,就表示兩條 相似,則可以把它們合併,合併操作將對兩條路 衡化的處理,以得到一條能夠很好地反映原來兩 的主要特點的路徑。 6 )狀態聚類 之後,還需要對HMM拓撲結構中狀態進行聚 將東亞字元分割成數個部分,這些局部的結構還 少相似之處的。利用這個特點,能夠實現HMM 中狀態的聚類,從而減小資料量,降低HMM拓 複雜度。狀態的聚類可能在同一路徑上的狀態之 也可能在不同路徑上的狀態之間進行,被認爲是 的狀態,HMM拓撲結構中將只爲所有屬於同一個 態保留一套參數。 根據本發明,聚類操作包括:計算該HMM拓 每兩個狀態之間的狀態相似度度量,當狀態相似 示兩個狀態足夠相似時,使該兩個狀態共用參數 當兩個以上的狀態相互之間的狀態相似度度量都 時,使該這些狀態都共用參數。在本發明的一個 算相似度 之間的相 。比如, -表示, 变。如果 路徑足夠 徑進行平 條路徑中 類。如果 是具有不 拓撲結構 撲結構的 間進行, 可以聚類 聚類的狀 撲結構中 度度量表 ;其中, 足夠相似 實現中, 39 200937308 狀態相似度度量通過 Kullback-Leibler 差值表示, Kullback-Leibler差值低於一預定值,則表示這兩個狀態足 夠相似。比如一個具體的聚類操作的過程被顯示如下: 初始化階段: 設定所有狀態的總數爲Μ個,將第m個狀態歸類到第 m個類中200937308 In addition to these paths can be achieved by setting a predetermined threshold. The similar path merging operation can be implemented by measuring the path meter. The similarity metric is used to represent the path degree. When the two paths are sufficiently similar, the path merging similarity measure can use the Kullback-Leibler difference. The difference between the Kullback-Leibler and the Kullbaek-Leibler difference indicates that the difference is less than a predetermined value, indicating that the two are similar, then they can be combined, and the merge operation will process the two paths to obtain A path that best reflects the main features of the original two. 6) After state clustering, it is also necessary to aggregate the states in the HMM topology to divide the East Asian characters into several parts, and these local structures have few similarities. With this feature, clustering of states in the HMM can be achieved, thereby reducing the amount of data and reducing the complexity of the HMM. The clustering of states may be between states on the same path or between states on different paths. It is considered to be the state. In the HMM topology, only one set of parameters will be reserved for all belonging to the same state. According to the present invention, the clustering operation includes: calculating a state similarity measure between each two states of the HMM extension, and when the states are similar, indicating that the two states are sufficiently similar, the two states share the parameter when two or more states are mutually When the state similarity measures are between, the parameters are shared by the states. A phase between the similarities in the present invention. For example, - indicates, changes. If the path is sufficient for the class in the flat path. If there is an inter-topological structure with a non-topological structure, the clustered cluster-like structure can be clustered; wherein, in a sufficiently similar implementation, 39 200937308 state similarity measure is represented by Kullback-Leibler difference, Kullback-Leibler difference A value below a predetermined value indicates that the two states are sufficiently similar. For example, the process of a specific clustering operation is shown as follows: Initialization phase: Set the total number of all states to one, and classify the mth state into the mth class.

對於任意的兩個狀態 m 和 η,計算它們之間的 Kullback-Leibler 差值,並表示爲 D( m,n)。 聚類階段: 尋找具有最小的 Kullback-Leibler 差值的一組狀態 (m’,n’),可通過下面的函數實現:(m’,n’)=argmin( m,n)。 通過在Kullback-Leibler差值矩陣中將對應於m’和η’ 的行與列相加,合併第m ’和η ’狀態。 將狀態Μ的總數減1。 迴圈操作: 如果Μ的數量大於一預定值,則重復上面的操作,否 則完成合併操作。 通過上面的步驟 1 )· 5 ),就能從手寫東亞字元的訓練 資料中自動構建對應於手寫東亞字元的多路徑ΗΜΜ拓撲 結構,其中的每一條路徑對應手寫東亞字元的多種筆劃順 序中的一個,或者,其中的每一條路徑對應手寫東亞字元 的多種手寫風格中的一個; 在ΗΜΜ拓撲結構中提供描述手寫東亞字元持續筆劃 40 200937308 的持續狀態,和描述手寫東亞字元筆劃間轉角的轉角狀態; 其中,多路徑HMM拓撲結構中的每一條路徑是左向右 HMM拓撲結構,從一起始狀態開始,至一終止狀態結束; 其中持續狀態可以轉移至下一狀態或者自轉移,轉角狀態 ’ 只能轉移至下一狀態,不能自轉移;HMM拓撲結構中持續 • 狀態和轉角狀態依次交替存在;以及其中所有路徑起始於 同一入口狀態,至同一出口狀態結束。 〇 可能的硬體實現形式 本發明可以通過軟體的形式實現,比如通過一通用計 算系統運行實現本發明所述的方法的軟體,就能實現本發 明。本發明也可以以指令或者程式的形式被實現,這些指 令或者程式可以保存在一個存儲介質上,當一計算設備從 存儲介質上獲取這些指令或者程式並執行之後,就能夠實 現本發明。 此外,本發明也可以使用硬體的形式實現,需要說明 的是,對於本領域的技術人員來說,很顯然在本發明所處 的領域中,軟硬體之間的轉換是具有多種形式的,即具有 . 不同形式的硬體可以實現相同的功能,因此,本發明下面 所列舉的可能的硬體實現形式是限定硬體的功能,而不限 定其具體實現形式,對於本領域的技術人員來說,根據這 些功能實現各種形式的功能是顯而意見的。 參考第7圖所示,本發明的一示例的利用HMM模型識 別手寫東亞字元的系統700包括: 41 ❹ 200937308 HMM拓撲結構構建裴置7〇2,構建對應於手寫^ 元的HMM拓撲結構; 持續狀態設置裝置7〇4,在HMM拓撲結構中提胡 手寫東亞字元持續筆劃的持續狀態; 轉角狀態設置裝置7〇6,在HMM拓撲結構中提供 手寫東亞字元筆劃間轉角的轉角狀態。 其中,HMM拓撲妹M^ 供結構構建裝置702構建的Ημμ 結構是左向右HMM括撲結構,從一起始狀態開始, 終止狀態結束;並且,Η M M拓撲結構中的持續狀態可 移至下-狀態或者自轉移’轉角狀態只能轉移至下 態,不能自轉移;持續狀態和轉角狀態依次交替存在 在另一實施例中,該H蘭拓撲結構構建裝置702 多路徑HMM拓撲結構,對應於手寫東亞字元 一條路徑對應手寫東亞车分沾夕仏 ’、 内果亞予70的多種筆劃順序令的一個 者,其中的每一條路徑對應手寫東亞字元的多種手寫 中的一個。 于舄 其中,HMM抬撲牡姓磁决# 撰釔構構建裝置702構建的多路 拓撲結構中的每一條$彳;a I Θ + τ 阶峪亿疋左向右ΗΜΜ拓撲結 起始狀態開始,至一铢止壯能& & 丹 、、止狀態結束;並且,其中持續 可以轉移至下-狀態或者自轉移,轉角狀態只能轉移 -狀態’不能自轉移;ΗΜΜ拓撲結構令持續狀態和轉 態依次交替存纟及其中所有路徑起始於同一入 態,至同一出口狀態結束。 在一個實施例中,命备过 μ系統7〇〇還包括平行狀態提 -亞字 「描述 描述 拓撲 至一 以轉 -狀 〇 構建 的每 :或 風格 ημμ 從一 狀態 至下 角狀 口狀 供裝 42 200937308 置708,在HMM拓撲結構中提供平行狀態,對應手寫東亞 字元中既可能是虛筆劃,也可能是實筆劃的部分。該平行 狀態提供裝置 7 0 8可以應用雙高斯混合模型實現平行狀 態。 或者,包括多空間概率分佈(MSD)處理裝置710,對 HMM拓撲結構應用多空間概率分佈,對應手寫東亞字元中 既可能是虛筆劃,也可能是實筆劃的部分。For any two states m and η, calculate the Kullback-Leibler difference between them and denote it as D( m,n). Clustering phase: Finding a set of states (m', n') with the smallest Kullback-Leibler difference can be achieved by the following function: (m', n') = argmin( m, n). The m'th and η' states are merged by adding rows and columns corresponding to m' and η' in the Kullback-Leibler difference matrix. Decrement the total number of states 1 by 1. Loop operation: If the number of defects is greater than a predetermined value, repeat the above operation, otherwise the merge operation is completed. Through the above steps 1)· 5), a multi-path ΗΜΜ topology corresponding to handwritten East Asian characters can be automatically constructed from the training materials of handwritten East Asian characters, each of which corresponds to multiple stroke sequences of handwritten East Asian characters. One of them, or each of the paths corresponds to one of a plurality of handwritten styles of handwritten East Asian characters; providing a description of the persistence state of the handwritten East Asian character continuous stroke 40 200937308 in the topological structure, and describing the handwritten East Asian character strokes The corner state of the corners; wherein each path in the multipath HMM topology is a left-to-right HMM topology, starting from an initial state to a termination state; wherein the persistent state can be transferred to the next state or self-transfer The corner state can only be transferred to the next state and cannot be self-transferred; the persistence in the HMM topology • the state and the corner state alternately exist in turn; and all of the paths start at the same entry state and end at the same exit state. 〇 Possible hardware implementations The present invention can be implemented in the form of software, such as by running a software that implements the method of the present invention by a general purpose computing system. The present invention can also be implemented in the form of instructions or programs which can be stored on a storage medium which can be implemented by a computing device after it has been executed and executed from the storage medium. In addition, the present invention can also be implemented in the form of a hardware. It should be noted that it will be apparent to those skilled in the art that in the field of the present invention, the conversion between soft and hard bodies has various forms. That is, having different forms of hardware can achieve the same function, and therefore, the possible hardware implementations listed below of the present invention are functions that define the hardware, and are not limited to the specific implementation thereof, and are known to those skilled in the art. In fact, it is obvious to implement various forms of functions based on these functions. Referring to FIG. 7, a system 700 for recognizing handwritten East Asian characters using an HMM model according to an example of the present invention includes: 41 ❹ 200937308 HMM topology construction device 7〇2, constructing an HMM topology corresponding to handwritten elements; The persistent state setting means 7〇4, in the HMM topology, mentions the continuous state of the handwritten East Asian character continuous stroke; the corner state setting means 7〇6 provides the corner state of the handwritten East Asian character stroke between the corners in the HMM topology. The Ημμ structure constructed by the HMM topology device 702 is a left-to-right HMM structure, starting from an initial state, ending the state; and, the continuation state in the MM topology can be moved to the next- The state or self-transition 'corner state can only be transferred to the lower state, and cannot be self-transferred; the continuous state and the corner state are alternately present in another embodiment. The H-blue topology construction device 702 has a multi-path HMM topology corresponding to handwriting. One path of the East Asian character corresponds to one of the various stroke order orders of the handwritten East Asian car, the singer, and the endogenous squad, each of which corresponds to one of the various handwriting of the handwritten East Asian character. In the middle of the , , , H H H H H H 牡 牡 牡 牡 牡 牡 牡 牡 构建 构建 构建 构建 构建 构建 构建 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 702 , to the end of the strong &&& Dan, the end state; and, where the continuous can be transferred to the next - state or self-transition, the corner state can only be transferred - the state 'can not be transferred; ΗΜΜ topology to make the state And the transition state alternates in turn, and all the paths in it start at the same state, and the same exit state ends. In one embodiment, the surviving μ system 7〇〇 also includes a parallel state-sub-word "description describing the topology to a per-turn-like configuration: or a style ημμ from a state to a lower-angle mouth-like supply 42 200937308 708, providing a parallel state in the HMM topology, corresponding to the handwritten East Asian character may be either a virtual stroke or a real stroke part. The parallel state providing device 708 can apply a double Gaussian mixture model to achieve parallel Alternatively, a multi-space probability distribution (MSD) processing device 710 is applied to apply a multi-space probability distribution to the HMM topology, which may be a virtual stroke or a part of a real stroke in the handwritten East Asian character.

需要說明的是,平行狀態提供裝置7 0 8和多空間概率 分佈處理裝置710是二選其一的。 在一個實施例中,該系統 700還包括狀態聚類裝置 712,對HMM拓撲結構中的狀態進行聚類,使HMM拓撲 結構中的至少一組狀態共用參數,且對於所述一組共用參 數的狀態,在HMM拓撲結構中只保存一套參數。 需要說明的是,此處描述的利用HMM模型識別手寫東 亞字元的系統 700的各個裝置可用於實現上面結合附圖 2 · 5所描述的方法,其中的各個細節特徵都對應,因此這 裏就不再重復地進行描述。 第8圖示出了根據本發明的另一實施例的利用HMM模 型識別手寫東亞字元的系統800的結構圖,該系統800包 括: HMM拓撲結構構建裝置802,構建對應於手寫東亞字 元的多路徑HMM拓撲結構,其中的每一條路徑對應手寫 東亞字元的多種筆劃順序中的一個,或者,其中的每一條 路徑對應手寫東亞字元的多種手寫風格中的一個; 43 200937308 持續狀態設置裝置804,在HMM拓撲結構中設置描述 手寫東亞字元持續筆劃的持續狀態; 轉角狀態設置裝置806,在HMM拓撲結構中設置和描 述手寫東亞字元筆劃間轉角的轉角狀態; 其中,多路徑HM Μ拓撲結構中的每一條路徑是左向右 ΗΜΜ拓撲結構,從一起始狀態開始,至一終止狀態結束; 持續狀態可以轉移至下一狀態或者自轉移,轉角狀態只能 轉移至下一狀態,不能自轉移;持續狀態和轉角狀態依次 交替存在;並且所有路徑起始於同一入口狀態,至同一出 口狀態結束;以及 下列兩個裝置的其中之一: 平行狀態提供裝置808,在ΗΜΜ拓撲結構中提供平行 狀態,對應手寫東亞字元中既可能是虛筆劃,也可能是實 筆劃的部分;平行狀態提供裝置8 0 8可以採用雙高斯混合 模型(GMM ),即利用高斯函數的分佈特性,使得實筆劃 和虛筆劃分別對應不同的高斯函數的峰值,利用這樣的雙 高斯混合模型,也能夠解決解決實筆劃/虚筆劃不確定性的 問題。 多空間概率分佈處理裝置8 1 0,對ΗΜΜ拓撲結構應用 多空間概率分佈,對應手寫東亞字元中既可能是虛筆劃, 也可能是實筆劃的部分。 根據第8圖所示的系統800,其中的ΗΜΜ拓撲結構構 建裝置802從手寫東亞字元的訓練資料中自動構建多路徑 ΗΜΜ拓撲結構;以及該ΗΜΜ拓撲結構構建裝置802根據 44 200937308 手寫東亞字元的 習的自動分類 '$者書寫風格,應、用-機器自學 法對訓練資料進行分類。 繼續參考第8圖 寫東亞字元筆跡搂 系統8〇0採用的訓練資料包括手 同手寫風格的筆跡筆跡樣本包括不同筆劃順序或者不 包括, 跡樣本;該《MM拓撲結構構建裝置8〇2It should be noted that the parallel state providing means 708 and the multi-space probability distribution processing means 710 are selected one by one. In one embodiment, the system 700 further includes a state clustering device 712 that clusters states in the HMM topology such that at least one set of states in the HMM topology share parameters and for the set of shared parameters State, only one set of parameters is saved in the HMM topology. It should be noted that the various devices of the system 700 for identifying handwritten East Asian characters using the HMM model described herein can be used to implement the method described above in connection with FIG. 2.5, wherein each detail feature corresponds, so Description will be repeated again. 8 is a block diagram showing a system 800 for identifying handwritten East Asian characters using an HMM model, the system 800 comprising: an HMM topology construction device 802, which is constructed to correspond to handwritten East Asian characters, in accordance with another embodiment of the present invention. A multi-path HMM topology in which each path corresponds to one of a plurality of stroke sequences of handwritten East Asian characters, or each of which corresponds to one of a plurality of handwritten styles of handwritten East Asian characters; 43 200937308 Continuous state setting device 804, setting a continuous state describing the handwritten East Asian character continuous stroke in the HMM topology; the corner state setting device 806, setting and describing a corner state of the handwritten East Asian character stroke between the corners in the HMM topology; wherein, the multipath HM Μ Each path in the topology is a left-to-right ΗΜΜ topology, starting from an initial state to a termination state; the continuation state can be transferred to the next state or self-transition, and the corner state can only be transferred to the next state, Self-transition; continuous state and corner state alternately exist; and all paths start In the same entry state, to the end of the same exit state; and one of the following two devices: Parallel state providing device 808, providing parallel states in the topological structure, corresponding to handwritten East Asian characters may be either virtual strokes, or It is a part of the real stroke; the parallel state providing device 800 can adopt a double Gaussian mixture model (GMM), that is, using the distribution characteristic of the Gaussian function, so that the real stroke and the virtual stroke respectively correspond to the peaks of different Gaussian functions, using such a double The Gaussian mixture model can also solve the problem of solving the uncertainty of real stroke/virtual stroke. The multi-space probability distribution processing device 8 10 0 applies a multi-space probability distribution to the topological structure, which may be a virtual stroke or a part of a real stroke in the handwritten East Asian character. According to the system 800 shown in FIG. 8, the ΗΜΜ topology construction apparatus 802 automatically constructs a multi-path ΗΜΜ topology from the training material of the handwritten East Asian character; and the ΗΜΜ topology construction apparatus 802 writes the East Asian character according to 44 200937308 The automatic classification of the habits of the '$ person's writing style, should be, using the - machine self-study method to classify the training materials. Continue to refer to Figure 8 to write the East Asian character handwriting 搂 The training data used by the system 8〇0 includes the handwriting style handwriting style handwriting sample including different stroke sequences or not included, the track sample; the MM topology construction device 8〇2

筆跡樣本分段I ❹ 將其分成數個分段1、士 對於母一個筆跡樣本,按弧長 和,就是所有的實二所說的弧長是指所有筆劃長度之 段的過程是基於這個求:有的虛筆劃的長度之和,分 调衣和之後的弧長。 特徵提取裝置822,對每一八 的特徵該分段的形狀特徵 厂刀別提取-特徵;其中 操作之後,㊣夠獲K固字/過分段和提取特徵的 形狀特徵。將每個分段上的:::個分段上的—種 筆跡樣本的一個特徵。 筏填序連接,從而得到該 聚類裝置824,對上述 狩徵進;f干;^ # 應於一種筆劃順序或者一 丁聚類,母一個聚類對 裡手寫風格; 刪拓撲結構構建裝置8Q2基 定對應於每一種筆劃順序 、每一個聚類的資料確 撲結構中的一路徑的參數。 悝予寫風格的HMM拓Handwriting sample segmentation I ❹ Divide it into several segments 1 , for a parent handwriting sample, according to the arc length sum, that is, all the actual two said arc length refers to the segment length of all strokes based on this : The sum of the lengths of some virtual strokes, the length of the clothes and the length of the arc. The feature extraction means 822 extracts the features of the shape of each segment for each of the eight features; wherein, after the operation, the K-shaped/over-segmented and extracted feature features are extracted. A feature of the handwriting sample on the ::: segments on each segment.筏 筏 连接 , , , , , , 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 824 ^ 824 ^ ^ ^ The base corresponds to each stroke order, and the data of each cluster is a parameter of a path in the structure.悝Write style HMM extension

拓撲結構構建裝置8〇2還可包括 拫據—實施例,HMM 立裝置826,將上述的筆跡===列方向直方圖向量建 實現爲子序列方向直方圖向量^其母個分段上的形狀特徵 對於每一個筆跡樣本,將實 ”中,筆跡樣本分段裝置 貫筆劃和虛筆劃相連,並將實筆 45The topology construction apparatus 8〇2 may further include an embodiment, the HMM vertical apparatus 826, which implements the above-described handwriting===column direction histogram vector as a subsequence direction histogram vector^ on the parent segment Shape feature For each handwriting sample, the real stroke, the handwriting sample segmentation device is connected to the virtual stroke and the virtual stroke is 45

200937308 劃和虛筆劃相連後的弧長分成數段; 而特徵爲量化的方向,特徵提取裝置822使每一個 徵對應一預定角度範圍的方向; 該子序列方向直方圖向量建立裝置826建立對應於 段及特徵的子序列方向直方圖向量,該子序列方向直方 向量建立裝置8 2 6將筆跡樣本的分段進一步劃分成數個 段,每一個子段確定一量化的方向特徵,每一個量化的 向特徵對應一預定角度範圍的方向。每一個分段上的形 特徵爲一子序列方向直方圖,每一個筆跡樣本的特徵爲 子序列方向直方圖向量。 繼續參考第8圖,在一實施例中,該系統8 0 0還包泰 路徑合併裝置814,對多路徑HMM拓撲結構中的路 進行合併,以控制路徑的數量。該路徑合併裝置8 1 4判 路徑所對應的訓練資料的資料量,刪除對應的資料量小 一預定值的路徑;計算多路徑HMM拓撲結構中每兩個 徑之間的路徑相似度度量,當路徑相似度度量表示兩個 態足夠相似時,合併該兩個路徑,其中,當兩個以上的 徑的路徑相似度度量都足夠相似時,合併這些路徑。在 個實現中,該路徑相似度度量用 Kullback-Leibler差值 示,當Kullback-Leibler差值低於一預定值時,表示路 足夠相似,可以合併這些路徑。 該系統800還可包括狀態聚類裝置8 1 2,對HMM拓 結構中狀態進行聚類,使HMM拓撲結構中的至少一組 態共用參數,且對於一組共用參數的狀態,在HMM拓 特 分 圖 子 方 狀 徑 斷 於 路 狀 路 表 徑 撲 狀 撲 46 200937308 結構中只保存一套參數。狀態聚類裝置812計算該HMM 拓撲結構中每兩個狀態之間的狀態相似度度量,當狀態相 似度度量表示兩個狀態足夠相似時,使該兩個狀態共用參 數;其中’當兩個以上的狀態相互之間的狀態相似度度量 都足夠相似時,使該這些狀態都共用參數《並且狀態聚類 裝置812狀態通過Kullback-Leibler差值表示相似度度 量,當Kullback-Leibler差值低於一預定值時,表示這些 狀態足夠相似,可以合併這些狀態。 需要說明的是,此處利用HMN1模型識別手寫東亞字元 的系統800的各個裝置可用於實現上面結合第6圖所描述 的方法’包括上面所描述的步驟1)_5)來構建多路徑 抬撲結構,其中的各個細節特徵都對應,因此這裏就不再 重復地進行描述。 本發明針對手寫東亞字元的特點提供了一種利用改進 Q 的HMM模型來對手寫東亞字元進行識別的方案,充分考 慮了手寫東亞字元筆劃多、筆劃順序多樣、結構複雜、書 寫風格多樣、筆劃間連接不確定的特點,改進了 HMΜ拓 • 撲結構的特點,通過在HMM拓撲結構中加入轉角狀態、 . 提供多路徑、提供平行狀態的手段來解決上述的問題,並 且’通過聚類和合併的操作來減小資料量,降低運算的複 雜程度。 上面結合本發明的一實施例對本發明進行了詳細的描 迷’但是需要說明的是,這並不是對於本發明的範圍作出 47 200937308 任何的限制,對於上面所說的實施例的所作出的各種不需 要創造性勞動的變化,修改,都應被視爲是在本發明的範 圍之内,對於本發明而言,應當擴展到符合申請專利範圍 所限定之發明原理的最寬泛之範圍。 【圖式簡單說明】 本發明的上述的以及其他的特徵,性質和優勢將通過 如下結合附圖對實施例的描述而變得更加明顯,在附圖 中,相同的附圖標記始終表示相同的特徵,其中, 第1 a圖是可實現本發明的適當的計算系統環境的一示 例; 第lb圖示出了一 HMM模型的示例; 第2a圖示出了手寫東亞字元中的“轉角”; 第 2b圖示出了能體現手寫東亞字元中轉角特性的 HMM拓撲結構; 第2c圖示出了手寫東亞字元中持續筆劃與轉角的區分 規則; 第3a圖示出了手寫東亞字元的筆劃順序的多樣性; 第3b圖示出了手寫東亞字元的手寫風格的多樣性; 第4a圖示出了根據本發明的一實施例的多路徑HMM 拓撲結構的結構圖; 第 4b圖示出了每一路徑對應一種筆劃順序的多路徑 HMM拓撲結構的示例; 48 200937308 第 4c圖示出了每一路徑對應一種手寫風格的多路徑 HMM拓撲結構的示例; 第5a圖示出了手寫東亞字元書寫過程中實筆劃/虛筆 劃不確定性的情況; 第5 b圖示出了根據本發明的一實施例的具有平行狀態 的HMM拓撲結構的示意圖; 第5c圖示出了根據第5a圖所示的手寫示例構建的具 有平行狀態的HMM拓撲機構的示意圖; 第6a圖示出了對筆跡樣本應用子序列方向直方圖的一 個實例; 第6b圖示出了對訓練資料進行聚類操作的一個實例; 第6 c圖示出了採用雙高斯混合模型實現平行狀態的一 個實例; 第7圖示出了根據本發明的一實施例的利用HMM模型 對手窝東亞字元進行識別的系統的結構圖; 第8圖示出了根據本發明的另一實施例的利用 HMM 模型對手寫東亞字元進行識別的系統的結構圖。 【主要元件符號說明】 1 3 0系統存儲器 1 3 4作業系統 1 35應用程式 1 3 6其他程式模組 49 200937308 137程式數據 144作業系統 1 4 5應用程式 1 3 6其他程式模組 147程式數據 120處理單元 140不可抽取,非揮發性存儲器介面200937308 The arc length after the stroke and the virtual stroke are connected into a plurality of segments; and the feature is the direction of quantization, the feature extraction means 822 causes each sign to correspond to a direction of a predetermined angle range; the sub-sequence direction histogram vector establishing means 826 establishes a correspondence corresponding to a subsequence direction histogram vector of segments and features, the subsequence direction direct direction amount establishing means 286 further dividing the segment of the handwriting sample into a plurality of segments, each sub segment determining a quantized directional feature, each quantized The feature corresponds to a direction of a predetermined range of angles. The shape feature on each segment is a subsequence direction histogram, and each hand sample feature is a subsequence direction histogram vector. With continued reference to Fig. 8, in one embodiment, the system 800 also includes a path merging device 814 that combines the paths in the multipath HMM topology to control the number of paths. The path merging device 8 1 4 determines the data amount of the training data corresponding to the path, deletes the path of the corresponding data amount smaller than the predetermined value; calculates the path similarity measure between each two paths in the multi-path HMM topology, when The path similarity measure indicates that the two paths are merged when the two states are sufficiently similar, wherein the paths are merged when the path similarity measures of the two or more paths are sufficiently similar. In one implementation, the path similarity metric is represented by a Kullback-Leibler difference. When the Kullback-Leibler difference is below a predetermined value, the paths are sufficiently similar and the paths can be merged. The system 800 can also include a state clustering device 8 1 2 that clusters states in the HMM topology to make at least one configuration common parameter in the HMM topology, and for a set of shared parameter states, in HMM TNT The sub-picture sub-path is broken in the road path. The path is only 46. 37308 Only one set of parameters is saved in the structure. The state clustering means 812 calculates a state similarity measure between each two states in the HMM topology, and when the state similarity measure indicates that the two states are sufficiently similar, the two states share parameters; wherein 'when more than two When the state similarity metrics are sufficiently similar to each other, the states are all shared parameters "and the state clustering means 812 state represents the similarity metric by the Kullback-Leibler difference, when the Kullback-Leibler difference is lower than one When the value is predetermined, it means that these states are sufficiently similar that they can be merged. It should be noted that the various devices of the system 800 for recognizing handwritten East Asian characters using the HMN1 model herein can be used to implement the method described above in connection with FIG. 6 including step 1)_5) described above to construct a multipath uplift. The structure in which each detail feature corresponds is therefore not described repeatedly here. The invention provides a scheme for recognizing handwritten East Asian characters by using the HMM model of improved Q for the characteristics of handwritten East Asian characters, and fully considers that the handwritten East Asian character has many strokes, various stroke sequences, complicated structure, and diverse writing styles. The characteristics of the connection between the strokes are uncertain, and the characteristics of the HM • • • structure are improved. By adding the corner state in the HMM topology, providing multiple paths and providing parallel states to solve the above problems, and 'through clustering and The combined operation reduces the amount of data and reduces the complexity of the operation. The present invention has been described in detail above with reference to an embodiment of the present invention, but it should be noted that this is not to be construed as limiting the scope of the invention to any of the limitations of the above-mentioned embodiments. It is intended that the present invention be construed as being limited to the scope of the invention as defined by the appended claims. BRIEF DESCRIPTION OF THE DRAWINGS The above and other features, aspects and advantages of the present invention will become more apparent from the description of the appended claims Features, wherein Figure 1a is an example of a suitable computing system environment in which the present invention may be implemented; Figure lb shows an example of an HMM model; Figure 2a shows "turning corners" in handwritten East Asian characters Figure 2b shows the HMM topology that reflects the cornering characteristics of handwritten East Asian characters; Figure 2c shows the distinction between continuous strokes and corners in handwritten East Asian characters; Figure 3a shows handwritten East Asian characters Diversity of stroke sequences; Figure 3b shows the diversity of handwritten styles of handwritten East Asian characters; Figure 4a shows the structure of a multipath HMM topology according to an embodiment of the invention; An example of a multi-path HMM topology in which each path corresponds to one stroke order is shown; 48 200937308 Figure 4c shows an example of a multi-path HMM topology corresponding to one hand-written style for each path; Figure 5a shows a case where the real stroke/virtual stroke uncertainty is written in the East Asian character writing process; Figure 5b shows a schematic diagram of the HMM topology having a parallel state according to an embodiment of the present invention; Figure 5c shows a schematic diagram of a HMM topology with parallel states constructed according to the handwritten example shown in Figure 5a; Figure 6a shows an example of applying a subsequence direction histogram to handwriting samples; Figure 6b shows An example of a clustering operation on training data; Figure 6c shows an example of implementing a parallel state using a double Gaussian mixture model; Figure 7 shows a nest using a HMM model according to an embodiment of the present invention. A structural diagram of a system for identifying East Asian characters; FIG. 8 is a structural diagram of a system for recognizing handwritten East Asian characters using the HMM model according to another embodiment of the present invention. [Main component symbol description] 1 3 0 system memory 1 3 4 operating system 1 35 application 1 3 6 other program module 49 200937308 137 program data 144 operating system 1 4 5 application 1 3 6 other program module 147 program data 120 processing unit 140 is not extractable, non-volatile memory interface

1 5 0可抽取,非揮發性存儲器接介面 1 9 0視頻介面 1 60用戶輸入介面 1 9 5輸出外圍介面 1 7 0網路介面 172數據機 162鍵盤 1 6 1指示設備 163話筒 1 9 1監視器 196印表機 197揚聲器 1 7 1區域網路 1 7 3廣域網路 180遠端電腦 50 200937308 185遠端應用程式 7 02 HMM拓撲結構構建裝置 7 04持續狀態設置裝置 706轉角狀態設置裝置 708平行狀態提供裝置 710多空間概率分佈處理裝置 7 1 2狀態聚類裝置 8 02 HMM拓撲結構構建裝置 8 2 0筆跡樣本分段裝置 8 22特徵提取裝置 824聚類裝置 826子序列方向直方圖向量建立裝置 804持續狀態設置裝置 8 0 6轉角狀態設置裝置 8 0 8平行狀態提供裝置 810多空間概率分佈處理裝置 8 1 4路徑合併裝置 8 1 2狀態聚類裝置 511 5 0 extractable, non-volatile memory interface 1 90 video interface 1 60 user input interface 1 9 5 output peripheral interface 1 7 0 network interface 172 data machine 162 keyboard 1 6 1 indicating device 163 microphone 1 9 1 monitoring 196 printer 197 speaker 1 7 1 area network 1 7 3 wide area network 180 remote computer 50 200937308 185 remote application 7 02 HMM topology construction device 7 04 continuous state setting device 706 corner state setting device 708 parallel state Providing device 710 multi-space probability distribution processing device 7 1 2 state clustering device 8 02 HMM topology construction device 8 2 handwriting sample segmentation device 8 22 feature extraction device 824 clustering device 826 sub-sequence direction histogram vector establishing device 804 Continuous state setting means 86 6 corner state setting means 8 0 8 parallel state providing means 810 multi-space probability distribution processing means 8 1 4 path combining means 8 1 2 state clustering means 51

Claims (1)

200937308 十、申請專利範圍: 1. 一種建立適用於識別手寫東亞字元的隱性馬爾可夫 (HMM )模型的方法,其特徵在於: 提供一 HMM拓撲結構,該HMM拓撲結構用於識別手 寫東亞字元; * 在該HMM拓撲結構中提供一持續狀態,該持續狀態用 於描述手寫東亞字元的持續筆劃; © 在HMM拓撲結構中提供一轉角狀態,該轉角狀態用於 描述手寫東亞字元的筆劃間轉角。 2. 如申請專利範圍第1項所述之方法,其特徵在於: 該HMM拓撲結構是左向右HMM拓撲結構,從一起始 狀態開始,至一終止狀態結束; 該 HMM拓撲結構中的持續狀態被設置爲能夠轉移至 下一狀態或者自轉移,轉角狀態被設置爲只能轉移至下一 φ 狀態,不能自轉移; 該 HMM拓撲結構中之持續狀態和轉角狀態被設置爲 依次交替存在。 • 3·如申請專利範圍第1項所述之方法,其特徵在於,還包 括: 提供多路徑HMM拓撲結構,對應於手寫東亞字元; 該多路徑 HMM拓撲結構中的每一條路徑被構建爲手 52 200937308 寫東亞字元的多種筆劃順序中的一個;或者 該多路徑 HMM拓撲結構中的每一條路徑被構建爲手 寫東亞字元的多種手寫風格中的一個。 4·如申請專利範圍第3項所述之方法,其特徵在於, ' 該多路徑 HMM拓撲結構中的每一條路徑是左向右 HMM拓撲結構,從一起始狀態開始,至一終止狀態結束; 〇 該HMM拓撲結構包括持續狀態和轉角狀態,其中該持 續狀態可以轉移至下一狀態或者自轉移,該轉角狀態只能 轉移至下一狀態,不能自轉移; 該 HMM拓撲結構中之持續狀態和轉角狀態依次交替 存在;以及 其中所有路徑起始於同一入口狀態,至同一出口狀態 結束。 φ 5·如申請專利範圍第3項所述之方法,其特徵在於,還包 括: 在該HMM拓撲結構中提供平行狀態,其對應手寫東亞 ' 字元中既可能是虛筆劃,也可能是實筆劃的部分。 6 .如申請專利範圍第3項所述之方法,其特徵在於,還包 括: 對該HMM拓撲結構應用多空間概率分佈(MSD ),對 53 200937308 應手寫東亞字元中既可能是虚筆劃,也可能是實筆劃的部 分。 7.如申請專利範圍第3項所述之方法,其特徵在於,還包 括: • 對該HMM拓撲結構中的狀態進行聚類,使該HMM拓 撲結構中的至少一組狀態共用參數,且對於該一組共用參 〇 數的狀態,在HMM拓撲結構中只保存一套參數。 8 · —種利用HMM模型識別手寫東亞字元的方法,其特徵 在於: 提供對應於手寫東亞字元的一多路徑 HMM拓撲結 構,其中的每一條路徑對應手寫東亞字元的多種筆劃順序 中的一個,或者,其中的每一條路徑對應手寫東亞字元的 多種手寫風格中的一個; q 在該 HMM拓撲結構中提供描述手寫東亞字元持續筆 劃的一持續狀態,和描述手寫東亞字元筆劃間轉角的一轉 角狀態; • 其中,該多路徑HMM拓撲結構中的每一條路徑是左向 • 右HMM拓撲結構,從一起始狀態開始,至一終止狀態結 束;其中該持續狀態可以轉移至下一狀態或者自轉移,該 轉角狀態只能轉移至下一狀態,不能自轉移;該HMM拓 撲結構中之持續狀態和轉角狀態依次交替存在;以及其中 54 200937308 所有路徑起始於同一入口狀態,至同一出口狀態結束; 在該HMM拓撲結構中提供一平行狀態,或者對該HMΜ 拓撲結構應用多空間概率分佈(M S D ),對應手寫東亞字元 中既可能是虛筆劃,也可能是實筆劃的部分。 • 9·如申請專利範圍第8項所述之方法,其特徵在於: 該多路徑 ΗΜΜ拓撲結構從手寫東亞字元的訓練資料 © 中自動構建; 根據手寫東亞字元的筆劃順序或者書寫風格,應用一 機器自學習的自動分類方法對該訓練資料進行分類。 1〇·如申請專利範圍第9項所述之方法,其特徵在於: 該訓練資料包括手寫東亞字元筆跡樣本,該筆跡樣本 包括不同筆劃順序或者不同手寫風格的筆跡樣本; 對於每一個筆跡樣本,按弧長將其分成數個分段,每 φ 一分段分別提取一特徵,將每個分段上的特徵按順序連 接,從而得到該筆跡樣本的一個特徵; 對該特徵進行聚類,每一個聚類對應於一種筆劃順序 或者一種手寫風格; ' 基於每一個聚類的資料確定對應於每一種筆劃順序或 者每一種手寫風格的ΗΜΜ拓撲結構中的一路徑的拓撲和 初始參數。 55 200937308 11·如申請專利範圍第10項所述之方法,其特徵在於,還 包括: 建立對應於分段及特徵的子序列方向直方圖向量, 其中,對於每一個筆跡樣本,將實筆劃和虛筆劃相連, * 並將實筆劃和虛筆劃相連後的弧長分成數段, * 每一個子段確定一量化的方向特徵,每一個量化的方 向特徵對應一預定角度範圍的方向; 〇 每一個分段上的形狀特徵爲一子序列方向直方圖,每 一個筆跡樣本的特徵爲一子序列方向直方圖向量。 12.如申請專利範圍第9項所述之方法,其特徵在於,還 包括: 對該多路徑ΗΜΜ拓撲結構中的路徑進行合併,以控制 路徑的數量。 φ 13.如申請專利範圍第12項所述之方法,其特徵在於,該 路徑之合併包括: 判斷路徑所對應的訓練資料的資料量,刪除對應的資 料量小於一預定值的路徑; • 計算該多路徑 ΗΜΜ拓撲結構中每兩個路徑之間的路 徑相似度度量,當路徑相似度度量表示兩個路徑足夠相似 時,合併該兩個路徑,其中,當兩個以上路徑的路徑相似 度度量表示這些路徑都足夠相似時,合併這些路徑。 56 200937308 14·如申請專利範圍第13項所述之方法,其特徵在於,該 路徑相似度度量使用Kullback-Leibler差值表示, 當Kullback-Leibler差值低於一預定值時,表示該路徑 足夠相似。 15_如申請專利範圍第12項所述之方法,其特徵在於,還 〇 包括: 對該HMM拓撲結構中狀態進行聚類,使該HMM拓撲 結構中的至少一組狀態共用參數,且對於該一組共用參數 的狀態,在該HMM拓撲結構令只保存一套參數。 16.如申請專利範圍第15項所述之方法,其特徵在於,聚 類多個HMM拓撲結構中的狀態包括: 計算該 HMM拓撲結構中每兩個狀態之間的狀態相似 0 度度量,當該狀態相似度度量表示兩個狀態足夠相似時, 使該兩個狀態共用參數; 其中,當兩個以上的狀態相互之間的狀態相似度度量 ' 都足夠相似時,使該這些狀態都共用參數。 17·如申請專利範圍第16項所述之方法,其特徵在於, 狀態相似度度量通過 Kullback-Leibler差值表示,當 Kullback-Leibler差值低於一予頁定值時,表示該狀態足夠相 57 200937308 似。 …―HMM拓撲結構構建裝置,其係構建對應於手寫東亞 子元的HMM拓撲結構; Ο 持續狀態叹置裝置,其係在該HMM拓撲結構中提供 描述手寫東亞字元持續筆劃的一持續狀態; 轉角狀態戎置裝置,其係在該HMM拓撲結構中提供 述手寫東亞字元筆劃間轉角的一轉角狀態。 如申請專利範圍第18項所述之系統,其特徵在於: 該HMM拓撲結構構建裝置所構建的hmm拓撲結構是 左向右HMM拓撲結構’從_起始狀態開始,至—終止狀 態结束; 並且,該HMM拓撲結構中的持續狀態被設置爲能夠轉 移至下一狀態或者自轉移’轉角狀態被設置爲只能轉移至 下〜狀態’不能自轉移; 該HMM拓撲結構中持續狀態和轉角狀態被設置爲依 次交替存在。 如申請專利範圍第U項所述之系統,其特徵在於: 該HMM拓撲結構構建裝置構建多路徑拓撲結 58 200937308 構,對應於手寫東亞字元;其中的每一條路徑對應手寫東 亞字元的多種筆劃順序中的一個;或者,其中的每一條路 徑對應手寫東亞字元的多種手寫風格中的一個。 2 1 ·如申請專利範圍第20項所述之系統,其特徵在於: * 該HMM拓撲結構構建裝置所構建的多路徑HMM拓撲 結構中的每一條路徑是左向右HMM拓撲結構,從一起始 〇 狀態開始,至一終止狀態結束; 並且,其中持續狀態可以轉移至下一狀態或者自轉 移,轉角狀態只能轉移至下一狀態,不能自轉移; 該 HMM拓撲結構中持續狀態和轉角狀態依次交替存 在;以及 其中所有路徑起始於同一入口狀態,至同一出口狀態 結束。 φ 22 .如申請專利範圍第20項所述之系統,其特徵在於,還 包括: 一平行狀態提供裝置,在HMM拓撲結構中提供平行狀 * 態,對應手寫東亞字元中既可能是虛筆劃,也可能是實筆 、 劃的部分。 2 3 ·如申請專利範圍第20項所述之系統,其特徵在於,還 包括: 59 200937308 一多空間概率分佈(MSD )處理裝置,對HMM拓撲結 構應用多空間概率分佈,對應手寫東亞字元中既可能是虛 筆劃,也可能是實筆劃的部分。 24 .如申請專利範圍第2 0項所述之系統,其特徵在於,還 • 包括: 一狀態聚類裝置,對HMM拓撲結構中的狀態進行聚 0 類,使該HMM拓撲結構中的至少一組狀態共用參數,且 對於該一組共用參數的狀態,在該HMM拓撲結構中只保 存一套參數。 25 · —種利用 HMM模型識別手寫東亞字元的系統,其特 徵在於,包括: 一 HMM拓撲結構構建裝置,其係構建對應於手寫東亞 字元的一多路徑HMM拓撲結構,其中的每一條路徑對應 ^ 手寫東亞字元的多種筆劃順序中的一個,或者,其中的每 一條路徑對應手寫東亞字元的多種手寫風格中的一個; 一持續狀態設置裝置,在該HMM拓撲結構中設置描述 • 手寫東亞字元持續筆劃的一持續狀態; - 一轉角狀態設置裝置,在該HMM拓撲結構中設置和描 述手寫東亞字元筆劃間轉角的一轉角狀態; 其中,該多路徑HMM拓撲結構中的每一條路徑是左向 右HMM拓撲結構,從一起始狀態開始,至一終止狀態結 60 200937308 束;該持續狀態可以轉移至下一狀態或者自轉移, 狀態只能轉移至下一狀態,不能自轉移;持續狀態 狀態依次交替存在;並且所有路徑起始於同一入口 至同一出口狀態結束;以及 下列兩個裝置的其中之一: 一平行狀態提供裝置,在該HMM拓撲結構中提 行狀態,對應手寫東亞字元中既可能是虛筆劃,也 實筆劃的部分; 一多空間概率分佈處理裝置,對該HMM拓撲結 多空間概率分佈,對應手寫東亞字元中既可能是虛 也可能是實筆劃的部分。 26 ·如申請專利範圍第25項所述之系統,其特徵在 該 HMM拓撲結構構建裝置從手寫東亞字元的 料中自動構建多路徑HMM拓撲結構;以及 該 HMM拓撲結構構建裝置根據手寫東亞字元 順序或者書寫風格,應用一機器自學習的自動分類 所述訓練資料進行分類。 2 7 ·如申請專利範圍第26項所述之系統,其特徵在 該訓練資料包括手寫東亞字元筆跡樣本,該筆 包括不同筆劃順序或者不同手寫風格的筆跡樣本;玄 拓撲結構構建裝置包括: 該轉角 和轉角 狀態, 供一平 可能是 構應用 筆劃, 於: 訓練資 的筆劃 方法對 於: 跡樣本 HMM 61 200937308 一筆跡樣本分段裝置,對於每一個筆跡樣本,按弧長 將其分成數個分段,將每個分段上的特徵按順序連接,從 而得到該筆跡樣本的一個特徵; 一特徵提取裝置,對每一分段分別提取一特徵; 一聚類裝置,對該分段和特徵進行聚類,每一個聚類 對應於一種筆劃順序或者一種手寫風格; ❹ 該 HMΜ拓撲結構構建裝置基於每一個聚類的資料確 定對應於每一種筆劃順序或者每一種手寫風格的ΗΜΜ拓 撲結構中的一路徑的拓撲和初始參數。 2 8 .如申請專利範圍第2 7項所述之系統,其特徵在於,還 包括: 一子序列方向直方圖向量建立裝置, 其中,該筆跡樣本分段裝置對於每一個筆跡樣本,將 實筆劃和虛筆劃相連,並將實筆劃和虛筆劃相連後的弧長 分成數段; 每一個子段確定一量化的方向特徵,每一個量化的方 向特徵對應一預定角度範圍的方向; 該子序列方向直方圖向量建立裝置建立每一個分段上 的形狀特徵爲一子序列方向直方圖,每一個筆跡樣本的特 徵爲一子序列方向直方圖向量。 2 9 ·如申請專利範圍第2 5項所述之系統,其特徵在於,還 62 200937308 包括: 一路徑合併裝置,對該多路徑Η MM拓撲結構中的路徑 進行合併,以控制路徑的數量。 3 0 ·如權利要求2 9所述的系統,其特徵在於,該路徑合併 • 裝置: 判斷路徑所對應的訓練資料的資料量,刪除對應的資 © 料量小於一預定值的路徑; 計算該多路徑 HMM拓撲結構中每兩個路徑之間的路 徑相似度度量,當路徑相似度度量表示兩個路徑足夠相似 時,合併該兩個路徑,其中,當兩個以上路徑的路徑相似 度度量表示這些路徑都足夠相似時,合併這些路徑。200937308 X. Patent application scope: 1. A method for establishing a hidden Markov (HMM) model suitable for recognizing handwritten East Asian characters, characterized in that: an HMM topology is provided, which is used to identify handwritten East Asia Character; * Provides a persistent state in the HMM topology that describes the continuous stroke of the handwritten East Asian character; © Provides a corner state in the HMM topology that describes the handwritten East Asian character The corner between the strokes. 2. The method of claim 1, wherein the HMM topology is a left-to-right HMM topology, starting from an initial state to a termination state; a persistent state in the HMM topology It is set to be able to shift to the next state or self-transition, and the corner state is set to be transferred only to the next φ state, and cannot be self-transferred; the persistent state and the corner state in the HMM topology are set to alternately exist in sequence. 3. The method of claim 1, further comprising: providing a multipath HMM topology corresponding to handwritten East Asian characters; each path in the multipath HMM topology is constructed as Hand 52 200937308 Write one of a plurality of stroke sequences for East Asian characters; or each of the multipath HMM topologies is constructed as one of a plurality of handwritten styles of handwritten East Asian characters. 4. The method of claim 3, wherein each path in the multipath HMM topology is a left-to-right HMM topology, starting from an initial state to a termination state; The HMM topology includes a persistent state and a corner state, wherein the persistent state can be transferred to a next state or a self-transition, the corner state can only be transferred to the next state, and cannot be self-transferred; the persistent state in the HMM topology and The corner states alternate in sequence; and all of the paths start at the same entry state and end at the same exit state. The method of claim 3, further comprising: providing a parallel state in the HMM topology, which may be a virtual stroke or a real one in the handwritten East Asian character. The part of the stroke. 6. The method of claim 3, further comprising: applying a multi-space probability distribution (MSD) to the HMM topology, the pair of 2009 2009308 handwritten East Asian characters may be virtual strokes, It may also be part of a real stroke. 7. The method of claim 3, further comprising: • clustering states in the HMM topology such that at least one set of states in the HMM topology share parameters, and The state of the set of shared parameters, only one set of parameters is saved in the HMM topology. 8 - A method for recognizing handwritten East Asian characters using an HMM model, characterized in that: a multi-path HMM topology corresponding to handwritten East Asian characters is provided, wherein each path corresponds to a plurality of stroke sequences of handwritten East Asian characters One, or each of the paths corresponds to one of a plurality of handwritten styles of handwritten East Asian characters; q providing a continuous state describing the handwritten East Asian character continuous stroke in the HMM topology, and describing the handwritten East Asian character strokes a corner state of the corner; • wherein each path in the multipath HMM topology is a leftward/right HMM topology, starting from an initial state to an end state; wherein the persistent state can be transferred to the next state State or self-transition, the corner state can only be transferred to the next state, and cannot be self-transferred; the persistent state and the corner state in the HMM topology alternately exist in turn; and wherein 54 200937308 all paths start at the same entry state, to the same Ending the exit state; providing a parallel state in the HMM topology, or The spatial structure of a multi-topology application HMΜ probability distribution (M S D), corresponding to the handwritten characters in both East may be a dummy stroke, it may be part of the real stroke. • The method of claim 8, wherein the multi-path topology is automatically constructed from handwritten East Asian character training material ©; according to the stroke order or writing style of the handwritten East Asian character, The training data is classified by an automatic classification method of machine self-learning. The method of claim 9, wherein the training material comprises a handwritten East Asian character handwriting sample, the handwriting sample comprising handwriting samples of different stroke sequences or different handwriting styles; for each handwriting sample Dividing it into several segments according to the arc length, extracting a feature for each segment of φ, and connecting the features on each segment in order, thereby obtaining a feature of the handwriting sample; clustering the feature Each cluster corresponds to a stroke order or a handwritten style; 'Based on each clustered data, the topology and initial parameters of a path in the topological structure corresponding to each stroke order or each handwritten style are determined. The method of claim 10, further comprising: establishing a subsequence direction histogram vector corresponding to the segment and the feature, wherein for each handwriting sample, the real stroke is The virtual strokes are connected, * and the arc length after the real stroke and the virtual stroke are connected is divided into several segments, * each sub-segment determines a quantized directional feature, and each quantized directional feature corresponds to a predetermined angular range direction; The shape feature on the segment is a subsequence direction histogram, and each hand sample feature is a subsequence direction histogram vector. 12. The method of claim 9, further comprising: merging the paths in the multipath ΗΜΜ topology to control the number of paths. The method of claim 12, wherein the merging of the path comprises: determining a data amount of the training material corresponding to the path, and deleting a path corresponding to the data amount less than a predetermined value; A path similarity measure between each two paths in the multipath topology, when the path similarity measure indicates that the two paths are sufficiently similar, the two paths are merged, wherein when two or more paths have path similarity measures Merge these paths when they are sufficiently similar. The method of claim 13 is characterized in that the path similarity measure is represented by a Kullback-Leibler difference value, and when the Kullback-Leibler difference is lower than a predetermined value, the path is sufficient similar. The method of claim 12, further comprising: clustering states in the HMM topology to make at least one set of states in the HMM topology share parameters, and The state of a set of shared parameters in which the HMM topology allows only one set of parameters to be saved. 16. The method of claim 15, wherein the clustering the states in the plurality of HMM topologies comprises: calculating a state similarity 0 degree metric between each of the two states in the HMM topology, when The state similarity measure indicates that when the two states are sufficiently similar, the two states share parameters; wherein when the state similarity measures ' between two or more states are sufficiently similar, the states are shared . 17. The method of claim 16, wherein the state similarity measure is represented by a Kullback-Leibler difference value, and when the Kullback-Leibler difference is lower than a predetermined value, the state is sufficiently phased. 57 200937308 Like. ...-HMM topology construction device, which constructs an HMM topology corresponding to the handwritten East Asian sub-element; Ο a continuous state sag device, which provides a continuous state describing the handwritten East Asian character continuous stroke in the HMM topology; The corner state setting device provides a corner state of the handwritten East Asian character stroke between the HMM topologies. The system of claim 18, wherein: the hmm topology constructed by the HMM topology construction device is a left-to-right HMM topology starting from an initial state and ending at a termination state; The persistent state in the HMM topology is set to be able to transition to the next state or the self-transition 'corner state is set to only transfer to the next ~ state' cannot be self-transferred; the persistent state and the corner state in the HMM topology are Set to alternate in order. The system of claim U is characterized in that: the HMM topology construction device constructs a multipath topology node 58 200937308, corresponding to handwritten East Asian characters; each of the paths corresponds to a plurality of handwritten East Asian characters One of the stroke sequences; or each of the paths corresponds to one of a plurality of handwritten styles of handwritten East Asian characters. 2 1 · The system of claim 20, characterized in that: * each path in the multi-path HMM topology constructed by the HMM topology construction device is a left-to-right HMM topology, starting from The 〇 state begins and ends at a termination state; and, wherein the continuation state can be transferred to the next state or self-transition, the corner state can only be transferred to the next state, and cannot be self-transferred; the persistent state and the corner state in the HMM topology are sequentially Alternate; and where all paths start at the same entry state, ending at the same exit state. φ 22. The system of claim 20, further comprising: a parallel state providing device that provides a parallel state in the HMM topology, which may be a virtual stroke in the corresponding handwritten East Asian character It may also be a part of real and stroked. The system according to claim 20, characterized in that it further comprises: 59 200937308 A multi-space probability distribution (MSD) processing device, which applies a multi-space probability distribution to the HMM topology, corresponding to the handwritten East Asian character It may be a virtual stroke or a part of a real stroke. 24. The system of claim 20, wherein the method further comprises: a state clustering device that aggregates states in the HMM topology to at least one of the HMM topologies The group state shares parameters, and for the state of the set of shared parameters, only one set of parameters is saved in the HMM topology. A system for recognizing handwritten East Asian characters using an HMM model, comprising: an HMM topology construction device, which constructs a multipath HMM topology corresponding to handwritten East Asian characters, each of which paths Corresponding to one of the various stroke sequences of the handwritten East Asian character, or each of the paths corresponds to one of a plurality of handwritten styles of handwritten East Asian characters; a continuous state setting device, setting a description in the HMM topology • Handwriting a continuous state of the East Asian character continuation stroke; - a corner state setting means for setting and describing a corner state of the handwritten East Asian character stroke between the HMM topologies; wherein each of the multipath HMM topologies The path is a left-to-right HMM topology, starting from an initial state to a termination state junction 60 200937308 bundle; the persistent state can be transferred to the next state or self-transition, and the state can only be transferred to the next state, and cannot be self-transferred; The state of the persistent state alternates in turn; and all paths start at the same entrance to the same The exit state ends; and one of the following two devices: a parallel state providing device that raises the state in the HMM topology, corresponding to the portion of the handwritten East Asian character that may be both a virtual stroke and a real stroke; The spatial probability distribution processing device, the multi-space probability distribution of the HMM topology, corresponds to a part of the handwritten East Asian character that may be either a virtual or a real stroke. 26. The system of claim 25, wherein the HMM topology construction device automatically constructs a multi-path HMM topology from handwritten East Asian characters; and the HMM topology construction device is based on handwritten East Asian characters Meta-order or writing style, using a machine self-learning automatic classification of the training materials for classification. The system according to claim 26, wherein the training material comprises a handwritten East Asian character handwriting sample, the pen comprising a handwriting sample of different stroke order or different handwriting style; the metatop topology construction device comprises: The corner and corner state, for a flat may be the application stroke, in: Training stroke method for: Trace sample HMM 61 200937308 One trace sample segmentation device, for each handwriting sample, divide it into several points according to the arc length a segment, the features on each segment are sequentially connected to obtain a feature of the handwriting sample; a feature extraction device that extracts a feature for each segment; a clustering device that performs the segment and feature Clustering, each cluster corresponds to a stroke order or a handwritten style; ❹ the HMΜ topology construction device determines one of the topologies corresponding to each stroke order or each handwritten style based on the data of each cluster The topology and initial parameters of the path. The system of claim 27, further comprising: a subsequence direction histogram vector establishing device, wherein the handwriting sample segmentation device performs a real stroke for each handwriting sample Connected to the virtual stroke, and divide the arc length after the real stroke and the virtual stroke are divided into several segments; each sub-segment determines a quantized directional feature, and each quantized directional feature corresponds to a predetermined angular range direction; the sub-sequence direction The histogram vector establishing means establishes a shape feature on each segment as a subsequence direction histogram, and each of the handwriting samples has a subsequence direction histogram vector. The system of claim 25, wherein the system further comprises: a path merging device that merges the paths in the multipath Η MM topology to control the number of paths. The system according to claim 29, wherein the path merging device: determining the amount of the training material corresponding to the path, deleting the path of the corresponding resource amount less than a predetermined value; A path similarity measure between every two paths in a multipath HMM topology. When the path similarity measure indicates that the two paths are sufficiently similar, the two paths are merged, wherein when two or more paths have path similarity metrics When these paths are sufficiently similar, merge the paths. 3 1 .如申請專利範圍第3 0項所述之系統,其特徵在於,該 路徑合併裝置路徑用 Kullback-Leibler差值表示相似度度 量,當Kullback-Leibler差值低於一預定值時,表示該路 徑足夠相似。 3 2 .如申請專利範圍第2 9項所述之系統,其特徵在於,還 包括: 一狀態聚類裝置,對該 HMM拓撲結構中狀態進行聚 類,使該HMM拓撲結構中的至少一組狀態共用參數,且 對於該一組共用參數的狀態,在HMM拓撲結構中只保存 63 200937308 一套參數。 3 3 .如申請專利範圍第3 2項所述之系統,其特徵在於: 該狀態聚類裝置計算該HMM拓撲結構中每兩個狀態 之間的狀態相似度度量,當狀態相似度度量表示兩個狀態 * 足夠相似時,使該兩個狀態共用參數;其中,當兩個以上 的狀態相互之間的狀態相似度度量都足夠相似時,使該這 〇 些狀態都共用參數。 3 4 .如申請專利範圍第3 3項所述之系統,其特徵在於: 狀態聚類裝置通過Kullback-Leibler差值表示狀態相似度 度量,當Kullback-Leibler差值低於一預定值時,表示該 狀態足夠相似。 64The system according to claim 30, wherein the path merging device path uses a Kullback-Leibler difference to represent a similarity metric, and when the Kullback-Leibler difference is lower than a predetermined value, The path is similar enough. The system of claim 29, further comprising: a state clustering device that clusters states in the HMM topology to make at least one of the HMM topologies The state shares parameters, and for the state of the set of shared parameters, only 63 200937308 sets of parameters are saved in the HMM topology. 3. The system of claim 3, wherein the state clustering device calculates a state similarity measure between each of the two states in the HMM topology, and the state similarity measure represents two When the states* are sufficiently similar, the two states share the parameters; wherein when the state similarity metrics of the two or more states are sufficiently similar to each other, the states are shared. 3: The system according to claim 3, wherein the state clustering device indicates the state similarity measure by the Kullback-Leibler difference, and when the Kullback-Leibler difference is lower than a predetermined value, This state is sufficiently similar. 64
TW97106872A 2008-02-27 2008-02-27 Method and apparatus for constructing HMM models for recognizing online eastern asian characters TW200937308A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW97106872A TW200937308A (en) 2008-02-27 2008-02-27 Method and apparatus for constructing HMM models for recognizing online eastern asian characters

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW97106872A TW200937308A (en) 2008-02-27 2008-02-27 Method and apparatus for constructing HMM models for recognizing online eastern asian characters

Publications (1)

Publication Number Publication Date
TW200937308A true TW200937308A (en) 2009-09-01

Family

ID=44867020

Family Applications (1)

Application Number Title Priority Date Filing Date
TW97106872A TW200937308A (en) 2008-02-27 2008-02-27 Method and apparatus for constructing HMM models for recognizing online eastern asian characters

Country Status (1)

Country Link
TW (1) TW200937308A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI685796B (en) * 2018-05-31 2020-02-21 國立中興大學 A method for character pattern recognition

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI685796B (en) * 2018-05-31 2020-02-21 國立中興大學 A method for character pattern recognition

Similar Documents

Publication Publication Date Title
Ranjan et al. Deep learning for understanding faces: Machines may be just as good, or better, than humans
Sener et al. Unsupervised learning and segmentation of complex activities from video
CN108960409B (en) Method and device for generating annotation data and computer-readable storage medium
US11398062B2 (en) Face synthesis
KR101312804B1 (en) Two tiered text recognition
US8977042B2 (en) Rotation-free recognition of handwritten characters
Ellis et al. Exploring the trade-off between accuracy and observational latency in action recognition
US9639746B2 (en) Systems and methods of detecting body movements using globally generated multi-dimensional gesture data
Kirac et al. Hierarchically constrained 3D hand pose estimation using regression forests from single frame depth data
JP2669583B2 (en) Computer-based method and system for handwriting recognition
Zeng et al. Soft-ranking label encoding for robust facial age estimation
TWI437448B (en) Radical-based hmm modeling for handwritten east asian characters
CN110232373A (en) Face cluster method, apparatus, equipment and storage medium
Ding et al. Profile HMMs for skeleton-based human action recognition
US6256410B1 (en) Methods and apparatus for customizing handwriting models to individual writers
CN112818670B (en) Segmentation grammar and semantics in a decomposable variant automatic encoder sentence representation
Zhao et al. Deep imitator: Handwriting calligraphy imitation via deep attention networks
WO2008104130A1 (en) Method and system for establishing hmm topology adapted to recognize hand-written east-asian character
Battistoni et al. AI at the edge for sign language learning support
Chen et al. Temporal hierarchical dictionary guided decoding for online gesture segmentation and recognition
Zhang et al. Identity-and-pose-guided generative adversarial network for face rotation
CN113408418A (en) Calligraphy font and character content synchronous identification method and system
TW200937308A (en) Method and apparatus for constructing HMM models for recognizing online eastern asian characters
CN115035463B (en) Behavior recognition method, behavior recognition device, behavior recognition equipment and storage medium
Vieira et al. Online human moves recognition through discriminative key poses and speed-aware action graphs