TW476033B

TW476033B - System and method for increasing word recognition rate

Info

Publication number: TW476033B
Application number: TW086117317A
Authority: TW
Inventors: Dai-Shuei He; Shr-Guang Tsai; Wei-Jiun Pan
Original assignee: Inventec Corp
Priority date: 1997-11-19
Filing date: 1997-11-19
Publication date: 2002-02-11

Abstract

Provided is a system and method for increasing word recognition rate for increasing word recognition rate when inputting words or documents by the handwriting board or the scanner. It adds a recognition data storage device into the known word recognition system for storing necessary related data of word recognition and correction. Characters which each input word includes use method of word comparison for recognizing. When the recognition results of input words include un-existed words, it corrects the recognition results. Words which the recognition results included will proceed correction with words which the correction rule index table is stores by means of replacement of similar characters or stings. Characters which each word included is able to find out replaced strings or characters in the correction rule index table to form new words and find out the mostly candidate words as new ones to display it.

Description

476033 A7 B7 經濟部中央標準局員工消費合作社印製五、發明説明（ 5-1發明領域：本發明係關於一種用以提昇單詞辨識率《系統與方法，特别是指可於手寫板輸入單詞或於#描器（Scanner)輸入文件時，利用字形比對與相似字母取代的方式’來提昇單詞辨識率之辨識系統與其運作方法。 5-2發明背景：目前的電腦與電子相關產品已廣泛的應用在各種產業以及人們的曰常生活之中，而所有的使用者與電腦間之溝通媒介中，最重要亦爲最基本的方法即利用鍵盤或滑鼠，來輸入相關的指令或資料給電腦。此外，在文字輸入方面，近來亦發展語音合成器，以及運用手寫板或掃描器來作爲另一種文字輸入工具，因此丰寫辨識裝置或光學符號辨識 (Optical Character Recognition，OCR)裝置等文字辨識裝冒的辨識效率，便成爲一極受關切的課題。然而習知的單詞辨識系統，諸如英文單詞辨識系統等，其使用單詞的拼音枱正來辨識所輸入的軍Ύ ’ u 、w早€ (例如輸入 “telefone”時，可辨識爲 “telephone”），或以於、、 ’ 4以輸入早詞之整許字形來取得一最相近的單詞（例如“ 4 a uuta將被辨視良 “dute”），故所需的演算法不僅複雜，辨馬丹硪連度慢且辨識本紙張尺度適用中國國家標準（CNS ) A4規格（210X297公釐 (請先閱讀背面之注意事項再填寫本頁} -裝 '發明説明（ A7 B7 經濟部中央標準局員工消費合作社印製 =高。再者’由於每個人所手寫的字形或掃描器品質的不同，將使輸入的字形產斗枋丄沾至田 7 # τ . ^ ^ ^ ^ 產生極大的差異性，所以利用拼音校正或整體字形來辨識娇二辫謐古达^ 的早巧’便不是-個合適的于識方法。因此’函 Φ _接A匕夕A* /土 m 學斿妹種能夠使用於手寫辨識裝置與光號辨識裝“中，用以提昇單字辨識率之系統與方法，用以克服習知技術所面臨的問題。、、 S_3發明目的及概述：鑒於上述之發明背景中，傳統的單詞辨識系統使的拼音或整體字形校正來辨識度俨且雜，· w 早詞，不僅辨識速 2: 而，本發明即針對上述缺點，提出-用以提昇單字辨識率之手统盘臨的難題。系統…’用以克服習知技術所面本發明所揭露的用以提昇單詞辨識率m *方法，在宮知的單詞辨識裝置中增加-辨識資料儲存裝置…儲存字形辨識與校正所需的相關資料十個輸入單詞的字母皆利用字形比對的方式進行辨識，而在需要對單詞之辨識結果進行校正時’則將該單詞與校正規則索引表所儲存的字形，&相似字母或字串取代的方式來執行，而每個單詞所包含的字’，皆可在校正規則索引表找出替代序列或子母以組成新單詞，再由該新單詞中尋找一最可能之候選表纸張尺度適用悄’標準（CNS )八4規格（210Χ297公釐 —-— (請先閲讀背面之注意事項再填寫本頁) •裴· 、=%» Μ 476033 Λ7 B7 五、發明説明（經濟部中央標隼局員工消費合作社印製單詞並顯示之。5-4圈式簡單説明：本發明的較佳實施例將於往後之説圖形做更詳細的闡述：第一圖爲本發明較佳實施例中，提昇單方塊圖；第二圖爲一流程圖，描述本潑明所揭露之提昇單詞辨識率的方法；第三圖描繪對單詞進行校正之操作流程圖；第四圖描述對單詞進行單字母校正之操作流程圖；第五圖描繪對單詞進行多字母校正之操作流程圖；第六圖爲本發明較佳實施例中所使用的單字母校正規則索引表；，第七圖爲本發明較佳實施例中所使用的多字母校正規則索引表；第八A圖描繪於手寫板輸入一芩文單字之情形；第八B圖描繪以第八a圖爲輸·入單詞，於本發明較佳實施例中所產生之辨識結果；第九圖係字母之辨識距離的示意圖；第十A圖描繪於手寫輸入另一個一英文單字之情形；及第十B圖描繪以第十a圖爲輸入單詞，於本發明較佳實施明文字中輔以下列詞辨識率系統的架構 (請先閲讀背面之注意事項再填寫本頁) 裝· -7 、! 本紙張尺度顧 476033 、發明説明（例中所產生之辨識結果 5_5發明詳細説明：第一圖描繪本發明較佳會> μ 士 ^ 的加搂、父佳實他例中，提昇單詞辨識=了 =，其Γ筆式輸入裝置102輸入使用 ”·* °所輸入的單^被送，往處理裝置1 03中加以理’所需的辨識資料將由辨識資訊儲存裝^ 读 :辨：結果不須校正之時：’該辨識結果將被顯示：置.104中。當辨識結果需要校正時，該辨識結果所正資料，亦由辨識資訊儲存裝置105讀出，並送往置1〇_3做處理以產生校正結果，最後亦將該校正= 於顯示裝置104中。此外，處理裝置1〇3在辨識與程中所產生的暫態資訊則儲存於記憶裝置i 〇丨之中率系统者之手辨識處出。當顯示裝需的校處理裝果顯示校正過 (請先閱讀背面之注意事項再填寫本頁〕 -裝· m -丁，經濟部中央標準局員工消費合作社印製上述之記憶裝置1〇1可以是隨機存取記憶體（Rand〇m Access Memory，RAM);筆式輸入裝置1〇2可以是一手寫板；處理裝置1〇3則包含一中央處理單元entw Processing Unit，CPU);辨識資訊儲存裝置1〇5則可以利用非揮發性記憶體（Non-volatile memory)，諸如唯讀記憶體 (Read Only Memory， ROM)、、可程式化唯讀記憶體 (Programmable ROM，PROM)、可清除式可程式化唯讀纪憶體（Erasable PROM, EPROM)、電子可清除式可程式化唯讀本紙張尺度適用中國國家標準（CNS ) A4規格（210X 297公釐） 476033 A7476033 A7 B7 Printed by the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs V. Invention Description (5-1 Field of Invention: The present invention relates to a system and method for improving the word recognition rate, in particular, it can input words on a tablet or When inputting a file with #Scanner (Scanner), a glyph comparison and the replacement of similar letters are used to improve the recognition system and operation method of word recognition. 5-2 Background of the Invention: The current computer and electronic related products have been widely used. It is used in various industries and people ’s daily lives. The most important and most basic method of communication between users and computers is to use the keyboard or mouse to enter relevant instructions or information to the computer. In addition, in terms of text input, speech synthesizers have also recently been developed, as well as the use of a handwriting tablet or scanner as another text input tool. Therefore, text recognition such as text recognition or optical character recognition (OCR) devices The recognition efficiency of pretense has become a subject of great concern. However, the conventional word recognition system Such as the English word recognition system, which uses the pinyin of the word to identify the military input 'u, w as early as € (for example, when you type "telefone", you can recognize it as "telephone"), or use,,' 4 Use the full glyph of the early word to get a closest word (for example, "4 a uuta will be recognized as a" dute "), so the required algorithm is not only complicated, but the recognition rate is slow and the recognition text The paper size applies the Chinese National Standard (CNS) A4 specification (210X297 mm (please read the precautions on the back before filling out this page)-Installation of the invention description (A7 B7 Printed by the Consumer Standards Cooperative of the Central Standards Bureau of the Ministry of Economic Affairs = High. Because the quality of each person ’s handwritten glyphs or scanners will be different, the input glyphs will have a great difference to Tian 7 # τ. ^ ^ ^ ^ Identifying Zao Qiao of Jiao Erzha 谧 Gu Da ^ is not a suitable method of understanding. Therefore, the function of Φ __A 夕夕 A * / 土 m 学斿妹 species can be used for handwriting recognition devices and light number recognition Installed "medium for promotion The system and method of word recognition rate are used to overcome the problems faced by the conventional technology. The purpose and summary of the invention: In view of the above background of the invention, the traditional word recognition system uses pinyin or overall glyph correction to recognize the degree. And miscellaneous, · w Early words, not only identifying speed 2: However, the present invention addresses the above-mentioned shortcomings, and presents the problem of hand-made control to improve the recognition rate of single words. The system ... 'is used to overcome the conventional technology. The method for improving the word recognition rate m * disclosed in the invention is to add a recognition data storage device to the word recognition device of Gongzhi ... to store the relevant data required for glyph recognition and correction. The letters of the ten input words are compared using glyphs. When the recognition result of a word needs to be corrected, 'the word is replaced with the glyph stored in the correction rule index table, & similar letters or strings are executed, and each word contains "Words" can be found in the correction rule index table to find alternative sequences or sons and daughters to form a new word, and then find the most likely candidate from the new word Paper size applies to the standard (CNS) 8 4 specifications (210 × 297 mm —-— (please read the precautions on the back before filling out this page) • Bae ·, =% »Μ 476033 Λ7 B7 5. Invention Description (Economic The Ministry of Standards and Standards Bureau employee consumer cooperatives printed the words and displayed them. 5-4 circle type brief description: The preferred embodiment of the present invention will be explained in more detail in the following: The first figure is a single block diagram of the upgrade in the preferred embodiment of the present invention; the second figure is a Flowchart describing the method for improving the word recognition rate disclosed by Ben Poming; the third figure depicts the operation flowchart of correcting words; the fourth figure depicts the operation flowchart of single-letter correction of words; the fifth figure depicts the words Operation flowchart for performing multi-letter correction; The sixth figure is a single-letter correction rule index table used in the preferred embodiment of the present invention; and the seventh figure is the multi-letter correction rule index used in the preferred embodiment of the present invention The eighth picture A depicts the case where a single word is entered on the tablet; the eighth picture B depicts the recognition result produced in the preferred embodiment of the present invention by using the eighth picture a as the input word; the ninth The figure is a schematic diagram of the recognition distance of letters; Figure 10A depicts the case where another English word is input by handwriting; and Figure 10B depicts the input word using Figure 10a as the input word in the preferred embodiment of the present invention. Use the following words to identify the architecture of the rate system (please read the precautions on the back before filling this page). -7,! This paper scale Gu 476033, invention description (recognition results produced in the example 5_5 Detailed description of the invention: First The figure depicts the best case of the present invention. In the example of the plus and minus ^ of the present invention, the word recognition is promoted ===, and the Γ pen type input device 102 uses the input form "** °" to be sent , Processing to the processing device 103, 'required identification data will be stored by the identification information ^ read: identification: when the result does not need to be corrected:' the identification result will be displayed: set in .104. When the identification result needs to be corrected At this time, the data corrected by the recognition result is also read out by the recognition information storage device 105 and sent to 10_3 for processing to generate a correction result. Finally, the correction is also stored in the display device 104. In addition, the processing device The transient information generated during the identification and process is stored in the memory device i 〇丨 the identification of the system's hands. When the required correction is displayed, the display is corrected (please read the back first) Note for refilling This page]-Install · m-D. The above-mentioned memory device 101 printed by the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs may be random access memory (Random Access Memory, RAM); pen input device 1〇 2 can be a tablet; the processing device 10 includes a central processing unit (entw Processing Unit, CPU); the identification information storage device 105 can use non-volatile memory, such as read-only Memory (Read Only Memory, ROM), Programmable ROM (PROM), Erasable PROM (EPROM), Erasable Programmable The size of this paper is only applicable to China National Standard (CNS) A4 (210X 297 mm) 476033 A7

裝訂· 476033 A7 B7 五、發明説明（字的中心點爲B，則“c”與“i，，的辨識距離將爲線段的長經濟部中央榡準局員工消費合作社印製度。因此，當線段AB的長度較容差限度爲大時，表示第九圖所不的爲“ c ”與‘‘ 1 ’’兩個分開的字母；而當線段AB的長度較谷差限度爲小之時，表示第九圖所示的應爲單一的“ d，，字母。 /在本發明所揭露的方法 '中，當英文單詞需要校正時，需執行單字母校正與多字母莩·正之程序，而第三、四、五圖則^别描述校正流程之總觀、以及單字母與多字母之校正流私此外，由於每個人所寫出的字體皆不相同，甚至每次書寫的字體亦不盡相&，因此所產生的辨識結果亦有極大的變化。故本發明料對所有手寫單詞所可能產生的字母組合加以分類，並進一每、，、八& 梦產生早竽母校正表與多字母校正表之分類表。 t 以第六圖所描繪的念^ /rfi 六_、央又子母之單字母校正表而言，每個在早字母校正表中的罩早予母皆至少包含一個相似竽母，例如手寫的字母“e”往并也、 ^ 被猻u ，饪與竽母“c”、，，相近，所以在辨識時被辨硪爲“c”、“丄，，的 . 可沪、+ 1 ]幾率就，相對提高；同樣的，字母“d，，亦了此破辨識成“a”、‘‘υ，， ^ ςς 口靼客& ^ , 、或g” 6此外，多字母校正表係指興多竽母字串相似的單，往把〜個單字母寫~早子母，其形成原因即在於使用者往 *件較開，以致吟被辨識爲兩個英文字本紙張尺^ 210X297 公釐） C请先閱讀背面之注意事項#填寫本頁) •装' 訂 476033 五、發明説明（母。以第七圖所描繪的英文辨識的“cl”往往是將“d，，、“γ^‘字母校正表而言，所 r--------—· {請先閲讀背面之注意事項再填寫本頁j 以在辨識時就可能被辨識ci，^ u字寫太開的結果，所 “CC”也有可能是“a”、“ 同樣的，辨識結果中的 u 、或 d” 〇第三圖描繪對英文單詞杆於辨識結果中，依據字母相似;^總觀流程圖，首先將詞序列（步驟301)，接著再對該^詞=後子母=搭配產生單驟3 02)與多字母校正（步驟· °列執仃單字母校正（步早詞序列（步驟304)，並於咳候選座候選文軍™山 1中挑選機率最大的英又早Η爲候選子並輸出之（步骤3 挪，Λ 〇驟305)。此外，步驟302與步驟3 03之單字母與多字母之枋 ^ 签二固&欠炙校正，即分别運用描繪於弟六圖與第七圖單字母盥多李签分、列。 …母校正表來產生.新單詞序經濟部中央標準局員工消費合作社印製單字母校正流程係利用第四圖來描述，該單字母校正流 2係對辨識結果以逐字校正的方式進行，一直到最後一個字母被校正爲止（步驟401)。在步驟402中，除了計算每個字母的相似率，以決定該字母是否可能被辨識錯誤之外，亦將檢查該字母是否與前後連接之字母產生不相符之情形。當然，若一字母之相似率高务一容差限度（T〇lerance)，而且4冬母與前後字母之搭配皆符合文法規則時，將直接進行下一個字母之校正（步驟406)。而當相似率低於該容差本紙張尺度適用中國國家標準（CNS ) A4規格（210X297公釐）經濟部中央標準局員工消費合作社印製、發明説明（，度或前後字母不搭配時，將查詢—單字母校正表，以承孩字母之相似字母（步骤403)，隨後再利用該相似字母產 =新單詞（步驟4〇4)，並於除去不存、在的單詞後（步驟4〇5)，野下一個字母進行校正（步骤4〇6)。多字母校正流程係利用第五圖來描述，首先將於辨識处果中找尋需校正的字串(步驟、501)，並將該字串以相對應：相似字串取代之（步驟5〇2)·，進而產生一些新單詞（步驟 503) ，隨後並於新單詞中、除去所有不存在的 504) 。叫V歹綠謹列舉一些實例以對上述之流程做進一步之説明。以八A圖爲例，其係在手寫板上輸入“ d a t a，，，但是第一個‘‘ 字被寫得有點像“d”，所以在步驟202極可能將“data，，辨爲“ddt a”，其爲不存在之英支單詞，所以需執行步驟2 以對辨識結果“ d d t a ”進行校正。首先進行的爲第四圖所的單字母校正程序，其將在步骤402中發現第二個‘‘d，，之識率較低，所以將以“d”的相似字母來產生新單詞，由於“ 的相似字母有“a，，、“u”、或“q”來產生“data”、‘‘duta，，、 “dqta”三個新單詞，由於在上述三個新單詞僅‘‘data，，爲正存在的單詞，所以校正結果即爲“ d a t a ”，並將校正名士 “data”輸出給使用者，一如第八B圖之所示。本紙張尺度適用中國國家標準（CNS ) A4規格（210X297公釐） (請先閱讀背面之注意事項再填寫本頁) -裝· 訂斗/()033 經濟部中央標準局員工消費合作社印製、發明説明（Binding · 476033 A7 B7 V. Description of the invention (The center point of the character is B, then the recognition distance between "c" and "i," will be the line segment of the long-term Ministry of Economic Affairs Central Procurement Bureau employee consumer cooperative stamp system. Therefore, when When the length of the line segment AB is larger than the tolerance limit, it means that "c" and "1" are two separate letters that are not shown in the ninth figure; and when the length of the line segment AB is smaller than the valley limit, It means that the letter "d," shown in the ninth figure should be single. / In the method disclosed in the present invention, when the English word needs to be corrected, a single-letter correction and multi-letter 莩 · zheng procedure must be performed. Three, four, and five plans ^ Don't describe the general overview of the correction process, and the correction flow of single-letter and multi-letter corrections. In addition, because each person writes different fonts, even each time they write different fonts &, the recognition result produced by it has also changed greatly. Therefore, the present invention is to classify the combinations of letters that can be generated by all handwritten words, and add each one,,, eight & dream generation early mother correction table and more Classification of letter correction table For the single letter correction table described in the sixth figure, / rfi 六 _, central and mother-in-law, each mask in the early letter correction table contains at least one similar mother, for example The hand-written letter "e" goes and also, ^ is 狲 u, cooking is similar to the aunt "c" ,,, so it is recognized as "c", "丄 ,," in recognition. Ke Hu, + 1 ] The probability is relatively increased; similarly, the letter "d," is also identified as "a", "υ ,," ^ ς 口靼客 & ^,, or g "6 In addition, the multi-letter correction table Refers to a single with a similar Xingduo mother string, write ~ a single letter ~ early son, the reason for the formation is that the user is more open to the *, so that the chant is recognized as two English characters paper ruler ^ 210X297 mm) C Please read the notes on the back #Fill this page) • Binding '476033 5. Description of the invention (Mother. The "cl" identified in English as depicted in the seventh figure is often the "d ,,, "Γ ^ 'letter correction table, so r ---------- · {Please read the precautions on the back before filling in this page j to identify Can be recognized ci, ^ u character is written too far, so "CC" may also be "a", "the same, u, or d in the recognition result". The third picture depicts the recognition result of the English word. In accordance with the similarity of the letters; ^ Overview flowchart, first the word sequence (step 301), and then the ^ word = post-mother = collocation to generate a single step 3 02) and multi-letter correction (step · ° column execution) One-letter correction (step early word sequence (step 304)), and select the English and Chinese early in the candidate clan candidate Wenjun ™ mountain 1 as the candidate and output it (step 3, Λ 〇〇305). In addition, in step 302 and step 03, the single-letter and multi-letter sign ^ ^ Ergu & undercorrection correction, that is, the single-letter sign and column depicted in Figure 6 and Figure 7 are used respectively. … The mother correction table is generated. The new word order is printed using the fourth figure to describe the single-letter correction process of the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs. The single-letter correction flow 2 is used to correct the recognition results verbatim. Until the last letter is corrected (step 401). In step 402, in addition to calculating the similarity rate of each letter to determine whether the letter is likely to be misidentified, it is also checked whether the letter has a mismatch with the connected letters. Of course, if the similarity rate of a letter is higher than the tolerance limit, and if the combination of the 4 letters and the preceding and following letters conform to the grammar rules, the next letter will be corrected directly (step 406). And when the similarity rate is lower than the tolerance, the paper size applies the Chinese National Standard (CNS) A4 specification (210X297 mm). When printed by the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs, the description of the invention (, the degree or the letters before and after will not match, will be Query—Single-letter correction table, using the similar letters of the Chengyu alphabet (step 403), and then using the similar letters to generate new words (step 404), and after removing the non-existent, existing words (step 4〇) 5), correct the next letter in the wild (step 406). The multi-letter correction process is described using the fifth figure. First, the character string to be corrected will be found in the recognition result (step, 501), and the The strings correspond to: the similar strings are replaced (step 502) ·, then some new words are generated (step 503), and then all the non-existent 504 are removed from the new words. It's called V 歹 green. I would like to list some examples to further explain the above process. Take Figure 8A as an example, it is entered "data," on the tablet, but the first word is written a bit like "d", so it is highly likely that "data," is identified as "ddt" in step 202. a ", which is a non-existent English word, so step 2 needs to be performed to correct the recognition result" ddta ". The first one-letter correction procedure performed in the fourth figure will find the second `` d, '' in step 402, and the recognition rate is low, so new words will be generated with similar letters of "d". Similar letters of "have" a ,,, "u", or "q" to generate three new words "data", "duta", and "dqta", because in the above three new words only `` data, , Is a positive word, so the correction result is "data", and the correction master "data" is output to the user, as shown in Figure 8B. This paper size applies to Chinese National Standard (CNS) A4 (210X297 mm) (Please read the precautions on the back before filling out this page)-Packing and ordering / () 033 Printed by the Staff Consumer Cooperative of the Central Standards Bureau of the Ministry of Economic Affairs, Invention Description (

再者，右於手寫板輸入如第十A ;:Uble”’但是“d”被寫的較分開，因此在…02中；= 生的辨識結果即爲“cl〇uble，，。由^“ci〇ubie，，不二 2文單詞，所以必須執行步驟2〇4的校正程序。因平字母校正程序（即步驟3〇2)中無法獲得正確的校正姓果， ^以將以^母校正程序（㈣3G3)所獲得的校正料爲輸In addition, the input on the right of the tablet is like the tenth A;: Uble "', but" d "is written separately, so in ... 02; = The recognition result of the raw is" clOuble, ". Since ^ "ci〇ubie," the two words, you must perform the correction procedure of step 204. Because the correct alphabetic correction procedure (that is, step 30) cannot be obtained, ^ will be based on ^ The calibration material obtained by the parent calibration program (㈣3G3) is lost.

出。在多字母校正過程中，由於“ J τ·田於Cl可於第七圖中查應的相似字母“d”、“a”、盥“u ” 一 n “ ？ ’故可產生“ d 〇 u b 1 e ”、 a〇Uble”、與“uouble”三個新單詞，而在上述三個新單詞中僅“d〇uble”爲正確的單詞，所以將輸出“d〇ubie，，爲校正後之結果，其顯示圖形如第十B圖所示。一明顯的，在本發明中只需對第八A圖之手寫單詞進行三次字母替代之步驟（即利用“a，，、“u，，、或“q，，來取代“d”），即可獲得“data”之校正結果‘。再者，本發明能於第十A圖之手寫輸入下，仍可獲得正確的校正結果“d〇uble ”，故辨識率能大爲提高。综合以上所述，本發明所揭露的用以提昇單詞辨識率之裝置與方法’係利用字形比對與相似字母取代的方式來執行辨識與校正，當需對輸入單詞、辨識結果進行校正時，將該輸入單詞與校正規則索引表所儲存的字形進行比對，而每個輸入單詞所包含的字母，皆可在該校正規則索引表找 10 本紙張尺度適用中國國家標準（CNS ) A4規格（210 X 297公釐） (請先閱讀背面之注意事項再填寫本頁j -裴· 1Γ 476033 A7 B7 五、發明説明（之詞單新成組所斷判在卓 °ΡΓ ，選詞候單d 新能成可組最以一母出字找或再串，字後代性替確出正之示顯並詞佳t 較其之凡明·，發圍本範爲利僅專所申上之以明發本已而例施實本離脱未它定精限之以示用揭非所並明發專請中之述下在含包應均飾修或變改效等之成。完内所圍下範神 ί (請先閱讀背面之注意事項再填寫本頁) 經濟部中央標準局員工消費合作社印製本紙張尺度適用中國國家標準（CNS ) Α4規格（210X 297公釐）Out. In the process of multi-letter correction, because "J τ · Tian Yu Cl can be looked up in the seventh picture for similar letters" d "," a ", and" u "-n"? 'Therefore, three new words "d 〇ub 1 e", a〇Uble ", and" uouble "can be generated, and only" d〇uble "is the correct word among the above three new words, so" d 〇ubie, the result after correction, its display graph is shown in Figure 10B. Obviously, in the present invention, only the step of replacing the letter of the handwritten word in the eighth figure A with three letters is required (ie, using "a ,,," u ,, or "q," instead of "d"), that is, The “correction result of“ data ”can be obtained.” Furthermore, the present invention can still obtain the correct correction result “double” under the handwriting input of the tenth figure A, so the recognition rate can be greatly improved. It is described that the device and method for improving the word recognition rate disclosed in the present invention is to perform recognition and correction by means of glyph comparison and substitution of similar letters. When input words and recognition results need to be corrected, the input The words are compared with the glyphs stored in the correction rule index table, and the letters contained in each input word can be found in the correction rule index table. The paper size is applicable to the Chinese National Standard (CNS) A4 specification (210 X 297). (Mm) (Please read the notes on the back before filling in this page j -Pei · 1Γ 476033 A7 B7 V. Description of the invention (The list of words in the new group was judged in Zhuo ° PΓ), and the candidate list d group Find or re-string the word with a mother, and the descendants of the word must be positive, and the word is better. T is better than the ordinary. The actual release has not set its limits to show the use of non-existent and expressly stated in the special request, including the package should be decorated or changed, etc. Fan Shen ί (please first surround (Please read the notes on the back and fill in this page) The paper size printed by the Employees' Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs applies the Chinese National Standard (CNS) Α4 specification (210X 297 mm)

Claims

476033 Department of the year // month / positive charge Α8 Β8 C8 D8 VI. Application for patent scope # Λ7νΓ; Λ ·, 'ν .'Γ.ν'ν Printed by the Consumer Cooperatives of the Intellectual Property Bureau of the Ministry of Economic Affairs 1. A way to improve the word recognition rate Method, the method includes at least the following steps: inputting a word; generating a recognition result by comparing the words with a glyph; when the recognition result contains a non-existent word, using the glyph comparison with a similar glyph Instead, the recognition result is corrected to generate a correction result; and the word contained in the correction result is displayed. 2. For the method of claim 1 in the scope of patent application, wherein the above step of generating the recognition result is generated by letter-by-letter comparison of the letters contained in the word. 3. The method according to item 1 of the scope of patent application, wherein the step of correcting the recognition result by using glyph comparison and similar glyph replacement includes at least the following steps: Performing correction on the recognition result by means of single-letter verbatim replacement. ; And using a single letter instead of a multi-letter string to perform correction on the recognition result. 4. The method according to item 2 of the scope of patent application, wherein the above-mentioned step of using the single-letter verbatim instead of performing the correction on the recognition result includes at least the following steps. 12 This paper size applies the Chinese National Standard (CNS) A4 specification (210 X 297 mm) (Please read the notes on the back before filling out this page) -------- Order · ------- »476 Hope to amend and add A8 B8 C8 D8 | .Ministry of Economic Affairs! Scope of patent application When the similarity rate of each it early mother is higher than a tolerance limit (Tolerance), and now, there is no inconsistency between ± 礅 early mother and the single letter connected to the front and back, lose \ Xi lift, edge π field, * ^ ^ perform the eighth μ consciousness every time ... If the next letter of the army, continue to perform Λ 利 # 时-子 instead of performing the step of correcting the recognition result; use this Zhuofeng mother ^ Shi ^ new words; Like the mother and mother replace the single letter, used to generate renunciation to remove all non-existent in the new single dog to correct the next single /, and give the mother to correct. 5 · If the patent applicant is poor, siege. The method of the above item, wherein the step of using the single letter to replace the multi-letter string material 2. ^ The step of performing correction by Kono's recognition result includes at least the following steps: Find the multi-character that needs to be corrected based on the recognition result + #% ^ ^ Alphabetic string; replace the multi-word: whole d > with the single letter to generate a plurality of new words; and ^ remove the new word $ 中户 kg if the word does not exist. 6. The method of claiming item 5 of the patent, wherein the above is used to replace the multi-letter string and the single letter. When the single letter is broken into a plurality of parts, it will become a similar word of the multi-letter string. string. 7 · The method according to item 1 of the scope of patent application, in which the identification knot is generated as above 13 The paper size is applicable to the Chinese National Standard (CNS) A4 specification (210 X 297 mm) — I1IIIIIIIII · 1 IIIIII «ΙΙΙΙΙ! — — (Please Please read the notes on the back before filling this page) 476083% year ((P ”it supplement 丨 A8 B8 C8 D8 Printed by the Intellectual Property Bureau Employees Consumer Cooperatives of the Ministry of Economic Affairs 6. Printing patent scope steps, including when the identification result exists When the word is displayed, the step of presenting the word is displayed. 8 — A method for improving the recognition rate of the word, the method includes at least the following steps: input a word; compare the word with a glyph to generate a recognition result; when the When the recognition result includes a word, the method of improving the recognition rate of the word is ended after displaying the existing word; the recognition result is corrected by using a single-letter word-by-word replacement method; a single-letter word is used to replace a multi-letter string, Perform correction on the recognition result; and display the word contained in the correction result. The method of item 8, wherein the step of generating the recognition result is generated by comparing the letters contained in the word letter by letter. 1 0. The method of item 8 in the scope of patent application, wherein the above uses The one-letter verbatim replaces the step of performing correction on the recognition result, including at least the following steps: When the similarity rate of each one-letter is higher than a tolerance limit, and the one-letter is connected to the one-to-one When there is no inconsistency between the letters, enter the single letter below the recognition result to continue to execute the 14 paper standards applicable to China National Standard (CNS) A4 (210 X 297 mm) — III ----- -------- II. I — — — — — — — (Please read the notes on the back before filling this page) 476㈣修, π: " 介补天 Α8 Β8 C8 D8 六、 Scope of patent application Trouble reading and reading the second pan-educator &) rfr 1 year quasi-performing the single letter verbatim instead of performing the step of correcting the recognition result; replacing the single letter with a similar letter of the single letter to generate multiple new ones Word All non-existent words in the new word; and correction of the next single letter. 1 1. The method of claim 8 in the scope of the patent application, wherein the single letter is used instead of the multi-letter string to perform the recognition result The step of correcting includes at least the following steps: finding the multi-letter string to be corrected in the recognition result; replacing the multi-letter string with the single letter to generate a plurality of new words; and removing all non-words in the new word The word exists. Printed by the Consumer Cooperatives of the Intellectual Property Bureau of the Ministry of Economic Affairs 12. If the method of applying for item 10 of the patent scope, wherein the above is used to replace the single letter of the multi-letter string, when the single letter is broken into multiple parts Will become a similar string of the multi-letter string. 1 3. A device for improving word recognition rate, the word recognition rate raising device includes at least: an input device for inputting a word; a recognition information storage device for storing identification information and correction information; a processing device for inputting the word and The identification information is used to make 15 words on this word. Paper size is applicable to Chinese National Standard (CNS) A4 (210 X 297 mm). — ^ ---- 1 ------------- Order * -------- (Please read the notes on the back before filling this page), apply for patent range recognition to generate a recognition result, and use the input of the word, the recognition result and the correction information to generate a correction Results; and recognize the display with g. Use the results to verify the results, identify the inputs, and input the results. If the results are displayed, verify the inputs. Members are requested to indicate whether there is any change in the amendments mentioned in the year and month. The styles of the birth and the birth of the students are as follows. The Fang Zhi's generation rate is based on the recognition of Shan Shan's word recognition. Contrast (please read the precautions on the back before filling this page) Prescribing the new words as similar to the contrasting glyphs, and using the Li-Fan model for the Liguo specialization, please apply to the school for more details. Take the item of the promotion of the form of the word recognition 3 11 The item of the promotion of the form of the recognition of the word 3 3 i—x Fan Li special request for application 6 If the word list in production does not contain a package Fruiting should be identified in the department of fruiting. According to the Ministry of Economic Affairs, Intellectual Property Bureau, Employees' Cooperatives, Printed Agriculture 17. If the device for improving the word recognition rate of item 13 of the patent application scope, the above device for improving the word recognition rate further includes a storage device for storing the improved word recognition Rate information generated during the identification process. 18. If the device for improving the word recognition rate according to item 13 of the patent application, the input device described above may be a handwriting tablet. 16 This paper size applies the Chinese National Standard (CNS) A4 specification (210 X 297 mm) 4760 Ή month 修正 f amends and supplements A8 B8 C8 D8 patent application scope 19. If the word recognition device of item 13 of the patent scope is applied, where The above input device may be a scanner. 20 · The device for improving the word recognition rate according to item 13 of the scope of patent application, wherein the above-mentioned recognition information storage device is composed of the following memory: Read Only Memory (ROM); J ^ Programmable Read Only Memory (Programmable ROM, PROM); erasable programmable read-only memory (Erasable PROM, f / ^ EPROM); amendments to the internal electronic programmable erasable programmable read-only memory (Eleetricaliy EPROM a month EEPROM ); And the IS flash memory (Flash memory) printed by the Intellectual Property Bureau of the Ministry of Economic Affairs's Consumer Cooperatives 21 · —a device for improving the word recognition rate, the device for improving the word recognition rate includes at least: an input device for inputting a A word; an identification information storage device for storing identification information and correction information; a processing device for inputting the word and the identification information, performing identification on the word by means of glyph comparison and similar glyph substitution to generate a recognition result, When the recognition result includes the word that does not exist, input the word, the recognition result, and the correction information to generate a correction And a display device, inputting the recognition result to display the recognition and 17 paper sizes applicable to the Chinese National Standard (CNS) A4 specification (210 X 297 mm) 1 · nn nnnnnnn mi · innn —J nn a K · nnn II nn I (Please read the notes on the back before filling out this page) 4 / OUJJ

6. Scope of Patent Application Enter the correction result to input the correction result. 22. The device for improving the word recognition rate according to item 21 of the patent application, wherein the correction result is generated by means of glyph comparison and substitution with similar glyphs. Repair trouble ^ Rongwang · Rugao patent scope item 21 to improve the word recognition rate device, which is described above, the word recognition rate improvement device further includes a storage device for storing the If word promotion rate recognition device in the recognition process Transient information generated in. 24. If the device for improving the word recognition rate according to item 21 of the patent application, the above input device may be a tablet. 25. The device for improving the word recognition rate according to item 2m of the scope of patent application, wherein the input device described in the above 0 may be a scanner. 26. If the device for increasing the word recognition rate of item 21 in the scope of patent application, the above-mentioned recognition information storage device is composed of the following memory: Read Only Memory (ROM); Programmable Read Only Memory (Programmable ROM, PROM); Erasable PROM; EPROM; Electronically Programmable Read-Only Memory (Electrically 18) This paper standard applies to Chinese National Standards (CNS) A4 size (210 X 297 mm) (Please read the notes on the back before filling this page)

Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs D8 Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs has never been printed, patent application scope EPROM, EEPROM); and flash memory (Flash memory) Members are requested to indicate > year " Month 2Z &: Whether there is any change to the substance of the amendment, whether the amendment is allowed to be amended. 19 n HI n n n n vn n n d · m n tfa · (Please read the notes on the back before filling out this page) 1T—— Sound This paper size is applicable to China National Standard (CNS) A4 (210 X 297 mm)