TW476033B - System and method for increasing word recognition rate - Google Patents

System and method for increasing word recognition rate Download PDF

Info

Publication number
TW476033B
TW476033B TW086117317A TW86117317A TW476033B TW 476033 B TW476033 B TW 476033B TW 086117317 A TW086117317 A TW 086117317A TW 86117317 A TW86117317 A TW 86117317A TW 476033 B TW476033 B TW 476033B
Authority
TW
Taiwan
Prior art keywords
word
recognition
letter
correction
recognition result
Prior art date
Application number
TW086117317A
Other languages
Chinese (zh)
Inventor
Dai-Shuei He
Shr-Guang Tsai
Wei-Jiun Pan
Original Assignee
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Corp filed Critical Inventec Corp
Priority to TW086117317A priority Critical patent/TW476033B/en
Application granted granted Critical
Publication of TW476033B publication Critical patent/TW476033B/en

Links

Abstract

Provided is a system and method for increasing word recognition rate for increasing word recognition rate when inputting words or documents by the handwriting board or the scanner. It adds a recognition data storage device into the known word recognition system for storing necessary related data of word recognition and correction. Characters which each input word includes use method of word comparison for recognizing. When the recognition results of input words include un-existed words, it corrects the recognition results. Words which the recognition results included will proceed correction with words which the correction rule index table is stores by means of replacement of similar characters or stings. Characters which each word included is able to find out replaced strings or characters in the correction rule index table to form new words and find out the mostly candidate words as new ones to display it.

Description

476033 A7 B7 經濟部中央標準局員工消費合作社印製 五、發明説明( 5-1發明領域: 本發明係關於一種用以提昇單詞辨識率《系統與方 法,特别是指可於手寫板輸入單詞或於#描器(Scanner)輸 入文件時,利用字形比對與相似字母取代的方式’來提昇 單詞辨識率之辨識系統與其運作方法。 5-2發明背景: 目前的電腦與電子相關產品已廣泛的應用在各種產業 以及人們的曰常生活之中,而所有的使用者與電腦間之溝 通媒介中,最重要亦爲最基本的方法即利用鍵盤或滑鼠, 來輸入相關的指令或資料給電腦。此外,在文字輸入方面, 近來亦發展語音合成器,以及運用手寫板或掃描器來作爲 另一種文字輸入工具,因此丰寫辨識裝置或光學符號辨識 (Optical Character Recognition,OCR)裝置等文字辨識裝冒 的辨識效率,便成爲一極受關切的課題。 然而習知的單詞辨識系統,諸如英文單詞辨識系統等, 其使用單詞的拼音枱正來辨識所輸入的軍Ύ ’ u 、w早€ (例如輸入 “telefone”時,可辨識爲 “telephone”),或以於、 、 ’ 4以輸入早詞之整許 字形來取得一最相近的單詞(例如“ 4 a uuta將被辨視良 “dute”),故所需的演算法不僅複雜,辨 馬 丹硪連度慢且辨識 本紙張尺度適用中國國家標準(CNS ) A4規格(210X297公釐 (請先閱讀背面之注意事項再填寫本頁} -裝 '發明説明( A7 B7 經濟部中央標準局員工消費合作社印製 =高。再者’由於每個人所手寫的字形或掃描器品質的不同,將使輸入的字形產斗枋丄沾至田 7 # τ . ^ ^ ^ ^ 產生極大的差異性,所以利用拼音 校正或整體字形來辨識娇二 辫謐古达^ 的早巧’便不是-個合適的 于識方法。因此’函 Φ _接A匕夕A* /土 m 學斿妹種能夠使用於手寫辨識裝置與光 號辨識裝“中,用以提昇單字辨識率之系統與方法, 用以克服習知技術所面臨的問題。 、、 S_3發明目的及概述: 鑒於上述之發明背景中,傳統的單詞辨識系統使 的拼音或整體字形校正來辨識 度俨且雜,· w 早詞,不僅辨識速 2: 而,本發明即針對上述缺點,提出-用 以提昇單字辨識率之手统盘 臨的難題。 系統…’用以克服習知技術所面 本發明所揭露的用以提昇單詞辨識率m *方法,在 宮知的單詞辨識裝置中增加-辨識資料儲存裝置…儲 存字形辨識與校正所需的相關資料十個輸入單詞的字 母皆利用字形比對的方式進行辨識,而在需要對單詞之辨 識結果進行校正時’則將該單詞與校正規則索引表所儲存 的字形,&相似字母或字串取代的方式來執行,而每個單 詞所包含的字’,皆可在校正規則索引表找出替代序列或 子母以組成新單詞,再由該新單詞中尋找一最可能之候選 表纸張尺度適用悄’標準(CNS )八4規格(210Χ297公釐 —-— (請先閲讀背面之注意事項再填寫本頁) •裴· 、=%» Μ 476033 Λ7 B7 五、發明説明( 經濟部中央標隼局員工消費合作社印製 單詞並顯示之。5-4圈式簡單説明: 本發明的較佳實施例將於往後之説 圖形做更詳細的闡述: 第一圖爲本發明較佳實施例中,提昇單 方塊圖; 第二圖爲一流程圖,描述本潑明所揭露之提昇單詞辨識率的 方法; 第三圖描繪對單詞進行校正之操作流程圖; 第四圖描述對單詞進行單字母校正之操作流程圖; 第五圖描繪對單詞進行多字母校正之操作流程圖; 第六圖爲本發明較佳實施例中所使用的單字母校正規則索 引表; , 第七圖爲本發明較佳實施例中所使用的多字母校正規則索 引表; 第八A圖描繪於手寫板輸入一芩文單字之情形; 第八B圖描繪以第八a圖爲輸·入單詞,於本發明較佳實施 例中所產生之辨識結果; 第九圖係字母之辨識距離的示意圖; 第十A圖描繪於手寫輸入另一個一英文單字之情形;及 第十B圖描繪以第十a圖爲輸入單詞,於本發明較佳實施 明文字中輔以下列 詞辨識率系統的架構 (請先閲讀背面之注意事項再填寫本頁) 裝· -7 、! 本紙張尺度顧 476033 、發明説明( 例中所產生之辨識結果 5_5發明詳細説明: 第一圖描繪本發明較佳會> μ 士 ^ 的加搂、 父佳實他例中,提昇單詞辨識=了 =,其Γ筆式輸入裝置102輸入使用 ”·* °所輸入的單^被送,往處理裝置1 03中加以 理’所需的辨識資料將由辨識資訊儲存裝^ 读 :辨:結果不須校正之時:’該辨識結果將被顯示: 置.104中。當辨識結果需要校正時,該辨識結果所 正資料,亦由辨識資訊儲存裝置105讀出,並送往 置1〇_3做處理以產生校正結果,最後亦將該校正= 於顯示裝置104中。此外,處理裝置1〇3在辨識與 程中所產生的暫態資訊則儲存於記憶裝置i 〇丨之中 率系统 者之手 辨識處 出。當 顯示裝 需的校 處理裝 果顯示 校正過 (請先閱讀背面之注意事項再填寫本頁〕 -裝· m -丁, 經濟部中央標準局員工消費合作社印製 上述之記憶裝置1〇1可以是隨機存取記憶體(Rand〇m Access Memory,RAM);筆式輸入裝置1〇2可以是一手寫 板;處理裝置1〇3則包含一中央處理單元entw Processing Unit,CPU);辨識資訊儲存裝置1〇5則可以利用 非揮發性記憶體(Non-volatile memory),諸如唯讀記憶體 (Read Only Memory, ROM)、、可程式化唯讀記憶體 (Programmable ROM,PROM)、可清除式可程式化唯讀纪憶 體(Erasable PROM, EPROM)、電子可清除式可程式化唯讀 本紙張尺度適用中國國家標準(CNS ) A4規格(210X 297公釐) 476033 A7476033 A7 B7 Printed by the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs V. Invention Description (5-1 Field of Invention: The present invention relates to a system and method for improving the word recognition rate, in particular, it can input words on a tablet or When inputting a file with #Scanner (Scanner), a glyph comparison and the replacement of similar letters are used to improve the recognition system and operation method of word recognition. 5-2 Background of the Invention: The current computer and electronic related products have been widely used. It is used in various industries and people ’s daily lives. The most important and most basic method of communication between users and computers is to use the keyboard or mouse to enter relevant instructions or information to the computer. In addition, in terms of text input, speech synthesizers have also recently been developed, as well as the use of a handwriting tablet or scanner as another text input tool. Therefore, text recognition such as text recognition or optical character recognition (OCR) devices The recognition efficiency of pretense has become a subject of great concern. However, the conventional word recognition system Such as the English word recognition system, which uses the pinyin of the word to identify the military input 'u, w as early as € (for example, when you type "telefone", you can recognize it as "telephone"), or use,,' 4 Use the full glyph of the early word to get a closest word (for example, "4 a uuta will be recognized as a" dute "), so the required algorithm is not only complicated, but the recognition rate is slow and the recognition text The paper size applies the Chinese National Standard (CNS) A4 specification (210X297 mm (please read the precautions on the back before filling out this page)-Installation of the invention description (A7 B7 Printed by the Consumer Standards Cooperative of the Central Standards Bureau of the Ministry of Economic Affairs = High. Because the quality of each person ’s handwritten glyphs or scanners will be different, the input glyphs will have a great difference to Tian 7 # τ. ^ ^ ^ ^ Identifying Zao Qiao of Jiao Erzha 谧 Gu Da ^ is not a suitable method of understanding. Therefore, the function of Φ __A 夕夕 A * / 土 m 学 斿 妹 species can be used for handwriting recognition devices and light number recognition Installed "medium for promotion The system and method of word recognition rate are used to overcome the problems faced by the conventional technology. The purpose and summary of the invention: In view of the above background of the invention, the traditional word recognition system uses pinyin or overall glyph correction to recognize the degree. And miscellaneous, · w Early words, not only identifying speed 2: However, the present invention addresses the above-mentioned shortcomings, and presents the problem of hand-made control to improve the recognition rate of single words. The system ... 'is used to overcome the conventional technology. The method for improving the word recognition rate m * disclosed in the invention is to add a recognition data storage device to the word recognition device of Gongzhi ... to store the relevant data required for glyph recognition and correction. The letters of the ten input words are compared using glyphs. When the recognition result of a word needs to be corrected, 'the word is replaced with the glyph stored in the correction rule index table, & similar letters or strings are executed, and each word contains "Words" can be found in the correction rule index table to find alternative sequences or sons and daughters to form a new word, and then find the most likely candidate from the new word Paper size applies to the standard (CNS) 8 4 specifications (210 × 297 mm —-— (please read the precautions on the back before filling out this page) • Bae ·, =% »Μ 476033 Λ7 B7 5. Invention Description (Economic The Ministry of Standards and Standards Bureau employee consumer cooperatives printed the words and displayed them. 5-4 circle type brief description: The preferred embodiment of the present invention will be explained in more detail in the following: The first figure is a single block diagram of the upgrade in the preferred embodiment of the present invention; the second figure is a Flowchart describing the method for improving the word recognition rate disclosed by Ben Poming; the third figure depicts the operation flowchart of correcting words; the fourth figure depicts the operation flowchart of single-letter correction of words; the fifth figure depicts the words Operation flowchart for performing multi-letter correction; The sixth figure is a single-letter correction rule index table used in the preferred embodiment of the present invention; and the seventh figure is the multi-letter correction rule index used in the preferred embodiment of the present invention The eighth picture A depicts the case where a single word is entered on the tablet; the eighth picture B depicts the recognition result produced in the preferred embodiment of the present invention by using the eighth picture a as the input word; the ninth The figure is a schematic diagram of the recognition distance of letters; Figure 10A depicts the case where another English word is input by handwriting; and Figure 10B depicts the input word using Figure 10a as the input word in the preferred embodiment of the present invention. Use the following words to identify the architecture of the rate system (please read the precautions on the back before filling this page). -7,! This paper scale Gu 476033, invention description (recognition results produced in the example 5_5 Detailed description of the invention: First The figure depicts the best case of the present invention. In the example of the plus and minus ^ of the present invention, the word recognition is promoted ===, and the Γ pen type input device 102 uses the input form "** °" to be sent , Processing to the processing device 103, 'required identification data will be stored by the identification information ^ read: identification: when the result does not need to be corrected:' the identification result will be displayed: set in .104. When the identification result needs to be corrected At this time, the data corrected by the recognition result is also read out by the recognition information storage device 105 and sent to 10_3 for processing to generate a correction result. Finally, the correction is also stored in the display device 104. In addition, the processing device The transient information generated during the identification and process is stored in the memory device i 〇 丨 the identification of the system's hands. When the required correction is displayed, the display is corrected (please read the back first) Note for refilling This page]-Install · m-D. The above-mentioned memory device 101 printed by the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs may be random access memory (Random Access Memory, RAM); pen input device 1〇 2 can be a tablet; the processing device 10 includes a central processing unit (entw Processing Unit, CPU); the identification information storage device 105 can use non-volatile memory, such as read-only Memory (Read Only Memory, ROM), Programmable ROM (PROM), Erasable PROM (EPROM), Erasable Programmable The size of this paper is only applicable to China National Standard (CNS) A4 (210X 297 mm) 476033 A7

裝 訂· 476033 A7 B7 五、發明説明( 字的中心點爲B,則“c”與“i,,的辨識距離將爲線段的長 經濟部中央榡準局員工消費合作社印製 度。因此,當線段AB的長度較容差限度爲大時,表示第九 圖所不的爲“ c ”與‘‘ 1 ’’兩個分開的字母;而當線段AB的長度 較谷差限度爲小之時,表示第九圖所示的應爲單一的“ d,,字 母。 /在本發明所揭露的方法 '中,當英文單詞需要校正時,需 執行單字母校正與多字母莩·正之程序,而第三、四、五圖 則^别描述校正流程之總觀、以及單字母與多字母之校正 流私此外,由於每個人所寫出的字體皆不相同,甚至每 次書寫的字體亦不盡相&,因此所產生的辨識結果亦有極 大的變化。故本發明料 對所有手寫單詞所可能產生的字母組 合加以分類,並進一每、 , 、八& 梦產生早竽母校正表與多字母校正表 之分類表。 t 以第六圖所描繪的念^ /rfi 六_、 央又子母之單字母校正表而言,每個 在早字母校正表中的罩 早予母皆至少包含一個相似竽母,例 如手寫的字母“e”往并也、 ^ 被猻u , 饪與竽母“c”、,,相近,所以在辨識時 被辨硪爲“c”、“丄,,的 . 可沪、+ 1 ]幾率就,相對提高;同樣的,字母“d,,亦 了此破辨識成“a”、‘‘υ,, ^ ςς 口 靼客& ^ , 、或g” 6此外,多字母校正表係指 興多竽母字串相似的單 , 往把〜個單字母寫~早子母,其形成原因即在於使用者往 *件較開,以致吟被辨識爲兩個英文字 本紙張尺^ 210X297 公釐) C请先閱讀背面之注意事項#填寫本頁) •装' 訂 476033 五、發明説明( 母。以第七圖所描繪的英文 辨識的“cl”往往是將“d,,、“γ^‘字母校正表而言,所 r--------—· {請先閲讀背面之注意事項再填寫本頁j 以在辨識時就可能被辨識ci,^ u字寫太開的結果,所 “CC”也有可能是“a”、“ 同樣的,辨識結果中的 u 、或 d” 〇 第三圖描繪對英文單詞杆 於辨識結果中,依據字母相似;^總觀流程圖,首先將 詞序列(步驟301),接著再對該^詞=後子母=搭配產生單 驟3 02)與多字母校正(步驟· °列執仃單字母校正(步 早詞序列(步驟304),並於咳候選 座候選 文軍™山 1中挑選機率最大的英 又早Η爲候選子並輸出之(步骤3 挪,Λ 〇 驟305)。此外,步驟302與步 驟3 03之單字母與多字母之枋 ^ 签二固&欠 炙校正,即分别運用描繪於 弟六圖與第七圖單字母盥多李签分 、 列。 …母校正表來產生.新單詞序 經濟部中央標準局員工消費合作社印製 單字母校正流程係利用第四圖來描述,該單字母校正流 2係對辨識結果以逐字校正的方式進行,一直到最後一個 字母被校正爲止(步驟401)。在步驟402中,除了計算每個 字母的相似率,以決定該字母是否可能被辨識錯誤之外, 亦將檢查該字母是否與前後連接之字母產生不相符之情 形。當然,若一字母之相似率高务一容差限度(T〇lerance), 而且4冬母與前後字母之搭配皆符合文法規則時,將直接 進行下一個字母之校正(步驟406)。而當相似率低於該容差 本紙張尺度適用中國國家標準(CNS ) A4規格(210X297公釐) 經濟部中央標準局員工消費合作社印製 、發明説明( ,度或前後字母不搭配時,將查詢—單字母校正表,以承 孩字母之相似字母(步骤403),隨後再利用該相似字母產 =新單詞(步驟4〇4),並於除去不存、在的單詞後(步驟4〇5), 野下一個字母進行校正(步骤4〇6)。 多字母校正流程係利用第五圖來描述,首先將於辨識处 果中找尋需校正的字串(步驟、501),並將該字串以相對應: 相似字串取代之(步驟5〇2)·,進而產生一些新單詞(步驟 503) ,隨後並於新單詞中、除去所有不存在的 504) 。 叫V歹綠 謹列舉一些實例以對上述之流程做進一步之説明。以 八A圖爲例,其係在手寫板上輸入“ d a t a,,,但是第一個‘‘ 字被寫得有點像“d”,所以在步驟202極可能將“data,,辨 爲“ddt a”,其爲不存在之英支單詞,所以需執行步驟2 以對辨識結果“ d d t a ”進行校正。首先進行的爲第四圖所 的單字母校正程序,其將在步骤402中發現第二個‘‘d,,之 識率較低,所以將以“d”的相似字母來產生新單詞,由於“ 的相似字母有“a,,、“u”、或“q”來產生“data”、‘‘duta,,、 “dqta”三個新單詞,由於在上述三個新單詞僅‘‘data,,爲 正存在的單詞,所以校正結果即爲“ d a t a ”,並將校正名士 “data”輸出給使用者,一如第八B圖之所示。 本紙張尺度適用中國國家標準(CNS ) A4規格(210X297公釐) (請先閱讀背面之注意事項再填寫本頁) -裝· 訂 斗/()033 經濟部中央標準局員工消費合作社印製 、發明説明(Binding · 476033 A7 B7 V. Description of the invention (The center point of the character is B, then the recognition distance between "c" and "i," will be the line segment of the long-term Ministry of Economic Affairs Central Procurement Bureau employee consumer cooperative stamp system. Therefore, when When the length of the line segment AB is larger than the tolerance limit, it means that "c" and "1" are two separate letters that are not shown in the ninth figure; and when the length of the line segment AB is smaller than the valley limit, It means that the letter "d," shown in the ninth figure should be single. / In the method disclosed in the present invention, when the English word needs to be corrected, a single-letter correction and multi-letter 莩 · zheng procedure must be performed. Three, four, and five plans ^ Don't describe the general overview of the correction process, and the correction flow of single-letter and multi-letter corrections. In addition, because each person writes different fonts, even each time they write different fonts &, the recognition result produced by it has also changed greatly. Therefore, the present invention is to classify the combinations of letters that can be generated by all handwritten words, and add each one,,, eight & dream generation early mother correction table and more Classification of letter correction table For the single letter correction table described in the sixth figure, / rfi 六 _, central and mother-in-law, each mask in the early letter correction table contains at least one similar mother, for example The hand-written letter "e" goes and also, ^ is 狲 u, cooking is similar to the aunt "c" ,,, so it is recognized as "c", "丄 ,," in recognition. Ke Hu, + 1 ] The probability is relatively increased; similarly, the letter "d," is also identified as "a", "υ ,," ^ ς 口 靼 客 & ^,, or g "6 In addition, the multi-letter correction table Refers to a single with a similar Xingduo mother string, write ~ a single letter ~ early son, the reason for the formation is that the user is more open to the *, so that the chant is recognized as two English characters paper ruler ^ 210X297 mm) C Please read the notes on the back #Fill this page) • Binding '476033 5. Description of the invention (Mother. The "cl" identified in English as depicted in the seventh figure is often the "d ,,, "Γ ^ 'letter correction table, so r ---------- · {Please read the precautions on the back before filling in this page j to identify Can be recognized ci, ^ u character is written too far, so "CC" may also be "a", "the same, u, or d in the recognition result". The third picture depicts the recognition result of the English word. In accordance with the similarity of the letters; ^ Overview flowchart, first the word sequence (step 301), and then the ^ word = post-mother = collocation to generate a single step 3 02) and multi-letter correction (step · ° column execution) One-letter correction (step early word sequence (step 304)), and select the English and Chinese early in the candidate clan candidate Wenjun ™ mountain 1 as the candidate and output it (step 3, Λ 〇〇305). In addition, in step 302 and step 03, the single-letter and multi-letter sign ^ ^ Ergu & undercorrection correction, that is, the single-letter sign and column depicted in Figure 6 and Figure 7 are used respectively. … The mother correction table is generated. The new word order is printed using the fourth figure to describe the single-letter correction process of the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs. The single-letter correction flow 2 is used to correct the recognition results verbatim. Until the last letter is corrected (step 401). In step 402, in addition to calculating the similarity rate of each letter to determine whether the letter is likely to be misidentified, it is also checked whether the letter has a mismatch with the connected letters. Of course, if the similarity rate of a letter is higher than the tolerance limit, and if the combination of the 4 letters and the preceding and following letters conform to the grammar rules, the next letter will be corrected directly (step 406). And when the similarity rate is lower than the tolerance, the paper size applies the Chinese National Standard (CNS) A4 specification (210X297 mm). When printed by the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs, the description of the invention (, the degree or the letters before and after will not match, will be Query—Single-letter correction table, using the similar letters of the Chengyu alphabet (step 403), and then using the similar letters to generate new words (step 404), and after removing the non-existent, existing words (step 4〇) 5), correct the next letter in the wild (step 406). The multi-letter correction process is described using the fifth figure. First, the character string to be corrected will be found in the recognition result (step, 501), and the The strings correspond to: the similar strings are replaced (step 502) ·, then some new words are generated (step 503), and then all the non-existent 504 are removed from the new words. It's called V 歹 green. I would like to list some examples to further explain the above process. Take Figure 8A as an example, it is entered "data," on the tablet, but the first word is written a bit like "d", so it is highly likely that "data," is identified as "ddt" in step 202. a ", which is a non-existent English word, so step 2 needs to be performed to correct the recognition result" ddta ". The first one-letter correction procedure performed in the fourth figure will find the second `` d, '' in step 402, and the recognition rate is low, so new words will be generated with similar letters of "d". Similar letters of "have" a ,,, "u", or "q" to generate three new words "data", "duta", and "dqta", because in the above three new words only `` data, , Is a positive word, so the correction result is "data", and the correction master "data" is output to the user, as shown in Figure 8B. This paper size applies to Chinese National Standard (CNS) A4 (210X297 mm) (Please read the precautions on the back before filling out this page)-Packing and ordering / () 033 Printed by the Staff Consumer Cooperative of the Central Standards Bureau of the Ministry of Economic Affairs, Invention Description (

再者,右於手寫板輸入如第十A ;:Uble”’但是“d”被寫的較分開,因此在…02中;= 生的辨識結果即爲“cl〇uble,,。由^“ci〇ubie,,不二 2文單詞,所以必須執行步驟2〇4的校正程序。因 平字母校正程序(即步驟3〇2)中無法獲得正確的校正姓果, ^以將以^母校正程序(㈣3G3)所獲得的校正料爲輸In addition, the input on the right of the tablet is like the tenth A;: Uble "', but" d "is written separately, so in ... 02; = The recognition result of the raw is" clOuble, ". Since ^ "ci〇ubie," the two words, you must perform the correction procedure of step 204. Because the correct alphabetic correction procedure (that is, step 30) cannot be obtained, ^ will be based on ^ The calibration material obtained by the parent calibration program (㈣3G3) is lost.

出。在多字母校正過程中,由於“ J τ·田於Cl可於第七圖中查 應的相似字母“d”、“a”、盥“u ” 一 n “ ? ’故可產生“ d 〇 u b 1 e ”、 a〇Uble”、與“uouble”三個新單詞,而在上述三個新單詞 中僅“d〇uble”爲正確的單詞,所以將輸出“d〇ubie,,爲校正 後之結果,其顯示圖形如第十B圖所示。 一明顯的,在本發明中只需對第八A圖之手寫單詞進行 三次字母替代之步驟(即利用“a,,、“u,,、或“q,,來取代“d”), 即可獲得“data”之校正結果‘。再者,本發明能於第十A圖 之手寫輸入下,仍可獲得正確的校正結果“d〇uble ”,故辨 識率能大爲提高。 综合以上所述,本發明所揭露的用以提昇單詞辨識率之 裝置與方法’係利用字形比對與相似字母取代的方式來執 行辨識與校正,當需對輸入單詞、辨識結果進行校正時,將 該輸入單詞與校正規則索引表所儲存的字形進行比對,而 每個輸入單詞所包含的字母,皆可在該校正規則索引表找 10 本紙張尺度適用中國國家標準(CNS ) A4規格(210 X 297公釐) (請先閱讀背面之注意事項再填寫本頁j -裴· 1Γ 476033 A7 B7 五、發明説明( 之 詞 單 新 成 組 所 斷 判 在卓 °ΡΓ ,選 詞候 單d 新能 成可 組最 以一 母出 字找 或再 串, 字後 代性 替確 出正 之 示 顯 並 詞 佳t 較其 之凡 明·, 發圍 本範 爲利 僅專 所申 上之 以明 發 本 已 而 例 施 實 本 離 脱 未 它 定精 限之 以示 用揭 非所 並明 發 專 請 中 之 述 下 在 含 包 應 均 飾 修 或 變 改 效 等 之 成。 完内 所圍 下範 神 ί (請先閱讀背面之注意事項再填寫本頁) 經濟部中央標準局員工消費合作社印製 本紙張尺度適用中國國家標準(CNS ) Α4規格(210X 297公釐)Out. In the process of multi-letter correction, because "J τ · Tian Yu Cl can be looked up in the seventh picture for similar letters" d "," a ", and" u "-n"? 'Therefore, three new words "d 〇ub 1 e", a〇Uble ", and" uouble "can be generated, and only" d〇uble "is the correct word among the above three new words, so" d 〇ubie, the result after correction, its display graph is shown in Figure 10B. Obviously, in the present invention, only the step of replacing the letter of the handwritten word in the eighth figure A with three letters is required (ie, using "a ,,," u ,, or "q," instead of "d"), that is, The “correction result of“ data ”can be obtained.” Furthermore, the present invention can still obtain the correct correction result “double” under the handwriting input of the tenth figure A, so the recognition rate can be greatly improved. It is described that the device and method for improving the word recognition rate disclosed in the present invention is to perform recognition and correction by means of glyph comparison and substitution of similar letters. When input words and recognition results need to be corrected, the input The words are compared with the glyphs stored in the correction rule index table, and the letters contained in each input word can be found in the correction rule index table. The paper size is applicable to the Chinese National Standard (CNS) A4 specification (210 X 297). (Mm) (Please read the notes on the back before filling in this page j -Pei · 1Γ 476033 A7 B7 V. Description of the invention (The list of words in the new group was judged in Zhuo ° PΓ), and the candidate list d group Find or re-string the word with a mother, and the descendants of the word must be positive, and the word is better. T is better than the ordinary. The actual release has not set its limits to show the use of non-existent and expressly stated in the special request, including the package should be decorated or changed, etc. Fan Shen ί (please first surround (Please read the notes on the back and fill in this page) The paper size printed by the Employees' Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs applies the Chinese National Standard (CNS) Α4 specification (210X 297 mm)

Claims (1)

476033 系年//·月/ 正 充 Α8 Β8 C8 D8 六、申請專利範圍 #Λ7νΓ;Λ·,'ν .'Γ.ν'ν 經濟部智慧財產局員工消費合作社印製 1. 一種提昇單詞辨識率之方法,該方法至少包含下列步驟·· 輸入一單詞; 對該單詞以字形比對之方式產生一辨識結果; 當該辨識結果所包含的係不存在之單詞時,利用字形比 對與相似字形取代之方式對該辨識結果進行校正,以產生一 校正結果;及 顯示該校正結杲所包含之該單詞。 2. 如申請專利範圍第1項之方法,其中上述產生該辨識結果 之步驟,係對該單詞所包含之字母,以逐字母比對的方式所 產生。 3 .如申請專利範圍第1項之方法,其中上述利用字形比對與 相似字形取代對該辨識結果進行校正之步骤至少包含下列 步驟: 利用單字母逐字取代之方式,對該辨識結果執行校正; 及 利用單字母取代多字母字串之方式,對該辨識結果執行 校正。 4.如申請專利範圍第2項之方法,其中上述利用該單字母之 逐字取代對該辨識結果執行校正之步驟,至少包含下列步 12 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐) (請先閱讀背面之注意事項再填寫本頁) --------訂·-------» 476盼 修正補充 A8 B8 C8 D8 |_1驗隨禱襲 .經濟部! 申請專利範圍 當每個該i t 早予母的相似率高於一容差限肩 (Tolerance),而 今时、 ± 礅早竽母與前後連接之該單字母並無不相 符之情形時,輸\兮掀、边π田、 * ^ ^ 行該箪丰每之逐八μ識…果〈下一個該軍字母,以繼續執 Λ利#时—子取代對該辨識結果執行校正之步驟; 利用該卓丰母^ 士 ^ 個新單詞; 似子母取代該單字母,用以產生複教 除去該新單狗中所有不存在 對下一個該單/ , 卞予母進行校正。 5 ·如申請專利範園贫, 圍呆。項之方法,其中上述利用該單字母取 代該多字母字串料2、^ 甲野琢辨識結果執行校正之步骤至少包含下 列步驟: 於該辨識結果+ # % ^ ^ 來肀找尋需校正的該多字母字串; 將該多字通:全d> 予串以該單字母取代之以產生複數個新單 詞;及 ^ 除去該新單$中户斤有不存在的該單詞。 6.如申明專利範圍第5項之方法,其中上述用以取代該多字 母字串二該單字母,當該單字母被拆開成複數個部份時將成 爲該多字母字串之相似字串。 7 ·如申請專利範圍第1項之方法,其中上述產生該辨識結 13 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐) — I1IIIIIIIII · 1 I I I I I I «ΙΙΙΙΙ! — — (請先閱讀背面之注意事項再填寫本頁) 476083 %年((P” it 補充丨 A8 B8 C8 D8 經濟部智慧財產局員工消費合作社印製 六、申請專利範圍 之步骤,包含當該辨識結果係存在之該單詞時,顯示該存在 單詞之步骤。 8 —種提昇單詞辨識率之方法,該方法至少包含下列步驟: 輸入一單詞; 對該單詞以字形比對之方式以產生一辨識結果; 當該辨識結果包含存在單詞時,則於顯示該存在單詞後 結束該提昇單詞辨識率之方法; 利用單字母逐字取代之方式,對該辨識結果執行校正; 利用單字母取代多字母字串之方式,對該辨識結果執行 校正;及 顯示該校正結果所包含之該單詞。 9.如申請專利範圍第8項之方法,其中上述產生該辨識結果 之步驟,係對該單詞所包含之字母,以逐字母比對的方式所 產生。 1 0 .如申請專利範圍第8項之方法,其中上述利用該單字母 之逐字取代對該辨識結果執行校正之步驟,至少包含下列步 驟: 當每個該單字母的相似率高於一容差限度 (Tolerance),而且該單字母與前後連接之該單字母並無不相 符之情形時,輸入該辨識結果之下一個該單字母,以繼續執 14 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐) — III-------------I I . I — — — — — — — (請先閱讀背面之注意事項再填寫本頁) 476㈣ 修,π: "介補天 Α8 Β8 C8 D8 六、申請專利範圍 修煩 讀 夺二泛 育員 & )rfr 1年 准 行該單字母之逐字取代對該辨識結果執行校正之步驟; 利用該單字母之相似字母取代該單字母,用以產生複數 個新單詞; 除去該新單詞中所有不存在的單詞;及 對下一個該單字母進行校正。 1 1 .如申請專利範圍第8項之方法,其中上述利用該單字母 取代該多字母字串對該辨識結果執行校正之步驟至少包含 下列步骤: 於該辨識結果中找尋需校正的該多字母字串; 將該多字母字串以該單字母取代之以產生複數個新單 詞;及 除去該新單詞中所有不存在的該單詞。 經濟部智慧財產局員工消費合作社印製 12.如申請專利範圍第10項之方法,其中上述用以取代該多 字母字串之該單字母,當該單字母被拆開成複數個部份時將 成爲該多字母字串之相似字串。 1 3 . —種提昇單詞辨識率裝置,該提昇單詞辨識率裝置至少 包含: 輸入裝置,用以輸入一單詞; 辨識資訊儲存裝置,用以儲存辨識資訊與校正資訊; 處理裝置,輸入該單詞與該辨識資訊,以對該單詞進行 15 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐) —^----1-------------訂 *-------- (請先閱讀背面之注意事項再填寫本頁) 、申請專利範圍 辨識以產生一辨識結果,以及利用輸入該單詞、該辨識結果 與該校正資訊以產生一校正結果;及 及 以 g又 結 識 辨 該 示 顯。 以果 用結 果正 結校 識該 辨入 該輸 入以 輸用 ,果 置結 裝正 示校 顯該 入 輸 煩請委員明示 > 年"月所提之 修正本有無變更實質内容是否准予修正。 述生 上產 中來 其式 , 方 置之 裝代 率取 識杉 辨字 詞 單 昇 提 之 項 形 3 1 字 第用 圍利 範係 利果 專結 青 i 古口識 申辨 如之 4 似 相 與 對 比 (請先閱讀背面之注意事項再填寫本頁) 裝 述生 字 似 相 與 對 比 形 字 第用 圍利 範係 利果 專結 請正 申校 如之 上產 中來 其式 , 方 置之 裝代 率取 識形 辨 詞 單 昇 提 之 項 3 11 上 中 其 置 裝 率 識 辨 詞 單 昇 提 之 項 3 i—x 第 圍 範 利 專 請 申 如 6 生 產 時 詞 單 該 之 在 存 不 含 包 果 結 識 辨 該 在 係 果 結 正 校 之。 述之 經濟部智慧財產局員工消費合作社印制农 17·如申請專利範圍第13項之提昇單詞辨識率裝置,其中上 述之提昇單詞辨識率裝置更包含一儲存裝置,用以儲存該提 昇單詞辨識率裝置於辨識過程中所產生的暫態資訊。 18·如申請專利範圍第13項之提昇單詞辨識率裝置,其中上 述之輸入裝置可為一手寫板。 16 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐) 4760Ή 月〆 f修正 補充 A8 B8 C8 D8 申請專利範圍 19.如申請專利範圍第13項之提昇單詞辨識率裝置,其中上 述之輸入裝置可為一掃描器(Scanner)。 20·如申請專利範圍第13項之提昇單詞辨識率裝置,其中上 述之辨識資訊儲存裝置係由下列記憶體所組成: 唯讀記憶體(Read Only Memory,ROM); J ^ 可程式化唯讀記憶體(Programmable ROM,PROM); 可清除式可程式化唯讀記憶體(Erasable PROM, f/^EPROM); 修規 正請 内 電子可清除式可程式化唯讀記憶體(Eleetricaliy EPROM a月 EEPROM);及 IS 修所 快閃記憶體(Flash memory) 經濟部智慧財產局員工消費合作社印製 21·—種提昇單詞辨識率裝置,該提昇單詞辨識率裝置至少包 含: 輸入裝置,用以輸入一單詞; 辨識資訊儲存裝置,用以儲存辨識資訊與校正資訊; 處理裝置,輸入該單詞與該辨識資訊,利用字形比對與 相似字形取代之方式對該單詞執行辨識用以產生一辨識結 果,以及當該辨識結果包含不存在之該單詞時,輸入該單詞、 該辨識結果與該校正資訊以產生一校正結果;及 顯示裝置,輸入該辨識結果用以顯示該辨識 以及 17 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐) 1 ·ϋ n n n n n n n mi · i n n n —J n n 一 K · n n n I I n n I (請先閱讀背面之注意事項再填寫本頁) 4/OUJJ476033 Department of the year // month / positive charge Α8 Β8 C8 D8 VI. Application for patent scope # Λ7νΓ; Λ ·, 'ν .'Γ.ν'ν Printed by the Consumer Cooperatives of the Intellectual Property Bureau of the Ministry of Economic Affairs 1. A way to improve the word recognition rate Method, the method includes at least the following steps: inputting a word; generating a recognition result by comparing the words with a glyph; when the recognition result contains a non-existent word, using the glyph comparison with a similar glyph Instead, the recognition result is corrected to generate a correction result; and the word contained in the correction result is displayed. 2. For the method of claim 1 in the scope of patent application, wherein the above step of generating the recognition result is generated by letter-by-letter comparison of the letters contained in the word. 3. The method according to item 1 of the scope of patent application, wherein the step of correcting the recognition result by using glyph comparison and similar glyph replacement includes at least the following steps: Performing correction on the recognition result by means of single-letter verbatim replacement. ; And using a single letter instead of a multi-letter string to perform correction on the recognition result. 4. The method according to item 2 of the scope of patent application, wherein the above-mentioned step of using the single-letter verbatim instead of performing the correction on the recognition result includes at least the following steps. 12 This paper size applies the Chinese National Standard (CNS) A4 specification (210 X 297 mm) (Please read the notes on the back before filling out this page) -------- Order · ------- »476 Hope to amend and add A8 B8 C8 D8 | .Ministry of Economic Affairs! Scope of patent application When the similarity rate of each it early mother is higher than a tolerance limit (Tolerance), and now, there is no inconsistency between ± 礅 early mother and the single letter connected to the front and back, lose \ Xi lift, edge π field, * ^ ^ perform the eighth μ consciousness every time ... If the next letter of the army, continue to perform Λ 利 # 时-子 instead of performing the step of correcting the recognition result; use this Zhuofeng mother ^ Shi ^ new words; Like the mother and mother replace the single letter, used to generate renunciation to remove all non-existent in the new single dog to correct the next single /, and give the mother to correct. 5 · If the patent applicant is poor, siege. The method of the above item, wherein the step of using the single letter to replace the multi-letter string material 2. ^ The step of performing correction by Kono's recognition result includes at least the following steps: Find the multi-character that needs to be corrected based on the recognition result + #% ^ ^ Alphabetic string; replace the multi-word: whole d > with the single letter to generate a plurality of new words; and ^ remove the new word $ 中 户 kg if the word does not exist. 6. The method of claiming item 5 of the patent, wherein the above is used to replace the multi-letter string and the single letter. When the single letter is broken into a plurality of parts, it will become a similar word of the multi-letter string. string. 7 · The method according to item 1 of the scope of patent application, in which the identification knot is generated as above 13 The paper size is applicable to the Chinese National Standard (CNS) A4 specification (210 X 297 mm) — I1IIIIIIIII · 1 IIIIII «ΙΙΙΙΙ! — — (Please Please read the notes on the back before filling this page) 476083% year ((P ”it supplement 丨 A8 B8 C8 D8 Printed by the Intellectual Property Bureau Employees Consumer Cooperatives of the Ministry of Economic Affairs 6. Printing patent scope steps, including when the identification result exists When the word is displayed, the step of presenting the word is displayed. 8 — A method for improving the recognition rate of the word, the method includes at least the following steps: input a word; compare the word with a glyph to generate a recognition result; when the When the recognition result includes a word, the method of improving the recognition rate of the word is ended after displaying the existing word; the recognition result is corrected by using a single-letter word-by-word replacement method; a single-letter word is used to replace a multi-letter string, Perform correction on the recognition result; and display the word contained in the correction result. The method of item 8, wherein the step of generating the recognition result is generated by comparing the letters contained in the word letter by letter. 1 0. The method of item 8 in the scope of patent application, wherein the above uses The one-letter verbatim replaces the step of performing correction on the recognition result, including at least the following steps: When the similarity rate of each one-letter is higher than a tolerance limit, and the one-letter is connected to the one-to-one When there is no inconsistency between the letters, enter the single letter below the recognition result to continue to execute the 14 paper standards applicable to China National Standard (CNS) A4 (210 X 297 mm) — III ----- -------- II. I — — — — — — — (Please read the notes on the back before filling this page) 476㈣ 修, π: " 介 补天 Α8 Β8 C8 D8 六 、 Scope of patent application Trouble reading and reading the second pan-educator &) rfr 1 year quasi-performing the single letter verbatim instead of performing the step of correcting the recognition result; replacing the single letter with a similar letter of the single letter to generate multiple new ones Word All non-existent words in the new word; and correction of the next single letter. 1 1. The method of claim 8 in the scope of the patent application, wherein the single letter is used instead of the multi-letter string to perform the recognition result The step of correcting includes at least the following steps: finding the multi-letter string to be corrected in the recognition result; replacing the multi-letter string with the single letter to generate a plurality of new words; and removing all non-words in the new word The word exists. Printed by the Consumer Cooperatives of the Intellectual Property Bureau of the Ministry of Economic Affairs 12. If the method of applying for item 10 of the patent scope, wherein the above is used to replace the single letter of the multi-letter string, when the single letter is broken into multiple parts Will become a similar string of the multi-letter string. 1 3. A device for improving word recognition rate, the word recognition rate raising device includes at least: an input device for inputting a word; a recognition information storage device for storing identification information and correction information; a processing device for inputting the word and The identification information is used to make 15 words on this word. Paper size is applicable to Chinese National Standard (CNS) A4 (210 X 297 mm). — ^ ---- 1 ------------- Order * -------- (Please read the notes on the back before filling this page), apply for patent range recognition to generate a recognition result, and use the input of the word, the recognition result and the correction information to generate a correction Results; and recognize the display with g. Use the results to verify the results, identify the inputs, and input the results. If the results are displayed, verify the inputs. Members are requested to indicate whether there is any change in the amendments mentioned in the year and month. The styles of the birth and the birth of the students are as follows. The Fang Zhi's generation rate is based on the recognition of Shan Shan's word recognition. Contrast (please read the precautions on the back before filling this page) Prescribing the new words as similar to the contrasting glyphs, and using the Li-Fan model for the Liguo specialization, please apply to the school for more details. Take the item of the promotion of the form of the word recognition 3 11 The item of the promotion of the form of the recognition of the word 3 3 i—x Fan Li special request for application 6 If the word list in production does not contain a package Fruiting should be identified in the department of fruiting. According to the Ministry of Economic Affairs, Intellectual Property Bureau, Employees' Cooperatives, Printed Agriculture 17. If the device for improving the word recognition rate of item 13 of the patent application scope, the above device for improving the word recognition rate further includes a storage device for storing the improved word recognition Rate information generated during the identification process. 18. If the device for improving the word recognition rate according to item 13 of the patent application, the input device described above may be a handwriting tablet. 16 This paper size applies the Chinese National Standard (CNS) A4 specification (210 X 297 mm) 4760 Ή month 修正 f amends and supplements A8 B8 C8 D8 patent application scope 19. If the word recognition device of item 13 of the patent scope is applied, where The above input device may be a scanner. 20 · The device for improving the word recognition rate according to item 13 of the scope of patent application, wherein the above-mentioned recognition information storage device is composed of the following memory: Read Only Memory (ROM); J ^ Programmable Read Only Memory (Programmable ROM, PROM); erasable programmable read-only memory (Erasable PROM, f / ^ EPROM); amendments to the internal electronic programmable erasable programmable read-only memory (Eleetricaliy EPROM a month EEPROM ); And the IS flash memory (Flash memory) printed by the Intellectual Property Bureau of the Ministry of Economic Affairs's Consumer Cooperatives 21 · —a device for improving the word recognition rate, the device for improving the word recognition rate includes at least: an input device for inputting a A word; an identification information storage device for storing identification information and correction information; a processing device for inputting the word and the identification information, performing identification on the word by means of glyph comparison and similar glyph substitution to generate a recognition result, When the recognition result includes the word that does not exist, input the word, the recognition result, and the correction information to generate a correction And a display device, inputting the recognition result to display the recognition and 17 paper sizes applicable to the Chinese National Standard (CNS) A4 specification (210 X 297 mm) 1 · nn nnnnnnn mi · innn —J nn a K · nnn II nn I (Please read the notes on the back before filling out this page) 4 / OUJJ 六、申請專利範圍 輸入該校正結果用以輸入該校正結果。 22.如申請專利範圍第21項之提昇單詞辨識率裝置,其中上 述之校正結果係利用字形比對與相似字形取代之方式來產 生。 修煩 ^ 荣望·如凊專利範圍第21項之提昇單詞辨識率裝置,其中上 賴述,提昇單詞辨識率裝置更包含一儲存裝置,用以儲存該提 If昇單詞辨識率裝置於辨識過程中所產生的暫態資訊。 24·如申請專利範圍第21項之提昇單詞辨識率裝置,其中上 述之輸入裝置可為一手寫板。 25·如申請專利範圍第2ι項之提昇單詞辨識率裝置,其中上 0之述之輸入裝置可為一掃描器(Scanner)。 26·如申請專利範圍第21項之提昇單詞辨識率裝置,其中上 述之辨識資訊儲存裝置係由下列記憶體所組成: 唯讀記憶體(Read Only Memory, ROM); 可程式化唯讀記憶體(Programmable R〇m,PROM); 可清除式可程式化唯讀記憶體(Erasable PROM; EPROM); 電子可清除式可程式化唯讀記憶體(Electrically 18 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐) (請先閱讀背面之注意事項再填寫本頁)6. Scope of Patent Application Enter the correction result to input the correction result. 22. The device for improving the word recognition rate according to item 21 of the patent application, wherein the correction result is generated by means of glyph comparison and substitution with similar glyphs. Repair trouble ^ Rongwang · Rugao patent scope item 21 to improve the word recognition rate device, which is described above, the word recognition rate improvement device further includes a storage device for storing the If word promotion rate recognition device in the recognition process Transient information generated in. 24. If the device for improving the word recognition rate according to item 21 of the patent application, the above input device may be a tablet. 25. The device for improving the word recognition rate according to item 2m of the scope of patent application, wherein the input device described in the above 0 may be a scanner. 26. If the device for increasing the word recognition rate of item 21 in the scope of patent application, the above-mentioned recognition information storage device is composed of the following memory: Read Only Memory (ROM); Programmable Read Only Memory (Programmable ROM, PROM); Erasable PROM; EPROM; Electronically Programmable Read-Only Memory (Electrically 18) This paper standard applies to Chinese National Standards (CNS) A4 size (210 X 297 mm) (Please read the notes on the back before filling this page) 經濟部智慧財產局員工消費合作社印製 D8 經濟部智慧財產局員工消費合作社印製 猶无丨 、申請專利範圍 EPROM, EEPROM);及 快閃記憶體(Flash memory) 煩請委員明示 >年"月2Z&:听提之 修正本有無變更實質内容是否准予修正。 19 n HI n n n n vn n n d · m n tfa · (請先閱讀背面之注意事項再填寫本頁) 1T—— 聲 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公釐)Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs D8 Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs has never been printed, patent application scope EPROM, EEPROM); and flash memory (Flash memory) Members are requested to indicate > year " Month 2Z &: Whether there is any change to the substance of the amendment, whether the amendment is allowed to be amended. 19 n HI n n n n vn n n d · m n tfa · (Please read the notes on the back before filling out this page) 1T—— Sound This paper size is applicable to China National Standard (CNS) A4 (210 X 297 mm)
TW086117317A 1997-11-19 1997-11-19 System and method for increasing word recognition rate TW476033B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW086117317A TW476033B (en) 1997-11-19 1997-11-19 System and method for increasing word recognition rate

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW086117317A TW476033B (en) 1997-11-19 1997-11-19 System and method for increasing word recognition rate

Publications (1)

Publication Number Publication Date
TW476033B true TW476033B (en) 2002-02-11

Family

ID=21627258

Family Applications (1)

Application Number Title Priority Date Filing Date
TW086117317A TW476033B (en) 1997-11-19 1997-11-19 System and method for increasing word recognition rate

Country Status (1)

Country Link
TW (1) TW476033B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8583440B2 (en) 2002-06-20 2013-11-12 Tegic Communications, Inc. Apparatus and method for providing visual indication of character ambiguity during text entry
US8606582B2 (en) 2004-06-02 2013-12-10 Tegic Communications, Inc. Multimodal disambiguation of speech recognition
US8782568B2 (en) 1999-12-03 2014-07-15 Nuance Communications, Inc. Explicit character filtering of ambiguous text entry
US8938688B2 (en) 1998-12-04 2015-01-20 Nuance Communications, Inc. Contextual prediction of user words and user actions
TWI635406B (en) * 2016-11-25 2018-09-11 英業達股份有限公司 Method for string recognition and machine learning

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8938688B2 (en) 1998-12-04 2015-01-20 Nuance Communications, Inc. Contextual prediction of user words and user actions
US9626355B2 (en) 1998-12-04 2017-04-18 Nuance Communications, Inc. Contextual prediction of user words and user actions
US8782568B2 (en) 1999-12-03 2014-07-15 Nuance Communications, Inc. Explicit character filtering of ambiguous text entry
US8972905B2 (en) 1999-12-03 2015-03-03 Nuance Communications, Inc. Explicit character filtering of ambiguous text entry
US8990738B2 (en) 1999-12-03 2015-03-24 Nuance Communications, Inc. Explicit character filtering of ambiguous text entry
US8583440B2 (en) 2002-06-20 2013-11-12 Tegic Communications, Inc. Apparatus and method for providing visual indication of character ambiguity during text entry
US8606582B2 (en) 2004-06-02 2013-12-10 Tegic Communications, Inc. Multimodal disambiguation of speech recognition
US9786273B2 (en) 2004-06-02 2017-10-10 Nuance Communications, Inc. Multimodal disambiguation of speech recognition
TWI635406B (en) * 2016-11-25 2018-09-11 英業達股份有限公司 Method for string recognition and machine learning

Similar Documents

Publication Publication Date Title
CN102298582B (en) Data search and matching process and system
US8019769B2 (en) System and method for determining valid citation patterns in electronic documents
US6721451B1 (en) Apparatus and method for reading a document image
US10963717B1 (en) Auto-correction of pattern defined strings
JP2005173730A (en) Business form ocr program, method, and device
US8725497B2 (en) System and method for detecting and correcting mismatched Chinese character
TW476033B (en) System and method for increasing word recognition rate
CN114494679A (en) Double-layer PDF generation and correction method and device
KR100757372B1 (en) Database system and its handling method for ideogram
US20220138416A1 (en) Dictionary editing apparatus, dictionary editing method, and recording medium recording thereon dictionary editing program
CN112364632B (en) Book checking method and device
TW420774B (en) Method and apparatus for automatically correcting documents in chinese language
JP5550959B2 (en) Document processing system and program
CN112149402B (en) Document matching method, device, electronic equipment and computer readable storage medium
JP7315420B2 (en) How to adapt and modify text
JPH08263587A (en) Method and device for document input
CN113868411A (en) Contract comparison method and device, storage medium and computer equipment
JP7293693B2 (en) Information processing device and program
JP3958722B2 (en) Image data document retrieval system
US20230099764A1 (en) Information processing apparatus, information processing method, and non-transitory computer readable medium
JP6325218B2 (en) Character recognition result verification device and character reading system
Vo-Nguyen et al. An Efficient Method to Extract Data from Bank Statements Based on Image-Based Table Detection
JP3663865B2 (en) Form processing apparatus and storage medium
JP7430219B2 (en) Document information structuring device, document information structuring method and program
WO2023047570A1 (en) Information processing device, information processing method, and information processing program

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent