TWI239200B - System for capturing and identifying image, method for identifying images and method for acquring additional remarks about test paper - Google Patents

System for capturing and identifying image, method for identifying images and method for acquring additional remarks about test paper Download PDF

Info

Publication number
TWI239200B
TWI239200B TW93100029A TW93100029A TWI239200B TW I239200 B TWI239200 B TW I239200B TW 93100029 A TW93100029 A TW 93100029A TW 93100029 A TW93100029 A TW 93100029A TW I239200 B TWI239200 B TW I239200B
Authority
TW
Taiwan
Prior art keywords
image
patent application
scope
item
document
Prior art date
Application number
TW93100029A
Other languages
Chinese (zh)
Other versions
TW200524399A (en
Inventor
Jang-Ping Sheu
Chih-Yung Chang
Gwo-Jong Yu
Kuei-Ping Shih
Original Assignee
Univ Nat Central
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ Nat Central filed Critical Univ Nat Central
Priority to TW93100029A priority Critical patent/TWI239200B/en
Publication of TW200524399A publication Critical patent/TW200524399A/en
Application granted granted Critical
Publication of TWI239200B publication Critical patent/TWI239200B/en

Links

Landscapes

  • Character Discrimination (AREA)
  • Character Input (AREA)

Abstract

A system for capturing and identifying an image includes an image-capturing unit and an image-identifying unit. The image-capturing unit is adapted to capture a single image by a single photo. The image-identifying unit is electrically connected with the image-capturing unit, and is adapted to identify the single image and provides related assistant learning information desired by a reader, based on the identifying result. The captured image is identified, including the steps of removing the interference with light with estimation of parameters of a light curve, dividing characters, normalizing characters and comparing features. The system can be used for capturing the information about a question of a test paper. A question of the test paper has a mark-code image. After captured, the mark-code image can be identified to attain mark-code characters. Based on the mark-code characters, related information is provided.

Description

1239200 案號 93100029 曰 修正 五、發明說明(1) 【發明所屬之技術領域】 本發明是有關於一種文件取像辨識系統及辨識方法, 且特別是有關於一種可以擷取單張文件影像及可以對單張 文件影像進行辨識之文件取像辨識系統及辨識方法。 【先前技術】 不論過去的學習者或今日的學習者,大部份人都還是 以閱讀實體平面書籍内容為主,這是因為傳統的平面書籍 是實體的、眼睛看起來也比較舒服。閱讀平面書籍比較符 合人類身心的自然閱讀習慣,且易於思考。傳統的平面書 籍固然有許多優點,然而卻也存在一個嚴重缺點,傳統紙 張平面書籍都是靜態的,它的内容在離開印刷廠後就固定 下來而無法增添了 ,當學習者在某處看不懂時,傳統平面 書籍並不會及時作出進一步解說,來協助學習者。 有鑑於傳統平面書籍受到篇幅的限制,而無法快速地 提供讀者關聯式的資訊學習,而且在閱讀時只能侷限於紙 本上的文字,也易造成學習效果不佳,對於問題瞭解的深 度及廣度會較為不足。因此,許多廠商開發了一些相關的 電子學習產品,但是功能上與使用者的操作方便性上均不 理想。接下來,探討數家廠商所開發的電子學習產品之使 用方式及所存在的缺點。 一.就台灣極鼎科技公司的Ε Β 00 Κ 0 2以及包爾科技公司 的L e a p P a d電子書產品而言,它是把一張印刷有圖文紙 卡,或一本薄的平面書放置在一個與其書本同樣大小之塑 膠硬體平台上,使用者拿一支硬筆去壓住紙卡上圖文,使1239200 Case No. 93100029 Amendment V. Description of the Invention (1) [Technical Field to which the Invention belongs] The present invention relates to a document acquisition recognition system and method, and more particularly to a method that can capture a single document image and can Document acquisition recognition system and method for recognizing single document image. [Previous technology] No matter the past learners or today's learners, most people still mainly read the contents of physical plane books. This is because the traditional plane books are physical and the eyes look more comfortable. Reading flat books is more in line with the natural reading habits of human beings, and easy to think. Although traditional print books have many advantages, they also have a serious disadvantage. Traditional paper print books are static. Their content is fixed after leaving the printing factory and cannot be added. When the learner cannot see it somewhere When understood, traditional print books will not provide further explanation in time to assist learners. In view of the limitation of the traditional print books, it can not provide readers with relevant information learning quickly, and can only be limited to the text on the paper when reading, which is also likely to cause poor learning results. The breadth will be inadequate. Therefore, many manufacturers have developed some related e-learning products, but the functions and user convenience are not ideal. Next, we will discuss the use of e-learning products developed by several manufacturers and their shortcomings. 1. As far as E Β 00 Κ 0 2 of Taiwan Ji Ding Technology Co., Ltd. and Leap P ad e-book products of Baoer Technology Co., Ltd. are concerned, it is a printed paper card or a thin flat book It is placed on a plastic hardware platform of the same size as the book. The user holds a stylus to hold the picture on the paper card, so that

11948twf1.ptc 第9頁 1239200 _案號93100029_年月日__ 五、發明說明(2) 塑膠硬體平台以感壓方武(或電磁變化)偵測到筆尖位置, 從而播放出解說語音。其缺點為: 1 .必須隨時攜帶塑膠硬體平台才能閱讀,攜帶上不 方便。 2.書本一定要水平放置在桌面上才能操作,不能拿 起 來舒適的閱讀。 3 .書本編製受到很大限制,書的長、寬形狀固定, 必須配合塑膠硬體平台的大小,只能適用於特定書籍。 4.此類產品都是只能播放經過壓縮過的語音,解壓 縮過後不但聲音品質不佳,而且容量甚小,無法提供多媒 體效果,一般只適合幼兒使用。 二·就條碼閱讀筆(B a r C〇d e R e a d e r )而言,它是把條 碼印刷在一本平面書籍上,使用者拿一支條碼閱讀筆去掃 瞄這個條碼,從條碼中辨識出所代表的輔助教材檔案儲存 位置,從而播放出輔助教材。其缺點為: 1 ·使用時學習者必須運用手腕,以水平一致的方 向、穩定的掃瞄速度才能讀取正確的整條條碼,為提高成 功率,此類產品都會要求使用者運筆姿勢。若是使用者的 運筆姿勢不良,會造成頻繁的錯誤率,而產生錯誤的訊 息,因而必須重覆掃瞄多次,容易讓人產生挫折感,操作 上並不理想。 2 .書本要印刷上許多巨大冗長的長方形條碼,這些 條碼符號本身不但對使用者而言並無任何意義,而且破壞 了書籍頁面上的設計美觀與質感,只會引起不舒適感,僅11948twf1.ptc Page 9 1239200 _Case No. 93100029_Year Month__ V. Description of the invention (2) The plastic hardware platform detects the position of the pen tip with pressure-sensitive Fang Wu (or electromagnetic change), thereby playing out the commentary voice. The disadvantages are: 1. The plastic hardware platform must be carried at all times to read, which is inconvenient to carry. 2. Books must be placed horizontally on a table to operate. Do not pick them up for comfortable reading. 3. The compilation of books is very limited. The length and width of the book are fixed. The size of the book must match the size of the plastic hardware platform. It can only be used for specific books. 4. These products can only play compressed voice. After decompression, not only the sound quality is poor, but also the capacity is very small, which cannot provide multi-media effects. Generally, it is only suitable for children. 2. As far as Bar Code Reeader is concerned, it prints the bar code on a flat book. The user uses a bar code reading pen to scan the bar code and identify the representative from the bar code. Storage location of the auxiliary teaching material file, so as to play the auxiliary teaching material. The disadvantages are as follows: 1. Learners must use their wrists to read the correct entire barcode with a consistent horizontal direction and stable scanning speed. In order to increase the power, these products will require the user to move the pen. If the user's posture is poor, it will cause frequent error rate and produce erroneous information. Therefore, it must be scanned repeatedly, which is easy to cause frustration and is not ideal for operation. 2. The book should be printed with many huge and long rectangular bar codes. These bar code symbols are not only meaningless to the user, but also destroy the aesthetic and texture of the design on the pages of the book, only causing discomfort.

11948twf1.ptc 第10頁 1239200 修正 對於註釋較多的幼兒以 字裡行間印刷上許多長 解。 平放在桌面上掃目给’才 適的閱讀。 司的C-Pen光學掃瞄筆而 印刷在平面書籍上的文 續照像使用者運筆過程 續影像中辨識出所掃瞄 作查生字、文書編輯、 直把筆身平壓在書本上 運用手腕以水平一致的 的整行句子。為提高成 者運筆的一定筆勢筆 案號 93100029 五、發明說明(3) 適合運用在註釋較少的幼兒書籍。 上書籍,是難以在書本文章的狹窄 方形條碼,以提供學習者必要的註 3 .通常此類產品要求書籍須 能有較佳操作效果,不能拿起來舒 三·就瑞典C Technologies公 言,它是以一支光學掃瞄筆去掃瞄 字,其是以每秒數十次的速度去連 中所經過的文字,從取得的多張連 的文字,並把這些文字輸入電腦, 表格填寫等之應用。其缺點為: 1 .使用者首先必須握筆,垂 (四方形的筆端不可以離開紙面), 方向、穩定的速度才能掃瞄到正確 功率,C — Pen光學掃瞄筆要求使用 姿。若是使用者的運筆姿勢不良,會造成頻繁的錯誤率, 而產生錯誤的訊息,因而必須重覆掃瞄多次,容易讓人產 生挫折感,操作上並不理想。 2.由於使用者必須將其四方形筆瑞,垂直的對準所 欲瞄掃地方的首字,然後才運筆瞄掃,因此C-Pen光學掃 瞄筆設計上主要是用來掃取一整行的長句串(免去使用者 必須逐一打字鍵入之勞),而不適合用來掃瞄綿密的書籍 文章中的一個字或詞。 四·就台灣蒙恬科技公司的蒙恬掃譯筆以及W i z C 〇 m11948twf1.ptc Page 10 1239200 Amendment For toddlers with a lot of comments, print a lot of long solutions between the lines. Lay flat on the table and scan for ‘cause reading. The C-Pen optical scanning pen of the company's C-Pen optical scanning pen printed on a flat book continues the user's pen-scanning process to identify the scanned image as a check-word, document editing, and press the pen body flat on the book using the wrist Sentence a whole line of sentences. A certain gesture for improving the writing skills of the case No. 93100029 V. Description of the invention (3) Suitable for children's books with less annotations. On books, it is difficult to write a narrow square bar code in a book article to provide the necessary notes for the learner. 3. Generally, this type of product requires books to have better operating effects. They cannot be picked up. An optical scanning pen is used to scan the characters. It is to connect the characters that pass through at a speed of dozens of times per second. From the multiple connected characters obtained, enter these characters into the computer, fill out the forms, etc. application. The disadvantages are as follows: 1. The user must first hold the pen and hang it down (the square pen tip cannot leave the paper surface), and the direction and stable speed can scan the correct power. The C-Pen optical scanning pen requires a posture. If the user's posture is poor, it will cause frequent error rate and produce wrong messages, so it must be scanned repeatedly, which is easy to cause frustration and is not ideal for operation. 2. Since the user must align his square pen pen vertically to the first character of the place to be scanned, and then run the pen to scan, the C-Pen optical scanning pen is mainly designed to scan a whole Long lines of strings (to save users from having to type one by one), not suitable for scanning a word or word in a dense book. 4. The Meng Tian Scanning Pen of Taiwan Meng Tian Technology Co., Ltd. and Wiz C 〇 m

11948twf1.ptc 第11頁 1239200 案號 93100029 年 月 曰 修正 五、發明說明(4)11948twf1.ptc Page 11 1239200 Case No. 93100029 Date Amendment V. Description of Invention (4)

Technologies公司的Super Pen ,二者皆利用普通桌上型 掃礙器的C I S線型滾輪式光學元件去感測取得影像,它是 以一支光學掃瞄筆去掃瞄印刷在平面書籍上的文字,從&掃 瞄過程中連續取得多張線型影像,把這些線型影像組i Z 來,辨識出所掃瞄的文字,並把這些文字輸入電腦 查 生字、文書編輯、表格填寫應用。其操作方法與瑞典C 一 Technologies公司的C —Pen光學掃瞄筆一樣,使用缺點與 不方便性也相像。 五.瑞典Ericsson公司所開發的Chatp⑼(聊天筆), 則需要配合一種印刷有專屬处如回也n L t t _ 离點矩圖案的特殊紙張,去擷取 辨識使用者在此專用紙張所直接書寫的手稿,並輸入至電 腦0 Chat Pen必須應用在專用祕 多不方便性。 寻用紙上,才能操作,增加了許 綜上所述,目前市面上㊉ 式輸入掃描器在進行文件取S = = f學糸統,所使用之筆 線環境下且無字元旋轉及字ΐ;作業時,必須在良好之光 高品質的辨識率。因此,摄=巧斜的情況下,才可以達到 顯著的影響,也造成使用者^ =勢對掃瞄辨識率的良窳有 多的不便利性。 在使用上述這些產品時,有許 【發明内容】 有鑑於此,本發明的〜丄 辨識方法,係利用數位相機…就是在提供一種文件取像 影像,使用者不必運筆掃嘴,=張擷取的方式擷取文件之 提高辨識成功的機率。值得、1 i以簡化操作程序,且可以 思的是,本發明案所採用的Technologies' Super Pen, both use the CIS linear roller optics of ordinary desktop scanners to obtain images. It uses an optical scanning pen to scan text printed on flat books. Multiple linear images are continuously obtained from the & scanning process. These linear image groups i Z are used to identify the scanned text, and these texts are entered into computer search words, document editing, and form filling applications. The operation method is the same as that of the C-Pen optical scanning pen of C-Technology, Sweden, and the disadvantages and inconveniences are similar. 5. Chatp⑼ (chat pen) developed by Sweden's Ericsson company needs to be matched with a special paper printed with a special pattern such as the return point n L tt _ off point moment pattern to capture and identify the user's direct writing on this special paper The manuscript and input to the computer 0 Chat Pen must be applied in a dedicated secret for the inconvenience. It can only be operated on the paper, which is added by Xu. As mentioned above, the current type input scanner on the market is performing document fetching S = = f academic system. The pen-line environment used has no character rotation and characters. ; When operating, must have high quality recognition rate in good light. Therefore, under the circumstance of photographing and oblique oblique, the significant impact can be achieved, which also causes the user ^ = potential to have a lot of inconvenience to the quality of the scan recognition rate. When using these products, there is a lot of content. [Summary of the Invention] In view of this, the ~~ identification method of the present invention uses a digital camera ... is to provide a document capture image, the user does not need to move the pen to sweep the mouth, = Zhang capture Retrieving documents in a way improves the chance of successful identification. It is worth, 1 i to simplify the operating procedures, and it is conceivable that the

1239200 _案號 93100029_年月日_修正_ 五、發明說明(5) 數位相機光學筆,並不要求書籍必須平放在桌面上時才能 擷取影像,而使用者可以靠在沙發上,當拿起平面書本在 舒適的閱讀時,利用數位相機光學筆可以擷取書本上的文 字、數字或代碼,如此操作上甚具便利性。 本發明的另一目的就是在提供一種文件取像辨識方 法,其所採用的數位相機光學筆,可以搭配藍芽無線科 技,讓使用者盡情隨性的坐臥躺在家中任何地方看書,而 隨時獲取多釆多姿的高晝質電子媒體解說,讓閱讀平面書 籍變成是賞心悅目的精緻享受。 本發明的另一目的就是在提供一種文件取像辨識方 法,其所採用的數位相機光學筆,可以搭配電腦使用。藉 由數位相機光學筆可以擷取一張書本上的注釋代碼圖文影 像,並且分析比對此張圖文影像,藉以辨識出其中所包含 的資訊(例如頁碼與/或句碼),然後由一資料庫裝置中檢 索出相對應於此資訊(例如頁碼與/或句碼)之關聯式檔 案,並加以其他運用;或檢索出相對應於此資訊(例如頁 碼與/或句碼)之一輔助教材檔案,並播放出來,協助使用 者可以更有效的暸解註釋代碼所標記地方的實體平面書籍 内容,達到輔助學習目的。 此外,本發明的再另一目的就是提供一種新式個人考 試練習環境。利用本發明之文件取像辨識方法及所採用的 數位相機光學筆,可以搭配各式電腦學習輔具,應用在特 殊設計的紙本試卷上作答,使學生的紙本作答可與學習輔 具上的多媒體教材及模擬試題互動,而做到個人化考試練1239200 _Case No. 93100029_ 年月 日 _ 修 __ Five. Description of the invention (5) The digital camera optical pen does not require books to be captured on the desktop to capture images, and users can lean on the sofa, when When you pick up a flat book and read it comfortably, you can use the digital camera optical pen to capture text, numbers, or codes on the book, which is very convenient for operation. Another object of the present invention is to provide a method for document image recognition. The digital camera optical pen can be used with Bluetooth wireless technology to allow users to sit and lie anywhere in the home to read books at any time. Get a variety of high-day quality electronic media commentary to make reading a flat book a delightful and exquisite experience. Another object of the present invention is to provide a method for recognizing a document, which uses a digital camera optical pen that can be used with a computer. The digital camera optical pen can capture an annotated code graphic image on a book, and analyze and compare the image and text image to identify the information contained in it (such as page number and / or sentence code). Relevant files corresponding to this information (such as page numbers and / or sentence codes) are retrieved from the database device and used for other purposes; or one of the auxiliary files corresponding to this information (such as page numbers and / or sentence codes) is assisted The textbook file is played out to help the user to understand the content of the physical plane book marked by the comment code more effectively to achieve the purpose of assisted learning. In addition, another object of the present invention is to provide a new personal examination practice environment. The document acquisition method and digital camera optical pen of the present invention can be used with various computer learning aids to apply answers on specially designed paper test papers so that students' paper answers and learning aids Interactive multimedia textbooks and simulated test questions to achieve personalized test practice

11948twfl.ptc 第13頁 1239200 _案號 93100029_年月日__ 五、發明說明(6) 習環境。 在本應用設計中,主要為加強學生於考試時,關聯式 地即時獲得相關資訊所研發之機制。有鑑於傳統紙式試題 受到篇幅的限制,而無法提供受測者關聯式的資訊學習, 而且學生在測驗時只能侷限於紙本上的文字,也易造成學 習效果不佳、對於問題瞭解的深度以及廣度不足,對於知 識的學習有落差;同時於進行傳統紙式試題測驗中,也無 法即時自動記錄學習者所建立的作答資訊,並加以分析, 以進一步地給予學習者適當的協助。 因此,提出本新式個人考試練習環境之系統設計,其 主要技術特點為· 1 .結合了紙本試題與電子格式檔案的優點,達到超越 紙本試題所不能提供的豐富資訊。 2. 此種新式測驗是動態式的,可針對考題中練習不 足、不夠瞭解之部分進行更深入地探討,並依測驗者作答 的狀況來準備下一份個人化的考卷。 3. 由於本新式測驗應用是透過數位相機光學筆連結資 料庫提取相關考題資訊,因此資料庫的建立將是多方面 的。 4. 教師及家長亦可透過本應用,在學習輔具上獲取學 習者所建立的作答資訊及分析,以進一步地給予學習者適 當的協助。 為達本發明之上述目的,本發明提出一種文件取像辨識系 統,適於對一文件進行取像辨識的作業,該文件取像辨識11948twfl.ptc Page 13 1239200 _Case No. 93100029_ Year Month Day __ 5. Description of the invention (6) Learning environment. In this application design, it is mainly to strengthen the mechanism developed by students to obtain relevant information in a timely and relevant manner during the exam. In view of the limitation of the length of traditional paper-based test questions, it is impossible to provide relevant information learning of the testees, and students can only be limited to the text on the paper during the test, which may also lead to poor learning results and understanding of the problem. The depth and breadth are insufficient, and there is a gap in the learning of knowledge. At the same time, in the traditional paper test, the answer information established by the learner cannot be automatically recorded in real time and analyzed to further give the learner appropriate assistance. Therefore, the system design of the new personal examination practice environment is proposed. Its main technical features are: 1. It combines the advantages of paper-based test questions and electronic format files to achieve a wealth of information that cannot be provided by paper-based test questions. 2. This new type of test is dynamic and can be explored in more depth on the part of the test that is inadequately practiced and understood, and prepare the next personalized test paper based on the status of the tester's response. 3. Since the new quiz application uses the digital camera optical pen to link to the database to extract the relevant test question information, the establishment of the database will be multifaceted. 4. Teachers and parents can also use this application to obtain the answer information and analysis established by the learners on the learning aids to further give the learners appropriate assistance. In order to achieve the above object of the present invention, the present invention proposes a document acquisition recognition system, which is suitable for the operation of acquiring and recognizing a document. The document acquisition recognition

11948twf1.ptc 第14頁 1239200 _案號 93100029_年月日__ 五、發明說明(7) 系統至少包括一影像擷取單元及一影像辨識單元。影像擷 取單元適用於擷取文件之一單一影像,而影像辨識單元係 與影像擷取單元電性連接,影像辨識單元適用於辨識單一 影像,並可以根據辨識所得之結果提供相關資訊。 依照本發明之一較佳實施例,單一影像的資訊比如可 以利用無線通訊或有線通訊的方式將圖像資料傳送至一電 腦主機。影像擷取單元比如是一數位相機,而文件比如是 考卷、書籍、平面媒體或資訊硬體螢幕上的圖文影像。 本發明之辨識影像之方法包括下列步驟:1 ·針對所擷 取的影像進行光線曲面參數估計及光線影響移除的步驟, 以取得一修正影像;2. 對該修正影像進行字元切割的步 驟,以取得一字元影像;3 . 對該字元影像進行字元正規 化的步驟,以取得一正規化字元影像;4. 該正規化字元 影像進行特徵比對的步驟。值得注意的是,當所擷取的影 像包括可辨識字元時,在進行特徵比對的步驟之前,還以 線性區分分析法對正規化字元影像進行線性轉換的步驟; 而當所擷取的影像包括特殊符號時,在進行特徵比對的步 驟之前,還要將正規化字元影像轉成特徵碼,其中這些特 殊符號比如係符合哈達馬德碼(H a d a m a r d C 〇 d e )之定義。 上述之針對所擷取的影像進行光線曲面參數估計及光 線影響移除的步驟包括:1 .以最小均方差算出二次曲面之 六個曲面參數;2.以這些曲面參數重建一標準二次曲面; 3 .將影像減去標準二次曲面,而得到一修正影像;4 ·利用 等化技術提升修正影像的對比。11948twf1.ptc Page 14 1239200 _Case No. 93100029_Year Month Day__ V. Description of the Invention (7) The system includes at least an image capture unit and an image recognition unit. The image capture unit is suitable for capturing a single image of a document, and the image recognition unit is electrically connected to the image capture unit. The image recognition unit is suitable for identifying a single image and can provide relevant information according to the recognition result. According to a preferred embodiment of the present invention, the information of a single image can be transmitted to a computer host using wireless communication or wired communication, for example. The image capturing unit is, for example, a digital camera, and the document is, for example, an examination paper, a book, a print medium, or a graphic image on an information hardware screen. The method for identifying an image of the present invention includes the following steps: 1) performing the steps of ray surface parameter estimation and removing light effects on the captured image to obtain a modified image; 2. the character cutting step of the modified image To obtain a character image; 3. a step of character normalizing the character image to obtain a normalized character image; 4. a step of feature comparison of the normalized character image. It is worth noting that, when the captured image includes identifiable characters, before the step of feature comparison, the step of linearly transforming the normalized character image by linear discrimination analysis is performed; and when the captured image includes When the image includes special symbols, before performing the feature comparison step, the normalized character image must be converted into a feature code, where these special symbols conform to the definition of Hadamard Code. The above-mentioned steps for estimating the light surface parameters and removing the light effects on the captured image include: 1. calculating the six surface parameters of the quadric surface using the minimum mean square error; 2. reconstructing a standard quadric surface using these surface parameters 3. Subtract the standard quadric surface from the image to get a corrected image; 4. Use equalization techniques to improve the contrast of the corrected image.

11948twf1.ptc 第15頁 1239200 _案號 93100029_年月日___ 五、發明說明(8) 上述之針對字元影像進行字元正規化以取得正規化字 元影像的步驟包括:1 .由影像上每個點之座標位置計算出 一共變異矩陣;2 ·計算共變異矩陣之多個特徵值及多個原 始特徵向量;3 .將特徵向量除以相對應之特徵值,而分別 得出多個正規化特徵向量;4.以正規化特徵向量對影像做 座標轉換,形成一特徵化影像;5 ·以複數動量法計算出一 旋轉角度;6.以旋轉角度產生一旋轉矩陣,並藉由旋轉矩 陣將特徵化影像旋轉成一標準角度,以取得正規化字元影 像。 為讓本發明之上述目的、特徵和優點能更明顯易懂, 下文特舉一較佳實施例,並配合所附圖式作詳細說明如 下: 【實施方式】 第1 A圖及第1 B圖係繪示依照本發明一較佳實施例之文 件取像辨識系統之示意圖。請參照第1 A圖及第1 B圖,本發 明提出一種文件取像辨識系統,其所採用的數位相機光學 筆1 1 0 ,可以搭配比如是電腦主機1 2 0使用,藉以達到平面 書籍之輔助學習的目的。數位相機光學筆1 1 0包括一影像 擷取單元1 1 2及一影像辨識單元1 1 4,如第1 A圖所示,而影 像辨識單元1 1 4亦可以是安裝在電腦主機1 2 0内,如第1 B圖 所示。影像擷取單元1 1 2係用以擷取印刷於一文件1 3 0上之 註釋代碼靜態圖像,其中文件1 3 0比如是考卷、書籍、平 面媒體或資訊硬體螢幕上的圖文影像。影像辨識單元1 1 4 電性連接於影像擷取單元1 1 2,並進行一影像處理程序、11948twf1.ptc Page 15 1239200 _ Case No. 93100029_ Year Month ___ V. Description of the invention (8) The above steps of character normalizing a character image to obtain a normalized character image include: 1. From the image Calculate a common variation matrix at the coordinates of each point on the point; 2 Calculate multiple eigenvalues and multiple original eigenvectors of the common variation matrix; 3. Divide the eigenvector by the corresponding eigenvalue to obtain multiple Normalize the feature vector; 4. Coordinate transform the image with the normalized feature vector to form a feature image; 5. Calculate a rotation angle by the complex momentum method; 6. Generate a rotation matrix by the rotation angle, and use rotation The matrix rotates the feature image to a standard angle to obtain a normalized character image. In order to make the above-mentioned objects, features, and advantages of the present invention more comprehensible, a preferred embodiment is given below and described in detail with the accompanying drawings as follows: [Embodiment] Figures 1A and 1B It is a schematic diagram showing a document image recognition system according to a preferred embodiment of the present invention. Please refer to FIG. 1A and FIG. 1B. The present invention proposes a document image recognition system. The digital camera optical pen 1 1 0 adopted by the present invention can be used with, for example, a computer host 1 2 0, so as to achieve a flat book. The purpose of assisted learning. The digital camera optical pen 1 1 0 includes an image capture unit 1 12 and an image recognition unit 1 1 4 as shown in FIG. 1A, and the image recognition unit 1 1 4 can also be installed on the computer host 1 2 0 Inside, as shown in Figure 1B. The image capture unit 1 12 is used to capture the static image of the annotation code printed on a document 130, where the document 130 is, for example, a text image on an examination paper, a book, a flat media or an information hardware screen . The image recognition unit 1 1 4 is electrically connected to the image capture unit 1 1 2 and performs an image processing program,

11948twfl.ptc 第16頁 123920011948twfl.ptc Page 16 1239200

案號 93100029 _ I 丨 五、發明說明(9) 一圖文分析程序及文字辨識程序。 請參照第1A圖,影像擷取單元112比如是_ Μ ’其係利用單張擷取之照相方式先擷取數位相 2數Ϊ相機光學筆U〇内的影像辨識單元"4在:Ϊ百而 圖像之後,可以辨識出此頁碼圖像,而可以利妾,^碼 士有線傳輸的方式輸出頁碼字元到電腦主機丨Μ用热、'泉傳輸 者丄,像擷取單元1 1 2比如是利用單張掏取之昭相方, ΐ 一 ΐ ,而影像辨識單元114在接收句碼圖像i 後,可以辨識出此句碼圖像,而可以利用益3像之 傳輸的方式輸出句碼字元到電腦主機丨2 〇中。、輪或有線 或是,請參照第丨B圖,影像擷取 擷取之照相方式擷取頁碼圖像之後,便利用盔 用單張 線傳輸的方式輸出頁碼圖像到電腦主機丨2 〇中、、,〃傳^輸或有 主機120内的影像辨識單元i 14在接收頁碼圖像在電腦 辨識出此頁碼圖像,而輸出頁碼字元。 可以 元1 1 2會利用單張擷取之照相方式擷取一接者=像擷取單 可以利用無線傳輸或有線傳輸的方式 σ碼@ ,此時 主機12",位在電腦主機12〇 像U圖巧到電腦 :句碼圖像之後,可以辨識出此句;出,= 料廑ΐ ϋ :字元及句碼字元,τ以由電腦主機1 20之-枯Ϊ I裰索出相對應之輔助教材檔案,並播放出來,U貝 使用者更有效的瞭解標記地方的實 ^ = f來協助 輔助學習目的。 貝體千面書稭内容,達到Case No. 93100029 _ I 丨 V. Description of the invention (9) A graphic analysis program and text recognition program. Please refer to FIG. 1A. The image capturing unit 112 is, for example, _M ', which uses the single-photographing method to first capture the digital phase 2 digital image recognition unit in the camera optical pen U0. After the Baier image, the page number image can be identified, and the page number characters can be output to the host computer by means of wired transmission. The hot and 'spring transmitter' is used as the image capture unit 1 1 2 For example, using a single sheet to extract the Zhao Xiangfang, ΐ a ΐ, and after receiving the sentence code image i, the image recognition unit 114 can recognize the sentence code image, and can use the output method of the Yi 3 image to output The sentence code characters are stored in the host computer. , Wheel or wired or, please refer to Figure 丨 B. After capturing the page number image by the photographic method of image capture, it is convenient to output the page number image to the host computer by single-line transmission using the helmet. 2 The image recognition unit i 14 in the host 120 receives the page number image, recognizes the page number image on the computer, and outputs a page number character. You can use 1 1 2 to capture a receiver by taking a single photo. The image capture can use wireless or wired transmission σ code @, at this time host 12 " is located on the computer host 12〇 image U maps to the computer: After the sentence code image, this sentence can be recognized; out, = material 廑 ΐ ϋ: characters and sentence code characters, τ can be found by the computer host 1-20 Corresponding auxiliary teaching material files are played out, and U Bei users can more effectively understand the marked place ^ = f to assist auxiliary learning purposes. Shellfish Thousands of Facebook Straw Content

11948twf1.ptc 第17頁 1239200 _案號 93100029_年月日__ 五、發明說明(10) 如上所述,本發明提出一種文件取像辨識方法,首先 要提供一文件,而文件具有至少一影像,接著利用一數位 相機以單張擷取的方式擷取文件之影像,然後可以進行辨 識影像的操作。 值得注意的是,所擷取的影像可以是可識別字元,比 如是阿拉伯數字、英文字母、標點符號、數學運算符號、 圖示、幾何圖形(例如正方形/長方形/三角形等)或中文 字等。然而傳統的可識別字元集,在不同字元間的差異度 並不夠大,因此,辨識的結果可能因為字元過於相似而產 生錯誤,如此可以藉由擷取差異度大的特殊符號之影像, 來提高辨識率,其中特殊符號比如是符合哈達馬德碼之定 義。哈達馬德碼是錯誤更正碼的一種,對一長度為η的碼 而言,任兩個碼間的漢明距離(H a m m i n g D i s t a n c e )皆為 n / 2 ,也就是說任兩個碼間的差異度都是最大的。差異度 大的樣本(P a 11 e r η )可減少誤判的機率,而哈達馬德碼可 以二維樣本的方式表示,用來提高系統辨識率。關於哈達 馬德碼之詳細内容可以參考G w 〇 - J ο n g Y u, C h u η - S h i e η L u, Hong-Yuan Mark Liao, n A Message-based cocktail watermark i ng system" , in Pattern Recognition, vol.36, pp. 9 5 7 - 9 6 8, 2003。所有揭露於上述資料的内容 均可作為本案之參考資料。關於可識別字元集與特殊符號 字元集之對應例子,可以參照第2圖。 請參照第3圖,其繪示依照本發明針對可識別字元的 取像辨識流程圖。在利用數位相機以單張擷取的方式擷取11948twf1.ptc Page 17 1239200 _Case No. 93100029_ Year Month Day__ V. Description of the Invention (10) As mentioned above, the present invention proposes a method for document image recognition. First, a document must be provided, and the document has at least one image. , Then use a digital camera to capture the image of the document in a single capture mode, and then you can identify the image. It is worth noting that the captured images can be identifiable characters, such as Arabic numerals, English letters, punctuation marks, mathematical operation symbols, icons, geometric figures (such as square / rectangle / triangle, etc.) or Chinese characters. . However, the traditional recognizable character set is not sufficiently different between different characters. Therefore, the recognition result may be wrong because the characters are too similar, so you can capture images of special symbols with large differences , To improve the recognition rate, where the special symbol is, for example, in line with the definition of the Hadamard code. Hadamard code is a type of error correction code. For a code of length η, the Hamming D istance between any two codes is n / 2, that is, between any two codes. The degree of difference is the largest. Samples with large differences (P a 11 e r η) can reduce the probability of misjudgment, and the Hadamard code can be expressed in the form of two-dimensional samples to improve the system recognition rate. For more details about the Hadamard code, please refer to G w 〇- J ο ng Yu, C hu η-S hie η L u, Hong-Yuan Mark Liao, n A Message-based cocktail watermark i ng system ", in Pattern Recognition, vol. 36, pp. 9 5 7-9 6 8, 2003. All the contents disclosed in the above information can be used as reference materials in this case. For a correspondence example between the recognizable character set and the special symbol character set, refer to FIG. 2. Please refer to FIG. 3, which illustrates a flowchart of image recognition for recognizable characters according to the present invention. Captured as a single frame with a digital camera

11948twf1.ptc 第18頁 1239200 案號 93100029 年 月 曰 修一 五、發明說明(11) 文件之可識別影像(S 2 0 1 )後’可以針對該影像進行光線曲 面參數估計及光線影響移除的步驟,以取得一修正影像 (S 2 0 2 )。接著,可以對該修正影像進行字元切割的步驟, 以取得一字元影像(S 2 0 3 )。接著,可以對該字元影像進行 字元正規化的步驟,以取得一正規化字元影像(s 2 〇 4 )。接 著’可以線性區分分析法對該正規化字元影像進行線性轉 換的步驟(S 2 0 5 )。接著,可以對經線性轉換後的正規化字 元影像進行特徵比對的步驟(S 2 0 6 )。 圖’其緣示依照本發明針對特殊符號字元 = 程圖。在利用數位相機以單張擷取的方式擷 取文件f特殊符號影像(S301)後,可以針對 線曲面麥數估計及光線影響移除的步 ^ = 像⑻。…接著,可以對該修正影像;;行:取:=影 驟,以取得一字元影像(S3〇3)。接 兀刀副的y 進行字元正規化的步驟,以取得一正y =對該字元影像 (S 3 0 4 )。接著,要將該正旦彡彳子元影像 (S 3 0 5 )。接著,可以對 :兀〜像轉成特徵碼 特徵比對的步驟(S 3 0 6 )。 、I碼的正規化字元影像進行 在上这針對可識別字元盘 ^ 一 程中,均會進行針對影像2 : : τ元之取像辨識過 限制光線之照射方4,因此可像時並未嚴格 背景因對比不足而無法正'的S 口度,2可能造成文字與 的切割右能修正光源差異所11948twf1.ptc Page 18 1239200 Case No. 93100029 Revised May 15th, Invention Description (11) After the identifiable image (S 2 0 1) of the document, the light surface parameter estimation and the removal of light effects can be performed on the image. Steps to obtain a corrected image (S 2 0 2). Then, a character cutting step may be performed on the modified image to obtain a character image (S 2 0 3). Next, a character normalization step may be performed on the character image to obtain a normalized character image (s 2 0 4). Then, a step of linearly transforming the normalized character image can be performed by linear discrimination analysis (S205). Then, a feature comparison step (S 2 0 6) may be performed on the normalized character image after linear conversion. Figure ′ shows its edge for a special symbol character according to the present invention. After using a digital camera to capture the f special symbol image of the file f in a single capture (S301), it is possible to estimate the number of lines and surfaces and remove the effects of light ^ = image ⑻. … Next, the corrected image can be taken; line: take: = step to obtain a character image (S303). The step of performing character normalization on the y of the vice-knife to obtain a positive y = the character image (S 3 0 4). Next, the Zhengdan Yuanzi Yuan image (S 3 0 5). Then, a step of converting the image to the feature code and the feature comparison can be performed (S 3 0 6). The normalized character image of I and I codes is performed on the discriminated character discs ^ In the course, it will be performed on the image 2:: τ Yuan's image recognition has identified the irradiation side 4 which restricts light, so it can be used when There is not a strict background that cannot be corrected due to insufficient contrast, 2 may cause the text to be cut to the right, which can correct the difference in light sources.

11948twf1.ptc11948twf1.ptc

1239200 _案號93100029_年月日__ 五、發明說明(12) 造成之影響,將有助於文字之正確切割。在點光源的情形 下,投射在平面上之亮度變化為二次曲面,可以利用六個 參數來描述。以最小均方差(Least Mean Square Error) 的方式,估算出曲面模型與影像間有最小差異之六個曲面 參數,再由影像中扣除二次曲面的影響,即可得出一張較 不受光線影響的影像。 請參照第5圖,其繪示依照本發明針對影像進行光線 曲面參數估計及光線影響移除的流程圖。首先,係針對所 輸入之影像,以最小均方差算出二次曲面之六個曲面參數 (S401);接著,以這些曲面參數重建一標準二次曲面 (S 4 0 2 );接著,將影像減去標準二次曲面,而得到一修正 影像(S 4 0 3 );接著,利用等化技術提升修正影像的對比 (S 4 0 4 )。如此,便可以得到受到光源影響較少之影像。 在上述針對可識別字元與特殊符號字元之取像辨識過 程中,均會進行字元正規化的步驟。原因是在取像時受到 數位相機光學筆距離紙張遠近之影響、受到取像之角度和 傾斜度之影響、或是受到紙張上字元之大小差異影響,均 會造成取像之字元有旋轉或縮放等問題發生。藉由字元正 規化程序可以將字元轉換成相同方向、大小及長寬比例之 字元,有助於辨識率之提升。 請參照第6圖,其繪示依照本發明進行字元正規化步 驟之流程圖。而對字元影像進行字元正規化以取得正規化1239200 _Case No. 93100029_ 年月 日 __ V. The effect of the description of the invention (12) will help the text to be cut correctly. In the case of a point light source, the brightness change projected on a plane is a quadric surface, which can be described using six parameters. Least Mean Square Error is used to estimate the six surface parameters that have the smallest difference between the surface model and the image, and then subtract the influence of the quadric surface from the image to obtain a sheet that is less affected by light. Affected image. Please refer to FIG. 5, which illustrates a flowchart of estimating a light surface parameter and removing a light effect on an image according to the present invention. First, the six surface parameters of the quadric surface are calculated based on the minimum mean square error for the input image (S401); then, a standard quadric surface is reconstructed using these surface parameters (S 4 0 2); Remove the standard quadric surface to obtain a corrected image (S 4 0 3); then, use the equalization technique to improve the contrast of the corrected image (S 4 0 4). In this way, an image that is less affected by the light source can be obtained. In the above image recognition process for identifiable characters and special symbol characters, the character normalization step is performed. The reason is that the image is affected by the distance between the digital camera optical pen and the paper, the angle and tilt of the image, or the difference in the size of the characters on the paper. Or scaling issues. Through the character normalization process, characters can be converted into characters with the same orientation, size, and aspect ratio, which helps to improve the recognition rate. Please refer to FIG. 6, which shows a flowchart of the character normalization step according to the present invention. And character normalize the character image for normalization

11948twfl.ptc 第20頁 1239200 _案號 93100029_年月曰 修正_ 五、發明說明(13) 字元影像的步驟包括:1 .由影像上每個點之座標位置計算 出一共變異矩陣(S 5 0 1 ); 2 .計算共變異矩陣之多個特徵值 及多個原始特徵向量(S 5 0 2 ); 3 .將每個特徵向量分別除以 相對應之特徵值,而分別得到多個正規化特徵向量 (S 5 0 3 ); 4 ·以正規化特徵向量對影像做座標轉換,形成一 特徵化影像(S 5 0 4 ); 5 ·以複數動量法計算出一旋轉角度 (S 5 0 5 ); 6 ·以旋轉角度產生一旋轉矩陣,並藉由旋轉矩陣 將特徵化影像旋轉成一標準角度,以取得正規化字元影像 (S 5 0 6 ); 7 .最後,可以將字元影像調整成標準大小及標準 旋轉角度(S 5 0 7 )。 關於字元正規化之詳細内容可以參考Soo-Chang Pei and Chao-Nan Lin, MImage normalization for pattern recognition" in Image and Vision Computing, v o 1 . 13,no. 10,pp. 7 1 1 - 7 2 3, 1 9 9 5 ; Dinggang Shen and11948twfl.ptc Page 20 1239200 _Case No. 93100029_ Year Month Revision_ V. Description of the invention (13) The steps of character image include: 1. Calculate a total variation matrix from the coordinates of each point on the image (S 5 0 1); 2. Calculate multiple eigenvalues of the covariance matrix and multiple original eigenvectors (S 5 0 2); 3. Divide each eigenvector by the corresponding eigenvalue to obtain multiple normals Eigenvectors (S 5 0 3); 4 · coordinate transformation of the image with normalized eigenvectors to form a characteristic image (S 5 0 4); 5 · calculate a rotation angle (S 5 0 by complex momentum method) 5); 6 · Generate a rotation matrix at the rotation angle, and rotate the feature image into a standard angle by the rotation matrix to obtain a normalized character image (S 506); 7. Finally, the character image can be Adjust to standard size and standard rotation angle (S 5 0 7). For details on character normalization, please refer to Soo-Chang Pei and Chao-Nan Lin, MImage normalization for pattern recognition " in Image and Vision Computing, vo 1.13, no. 10, pp. 7 1 1-7 2 3 , 1 9 9 5; Dinggang Shen and

Horace H· S. lp, "Generalized affine invariant image normalization,丨丨 in IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 5,Horace H. S. lp, " Generalized affine invariant image normalization, 丨 丨 in IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 5,

May, 1997。所有揭露於上述資料的内容均可作為本案之 參考資料。 在上述針對可識別字元之取像辨識過程中,會以線性 區分分析法進行線性轉換的步驟,藉由對特徵向量進行線 性轉換的步驟,可以使同一類字母之特徵差異愈小,不同 類字母的特徵差異愈大,藉以提高系統之辨識率。關於以 線性區分分析法進行線性轉換步驟之詳細内容玎以參考May, 1997. All the contents disclosed in the above information can be used as reference materials in this case. In the above-mentioned image recognition process for recognizable characters, a linear conversion analysis step is used to perform linear conversion. By performing a linear conversion step on the feature vectors, the differences in the characteristics of the same type of letters can be made smaller, and different types of letters can be different. The greater the difference in the characteristics of the letters, the higher the recognition rate of the system. Refer to the details of the linear transformation step by linear discrimination analysis for reference

11948twfl.ptc 第21頁 1239200 案號 93100029 年 月 修正 五、發明說明(14)11948twfl.ptc Page 21 1239200 Case No. 93100029 Amendment V. Description of Invention (14)

Li-Fen Chen, Hong-Yuan Mark Liao, M i ng-Ta t Ko, J a -Chen Lin, and Gwo-Jong Yu, M A new LDA-based face recognition system which can solve the small sample size problem", in Pattern Recognition, vol · 33,pp. 1 7 1 3 - 1 7 2 6,2 0 0 0 〇 就本發明的應用而言,比如可以將上述之文件取像 識系統結合考卷使用,使學生可以輕易地獲得與考表 相關的資訊,增加學生學習的深度及廣度。相較於— • 一 ·一 一 ....... 一 % A 題 般的 考卷,學生僅能知道試題的答案,若是想要知道其他 、, 細的說明,則必須要花費額外許多時間在相關的參考查^ 中才能找到,如此會降低學生的學習效率。 ’ 曰籍 為更清楚說明本發明之文件取像辨識系統作為考 輔助學習教材之使用方式’可以參照第7圖,其繪示 之 本發明使用文件取像辨識系統來獲得考卷資訊之流^照 首先要提供一考卷(S6〇1),當學生碰到不會的試題圖。 可以利用光學筆對準試題之題號及/或選項f主^ ’則 透過數位相機以單張操取的方式擷取題號影 ^D ’ 影像(S 6 0 2 )。接著利用前述之影像 項 及/或選項影像,以取得題號字元及/ 5 5 T J f就影像 碼字元(S 6 0 3 )。根據題號字元及/ . <仏註 進 步地熟習此題型 字元,便可以由電腦主機之資料庫次中3^項寻#之/票註碼 教材檔案,並播放出來,協助使用者^古 目之補助 的原理,或者甚至需要時,輔助以有η此試題 關的其他類似活用題,方便學生更^ °以列*喊題相Li-Fen Chen, Hong-Yuan Mark Liao, M i ng-Ta t Ko, J a -Chen Lin, and Gwo-Jong Yu, MA new LDA-based face recognition system which can solve the small sample size problem ", in Pattern Recognition, vol · 33, pp. 1 7 1 3-1 7 2 6, 2 0 0 0 〇 In terms of the application of the present invention, for example, the above-mentioned document retrieval system can be used in combination with examination papers, so that students can easily To obtain information related to the test form to increase the depth and breadth of students' learning. Compared to — • 1 · 1 1 ....... 1% A-like test paper, students can only know the answer to the test question, if you want to know other, detailed explanation, you must spend a lot of extra time It can only be found in the relevant reference check ^, which will reduce the learning efficiency of students. 'The purpose of this document is to more clearly illustrate the use of the document acquisition identification system of the present invention as a test-assisted learning textbook'. Refer to FIG. 7, which illustrates the use of the document acquisition identification system of the present invention to obtain examination paper information. First provide a test paper (S601), when the student encounters a picture of the question that will not be. You can use the optical pen to aim at the question number and / or option f. ^ ′ Then capture the image of the question number ^ D ′ through a digital camera in a single operation (S 602). Then use the aforementioned image item and / or option image to obtain the title character and / 5 5 T J f image code character (S 6 0 3). According to the title character and /. ≪ note to gradually familiarize with this type of character, you can use the computer host database to find 3 ^ Item find # of the ticket ticket code textbook file, and play it out to assist the use ^ The principle of the ancient project subsidy, or even if needed, supplemented with other similar practical questions related to η this test question, to facilitate students to ^ ° column *

11948twf1.ptc 1239200 _案號 93100029_年月日__ 五、發明說明(15) 以達到輔助學習的目的(S 6 0 4 );同時使用者利用數位相機 光學筆在在紙式試題測驗中所選擇的作答,也可以即時傳 送到電腦主機,由電腦主機來自動記錄學習者所建立的作 答資訊,並加以分析,以進一步地給予學習者適當的協 助。 綜上所述,本發明至少具有下列優點: 1 .本發明之文件取像辨識方法,係利用數位相機以單 張擷取的方式擷取文件之影像,使用者不必運筆掃瞄,可 以簡化操作程序,且可以提高辨識成功的機率。值得注意 的是,本發明案所採用的數位相機光學筆,並不要求書籍 必須平放在桌面上時才能擷取影像,而使用者可以靠在沙 發上,當拿起平面書本在舒適的閱讀時,利用數位相機光 學筆可以攝取書本上的文字、數字或代碼,如此操作上甚 具便利性。 2 .本發明之文件取像辨識方法,其所採用的數位相機 光學筆,可以搭配藍芽無線科技,讓使用者盡情隨性的坐 臥躺在家中任何地方看書,而隨時獲取多釆多姿的高晝質 電子媒體解說,讓閱讀平面書籍變成是賞心悅目的精緻享 受。 3 .本發明之文件取像辨識方法,其所採用的數位相機 光學筆,可以搭配電腦使用。藉由數位相機光學筆可以擷 取一張書本上的注釋代碼圖文影像,並且分析比對此張圖 文影像,藉以辨識出其中所包含的頁碼與句碼,然後由一 資料庫裝置中檢索出相對應頁碼與句碼之一輔助教材檔11948twf1.ptc 1239200 _Case No. 93100029_Year Month and Day__ V. Description of the invention (15) to achieve the purpose of assisted learning (S 604); At the same time, the user uses the digital camera optical pen in the paper test The selected answers can also be sent to the host computer in real time, and the host computer will automatically record the answer information created by the learner and analyze it to further give the learner appropriate assistance. To sum up, the present invention has at least the following advantages: 1. The document image capturing identification method of the present invention uses a digital camera to capture the image of a document in a single capture mode, and the user does not have to run a pen to scan, which can simplify the operation Procedures, and can increase the probability of successful identification. It is worth noting that the digital camera optical pen used in the present invention does not require books to be captured on the desktop to capture images, and users can lean on the sofa and pick up a flat book in a comfortable place. When reading, the digital camera optical pen can capture the text, numbers or codes on the book, which is very convenient for operation. 2. The document image capturing identification method of the present invention, which uses a digital camera optical pen, can be used with Bluetooth wireless technology, allowing users to sit and lie anywhere in the home to read books at will, and obtain a variety of attitudes at any time High-quality electronic media commentary makes reading a flat book a pleasing and exquisite enjoyment. 3. The document acquisition method of the present invention, which uses a digital camera optical pen, can be used with a computer. The digital camera optical pen can capture a text image of the annotation code on a book, and analyze and compare the graphic image to identify the page number and sentence code contained in it, and then retrieve it from a database device Corresponding page number and sentence code

11948twf1.ptc 第23頁 1239200 _案號 93100029_年月日_修正 _ 五、發明說明(16) 案,並播放出來,協助使用者可以更有效的暸解註釋代碼 所標記地方的實體平面書籍内容,達到輔助學習目的。 4 ·本發明之文件取像辨識方法,可以應用在取得考卷 之資訊上,其中考卷具有至少一標註碼,利用本發明之文 件取像辨識系統可以擷取該考卷之標註點的影像,並辨識 標註碼的影像,而輸出一標註碼字元,接著可以根據此標 註碼字元,提供關聯式輔助學習資訊。本發明使學生的紙 本作答可與學習輔具上的多媒體教材及模擬試題互動,而 做到一種新式個人化考試練習環境,創造性的改善傳統的 紙式測驗方式。 雖然本發明已以一較佳實施例揭露如上,然其並非用 以限定本發明,任何熟習此技藝者,在不脫離本發明之精 神和範圍内,當可作些許之更動與潤飾,因此本發明之保 護範圍當視後附之申請專利範圍所界定者為準。11948twf1.ptc Page 23 1239200 _ Case No. 93100029_ Year Month Day _ Amendment _ V. Description of the Invention (16) and play it out to help users more effectively understand the content of the physical plane book marked by the comment code, To achieve the purpose of assisted learning. 4 · The document image acquisition identification method of the present invention can be applied to obtaining information of examination papers, wherein the examination papers have at least one label code, and the image acquisition identification system of the present invention can capture images of the marked points of the examination papers and identify Annotate the image of the code, and output a code character, and then provide related auxiliary learning information based on the code character. The invention enables students' paper answers to interact with multimedia teaching materials and simulated test questions on learning aids, so as to realize a new type of personalized test practice environment and creatively improve the traditional paper-based test methods. Although the present invention has been disclosed as above with a preferred embodiment, it is not intended to limit the present invention. Any person skilled in the art can make some changes and retouch without departing from the spirit and scope of the present invention. The scope of protection of the invention shall be determined by the scope of the attached patent application.

11948twf1.ptc 第24頁 1239200 _案號 93100029 年月曰__ 圖式簡單說明 第1 A圖及第1 B圖繪示依照本發明一較佳實施例之文件 取像辨識糸統之不意圖。 第2圖繪示關於可識別字元與特殊符號字元集之對應 例子。 第3圖繪示依照本發明針對可識別字元的取像辨識流 程圖。 第4圖繪示依照本發明針對特殊符號字元的取像辨識 流程圖。 第5圖繪示依照本發明針對影像進行光線曲面參數估 計及光線影響移除的流程圖。 第6圖繪示依照本發明進行字元正規化步驟之流程 第7圖繪示依照本發明使用文件取像辨識系統來獲得 考卷資訊之流程圖。 【圖式標示說明】 110 數 位 相 機 光 學筆 112 影 像 擷 取 單 元 114 影 像 辨 識 單 元 120 電 腦 主 機 130 文 件11948twf1.ptc Page 24 1239200 _ Case No. 93100029 __ Brief Description of Drawings Figures 1A and 1B show the unintended intention of the image acquisition identification system according to a preferred embodiment of the present invention. Figure 2 shows an example of the correspondence between the recognizable characters and the special symbol character set. FIG. 3 is a flowchart of image recognition for recognizable characters according to the present invention. FIG. 4 shows a flowchart of image recognition for special symbol characters according to the present invention. FIG. 5 shows a flowchart of estimating a light surface parameter and removing a light influence on an image according to the present invention. Fig. 6 shows the flow of the character normalization step according to the present invention. Fig. 7 shows the flow chart of obtaining the examination paper information using the document acquisition system according to the present invention. [Illustration of graphic labeling] 110 digital camera optical pen 112 image capture unit 114 image recognition unit 120 computer host 130 files

11948twf1.pt c 第25頁11948twf1.pt c p. 25

Claims (1)

1239200 _案號 93100029_年月日__ 六、申請專利範圍 1 . 一種文件取像辨識系統,適於對一文件進行取像辨 識的作業,該文件取像辨識系統至少包括: 一影像擷取單元,以單次照相方式擷取該文件之一單 一影像;以及 一影像辨識單元,係與該影像擷取單元電性連接,該 影像辨識單元適用於辨識該單一影像,並根據辨識所得之 結果提供相關資訊,其中該影像辨識單元辨識該單一影像 的方法包括: 針對該單一影像進行光線曲面參數估計及光線影 響移除以取得一修正影像; 對該修正影像進行字元切割以取得一字元影像; 對該字元影像進行字元正規化以取得一正規化字 元影像;以及 對該正規化字元影像進行特徵比對。 2 .如申請專利範圍第1項所述之文件取像辨識系統, 其中該影像擷取單元與該影像辨識單元係配置在一光學筆 内。 3.如申請專利範圍第2項所述之文件取像辨識系統, 其中係藉由無線通訊及有線通訊的方式,二者擇一,將該 單一影像的資訊從該影像辨識單元傳送至一電腦主機。 4 .如申請專利範圍第1項所述之文件取像辨識系統, 其中該影像擷取單元係配置在一光學筆内,而該影像辨識 單元係配置在一電腦主機内。 5 .如申請專利範圍第4項所述之文件取像辨識系統,1239200 _Case No. 93100029_ 年月 日 __ VI. Scope of patent application 1. A document acquisition recognition system, which is suitable for image recognition of a document. The document acquisition recognition system includes at least: an image capture A unit that captures a single image of the document in a single photograph; and an image recognition unit that is electrically connected to the image capture unit, the image recognition unit is adapted to recognize the single image and according to the recognition result Provide related information, wherein the method for identifying the single image by the image recognition unit includes: performing light surface parameter estimation and removing light effects on the single image to obtain a corrected image; and performing character cutting on the corrected image to obtain a character Image; character normalize the character image to obtain a normalized character image; and perform feature comparison on the normalized character image. 2. The document image recognition system according to item 1 of the scope of patent application, wherein the image capture unit and the image recognition unit are arranged in an optical pen. 3. The document image recognition system as described in item 2 of the scope of patent application, wherein one of the two methods is wireless communication and wired communication, and the information of the single image is transmitted from the image recognition unit to a computer. Host. 4. The document acquisition system according to item 1 of the scope of patent application, wherein the image acquisition unit is arranged in an optical pen, and the image recognition unit is arranged in a computer host. 5. Document acquisition system as described in item 4 of the scope of patent application, 11948twf1.ptc 第26頁 1239200 _案號93100029_年月日_iMz_ 六、申請專利範圍 其中係藉由無線通訊及有線通訊的方式,二者擇一,將該 單一影像的資訊從該影像擷取單元傳送至該影像辨識單 元。 6 .如申請專利範圍第1項所述之文件取像辨識系統, 其中該影像擷取單元係為一數位相機。 7 ·如申請專利範圍第1項所述之文件取像辨識系統, 其中該文件係為一紙本考卷。 8 .如申請專利範圍第1項所述之文件取像辨識系統, 其中該文件係為一紙本書籍。 9 ·如申請專利範圍第1項所述之文件取像辨識系統, 其中該文件係為一平面媒體。 1 〇.如申請專利範圍第1項所述之文件取像辨識系統, 其中該文件係為資訊硬體螢幕上的圖文影像。 1 1 . 一種辨識方法,適於辨識從一文件上所擷取之一 影像,該辨識方法至少包括: 針對該影像進行光線曲面參數估計及光線影響移除以 取得一修正影像; 對該修正影像進行字元切割以取得一字元影像; 對該字元影像進行字元正規化以取得一正規化字元影 像;以及 對該正規化字元影像進行特徵比對。 1 2 .如申請專利範圍第1 1項所述之辨識方法,其中當 該影像包括至少一可辨識字元時,在對該正規化字元影像 進行特徵比對之前,還以線性區分分析法對該正規化字元11948twf1.ptc Page 26 1239200 _ Case No. 93100029_ Year Month Date _iMz_ VI. The scope of patent application Among them are wireless communication and wired communication, one of which is to extract the information of a single image from the image The unit is transmitted to the image recognition unit. 6. The document acquisition identification system described in item 1 of the scope of patent application, wherein the image acquisition unit is a digital camera. 7 · The document acquisition system as described in item 1 of the scope of patent application, wherein the document is a paper examination paper. 8. The document acquisition system according to item 1 of the scope of patent application, wherein the document is a paper book. 9 The document acquisition system as described in item 1 of the scope of patent application, wherein the document is a print medium. 10. The document acquisition system as described in item 1 of the scope of patent application, wherein the document is a graphic image on an information hardware screen. 1 1. A recognition method suitable for recognizing an image captured from a document, the recognition method includes at least: performing a light surface parameter estimation and removing a light effect on the image to obtain a modified image; and the modified image Performing character cutting to obtain a character image; performing character normalization on the character image to obtain a normalized character image; and performing feature comparison on the normalized character image. 12. The identification method as described in item 11 of the scope of patent application, wherein when the image includes at least one identifiable character, a linear discrimination analysis method is used before performing a feature comparison on the normalized character image. The normalized character 11948twf1.ptc 第27頁 1239200 _案號 93100029_年月日_魅_ 六、申請專利範圍 影像進行線性轉換。 1 3 ·如申請專利範圍第1 1項所述之辨識方法,其中當 該影像包括至少一特殊符號時,在對該正規化字元影像進 行特徵比對之前,還將該正規化字元影像轉成一特徵碼。 1 4 .如申請專利範圍第1 3項所述之辨識方法,其中該 特殊符號係符合哈達馬德碼之定義。 1 5 .如申請專利範圍第1 1項所述之辨識方法,其中針 對該影像進行光線曲面參數估計及光線影響移除的步驟包 括: 以最小均方差算出二次曲面之多個曲面參數; 以該些曲面參數重建一標準二次曲面;以及 將該影像減去該標準二次曲面,而得到該修正影像。 1 6 .如申請專利範圍第1 5項所述之辨識方法,其中在 得到該修正影像之後,更包括利用等化技術提升該修正影 像的對比。 1 7.如申請專利範圍第1 1項所述之辨識方法,其中對 該字元影像進行字元正規化以取得該正規化字元影像的步 驟包括: 由該影像上每個點之座標位置計算出一共變異矩陣; 計算該共變異矩陣之多個特徵值及多個原始特徵向 曰 · 置 , 將該些特徵向量除以相對應之該些特徵值,而分別得 出多個正規化特徵向量; 以該些正規化特徵向量對該影像做座標轉換,形成一11948twf1.ptc Page 27 1239200 _Case No. 93100029_Year Month_Charm_ VI. Patent Application Scope The image is linearly converted. 1 3 · The identification method as described in item 11 of the scope of patent application, wherein when the image includes at least one special symbol, before performing the feature comparison on the normalized character image, the normalized character image is also Into a feature code. 14. The identification method as described in item 13 of the scope of the patent application, wherein the special symbol conforms to the definition of a Hadamard code. 15. The identification method as described in item 11 of the scope of patent application, wherein the steps of estimating light surface parameters and removing light effects for the image include: calculating a plurality of surface parameters of a quadratic surface with a minimum mean square error; The surface parameters reconstruct a standard quadric surface; and subtracting the standard quadric surface from the image to obtain the modified image. 16. The identification method as described in item 15 of the scope of patent application, wherein after obtaining the modified image, it further includes using an equalization technique to improve the contrast of the modified image. 1 7. The identification method according to item 11 of the scope of patent application, wherein the character normalization of the character image to obtain the normalized character image includes: the coordinate position of each point on the image Calculate a common variation matrix; calculate multiple eigenvalues and original feature orientations of the common variation matrix, divide the feature vectors by the corresponding feature values, and obtain multiple normalized features respectively Vector; coordinate transformation of the image with the normalized feature vectors to form a 11948twf1.ptc 第28頁 1239200 _案號 93100029_年月日_iMz_- 六、申請專利範圍 至少一可辨識字元時,在對該正規化字元影像進行特徵比 對之前,還以線性區分分析法對該正規化字元影像進行線 性轉換。 2 1 .如申請專利範圍第1 9項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中當該標註碼的影像包括 至少一特殊符號時,在對該正規化字元影像進行特徵比對 之前,還將該正規化字元影像轉成一特徵碼。 2 2 .如申請專利範圍第2 1項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中該特殊符號係符合哈達 馬德碼之定義。 2 3 .如申請專利範圍第1 9項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中針對該標註碼的影像進 行光線曲面參數估計及光線影響移除以取得該修正影像的 步驟包括: 以最小均方差算出二次曲面之多個曲面參數; 以該些曲面參數重建一標準二次曲面;以及 將該影像減去該標準二次曲面,而得到該修正影像。 2 4.如申請專利範圍第2 3項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中在得到該修正影像之 後,更包括: 利用等化技術提升該修正影像的對比。 2 5 .如申請專利範圍第1 9項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中對該字元影像進行字元 正規化以取得該正規化字元影像的步驟包括:11948twf1.ptc Page 28 1239200 _ Case No. 93100029_year month_iMz_- 6. When applying for a patent with at least one identifiable character, before performing a feature comparison on the normalized character image, a linear discrimination analysis is also performed. Method to linearly transform the normalized character image. 2 1. The method for obtaining examination paper information by using a document acquisition recognition system as described in item 19 of the scope of patent application, wherein when the image of the markup code includes at least one special symbol, the normalized character image is processed. Before the feature comparison, the normalized character image is also converted into a feature code. 2 2. The method for obtaining examination paper information by using a document acquisition system as described in item 21 of the scope of patent application, wherein the special symbol conforms to the definition of a Hadamard code. 2 3. The method for obtaining examination paper information using a document acquisition recognition system as described in item 19 of the scope of patent application, wherein light surface parameter estimation and light effect removal are performed on the labeled image to obtain the corrected image. The steps include: calculating a plurality of surface parameters of a quadric surface with a minimum mean square error; reconstructing a standard quadric surface with the surface parameters; and subtracting the standard quadric surface from the image to obtain the modified image. 2 4. The method for obtaining examination paper information using a document acquisition system as described in item 23 of the scope of patent application, wherein after obtaining the corrected image, the method further includes: using an equalization technique to improve the contrast of the corrected image. 25. The method for obtaining examination paper information using a document acquisition system as described in item 19 of the scope of patent application, wherein the character normalization of the character image to obtain the normalized character image includes: 11948twf1.ptc 第30頁 1239200 _案號 93100029_年月日__ 六、申請專利範圍 由該影像上每個點之座標位置計算出一共變異矩陣; 計算該共變異矩陣之多個特徵值及多個原始特徵向 量; 將該些特徵向量除以相對應之該些特徵值,而分別得 出多個正規化特徵向量; 以該些正規化特徵向量對該影像做座標轉換,形成一 特徵化影像; 以複數動量法計算出一旋轉角度;以及 以該旋轉角度產生一旋轉矩陣,並藉由該旋轉矩陣將 該特徵化影像旋轉至一標準角度,以取得該正規化字元影 像。 2 6 .如申請專利範圍第1 8項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中該文件取像辨識系統包 括: 一影像擷取單元,適用於擷取該標註碼的影像;以及 一影像辨識單元,係與該影像擷取單元電性連接,該 影像辨識單元適用於辨識該標註碼的影像,並根據辨識所 得之結果提供相關資訊。 2 7.如申請專利範圍第2 6項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中該影像擷取單元與該影 像辨識單元係配置在一光學筆内。 2 8 .如申請專利範圍第2 7項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中係藉由無線通訊及有線 通訊的方式,二者擇一,將該單一影像的資訊從該影像辨11948twf1.ptc Page 30 1239200 _Case No. 93100029_Year_Month__ Sixth, the scope of patent application is calculated from the coordinate position of each point on the image to calculate a common variation matrix; calculate the multiple eigenvalues and multiple of the common variation matrix Original feature vectors; dividing the feature vectors by corresponding feature values to obtain a plurality of normalized feature vectors respectively; performing coordinate transformation on the image with the normalized feature vectors to form a feature image Calculate a rotation angle using the complex momentum method; and generate a rotation matrix with the rotation angle, and rotate the feature image to a standard angle by using the rotation matrix to obtain the normalized character image. 26. The method for obtaining examination paper information using a document acquisition and identification system as described in item 18 of the scope of patent application, wherein the document acquisition and identification system includes: an image acquisition unit adapted to acquire the tag code An image; and an image recognition unit, which is electrically connected to the image capture unit. The image recognition unit is adapted to recognize the image of the label and provide related information according to the recognition result. 2 7. The method for obtaining examination paper information using a document acquisition recognition system as described in item 26 of the scope of patent application, wherein the image acquisition unit and the image recognition unit are arranged in an optical pen. 28. The method for obtaining examination paper information using the document acquisition system as described in item 27 of the scope of patent application, wherein the method uses wireless communication and wired communication to select one of the two methods to obtain the information of the single image. Distinguish from this image 11948twf1.ptc 第31頁 1239200 _案號 93100029_年月日_iMz_ 六、申請專利範圍 識單元傳送至一電腦主機。 2 9 .如申請專利範圍第2 6項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中該影像擷取單元係配置 在一光學筆内,而該影像辨識單元係配置在一電腦主機 内。 3 0 .如申請專利範圍第2 9項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中係藉由無線通訊及有線 通訊的方式,二者擇一,將該單一影像的資訊從該影像擷 取單元傳送至該影像辨識單元。 3 1 .如申請專利範圍第2 6項所述之使用文件取像辨識 系統來獲得考卷資訊的方法,其中該影像擷取單元係為一 數位相機。11948twf1.ptc Page 31 1239200 _ Case No. 93100029_year month_iMz_ VI. Patent Application Scope The identification unit is transmitted to a computer host. 29. The method for obtaining examination paper information using a document image recognition system as described in item 26 of the scope of patent application, wherein the image capture unit is configured in an optical pen and the image recognition unit is configured in a Computer host. 30. The method for obtaining examination paper information using a document acquisition system as described in item 29 of the scope of patent application, wherein wireless communication and wired communication are used to select the information of the single image. Transmitting from the image capturing unit to the image recognition unit. 31. The method for obtaining examination paper information using a document acquisition system as described in item 26 of the scope of patent application, wherein the image acquisition unit is a digital camera. 11948twf1.ptc 第32頁11948twf1.ptc Page 32
TW93100029A 2004-01-02 2004-01-02 System for capturing and identifying image, method for identifying images and method for acquring additional remarks about test paper TWI239200B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW93100029A TWI239200B (en) 2004-01-02 2004-01-02 System for capturing and identifying image, method for identifying images and method for acquring additional remarks about test paper

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW93100029A TWI239200B (en) 2004-01-02 2004-01-02 System for capturing and identifying image, method for identifying images and method for acquring additional remarks about test paper

Publications (2)

Publication Number Publication Date
TW200524399A TW200524399A (en) 2005-07-16
TWI239200B true TWI239200B (en) 2005-09-01

Family

ID=37001228

Family Applications (1)

Application Number Title Priority Date Filing Date
TW93100029A TWI239200B (en) 2004-01-02 2004-01-02 System for capturing and identifying image, method for identifying images and method for acquring additional remarks about test paper

Country Status (1)

Country Link
TW (1) TWI239200B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI745724B (en) * 2019-07-25 2021-11-11 國泰人壽保險股份有限公司 Mobile Document Recognition System

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BRPI0800754A2 (en) * 2008-03-25 2020-09-24 Sicpa Holding S.A. PRODUCTION CONTROL SYSTEM INTEGRATED BY IMAGE PROCESSING AND AUTOMATED CODING

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI745724B (en) * 2019-07-25 2021-11-11 國泰人壽保險股份有限公司 Mobile Document Recognition System

Also Published As

Publication number Publication date
TW200524399A (en) 2005-07-16

Similar Documents

Publication Publication Date Title
JP4560299B2 (en) Generating memo documents to be increased
Shilkrot et al. FingerReader: a wearable device to explore printed text on the go
US8794978B2 (en) Educational material processing apparatus, educational material processing method, educational material processing program and computer-readable recording medium
Baker et al. Tactile graphics with a voice: using QR codes to access text in tactile graphics
CN101685482A (en) Electric marking system capable of automatically processing marking results and method thereof
KR20060004916A (en) Scanning apparatus
US20160162137A1 (en) Interactive Digital Workbook Using Smart Pens
Grahame Digital note-taking: Discussion of evidence and best practices
JP5729058B2 (en) Foreign language teaching material creation system
CN205177198U (en) System of going over examination papers on line infraduction line
Zeinullin et al. Tactile audio responsive intelligent system
JP2007005950A (en) Image processing apparatus and network system
CN112396897A (en) Teaching system
TWI239200B (en) System for capturing and identifying image, method for identifying images and method for acquring additional remarks about test paper
TWI281134B (en) Real-time distant teaching aided method and system bundled with physical plane book and computer-readable storage media
Shilkrot et al. FingerReader: A finger-worn assistive augmentation
JP2013011705A (en) Information terminal, information processing method and education support system
TW201209729A (en) Scoring system
Gordon Lanning et al. Traces of humanity: Echoes of social and cultural experience in physical objects and digital surrogates in the University of Victoria Libraries
Rai et al. MyOcrTool: visualization system for generating associative images of Chinese characters in smart devices
Stothert-Maurer et al. Read by touch: Stewarding the reading and writing collection at the perkins school for the blind
JP7333526B2 (en) Comic machine translation device, comic parallel database generation device, comic machine translation method and program
CN212276609U (en) Learning analysis machine
Le et al. Document image collection using Amazon's Mechanical Turk
Fruchterman Accessing books and documents

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees