TW201301874A - Method and device of document scanning and portable electronic device - Google Patents

Method and device of document scanning and portable electronic device Download PDF

Info

Publication number
TW201301874A
TW201301874A TW100122242A TW100122242A TW201301874A TW 201301874 A TW201301874 A TW 201301874A TW 100122242 A TW100122242 A TW 100122242A TW 100122242 A TW100122242 A TW 100122242A TW 201301874 A TW201301874 A TW 201301874A
Authority
TW
Taiwan
Prior art keywords
images
adjusted
image
file
feature points
Prior art date
Application number
TW100122242A
Other languages
Chinese (zh)
Inventor
Chang-Ming Lee
Kai-Chen Lin
Chien-An Chen
Original Assignee
Wistron Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wistron Corp filed Critical Wistron Corp
Priority to TW100122242A priority Critical patent/TW201301874A/en
Priority to CN201110192655XA priority patent/CN102843479A/en
Priority to US13/311,569 priority patent/US20120327486A1/en
Publication of TW201301874A publication Critical patent/TW201301874A/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/387Composing, repositioning or otherwise geometrically modifying originals
    • H04N1/3876Recombination of partial images to recreate the original image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00002Diagnosis, testing or measuring; Detecting, analysing or monitoring not otherwise provided for
    • H04N1/00007Diagnosis, testing or measuring; Detecting, analysing or monitoring not otherwise provided for relating to particular apparatus or devices
    • H04N1/00018Scanning arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00002Diagnosis, testing or measuring; Detecting, analysing or monitoring not otherwise provided for
    • H04N1/00026Methods therefor
    • H04N1/00034Measuring, i.e. determining a quantity by comparison with a standard
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/04Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa
    • H04N1/10Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using flat picture-bearing surfaces
    • H04N1/107Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using flat picture-bearing surfaces with manual scanning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/04Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa
    • H04N1/19Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays
    • H04N1/195Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays the array comprising a two-dimensional array or a combination of two-dimensional arrays
    • H04N1/19594Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays the array comprising a two-dimensional array or a combination of two-dimensional arrays using a television camera or a still video camera
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/40Picture signal circuits
    • H04N1/401Compensating positionally unequal response of the pick-up or reproducing head
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00204Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a digital computer or a digital computer system, e.g. an internet server
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00281Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a telecommunication apparatus, e.g. a switched network of teleprinters for the distribution of text-based information, a selective call terminal
    • H04N1/00307Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a telecommunication apparatus, e.g. a switched network of teleprinters for the distribution of text-based information, a selective call terminal with a mobile telephone apparatus
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/21Intermediate information storage
    • H04N1/2104Intermediate information storage for one or a few pictures
    • H04N1/2112Intermediate information storage for one or a few pictures using still video cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/0096Portable devices
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/04Scanning arrangements
    • H04N2201/0402Arrangements not specific to a particular one of the scanning methods covered by groups H04N1/04 - H04N1/207
    • H04N2201/0436Scanning a picture-bearing surface lying face up on a support
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/04Scanning arrangements
    • H04N2201/0402Arrangements not specific to a particular one of the scanning methods covered by groups H04N1/04 - H04N1/207
    • H04N2201/0458Additional arrangements for improving or optimising scanning resolution or quality

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Image Processing (AREA)
  • Editing Of Facsimile Originals (AREA)

Abstract

A document scanning method includes receiving a plurality of images of a plurality of blocks of a document by an image receiver, adjusting characteristics of the plurality of images according to distances between the image receiver and the plurality of blocks when the image receiver is receiving the plurality of images, to generate a plurality of adjusted images, determining a plurality of characteristic points of each of the plurality of adjusted images and a plurality of characteristic vectors corresponding to the plurality of characteristic points, and combining the plurality of adjusted images according to the plurality of characteristic vectors corresponding to the plurality of characteristic points of each of the plurality of adjusted images, to generate a scanning result.

Description

文件掃描方法、文件掃描裝置及可攜式電子裝置Document scanning method, document scanning device and portable electronic device

本發明係指一種文件掃描方法、文件掃描裝置及可攜式電子裝置,尤指一種可根據影像特徵點,合併不同影像,以提供文件掃描功能之文件掃描方法、文件掃描裝置及可攜式電子裝置。The invention relates to a document scanning method, a document scanning device and a portable electronic device, in particular to a document scanning method, a document scanning device and a portable electronic device capable of combining different images according to image feature points to provide a document scanning function. Device.

可攜式電子裝置,如筆記型電腦、平版電腦、智慧型手機等,有著體積小、重量輕、攜帶方便等特性,能夠讓使用者隨時隨地都能擁有強大的計算能力與文書處理功能,因此已成為商務人士必備的工具之一。在此情形下,如何提升可攜式電子裝置的功能,以應付各式需求,也就成為業界所努力的目標之一。Portable electronic devices, such as notebook computers, lithographic computers, smart phones, etc., have the characteristics of small size, light weight, and easy portability, so that users can have powerful computing power and paper processing functions anytime, anywhere. Has become one of the necessary tools for business people. Under this circumstance, how to improve the functions of portable electronic devices to meet various needs has become one of the goals of the industry.

舉例來說,商務人士於外出時通常會攜帶筆記型電腦,用以展示產品資訊或記錄會議資料等。然而,當臨時遇到需要將紙本文件掃描成電子檔案的情況時,由於一般使用筆記型電腦的情況下不會隨身攜帶掃描器,因此,使用者往往需到附近的便利商店或回公司後再掃描,不但耗費金錢與時間,甚至錯失此份資料的即時性。因此,若筆記型電腦擁有掃描文件的功能就會方便許多。For example, business people usually carry a laptop when they go out to display product information or record meeting materials. However, when it is temporarily encountered that a paper document needs to be scanned into an electronic file, since the scanner is not carried with the notebook computer in general, the user often needs to go to a nearby convenience store or return to the company. Scanning again, not only costs money and time, but also misses the immediacy of this information. Therefore, it is much more convenient if the notebook has the function of scanning files.

因此,本發明主要提供一種文件掃描方法、文件掃描裝置及可攜式電子裝置。Therefore, the present invention mainly provides a document scanning method, a document scanning device, and a portable electronic device.

本發明揭露一種文件掃描方法,包含有由一影像擷取裝置擷取一文件之複數個區塊的複數個影像;根據該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離,調整該複數個影像之特性,以產生複數個調整後影像;判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量;以及根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的一掃描結果。The present invention discloses a file scanning method, which includes a plurality of images of a plurality of blocks of a file captured by an image capturing device; and the image capturing device captures the plurality of images and the plurality of blocks according to the image capturing device a distance, adjusting a characteristic of the plurality of images to generate a plurality of adjusted images; determining a plurality of feature points of each adjusted image in the plurality of adjusted images and a plurality of feature vectors of the plurality of feature points; The plurality of feature vectors of each adjusted image of the plurality of adjusted images are combined with the plurality of adjusted images to generate a scan result of the file.

本發明另揭露一種文件掃描裝置,包含有一影像擷取裝置;一測距單元;一處理器;以及一儲存單元,該儲存單元用來儲存一程式碼,該程式碼指示該處理器執行以下步驟:控制該影像擷取裝置擷取一文件之複數個區塊的複數個影像;控制該測距單元測量該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離;根據該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離,調整該複數個影像之特性,以產生複數個調整後影像;判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量;以及根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的一掃描結果。The invention further discloses a document scanning device, comprising: an image capturing device; a ranging unit; a processor; and a storage unit, wherein the storage unit is configured to store a code, the code indicating that the processor performs the following steps Controlling the image capturing device to capture a plurality of images of a plurality of blocks of a file; controlling the ranging unit to measure a distance from the plurality of blocks when the image capturing device captures the plurality of images; The image capture device captures the distance from the plurality of blocks when the plurality of images are captured, and adjusts the characteristics of the plurality of images to generate a plurality of adjusted images; and determines each of the adjusted images in the plurality of adjusted images. a plurality of feature points and a plurality of feature vectors of the plurality of feature points; and combining the plurality of adjusted images according to the plurality of feature vectors of each adjusted image of the plurality of adjusted images to generate the file A scan result.

本發明另揭露一種可攜式電子裝置,包含有一處理器;一儲存單元;一影像擷取裝置;以及一文件掃描裝置,包含有一測距單元;一程式碼,儲存於該儲存單元中,該程式碼指示該處理器執行以下步驟:控制該影像擷取裝置擷取一文件之複數個區塊的複數個影像;控制該測距單元測量該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離;根據該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離,調整該複數個影像之特性,以產生複數個調整後影像;判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量;以及根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的一掃描結果。The present invention further discloses a portable electronic device including a processor, a storage unit, an image capture device, and a file scanning device, including a ranging unit, and a code stored in the storage unit. The code indicates that the processor performs the following steps: controlling the image capturing device to capture a plurality of images of a plurality of blocks of a file; and controlling the ranging unit to measure the image capturing device to capture the plurality of images The distance between the plurality of blocks; the distance between the plurality of blocks when the image capturing device captures the plurality of blocks, and adjusting the characteristics of the plurality of images to generate a plurality of adjusted images; determining the plurality of adjustments a plurality of feature points of each adjusted image in the post-image and a plurality of feature vectors of the plurality of feature points; and the plurality of feature vectors according to each adjusted image of the plurality of adjusted images, combined with the plurality of feature vectors The adjusted image is used to produce a scan of the file.

筆記型電腦、平版電腦、智慧型手機等可攜式電子裝置通常配備有照像機,用以提供照相、攝影、視訊電話等影像擷取功能。在此情形下,本發明利用可攜式電子裝置的影像擷取功能實現掃描功能,並搭配一系列演算流程,有效將同一文件的不同影像部分結合為完整文件或影像資料。Portable electronic devices such as notebook computers, lithographic computers, and smart phones are usually equipped with a camera for providing image capture functions such as photography, photography, and video telephony. In this case, the present invention utilizes the image capturing function of the portable electronic device to implement the scanning function, and cooperates with a series of calculation processes to effectively combine different image portions of the same file into a complete file or image data.

請參考第1圖,第1圖為本發明實施例一文件掃描裝置10之示意圖。文件掃描裝置10可設置於筆記型電腦、平版電腦、智慧型手機等可攜式電子裝置,其係由一影像擷取裝置100、一測距單元102、一提示單元114及一處理模組104所組成。影像擷取裝置100可以是可攜式電子裝置原始設置之照相機、視訊設備等,用來擷取影像。測距單元102係利用紅外線、超音波等測距原理,測量影像擷取裝置100擷取影像時與該影像的實際距離。提示單元114可顯示測距單元102的測距結果,其可以是燈號、聲響或是顯示在一螢幕之訊息等。處理模組104係由一處理器106及一儲存單元108所組成,儲存單元108中儲存有一文件掃描程式碼110,用以指示處理器106執行文件掃描功能。Please refer to FIG. 1. FIG. 1 is a schematic diagram of a document scanning device 10 according to an embodiment of the present invention. The file scanning device 10 can be disposed in a portable electronic device such as a notebook computer, a lithographic computer, or a smart phone, and is provided by an image capturing device 100, a ranging unit 102, a prompting unit 114, and a processing module 104. Composed of. The image capturing device 100 can be a camera, a video device, or the like originally set by the portable electronic device for capturing images. The ranging unit 102 measures the actual distance from the image capturing device 100 when capturing an image by using a distance measuring principle such as infrared rays or ultrasonic waves. The prompting unit 114 can display the ranging result of the ranging unit 102, which can be a light number, a sound, or a message displayed on a screen. The processing module 104 is composed of a processor 106 and a storage unit 108. The storage unit 108 stores a file scanning code 110 for instructing the processor 106 to perform a file scanning function.

當要進行文件掃描功能時,文件掃描裝置10係利用影像擷取裝置100擷取一待掃描文件112之區塊BLK_1~BLK_n的影像IMG_1~IMG_n,而在影像擷取裝置100擷取影像IMG_1~IMG_n時,測距單元102會測量影像擷取裝置100與區塊BLK_1~BLK_n的距離DT_1~DT_n,並透過提示單元114顯示對應的測量結果。處理模組104則根據距離DT_1~DT_n,先調整影像IMG_1~IMG_n之特性,再判斷其中之特徵點及特徵向量,最後據以結合調整後的影像,以產生文件112的掃描結果SCN。When the file scanning function is to be performed, the image scanning device 10 captures the images IMG_1 to IMG_n of the blocks BLK_1 to BLK_n of the file to be scanned 112 by the image capturing device 100, and captures the image IMG_1~ at the image capturing device 100. In the case of IMG_n, the ranging unit 102 measures the distances DT_1 DT DT_n of the image capturing device 100 and the blocks BLK_1 ~ BLK_n, and displays the corresponding measurement results through the prompting unit 114. The processing module 104 first adjusts the characteristics of the images IMG_1 to IMG_n according to the distances DT_1 ~ DT_n, and then determines the feature points and feature vectors therein, and finally combines the adjusted images to generate the scan result SCN of the file 112.

關於處理模組104的運作原理,以下分不同步驟詳細說明。Regarding the operation principle of the processing module 104, the following steps are described in detail.

一、影像調整:First, image adjustment:

如前所述,影像擷取裝置100可以是可攜式電子裝置原始設置之照相機等,亦可以是額外新增之設備,由於可攜式電子裝置之空間較小,且為維持影像擷取品質,因此文件掃描裝置10係將文件112依不同區塊BLK_1~BLK_n依次擷取。其中,需注意的是,區塊BLK_1~BLK_n中相鄰區塊間需部分重疊,始可進行影像合併,而關於如何組合影像及當無法組合時的處理方式,於後詳述。此外,在擷取區塊BLK_1~BLK_n之影像IMG_1~IMG_n時,較佳地係由使用者手動平移,但不限於此,而為確保使用者的平移方式符合系統要求,提示單元114可將最佳距離告知使用者,以避免大小不一的情況。As described above, the image capturing device 100 may be a camera or the like originally set by the portable electronic device, or may be an additional device. The space of the portable electronic device is small, and the image capturing quality is maintained. Therefore, the file scanning device 10 sequentially extracts the file 112 according to different blocks BLK_1 to BLK_n. It should be noted that the adjacent blocks in the blocks BLK_1 to BLK_n need to be partially overlapped, and image combination can be performed at the beginning, and the processing manners when how to combine images and when they cannot be combined will be described in detail later. In addition, when the images IMG_1~IMG_n of the blocks BLK_1-BLK_n are captured, it is preferably manually translated by the user, but is not limited thereto, and the prompting unit 114 can be the most to ensure that the user's translation mode conforms to the system requirements. The distance is informed to the user to avoid situations of varying sizes.

由此可知,在進行掃描文件112時,使用者拿著待掃描的文件112在影像擷取裝置100前移動,由影像擷取裝置100連續拍下文件112的區塊BLK_1~BLK_n的影像IMG_1~IMG_n。由於手持方式拿著文件112進行平移,移動過程中會因為外在環境因素等造成拍攝出來的影像IMG_1~IMG_n距離遠近不一致,使得影像IMG_1~IMG_n的大小會有所不同,可能造成接合後的文件影像內容中會有明顯的區塊差異或段差。因此,文件112與影像擷取裝置100之間的垂直距離就顯得相當重要,太遠或太近都會影響到所拍攝出來之文字內容的清晰度。在此情形下,當使用者進行掃描的時候,測距單元102可將文件112之各區塊與影像擷取裝置100之間最佳拍攝距離的值,透過提示單元114告知使用者。Therefore, when the file 112 is scanned, the user moves the file 112 to be scanned before the image capturing device 100, and the image capturing device 100 continuously captures the image IMG_1 of the blocks BLK_1 to BLK_n of the file 112. IMG_n. Since the handheld mode is carried by the file 112, the distances of the images IMG_1 to IMG_n that are captured due to external environmental factors are inconsistent, and the sizes of the images IMG_1 to IMG_n may be different, which may result in the files after the bonding. There will be significant block differences or step differences in the image content. Therefore, the vertical distance between the file 112 and the image capturing device 100 is quite important, and too far or too close will affect the sharpness of the captured text content. In this case, when the user performs scanning, the ranging unit 102 can notify the user of the value of the optimal shooting distance between each block of the file 112 and the image capturing device 100 through the prompting unit 114.

此外,影像擷取裝置100拍攝影像IMG_1~IMG_n的同時,測距單元102也會同步紀錄每次拍攝時區塊BLK_1~BLK_n與影像擷取裝置100間的垂直距離DT_1~DT_n。而所記錄下來的距離DT_1~DT_n,將成為接下來接合區塊BLK_1~BLK_n的影像IMG_1~IMG_n時,進行每一影像縮小或放大、銳利化、對比度調整等後處理微調之重要參考的依據。同時,若處理模組104偵測出文件112某一區塊的影像拍攝效果不佳或無法使用,則可建議使用者只針對此區塊進行重新拍攝的動作,如此一來最後所接合出來的影像內容將更趨一致,掃描出成品的品質也較為良好。In addition, while the image capturing device 100 captures the images IMG_1 to IMG_n, the ranging unit 102 also synchronously records the vertical distances DT_1 to DT_n between the blocks BLK_1 to BLK_n and the image capturing device 100 at each shooting. The recorded distances DT_1 to DT_n are the basis for performing an important reference for post-processing fine-tuning such as image reduction, enlargement, sharpening, and contrast adjustment when the images IMG_1 to IMG_n of the subsequent blocks BLK_1 to BLK_n are next. At the same time, if the processing module 104 detects that the image of a certain block of the file 112 is not good or can not be used, the user may be recommended to perform the re-shooting action only for the block, so that the final stitching is performed. The content of the image will be more consistent, and the quality of the finished product will be better.

因此,處理模組104在取得影像IMG_1~IMG_n時,皆會記錄下對應區塊BLK_1~BLK_n與影像擷取裝置100之間的距離DT_1~DT_n,以做為縮放、調整影像IMG_1~IMG_n的依據。調整影像IMG_1~IMG_n的方式有許多種,舉例來說,可以拍攝第偶數個區塊(BLK_2、BLK_4...)時之距離(DT_2、DT_4...)加以平均後的數值做為基準點,做為縮放基準。例如,若總共拍攝四個區塊BLK_1~BLK_4的影像IMG_1~IMG_4,而第二張影像IMG_2拍攝距離DT_2為2公分,第四張影像IMG_4拍攝距離DT_4為3公分,將兩者的平均2.5公分做為縮放基準;因此,第二張影像IMG_2需放大2/2.5=0.8倍,而第四張影像IMG_4則需放大3/2.5=1.2倍。Therefore, when acquiring the images IMG_1 - IMG_n, the processing module 104 records the distances DT_1 - DT_n between the corresponding blocks BLK_1 - BLK_n and the image capturing device 100 as the basis for scaling and adjusting the images IMG_1 - IMG_n. . There are many ways to adjust the images IMG_1 to IMG_n. For example, the average value of the distance (DT_2, DT_4, ...) at the even-numbered blocks (BLK_2, BLK_4...) can be used as the reference point. , as a scaling benchmark. For example, if the images IMG_1 to IMG_4 of the four blocks BLK_1 to BLK_4 are taken in total, and the shooting distance DT_2 of the second image IMG_2 is 2 cm, the shooting distance DT_4 of the fourth image IMG_4 is 3 cm, and the average of the two is 2.5 cm. As a zoom reference; therefore, the second image IMG_2 needs to be enlarged by 2/2.5=0.8 times, while the fourth image IMG_4 needs to be enlarged by 3/2.5=1.2 times.

另外,更進一步地,可使用雙線性內插法做為縮放處理之演算法。在所有被拍攝的影像都被縮放調整至同一基準後,才進行文件各區塊影像接合的處理。In addition, further, bilinear interpolation can be used as an algorithm for scaling processing. After all the captured images are zoomed and adjusted to the same reference, the image joint processing of each block of the file is performed.

二、影像合併:Second, the image merger:

影像合併的基本概念是先尋找出相鄰影像中相同的影像內容,藉此合倂影像。為了判斷影像內容,需先選取具代表性之特徵點,而通常影像的特徵有很多種型式,諸如紋理、顏色、形狀、輪廓等,而本發明係在影像中尋找具代表性的角點(corner point)特徵。The basic concept of image merging is to first find the same image content in adjacent images, thereby combining the images. In order to judge the content of the image, a representative feature point needs to be selected first, and usually the image has many types of features, such as texture, color, shape, contour, etc., and the present invention searches for representative corner points in the image ( Corner point) feature.

(a)尋找特徵點:(a) Looking for feature points:

為了判斷角點特徵,首先使用積分影像(integral image)技術以加快運算的速度,並使用量化後的高斯函數二階偏微分,來計算影像的海斯矩陣(Hessian matrix)行列式值,最後利用不同之變異數σ來產生不同的海斯行列式矩陣,以此尋找特徵點。In order to judge the corner feature, the integral image technique is first used to speed up the calculation, and the quantized Gaussian function second-order partial differential is used to calculate the Hessian matrix determinant value of the image. The variation σ produces different Hays determinant matrices to find feature points.

詳細來說,若I(x,y)表示一影像I中座標(x,y)之像素,則此點的海斯矩陣在變異數為σ的前提下,其表示式為,其中h(x,y,σ)為像素I(x,y)所對應的海斯矩陣;L xx (x,y,σ)為像素I(x,y)與高斯函數g(σ)在x軸方向的二階導數之迴旋積值;而L xy (x,y,σ)與L yy (x,y,σ)則分別為像素I(x,y)與進行迴旋積之值,而g(σ)為高斯函數。每一元素H(x,y,σ)為影像I在座標(x,y)之海斯矩陣的行列式值,則可寫成H(x,y,σ)=L xx (x,y,σ)*L yy (x,y,σ)-(L xy (x,y,σ))2,可稱H為影像I對應於σ值之海斯行列式矩陣。使用不同的σ值可得到不同的海斯行列式矩陣H,例如若選取σ1=1.2、σ2=2、σ3=2.8,可得到不同尺度空間下的H 1H 2H 3。當影像I其數個對應於不同σ值之海斯行列式矩陣產生後,接著便從這些海斯行列式矩陣中尋找特徵點。In detail, if I ( x , y ) represents a pixel of coordinates ( x , y ) in an image I , the Hayes matrix of this point is represented by the variability of σ. Where h ( x , y , σ) is the Hayesian matrix corresponding to the pixel I ( x , y ); L xx ( x , y , σ) is the pixel I ( x , y ) and the Gaussian function g (σ) Second derivative in the x-axis direction The gyro product; L xy ( x , y , σ ) and L yy ( x , y , σ) are the pixels I ( x , y ) and The value of the convolution product is performed, and g (σ) is a Gaussian function. Each element H ( x , y , σ) is the determinant value of the Hayes matrix of the image I at coordinates ( x , y ), which can be written as H ( x , y , σ ) = L xx ( x , y , σ * L yy ( x , y , σ) - ( L xy ( x , y , σ)) 2 , which can be called H is a Hayesian matrix of the image I corresponding to the σ value. Different Hays determinant matrices H can be obtained by using different σ values. For example, if σ 1 =1.2, σ 2 =2, and σ 3 =2.8, H 1 , H 2 , and H 3 in different scale spaces can be obtained. When the number of images I corresponds to different σ values of the Hayes determinant matrix, then the feature points are searched from these Hays determinant matrices.

假設有一像素點其座標為(x,y),為第2圖中的X點,H 1H 2H 3分別為第2圖中對應至σ1、σ2、σ3之海斯行列式矩陣,則X點其26個鄰近像素(上、下尺度各九個鄰近點與本身尺度的八個鄰近點)即如第2圖中之圓點所示。若X點在其26個鄰近相素中,具有最大的海斯行列式值,則稱X點為特徵點。Suppose there is a pixel whose coordinates are ( x , y ), which is the X point in Fig. 2, and H 1 , H 2 , and H 3 are the Hayes determinant matrix corresponding to σ1, σ2, and σ3 in Fig. 2 , respectively. Then X points its 26 adjacent pixels (the nine adjacent points of the upper and lower scales and the eight adjacent points of its own scale) are as shown by the dots in Figure 2. If the X point has the largest Hays determinant value among its 26 neighboring pixels, the X point is called the feature point.

(b)決定特徵向量:(b) Determine the feature vector:

找出特徵點後,由於單一特徵點無法確切描述特徵點周圍區域的資料,故需要在特徵點附近劃出一塊鄰近區域,並利用此鄰近區域,來產生代表此特徵點之特徵向量。首先,在計算描述一特徵點X之特徵向量前,為了使特徵向量對旋轉具有抵抗力,也就是影像旋轉後,仍能有效找出兩張影像吻合之特徵點,因此需要找出特徵點X的主要方向。因此,可將特徵點X為中心之鄰近正方區域(如20*20)像素劃出一個區域為R,將區域R分別與哈爾(Haar)水平、垂直方向濾波器做迴旋積,並以dx(x,y)與dy(x,y)代表完成迴旋積後的結果,而dx(x,y)、dy(x,y)為與區域R相同大小之矩陣(如20*20)。因此,對區域R中每一像素點,在dx(x,y)與dy(x,y)中都有對應的值。After finding the feature points, since a single feature point cannot accurately describe the data of the area around the feature point, it is necessary to draw a neighboring area near the feature point, and use the adjacent area to generate a feature vector representing the feature point. First, before calculating the feature vector describing a feature point X, in order to make the feature vector resistant to rotation, that is, after the image is rotated, the feature points of the two images can be effectively found, so it is necessary to find the feature point X. The main direction. Therefore, it is possible to draw a region from the neighboring square region (such as 20*20) with the feature point X as the center as R, and to circulate the region R with the Haar horizontal and vertical filters, respectively, and dx. ( x , y ) and dy ( x , y ) represent the result of completing the convolution product, and dx ( x , y ), dy ( x , y ) are matrices of the same size as the region R (eg 20*20). Therefore, for each pixel in the region R, there are corresponding values in dx ( x , y ) and dy ( x , y ).

接著,可以特徵點X為中心,將0至60度為一個區域,計算此區域中所有像素dx(x,y)與dy(x,y)的總合,即可得到一個序對(order pair),也就是(Σdx(x,y),Σdy(x,y)),以此序對做為一個向量,計算該向量的長度L=,此即是以特徵點X為中心0至60度區域的長度。接下來,分別計算60度至120度、120度至180、180度至240度、240至300度、300度至360度的長度。假設經過統計之後其在60至120間具有最高長度,則可將特徵點X之主要方向設定為90度(因為(120+60)/2)。在此例中,是以每60度為一個區間對區域R中所有像素做統計,但在實際應用上可用較小的區間對區域R中像素進行統計,以找出較正確的方向。換句話說,角度範圍的取捨,會影響到是否能精確找出特徵點主要方向的能力。Then, the feature point X can be centered, and 0 to 60 degrees is an area. The total of all the pixels dx ( x , y ) and dy ( x , y ) in the region can be calculated to obtain a sequence pair. ), that is, (Σ dx ( x , y ), dy dy ( x , y )), using this sequence as a vector, calculating the length of the vector L = This is the length of the region from 0 to 60 degrees centered on the feature point X. Next, lengths of 60 degrees to 120 degrees, 120 degrees to 180, 180 degrees to 240 degrees, 240 to 300 degrees, and 300 degrees to 360 degrees, respectively, are calculated. Assuming that it has the highest length between 60 and 120 after statistics, the main direction of the feature point X can be set to 90 degrees (because (120+60)/2). In this example, all pixels in the region R are counted every 60 degrees, but in practical applications, the pixels in the region R can be counted in a smaller interval to find a more correct direction. In other words, the choice of angle range will affect the ability to accurately find the main direction of the feature point.

在得到特徵點X的主要方向後,以特徵點X之主要方向為主,將影像旋轉至X之主要方向為正北方,劃定一個新的區域R’,其中區域R’也是以特徵點X為中心之區域(如20*20之正方區域),目的是為了能產生具旋轉不變之特徵向量,其結果分別以dx'(x,y)、dy'(x,y)表示之,其中dx'(x,y)、dy'(x,y)同樣為與區域R’之大小相同的矩陣。After the main direction of the feature point X is obtained, the main direction of the feature point X is dominant, and the main direction of the image is rotated to the north, and a new region R' is defined, wherein the region R' is also the feature point X. The centered area (such as the square area of 20*20) is designed to generate a eigenvector with rotation invariance, and the result is represented by dx '( x , y ), dy '( x , y ), respectively. Dx '( x , y ), dy '( x , y ) is also a matrix of the same size as the region R'.

接著,將區域R’分割成複數個子區塊,如4*4個,則每個子區塊大小為5*5,並利用dx'(x,y)與dy'(x,y)對每個子區塊計算下列四個分量,(Σdx'(x,y),Σdy'(x,y),Σ|dx'(x,y)|,Σ∣dy'(x,y)|),並以此四個分量代表一個子區塊。在此例中,由於區域R’有4*4個區塊,每個子區塊有四個分量,故能以特徵點X之主要方向為基準,由左至右、由上至下分別將代表各子區域的4個分量串接起來,形成一個64維的特徵向量,並以此特徵向量用來代表特徵點X。Next, the region R' is divided into a plurality of sub-blocks, such as 4*4, each sub-block size is 5*5, and each of the sub-blocks is represented by dx '( x , y ) and dy '( x , y ) The block calculates the following four components, (Σ dx '( x , y ), Σ dy '( x , y ), Σ| dx '( x , y )|, Σ∣ dy '( x , y )|), And four components represent one sub-block. In this example, since the region R' has 4*4 blocks, each sub-block has four components, so it can be represented by the main direction of the feature point X, from left to right and top to bottom respectively. The four components of each sub-area are concatenated to form a 64-dimensional feature vector, and this feature vector is used to represent the feature point X.

(c)影像比對與合併:(c) Image comparison and merger:

當進行兩張影像比對時,首先將所有自兩張影像中找出的特徵點,經上述(b)方式產生每個特徵點之特徵向量後,利用這些特徵點的特徵向量互相進行比對並找出最吻合的特徵點。其比對方式如下,若X為一特徵點,在特徵點的比對過程中,利用特徵點X的特徵向量與其他特徵點的特徵向量計算兩者間的歐氏距離。假設特徵點X1與X2分別為與X最接近與次接近的特徵點,且其距離分別為d1與d2,若d1<r*d2則認定X1與X最為吻合,其中r為自訂之係數,如0.5。最後,將兩個最接近的特徵點加以疊合,即完成兩張影像的合併,這也是為何需確保相鄰兩張影像有部分重疊的原因。When comparing two images, firstly, all the feature points found from the two images are generated by the feature vectors of each feature point through the above (b) method, and the feature vectors of the feature points are compared with each other. And find the most consistent feature points. The comparison method is as follows. If X is a feature point, the Euclidean distance between the feature vector of the feature point X and the feature vector of the other feature points is calculated during the alignment of the feature points. It is assumed that the feature points X1 and X2 are the feature points closest to X and closest to X, and the distances are d 1 and d 2 respectively. If d 1 < r * d 2 , it is determined that X1 and X are the most consistent, where r is self. Set the coefficient, such as 0.5. Finally, the two closest feature points are superimposed, that is, the merging of the two images is completed, which is why it is necessary to ensure that the two adjacent images partially overlap.

(d)色彩轉換:(d) Color conversion:

在開始判斷特徵點前,本發明另可對每一區塊影像做前置處理,讓影像中相似的顏色利用量化歸類成同一種顏色,以便更有效的提取全域特徵,主要目的是要讓相似度較大的影像內容,能被歸類至同一類,以減少影像合併時,兩個影像具有相似內容,卻因光線少許差異而造成兩者內容被判斷為不同之情況。色彩轉換的方式有許多種,例如可先將影像從RGB色彩空間轉換至CIEL*a*b*色彩空間,再對影像CIEL*a*b*中,a*、b*兩個子頻做量化,最後將影像從CIEL*a*b*轉換回RGB色彩空間。Before starting to judge the feature points, the present invention can also perform pre-processing on each block image, so that the similar color in the image is quantized into the same color, so as to extract the global features more effectively, the main purpose is to let Image content with similar similarity can be classified into the same category to reduce the situation where the two images have similar content when the image is merged, but the content of the two is judged to be different due to a slight difference in light. There are many ways to convert colors. For example, you can convert images from RGB color space to CIEL*a*b* color space, and then quantize the two sub-frequency of a* and b* in image CIEL*a*b*. Finally, convert the image from CIEL*a*b* back to the RGB color space.

色彩量化之目的是希望量化完後,讓影像色彩的差異降低,也就是說差異不大的顏色都被正規化為同一種顏色。為了讓使用者在掃描文件時,有理想的正確率與效率,特徵的挑選上必須講求特徵提取時的穩定性或是符合處理速度上的要求,因此挑選特徵的原則是當區塊影像之間差異較大時,可明確表示出各特徵之間的差異,反之,當區塊影像僅有些微的差異時,則忽略或使其不明顯。在此情形下,單純使用前述(a)之角點特徵無法提供較準確的鑑別力,因此需要另外加入更多描述影像細節的特徵,來與上述特徵結合成代表影像的特徵向量,例如可利用索貝爾(sobel)邊緣特徵、不變矩(invariant moment)、RGB色彩的標準差與均值特徵等,與前述之角點特徵組合成一個特徵向量,而每一張影像的每個特徵點所產生的特徵向量長度皆相同,做為合併影像時的依據。換句話說,本發明可利用前述(a)之方法所找出特徵點,以該特徵點位置為中心圈選出一方形區域(如15*15),分別就此方形區域算出該區域之索貝爾特徵(225維)、7個不變矩特徵(7維),及RGB色彩特徵(6維),最後取得維度為238維之特徵向量。The purpose of color quantization is to reduce the difference in image color after quantization, that is, the colors with little difference are normalized to the same color. In order to allow users to have the correct accuracy and efficiency when scanning documents, the selection of features must be based on the stability of feature extraction or the processing speed. Therefore, the principle of selecting features is between block images. When the difference is large, the difference between the features can be clearly indicated. Conversely, when the block image is only slightly different, it is ignored or made inconspicuous. In this case, simply using the corner feature of (a) above does not provide a more accurate discriminating power, so it is necessary to add more features describing the details of the image to combine with the above features to represent the feature vector of the image, for example, available. Sobel edge features, invariant moments, standard deviations and mean features of RGB colors, etc., combined with the aforementioned corner features into one feature vector, and each feature point of each image is generated. The feature vector lengths are the same, and are used as the basis for merging images. In other words, the present invention can use the method of the above (a) to find the feature point, and select a square area (such as 15*15) with the feature point as the center circle, and calculate the Sobel characteristics of the area for the square area respectively. (225 dimensions), 7 invariant moment features (7 dimensions), and RGB color features (6 dimensions), and finally obtain a feature vector with a dimension of 238 dimensions.

加入色彩轉換之處理後,相關影像比對與合併之運作方式仍是依照前述(c)之方法,於此不贅述。After the color conversion process is added, the operation mode of the related image comparison and combination is still in accordance with the method of the foregoing (c), and will not be described herein.

(e)除錯:(e) Debugging:

若在處理合併的計算過程中,有任何其中一張影像發生該影像的特徵點無法與其他影像的特徵點進行合併的情形,或是相符的特徵點少於一設定值(如20個),則本發明之文件掃描裝置10可透過提示單元114發出警告,讓使用者針對文件112上的對應的區塊進行重新拍攝,接著再找出重拍影像的所有特徵點與特徵向量,並與其他之前已經合併好的部分進行合併。If in the process of processing the merge, there is any case where the feature points of the image cannot be merged with the feature points of other images, or if the matching feature points are less than a set value (such as 20), Then, the document scanning device 10 of the present invention can issue a warning through the prompting unit 114, and let the user re-shoot the corresponding block on the file 112, and then find all the feature points and feature vectors of the retake image, and other The previously merged parts are merged.

因此,由上述說明可知,本發明之文件掃描裝置10可將文件112分為區塊BLK_1~BLK_n進行拍攝,最後再透過一系列演算法合併為掃描結果SCN。同時,在進行合併時,若發生合併失敗的情形時,可有效進行除錯,以確保掃描結果SCN的正確性。Therefore, as apparent from the above description, the document scanning apparatus 10 of the present invention can divide the file 112 into the blocks BLK_1 to BLK_n for photographing, and finally merge them into the scan result SCN through a series of algorithms. At the same time, when the merge is performed, if the merge fails, the debug can be effectively performed to ensure the correctness of the scan result SCN.

上述關於文件掃描裝置10之運作方式可進一步歸納為一文件掃描流程30,如第3圖所示。文件掃描流程30包含以下步驟:The above described operation of the document scanning device 10 can be further summarized into a document scanning process 30, as shown in FIG. The file scanning process 30 includes the following steps:

步驟300:開始。Step 300: Start.

步驟302:影像擷取裝置100擷取文件112之區塊BLK_1~BLK_n的影像IMG_1~IMG_n。Step 302: The image capturing device 100 captures the images IMG_1 - IMG_n of the blocks BLK_1 - BLK_n of the file 112.

步驟304:測距單元102測量影像擷取裝置100擷取影像IMG_1~IMG_n時與區塊BLK_1~BLK_n的距離DT_1~DT_n。Step 304: The distance measuring unit 102 measures distances DT_1 DT DT_n from the blocks BLK_1 ~ BLK_n when the image capturing device 100 captures the images IMG_1 ~ IMG _n.

步驟306:處理模組104根據距離DT_1~DT_n,調整影像IMG_1~IMG_n之特性。Step 306: The processing module 104 adjusts the characteristics of the images IMG_1 - IMG_n according to the distances DT_1 - DT_n.

步驟308:處理模組104判斷每一調整後之影像IMG_1~IMG_n的特徵點及對應的特徵向量。Step 308: The processing module 104 determines the feature points of each of the adjusted images IMG_1 to IMG_n and the corresponding feature vectors.

步驟310:處理模組104根據調整後之影像IMG_1~IMG_n的特徵向量,結合調整後之影像IMG_1~IMG_n,以產生文件112的掃描結果SCN。Step 310: The processing module 104 combines the adjusted images IMG_1 - IMG_n according to the feature vectors of the adjusted images IMG_1 - IMG_n to generate the scan result SCN of the file 112.

步驟312:結束。Step 312: End.

文件掃描流程30係為文件掃描裝置10之運作方式的歸納,詳細運作及變化方式,如判斷特徵點及特徵向量的方法、除錯機制等,可參考前述。此外,在文件掃描裝置10中,所有運作邏輯係被編譯為文件掃描程式碼110,此應係本領域具通常知識者實現電腦軟體之固有知識。然而,需注意的是,文件掃描程式碼110不限於任何程式語言,凡能由處理器106執行而達成對應功能之程式語言皆可用於本發明。The file scanning process 30 is a summary of the operation mode of the file scanning device 10, detailed operations and changes, such as methods for determining feature points and feature vectors, debugging mechanisms, etc., refer to the foregoing. In addition, in the file scanning device 10, all operational logic is compiled into the file scanning program code 110, which should be the intrinsic knowledge of the computer software for those skilled in the art. However, it should be noted that the file scanning program code 110 is not limited to any programming language, and any programming language that can be executed by the processor 106 to achieve a corresponding function can be used in the present invention.

再者,文件掃描裝置10中各元件係為實現本發明之最基本元件。然而,根據不同系統需求,各元件可能為系統原有之配備,經轉用而實現本發明。例如,筆記型電腦、智慧型手機等可攜式電子裝置中原配備有中央處理器、記憶體、照相機等,其即可用來實現文件掃描裝置10之處理器106、儲存單元108、影像擷取裝置100;同理,測距單元102亦可由接近感應器(Proximity Sensor)所實現,而提示單元114則可由螢幕上的訊息窗、喇叭發出的聲響等實現。換句話說,當文件掃描裝置10應用於可攜式電子裝置時,可能僅需增加文件掃描程式碼110即可實現對應的功能,如此一來,可有效提升使用者採用的意願。Furthermore, each component of the document scanning device 10 is the most basic component of the present invention. However, depending on the requirements of different systems, each component may be the original equipment of the system, and the invention is implemented by being transferred. For example, a portable electronic device such as a notebook computer or a smart phone is originally equipped with a central processing unit, a memory, a camera, etc., which can be used to implement the processor 106, the storage unit 108, and the image capturing device of the document scanning device 10. 100. Similarly, the ranging unit 102 can also be implemented by a Proximity Sensor, and the prompting unit 114 can be implemented by a message window on the screen, a sound emitted by a speaker, or the like. In other words, when the file scanning device 10 is applied to the portable electronic device, the file scanning program code 110 may only be added to implement the corresponding function, so that the user's willingness to use is effectively improved.

在習知技術中,當臨時遇到需要將紙本文件掃描成電子檔案的情況時,由於一般使用筆記型電腦的情況下不會隨身攜帶掃描器,因此,使用者往往需到附近的便利商店或回公司後再掃描,不但耗費金錢與時間,甚至錯失此份資料的即時性。在此情形下,本發明可於可攜式電子裝置中提供文件掃描功能,因而可有效提升便利性。In the prior art, when a temporary need to scan a paper document into an electronic file, since the scanner is not carried with the notebook computer in general, the user often needs to go to a nearby convenience store. Or after scanning back to the company, it will not only cost money and time, but also miss the immediacy of this information. In this case, the present invention can provide a file scanning function in the portable electronic device, thereby effectively improving convenience.

綜上所述,本發明係根據影像特徵點,合併不同影像,因而可於一可攜式電子裝置中提供文件掃描功能,以提升可攜式電子裝置的功能性及便利性。In summary, the present invention combines different images according to image feature points, thereby providing a file scanning function in a portable electronic device to improve the functionality and convenience of the portable electronic device.

以上所述僅為本發明之較佳實施例,凡依本發明申請專利範圍所做之均等變化與修飾,皆應屬本發明之涵蓋範圍。The above are only the preferred embodiments of the present invention, and all changes and modifications made to the scope of the present invention should be within the scope of the present invention.

10...文件掃描裝置10. . . Document scanning device

100...影像擷取裝置100. . . Image capture device

102...測距單元102. . . Ranging unit

104...處理模組104. . . Processing module

106...處理器106. . . processor

108...儲存單元108. . . Storage unit

110...文件掃描程式碼110. . . File scanning code

112...文件112. . . file

114...提示單元114. . . Prompt unit

BLK_1~BLK_n...區塊BLK_1~BLK_n. . . Block

IMG_1~IMG_n...影像IMG_1~IMG_n. . . image

DT_1~DT_n...距離DT_1~DT_n. . . distance

SCN...掃描結果SCN. . . Scan result

X...特徵點X. . . Feature points

30...文件掃描流程30. . . File scanning process

300、302、304、306、308、310、312...步驟300, 302, 304, 306, 308, 310, 312. . . step

第1圖為本發明實施例一文件掃描裝置之示意圖。FIG. 1 is a schematic diagram of a document scanning device according to an embodiment of the present invention.

第2圖為本發明實施例海斯行列式矩陣之示意圖。2 is a schematic diagram of a Hays determinant matrix according to an embodiment of the present invention.

第3圖為本發明實施例一文件掃描流程之示意圖。FIG. 3 is a schematic diagram of a file scanning process according to an embodiment of the present invention.

10...文件掃描裝置10. . . Document scanning device

100...影像擷取裝置100. . . Image capture device

102...測距單元102. . . Ranging unit

104...處理模組104. . . Processing module

106...處理器106. . . processor

108...儲存單元108. . . Storage unit

110...文件掃描程式碼110. . . File scanning code

112...文件112. . . file

114...提示單元114. . . Prompt unit

BLK_1~BLK_n...區塊BLK_1~BLK_n. . . Block

IMG_1~IMG_n...影像IMG_1~IMG_n. . . image

DT_1~DT_n...距離DT_1~DT_n. . . distance

SCN...掃描結果SCN. . . Scan result

Claims (17)

一種文件掃描方法,包含有:由一影像擷取裝置擷取一文件之複數個區塊的複數個影像;根據該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離,調整該複數個影像之特性,以產生複數個調整後影像;判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量;以及根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的一掃描結果。A file scanning method includes: capturing, by an image capturing device, a plurality of images of a plurality of blocks of a file; and extracting, by the image capturing device, a distance from the plurality of blocks when capturing the plurality of images, Adjusting characteristics of the plurality of images to generate a plurality of adjusted images; determining a plurality of feature points of each adjusted image in the plurality of adjusted images and a plurality of feature vectors of the plurality of feature points; and according to the complex number The plurality of feature vectors of each adjusted image of the adjusted image are combined with the plurality of adjusted images to generate a scan result of the file. 如請求項1所述之文件掃描方法,其中該複數個影像之特性係選自放大倍率、銳利化、對比度及彩度。The document scanning method of claim 1, wherein the characteristics of the plurality of images are selected from the group consisting of magnification, sharpness, contrast, and chroma. 如請求項1所述之文件掃描方法,其中判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量之步驟,包含有:計算每一調整後影像之每一像素對應於複數個變異數的複數個海斯矩陣(Hessian matrix);根據每一調整後影像之複數個像素的海斯矩陣,由該複數個像素中選取該複數個特徵點;計算該複數個特徵點之每一特徵點之複數個鄰近像素之哈爾(Haar)運算結果;以及根據該複數個特徵點之每一特徵點之該複數個鄰近像素之哈爾算結果,判斷該複數個特徵向量。The file scanning method of claim 1, wherein the step of determining a plurality of feature points of each of the plurality of adjusted images and a plurality of feature vectors of the plurality of feature points includes: calculating each Each pixel of the adjusted image corresponds to a plurality of Hessian matrices of a plurality of variograms; and the plurality of features are selected from the plurality of pixels according to a Hayes matrix of a plurality of pixels of each adjusted image a Haar operation result of calculating a plurality of neighboring pixels of each of the plurality of feature points; and a result of calculating a plurality of neighboring pixels according to each feature point of the plurality of feature points And determining the plurality of feature vectors. 如請求項1所述之文件掃描方法,其中根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的該掃描結果之步驟,包含有:比較該複數個調整後影像之每一調整後影像的該複數個特徵向量,以判斷該複數個調整後影像中相似之特徵點;以及以該複數個調整後影像中相似之特徵點為基準,依序結合該複數個調整後影像,以產生該文件的該掃描結果。The file scanning method of claim 1, wherein the step of generating the scan result of the file is performed according to the plurality of feature vectors of each adjusted image of the plurality of adjusted images combined with the plurality of adjusted images The method includes: comparing the plurality of feature vectors of each adjusted image of the plurality of adjusted images to determine similar feature points in the plurality of adjusted images; and similar features in the plurality of adjusted images The point is a reference, and the plurality of adjusted images are sequentially combined to generate the scan result of the file. 如請求項4所述之文件掃描方法,其另包含於該複數個調整後影像中一調整後影像與其他調整後影像間無相似之特徵點時,重新擷取該調整後影像所對應之一區塊的影像。The method for scanning a document according to claim 4, further comprising: retrieving one of the adjusted images after the adjusted image has no similar feature points between the adjusted image and the other adjusted images; The image of the block. 一種文件掃描裝置,包含有:一影像擷取裝置;一測距單元;一處理器;以及一儲存單元,該儲存單元用來儲存一程式碼,該程式碼指示該處理器執行以下步驟:控制該影像擷取裝置擷取一文件之複數個區塊的複數個影像;控制該測距單元測量該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離;根據該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離,調整該複數個影像之特性,以產生複數個調整後影像;判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量;以及根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的一掃描結果。A file scanning device includes: an image capturing device; a ranging unit; a processor; and a storage unit for storing a code, the code indicating that the processor performs the following steps: controlling The image capturing device captures a plurality of images of a plurality of blocks of a file; and the distance measuring unit controls the distance of the plurality of blocks when the image capturing device captures the plurality of images; according to the image Taking the distance between the plurality of blocks and the plurality of blocks, adjusting the characteristics of the plurality of images to generate a plurality of adjusted images; determining a plurality of the adjusted images in the plurality of adjusted images a feature point and a plurality of feature vectors of the plurality of feature points; and combining the plurality of adjusted images according to the plurality of feature vectors of each adjusted image of the plurality of adjusted images to generate a scan of the file result. 如請求項6所述之文件掃描裝置,其中該複數個影像之特性係選自放大倍率、銳利化、對比度及彩度。The document scanning device of claim 6, wherein the plurality of images are selected from the group consisting of magnification, sharpness, contrast, and chroma. 如請求項6所述之文件掃描裝置,其中判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量之步驟,包含有:計算每一調整後影像之每一像素對應於複數個變異數的複數個海斯矩陣(Hessian matrix);根據每一調整後影像之複數個像素的海斯矩陣,由該複數個像素中選取該複數個特徵點;計算該複數個特徵點之每一特徵點之複數個鄰近像素之哈爾(Haar)運算結果;以及根據該複數個特徵點之每一特徵點之該複數個鄰近像素之哈爾算結果,判斷該複數個特徵向量。The file scanning device of claim 6, wherein the step of determining a plurality of feature points of each of the plurality of adjusted images and a plurality of feature vectors of the plurality of feature points comprises: calculating each Each pixel of the adjusted image corresponds to a plurality of Hessian matrices of a plurality of variograms; and the plurality of features are selected from the plurality of pixels according to a Hayes matrix of a plurality of pixels of each adjusted image a Haar operation result of calculating a plurality of neighboring pixels of each of the plurality of feature points; and a result of calculating a plurality of neighboring pixels according to each feature point of the plurality of feature points And determining the plurality of feature vectors. 如請求項6所述之文件掃描裝置,其中根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的該掃描結果之步驟,包含有:比較該複數個調整後影像之每一調整後影像的該複數個特徵向量,以判斷該複數個調整後影像中相似之特徵點;以及以該複數個調整後影像中相似之特徵點為基準,依序結合該複數個調整後影像,以產生該文件的該掃描結果。The document scanning device of claim 6, wherein the step of generating the scan result of the file is performed according to the plurality of feature vectors of each adjusted image of the plurality of adjusted images combined with the plurality of adjusted images The method includes: comparing the plurality of feature vectors of each adjusted image of the plurality of adjusted images to determine similar feature points in the plurality of adjusted images; and similar features in the plurality of adjusted images The point is a reference, and the plurality of adjusted images are sequentially combined to generate the scan result of the file. 如請求項9所述之文件掃描裝置,其另包含於該複數個調整後影像中一調整後影像與其他調整後影像間無相似之特徵點時,重新擷取該調整後影像所對應之一區塊的影像。The document scanning device of claim 9, further comprising: in the plurality of adjusted images, if one of the adjusted images has no similar feature points between the other adjusted images, retrieving one of the adjusted images The image of the block. 如請求項6所述之文件掃描裝置,其另包含一提示單元,該程式碼另指示該處理器透過該提示單元指示一使用者平移該文件,使該影像擷取裝置依序擷取該複數個區塊的該複數個影像。The file scanning device of claim 6, further comprising a prompting unit, the code further instructing the processor to instruct the user to translate the file through the prompting unit, so that the image capturing device sequentially captures the plurality The plurality of images of the blocks. 一種可攜式電子裝置,包含有:一處理器;一儲存單元;一影像擷取裝置;以及一文件掃描裝置,包含有:一測距單元;一程式碼,儲存於該儲存單元中,該程式碼指示該處理器執行以下步驟:控制該影像擷取裝置擷取一文件之複數個區塊的複數個影像;控制該測距單元測量該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離;根據該影像擷取裝置擷取該複數個影像時與該複數個區塊的距離,調整該複數個影像之特性,以產生複數個調整後影像;判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量;以及根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的一掃描結果。A portable electronic device includes: a processor; a storage unit; an image capture device; and a file scanning device, including: a ranging unit; a code stored in the storage unit, The code indicates that the processor performs the following steps: controlling the image capturing device to capture a plurality of images of a plurality of blocks of a file; and controlling the ranging unit to measure the image capturing device to capture the plurality of images The distance between the plurality of blocks; the distance between the plurality of blocks when the image capturing device captures the plurality of blocks, and adjusting the characteristics of the plurality of images to generate a plurality of adjusted images; determining the plurality of adjustments a plurality of feature points of each adjusted image in the post-image and a plurality of feature vectors of the plurality of feature points; and the plurality of feature vectors according to each adjusted image of the plurality of adjusted images, combined with the plurality of feature vectors The adjusted image is used to produce a scan of the file. 如請求項12所述之可攜式電子裝置,其中該複數個影像之特性係選自放大倍率、銳利化、對比度及彩度。The portable electronic device of claim 12, wherein the plurality of images are selected from the group consisting of magnification, sharpness, contrast, and chroma. 如請求項12所述之可攜式電子裝置,其中判斷該複數個調整後影像中每一調整後影像的複數個特徵點及該複數個特徵點的複數個特徵向量之步驟,包含有:計算每一調整後影像之每一像素對應於複數個變異數的複數個海斯矩陣(Hessian matrix);根據每一調整後影像之複數個像素的海斯矩陣,由該複數個像素中選取該複數個特徵點;計算該複數個特徵點之每一特徵點之複數個鄰近像素之哈爾(Haar)運算結果;以及根據該複數個特徵點之每一特徵點之該複數個鄰近像素之哈爾算結果,判斷該複數個特徵向量。The portable electronic device of claim 12, wherein the step of determining a plurality of feature points of each of the plurality of adjusted images and a plurality of feature vectors of the plurality of feature points comprises: calculating Each pixel of each adjusted image corresponds to a plurality of Hessian matrices of a plurality of variograms; and the complex number is selected from the plurality of pixels according to a Hayes matrix of a plurality of pixels of each adjusted image a feature point; a Haar operation result of a plurality of neighboring pixels of each of the plurality of feature points; and a plurality of neighboring pixels of each feature point of the plurality of feature points Calculate the result and determine the plurality of feature vectors. 如請求項12所述之可攜式電子裝置,其中根據該複數個調整後影像之每一調整後影像的該複數個特徵向量,結合該複數個調整後影像,以產生該文件的該掃描結果之步驟,包含有:比較該複數個調整後影像之每一調整後影像的該複數個特徵向量,以判斷該複數個調整後影像中相似之特徵點;以及以該複數個調整後影像中相似之特徵點為基準,依序結合該複數個調整後影像,以產生該文件的該掃描結果。The portable electronic device of claim 12, wherein the plurality of eigenvectors of each adjusted image of the plurality of adjusted images are combined with the plurality of adjusted images to generate the scan result of the file. The step of: comparing the plurality of feature vectors of each adjusted image of the plurality of adjusted images to determine similar feature points in the plurality of adjusted images; and comparing the plurality of adjusted images The feature point is a reference, and the plurality of adjusted images are sequentially combined to generate the scan result of the file. 如請求項15所述之可攜式電子裝置,其另包含於該複數個調整後影像中一調整後影像與其他調整後影像間無相似之特徵點時,重新擷取該調整後影像所對應之一區塊的影像。The portable electronic device of claim 15 further comprising: when the adjusted image has no similar feature points between the adjusted image and the other adjusted images, retrieving the adjusted image An image of one of the blocks. 如請求項12所述之可攜式電子裝置,其另包含一提示單元,該程式碼另指示該處理器透過該提示單元指示一使用者平移該文件,使該影像擷取裝置依序擷取該複數個區塊的該複數個影像。The portable electronic device of claim 12, further comprising a prompting unit, the code further instructing the processor to instruct the user to translate the file through the prompting unit, so that the image capturing device sequentially captures The plurality of images of the plurality of blocks.
TW100122242A 2011-06-24 2011-06-24 Method and device of document scanning and portable electronic device TW201301874A (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
TW100122242A TW201301874A (en) 2011-06-24 2011-06-24 Method and device of document scanning and portable electronic device
CN201110192655XA CN102843479A (en) 2011-06-24 2011-07-11 File scanning method, file scanning device and portable electronic device
US13/311,569 US20120327486A1 (en) 2011-06-24 2011-12-06 Method and Device of Document Scanning and Portable Electronic Device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW100122242A TW201301874A (en) 2011-06-24 2011-06-24 Method and device of document scanning and portable electronic device

Publications (1)

Publication Number Publication Date
TW201301874A true TW201301874A (en) 2013-01-01

Family

ID=47361594

Family Applications (1)

Application Number Title Priority Date Filing Date
TW100122242A TW201301874A (en) 2011-06-24 2011-06-24 Method and device of document scanning and portable electronic device

Country Status (3)

Country Link
US (1) US20120327486A1 (en)
CN (1) CN102843479A (en)
TW (1) TW201301874A (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9270838B2 (en) * 2010-07-20 2016-02-23 Kodak Alaris Inc. Verifying accuracy of a scanned document
GB2528429B (en) * 2014-05-13 2016-12-14 String Labs Ltd Border detection
GB2544786A (en) * 2015-11-27 2017-05-31 Univ Of East Anglia Method and system for generating an output image from a plurality of corresponding input image channels
US10924620B2 (en) * 2017-05-26 2021-02-16 Fuji Xerox Co., Ltd. Document reading guidance for operator using feature amount acquired from image of partial area of document

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6389179B1 (en) * 1996-05-28 2002-05-14 Canon Kabushiki Kaisha Image combining apparatus using a combining algorithm selected based on an image sensing condition corresponding to each stored image
US6097854A (en) * 1997-08-01 2000-08-01 Microsoft Corporation Image mosaic construction system and apparatus with patch-based alignment, global block adjustment and pair-wise motion-based local warping
US6571022B2 (en) * 1997-08-06 2003-05-27 Minolta Co., Ltd. Image processing apparatus for generating a single image of an object using overlapping partial images
JP2002208005A (en) * 2001-01-12 2002-07-26 Minolta Co Ltd Image processor, image processing method, image processing program, and computer readable recording medium with image processing program recorded
WO2009072264A1 (en) * 2007-12-03 2009-06-11 Panasonic Corporation Image processing device, photographing device, reproducing device, integrated circuit, and image processing method
US8194936B2 (en) * 2008-04-25 2012-06-05 University Of Iowa Research Foundation Optimal registration of multiple deformed images using a physical model of the imaging distortion
JP4991887B2 (en) * 2010-01-13 2012-08-01 シャープ株式会社 Captured image processing system, control method for captured image processing system, program, and recording medium
US20120182442A1 (en) * 2011-01-14 2012-07-19 Graham Kirsch Hardware generation of image descriptors

Also Published As

Publication number Publication date
US20120327486A1 (en) 2012-12-27
CN102843479A (en) 2012-12-26

Similar Documents

Publication Publication Date Title
US10798359B2 (en) Generating hi-res dewarped book images
US11210516B2 (en) AR scenario processing method and device, and computer storage medium
US10289924B2 (en) System and method for scanned document correction
US10915998B2 (en) Image processing method and device
JP6176598B2 (en) Dimension measurement program, dimension measurement apparatus, and dimension measurement method
US9014459B2 (en) Identification method for valuable file and identification device thereof
US9589333B2 (en) Image correction apparatus for correcting distortion of an image
CN103019537B (en) A kind of image preview method and device
RU2631765C1 (en) Method and system of correcting perspective distortions in images occupying double-page spread
US10482571B2 (en) Dual fisheye, hemispherical image projection and stitching method, device and computer-readable medium
JP6739937B2 (en) Information processing apparatus, control method of information processing apparatus, and program
JP2016212784A (en) Image processing apparatus and image processing method
TW201301874A (en) Method and device of document scanning and portable electronic device
CN109361850A (en) Image processing method, device, terminal device and storage medium
JP6669390B2 (en) Information processing apparatus, information processing method, and program
US9098746B2 (en) Building texture extracting apparatus and method thereof
CN108234868B (en) Intelligent shooting system and method based on case reasoning
US20220198814A1 (en) Image dewarping with curved document boundaries
JP6161874B2 (en) Imaging apparatus, length measurement method, and program
CN112615993A (en) Depth information acquisition method, binocular camera module, storage medium and electronic equipment
JP2005182098A (en) Device and method for generating and displaying composite image
CN114615480B (en) Projection screen adjustment method, apparatus, device, storage medium, and program product
CN114640833B (en) Projection picture adjusting method, device, electronic equipment and storage medium
JP5353299B2 (en) Image search system, image search device, and image search method
JP6203070B2 (en) Scanning system, control method therefor, and program