TW202001598A

TW202001598A - Image search method

Info

Publication number: TW202001598A
Application number: TW107120502A
Authority: TW
Inventors: 陳瑞麟
Original assignee: 宏碁股份有限公司
Priority date: 2018-06-14
Filing date: 2018-06-14
Publication date: 2020-01-01

Abstract

An image search method is provided. A selection is performed on a display frame, and a region of interest (ROI) is obtained from the display frame based on the selection. A face detection is performed on the ROI. When the ROI contains a human face, an image corresponding to the human face is obtained from the display frame and the image is input to search engines.

Description

Image search method

本發明是有關於一種搜尋方法，且特別是有關於一種可即時圈選感興趣區域（Region of Interest，ROI）進行搜尋的圖像搜尋方法。The present invention relates to a search method, and in particular, to an image search method that can search regions of interest (ROI) in real time.

一般搜尋引擎提供了一個介面，使用戶能夠指定關於感興趣的專案的標準，並讓搜尋引擎找到符合的結果。除了輸入文字作為關鍵字來進行搜尋之外，還可進一步輸入圖片至搜尋引擎來進行搜尋。而目前搜尋引擎在進行圖片搜尋時，只能預先準備好圖片，並將整張圖片輸入至搜尋引擎。General search engines provide an interface that allows users to specify criteria for projects of interest and let search engines find matching results. In addition to entering text as a keyword to search, you can also enter pictures into the search engine to search. At present, when searching for pictures, search engines can only prepare pictures in advance and input the entire picture into the search engine.

目前圖片搜尋的方式並無法針對圖片中的特定區域進行搜尋。倘若欲針對圖片中的特定區域進行搜尋，使用者必需先手動對圖片進行裁切，以獲得只包括特定區域的圖像，再以所述圖像進行搜尋。此外，目前圖片搜尋的方式也無法直接針對視頻進行圖片搜尋。使用者必需在視頻播放中，事先擷取下感興趣的圖像來進行搜尋。據此，在使用上並不便利。The current image search method cannot search for specific areas in the image. If you want to search for a specific area in the picture, the user must manually crop the picture to obtain an image that includes only the specific area, and then search with the image. In addition, the current image search method cannot directly perform image search for videos. The user must capture the images of interest to search during the video playback. Accordingly, it is not convenient in use.

本發明提供一種圖像搜尋方法，可即時於顯示畫面中圈選感興趣區域並自動進行搜尋。The invention provides an image search method, which can circle the region of interest in the display screen and search automatically.

本發明的圖像搜尋方法，包括：在顯示畫面中執行圈選動作；基於圈選動作自顯示畫面中獲得感興趣區域；對感興趣區域執行人臉偵測；在判定感興趣區域具有人臉時，自顯示畫面中獲得對應於人臉的圖像；以及輸入對應於人臉的圖像至搜尋引擎進行搜尋。The image search method of the present invention includes: performing a circled action in the display screen; obtaining a region of interest from the display screen based on the circled action; performing face detection on the region of interest; determining that the region of interest has a human face , The image corresponding to the human face is obtained from the display screen; and the image corresponding to the human face is input to the search engine for searching.

在本發明的一實施例中，在獲得感興趣區域之後，更包括：執行影像變形校正。In an embodiment of the invention, after obtaining the region of interest, it further includes: performing image distortion correction.

在本發明的一實施例中，在獲得感興趣區域之後，更包括：對感興趣區域對應的圖像執行解析度補償。In an embodiment of the present invention, after obtaining the region of interest, it further includes: performing resolution compensation on the image corresponding to the region of interest.

在本發明的一實施例中，上述圖像搜尋方法更包括：在判定感興趣區域不具有人臉時，輸入感興趣區域對應的圖像至搜尋引擎進行搜尋。In an embodiment of the invention, the above image search method further includes: when it is determined that the region of interest does not have a human face, input an image corresponding to the region of interest to the search engine for searching.

在本發明的一實施例中，上述對感興趣區域執行人臉偵測的步驟包括：執行膚色分割，以獲得膚色區域；判斷膚色區域是否存在人臉特徵；以及當膚色區域中存在人臉特徵，則判定感興趣區域包括人臉。In an embodiment of the present invention, the step of performing face detection on the region of interest includes: performing skin color segmentation to obtain a skin color region; determining whether a facial feature exists in the skin color region; and when a facial feature exists in the skin color region , It is determined that the region of interest includes a human face.

在本發明的一實施例中，上述圖像搜尋方法更包括：在輸入對應於人臉的圖像至搜尋引擎進行搜尋之後，判斷搜尋到的各影像是否符合感興趣區域中的人臉。In an embodiment of the present invention, the above image search method further includes: after inputting an image corresponding to a human face to the search engine for searching, determining whether each of the searched images matches the human face in the region of interest.

在本發明的一實施例中，上述圖像搜尋方法更包括：當在感興趣區域偵測到N個人臉時，自顯示畫面中獲得基於N個人臉的N個圖像，其中N為大於1的整數；以及分別將N個圖像輸入至搜尋引擎進行搜尋。In an embodiment of the present invention, the above image search method further includes: when N faces are detected in the region of interest, N images based on N faces are obtained from the display screen, where N is greater than 1 Integer; and input N images to the search engine to search.

在本發明的一實施例中，上述顯示畫面包括視頻畫面、圖片以及瀏覽器畫面其中一個。In an embodiment of the invention, the display screen includes one of a video screen, a picture, and a browser screen.

基於上述，本發明提供一種供使用者即時於顯示畫面中圈選感興趣區域，在自動將圖像進行前處理以提升或還原圖像的解析度之後，將圖像輸入至搜尋引擎來進行搜尋，以提高使用的便利性。Based on the above, the present invention provides a method for the user to circle the region of interest in the display screen in real time, after automatically pre-processing the image to improve or restore the resolution of the image, input the image to the search engine for searching To improve the convenience of use.

為讓本發明的上述特徵和優點能更明顯易懂，下文特舉實施例，並配合所附圖式作詳細說明如下。In order to make the above-mentioned features and advantages of the present invention more obvious and understandable, the embodiments are specifically described below in conjunction with the accompanying drawings for detailed description as follows.

圖1是依照本發明一實施例的電子裝置的方塊圖。電子裝置100包括處理器110、儲存單元120以及顯示器130。處理器110耦接至儲存單元120以及顯示器130。FIG. 1 is a block diagram of an electronic device according to an embodiment of the invention. The electronic device 100 includes a processor 110, a storage unit 120, and a display 130. The processor 110 is coupled to the storage unit 120 and the display 130.

處理器110可以是中央處理單元（Central Processing Unit，CPU）、圖像處理單元（Graphic Processing Unit，GPU）、物理處理單元（Physics Processing Unit，PPU）、可程式化之微處理器（Microprocessor）、嵌入式控制晶片、數位訊號處理器（Digital Signal Processor，DSP）、特殊應用積體電路（Application Specific Integrated Circuits，ASIC）或其他類似裝置。The processor 110 may be a central processing unit (Central Processing Unit, CPU), an image processing unit (Graphic Processing Unit, GPU), a physical processing unit (Physics Processing Unit, PPU), a programmable microprocessor (Microprocessor), Embedded control chip, digital signal processor (DSP), application specific integrated circuits (ASIC) or other similar devices.

儲存單元120為任意型式的固定式或可移動式隨機存取記憶體（Random Access Memory，RAM）、唯讀記憶體（Read-Only Memory，ROM）、快閃記憶體（Flash memory）、安全數位卡（Secure Digital Memory Card，SD）、硬碟或其他類似裝置或這些裝置的組合。儲存單元120儲存有至少一程式碼片段，而處理器110透過讀取儲存單元120內的程式碼片段來執行對應的指令來實現下述圖像搜尋方法。The storage unit 120 is any type of fixed or removable random access memory (RAM), read-only memory (ROM), flash memory (Flash memory), secure digital Card (Secure Digital Memory Card, SD), hard disk or other similar devices or a combination of these devices. The storage unit 120 stores at least one code segment, and the processor 110 reads the code segment in the storage unit 120 to execute corresponding instructions to implement the following image search method.

顯示器130可採用液晶顯示器（Liquid Crystal Display，LCD）、電漿顯示器（Plasma Display）等來實現，或者亦可以使用具有觸控模組的觸控螢幕來作為顯示器130。The display 130 may be implemented by a liquid crystal display (LCD), a plasma display (Plasma Display), or the like, or a touch screen with a touch module may be used as the display 130.

圖2是依照本發明一實施例的圖像搜尋方法的流程圖。請同時參照圖1以及圖2，在步驟S205中，在顯示畫面中執行圈選動作。例如，處理器110可在顯示器130的顯示畫面為呈現為視頻畫面、圖片或瀏覽器畫面時，藉由偵測輸入裝置所執行的動作，而在顯示器130的顯示畫面中執行對應的圈選動作。接著，在步驟S210中，處理器110基於圈選動作自顯示畫面中獲得感興趣區域（Region of Interest，ROI）。2 is a flowchart of an image search method according to an embodiment of the invention. Please refer to FIG. 1 and FIG. 2 at the same time. In step S205, a circle action is performed on the display screen. For example, the processor 110 may perform the corresponding circled action on the display screen of the display 130 by detecting the action performed by the input device when the display screen of the display 130 is presented as a video screen, picture, or browser screen. . Next, in step S210, the processor 110 obtains a region of interest (ROI) from the display screen based on the circled action.

圖3是依照本發明一實施例的圈選感興趣區域的示意圖。本實施例中以筆記型電腦為例來進行說明。請參照圖3，假設筆記型電腦正在播放視頻，使用者可以利用滑鼠、觸控筆、手寫板等輸入裝置在顯示畫面300（即，視頻畫面）上圈選感興趣區域310。3 is a schematic diagram of circled a region of interest according to an embodiment of the invention. In this embodiment, a notebook computer is used as an example for description. Referring to FIG. 3, assuming that the notebook computer is playing a video, the user may use an input device such as a mouse, a stylus pen, and a tablet to circle the region of interest 310 on the display screen 300 (ie, the video screen).

返回圖2，在獲得感興趣區域之後，處理器110進一步執行影像變形校正。例如，倘若感興趣區域為扭曲或翻轉過的畫面，則處理器110執行影像變形校正來修正感興趣區域對應的圖像。Returning to FIG. 2, after obtaining the region of interest, the processor 110 further performs image distortion correction. For example, if the region of interest is a distorted or flipped picture, the processor 110 performs image distortion correction to correct the image corresponding to the region of interest.

之後，在步驟S215中，處理器110對感興趣區域執行人臉偵測。這是為了避免當感興趣區域中包括一位以上的人物時，影響圖像的搜尋結果。人臉偵測是指在一張影像中去搜尋一個或多個可能為人臉的區域。在人臉偵測過程中，大致上可以利用臉部膚色以及人臉特徵等資訊來判斷。例如，處理器110執行膚色分割，以獲得膚色區域，之後判斷膚色區域是否存在人臉特徵。例如，判斷膚色區域中是否存在眼睛特徵、嘴巴特徵等。當膚色區域中存在人臉特徵，則判定感興趣區域包括人臉。另外，倘若所圈選的感興趣區域為半邊臉部的畫面，則處理器110會進一步將感興趣區域的範圍擴展至包括整個臉部區域甚至是全身的區域。Thereafter, in step S215, the processor 110 performs face detection on the region of interest. This is to avoid affecting the search result of the image when more than one person is included in the region of interest. Face detection refers to searching for one or more areas that may be faces in an image. In the process of face detection, it can be roughly judged by using information such as facial skin color and facial features. For example, the processor 110 performs skin color segmentation to obtain a skin color area, and then determines whether a human face feature exists in the skin color area. For example, it is determined whether there are eye features, mouth features, etc. in the skin color area. When there is a face feature in the skin color area, it is determined that the area of interest includes a face. In addition, if the circled area of interest is a half-face image, the processor 110 will further extend the range of the area of interest to include the entire face area or even the whole body area.

在步驟S220中，在判定感興趣區域具有人臉時，處理器110自顯示畫面中獲得對應於人臉的圖像。之後，在步驟S225中，處理器110輸入對應於人臉的圖像至搜尋引擎進行搜尋。另一方面，在判定感興趣區域不具有人臉時，處理器110直接輸入感興趣區域對應的圖像至搜尋引擎進行搜尋。In step S220, when it is determined that the region of interest has a human face, the processor 110 obtains an image corresponding to the human face from the display screen. After that, in step S225, the processor 110 inputs the image corresponding to the face to the search engine for searching. On the other hand, when it is determined that the region of interest does not have a human face, the processor 110 directly inputs the image corresponding to the region of interest to the search engine for searching.

而當在感興趣區域偵測到N個人臉時，處理器110自顯示畫面中獲得基於N個人臉的N個圖像，並且分別將N個圖像輸入至搜尋引擎進行搜尋。其中，N為大於1的整數。例如，若感興趣區域存在2個人臉，則獲得對應的2張圖像。When N faces are detected in the region of interest, the processor 110 obtains N images based on the N faces from the display screen, and inputs the N images to the search engine for searching. Where N is an integer greater than 1. For example, if there are two faces in the region of interest, then two corresponding images are obtained.

舉例來說，圖4A與圖4B是依照本發明另一實施例的圈選感興趣區域的示意圖。在圖4A中，圈選了感興趣區域410。在經過人臉偵測之後，於感興趣區域410找到2個人臉，而自顯示畫面中獲得基於各人臉的圖像411與圖像413。之後，分別將圖像411與圖像413輸入至搜尋引擎進行搜尋。在此，還可進一步設定為：當感興趣區域包括的人臉數量超過一門檻值時，仍然以整個感興趣區域對應的圖像來進行搜尋。例如，在人臉數量超過10個的情況下則不會分別以各個人臉進行搜尋，而是直接以整張圖像進行搜尋。For example, FIGS. 4A and 4B are schematic diagrams of circled regions of interest according to another embodiment of the present invention. In FIG. 4A, the region of interest 410 is circled. After face detection, two faces are found in the region of interest 410, and images 411 and 413 based on each face are obtained from the display screen. After that, input the image 411 and the image 413 to the search engine for searching. Here, it can be further set that when the number of faces included in the region of interest exceeds a threshold, the search is still performed with the image corresponding to the entire region of interest. For example, when the number of faces exceeds 10, each face is not searched separately, but the entire image is searched directly.

而在獲得感興趣區域之後，處理器110還可進一步對感興趣區域對應的圖像執行解析度補償。首先，將圖像轉換為灰階影像Gary₁ 。接著，對灰階影像Gary₁ 執行邊緣偵測演算法以求出邊緣數。在此，使用Sobel演算法來作為邊緣偵測演算法。After obtaining the region of interest, the processor 110 may further perform resolution compensation on the image corresponding to the region of interest. First, convert the image to grayscale image Gary ₁ . Next, an edge detection algorithm is executed on the grayscale image Gary _{1 to} find the number of edges. Here, the Sobel algorithm is used as the edge detection algorithm.

底下公式1、公式2的Gx ₁ 、 Gy ₁ 分別代表經橫向及縱向邊緣檢測的圖像。公式1

公式2

基於每一個像素的橫向及縱向梯度近似值，可使用以下公式3來計算梯度的大小（Magnitude）G_M 。公式3

Gx ₁ and Gy _{1 in} Equation 1 and Equation 2 below represent images detected by horizontal and vertical edges, respectively. Formula 1

Formula 2

Based on the approximate horizontal and vertical gradients of each pixel, the following formula 3 can be used to calculate the magnitude of the gradient (Magnitude) G _M. Formula 3

接著，利用與灰階影像Gary₁ 大小相同的空白影像Edge 來記錄邊緣（參照公式4）。其中，門檻值T預設為100。公式4

Next, use the blank image Edge that is the same size as the grayscale image Gary ₁ to record the edge (refer to Equation 4). The threshold value T is preset to 100. Formula 4

然後，參照下述公式5、公式6來計算銳度比（sharpness ratio）S_R ，並且根據銳度比S_R 來獲得增強比（Enhancement Ratio）E_R 。公式5

公式6

Then, the sharpness ratio S _R is calculated with reference to the following formulas 5 and 6, and the enhancement ratio (Erancement Ratio) E _R is obtained according to the sharpness ratio S _R. Formula 5

Formula 6

之後，參照公式7，依據所獲得的增強比E_R 來求出新的門檻值T。之後以新的門檻值T來重新獲得另一張邊緣影像Edge，並與原始的圖像疊加以進行優化。公式7

Thereafter, the enhancement ratio E _R obtains the new reference threshold Equation 7, based on the obtained T. Then, a new threshold value T is used to obtain another edge image Edge, which is superimposed with the original image for optimization. Formula 7

另外，在輸入對應於人臉的圖像至搜尋引擎進行搜尋之後，處理器110可進一步判斷搜尋到的各影像是否符合感興趣區域中的人臉。舉例來說，可利用眼距以及眼距中心至嘴巴的距離來進行比對。圖5是依照本發明一實施例的分析人臉的示意圖。參照圖5，利用人臉偵測在圖像中找出人臉區域510，之後進一步在人臉區域中找出眼部區域511、512以及嘴巴區域513。計算眼部區域511、512之間的距離L，並且計算自眼距中心至嘴巴區域513之間的距離R。In addition, after inputting the image corresponding to the human face to the search engine for searching, the processor 110 may further determine whether each of the searched images matches the human face in the region of interest. For example, the eye distance and the distance from the center to the mouth can be used for comparison. FIG. 5 is a schematic diagram of analyzing a human face according to an embodiment of the invention. Referring to FIG. 5, face detection is used to find the face area 510 in the image, and then the eye areas 511 and 512 and the mouth area 513 are further found in the face area. The distance L between the eye regions 511 and 512 is calculated, and the distance R from the center of the eye to the mouth region 513 is calculated.

自感興趣區域獲得距離L、R，並依據L:R來獲得一比例R_ROI 。並且，自搜尋引擎所搜尋到的影像獲得距離L、R，並依據L:R來獲得一比例R_Output 。當比對結果符合下述公式8時，判定搜尋到的影像與感興趣區域中為相同人物。公式8

Obtain distances L and R from the region of interest, and obtain a ratio R _ROI according to L:R. Moreover, the images searched by the search engine obtain distances L and R, and obtain a ratio R _Output according to L:R. When the comparison result meets the following formula 8, it is determined that the searched image and the region of interest are the same person. Formula 8

另外，可將感興趣區域對應的圖像輸入至多個搜尋引擎，並將自各搜尋引擎所獲得的結果進行統整。例如，針對場景、動植物、人物等來進行搜尋，不同的搜尋引擎的搜尋準確率皆不同。因此，整合各個搜尋引擎所收集到的資料，並且比對所獲得的資料，以將正確率最高（資料數最多）的資料匯整給使用者。In addition, images corresponding to the region of interest can be input to multiple search engines, and the results obtained from each search engine can be integrated. For example, when searching for scenes, animals, plants, characters, etc., the search accuracy of different search engines is different. Therefore, integrate the data collected by various search engines, and compare the obtained data to aggregate the data with the highest accuracy (the most data) to the user.

綜上所述，本發明提供一種供使用者即時於顯示畫面中圈選感興趣區域，在自動經過影像處理之後將圖像輸入至搜尋引擎來進行搜尋，以提高使用的便利性。並且，在進行搜尋之前提高圖像的解析度，可提高搜尋的準確度。此外，統整各搜尋引擎的結果，而回傳正確率最高的資料給使用者，在使用上可打破以往使用單一搜尋引擎的限制。In summary, the present invention provides a method for a user to circle a region of interest in a display screen in real time, and after automatically undergoing image processing, input an image to a search engine for searching, so as to improve the convenience of use. Moreover, before the search is performed, the resolution of the image is improved, and the accuracy of the search can be improved. In addition, the results of each search engine are consolidated, and the data with the highest accuracy is returned to the user, which can break the restrictions of using a single search engine in the past.

雖然本發明已以實施例揭露如上，然其並非用以限定本發明，任何所屬技術領域中具有通常知識者，在不脫離本發明的精神和範圍內，當可作些許的更動與潤飾，故本發明的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed as above with examples, it is not intended to limit the present invention. Any person with ordinary knowledge in the technical field can make some changes and modifications without departing from the spirit and scope of the present invention. The scope of protection of the present invention shall be subject to the scope defined in the appended patent application.

100‧‧‧電子裝置110‧‧‧處理器120‧‧‧儲存單元130‧‧‧顯示器300‧‧‧顯示畫面310、410‧‧‧感興趣區域411、413‧‧‧圖像510‧‧‧人臉區域511、512‧‧‧眼部區域513‧‧‧嘴巴區域S205～S225‧‧‧圖像搜尋方法的各步驟100‧‧‧Electronic device 110‧‧‧Processor 120‧‧‧Storage unit 130‧‧‧Display 300‧‧‧Display screen 310,410‧‧‧Interest area 411,413‧‧‧Image 510‧‧‧ Face area 511, 512‧‧‧Eye area 513‧‧‧ Mouth area S205～S225‧‧‧Steps of image search method

圖1是依照本發明一實施例的電子裝置的方塊圖。圖2是依照本發明一實施例的圖像搜尋方法的流程圖。圖3是依照本發明一實施例的圈選感興趣區域的示意圖。圖4A與圖4B是依照本發明另一實施例的圈選感興趣區域的示意圖。圖5是依照本發明一實施例的分析人臉的示意圖。FIG. 1 is a block diagram of an electronic device according to an embodiment of the invention. 2 is a flowchart of an image search method according to an embodiment of the invention. 3 is a schematic diagram of circled a region of interest according to an embodiment of the invention. 4A and 4B are schematic diagrams of circled regions of interest according to another embodiment of the invention. FIG. 5 is a schematic diagram of analyzing a human face according to an embodiment of the invention.

S205~S225‧‧‧圖像搜尋方法的各步驟 S205~S225‧‧‧‧Image search method steps

Claims

An image search method includes: performing a circle selection action in a display screen; obtaining an area of interest from the display screen based on the circle selection action; performing a face detection on the area of interest; When the interest area has a human face, an image corresponding to the human face is obtained from the display screen; and the image corresponding to the human face is input to a search engine for searching.

The image search method as described in item 1 of the patent application scope, wherein after obtaining the region of interest, further comprising: performing an image distortion correction.

The image search method as described in item 1 of the patent application scope, wherein after obtaining the region of interest, further comprising: performing a resolution compensation on the image corresponding to the region of interest.

The image search method described in item 1 of the scope of the patent application further includes: when it is determined that the region of interest does not have a human face, input the image corresponding to the region of interest to the search engine for searching.

The image search method as described in item 1 of the patent application scope, wherein the step of performing the face detection on the region of interest includes: performing a skin color segmentation to obtain a skin color region; determining whether a human face exists in the skin color region Feature; and when there is a face feature in the skin color area, it is determined that the area of interest includes a face.

The image search method described in item 1 of the scope of the patent application further includes: after inputting the image corresponding to the face to the search engine for searching, judging whether each of the searched images matches the area of interest The face of the person.

The image search method as described in item 1 of the scope of the patent application further includes: when N faces are detected in the region of interest, N images based on the faces of the N persons are obtained from the display screen, wherein N is an integer greater than 1; and the N images are respectively input to the search engine for searching.

The image search method as described in item 1 of the patent application scope, wherein the display screen includes one of a video screen, a picture and a browser screen.