TW202004516A - Optimization method for searching exclusive personalized pictures - Google Patents

Optimization method for searching exclusive personalized pictures Download PDF

Info

Publication number
TW202004516A
TW202004516A TW107117400A TW107117400A TW202004516A TW 202004516 A TW202004516 A TW 202004516A TW 107117400 A TW107117400 A TW 107117400A TW 107117400 A TW107117400 A TW 107117400A TW 202004516 A TW202004516 A TW 202004516A
Authority
TW
Taiwan
Prior art keywords
key combination
search key
content
semantic database
search
Prior art date
Application number
TW107117400A
Other languages
Chinese (zh)
Other versions
TWI693524B (en
Inventor
陳世興
Original Assignee
正修學校財團法人正修科技大學
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 正修學校財團法人正修科技大學 filed Critical 正修學校財團法人正修科技大學
Priority to TW107117400A priority Critical patent/TWI693524B/en
Publication of TW202004516A publication Critical patent/TW202004516A/en
Application granted granted Critical
Publication of TWI693524B publication Critical patent/TWI693524B/en

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An optimization method for searching exclusive personalized pictures is revealed herein. This method includes steps of obtaining characteristics of at least one user by automatic establishment of users' data or recording users' behavior and machine learning, allowing the users' characteristics and keywords the users input in a search engine to become a search key combination, and comparing the search key combination and a label of a picture to determine whether to keep and manifest the picture or not. The picture may be kept and manifested if the label matches with the search key combination, so as to get search hits for exclusive personalized pictures.

Description

專屬個性化圖片搜尋優化方法Exclusive personalized image search optimization method

本發明係有關於一種專屬個性化圖片搜尋優化方法,尤其是指一種資料搜尋者在進行圖片搜尋時,能針對資料搜尋者的本身的喜好自動篩選出與其喜好契合性高的圖片,使搜尋結果更符合資料搜尋者的需求的方法。The present invention relates to an exclusive and personalized image search optimization method, in particular to a data searcher who can automatically filter out images with a high degree of preference for the data searcher's own preferences when performing image search, so that the search results A method that better meets the needs of data seekers.

在資訊科技的蓬勃發展下,資訊爆炸的現象與日俱增,同時也改變了人們查找資料、學習、互動…等關係。在海量的網路資料物件中,主要是透過在一搜尋引擎輸入關鍵字詞後,篩選出與該關鍵字詞相匹配的資料,以得到一搜尋結果。With the vigorous development of information technology, the phenomenon of information explosion is increasing day by day, and at the same time, it has changed the relationship between people looking for information, learning, interacting... and so on. In the massive network data objects, mainly by inputting a keyword in a search engine, the data matching the keyword is selected to obtain a search result.

然而,搜尋引擎回傳的搜尋結果中雖有符合用戶所輸入的關鍵字詞之網路資料物件,其中仍夾帶大量非用戶所需要的網路資料物件,因此,用戶在檢索之後仍必須再進一步於回傳的檢索結果中進行資料的篩選,即現有的搜尋方式皆未因應用戶個人的需求進行網路資料物件的選取,不僅造成搜尋上的困擾,而且也相當耗時。However, although the search results returned by the search engine include network data objects that match the keywords entered by the user, a large number of network data objects that are not required by the user are still included in the search results. Therefore, the user must still go further after searching Filtering the data in the returned search results, that is, none of the existing search methods select the network data objects according to the user's personal needs, not only causes trouble in search, but also is quite time-consuming.

請參TW I468956發明專利案,其係關於一種個人化搜尋排序方法包含:取得數個預設屬性;接收一關鍵字;根據關鍵字,搜尋數個候選資訊;自一客戶端接收一客戶端識別資訊;根據客戶端識別資訊,取得每一預設屬性之一喜愛屬性權重以及一不喜愛屬性權重;取得每一候選資訊與預設屬性間之一對應關係;根據對應關係、喜愛屬性權重以及不喜愛屬性權重,計算每一候選資訊之一候選資訊權重;根據候選資訊之候選資訊權重,排序候選資訊,並傳送排序後之候選資訊至客戶端。Please refer to the TW I468956 invention patent case, which relates to a personalized search ranking method including: obtaining several preset attributes; receiving a keyword; searching for several candidate information based on the keyword; receiving a client identification from a client Information; based on the client identification information, obtain one of the favorite attribute weights and a disliked attribute weight for each preset attribute; obtain a correspondence between each candidate information and the default attribute; according to the correspondence relationship, favorite attribute weights and no Love attribute weights, calculate one candidate information weight for each candidate information; sort candidate information according to the candidate information weight of candidate information, and send the sorted candidate information to the client.

請再參TW I421713發明專利案,其係有關於一種提供強化內容搜尋的系統及/或方法,主要是根據指示使用者對像是一網站之資訊點喜好的矩陣所進行。使用者拜訪一特定網站的資訊可以被累積,例如,其與位在該發佈者網站上的一信標或其他追蹤器有關。該強化內容可以由該發佈者網站所提供,或是以其他方式產生。Please refer again to the TW I421713 invention patent case, which relates to a system and/or method for providing enhanced content search, mainly based on a matrix indicating the user's preference for information points like a website. The information that a user visits a specific website can be accumulated, for example, it is related to a beacon or other tracker located on the publisher's website. The enhanced content may be provided by the publisher's website or generated in other ways.

由於網路資訊的發達,發佈者希望提供資訊給使用者,但並不希望造成對使用者負面反向反應,上述先前技術即是在避免打擾使用者的前提下提供資訊檢視,而本發明亦是朝此方向開發。Due to the development of network information, publishers hope to provide information to users, but they do not want to cause a negative reaction to users. The aforementioned prior art is to provide information review on the premise of avoiding disturbing users, and the present invention also It is developed in this direction.

本發明提出一種在進行圖片搜尋時,能針對資料搜尋者本身的偏好自動篩選出與其喜好契合性高的圖片,以進一步使搜尋結果更符合資料搜尋者的需求,此即為本發明之主要目的。The present invention proposes that when performing a picture search, it can automatically filter out the pictures with high preferences according to the preferences of the data searcher, so as to further make the search results more meet the needs of the data searcher, which is the main purpose of the present invention .

一種專屬個性化圖片搜尋優化方法,係包括以下步驟:An exclusive personalized image search optimization method, including the following steps:

藉由用戶自主的資料建立或透過用戶的行為記錄與機器學習,得到至少一用戶特徵;At least one user characteristic is obtained through the user's independent data creation or through the user's behavior records and machine learning;

所述用戶特徵與用戶在一搜尋引擎輸入之關鍵字組織成一搜尋關鍵組合;The user characteristics and keywords entered by the user in a search engine are organized into a search key combination;

解析目標圖片中的標籤內容,並將所述搜尋關鍵組合與所述標籤內容比對,當所述標籤內容與所述搜尋關鍵組合相符合時,將所述目標圖片保留並於一顯示裝置顯示。Parse the tag content in the target picture, and compare the search key combination with the tag content, and when the tag content matches the search key combination, retain the target picture and display it on a display device .

如上所述之專屬個性化圖片搜尋優化方法,其中,解析所述目標圖片的標籤內容的過程中,是透過連結Google Cloud Vision API存取所述目標圖片中的標籤資訊。The exclusive personalized image search optimization method as described above, in which the tag information in the target image is accessed by linking the Google Cloud Vision API during the process of parsing the tag content of the target image.

如上所述之專屬個性化圖片搜尋優化方法,其中,在將所述搜尋關鍵組合與所述標籤內容比對時,係直接以所述搜尋關鍵組合的文字與所述標籤內容的文字進行比對。The exclusive personalized image search optimization method as described above, wherein when comparing the search key combination with the label content, the text of the search key combination is directly compared with the text of the label content .

如上所述之專屬個性化圖片搜尋優化方法,其中,在將所述搜尋關鍵組合與所述標籤內容比對時,係先將所述搜尋關鍵組合的文字轉換為搜尋關鍵組合向量,且所述標籤內容的文字也同時轉換為標籤內容向量,再將所述搜尋關鍵組合向量與所述標籤內容向量進行比對。The exclusive personalized image search optimization method as described above, wherein when comparing the search key combination with the tag content, the text of the search key combination is first converted into a search key combination vector, and the The text of the label content is also converted into a label content vector, and then the search key combination vector is compared with the label content vector.

一種專屬個性化圖片搜尋優化方法,其步驟包括:An exclusive personalized image search optimization method, the steps include:

藉由用戶自主的資料建立或透過用戶的行為記錄與機器學習,得到至少一用戶特徵;At least one user characteristic is obtained through the user's independent data creation or through the user's behavior records and machine learning;

建立一語意庫;Build a semantic library;

選取所述語意庫中具有與所述關鍵字所包含的所有同義字的詞彙;Select words in the semantic database that have all synonyms that are included in the keyword;

所述用戶特徵與所述語意庫中具有與所述關鍵字所包含的所有同義字的詞彙組織成一搜尋關鍵組合;The user characteristics and the words in the semantic database that have all synonyms that are included in the keyword are organized into a search key combination;

解析目標圖片中的標籤內容,並將所述搜尋關鍵組合與所述標籤內容比對,當所述標籤內容與所述搜尋關鍵組合相符合時,將所述目標圖片保留並於一顯示裝置顯示。Parse the tag content in the target picture, and compare the search key combination with the tag content, and when the tag content matches the search key combination, retain the target picture and display it on a display device .

如上所述之專屬個性化圖片搜尋優化方法,其中,所述建立語意庫的步驟包括:匯入訓練圖庫的步驟、解析所述圖庫中各圖片可能出現的標籤內容的步驟、收集所述可能出現的標籤內容並儲存於語意庫的步驟。The exclusive personalized image search optimization method as described above, wherein the steps of creating a semantic database include: a step of importing a training gallery, a step of parsing the tag content that may appear in each picture in the gallery, and collecting the possible occurrences And store the content of the tag in the semantic database.

如上所述之專屬個性化圖片搜尋優化方法,其中,在所述解析目標圖片中的標籤內容的步驟中,係先計算所述圖庫中的多張圖片總共產生k個標籤的適合度Sm ,且m = 1,…,k,之後由Sj 的高低決定是否保留該標籤(Sj 表示圖j的適合度);假設每張圖片取得q個標籤內容與其對應分數(q出現在j的範圍中),對應分數以Yij 表示,其中i為圖片編號(i = 1,…,n以及j = 0,…,q),當j為0代表該圖片並未產出任何標籤,在計算時加總所有k個標籤的總分,如果出現次數多且品質分數高,所計算出的Sm 就會比較高,因此該標籤代表這是可保留至語意庫,其運算公式如下:The exclusive personalized image search optimization method as described above, wherein, in the step of parsing the label content in the target image, the suitability S m of the plurality of images in the gallery to generate a total of k labels is calculated first, And m = 1,...,k, then the level of S j decides whether to keep the label (S j represents the suitability of image j); suppose each image gets q label content and its corresponding score (q appears in the range of j Middle), the corresponding score is represented by Y ij , where i is the picture number (i = 1,...,n and j = 0,...,q), when j is 0 means that the picture does not produce any labels, when calculating Summing up the total score of all k tags, if there are many occurrences and the quality score is high, the calculated S m will be relatively high, so the tag represents that this can be retained to the semantic database, and its calculation formula is as follows:

Figure 02_image001
Figure 02_image001
.

如上所述之專屬個性化圖片搜尋優化方法,其中,所述建立語意庫的步驟係包括:輸入中文關鍵字的步驟、以及以一翻譯應用程式將所述輸入中文關鍵字轉譯為英文並儲存於語意庫的步驟。The exclusive personalized image search optimization method as described above, wherein the step of creating a semantic database includes the steps of inputting Chinese keywords, and translating the input Chinese keywords into English with a translation application and storing them in Semantic steps.

如上所述之專屬個性化圖片搜尋優化方法,其中,所述建立語意庫的步驟還包括一擴充語意庫的步驟,包括透過既有語意庫尋找同義字的步驟、將所述同義字加入所述語意庫中儲存。The exclusive personalized image search optimization method as described above, wherein the step of creating a semantic database further includes a step of expanding the semantic database, including a step of finding synonyms through an existing semantic database, and adding the synonyms to the Stored in the semantic database.

如上所述之專屬個性化圖片搜尋優化方法,其中,在解析所述目標圖片的標籤內容的過程中,是透過連結Google Cloud Vision API存取所述影像中的標籤資訊。The exclusive personalized image search optimization method as described above, wherein, in the process of analyzing the tag content of the target image, the tag information in the image is accessed by connecting to the Google Cloud Vision API.

為令本發明所運用之技術內容、發明目的及其達成之功效有更完整且清楚的揭露,茲於下詳細說明之,並請一併參閱所揭之圖式及圖號:In order to make the technical content, the purpose of the invention and the effect achieved by the invention more complete and clear disclosure, it is described in detail below, and please refer to the drawings and figures disclosed:

請參看第一圖,其係揭示本發明專屬個性化圖片搜尋優化方法的其一較佳實施例流程圖。Please refer to the first figure, which is a flow chart of a preferred embodiment of the method for searching and optimizing the personalized image of the present invention.

本發明之專屬個性化圖片搜尋優化方法,係包括以下步驟:The exclusive personalized picture search optimization method of the present invention includes the following steps:

步驟一(S11):建立用戶特徵,藉由用戶自主的資料建立或透過用戶的行為記錄與機器學習,得到至少一用戶特徵;即用戶透過一介面自主性建立至少一行為,或是由用戶使用機器的習慣透過機器學習知悉用戶行為;前述行為可以是某種偏好,例如:對顏色、運動消息、娛樂消息、公眾人士、或藝文團體... 等人、事、時、地、物各項資訊;Step one (S11): Create user characteristics, obtain at least one user characteristic through the user's independent data creation or through the user's behavior records and machine learning; that is, the user autonomously establishes at least one behavior through an interface, or is used by the user The habit of the machine knows the user's behavior through machine learning; the aforementioned behavior can be a preference, such as: color, sports news, entertainment news, public people, or art groups... etc. people, things, times, places, things Item of information

步驟二(S12):建立搜尋關鍵組合,將步驟一(S11)所得到的用戶特徵與用戶在一搜尋引擎輸入之關鍵字組織成一搜尋關鍵組合;Step 2 (S12): Create a search key combination, and organize the user characteristics obtained in step 1 (S11) and the keywords entered by the user in a search engine into a search key combination;

步驟三(S13):解析目標圖片之標籤內容並與搜尋關鍵組合比對,解析目標圖片中的標籤內容,並將經步驟二(S12)得到的搜尋關鍵組合與該目標圖片中的標籤內容比對,當該目標圖片中的標籤內容與該搜尋關鍵組合相符合時,將該目標圖片保留並於一顯示裝置顯示,若有未能解析之圖片,代表該圖片是過去沒有訓練過,或是較為複雜無法產生對應標籤,因此無法被解析,這方面的資料內容將無法呈現於搜尋列表中。Step three (S13): analyze the label content of the target picture and compare it with the search key combination, analyze the label content in the target picture, and compare the search key combination obtained in step two (S12) with the label content in the target picture Yes, when the tag content in the target picture matches the search key combination, the target picture is retained and displayed on a display device. If there is a picture that cannot be parsed, it means that the picture has not been trained in the past, or It is more complicated and cannot generate the corresponding label, so it cannot be parsed. The data content in this respect will not be displayed in the search list.

作為本發明之專屬個性化圖片搜尋優化方法的優選方案,可以進一步令被保留並顯示在顯示裝置的目標圖片透過一日期篩選器篩選或排序。As a preferred solution of the exclusive personalized picture search optimization method of the present invention, the target pictures retained and displayed on the display device can be further filtered or sorted by a date filter.

作為本發明之專屬個性化圖片搜尋優化方法的優選方案,其中,在解析目標圖片的標籤內容的過程中,是透過連結Google Cloud Vision API存取所述影像中的標籤資訊。As a preferred solution of the exclusive personalized image search optimization method of the present invention, in the process of parsing the label content of the target image, the label information in the image is accessed through the link to the Google Cloud Vision API.

作為本發明之專屬個性化圖片搜尋優化方法的優選方案,其中,在將搜尋關鍵組合與標籤內容比對時,係直接以搜尋關鍵組合的文字與標籤內容的文字進行比對;也可以先將搜尋關鍵組合的文字轉換為搜尋關鍵組合向量,且標籤內容的文字也同時轉換為標籤內容向量,再將轉換後的搜尋關鍵組合向量與轉換後的標籤內容向量進行比對。As a preferred solution of the exclusive personalized image search optimization method of the present invention, when comparing the search key combination with the label content, the text of the search key combination is directly compared with the text of the label content; The text of the search key combination is converted into the search key combination vector, and the text of the label content is also converted into the label content vector. The converted search key combination vector is then compared with the converted label content vector.

由於Google Cloud Vision API產生圖片的註解會有多種變化的可能性,而且每個人描述影像的文字也不盡相同,以致於造成語意間隙現象,為解決此一問題,本發明進一步提供一建立語意庫的步驟,以在執行專屬個性化圖片搜尋之前,先建立語意庫來解決此問題。Since the annotations of the images generated by the Google Cloud Vision API have many possibilities of change, and each person’s text describing the image is not the same, resulting in a semantic gap phenomenon. To solve this problem, the present invention further provides a semantic database Steps to solve this problem by creating a semantic database before performing an exclusive personalized image search.

該建立語意庫的步驟係在步驟一建立用戶特徵或步驟二建立搜尋關鍵組合的步驟之前執行;其中,第二圖所示係顯示在步驟二建立搜尋關鍵組合的步驟之前執行該建立語意庫的步驟;具體而言,本發明專屬個性化圖片搜尋優化方法其二較佳實施例,其步驟包括:The step of creating a semantic database is performed before the step of creating a user feature in step one or the step of creating a search key combination in step two; wherein, the second figure shows the step of creating the semantic database before the step of creating a search key combination in step two. Steps; specifically, the second preferred embodiment of the method for optimizing the search for a personalized image of the present invention, the steps include

步驟一(S21):建立用戶特徵,藉由用戶自主的資料建立或透過用戶的行為記錄與機器學習,得到至少一用戶特徵;即用戶透過一介面自主性建立至少一行為,或是由用戶使用機器的習慣透過機器學習知悉用戶行為;前述行為可以是某種偏好,例如:對顏色、運動消息、娛樂消息、公眾人士、或藝文團體... 等人、事、時、地、物各項資訊;Step one (S21): Create user characteristics, obtain at least one user characteristic through the user's independent data creation or through the user's behavior records and machine learning; that is, the user autonomously establishes at least one behavior through an interface, or is used by the user The habit of the machine knows the user's behavior through machine learning; the aforementioned behavior can be a preference, such as: color, sports news, entertainment news, public people, or art groups... etc. people, things, times, places, things Item of information

步驟二(S22):建立一語意庫,係將所有具有相同字義的詞彙加以蒐集整合,以避免目標圖片的標籤內容因為註解描述上的差異所產生的語意間隙現象影響後續的目標圖片檢索的準確性;Step 2 (S22): Establish a semantic database to collect and integrate all words with the same meaning, so as to avoid the semantic gap caused by the difference in the description of the target picture from affecting the accuracy of subsequent target picture retrieval Sex

步驟三(S23):選擇語意庫中與關鍵字的同義字詞彙,在語意庫中選取具有與用戶在一搜尋引擎所輸入之關鍵字所包含的所有同義字詞彙;Step 3 (S23): Select the synonyms of the words in the semantic database and the keywords, and select all the words of the same meaning in the semantic database with the keywords entered by the user in a search engine;

步驟四(S24):建立搜尋關鍵組合,將步驟一(S21)所得到的用戶特徵與步驟三(S23)中所選取之具有和用戶在搜尋引擎輸入之關鍵字所包含的所有同義字詞彙組織成一搜尋關鍵組合;Step 4 (S24): Create a search key combination, organize the user characteristics obtained in step 1 (S21) and all the synonyms with the keywords selected in step 3 (S23) and included in the keywords entered by the user in the search engine Search key combinations into one;

步驟五(S25):解析目標圖片之標籤內容並與搜尋關鍵組合比對,解析目標圖片中的標籤內容,並將經步驟四(S24)得到的搜尋關鍵組合與該目標圖片中的標籤內容比對,當該目標圖片中的標籤內容與該搜尋關鍵組合相符合時,將該目標圖片保留並於一顯示裝置顯示。Step 5 (S25): Analyze the label content of the target image and compare it with the search key combination, analyze the label content in the target image, and compare the search key combination obtained in step 4 (S24) with the label content in the target image Yes, when the content of the tag in the target picture matches the search key combination, the target picture is retained and displayed on a display device.

而該建立語意庫的方式可分為二種,分別為:(一)利用匯入少量的圖片來建立所需的語意庫以及(二)直接輸入關鍵字來建立所需的語意庫。The method of creating a semantic database can be divided into two types, namely: (1) using a small amount of imported pictures to create a desired semantic database and (2) directly entering keywords to create a desired semantic database.

其中,第一種建立方式請參看第三圖,其步驟係包括:Among them, please refer to the third figure for the first establishment method. The steps include:

步驟一(S31):匯入訓練圖庫;Step one (S31): import training library;

步驟二(S32):解析該圖庫中各圖片可能出現的標籤內容;在本步驟中,係先計算所述圖庫中的多張圖片總共產生k個標籤的適合度Sm ,且m = 1,…,k,之後由Sj 的高低決定是否保留該標籤(Sj 表示圖j的適合度);假設每張圖片取得q個標籤內容與其對應分數(q出現在j的範圍中),對應分數以Yij 表示,其中i為圖片編號(i = 1,…,n以及j = 0,…,q),當j為0代表該圖片並未產出任何標籤,在計算時加總所有k個標籤的總分,如果出現次數多且品質分數高,所計算出的Sm 就會比較高,因此該標籤代表這是可保留至語意庫,其運算公式如下:Step two (S32): Analyze the label content that may appear in each picture in the gallery; in this step, first calculate the suitability S m of the multiple pictures in the gallery to generate a total of k tags, and m = 1, …,K, and then the level of S j decides whether to keep the label (S j represents the suitability of figure j); suppose each image gets q label content and its corresponding score (q appears in the range of j), the corresponding score Represented by Y ij , where i is the picture number (i = 1,...,n and j = 0,...,q), when j is 0 means that the picture does not produce any labels, and all k are added in the calculation The total score of the label, if there are many occurrences and the quality score is high, the calculated S m will be relatively high, so the label represents that it can be retained to the semantic database, and its calculation formula is as follows:

Figure 02_image001
Figure 02_image001
.

步驟三(S33):收集該可能出現的標籤內容並儲存於語意庫。Step three (S33): Collect the content of the possible tags and store them in the semantic database.

第二種建立語意庫的方式請參看第四圖,其步驟係包括:For the second way to build a semantic database, please refer to the fourth figure. The steps include:

步驟一(S41):輸入中文關鍵字;Step one (S41): Enter Chinese keywords;

步驟二(S42):以一翻譯應用程式將輸入的中文關鍵字轉譯為英文;Step 2 (S42): Translate the entered Chinese keywords into English with a translation application;

步驟三(S43):收集該轉譯後的英文並儲存於語意庫。Step three (S43): Collect the translated English and store it in the semantic database.

另外,也可將既有語意庫的資料來進行本發明之語意庫的擴充,請參看第五圖,其步驟係包括:In addition, the data of the existing semantic database can also be used to expand the semantic database of the present invention. Please refer to the fifth figure. The steps include:

步驟一(S51):透過既有語意庫(例如:詞網(WordNet))尋找同義字;Step one (S51): Find synonymous words through the existing semantic database (for example: WordNet);

步驟二(S52):將同義字加入語意庫中儲存。Step two (S52): add synonyms to the semantic database for storage.

又,取得圖片的註解(Image Annotation)或是標籤化(Labelling)的目的,係為了可與建立搜尋關鍵組合進行比對,以識別該目標圖片是否符合用戶專屬偏好的所需圖片,進而提升專屬個性化圖片的搜尋效率。In addition, the purpose of obtaining image annotations or labeling is to compare with the establishment of a search key combination to identify whether the target image meets the user's specific preference of the desired image, thereby enhancing the exclusive Search efficiency of personalized pictures.

標籤化是建立機器學習的重要模式,需要匯入大量訓練的圖片,提供機器學習演算法(Machine Learning algorithms)學習,尤其圖片若沒有訓練過時,該圖片就無法辨識進而提供標籤,因此在進行標籤化之前,需要大量的訓練資料,然而,因為訓練機器相當耗費時間與成本,於是本發明為了有效解決這方面的問題,採用Google Cloud Vision API1,該系統可提供進行圖片標籤化(Label Detection)的功能。Tagging is an important model for building machine learning. A large number of training pictures need to be imported to provide machine learning algorithms (Machine Learning algorithms) for learning. Especially if the pictures are not trained out of date, the pictures cannot be identified and provide tags. Before the conversion, a large amount of training data is required. However, because the training machine is quite time-consuming and costly, in order to effectively solve this problem, the present invention adopts Google Cloud Vision API1, which provides image labeling (Label Detection). Features.

第六圖是從ImageNet [J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009, pp. 248–255.]測試案例所提供北極熊的圖片,待上傳完成圖片至Google Cloud Vision API辨識,並回傳3筆的標籤化運算結果,該API會傳回指定筆數的JSON資料。The sixth picture is from ImageNet [J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, "Imagenet: A large-scale hierarchical image database," in Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009, pp. 248–255.] The picture of the polar bear provided in the test case, to be uploaded to the Google Cloud Vision API for identification, and return 3 strokes The result of the tagging operation, the API will return a specified number of JSON data.

由第六圖測試結果得知,Google Cloud Vision API可以正確的解析出此圖片是北極熊,並且帶來98%的符合程度,另外,該張圖片也有可能是哺乳類動物或者是動物,分別有96%與95%的信心程度,這些信賴度可用於區別標籤來源信賴權重。According to the test results of the sixth picture, the Google Cloud Vision API can correctly parse out that this picture is a polar bear, and brings 98% compliance. In addition, the picture may also be a mammal or an animal, 96% respectively With a confidence level of 95%, these trust levels can be used to distinguish the trust weight of the label source.

即本發明能透過採用基於人工智慧的Google Cloud Vision API,存取其圖片中的標籤資訊,藉由建立搜尋關鍵組合與圖片中的標籤關鍵字進行篩選比對,以達到精準搜尋的目的。That is, the present invention can access the tag information in its pictures by using the Google Cloud Vision API based on artificial intelligence, and achieve the purpose of accurate search by establishing a search key combination and tag keywords in the pictures for screening and comparison.

以上所舉者僅係本發明之部份實施例,並非用以限制本發明,致依本發明之創意精神及特徵,稍加變化修飾而成者,亦應包括在本專利範圍之內。The above mentioned are only some of the embodiments of the present invention, and are not intended to limit the present invention. Therefore, those who have been modified slightly according to the creative spirit and features of the present invention should also be included in the scope of this patent.

綜上所述,本發明實施例確能達到所預期之使用功效,又其所揭露之具體技術手段,不僅未曾見諸於同類產品中,亦未曾公開於申請前,誠已完全符合專利法之規定與要求,爰依法提出發明專利之申請,懇請惠予審查,並賜准專利,則實感德便。In summary, the embodiments of the present invention can indeed achieve the expected use effect, and the specific technical means disclosed by it have not only not been seen in similar products, nor have they been disclosed before application, and have fully complied with the patent law. Regulations and requirements, I filed an application for an invention patent in accordance with the law, pleaded for the examination, and granted the patent.

(S11)‧‧‧步驟一(S11)‧‧‧Step 1

(S12)‧‧‧步驟二(S12)‧‧‧Step 2

(S13)‧‧‧步驟三(S13)‧‧‧Step 3

(S21)‧‧‧步驟一(S21)‧‧‧Step 1

(S22)‧‧‧步驟二(S22)‧‧‧Step 2

(S23)‧‧‧步驟三(S23)‧‧‧Step 3

(S24)‧‧‧步驟四(S24)‧‧‧Step 4

(S25)‧‧‧步驟五(S25)‧‧‧Step 5

(S31)‧‧‧步驟一(S31)‧‧‧Step 1

(S32)‧‧‧步驟二(S32)‧‧‧Step 2

(S33)‧‧‧步驟三(S33)‧‧‧Step 3

(S41)‧‧‧步驟一(S41)‧‧‧Step 1

(S42)‧‧‧步驟二(S42)‧‧‧Step 2

(S43)‧‧‧步驟三(S43)‧‧‧Step 3

(S51)‧‧‧步驟一(S51)‧‧‧Step 1

(S52)‧‧‧步驟二(S52)‧‧‧Step 2

第一圖:本發明專屬個性化圖片搜尋優化方法的其一較佳實施例步驟流程圖Figure 1: Step flow chart of a preferred embodiment of a method for searching and optimizing a personalized image of the present invention

第二圖:本發明專屬個性化圖片搜尋優化方法的其二較佳實施例步驟流程圖Figure 2: The flow chart of the second preferred embodiment of the method for optimizing the search for an exclusive personalized picture of the present invention

第三圖:第一種建立語意庫的步驟流程圖Figure 3: The flow chart of the first step to build a semantic database

第四圖:第二種建立語意庫的步驟流程圖Figure 4: Flow chart of the second step to build a semantic database

第五圖:將既有語意庫的資料進行擴充語意庫的步驟流程圖Figure 5: Steps to expand the semantic database of the existing semantic database

第六圖:將一北極熊圖片上傳至Google Cloud Vision API辨識所回傳的標籤化運算結果Figure 6: Upload a polar bear image to the Google Cloud Vision API to identify the results of the tagging operation

(S11)‧‧‧步驟一 (S11)‧‧‧Step 1

(S12)‧‧‧步驟二 (S12)‧‧‧Step 2

(S13)‧‧‧步驟三 (S13)‧‧‧Step 3

Claims (6)

一種專屬個性化圖片搜尋優化方法,係包括以下步驟:   藉由用戶自主的資料建立或透過用戶的行為記錄與機器學習,得到至少一用戶特徵;   所述用戶特徵與用戶在一搜尋引擎輸入之關鍵字組織成一搜尋關鍵組合;   解析目標圖片中的標籤內容,並將所述搜尋關鍵組合與所述標籤內容比對,當所述標籤內容與所述搜尋關鍵組合相符合時,將所述目標圖片保留並於一顯示裝置顯示; 其中,在將所述搜尋關鍵組合與所述標籤內容比對時,係直接以所述搜尋關鍵組合的文字與所述標籤內容的文字進行比對;或在將所述搜尋關鍵組合與所述標籤內容比對時,係先將所述搜尋關鍵組合的文字轉換為搜尋關鍵組合向量,且所述標籤內容的文字也同時轉換為標籤內容向量,再將所述搜尋關鍵組合向量與所述標籤內容向量進行比對。An exclusive personalized image search optimization method, which includes the following steps:   Create at least one user characteristic by user's independent data or through user's behavior record and machine learning; Words are organized into a search key combination;    Analyze the tag content in the target picture, and compare the search key combination with the tag content, and when the tag content matches the search key combination, the target picture Reserved and displayed on a display device; wherein, when comparing the search key combination with the label content, the text of the search key combination is directly compared with the text of the label content; or When the search key combination is compared with the label content, the text of the search key combination is first converted into a search key combination vector, and the text of the label content is also converted into a label content vector, and then the The search key combination vector is compared with the label content vector. 如申請專利範圍第1項所述之專屬個性化圖片搜尋優化方法,其中,解析所述目標圖片的標籤內容的過程中,是透過連結Google Cloud Vision API存取所述目標圖片中的標籤資訊。The exclusive personalized image search optimization method as described in item 1 of the patent application scope, wherein in the process of parsing the label content of the target image, the label information in the target image is accessed by linking the Google Cloud Vision API. 一種專屬個性化圖片搜尋優化方法,其步驟包括:   藉由用戶自主的資料建立或透過用戶的行為記錄與機器學習,得到至少一用戶特徵;   建立一語意庫;   選取所述語意庫中具有與所述關鍵字所包含的所有同義字的詞彙;   所述用戶特徵與所述語意庫中具有與所述關鍵字所包含的所有同義字的詞彙組織成一搜尋關鍵組合;   解析目標圖片中的標籤內容,並將所述搜尋關鍵組合與所述標籤內容比對,當所述標籤內容與所述搜尋關鍵組合相符合時,將所述目標圖片保留並於一顯示裝置顯示;   其中,所述建立一語意庫之步驟可在所述得到至少一用戶特徵之步驟之前執行;所述建立語意庫的步驟包括:匯入訓練圖庫的步驟、解析所述圖庫中各圖片可能出現的標籤內容的步驟、收集所述可能出現的標籤內容並儲存於語意庫的步驟;   在所述解析目標圖片中的標籤內容的步驟中,係先計算所述圖庫中的多張圖片總共產生k個標籤的適合度Sm ,且m = 1,…,k,之後由Sj 的高低決定是否保留該標籤(Sj 表示圖j的適合度);假設每張圖片取得q個標籤內容與其對應分數(q出現在j的範圍中),對應分數以Yij 表示,其中i為圖片編號(i = 1,…,n以及j = 0,…,q),當j為0代表該圖片並未產出任何標籤,在計算時加總所有k個標籤的總分,如果出現次數多且品質分數高,所計算出的Sm 就會比較高,因此該標籤代表這是可保留至語意庫,其運算公式如下:
Figure 03_image001
An exclusive personalized image search optimization method, the steps of which include: creating at least one user feature through user-independent data creation or through user behavior records and machine learning; creating a semantic database; selecting the semantic database that has Vocabulary of all synonymous words contained in the keywords; the user characteristics and the vocabulary in the semantic database with all the synonyms of the keywords are organized into a search key combination; parsing the label content in the target picture, Comparing the search key combination with the tag content, and when the tag content matches the search key combination, the target picture is retained and displayed on a display device; wherein, the creation of a semantic The step of the library may be performed before the step of obtaining at least one user feature; the step of creating a semantic database includes the steps of importing a training gallery, analyzing the tag content that may appear in each picture in the gallery, collecting sites The step of describing the content of tags that may appear and storing in the semantic database; in the step of parsing the content of tags in the target picture, the suitability S m of the k tags generated by multiple pictures in the gallery is calculated first, And m = 1,...,k, then the level of S j decides whether to keep the label (S j represents the suitability of figure j); suppose each image gets q label content and its corresponding score (q appears in the range of j Middle), the corresponding score is represented by Y ij , where i is the picture number (i = 1,...,n and j = 0,...,q), when j is 0 means that the picture does not produce any labels, when calculating Summing up the total score of all k tags, if there are many occurrences and the quality score is high, the calculated S m will be relatively high, so the tag represents that this can be retained to the semantic database, and its calculation formula is as follows:
Figure 03_image001
.
如申請專利範圍第3項所述之影像自動分類的方法,其中,所述建立語意庫的步驟係包括:輸入中文關鍵字的步驟、以及以一翻譯應用程式將所述輸入中文關鍵字轉譯為英文並儲存於語意庫的步驟。The automatic image classification method as described in item 3 of the patent application scope, wherein the step of creating a semantic database includes the steps of inputting Chinese keywords and translating the input Chinese keywords into a translation application Steps of storing English in the semantic database. 如申請專利範圍第4項所述之專屬個性化圖片搜尋優化方法,其中,所述建立語意庫的步驟還包括一擴充語意庫的步驟,包括透過既有語意庫尋找同義字的步驟、將所述同義字加入所述語意庫中儲存。The exclusive personalized image search optimization method as described in item 4 of the patent application scope, wherein the step of creating a semantic database also includes a step of expanding the semantic database, including the step of finding synonyms through the existing semantic database The synonymous words are added to the semantic database for storage. 如申請專利範圍第3、4或5項所述之專屬個性化圖片搜尋優化方法,其中,在解析所述目標圖片的標籤內容的過程中,是透過連結Google Cloud Vision API存取所述影像中的標籤資訊。The exclusive personalized image search optimization method as described in items 3, 4 or 5 of the patent application scope, wherein in the process of parsing the tag content of the target image, the image is accessed through the link Google Cloud Vision API Tag information.
TW107117400A 2018-05-22 2018-05-22 Optimization method for searching exclusive personalized pictures TWI693524B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW107117400A TWI693524B (en) 2018-05-22 2018-05-22 Optimization method for searching exclusive personalized pictures

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW107117400A TWI693524B (en) 2018-05-22 2018-05-22 Optimization method for searching exclusive personalized pictures

Publications (2)

Publication Number Publication Date
TW202004516A true TW202004516A (en) 2020-01-16
TWI693524B TWI693524B (en) 2020-05-11

Family

ID=69942228

Family Applications (1)

Application Number Title Priority Date Filing Date
TW107117400A TWI693524B (en) 2018-05-22 2018-05-22 Optimization method for searching exclusive personalized pictures

Country Status (1)

Country Link
TW (1) TWI693524B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116431855A (en) * 2023-06-13 2023-07-14 荣耀终端有限公司 Image retrieval method and related equipment

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8165406B2 (en) * 2007-12-12 2012-04-24 Microsoft Corp. Interactive concept learning in image search
WO2009148422A1 (en) * 2008-06-06 2009-12-10 Thomson Licensing System and method for similarity search of images
CN103136228A (en) * 2011-11-25 2013-06-05 阿里巴巴集团控股有限公司 Image search method and image search device
TW201435627A (en) * 2013-03-12 2014-09-16 Hon Hai Prec Ind Co Ltd System and method for optimizing search results
KR20160015838A (en) * 2014-07-31 2016-02-15 삼성전자주식회사 Method and device for classifying contents

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116431855A (en) * 2023-06-13 2023-07-14 荣耀终端有限公司 Image retrieval method and related equipment
CN116431855B (en) * 2023-06-13 2023-10-20 荣耀终端有限公司 Image retrieval method and related equipment

Also Published As

Publication number Publication date
TWI693524B (en) 2020-05-11

Similar Documents

Publication Publication Date Title
US10394878B2 (en) Associating still images and videos
US11782970B2 (en) Query categorization based on image results
US9589208B2 (en) Retrieval of similar images to a query image
KR102422977B1 (en) Natural language image search
CN111026671B (en) Test case set construction method and test method based on test case set
Wang et al. Towards a relevant and diverse search of social images
JP5749279B2 (en) Join embedding for item association
CN110362660A (en) A kind of Quality of electronic products automatic testing method of knowledge based map
WO2017070656A1 (en) Video content retrieval system
US20150178321A1 (en) Image-based 3d model search and retrieval
WO2023108980A1 (en) Information push method and device based on text adversarial sample
WO2021196541A1 (en) Method, apparatus and device used to search for content, and computer-readable storage medium
US20230086735A1 (en) Systems and methods for retrieving videos using natural language description
CN103778206A (en) Method for providing network service resources
US20200065395A1 (en) Efficient leaf invalidation for query execution
CN109271624A (en) A kind of target word determines method, apparatus and storage medium
CN116010552A (en) Engineering cost data analysis system and method based on keyword word library
TWI693524B (en) Optimization method for searching exclusive personalized pictures
JP2020042770A (en) Data search method and data search system
Hybridised OntoKnowNHS: Ontology Driven Knowledge Centric Novel Hybridised Semantic Scheme for Image Recommendation Using Knowledge Graph
Qian et al. What is happening in the video?—Annotate video by sentence
CN111753861B (en) Automatic image labeling system and method for active learning
Li et al. Research on hot news discovery model based on user interest and topic discovery
TW202004519A (en) Method for automatically classifying images
KR20150096848A (en) Apparatus for searching data using index and method for using the apparatus

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees