JP2022044687A

JP2022044687A - Image search method and apparatus

Info

Publication number: JP2022044687A
Application number: JP2022006614A
Authority: JP
Inventors: 勝權崔; Seung Kwon Choe; 相淵金; Sang Yeon Kim; ▲みん▼▲ちょる▼ 辛; Min Chul Shin; 盛皓咸; Sung Ho Ham; 相赫朴; Sang Hyuk Park; 澤潤金; Taek Yoon Kim; 澤秀金; Taek Soo Kim; 柳薫姜; Yoo Hoon Kang
Original assignee: Line Corp; Naver Corp
Current assignee: Z Intermediate Global Corp; Naver Corp
Priority date: 2019-01-28
Filing date: 2022-01-19
Publication date: 2022-03-17
Anticipated expiration: 2040-01-20
Also published as: TWI755666B; JP2020119566A; JP7013497B2; JP7282218B2; KR102221122B1; KR20200093321A; TW202029054A

Abstract

PROBLEM TO BE SOLVED: To provide an image search apparatus and a method thereof.

SOLUTION: An image search method includes: a step of obtaining reference feature vectors corresponding to multiple reference images containing the same attribute; a step of obtaining a representative feature vector corresponding to the same attribute from a generative model trained based on the obtained reference feature vectors; and a step of comparing the feature vector of a target image stored in a database with the representative feature vector, so as to assign attribute tags corresponding to the same attribute to at least some of the target images.

SELECTED DRAWING: Figure 2

Description

本発明は、検索の技術分野に関し、さらに具体的には、イメージに基づく検索装置及びその方法に関する。 The present invention relates to the technical field of search, and more specifically to an image-based search device and a method thereof.

情報通信技術の発達と共に、スマートフォン、個人用コンピュータのような多様な端末を利用した情報検索が幅広く利用されている。従来は、ユーザが入力したテキストを基に文書などを検索して提供していたが、最近は、写真や絵のようなイメージを検索する場合が増加している。 With the development of information and communication technology, information retrieval using various terminals such as smartphones and personal computers is widely used. In the past, documents and the like were searched and provided based on the text entered by the user, but recently, the number of cases of searching for images such as photographs and pictures is increasing.

衣類や風景のようなイメージをテキストだけで検索することは、容易ではない。例えば、ユーザが自分が所望するスタイルの衣類を検索する場合、テキスト検索では、適切なイメージを見い出し難い場合がある。 Searching for images such as clothing and landscapes by text alone is not easy. For example, when a user searches for clothing in the style he / she desires, it may be difficult to find a suitable image by text search.

最近では、ユーザからテキストではない検索イメージが入力され、検索イメージと類似したイメージを検索するサービスも試みられているが、そのためには、データベースに保存された多くのイメージに対して、一つ一つ手動的にタグを割り当てなければならないという困難さが伴う。 Recently, a service has been tried to search for images similar to the search image by inputting a search image that is not text from the user, but for that purpose, for many images stored in the database, one by one. It comes with the difficulty of having to manually assign tags.

韓国登録特許第１０－１７６８５２１号公報Korean Registered Patent No. 10-178521

本発明の一実施形態によるイメージ検索装置及びその方法は、データベースに保存されたイメージに対する属性タグの割り当てを簡便に行うことができるようにすることを技術的課題とする。 An image search device according to an embodiment of the present invention and a method thereof have a technical problem of making it possible to easily assign attribute tags to images stored in a database.

また、一実施形態によるイメージ検索装置及びその方法は、代表特徴ベクトルを利用し、ユーザが所望する種類のイメージを、さらに正確に検索することができるようにすることを技術的課題とする。 Further, the image search device according to one embodiment and the method thereof have a technical problem of being able to more accurately search for an image of a type desired by a user by using a representative feature vector.

一実施形態によるイメージ検索方法は、同一属性を含む複数の参照イメージに対応する参照特徴ベクトルを獲得する段階と、前記獲得した参照特徴ベクトルに基づいて訓練された生成モデルから、前記同一属性に対応する代表特徴ベクトルを獲得する段階と、データベースに保存されたターゲットイメージの特徴ベクトルと、前記代表特徴ベクトルとを比較し、前記ターゲットイメージのうち少なくとも一部のターゲットイメージに、前記同一属性に対応する属性タグを割り当てる段階と、を含んでもよい。 The image search method according to one embodiment corresponds to the same attribute from the stage of acquiring the reference feature vector corresponding to a plurality of reference images including the same attribute and the generation model trained based on the acquired reference feature vector. At the stage of acquiring the representative feature vector to be used, the feature vector of the target image stored in the database is compared with the representative feature vector, and at least a part of the target images corresponds to the same attribute. It may include the stage of assigning attribute tags.

一実施形態によるイメージ検索方法は、ユーザ端末から複数の検索イメージを受信する段階と、前記複数の検索イメージに対応する特徴ベクトルを獲得する段階と、前記獲得した特徴ベクトルでもって生成モデルを訓練させる段階と、前記特徴ベクトルでもって訓練された前記生成モデルから出力される代表特徴ベクトルと、データベースに保存されたターゲットイメージの特徴ベクトルとを比較し、前記複数の検索イメージに対応するターゲットイメージを決める段階と、前記決定されたターゲットイメージを、前記ユーザ端末に伝送する段階と、を含んでもよい。 In the image search method according to one embodiment, a stage of receiving a plurality of search images from a user terminal, a stage of acquiring a feature vector corresponding to the plurality of search images, and a step of training a generative model with the acquired feature vectors. The stage, the representative feature vector output from the generated model trained by the feature vector, and the feature vector of the target image stored in the database are compared, and the target image corresponding to the plurality of search images is determined. It may include a step and a step of transmitting the determined target image to the user terminal.

一実施形態によるイメージ検索装置は、プロセッサと、少なくとも１つのプログラム、及び生成モデルを保存するメモリと、ターゲットイメージを保存するデータベースと、を含むが、前記プロセッサは、前記少なくとも１つのプログラムが実行されることにより、同一属性を含む複数の参照イメージに対応する参照特徴ベクトルを獲得し、前記獲得した参照特徴ベクトルに基づいて訓練された前記生成モデルから、前記同一属性に対応する代表特徴ベクトルを獲得し、前記ターゲットイメージの特徴ベクトルと、前記代表特徴ベクトルとを比較し、前記ターゲットイメージのうち少なくとも一部のターゲットイメージに、前記同一属性に対応する属性タグを割り当てることができる。 An image retrieval device according to an embodiment includes a processor, at least one program, a memory for storing a generation model, and a database for storing a target image, wherein the processor is executed with the at least one program. By doing so, a reference feature vector corresponding to a plurality of reference images including the same attribute is acquired, and a representative feature vector corresponding to the same attribute is acquired from the generated model trained based on the acquired reference feature vector. Then, the feature vector of the target image and the representative feature vector can be compared, and an attribute tag corresponding to the same attribute can be assigned to at least a part of the target images of the target images.

他の実施形態によるイメージ検索方法は、イメージ検索装置に接続して属性を選択する段階と、前記イメージ検索装置に保存されたターゲットイメージのうち、前記選択した属性に対応する属性タグが割り当てられたターゲットイメージを、前記イメージ検索装置から受信する段階と、を含むが、前記イメージ検索装置に保存されたターゲットイメージは、複数の属性それぞれに対応する代表特徴ベクトルと、前記ターゲットイメージの特徴ベクトルとの比較結果に基づいて、前記複数の属性に対応する複数の属性タグのうち少なくとも一部の属性タグが割り当てられる。 In the image search method according to another embodiment, a step of connecting to an image search device and selecting an attribute and an attribute tag corresponding to the selected attribute among the target images stored in the image search device are assigned. The target image stored in the image search device includes a step of receiving the target image from the image search device, and the target image stored in the image search device includes a representative feature vector corresponding to each of a plurality of attributes and a feature vector of the target image. Based on the comparison result, at least a part of the attribute tags corresponding to the plurality of attributes is assigned.

本発明の一実施形態によるイメージ検索装置及びその方法は、データベースに保存されたイメージに対する属性タグの割り当てを簡便に行うことができる。 The image search device according to the embodiment of the present invention and the method thereof can easily assign attribute tags to images stored in a database.

また、本発明の一実施形態によるイメージ検索装置及びその方法は、代表特徴ベクトルを利用し、ユーザが所望する種類のイメージをさらに正確に検索するようにできる。 Further, the image search device and the method thereof according to the embodiment of the present invention can use the representative feature vector to search the image of the kind desired by the user more accurately.

ただし、本発明の一実施形態によるイメージ検索装置及びその方法が達成することができる効果は、前述のところに制限されるものではなく、言及されていない他の効果は、以下の記載から、本開示が属する技術分野で当業者に明確に理解されるであろう。 However, the effects that can be achieved by the image search device and the method thereof according to one embodiment of the present invention are not limited to those described above, and other effects not mentioned are described in the following description. It will be clearly understood by those skilled in the art in the technical field to which the disclosure belongs.

一実施形態によるイメージ検索装置及びユーザ端末を図示する図面である。It is a drawing which illustrates the image search apparatus and the user terminal by one Embodiment. 一実施形態による属性タグの割り当て方法について説明するためのフローチャートである。It is a flowchart for demonstrating the method of allocating the attribute tag by one Embodiment. 生成モデルから代表特徴ベクトルを獲得する方法について説明するための図面である。It is a drawing for demonstrating the method of acquiring the representative feature vector from a generative model. ターゲットイメージに対する属性タグの割り当て方法について説明するための例示的な図面である。It is an exemplary drawing for demonstrating the method of assigning an attribute tag to a target image. ユーザが属性を選択することができるようにするＵＩ（user interface）画面を図示する図面である。It is a drawing which illustrates the UI (user interface) screen which allows a user to select an attribute. 一実施形態によるイメージ検索方法について説明するためのフローチャートである。It is a flowchart for demonstrating the image search method by one Embodiment. 他の実施形態によるイメージ検索方法について説明するためのフローチャートである。It is a flowchart for demonstrating the image search method by another embodiment. 検索イメージから抽出される属性別特徴ベクトルを示す図面である。It is a drawing which shows the feature vector by attribute extracted from the search image. 検索イメージの属性別特徴ベクトルのうち少なくとも一つを、代表特徴ベクトルに交換する方法について説明するための図面である。It is a drawing for demonstrating the method of exchanging at least one of the attribute-specific feature vectors of a search image with a representative feature vector. 検索イメージの属性別特徴ベクトルと、ターゲットイメージの属性別特徴ベクトルと、を比較する方法について説明するための図面である。It is a drawing for demonstrating the method of comparing the attribute-specific feature vector of a search image, and the attribute-specific feature vector of a target image. 複数のオブジェクトを含む検索イメージを図示する例示的な図面である。It is an exemplary drawing which illustrates the search image which contains a plurality of objects. 図１１に図示された検索イメージの特徴ベクトルシーケンスのうち少なくとも一つを、代表特徴ベクトルに交換する方法について説明するための図面である。11 is a drawing for explaining a method of exchanging at least one of the feature vector sequences of the search image shown in FIG. 11 for a representative feature vector. 図１１に図示された検索イメージに対応する検索結果イメージを図示する例示的な図面である。FIG. 11 is an exemplary drawing illustrating a search result image corresponding to the search image illustrated in FIG. 一実施形態によるイメージ検索装置の構成を図示するブロック図である。It is a block diagram which illustrates the structure of the image search apparatus by one Embodiment.

本開示は、多様な変更を加えることができ、さまざまな実施形態を有することができるが、特定実施形態を図面に例示し、それを詳細な説明を介して説明する。しかし、それは、本開示を特定の実施形態について限定するものではなく、本開示の思想及び技術範囲に含まれる全ての変更、均等物あるいは代替物を含むものであると理解されなければならない。 Although various modifications can be made and various embodiments can be included in the present disclosure, specific embodiments are illustrated in the drawings, which will be described in detail. However, it should be understood that the present disclosure is not limited to a particular embodiment and includes all modifications, equivalents or alternatives contained within the ideas and technical scope of the present disclosure.

本実施形態についての説明において、関連公知技術についての具体的な説明が要旨を必要以上に不明確にすると判断される場合、その詳細な説明を省略する。また、本実施形態の説明過程において利用される数あるいは序数（例えば、第１、第２など）は、１つの構成要素を他の構成要素と区分するための識別記号に過ぎない。 In the description of the present embodiment, if it is determined that the specific description of the related publicly known technology makes the gist unnecessarily unclear, the detailed description thereof will be omitted. Further, the number or ordinal number (for example, first, second, etc.) used in the description process of the present embodiment is merely an identification symbol for distinguishing one component from another component.

また、本明細書において、一構成要素が他の構成要素と「連結されている」としたり「接続されている」としたりして言及されたときには、前記一構成要素が前記他の構成要素と直接連結されるか、あるいは直接接続されてもいるが、特に反対となる記載が存在しない以上、中間に他の構成要素を媒介させて連結されるか、あるいは接続されもすると理解されなければならないのである。 Further, in the present specification, when one component is referred to as being "connected" or "connected" to another component, the one component is referred to as the other component. It must be understood that they are directly connected or even directly connected, but are connected or even connected via other components in the middle, as long as there is no particular opposite statement. It is.

また、本明細書において、「～部（ユニット）」、「モジュール」と表現される構成要素は、２個以上の構成要素が１つの構成要素に合わされるか、あるいは１つの構成要素がさらに細分化された機能別に２個以上に分化されもする。また、以下で説明する構成要素それぞれは、自体が担当する株機能以外にも、他の構成要素が担当する機能のうち一部または全部をさらに遂行することもでき、構成要素それぞれが担当する主機能のうち一部機能が他の構成要素によって専担されても遂行されるということは言うまでもない。 Further, in the present specification, the components expressed as "-part (unit)" and "module" are such that two or more components are combined into one component, or one component is further subdivided. It is also differentiated into two or more according to the converted functions. In addition to the stock functions that each component is in charge of, each of the components described below can also perform some or all of the functions that other components are in charge of, and each component is in charge of the main. It goes without saying that some of the functions are performed even if they are exclusively occupied by other components.

以下、本開示の技術的思想による実施形態について順に詳細に説明する。 Hereinafter, embodiments based on the technical ideas of the present disclosure will be described in detail in order.

図１は、一実施形態によるイメージ検索装置１００及びユーザ端末１０を図示する図面である。 FIG. 1 is a drawing illustrating an image search device 100 and a user terminal 10 according to an embodiment.

ユーザ端末１０は、イメージ検索装置１００に、検索条件（例えば、検索イメージ、属性など）を選択または入力して伝送し、イメージ検索装置１００は、検索条件に対応する検索結果イメージを、ユーザ端末１０に伝送する。一実施形態において、イメージ検索装置１００は、検索条件に対応する検索結果イメージを決定するためのターゲットイメージをデータベースに保存することができる。イメージ検索装置１００は、ターゲットイメージのうち検索条件に対応するターゲットイメージを、検索結果イメージとしてユーザ端末１０に伝送することができる。 The user terminal 10 selects or inputs a search condition (for example, a search image, an attribute, etc.) to the image search device 100 and transmits the image, and the image search device 100 transmits a search result image corresponding to the search condition to the user terminal 10. To transmit to. In one embodiment, the image search device 100 can store a target image for determining a search result image corresponding to the search condition in the database. The image search device 100 can transmit the target image corresponding to the search condition among the target images to the user terminal 10 as a search result image.

ユーザ端末１０は、ネットワークを介して、イメージ検索装置１００に接続することができるデスクトップＰＣ（personal computer）、タブレット、スマートフォン、ノート型パソコンなどを含んでもよいが、それらに限定されるものではない。 The user terminal 10 may include, but is not limited to, a desktop PC (personal computer), a tablet, a smartphone, a notebook computer, and the like that can be connected to the image search device 100 via a network.

一実施形態において、イメージ検索装置１００及びユーザ端末１０は、ネットワークを介して互いに連結され、互いにデータを送受信することができる。ここで、該ネットワークは、有線ネットワークと無線ネットワークとを含んでもよく、具体的には、近距離ネットワーク（ＬＡＮ：local area network）、都市圏ネットワーク（ＭＡＮ：metropolitan area network）、広域ネットワーク（ＷＡＮ：wide area network）のような多様なネットワークを含んでもよい。また、ネットワーク１０は、公知のワールドワイドウェブ（ＷＷＷ：worldwide web）を含んでもよい。しかし、本発明によるネットワーク１０は、前記列挙されたネットワークに限られるものではなく、公知の無線データネットワークや公知の電話ネットワーク、公知の有無線テレビネットワークを少なくとも一部として含んでもよい。 In one embodiment, the image search device 100 and the user terminal 10 are connected to each other via a network and can transmit and receive data to and from each other. Here, the network may include a wired network and a wireless network, and specifically, a short-range network (LAN: local area network), a metropolitan area network (MAN: metropolitan area network), and a wide area network (WAN:). It may include various networks such as wide area network). Further, the network 10 may include a known worldwide web (WWW: worldwide web). However, the network 10 according to the present invention is not limited to the networks listed above, and may include a known wireless data network, a known telephone network, and a known known wireless television network as at least a part thereof.

一実施形態によるイメージ検索装置１００は、あらかじめ保存されたターゲットイメージに対して属性タグを割り当てることができる。本明細書において、「属性」とは、イメージに含まれたオブジェクトの特徴を意味する。前記属性は、カテゴリー別に区分される。例えば、ある衣類は、色相カテゴリーに属した赤色の属性、柄カテゴリーに属した縞模様属性、形状カテゴリーに属したワンピース属性を有することができる。また、本明細書において、「属性タグ」とは、イメージ内オブジェクトの特徴を示すためにイメージに割り当てられた情報を意味する。例えば、ある衣類の属性タグは、当該衣類の色相が赤色であり、柄は、縞模様で、形状は、ワンピースであるということ示すことができる。 The image search device 100 according to one embodiment can assign an attribute tag to a target image stored in advance. As used herein, the term "attribute" means the characteristics of the objects contained in the image. The attributes are classified by category. For example, a piece of clothing can have a red attribute that belongs to the hue category, a striped pattern attribute that belongs to the pattern category, and a one-piece attribute that belongs to the shape category. Further, in the present specification, the “attribute tag” means information assigned to an image to indicate the characteristics of an object in the image. For example, an attribute tag of a piece of clothing can indicate that the hue of the piece of clothing is red, the pattern is striped, and the shape is one piece.

イメージ検索装置１００が、ターゲットイメージに対して属性タグを割り当てる方法について、図２を参照して説明する。 A method of assigning an attribute tag to a target image by the image search device 100 will be described with reference to FIG.

図２は、一実施形態による属性タグの割り当て方法について説明するためのフローチャートである。 FIG. 2 is a flowchart for explaining a method of assigning attribute tags according to an embodiment.

Ｓ２１０段階において、イメージ検索装置１００は、同一属性を有しているか、あるいは同一属性タグが割り当てられた複数の参照イメージに対応する参照特徴ベクトルを獲得する。一実施形態において、管理者は、複数の参照イメージに対して、同一属性タグを割り当てることができる。例えば、複数の参照イメージに対して、「赤色」、「串柄」または「女性に人気ある衣類」のような同一属性タグが割り当てられる。 At the stage S210, the image search device 100 acquires reference feature vectors corresponding to a plurality of reference images having the same attribute or to which the same attribute tag is assigned. In one embodiment, the administrator can assign the same attribute tag to a plurality of reference images. For example, multiple reference images are assigned the same attribute tag, such as "red," "skewers," or "clothes popular with women."

複数の参照イメージに対応する参照特徴ベクトルは、マシンラーニングモデルによって出力されたものであってもよい。前記マシンラーニングモデルは、例えば、ＣＮＮ（convolutional neural network）を含んでもよい。また、該参照特徴ベクトルは、Ｎ（Ｎは、１以上の整数である）次元のデータであってもよい。一実施形態において、複数の参照イメージに対応する参照特徴ベクトルは、マシンラーニングモデルによって出力された後、後処理（post processing）を介して生成されたものでもある。後処理は、例えば、マシンラーニングモデルから出力された特徴ベクトルに含まれた複数の元素をサイズ基準で整列した後、上位ｋ個の要素の特徴ベクトル内での位置を示すインデックスを獲得し、獲得したインデックスを要素として有するインデックス特徴ベクトルを、参照特徴ベクトルとして生成する処理を含んでもよい。 The reference feature vector corresponding to the plurality of reference images may be output by the machine learning model. The machine learning model may include, for example, a CNN (convolutional neural network). Further, the reference feature vector may be N (N is an integer of 1 or more) dimension data. In one embodiment, the reference feature vector corresponding to the plurality of reference images is also generated by post processing after being output by the machine learning model. In the post-processing, for example, after arranging a plurality of elements contained in the feature vector output from the machine learning model on a size basis, an index indicating the position of the upper k elements in the feature vector is acquired and acquired. It may include a process of generating an index feature vector having the index as an element as a reference feature vector.

Ｓ２２０段階において、イメージ検索装置１００は、複数の参照イメージに対応する参照特徴ベクトルに基づいて訓練された生成モデルから、同一属性ないし同一属性タグに対応する代表特徴ベクトルを獲得する。該生成モデルは、複数の参照イメージの参照特徴ベクトルに基づいて訓練されることにより、参照特徴ベクトルと類似した代表特徴ベクトルを生成することができる。一実施形態において、生成モデルは、敵対的生成ネットワーク（ＧＡＮ：generative adversarial network）を含んでもよい。 In the S220 step, the image search device 100 acquires a representative feature vector corresponding to the same attribute or the same attribute tag from the generated model trained based on the reference feature vectors corresponding to the plurality of reference images. The generative model can generate a representative feature vector similar to the reference feature vector by training based on the reference feature vectors of the plurality of reference images. In one embodiment, the generative model may include a generative adversarial network (GAN).

図３を参照すれば、生成モデル３００は、参照イメージ３１０ａ，３１０ｂ，３１０ｃに対応する参照特徴ベクトル３２０ａ，３２０ｂ，３２０ｃによって訓練された後、例えば、ランダムノイズを入力データとして入力し、参照特徴ベクトル３２０ａ，３２０ｂ，３２０ｃと類似した代表特徴ベクトルを生成することができる。一実施形態において、生成モデル３００に該当するＧＡＮは，複数のＦＣＬ（fully connected layer）からなる。ＦＣＬとは、それぞれのレイヤが少なくとも１つのノードで構成されるが、あるレイヤの各ノードが、以前レイヤ及び次のレイヤに含まれた全てのノード全部と個別的に連結されたネットワーク構造である。該ＧＡＮは、イメージではない参照特徴ベクトルによって訓練されるので、ＣＮＮを含む必要がなく、ＦＣＬだけで、参照特徴ベクトルと類似した代表特徴ベクトルを生成することができる。すなわち、イメージ生成のためのＧＡＮの代わりに、代表特徴ベクトル生成のための複数のＦＣＬからなるＧＡＮが利用されることにより、訓練が迅速になされ、処理速度も、非常に速くなる。 Referring to FIG. 3, the generative model 300 is trained by the reference feature vectors 320a, 320b, 320c corresponding to the reference images 310a, 310b, 310c, and then, for example, input random noise as input data and the reference feature vector. Representative feature vectors similar to 320a, 320b, 320c can be generated. In one embodiment, the GAN corresponding to the generative model 300 is composed of a plurality of FCLs (fully connected layers). FCL is a network structure in which each layer is composed of at least one node, but each node of one layer is individually connected to all the nodes contained in the previous layer and the next layer. .. Since the GAN is trained by a reference feature vector that is not an image, it is not necessary to include a CNN, and the FCL alone can generate a representative feature vector similar to the reference feature vector. That is, by using a GAN composed of a plurality of FCLs for generating a representative feature vector instead of a GAN for image generation, training is speeded up and the processing speed is also very fast.

一実施形態において、該生成モデルは、参照特徴ベクトル元素の平均値を獲得し、獲得した平均値を元素として有する代表特徴ベクトルを生成することもできる。 In one embodiment, the generative model can also acquire an average value of a reference feature vector element and generate a representative feature vector having the acquired average value as an element.

再び図２を参照すれば、Ｓ２３０段階において、イメージ検索装置１００は、データベースに保存されたターゲットイメージの特徴ベクトルと、代表特徴ベクトルとを比較する。一例において、イメージ検索装置１００は、ターゲットイメージの特徴ベクトルそれぞれと、代表特徴ベクトルとのディスタンス（例えば、コサインディスタンス（cosine distance））を算出することができる。特徴ベクトル間の類似性を判断するために算出されるディスタンスは、公知されたものであるので、本明細書において、詳細な説明は、省略する。 Referring to FIG. 2 again, in the step S230, the image search device 100 compares the feature vector of the target image stored in the database with the representative feature vector. In one example, the image search device 100 can calculate the distance (for example, cosine distance) between each feature vector of the target image and the representative feature vector. Since the distance calculated for determining the similarity between the feature vectors is known, detailed description thereof is omitted in the present specification.

Ｓ２４０段階において、イメージ検索装置１００は、ターゲットイメージのうち少なくとも一部のターゲットイメージに、複数の参照イメージが有している同一属性に対応する属性タグを割り当てる。一実施形態において、イメージ検索装置１００は、代表特徴ベクトル対比で既設定の値以下のディスタンスを有する特徴ベクトルに対応するターゲットイメージに、同一属性に対応する属性タグを割り当てることができる。 At the stage S240, the image search device 100 assigns an attribute tag corresponding to the same attribute possessed by the plurality of reference images to at least a part of the target images among the target images. In one embodiment, the image search device 100 can assign an attribute tag corresponding to the same attribute to a target image corresponding to a feature vector having a distance equal to or less than a preset value in comparison with a representative feature vector.

図４は、ターゲットイメージに対する属性タグ割り当て方法について説明するための図面であり、図４を参照すれば、ターゲットイメージ４１０ａ，４２０ａ，４３０ａの特徴ベクトル４１０ｂ，４２０ｂ，４３０ｂそれぞれと、代表特徴ベクトルとの比較結果により、第１ターゲットイメージ４１０ａ及び第２ターゲットイメージ４２０ａに「ジグザグ柄」の属性タグ４１０ｃ，４２０ｃが割り当てられ、第３ターゲットイメージ４３０ａには、「パーガンディ色相」の属性タグ４３０ｃが割り当てられる。図４は、１つのターゲットイメージに１つの属性タグが割り当てられるように図示しているが、１つのターゲットイメージに複数の属性タグが割り当てられてもよい。 FIG. 4 is a drawing for explaining a method of assigning an attribute tag to a target image. With reference to FIG. 4, the feature vectors 410b, 420b, 430b of the target images 410a, 420a, 430a and the representative feature vector are shown. According to the comparison result, the attribute tags 410c and 420c of the "zigzag pattern" are assigned to the first target image 410a and the second target image 420a, and the attribute tag 430c of the "pergandy hue" is assigned to the third target image 430a. Although FIG. 4 is shown so that one attribute tag is assigned to one target image, a plurality of attribute tags may be assigned to one target image.

一実施形態によれば、相対的に少数の参照イメージに基づいて、多量のターゲットイメージに対して属性タグを割り当てることができるので、管理者が一つ一つデータベースに保存されたターゲットイメージに対して属性タグを割り当てる必要がなくなる。 According to one embodiment, attribute tags can be assigned to a large number of target images based on a relatively small number of reference images, so that the administrator can assign attribute tags to each target image stored in the database. Eliminates the need to assign attribute tags.

また、一実施形態によれば、新たな属性の開発、流行などにより、ターゲットイメージに新たな属性タグを割り当てる必要が発生した場合にも、速かに新たな属性タグをターゲットイメージに割り当てることができる。例えば、管理者によって新たな属性を有する少数の参照イメージが選択されれば、イメージ検索装置１００は、選択された参照イメージの参照特徴ベクトルでもって生成モデルを訓練させ、生成モデルから出力される代表特徴ベクトルと、ターゲットイメージの特徴ベクトルの比較結果により、ターゲットイメージに対し、新たな属性に対応する属性タグを割り当てることができる。 Further, according to one embodiment, even when it becomes necessary to assign a new attribute tag to the target image due to the development of a new attribute, fashion, etc., the new attribute tag can be quickly assigned to the target image. can. For example, if the administrator selects a small number of reference images with new attributes, the image search device 100 trains the generative model with the reference feature vector of the selected reference image, and the representative output from the generative model. Based on the comparison result between the feature vector and the feature vector of the target image, the attribute tag corresponding to the new attribute can be assigned to the target image.

図５は、ユーザが属性を選択することができるようにするユーザインターフェース（ＵＩ）画面５００を図示する図面である。 FIG. 5 is a drawing illustrating a user interface (UI) screen 500 that allows the user to select attributes.

前述のように、データベースに保存されたターゲットイメージに対し、属性タグが割り当てられれば、ユーザは、ユーザ端末１０のディスプレイに表示されたＵＩ画面５００において、特定属性を選択され行えば、簡便にイメージ検索が可能である。 As described above, if the attribute tag is assigned to the target image stored in the database, the user can easily select the specific attribute on the UI screen 500 displayed on the display of the user terminal 10. Search is possible.

図５を参照すれば、ユーザ端末１０のディスプレイに検索画面５００が表示されれば、ユーザは、検索画面５００において、特定カテゴリー（例えば、パターン）５２０に属した多くの属性５３０のうち一部（例えば、チェック）を選択することにより、当該属性に対応する属性タグが割り当てられたターゲットイメージを、検索結果に受信することができる。 Referring to FIG. 5, if the search screen 500 is displayed on the display of the user terminal 10, the user can use a part of the many attributes 530 belonging to the specific category (for example, pattern) 520 on the search screen 500 (for example, a pattern). For example, by selecting (check), the target image to which the attribute tag corresponding to the attribute is assigned can be received in the search result.

イメージ検索装置１００は、データベースに保存されたターゲットイメージのうち、ユーザ端末１０によって選択された属性に対応するタグが割り当てられたターゲットイメージを、検索結果イメージとして、ユーザ端末１０に伝送することができる。 The image search device 100 can transmit to the user terminal 10 as a search result image a target image to which a tag corresponding to the attribute selected by the user terminal 10 is assigned among the target images stored in the database. ..

具現例として、ユーザは、検索画面５００に、属性として検索語５１０を入力することもできる。イメージ検索装置１００は、データベースに保存されたターゲットイメージのうち、検索語５１０に対応する属性と係わる属性タグを有するターゲットイメージを、検索結果イメージとしてユーザ端末１０に伝送することができる。 As an embodiment, the user can also input the search term 510 as an attribute on the search screen 500. Among the target images stored in the database, the image search device 100 can transmit the target image having the attribute tag related to the attribute corresponding to the search term 510 to the user terminal 10 as the search result image.

一実施形態において、イメージ検索装置１００は、ユーザ端末１０によって選択された属性に対応する代表特徴ベクトルと、データベースに保存されたターゲットイメージの特徴ベクトルとを比較し、代表特徴ベクトルと同一であるか、あるいは類似した特徴ベクトルに対応するターゲットイメージを、検索結果イメージとして、ユーザ端末１０に伝送することができる。イメージ検索装置１００は、代表特徴ベクトルとの間で既設定の値以下であるディスタンスを有する特徴ベクトルに対応するターゲットイメージを検索結果イメージとしてユーザ端末１０に伝送することができる。 In one embodiment, the image search device 100 compares the representative feature vector corresponding to the attribute selected by the user terminal 10 with the feature vector of the target image stored in the database, and is it the same as the representative feature vector? , Or a target image corresponding to a similar feature vector can be transmitted to the user terminal 10 as a search result image. The image search device 100 can transmit a target image corresponding to a feature vector having a distance equal to or less than a preset value with the representative feature vector to the user terminal 10 as a search result image.

代表特徴ベクトルとの比較対象になる特徴ベクトルは、代表特徴ベクトルを出力する生成モデルを訓練させるのに利用された参照特徴ベクトルを、いかなる種類のマシンラーニングモデルが出力したかということによって相違し得る。例えば、生成モデルを訓練させるのに利用した参照特徴ベクトルが、従属（dependant）モデルによって出力された場合、比較対象になるターゲットイメージの特徴ベクトルも、従属モデルによって出力されたものでもある。該従属モデルは、特定カテゴリーの属性に対応する特徴ベクトルを出力するように訓練されたマシンラーニングモデルを意味する。例えば、該従属モデルは、色相カテゴリー専用のモデル、柄カテゴリー専用のモデルなどを含んでもよい。管理者は、ターゲットイメージに対して、色相属性のタグを割り当てる場合、参照イメージを、色相カテゴリー専用モデルに入力させ、参照イメージに対応する参照特徴ベクトルを獲得し、獲得した参照特徴ベクトルでもって生成モデルを訓練させることができる。 The feature vector to be compared with the representative feature vector can differ depending on what kind of machine learning model outputs the reference feature vector used to train the generative model that outputs the representative feature vector. .. For example, if the reference feature vector used to train the generative model is output by the dependent model, then the feature vector of the target image to be compared is also output by the dependent model. The dependent model means a machine learning model trained to output feature vectors corresponding to a particular category of attributes. For example, the dependent model may include a model dedicated to the hue category, a model dedicated to the pattern category, and the like. When assigning a hue attribute tag to a target image, the administrator inputs the reference image into the model dedicated to the hue category, acquires the reference feature vector corresponding to the reference image, and generates it with the acquired reference feature vector. You can train the model.

また、生成モデルを訓練させるのに利用した特徴ベクトルが、独立（independent）モデルによって出力された場合、比較対象になるターゲットイメージの特徴ベクトルも、独立モデルによって出力されたものでもある。該独立モデルは、属性のカテゴリーと無関係な特徴ベクトルを出力するように訓練されたマシンラーニングモデルを意味する。 Further, when the feature vector used for training the generative model is output by the independent model, the feature vector of the target image to be compared is also output by the independent model. The independent model means a machine learning model trained to output feature vectors independent of attribute categories.

図６は、一実施形態によるイメージ検索方法について説明するためのフローチャートである。 FIG. 6 is a flowchart for explaining an image search method according to an embodiment.

Ｓ６１０段階において、イメージ検索装置１００は、ユーザ端末１０から複数の検索イメージを受信する。イメージ検索装置１００は、ユーザ端末１０から、ネットワークを介して、複数の検索イメージを直接受信することもでき、ユーザ端末１０から、複数の検索イメージを特定することができる情報（例えば、複数の検索イメージが保存されたＵＲＬ、複数の検索イメージの識別情報など）を受信し、受信された情報に基づいて、複数の検索イメージを獲得することもできる。 At the stage S610, the image search device 100 receives a plurality of search images from the user terminal 10. The image search device 100 can also directly receive a plurality of search images from the user terminal 10 via the network, and information capable of identifying the plurality of search images from the user terminal 10 (for example, a plurality of searches). It is also possible to receive a URL in which an image is stored, identification information of a plurality of search images, etc., and acquire a plurality of search images based on the received information.

Ｓ６２０段階において、イメージ検索装置１００は、複数の検索イメージに対応する特徴ベクトルを獲得する。イメージ検索装置１００は、マシンラーニングモデルに複数の検索イメージを入力させ、マシンラーニングモデルから出力される特徴ベクトルを獲得することができる。前記マシンラーニングモデルは、従属モデルまたは独立モデルでもある。 At the stage S620, the image search device 100 acquires feature vectors corresponding to a plurality of search images. The image search device 100 can input a plurality of search images to the machine learning model and acquire the feature vector output from the machine learning model. The machine learning model is also a dependent model or an independent model.

Ｓ６３０段階において、イメージ検索装置１００は、複数の検索イメージに対応する特徴ベクトルでもって生成モデルを訓練させる。生成モデルがＧＡＮである場合、ＧＡＮは、生成器（generator）と区別器（discriminator）との相互作用を基に、訓練データと類似したデータを生成するように訓練される。 At the S630 stage, the image search device 100 trains the generative model with feature vectors corresponding to a plurality of search images. When the generative model is GAN, the GAN is trained to generate data similar to the training data based on the interaction between the generator and the discriminator.

Ｓ６４０段階において、イメージ検索装置１００は、複数の検索イメージの特徴ベクトルでもって訓練された生成モデルから出力される代表特徴ベクトルと、ターゲットイメージの特徴ベクトルとを比較する。イメージ検索装置１００は、ターゲットイメージの特徴ベクトルそれぞれと、代表特徴ベクトルとのディスタンスを算出することができる。 In the S640 step, the image search device 100 compares the representative feature vector output from the generated model trained with the feature vectors of the plurality of search images with the feature vector of the target image. The image search device 100 can calculate the distance between each feature vector of the target image and the representative feature vector.

前述のように、代表特徴ベクトルとの比較対象になる特徴ベクトルは、代表特徴ベクトルを出力する生成モデルを訓練させるのに利用された特徴ベクトルが、いかなる種類のマシンラーニングモデルによって出力されたかということによっても異なる。 As mentioned above, the feature vector to be compared with the representative feature vector is what kind of machine learning model output the feature vector used to train the generative model that outputs the representative feature vector. It also depends on.

Ｓ６５０段階において、イメージ検索装置１００は、データベースに保存されたターゲットイメージのうち、複数の検索イメージに対応するターゲットイメージを検索結果イメージとしてユーザ端末１０に伝送する。 At the S650 stage, the image search device 100 transmits the target images corresponding to the plurality of search images among the target images stored in the database to the user terminal 10 as search result images.

イメージ検索装置１００は、代表特徴ベクトルとの間で既設定の値以下のディスタンスを有する特徴ベクトルに対応するターゲットイメージを、検索結果イメージとしてユーザ端末１０に伝送することができる。 The image search device 100 can transmit a target image corresponding to a feature vector having a distance equal to or less than a preset value with the representative feature vector to the user terminal 10 as a search result image.

図６に図示された実施形態によれば、例えば、ユーザが、自分の趣向に符合する複数の検索イメージを選択すれば、イメージ検索装置１００は、ユーザの趣向に符合する検索イメージ以外の他のイメージを、検索結果として提供することができる。 According to the embodiment illustrated in FIG. 6, for example, if the user selects a plurality of search images that match his / her taste, the image search device 100 uses the image search device 100 other than the search images that match the user's taste. The image can be provided as a search result.

図７は、他の実施形態によるイメージ検索方法について説明するためのフローチャートである。 FIG. 7 is a flowchart for explaining an image search method according to another embodiment.

Ｓ７１０段階において、イメージ検索装置１００は、ユーザ端末１０から検索イメージを受信する。イメージ検索装置１００は、ユーザ端末１０から、ネットワークを介して、検索イメージを直接受信することもでき、ユーザ端末１０から、検索イメージを特定することができる情報（例えば、検索イメージが保存されたＵＲＬ、検索イメージの識別情報など）を受信し、受信された情報に基づいて、検索イメージを獲得することもできる。 At the S710 stage, the image search device 100 receives the search image from the user terminal 10. The image search device 100 can also directly receive the search image from the user terminal 10 via the network, and the information that can identify the search image from the user terminal 10 (for example, the URL in which the search image is stored) is stored. , The identification information of the search image, etc.) is received, and the search image can be acquired based on the received information.

Ｓ７２０段階において、イメージ検索装置１００は、検索イメージの属性別特徴ベクトルを獲得する。イメージ検索装置１００は、複数の従属モデルそれぞれに検索イメージを入力させ、複数の従属モデルから出力される特徴ベクトルを獲得することができる。例えば、図８に図示されているように、検索イメージ８１０を、第１従属モデル８２１に入力させ、第１属性（例えば、色相属性）に対応する特徴ベクトル８３１を獲得し、検索イメージ８１０を、第２従属モデル８２３に入力させ、第２属性（例えば、柄属性）に対応する特徴ベクトル８３３を獲得し、検索イメージ８１０を、第３従属モデル８２５に入力させ、第３属性（例えば、形状属性）に対応する特徴ベクトル８３５を獲得することができる。 At the stage S720, the image search device 100 acquires the attribute-specific feature vector of the search image. The image search device 100 can input a search image to each of a plurality of dependent models and acquire feature vectors output from the plurality of dependent models. For example, as shown in FIG. 8, the search image 810 is input to the first dependent model 821, the feature vector 831 corresponding to the first attribute (for example, the hue attribute) is acquired, and the search image 810 is obtained. The second dependent model 823 is input to acquire the feature vector 833 corresponding to the second attribute (for example, the pattern attribute), the search image 810 is input to the third dependent model 825, and the third attribute (for example, the shape attribute) is input. ) Corresponding to the feature vector 835 can be obtained.

一実施形態において、ユーザ端末１０から受信された検索イメージが２個である場合、いずれか１つの検索イメージを第１従属モデル８２１に入力させ、第１属性に対応する特徴ベクトルを獲得し、他の１つの検索イメージを、第２従属モデル８２３及び第３従属モデル８２５に入力させ、第２属性に対応する特徴ベクトルと、第３属性に対応する特徴ベクトルとを獲得することもできる。 In one embodiment, when the number of search images received from the user terminal 10 is two, any one of the search images is input to the first dependent model 821, the feature vector corresponding to the first attribute is acquired, and the other. It is also possible to input one search image of the above into the second dependent model 823 and the third dependent model 825, and acquire the feature vector corresponding to the second attribute and the feature vector corresponding to the third attribute.

他の実施形態において、ユーザ端末１０から受信された検索イメージが３個である場合、いずれか１つの検索イメージを第１従属モデル８２１に入力させ、第１属性に対応する特徴ベクトルを獲得し、他の１つの検索イメージを第２従属モデル８２３に入力させ、第２属性に対応する特徴ベクトルを獲得し、さらに他の１つの検索イメージを第３従属モデル８２５に入力させ、第３属性に対応する特徴ベクトルを獲得することもできる。 In another embodiment, when the number of search images received from the user terminal 10 is three, any one of the search images is input to the first dependent model 821, and the feature vector corresponding to the first attribute is acquired. The other one search image is input to the second dependent model 823 to acquire the feature vector corresponding to the second attribute, and the other search image is input to the third dependent model 825 to correspond to the third attribute. It is also possible to acquire the feature vector to be used.

すなわち、イメージ検索装置１００は、複数の検索イメージに対応する特徴ベクトルを組み合わせ、属性別特徴ベクトルを獲得することができる。 That is, the image search device 100 can combine feature vectors corresponding to a plurality of search images to acquire attribute-specific feature vectors.

図８は、属性別特徴ベクトルを抽出するために、３個の従属モデル８２１，８２３，８２５が利用されるように図示されているが、それは、１つの例示であるにすぎず、検索イメージとして、多様な個数の属性の特徴ベクトルを抽出するための多様な個数の従属モデルが利用され得る。 FIG. 8 is illustrated such that three dependent models 821,823,825 are used to extract attribute-specific feature vectors, but this is only an example and as a search image. , Various numbers of dependent models can be used to extract feature vectors of various numbers of attributes.

再び図７を参照すれば、Ｓ７３０段階において、イメージ検索装置１００は、属性別特徴ベクトルのうち、ユーザ端末１０によって選択された属性の特徴ベクトルを、ユーザ端末１０によって選択された属性に対応する代表特徴ベクトルに変更する。ユーザは、代表特徴ベクトルに変更する属性を、ユーザ端末１０を介して選択することができる。ユーザ端末１０は、検索イメージに含まれたオブジェクトの属性のうち、変更する属性（例えば、色相属性）と、変更する属性の代わりとなる他の属性とを選択することができる。それに対応し、イメージ検索装置１００は、属性別特徴ベクトルのうちユーザ端末１０によって選択された属性の特徴ベクトルを、他の属性に対応する代表特徴ベクトルに変更することができる。 Referring to FIG. 7 again, in the S730 stage, the image search device 100 uses the feature vector of the attribute selected by the user terminal 10 among the attribute-specific feature vectors as a representative corresponding to the attribute selected by the user terminal 10. Change to a feature vector. The user can select the attribute to be changed to the representative feature vector via the user terminal 10. The user terminal 10 can select an attribute to be changed (for example, a hue attribute) and another attribute to be replaced with the attribute to be changed among the attributes of the objects included in the search image. Correspondingly, the image search device 100 can change the feature vector of the attribute selected by the user terminal 10 among the feature vectors for each attribute to the representative feature vector corresponding to the other attribute.

例えば、図９に図示されているように、第１属性の特徴ベクトル８３１、第２属性の特徴ベクトル８３３、及び第３属性の特徴ベクトル８３５において、ユーザ端末１０によって選択された第２属性の特徴ベクトル８３３を、代表特徴ベクトル９３３に交換することができる。ここで、代表特徴ベクトル９３３は、第２属性専用の従属モデル８２３から出力される参照特徴ベクトルでもって訓練された生成モデルから出力された特徴ベクトルでもある。 For example, as shown in FIG. 9, in the feature vector 831 of the first attribute, the feature vector 833 of the second attribute, and the feature vector 835 of the third attribute, the feature of the second attribute selected by the user terminal 10 The vector 833 can be exchanged for the representative feature vector 933. Here, the representative feature vector 933 is also a feature vector output from the generative model trained with the reference feature vector output from the dependent model 823 dedicated to the second attribute.

Ｓ７４０段階において、イメージ検索装置１００は、代表特徴ベクトルを含む属性別特徴ベクトルと、データベースに保存されたターゲットイメージの属性別特徴ベクトルとを比較する。ここで、データベースに保存されたターゲットイメージの属性別特徴ベクトルは、従属モデルから出力されたものでもある。 At the stage S740, the image search device 100 compares the attribute-specific feature vector including the representative feature vector with the attribute-specific feature vector of the target image stored in the database. Here, the attribute-specific feature vector of the target image stored in the database is also output from the dependent model.

一実施形態において、イメージ検索装置１００は、図１０に図示されているように、代表特徴ベクトル９３３を含む属性別特徴ベクトル８３１，９３３，８３５と、データベースに保存されたターゲットイメージの属性別特徴ベクトル１０３１，１０３３，１０３５との比較にあたり、同一カテゴリーに属した属性の特徴ベクトル同士比較し、属性別ディスタンスを獲得することができる。イメージ検索装置１００は、ターゲットイメージの属性別特徴ベクトル１０３１，１０３３，１０３５それぞれに対し、第１属性に対応するディスタンス、第２属性に対応するディスタンス、及び第３属性に対応するディスタンスを算出することができる。 In one embodiment, as shown in FIG. 10, the image search device 100 includes an attribute-specific feature vector 831, 933, 835 including a representative feature vector 933, and an attribute-specific feature vector of a target image stored in a database. In comparison with 1031, 1033, 1035, feature vectors of attributes belonging to the same category can be compared with each other to obtain a distance for each attribute. The image search device 100 calculates the distance corresponding to the first attribute, the distance corresponding to the second attribute, and the distance corresponding to the third attribute for each of the attribute-specific feature vectors 1031, 1033, 1035 of the target image. Can be done.

Ｓ７５０段階において、イメージ検索装置１００は、データベースに保存されたターゲットイメージのうち、ユーザ端末１０に伝送するターゲットイメージを、前記算出された属性別ディスタンスに基づいて決定することができる。 At the stage S750, the image search device 100 can determine the target image to be transmitted to the user terminal 10 among the target images stored in the database based on the calculated distance for each attribute.

イメージ検索装置１００は、属性別ディスタンスを組み合わせた結果に基づいて、データベースに保存されたターゲットイメージのうち、代表特徴ベクトルを含む属性別特徴ベクトルと、同一であるか、あるいは類似したターゲットイメージを決定することができる。一例において、イメージ検索装置１００は、属性別ディスタンスを組み合わせた結果値が大きい順に、所定個数のターゲットイメージを選択することができ、または、属性別ディスタンスを組み合わせた結果値が既設定の値以上であるターゲットイメージを選択することもできる。 The image search device 100 determines a target image that is the same as or similar to the attribute-specific feature vector including the representative feature vector among the target images stored in the database, based on the result of combining the attribute-specific distances. can do. In one example, the image search device 100 can select a predetermined number of target images in descending order of the result value of combining the distances by attribute, or when the result value of combining the distances by attribute is equal to or more than the preset value. You can also select a target image.

Ｓ７６０段階において、イメージ検索装置１００は、Ｓ７５０段階で選択されたターゲットイメージを、ユーザ端末１０に伝送する。 In the S760 stage, the image search device 100 transmits the target image selected in the S750 stage to the user terminal 10.

図７に図示された実施形態によれば、ユーザは、検索イメージをイメージ検索装置１００に伝送し、検索イメージに含まれたオブジェクトの特定属性を他の属性に変更することにより、変更された属性に対応する検索結果を受信することができる。例えば、ユーザは、赤色及び縞模様のワンピースを含む検索イメージを、イメージ検索装置１００に伝送し、赤色の代わりに青色を選択し、青色及び縞模様のワンピースを含むイメージを検索結果として受けることができるのである。 According to the embodiment illustrated in FIG. 7, the user transmits the search image to the image search device 100 and changes the specific attribute of the object included in the search image to another attribute. You can receive the search results corresponding to. For example, the user may transmit a search image containing red and striped dresses to the image search device 100, select blue instead of red, and receive an image containing blue and striped dresses as a search result. You can.

図１１は、複数のオブジェクト１１１０，１１３０，１１５０を含む検索イメージ１１００を図示する例示的な図面である。 FIG. 11 is an exemplary drawing illustrating a search image 1100 that includes a plurality of objects 1110, 1130, 1150.

一実施形態において、ユーザ端末１０からイメージ検索装置１００に送信される検索イメージ１１００は、図１１に図示されているように、複数のオブジェクト１１１０，１１３０，１１５０を含んでもよい。図１１において、複数のオブジェクト１１１０，１１３０，１１５０は、上着オブジェクト１１１０、下衣（bottoms）オブジェクト１１３０及びかばんオブジェクト１１５０を含む。この場合、イメージ検索装置１００は、複数のオブジェクト１１１０，１１３０，１１５０それぞれに対応する部分イメージをクロッピング（cropping）して従属モデルに入力させ、従属モデルから出力されるそれぞれのオブジェクトの属性別特徴ベクトルを獲得することができる。例えば、イメージ検索装置１００は、図１１に図示された検索イメージ１１００から、上着オブジェクト１１１０の属性別特徴ベクトル、下衣オブジェクト１１３０の属性別特徴ベクトル、及びかばんオブジェクト１１５０の属性別特徴ベクトルを獲得することができる。 In one embodiment, the search image 1100 transmitted from the user terminal 10 to the image search device 100 may include a plurality of objects 1110, 1130, 1150 as shown in FIG. In FIG. 11, the plurality of objects 1110, 1130, 1150 include a coat object 1110, a bottoms object 1130, and a bag object 1150. In this case, the image search device 100 crops the partial images corresponding to each of the plurality of objects 1110, 1130, and 1150 and causes them to be input to the dependent model, and the attribute-specific feature vector of each object output from the dependent model. Can be acquired. For example, the image search device 100 acquires the attribute-specific feature vector of the coat object 1110, the attribute-specific feature vector of the undergarment object 1130, and the attribute-specific feature vector of the bag object 1150 from the search image 1100 shown in FIG. can do.

イメージ検索装置１００は、検索イメージ１１００に含まれた複数のオブジェクト１１１０，１１３０，１１５０それぞれの属性別特徴ベクトルを含む特徴ベクトルシーケンスにおいて、ユーザ端末１０によって選択されたオブジェクトの特定属性の特徴ベクトルを、ユーザ端末１０が選択した属性に対応する代表特徴ベクトルに変更することができる。例えば、ユーザが、図１１に図示された検索イメージ１１００において、かばんオブジェクト１１５０を選択し、かばんの色相属性を他の色相に変更する場合、イメージ検索装置１００は、かばんオブジェクト１１５０の色相属性の特徴ベクトルの代わりに、ユーザが変更する色相に対応する代表特徴ベクトルを、特徴ベクトルシーケンスに含めることができるのである。 The image search device 100 sets the feature vector of the specific attribute of the object selected by the user terminal 10 in the feature vector sequence including the attribute-specific feature vectors of each of the plurality of objects 1110, 1130, 1150 included in the search image 1100. It can be changed to the representative feature vector corresponding to the attribute selected by the user terminal 10. For example, when the user selects the bag object 1150 in the search image 1100 illustrated in FIG. 11 and changes the hue attribute of the bag to another hue, the image search device 100 is characterized by the hue attribute of the bag object 1150. Instead of a vector, a representative feature vector corresponding to the hue changed by the user can be included in the feature vector sequence.

例えば、図１２に図示されているように、第１オブジェクトの属性別特徴ベクトル１２１０、第２オブジェクトの属性別特徴ベクトル１２２０及び第３オブジェクトの属性別特徴ベクトル１２３０のうち、第３オブジェクト１２３０の第１属性の特徴ベクトル１２３１を、代表特徴ベクトル１２４１に交換することができる。 For example, as shown in FIG. 12, of the attribute-specific feature vector 1210 of the first object, the attribute-specific feature vector 1220 of the second object, and the attribute-specific feature vector 1230 of the third object, the third object 1230 is the third object. The one-attribute feature vector 1231 can be exchanged for the representative feature vector 1241.

イメージ検索装置１００は、代表特徴ベクトル１２４１を含む特徴ベクトルシーケンス１２５０と、データベースに保存されたターゲットイメージの特徴ベクトルシーケンスとを比較し、検索イメージ１１００に対応するターゲットイメージを決定することができる。イメージ検索装置１００は、代表特徴ベクトル１２４１を含む特徴ベクトルシーケンス１２５０と、ターゲットイメージの特徴ベクトルシーケンスとを比較するのにあたり、同一オブジェクトの同一カテゴリーの属性の特徴ベクトル同士比較し、各オブジェクトに係わる属性別ディスタンスを算出し、算出されたディスタンスを基に、検索結果に該当するターゲットイメージを決定することができる。一実施形態において、イメージ検索装置１００は、代表特徴ベクトル１２４１を含む特徴ベクトルシーケンス１２５０と、ターゲットイメージの特徴ベクトルシーケンスとを比較した結果として算出された各オブジェクトに係わる属性別ディスタンスを組み合わせ、組み合わせ結果値に基づいて、検索結果に該当するターゲットイメージを決定することができる。 The image search device 100 can compare the feature vector sequence 1250 including the representative feature vector 1241 with the feature vector sequence of the target image stored in the database to determine the target image corresponding to the search image 1100. The image search device 100 compares the feature vector sequence 1250 including the representative feature vector 1241 with the feature vector sequence of the target image, compares the feature vectors of the attributes of the same category of the same object, and attributes related to each object. Another distance can be calculated, and the target image corresponding to the search result can be determined based on the calculated distance. In one embodiment, the image search device 100 combines the attribute-specific distances related to each object calculated as a result of comparing the feature vector sequence 1250 including the representative feature vector 1241 with the feature vector sequence of the target image, and the combination result. Based on the value, the target image corresponding to the search result can be determined.

一実施形態において、イメージ検索装置１００は、検索結果として選択されたターゲットイメージを、ユーザ端末１０に伝送することができる。 In one embodiment, the image search device 100 can transmit the target image selected as the search result to the user terminal 10.

他の実施形態において、イメージ検索装置１００は、検索結果として選択されたターゲットイメージ内において、ユーザ端末１０が選択したオブジェクトに対応する部分イメージを抽出し、抽出された部分イメージを検索イメージに合成し、検索結果として、ユーザ端末１０に伝送することもできる。例えば、図１１及び図１３に図示されているように、イメージ検索装置１００は、検索イメージ１１００において、かばんオブジェクト１１５０を含む部分を、ターゲットイメージ内かばんオブジェクトを含む部分イメージ１３１１，１３２１，１３３１に交換した合成イメージ１３１０，１３２０，１３３０を、検索結果イメージとして、ユーザ端末１０に伝送することができる。 In another embodiment, the image search device 100 extracts a partial image corresponding to the object selected by the user terminal 10 in the target image selected as the search result, and synthesizes the extracted partial image into the search image. As a search result, it can also be transmitted to the user terminal 10. For example, as shown in FIGS. 11 and 13, the image search device 100 replaces the portion including the bag object 1150 in the search image 1100 with the partial images 1311, 1321, 1331 including the bag object in the target image. The combined images 1310, 1320, and 1330 can be transmitted to the user terminal 10 as search result images.

ユーザは複数のオブジェクトを含む検索イメージを、イメージ検索装置１００に伝送し、自らの趣向に符合するコーディネーションスタイルを検索結果を介して、容易に把握することができる。 The user can transmit a search image including a plurality of objects to the image search device 100, and easily grasp the coordination style that matches his / her taste through the search result.

図１４は、一実施形態によるイメージ検索装置１００の構成を図示するブロック図である。 FIG. 14 is a block diagram illustrating the configuration of the image search device 100 according to the embodiment.

図１４を参照すれば、一実施形態に他のイメージ検索装置１００は、メモリ１４１０、データベース１４３０、通信部１４５０及びプロセッサ１４７０を含んでもよい。 Referring to FIG. 14, another image retrieval device 100 may include a memory 1410, a database 1430, a communication unit 1450 and a processor 1470 in one embodiment.

メモリ１４１０は、少なくとも１つのプログラムを保存する。メモリ１４１０、データベース１４３０、通信部１４５０及びプロセッサ１４７０は、少なくとも１つのプログラムによって動作することができる。また、メモリ１４１０は、代表特徴ベクトルを生成するための生成モデルをさらに保存することができる。また、メモリ１４１０は、特徴ベクトル抽出のためのマシンラーニングモデルをさらに保存することができる。 Memory 1410 stores at least one program. The memory 1410, the database 1430, the communication unit 1450 and the processor 1470 can be operated by at least one program. In addition, the memory 1410 can further store a generative model for generating a representative feature vector. In addition, the memory 1410 can further store a machine learning model for feature vector extraction.

データベース１４３０は、ターゲットイメージ、及びターゲットイメージの特徴ベクトルを保存する。該ターゲットイメージの特徴ベクトルは、マシンラーニングモデルによって出力されたものでもある。例えば、該ターゲットイメージの特徴ベクトルは、従属モデルによって出力されたものを含んでもよく、独立モデルによって出力されたものを含んでもよい。また、複数のオブジェクトを含むターゲットイメージに対し、前記特徴ベクトルは、それぞれのオブジェクトの属性別特徴ベクトルを含む特徴ベクトルシーケンスを含んでもよい。一実施形態において、データベース１４３０は、ハードディスク（ＨＤＤ）、ＳＳＤなどの保存装置からもなる。 Database 1430 stores the target image and the feature vector of the target image. The feature vector of the target image is also output by the machine learning model. For example, the feature vector of the target image may include the one output by the dependent model or may include the one output by the independent model. Further, for a target image including a plurality of objects, the feature vector may include a feature vector sequence including the attribute-specific feature vectors of the respective objects. In one embodiment, the database 1430 also comprises a storage device such as a hard disk (HDD) or SSD.

通信部１４５０は、ユーザ端末１０とデータ送受信をする。通信部１４５０は、ユーザ端末１０から検索条件を受信し、検索条件に対応する検索結果、例えば、検索結果イメージをユーザ端末１０に伝送する。 The communication unit 1450 transmits / receives data to / from the user terminal 10. The communication unit 1450 receives the search condition from the user terminal 10 and transmits the search result corresponding to the search condition, for example, the search result image to the user terminal 10.

プロセッサ１４７０は、同一属性を有するか、あるいは同一属性タグが割り当てられた複数の参照イメージに対応する参照特徴ベクトルを獲得し、参照特徴ベクトルに基づいて訓練された生成モデルから、同一属性ないし同一属性タグに対応する代表特徴ベクトルを獲得し、ターゲットイメージの特徴ベクトルと代表特徴ベクトルとを比較し、ターゲットイメージのうち少なくとも一部のターゲットイメージに、同一属性に対応する属性タグを割り当てることができる。 The processor 1470 acquires a reference feature vector corresponding to a plurality of reference images having the same attribute or assigned the same attribute tag, and has the same attribute or the same attribute from a generation model trained based on the reference feature vector. It is possible to acquire a representative feature vector corresponding to a tag, compare the feature vector of the target image with the representative feature vector, and assign an attribute tag corresponding to the same attribute to at least a part of the target images.

また、プロセッサ１４７０は、ユーザ端末１０から受信された検索条件に対応するターゲットイメージを選択することができる。プロセッサ１４７０は、ユーザ端末１０が選択または入力した属性に対応する属性タグが割り当てられたターゲットイメージを選択するか、あるいはユーザ端末１０が選択または入力した属性の代表特徴ベクトルと同一／類似の特徴ベクトルを有したターゲットイメージを選択することができる。 Further, the processor 1470 can select the target image corresponding to the search condition received from the user terminal 10. The processor 1470 selects a target image to which an attribute tag corresponding to the attribute selected or input by the user terminal 10 is assigned, or a feature vector identical / similar to the representative feature vector of the attribute selected or input by the user terminal 10. You can select the target image with.

また、プロセッサ１４７０は、ユーザ端末１０から受信された複数の検索イメージに対応するターゲットイメージを選択することができる。また、プロセッサ１４７０は、ユーザ端末１０から受信された検索イメージの属性別特徴ベクトルのうち一部を、他の属性の代表特徴ベクトルで置換し、置換された結果に基づいて、ターゲットイメージを選択することもできる。 Further, the processor 1470 can select a target image corresponding to a plurality of search images received from the user terminal 10. Further, the processor 1470 replaces a part of the attribute-specific feature vectors of the search image received from the user terminal 10 with the representative feature vectors of other attributes, and selects the target image based on the replaced result. You can also do it.

一方、前述の本開示の実施形態は、コンピュータで実行されるプログラムに作成可能であり、作成されたプログラムは、媒体にも保存される。 On the other hand, the above-described embodiment of the present disclosure can be created in a program executed by a computer, and the created program is also stored in a medium.

該媒体は、コンピュータで実行可能なプログラムを続けて保存するか、あるいは実行またはダウンロードのために臨時保存するものでもある。また、該媒体は、単一またはいくつかのハードウェアが結合された形態の多様な記録手段または保存手段でもあるが、あるコンピュータシステムに直接接続される媒体に限定されず、ネットワーク上に分散存在するでもある。該媒体の例示としては、ハードディスク、フロッピーディスク及び磁気テープのような磁気媒体；ＣＤ－ＲＯＭ（compact disc read only memory）及びＤＶＤ（digital versatile disc）のような光記録媒体、フロプティカルディスク（floptical disk）のような磁気・光媒体（magneto-optical medium）；及びＲＯＭ（read-only memory）、ＲＡＭ（random access memory）、フラッシュメモリなどを含み、プログラム命令語が保存されるように構成されたものでもある。また、他の媒体の例示として、アプリケーションを流通するアプリストアや、その他多様なソフトウェアを供給したり流通したりするサイト、サーバなどで管理する記録媒体ないし記録媒体も挙げることができる。 The medium is also a continuous storage of computer-executable programs, or a temporary storage for execution or download. The medium is also a variety of recording or storage means in the form of a single piece or a combination of several pieces of hardware, but is not limited to a medium directly connected to a computer system and is distributed on a network. It also does. Examples of such media include magnetic media such as hard disks, floppy disks and magnetic tapes; optical recording media such as CD-ROMs (compact disc read only memory) and DVDs (digital versatile discs), and floptical discs. Includes magnetic and optical medium (disk); and ROM (read-only memory), RAM (random access memory), flash memory, etc., and is configured to store program command words. It is also a thing. Further, as an example of other media, a recording medium or a recording medium managed by an app store that distributes applications, a site that supplies or distributes various other software, a server, or the like can be mentioned.

以上、本開示の技術的思想について、望ましい実施形態を挙げて詳細に説明したが、本開示の技術的思想は、前記実施形態に限定されるものではなく、本開示の技術的思想の範囲内で、当分野で当業者によってさまざまな変形及び変更が可能である。 The technical idea of the present disclosure has been described in detail with reference to desirable embodiments, but the technical idea of the present disclosure is not limited to the above-described embodiment and is within the scope of the technical idea of the present disclosure. Therefore, various modifications and changes can be made by those skilled in the art in this field.

（付記１）
イメージ検索装置によるイメージ検索方法において、
同一属性を含む複数の参照イメージに対応する参照特徴ベクトルを獲得する段階と、
前記獲得した参照特徴ベクトルに基づいて訓練された生成モデルから、前記同一属性に対応する代表特徴ベクトルを獲得する段階と、
データベースに保存されたターゲットイメージの特徴ベクトルと、前記代表特徴ベクトルとを比較し、前記ターゲットイメージのうち少なくとも一部のターゲットイメージに、前記同一属性に対応する属性タグを割り当てる段階と、を含むことを特徴とするイメージ検索方法。
（付記２）
前記イメージ検索方法は、
ユーザ端末から属性の選択を受ける段階と、
前記ターゲットイメージのうち、前記選択を受けた属性に対応する属性タグが割り当てられたターゲットイメージを決定する段階と、
前記決定されたターゲットイメージを、前記ユーザ端末に伝送する段階と、をさらに含むことを特徴とする付記１に記載のイメージ検索方法。
（付記３）
前記イメージ検索方法は、
ユーザ端末から属性の選択を受ける段階と、
前記選択された属性に対応する代表特徴ベクトルと、前記データベースに保存されたターゲットイメージの特徴ベクトルと、を比較する段階と、
前記ターゲットイメージのうち、前記比較結果に基づいて選択されたターゲットイメージを、前記ユーザ端末に伝送する段階と、をさらに含むことを特徴とする付記１に記載のイメージ検索方法。
（付記４）
前記比較する段階は、
前記ユーザ端末が選択した属性に対応する代表特徴ベクトルが、従属モデルから出力される参照特徴ベクトルに基づいて訓練された前記生成モデルから出力されたものである場合、前記選択された属性に対応する代表特徴ベクトルと、前記従属モデルから出力された前記ターゲットイメージの特徴ベクトルと、を比較し、
前記ユーザ端末が選択した属性に対応する代表特徴ベクトルが、独立モデルから出力される参照特徴ベクトルに基づいて訓練された前記生成モデルから出力されたものである場合、前記選択された属性に対応する代表特徴ベクトルと、前記独立モデルから出力された前記ターゲットイメージの特徴ベクトルと、を比較する段階を含むことを特徴とする付記３に記載のイメージ検索方法。
（付記５）
前記イメージ検索方法は、
ユーザ端末から検索イメージを受信する段階と、
前記検索イメージの属性別特徴ベクトルを獲得する段階と、
前記属性別特徴ベクトルのうち、前記ユーザ端末が選択した属性の特徴ベクトルを、前記ユーザ端末が選択した属性に対応する代表特徴ベクトルに変更する段階と、
前記代表特徴ベクトルを含む属性別特徴ベクトルと、前記データベースに保存されたターゲットイメージの属性別特徴ベクトルと、を比較する段階と、
前記データベースに保存されたターゲットイメージのうち、前記比較結果に基づいて選択されたターゲットイメージを、前記ユーザ端末に伝送する段階と、を含むことを特徴とする付記１に記載のイメージ検索方法。
（付記６）
前記比較する段階は、
前記代表特徴ベクトルを含む属性別特徴ベクトルと、前記ターゲットイメージの属性別特徴ベクトルと、を同一カテゴリーの属性同士比較し、属性別ディスタンスを獲得する段階と、
前記獲得した属性別ディスタンスの大きさを組み合わせた結果により、前記データベースに保存されたターゲットイメージのうち、前記ユーザ端末に伝送するターゲットイメージを選択する段階と、を含むことを特徴とする付記５に記載のイメージ検索方法。
（付記７）
前記イメージ検索方法は、
ユーザ端末から複数のオブジェクトを含む検索イメージを受信する段階と、
前記検索イメージに含まれた複数のオブジェクトそれぞれの属性別特徴ベクトルを含む特徴ベクトルシーケンスを獲得する段階と、
前記複数のオブジェクトのうち、前記ユーザ端末からオブジェクトの選択を受ける段階と、
前記選択されたオブジェクトの属性別特徴ベクトルのうち、前記ユーザ端末が選択した属性の特徴ベクトルを、前記ユーザ端末が選択した属性に対応する代表特徴ベクトルに変更する段階と、
前記代表特徴ベクトルを含む特徴ベクトルシーケンスと、前記データベースに保存されたターゲットイメージの特徴ベクトルシーケンスと、を比較する段階と、
前記データベースに保存されたターゲットイメージのうち、前記比較結果に基づいて選択されたターゲットイメージを、前記ユーザ端末に伝送する段階と、をさらに含むことを特徴とする付記１に記載のイメージ検索方法。
（付記８）
前記選択されたターゲットイメージを、前記ユーザ端末に伝送する段階は、
前記選択されたターゲットイメージそれぞれに含まれた複数のオブジェクトのうち、前記ユーザ端末が選択したオブジェクトに対応する部分イメージを、前記検索イメージに合成する段階と、
前記部分イメージが合成された検索イメージを、検索結果として、前記ユーザ端末に伝送する段階と、を含むことを特徴とする付記７に記載のイメージ検索方法。
（付記９）
イメージ検索装置によるイメージ検索方法において、
ユーザ端末から複数の検索イメージを受信する段階と、
前記複数の検索イメージに対応する特徴ベクトルを獲得する段階と、
前記獲得した特徴ベクトルでもって生成モデルを訓練させる段階と、
前記特徴ベクトルでもって訓練された前記生成モデルから出力される代表特徴ベクトルと、データベースに保存されたターゲットイメージの特徴ベクトルと、を比較し、前記複数の検索イメージに対応するターゲットイメージを決定する段階と、
前記決定されたターゲットイメージを、前記ユーザ端末に伝送する段階と、を含むことを特徴とするイメージ検索方法。
（付記１０）
前記生成モデルは、
複数のＦＣＬ（fully connected layer）を含むＧＡＮ（generative adversarial network）を含むことを特徴とする付記１または９に記載のイメージ検索方法。
（付記１１）
付記１ないし１０のうちいずれか１項に記載のイメージ検索方法を装置のコンピュータに実行させるコンピュータプログラム。
（付記１２）
プロセッサと、
少なくとも１つのプログラム、及び生成モデルを保存するメモリと、
ターゲットイメージを保存するデータベースと、を含むが、
前記プロセッサは、前記少なくとも１つのプログラムが実行されることにより、
同一属性を含む複数の参照イメージに対応する参照特徴ベクトルを獲得し、
前記獲得した参照特徴ベクトルに基づいて訓練された前記生成モデルから、前記同一属性に対応する代表特徴ベクトルを獲得し、
前記ターゲットイメージの特徴ベクトルと、前記代表特徴ベクトルとを比較し、前記ターゲットイメージのうち少なくとも一部のターゲットイメージに、前記同一属性に対応する属性タグを割り当てることを特徴とするイメージ検索装置。
（付記１３）
ユーザ端末装置によるイメージ検索方法において、
イメージ検索装置に接続して属性を選択する段階と、
前記イメージ検索装置に保存されたターゲットイメージのうち、前記選択した属性に対応する属性タグが割り当てられたターゲットイメージを、前記イメージ検索装置から受信する段階と、を含むが、
前記イメージ検索装置に保存されたターゲットイメージは、
複数の属性それぞれに対応する代表特徴ベクトルと、前記ターゲットイメージの特徴ベクトルとの比較結果に基づいて、前記複数の属性に対応する複数の属性タグのうち少なくとも一部の属性タグが割り当てられたことを特徴とするイメージ検索方法。 (Appendix 1)
In the image search method using the image search device,
At the stage of acquiring reference feature vectors corresponding to multiple reference images containing the same attribute,
From the generated model trained based on the acquired reference feature vector, the stage of acquiring the representative feature vector corresponding to the same attribute, and
A step of comparing the feature vector of the target image stored in the database with the representative feature vector and assigning an attribute tag corresponding to the same attribute to at least a part of the target images of the target images is included. An image search method featuring.
(Appendix 2)
The image search method is
At the stage of receiving attribute selection from the user terminal,
Among the target images, the stage of determining the target image to which the attribute tag corresponding to the selected attribute is assigned, and
The image search method according to Appendix 1, further comprising a step of transmitting the determined target image to the user terminal.
(Appendix 3)
The image search method is
At the stage of receiving attribute selection from the user terminal,
A step of comparing the representative feature vector corresponding to the selected attribute with the feature vector of the target image stored in the database.
The image search method according to Appendix 1, further comprising a step of transmitting a target image selected based on the comparison result to the user terminal among the target images.
(Appendix 4)
The comparison stage is
When the representative feature vector corresponding to the attribute selected by the user terminal is output from the generated model trained based on the reference feature vector output from the dependent model, it corresponds to the selected attribute. Comparing the representative feature vector with the feature vector of the target image output from the dependent model,
When the representative feature vector corresponding to the attribute selected by the user terminal is output from the generated model trained based on the reference feature vector output from the independent model, it corresponds to the selected attribute. The image search method according to Appendix 3, further comprising a step of comparing the representative feature vector and the feature vector of the target image output from the independent model.
(Appendix 5)
The image search method is
At the stage of receiving the search image from the user terminal,
At the stage of acquiring the attribute-specific feature vector of the search image,
Among the attribute-specific feature vectors, the stage of changing the feature vector of the attribute selected by the user terminal to the representative feature vector corresponding to the attribute selected by the user terminal, and
A step of comparing the attribute-specific feature vector including the representative feature vector with the attribute-specific feature vector of the target image stored in the database.
The image search method according to Appendix 1, wherein the target image selected based on the comparison result among the target images stored in the database is transmitted to the user terminal.
(Appendix 6)
The comparison stage is
At the stage of comparing the attribute-specific feature vector including the representative feature vector and the attribute-specific feature vector of the target image between the attributes of the same category and acquiring the attribute-specific distance.
Addendum 5 is characterized by including a step of selecting a target image to be transmitted to the user terminal from the target images stored in the database based on the result of combining the acquired distance sizes for each attribute. The described image search method.
(Appendix 7)
The image search method is
At the stage of receiving a search image containing multiple objects from the user terminal,
The stage of acquiring the feature vector sequence including the attribute-specific feature vectors of each of the plurality of objects included in the search image, and
Among the plurality of objects, the stage of receiving an object selection from the user terminal and
Among the attribute-specific feature vectors of the selected object, the stage of changing the feature vector of the attribute selected by the user terminal to the representative feature vector corresponding to the attribute selected by the user terminal, and
A step of comparing the feature vector sequence including the representative feature vector with the feature vector sequence of the target image stored in the database.
The image search method according to Appendix 1, further comprising a step of transmitting a target image selected based on the comparison result to the user terminal among the target images stored in the database.
(Appendix 8)
The stage of transmitting the selected target image to the user terminal is
Among the plurality of objects included in each of the selected target images, the stage of synthesizing the partial image corresponding to the object selected by the user terminal into the search image, and
The image search method according to Appendix 7, wherein a search image in which the partial images are combined is transmitted to the user terminal as a search result.
(Appendix 9)
In the image search method using the image search device,
At the stage of receiving multiple search images from the user terminal,
The stage of acquiring the feature vectors corresponding to the plurality of search images, and
At the stage of training the generative model with the acquired feature vector,
A step of comparing the representative feature vector output from the generated model trained with the feature vector and the feature vector of the target image stored in the database to determine the target image corresponding to the plurality of search images. When,
An image search method comprising a step of transmitting the determined target image to the user terminal.
(Appendix 10)
The generative model is
The image search method according to Appendix 1 or 9, wherein a GAN (generative adversarial network) including a plurality of FCLs (fully connected layers) is included.
(Appendix 11)
A computer program that causes the computer of the apparatus to execute the image search method according to any one of Supplementary note 1 to 10.
(Appendix 12)
With the processor
A memory for storing at least one program and a generative model,
Contains a database to store the target image, but
The processor is executed by executing the at least one program.
Acquire the reference feature vector corresponding to multiple reference images containing the same attribute.
A representative feature vector corresponding to the same attribute is obtained from the generated model trained based on the obtained reference feature vector.
An image search device characterized by comparing a feature vector of the target image with the representative feature vector and assigning an attribute tag corresponding to the same attribute to at least a part of the target images of the target images.
(Appendix 13)
In the image search method by the user terminal device,
At the stage of connecting to the image search device and selecting attributes,
Among the target images stored in the image search device, the step of receiving the target image to which the attribute tag corresponding to the selected attribute is assigned from the image search device is included.
The target image saved in the image search device is
At least some of the attribute tags corresponding to the plurality of attributes are assigned based on the comparison result between the representative feature vector corresponding to each of the plurality of attributes and the feature vector of the target image. An image search method featuring.

１０ユーザ端末
１００イメージ検索装置
３００生成モデル
８１０，１１００検索イメージ
１３１０，１３２０，１３３０合成イメージ
１４１０メモリ
１４３０データベース
１４５０通信部
１４７０プロセッサ
10 User terminal 100 Image search device 300 Generation model 810, 1100 Search image 1310, 1320, 1330 Composite image 1410 Memory 1430 Database 1450 Communication unit 1470 Processor

Claims

In the image search method using the image search device,
The stage of receiving a search image or information related to the search image from the user terminal, and
The stage of receiving the selection for the attribute from the user terminal and
At the stage of acquiring the attribute-specific feature vector of the search image,
Among the attribute-specific feature vectors, the stage of changing the feature vector related to the attribute selected by the user terminal to the representative feature vector corresponding to the attribute selected by the user terminal, and
At the stage of comparing the attribute-specific feature vector including the representative feature vector with the attribute-specific feature vector of the target image stored in the database.
An image search method comprising a step of transmitting a target image selected based on the comparison result to the user terminal among the target images stored in the database.

The image search method is
Further including a step of receiving a selection for any one of a plurality of objects included in the search image from the user terminal.
The stage of acquiring the attribute-specific feature vector of the search image is
At the stage of acquiring the attribute-specific feature vectors related to multiple objects in the search image,
Including a step of constructing a feature vector sequence including attribute-specific feature vectors relating to the plurality of objects.
The stage of the change is
Among the attribute-specific feature vectors of the object selected by the user terminal, the step of changing the feature vector associated with the attribute selected by the user terminal to the representative feature vector corresponding to the attribute selected by the user terminal is included. The image search method according to claim 1, which is characterized.

The stage of transmitting the selected target image to the user terminal is
Among the plurality of objects included in each of the selected target images, the stage of synthesizing the partial image corresponding to the object selected by the user terminal into the search image, and
The image search method according to claim 2, further comprising a step of transmitting a search image in which the partial images are combined to the user terminal as a search result.

In the image search method using the image search device,
At the stage of receiving the first search image and the second search image from the user terminal,
The stage of acquiring the feature vector of the first attribute from the first search image and acquiring the feature vector of the second attribute from the second search image, and
A step of comparing the attribute-specific feature vector including the first attribute feature vector and the second attribute feature vector with the attribute-specific feature vector of the target image stored in the database.
An image search method comprising a step of transmitting a target image selected by the comparison result to the user terminal.

The image search method is
At the stage of acquiring reference feature vectors corresponding to multiple reference images containing the same attribute,
The stage of acquiring the representative feature vector corresponding to the same attribute from the generated model trained based on the acquired reference feature vector, and
It further includes a step of comparing the feature vector of the target image stored in the database with the representative feature vector and assigning an attribute tag corresponding to the same attribute to at least a part of the target images among the target images. The image search method according to any one of claims 1 to 4, wherein the image search method is characterized.

The comparison stage is
At the stage of comparing the attribute-specific feature vector including the representative feature vector and the attribute-specific feature vector of the target image between the attributes of the same category and acquiring the attribute-specific distance.
Claim 1 is characterized by including a step of selecting a target image to be transmitted to the user terminal from the target images stored in the database based on the result of combining the acquired distance sizes for each attribute. The image search method according to any one of 4 to 4.

A program for executing the image search method according to any one of claims 1 to 6.

With the processor
Memory to store at least one program,
Contains a database to store the target image, but
The processor is executed by executing the at least one program.
Receive a search image or information related to the search image from the user terminal,
Acquire the attribute-specific feature vector of the search image and
Among the attribute-specific feature vectors, the feature vector related to the attribute selected by the user terminal is changed to the representative feature vector corresponding to the attribute selected by the user terminal.
The attribute-specific feature vector including the representative feature vector is compared with the attribute-specific feature vector of the target image stored in the database.
An image search device characterized in that a target image selected based on the comparison result among the target images stored in the database is transmitted to the user terminal.