KR100701163B1

KR100701163B1 - Methods for Tagging Person Identification Information to Digital Data and Recommending Additional Tag by Using Decision Fusion

Info

Publication number: KR100701163B1
Application number: KR1020060077416A
Authority: KR
Inventors: 유중희
Original assignee: (주)올라웍스
Priority date: 2006-08-17
Filing date: 2006-08-17
Publication date: 2007-03-29
Also published as: KR20060101359A; US20090256678A1; WO2008020663A1; US7634115B2

Abstract

본 발명은 디시젼 퓨전(decision fusion)법을 사용하여 디지털 데이터로부터 인물 식별 정보를 추출할 때 보다 높은 정확도로 추출할 수 있도록 하고, 기존 태그 이외의 부가 태그를 추천하는 방법에 관한 것이다. 보다 상세하게는, 본 발명은, 디지털 데이터 관리 시스템에서, 디지털 데이터의 속성 및 내용에 따라 디지털 데이터로부터 인물 식별 정보를 추출하여 이를 기초로 디지털 데이터에 자동적으로 인물 태그를 부여할 때에, 다양한 부가 정보를 이용하여 보다 정확한 인물 식별 정보를 획득하고 이를 디지털 데이터에 태깅하는 방법과 새로 생성된 디지털 데이터에 부여되어 있는 태그와 동일한 태그를 부여받은 기존의 디지털 데이터에 자주 사용된 태그를 파악하여 이를 디지털 데이터에 추가적으로 부여하도록 하는 방법에 관한 것이다. The present invention relates to a method for extracting with higher accuracy when extracting person identification information from digital data using decision fusion and recommending additional tags other than the existing tags. More specifically, in the digital data management system, when the person identification information is extracted from the digital data according to the attributes and contents of the digital data and automatically assigns the person tag to the digital data, various additional information is provided. Acquiring more accurate person identification information by using and tagging it into digital data, and identifying the tag frequently used in existing digital data assigned the same tag as the tag given to newly created digital data It relates to a method to give in addition to.

디지털 데이터, 자동 태깅, 얼굴 인식, 디시젼 퓨전, 베이지안 분석 Digital Data, Auto Tagging, Face Recognition, Decision Fusion, Bayesian Analysis

Description

Method for Tagging Person Identification Information to Digital Data and Recommending Additional Tag by Using Decision Fusion}

도 1은, 본 발명의 바람직한 실시예에 따라, 동일 클러스터 내의 복수의 사진 중 어느 하나에 특정 인물이 나타나면 나머지 사진에도 특정 인물과 동일한 인물이 사진에 나타날 확률이 높다는 점에 착안하여 인물 인식에 대한 정확도를 높일 수 있다는 것을 나타내는 도면이다.FIG. 1 is a view illustrating a person having a high probability of appearing in a photo the same person as a specific person in a picture when a specific person appears in any one of a plurality of pictures in the same cluster according to a preferred embodiment of the present invention. It is a figure which shows that accuracy can be improved.

도 2는, 본 발명의 다른 바람직한 실시예에 따라, 2인 이상의 인물들이 만남을 가지기 전에 이메일, 전화 통화, SMS 메시지 등을 이용하여 통신을 하기 마련이므로, 만남 중에 촬영된 사진에 나타난 인물은 통신 시의 발신자 또는 수신자에 해당될 확률이 높다는 점에 착안하여 인물 인식에 대한 정확도를 높일 수 있다는 것을 나타내는 도면이다.Figure 2, according to another preferred embodiment of the present invention, before the two or more people have a meeting to communicate using e-mail, phone calls, SMS messages, etc., the person shown in the picture taken during the meeting is a communication The figure shows that the accuracy of person recognition can be improved by paying attention to the high probability of being the sender or the receiver of the city.

도 3a 및 3b는, 본 발명의 또 다른 바람직한 실시예에 따라, 복수의 사진에 나타난 적이 있는 모든 인물 중 일부 친분있는 인물만이 사진에 자주 나타난다는 점에 착안하여 인물 인식에 대한 정확도를 높일 수 있다는 것을 나타내는 도면이 다.Figures 3a and 3b, according to another preferred embodiment of the present invention, can improve the accuracy of the person recognition by focusing on the fact that only some familiar people among all the people who have been shown in the plurality of pictures often appear in the picture Figure showing that there is.

도 4는 본 발명의 바람직한 실시예에 따라, 인물 인식에 대한 각각의 결과를 베이지안 분석법에 의하여 결합하여 Top n 리스트를 만드는 것을 나타내는 도면이다.4 is a diagram illustrating a list of top n lists by combining respective results of person recognition by Bayesian analysis according to a preferred embodiment of the present invention.

도 5는 본 발명의 바람직한 실시예에 따라, 부가 태그를 추천하는 방법에 대한 플로우 차트이다.5 is a flow chart of a method for recommending additional tags, in accordance with a preferred embodiment of the present invention.

도 6은 얼굴 인식의 정확도를 높이기 위한 종래 기술을 나타내는 도면이다.6 is a diagram illustrating a prior art for improving the accuracy of face recognition.

본 발명은 디지털 데이터로부터 인물 식별 정보를 추출할 때 보다 높은 정확도로 추출할 수 있도록 하는 방법 및 부가 태그를 추천하는 방법에 관한 것이다.The present invention relates to a method for extracting with higher accuracy when extracting person identification information from digital data and a method for recommending additional tags.

보다 상세하게는, 본 발명은, 디지털 데이터 관리 시스템에서, 디지털 데이터의 속성 및 내용에 따라 디지털 데이터로부터 인물 식별 정보를 추출하여 이를 기초로 디지털 데이터에 자동적으로 인물 태그를 부여할 때에 다양한 부가 정보를 이용하여 보다 정확한 인물 식별 정보를 획득하고 이를 디지털 데이터에 태깅하는 방법과 새로 생성된 디지털 데이터에 부여되어 있는 태그와 동일한 태그를 가지고 있는 복수의 기존의 디지털 데이터에 자주 사용된 태그를 파악하여 이를 새로 생성된 디지털 데이터에 추가적으로 부여하도록 하는 방법에 관한 것이다. More specifically, in the digital data management system, when the person identification information is extracted from the digital data according to the attributes and contents of the digital data and automatically assigns the person tag to the digital data, various additional information may be applied. To obtain more accurate person identification information and tag it into digital data, and to identify tags that are frequently used in a plurality of existing digital data that have the same tag as that given to newly created digital data. The present invention relates to a method for additionally giving to generated digital data.

근래에 들어서 디지털 카메라, 카메라 부착형 이동 통신 기기, 디지털 캠코더 및 MP3 플레이어 등의 디지털 기기가 널리 보급되면서 사용자들이 디지털 사진이나 디지털 동영상과 같은 디지털 데이터를 생성하여 교류하는 양과 빈도가 크게 증가되고 있다. 이러한 디지털 데이터의 생성 및 교류의 확대는 필연적으로 데이터의 관리 문제를 수반한다. 그러나, 통상적으로 디지털 데이터의 경우, 그 정보량이 방대한 바, 데이터의 분류 및 통합 등에 의한 데이터 관리(검색 및 정보 추출 등)가 용이하지 않을 수 있다.Recently, as digital devices such as digital cameras, camera-attached mobile communication devices, digital camcorders, and MP3 players are widely used, the amount and frequency of users generating and exchanging digital data such as digital photographs and digital videos are greatly increased. The expansion of the generation and exchange of digital data inevitably involves data management problems. However, in the case of digital data, since the amount of information is huge, it may not be easy to manage data (such as searching and extracting information) by classifying and integrating data.

디지털 데이터 관리를 위한 하나의 종래기술로서, 태그를 사용하여 데이터를 분류하거나 통합하는 등의 데이터 관리를 행하는 기술이 널리 알려져 있다. "태그"는 데이터에 대한 신속한 액세스 또는 검색을 위해 디지털 데이터에 부착되는 부가 데이터로 이해될 수 있다. 이러한 태그는 일련의 문자, 숫자, 또는 문자 및 숫자의 조합으로 이루어지는 것이 일반적이다.As one conventional technique for digital data management, a technique for performing data management such as classifying or integrating data using tags is widely known. A "tag" can be understood as additional data attached to digital data for quick access or retrieval of the data. Such tags typically consist of a series of letters, numbers, or a combination of letters and numbers.

이러한 태그의 종류로는 공간에 관한 태그, 인물에 관한 태그, 사물에 관한 태그, 시간에 관한 태그 등이 있으며, 이 중 특히 인물에 관한 태그를 보다 높은 정확도로 디지털 데이터로부터 추출하기 위한 시도가 종래부터 있었다.Types of such tags include tags on space, tags on people, tags on objects, tags on time, and the like. Among these, attempts to extract tags on people from digital data with higher accuracy are known. Since.

도 6은 얼굴 인식을 위한 얼굴 모양 특성(face appearance feature)과 콘텍스트 특성(contextual features)을 개시하는데, 이와 관련된 논문으로서 마이크로소프트 리서치 아시아(Microsoft Research Asia)에서 2003년 발표된 논문인 "가족 앨범 내의 인간 얼굴에 대한 자동 주석달기(Automated Annotation of Human Faces in Family Albums)" 에서 볼 수 있듯이, 사진 속의 인물에 대한 얼굴 인식률을 높 이기 위해 일반적인 얼굴 인식 알고리즘에 콘텍스트 특성을 부가한 내용이 개시되어 있다. 콘텍스트 특성이란 동일한 날 또는 동일한 이벤트에 찍은 복수의 사진에 나타나는 개인은 동일한 옷을 입고 있을 확률이 크다는 것으로서, 이러한 콘텍스트 특성에 의해 앨범 내의 한정된 수의 개인을 구별하는 데에 사용될 수 있다는 것이다. 이 논문에 따르면, 서로 멀리 떨어진 다른 날(상기 논문에 따르면, 이틀이 넘는 간격)에 찍힌 두 개의 사진에는 이러한 콘텍스트 특성에 근거한 얼굴 인식은 적용되지 않는다는 것이며, 이러한 유사성을 적용하기 위해서는 적어도 이틀 내에 찍힌 사진들일 것을 조건으로 한다.FIG. 6 discloses a face appearance feature and a contextual feature for face recognition. As a related paper, FIG. 6 is a paper published in 2003 by Microsoft Research Asia, entitled "Family Album." As described in "Automated Annotation of Human Faces in Family Albums", a context-sensitive feature has been added to a general face recognition algorithm in order to increase face recognition rate for a person in a picture. The context characteristic is that an individual appearing in a plurality of photos taken on the same day or at the same event is likely to be wearing the same clothes, and this context characteristic can be used to distinguish a limited number of individuals in an album. According to the paper, face recognition based on this context characteristic does not apply to two pictures taken on different days that are far apart from each other (according to the above paper, more than two days apart). Subject to photographs.

그러나, 상기 논문에 개시된 이러한 종래의 얼굴 인식 기술은 일반적인 얼굴 모양 특성에 대한 유사성 파악에 더하여 동일 시간 범위 내에 존재하는 인물들의 의상에 대한 유사성 파악만을 기초로 얼굴 인식을 행하고 있으므로 그 정확도에서 상당히 미흡한 면이 있다. 예를 들면, 동일 시간 범위를 하루 또는 이틀로 정하여도 동일 인물이 다른 의상을 입고 있는 경우가 자주 있을 수 있다. 특히, 여름을 제외한 봄, 가을, 겨울의 경우 실외에서는 셔츠 위에 점퍼를 입고 있던 사람이 실내에서는 셔츠만을 입고 있는 일이 비일비재하다. 또한, 인물의 모습이 정면이 아닐 때 얼굴의 모양에 대한 인식 뿐만 아니라 입고 있는 의상의 모양에 대한 인식의 정확도가 떨어질 수 밖에 없다는 치명적인 단점이 존재하게 된다.However, the conventional face recognition technology disclosed in the above paper is quite inferior in accuracy because it performs face recognition only on the basis of the similarity of the clothes of the persons within the same time range, in addition to the similarity of the general face shape characteristics. There is this. For example, even if the same time range is set for one or two days, the same person may often wear different clothes. In particular, in the spring, autumn, and winter, except for the summer, people who were wearing jumpers on a shirt are wearing only a shirt indoors. In addition, when the figure of the person is not the front face, there is a fatal disadvantage that not only the recognition of the shape of the face but also the accuracy of the recognition of the shape of the clothes to be worn is deteriorated.

한편, 종래에는 디지털 데이터가 새로 생성되었을 때 부여된 태그(예를 들면, 장소에 관한 태그, 인물에 관한 태그, 사물에 관한 태그, 시간에 관한 태그 중 적어도 하나)외에 부가 태그를 추가적으로 부여하고자 할 때 수동으로 일일이 입력 해야 하는 불편이 있었다. 이에 대해, 종래의 디지털 데이터 관리 시스템으로 웹 기반 시스템인 Flickr^TM(http://www.flickr.com)(이하, "플리커")를 예로 들어 설명하면, 이 시스템에 개에 관한 사진을 업로드하면서 "dog"라고 태그를 붙인 후 이 개에 관한 부가 태그(예를 들면, "뽀삐")를 추가로 부여하고 싶을 때, 수동으로 일일이 입력해야 하는 불편이 있었으며, 이는 추가로 부여하고 싶은 부가 태그가 늘어날수록 큰 단점으로 부각될 수 밖에 없었다. Meanwhile, in the related art, an additional tag is to be additionally added in addition to a tag (for example, at least one of a tag about a place, a tag about a person, a tag about an object, and a tag about a time) assigned when digital data is newly generated. It was inconvenient when you had to enter it manually one by one. On the other hand, a conventional digital data management system is described using Flickr ^TM ( http://www.flickr.com ) (hereinafter referred to as "Flickr") which is a web-based system. When I tagged "dog" and wanted to add an additional tag (e.g. "Poppie") about this dog, I had to manually enter it manually. Increasingly, it was inevitable to be a big disadvantage.

따라서, 본 발명의 목적은, 종래의 얼굴 인식 방법보다 현저히 높은 정확도로 얼굴 인식을 행함으로써 생성된 인물 식별 정보를 디지털 데이터에 자동적으로 태깅하는 방법을 제공함으로써 사용자가 보다 편리하게 디지털 데이터를 관리하고 타 사용자와 공유할 수 있도록 하는 것이다.Accordingly, it is an object of the present invention to provide a method for automatically tagging digital data to person identification information generated by performing face recognition with a significantly higher accuracy than a conventional face recognition method. It is to share with other users.

더 나아가, 본 발명의 추가적인 목적은, 디시젼 퓨전(decision fusion)을 사용하여 새로 생성된 디지털 데이터에 이미 부여되어 있는 태그와 동일한 태그를 가지고 있는 기존의 디지털 데이터에 자주 사용된 태그를 파악하여 이를 디지털 데이터에 추가적으로 부여함으로써 사용자의 입력 부담을 상당히 경감시킬 수 있도록 하는 것이다.Furthermore, an additional object of the present invention is to identify and use tags frequently used in existing digital data having the same tag as that already assigned to newly generated digital data using decision fusion. In addition to digital data, the user's input burden can be significantly reduced.

후술하는 본 발명에 대한 상세한 설명은, 본 발명이 실시될 수 있는 특정 실 시예를 예시로서 도시하는 첨부 도면을 참조하여 설명한다.　 이들 실시예는 당업자가 본 발명을 실시할 수 있기에 충분하도록 상세히 설명된다. 본 발명의 다양한 실시예는 서로 다르지만 상호 배타적일 필요는 없음이 이해되어야 한다.　 예를 들어, 여기에 기재되어 있는 특정 형상, 구조 및 특성은 일 실시예에 관련하여 본 발명의 정신 및 범위를 벗어나지 않으면서 다른 실시예로 구현될 수 있다.　 또한, 각각의 개시된 실시예 내의 개별 구성요소의 위치 또는 배치는 본 발명의 정신 및 범위를 벗어나지 않으면서 변경될 수 있음이 이해되어야 한다.　 따라서, 후술하는 상세한 설명은 한정적인 의미로서 취하려는 것이 아니며, 본 발명의 범위는, 적절하게 설명된다면, 그 청구항들이 주장하는 것과 균등한 모든 범위와 더불어 첨부된 청구항에 의해서만 한정된다.DETAILED DESCRIPTION OF THE INVENTION The following detailed description of the invention will be described with reference to the accompanying drawings, which show, by way of illustration, specific embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention. It should be understood that the various embodiments of the present invention are different but need not be mutually exclusive. For example, certain shapes, structures, and characteristics described herein may be embodied in other embodiments without departing from the spirit and scope of the invention with respect to one embodiment. In addition, it is to be understood that the location or arrangement of individual components within each disclosed embodiment may be changed without departing from the spirit and scope of the invention. The following detailed description, therefore, is not to be taken in a limiting sense, and the scope of the present invention, if properly described, is defined only by the appended claims, along with the full range of equivalents to which such claims are entitled.

상기한 바와 같은 본 발명의 목적을 달성하고, 후술하는 본 발명의 특징적인 기능을 수행하기 위한, 본 발명의 특징적인 구성은 하기와 같다:In order to achieve the object of the present invention as described above, and to carry out the characteristic functions of the present invention described below, the characteristic configuration of the present invention is as follows:

본 발명의 일 태양에 따르면, 디지털 데이터 관리 시스템에서 디지털 기기에 의해 생성된 제1 디지털 데이터에 제1 인물의 식별 정보를 태그로서 부여하는 방법 - 상기 디지털 데이터 관리 시스템은 상기 디지털 기기의 사용자의 식별 정보 및 생활 패턴 정보를 포함하고, 상기 제1 디지털 데이터는 상기 제1 인물의 이미지를 포함함 - 으로서, 상기 제1 인물에 대한 부가 정보 중 적어도 하나를 이용하여 상기 제1 인물을 식별하는 단계, 및 상기 제1 인물의 식별 정보를 상기 제1 디지털 데이터에 태그로서 자동적으로 부여하는 단계를 포함하되, 상기 부가 정보는, 상기 제1 디지털 데이터와 동일 클러스터 내에 분포되는, 상기 제1 디지털 데이터와 다 른 적어도 하나의 디지털 데이터로 구성되는 제1 디지털 데이터군이 제2 인물의 이미지를 포함하는 경우, 상기 제1 인물이 상기 제2 인물과 동일 인물일 확률이 높음을 지시하는 정보, 상기 제1 디지털 데이터가 2인 이상의 만남 중에 생성된 것으로 판별되는 경우, 상기 제1 인물의 이미지는 상기 만남과 연관된 통신의 발신자 또는 수신자의 것일 확률이 높음을 지시하는 정보, 상기 제1 디지털 데이터와 다른 적어도 하나의 디지털 데이터로 구성되는 제2 디지털 데이터군에 제3 인물의 이미지가 m회 포함되는 경우, 상기 m의 회수에 기초하여 상기 제1 인물이 상기 제3 인물과 동일 인물일 확률이 높음을 지시하는 정보, 상기 제1 디지털 데이터의 생성 시각 및 장소 중 적어도 하나를 상기 디지털 기기의 사용자의 생활 패턴 정보와 함께 고려함으로써 획득되는 상기 제1 인물에 관한 정보, 상기 제1 인물의 이미지 내의 상기 제1 인물의 얼굴의 위치 및 사이즈에 따라 상기 제1 인물을 특정 인물로 식별할 확률을 지시하는 정보, 상기 디지털 기기가 듀얼 카메라인 경우, 상기 제1 디지털 데이터가 상기 듀얼 카메라의 렌즈가 상기 디지털 기기의 사용자 쪽을 향할 때에 생성된 것이면 상기 제1 인물은 상기 디지털 기기의 사용자일 확률이 높다는 정보, 및 상기 디지털 기기가 듀얼 카메라인 경우, 상기 제1 디지털 데이터가 상기 디지털 기기의 사용자 쪽을 향하지 않을 때에 생성된 것이면 상기 제1 인물은 상기 디지털 기기의 사용자가 아닐 확률이 높다는 정보 중 적어도 하나를 포함하는 방법이 제공된다.According to an aspect of the present invention, a method for assigning identification information of a first person as a tag to first digital data generated by a digital device in a digital data management system, wherein the digital data management system identifies the user of the digital device. Information and life pattern information, wherein the first digital data includes an image of the first person, wherein the first person is identified using at least one of additional information about the first person; And automatically assigning identification information of the first person as a tag to the first digital data, wherein the additional information is different from the first digital data, which is distributed in the same cluster as the first digital data. If the first digital data group consisting of at least one other digital data includes an image of the second person, If it is determined that the first person is likely to be the same person as the second person and the first digital data is generated during two or more meetings, the image of the first person is associated with the meeting. M, when the image of the third person is included in the second digital data group composed of at least one digital data that is different from the first digital data, indicating that the communication party is the sender or the receiver of the communication; At least one of information indicating a probability that the first person is the same person as the third person, a generation time and a location of the first digital data based on the number of times; According to the information about the first person obtained by considering together, the position and size of the face of the first person in the image of the first person Information indicating a probability of identifying the first person as a specific person; when the digital device is a dual camera, the first digital data is generated when a lens of the dual camera is directed toward a user of the digital device. The first person is information that the first person is likely to be a user of the digital device, and the first person is generated when the first digital data is not directed toward the user of the digital device when the digital device is a dual camera. A method is provided that includes at least one of information that a probability of not being a user of a digital device is high.

본 발명의 다른 태양에 따르면, 디지털 데이터 관리 시스템에 포함된 데이터베이스에 복수의 기존 디지털 데이터가 저장되어 있고 특정 사용자가 디지털 기기 를 사용하여 특정 장소 S 에서 하루 중의 특정 시간대 T 에 특정 인물 P 또는 특정 사물 O 의 이미지 중 적어도 하나를 포함하는 제1 디지털 데이터를 생성할 때 - 상기 제1 디지털 데이터에 상기 장소 S, 상기 인물 P, 상기 사물 O, 상기 시간대 T 중 적어도 하나가 1차 태그로서 부여되어 있음 - , 상기 1차 태그와 다른 2차 태그를 식별 정보로서 추가적으로 상기 제1 디지털 데이터에 부여하는 방법으로서, 상기 기존 디지털 데이터 중 상기 1차 태그와 동일한 태그를 가지는 동일_태그_디지털 데이터를 검색하는 단계, 상기 동일_태그_디지털 데이터에 부여되어 있는 모든 태그 중 상기 1차 태그와 동일한 태그를 제외한 나머지 태그가 상기 동일_태그_디지털 데이터에 얼마나 자주 부여되었는지 각각의 빈도를 계산하는 단계, 상기 나머지 태그 중 상기 빈도가 가장 높은 N 개의 태그를 상기 디지털 기기의 화면에 표시하는 단계, 상기 사용자가 상기 화면에 표시된 N 개의 태그 중 적어도 하나를 상기 제1 디지털 데이터의 상기 2차 태그로서 선택하는 단계, 및 상기 선택된 태그를 상기 제1 디지털 데이터에 상기 2차 태그로서 부여하는 단계를 포함하는 방법이 제공된다.According to another aspect of the present invention, a plurality of existing digital data is stored in a database included in a digital data management system, and a specific user P or a specific object is stored at a specific time zone T of a day at a specific place S using a digital device. When generating first digital data including at least one of the images of O-at least one of the place S, the person P, the object O, and the time zone T is assigned to the first digital data as a primary tag. A second tag different from the primary tag as additional identification information to the first digital data, wherein the same_tag_digital data having the same tag as the primary tag among the existing digital data is searched for Step, the same as the primary tag of all tags assigned to the same tag tag digital data Calculating each frequency of how often the remaining tags except the tag are assigned to the same_tag_digital data, displaying N tags having the highest frequency among the remaining tags on the screen of the digital device; A user selecting at least one of the N tags displayed on the screen as the secondary tag of the first digital data, and assigning the selected tag as the secondary tag to the first digital data; A method is provided.

[본 발명의 바람직한 실시예][Preferred Embodiments of the Invention]

인물 식별을 위한 부가 정보 1Additional information for person identification 1

도 1은 본 발명의 바람직한 실시예에 따라 동일 클러스터 내에는 동일한 인물이 사진에 찍힐 확률이 높다는 점에 착안하여 인물 인식에 대한 정확도를 높일 수 있다는 것을 나타내는 도면이다. 여기서, 클러스터는 디지털 사진과 같은 디지털 데이터가 생성된 시간에 따라 구분지어질 수도 있고, 디지털 데이터가 생성된 장소에 따라 구분지어질 수도 있다.1 is a view showing that the accuracy of the recognition of the person can be improved by paying attention to the high probability that the same person is taken in the picture in the same cluster according to a preferred embodiment of the present invention. Here, the clusters may be classified according to the time at which the digital data such as the digital picture is generated or may be classified according to the place where the digital data is generated.

이와 같이 클러스터 개념을 도입할 수 있는 근거는 동일한 출원인이 출원한 한국특허출원 제2006-14040호의 덩어리 태깅 부분의 설명에서 찾아볼 수 있다. 이에 따르면, 각 디지털 기기의 사용자는 본인이 원하는 시간 및 장소에서 디지털 데이터를 생성할 수 있다. 그러나, 이러한 디지털 데이터의 생성 시간이나 생성 장소 등을 살펴보면 실제로는 매우 불연속적인 분포를 갖는 경우가 많다. 즉, 디지털 기기의 사용자가 디지털 데이터를 매우 편중된 시간 및 장소에서만 생성하는 경향이 있기 마련이다. 따라서, 여러 디지털 데이터에 공통되는 장소나 시간대 등을 파악함으로써, 해당 디지털 데이터를 하나의 덩어리로 묶고 해당 디지털 데이터 모두에 공통되는 태그를 일괄적으로 부여하는 덩어리 태깅 방식이 존재할 수 있다는 것이다.The basis for introducing the cluster concept as described above can be found in the description of the mass tagging portion of Korean Patent Application No. 2006-14040 filed by the same applicant. Accordingly, a user of each digital device can generate digital data at a time and place desired by the user. However, when the generation time and the place of generation of such digital data are examined, there are many cases where the distribution is very discontinuous. That is, a user of a digital device tends to generate digital data only at a very decent time and place. Accordingly, by identifying a place, a time zone, and the like that are common to various digital data, there may be a mass tagging method that bundles the digital data into one chunk and collectively assigns tags common to all the digital data.

이와 같은 덩어리 태깅의 아이디어를 응용하여 인물에 관한 태그를 높은 정확도로 추출하도록 할 수 있다. 즉, 동일 클러스터 내에 복수의 디지털 데이터가 생성된 경우, 어느 하나의 디지털 데이터에 특정 인물의 이미지가 포함되었다고 한다면 나머지 디지털 데이터에도 동일한 인물의 이미지가 포함되어 있을 확률이 높다고 할 수 있다. 이는 짧은 시간대 내에서는 동일 인물에 관해 포즈를 달리하여 다량의 디지털 데이터가 생성될 가능성이 높고, 가까운 지역 내에서는 동일 인물과 만남을 가지면서 근방의 이곳 저곳을 함께 돌아다니면서 배경을 달리하여 다량의 디지털 데이터를 생성할 가능성이 높기 때문인데, 이는 사람들의 일반적인 경향이라고 말할 수 있다.This idea of lump tagging can be applied to extract tags about people with high accuracy. That is, when a plurality of digital data is generated in the same cluster, if one digital data includes an image of a specific person, it is likely that the remaining digital data includes an image of the same person. It is likely that a large amount of digital data is generated by different poses about the same person within a short time period, and a large amount of digital data is generated by traveling around the neighborhood while meeting with the same person in a nearby area. This is because they are more likely to generate data, which is a general trend for people.

따라서, 이와 같은 클러스터 개념에 입각하여, 동일 시간대 또는 동일 지역권 내에 편중되어 생성된 디지털 데이터에 그 이미지가 포함되어 있는 인물들에 대한 정보를 디지털 데이터에 태깅함에 있어서, 상기와 같은 이유로 동일 클러스터 내에서는 동일 인물이 반복하여 나타날 확률이 높다는 점을 이용하면 인물 인식의 정확도를 상당 부분 높일 수 있게 된다.Therefore, based on such a cluster concept, in digital data tagging information about persons whose images are included in digital data generated by being concentrated in the same time zone or in the same region, the same cluster is used within the same cluster. Using the fact that the same person is likely to appear repeatedly can greatly increase the accuracy of person recognition.

한편, 동일 클러스터 내에서 이미 생성된 디지털 데이터의 이미지에 나타난 인물들의 출현 빈도에 대한 분석을 함으로써 동일 클러스터 내에서 가장 자주 사진이 찍혔다고 판단되는 후보군 리스트를 생성한 후, 동일 클러스터 내에서 디지털 데이터가 새로 생성되면 이미 생성되어 있는 후보군 리스트 중 가장 확률이 높게 배정되어 있는 인물 중의 하나가 새로 생성된 디지털 데이터의 이미지에 나타나는 인물로서 판단될 가능성이 상당히 높다고 볼 수 있다. 이에 따라, 새로 생성된 디지털 데이터의 이미지에 포함되는 인물의 정체가 밝혀지면 후보군 리스트에 포함된 각각의 인물들에 배정된 확률값에 대한 실시간 업데이트가 가능함은 물론이다. 이와 같은 프로세스는 이하에서 설명될 디시젼 퓨전, 예를 들면, 베이지안 분석법을 참조로 행해질 수 있다.On the other hand, by analyzing the frequency of appearance of people appearing in the image of the digital data already generated in the same cluster to generate a list of candidate groups that are determined to have been taken most often in the same cluster, and then digital data in the same cluster When newly created, one of the most probable persons among the candidate groups already generated is most likely to be determined as the person appearing in the newly generated digital data image. Accordingly, if the identity of the person included in the newly generated image of the digital data is revealed, it is of course possible to update in real time the probability value assigned to each person included in the candidate group list. This process can be done with reference to decision fusion, for example Bayesian analysis, which will be described below.

인물 인식을 위한 부가 정보 2Additional Information for Recognizing People 2

도 2는 본 발명의 다른 바람직한 실시예에 따라 2인 이상의 인물들이 만남을 가지기 전에 이메일, 전화 통화, SMS 메시지, 채팅(메신저 채팅) 등을 통해 의사 교환을 하기 마련이므로, 만남 중에 생성된 디지털 데이터의 이미지에 나타난 인물은 의사 교환 시의 발신자 또는 수신자에 해당될 확률이 높다는 점에 착안하여 인물 인식에 대한 정확도를 높일 수 있다는 것을 나타내는 도면이다.Figure 2 is a digital data generated during the meeting because the two or more persons according to another preferred embodiment of the present invention is to communicate through email, phone calls, SMS messages, chat (messenger chat), etc. before having a meeting This figure shows that the person represented in the image can increase the accuracy of the person recognition by paying attention to the high probability that the person corresponds to the sender or the receiver during the communication.

이는 도 2에 도시된 문자의 예시에서도 볼 수 있듯이, 일반적으로 사람들은 지인을 만나기 전에 약속 장소나 시간 등에 관해 결정을 하기 위하여 연락을 하기 쉽상이다. 이러한 연락은 이메일, 전화 통화, 핸드폰을 이용한 SMS 메시지 교환, 메신저 서비스를 이용한 채팅 등을 통해서 가능할 것이다. 도 2에 나타나 있는 문자 교환을 구체적으로 예로 들어 설명해 보면, Jiro와 Tanaka 사이에 약속 시간을 몇 시로 할 것인가에 대한 문자 연락이 있었고, 실제로 그 문자 연락 후 Jiro와 Tanaka가 같이 찍힌 사진들이 생성되었음을 알 수 있다. 따라서, 이와 같은 상관 관계를 십분 이용하면 사진 속에 나타나는 인물에 대한 인식의 정확도를 높일 수 있게 된다.As can be seen in the example of the characters shown in Figure 2, it is generally easy for people to contact you to make a decision about an appointment location, time, etc. before meeting a friend. Such contact may be possible through e-mail, telephone call, SMS message exchange using a mobile phone, and chat using a messenger service. Referring to the text exchange shown in FIG. 2 as an example, there was a text contact between Jiro and Tanaka on what time the appointment was scheduled, and after the text contact, it was found that the pictures taken by Jiro and Tanaka were generated. Can be. Therefore, by using the correlation as much as possible, it is possible to increase the accuracy of recognition of the person appearing in the picture.

이와 같은 부가 정보는 사람들의 일반적인 경향을 고려해 볼 때 상당히 강력한 툴이 될 수 있다. 이는 공간적으로 떨어져 있는 2인 이상의 사람이 만나 사진을 찍기 위해서는 공간적으로 떨어져 있는 사람들이 일정 시간이 흐른 후 어느 특정 장소에서 만나야 할 것을 필요조건으로 하는데, 공간적으로 떨어져 있는 사람들이 어느 특정 장소에서 만나기 위해서는 공간적으로 떨어져 있는 사람들 사이에서 장소 및 시간을 정하기 위한 의사 교환이 행해져야 하기 때문이다. 이와 같은 인간의 일반적인 경향을 적절히 사용하여 디지털 데이터의 이미지에 나타나는 인물에 대한 인식의 정확도를 상당히 높게 유지시킬 수 있음을 알 수 있다.This additional information can be a very powerful tool given the general trends of people. In order to meet two or more people who are separated from each other and take pictures, it is a requirement that the people who are spaced apart must meet at a certain place after a certain period of time. This is because communication must be made to establish a place and time between people who are spaced apart. It can be seen that the general tendency of human beings can be used appropriately to keep the accuracy of recognition of a person appearing in an image of digital data fairly high.

상기와 같이 디지털 데이터가 생성되기 전의 일정 기간 동안에 행하여진 의사 소통의 발신자 및 수신자를 파악함으로써 후보군 리스트가 생성된 후, 디지털 데이터가 새로 생성되면 이미 생성되어 있는 후보군 리스트 중 가장 확률이 높게 배정되어 있는 인물 중의 하나가 새로 생성된 디지털 데이터의 이미지에 나타나는 인물로서 판단될 가능성이 상당히 높다고 볼 수 있다. 이에 따라, 새로 생성된 디지털 데이터의 이미지에 포함되는 인물의 정체가 밝혀지면 후보군 리스트에 포함된 각각의 인물들에 배정된 확률 값에 대하여 실시간 업데이트 일어날 수 있음은 물론이며, 향후 또다시 새롭게 생성된 디지털 데이터의 이미지에 인물이 포함되어 있는 경우 업데이트된 확률 값을 가지고 있는 후보군 리스트를 참조로 하여 디지털 데이터의 이미지 내의 인물의 정체를 판단하게 된다. 이와 같은 프로세스는 이하에서 설명될 디시젼 퓨전, 예를 들면 베이지안 분석법을 참조로 행해질 수 있다.After the candidate group list is generated by identifying the sender and the receiver of the communication performed during the predetermined period of time before the digital data is generated as described above, when the digital data is newly generated, the most likely candidate group list already generated is assigned. It is very likely that one of the people is judged as the person appearing in the image of the newly generated digital data. Accordingly, if the identity of the person included in the newly generated digital data image is revealed, it may be possible to update in real time with respect to the probability value assigned to each person included in the candidate group list. When a person is included in the image of the digital data, the identity of the person in the image of the digital data is determined by referring to the candidate group list having the updated probability value. This process can be done with reference to decision fusion, for example Bayesian analysis, described below.

인물 인식을 위한 부가 정보 3Additional Information for Recognizing People 3

도 3a 및 3b는 본 발명의 또 다른 바람직한 실시예에 따라 복수의 사진에 나타난 적이 있는 모든 인물 중 일부 친분있는 인물만이 사진에 자주 나타난다는 점에 착안하여 인물 인식에 대한 정확도를 높일 수 있다는 것을 나타내는 도면이다.3A and 3B illustrate that only some familiar persons among all persons who have been shown in the plurality of photographs frequently appear in the photographs according to another preferred embodiment of the present invention, thereby improving accuracy of person recognition. It is a figure which shows.

도 3a에서 x축은 모든 사진에 나타나는 개별 인물의 총수를 100으로 둘 때 모든 사진에 가장 자주 등장하는 인물을 내림차순으로 소팅한 것이며, y축은 모든 사진을 100%으로 가정할 때의 사진 수를 나타내는데, 이 때 xy평면에 도시된 그래 프 상의 좌표 값(X, Y)들의 궤적은 사진에 가장 자주 등장하는 인물 상위 X명이 등장하고 있는 사진 수가 총 Y %라는 것을 나타내는 것이다.In FIG. 3A, when the total number of individual people in all pictures is 100, the x-axis is a sorting order in descending order of the most frequently appearing people in all pictures, and the y-axis indicates the number of pictures when all pictures are assumed to be 100%. At this time, the trajectory of the coordinate values (X, Y) on the graph shown on the xy plane indicates that the total number of Y pictures of the top X people who appear most frequently in the picture appears.

예를 들면, 도 3a에 따르면, 모든 사진을 통틀어 볼 때 가장 자주 등장하는 상위 9명이 나타나는 사진은 총 사진의 37%에 해당함을 알 수 있다. 이는 총 인물수가 100명임을 고려할 때 9명은 9%에 불과한 수치이지만, 이러한 9%의 인물들이 차지하는 비중은 총 사진의 37%에까지 이르게 된다는 것으로서 도 3a에서 볼 수 있듯이 그래프가 상당히 좌상 측으로 치우쳐져 있음을 알 수 있다.For example, according to FIG. 3A, it can be seen that the photos of the top nine who appear most frequently in all photographs correspond to 37% of the total photographs. This figure is only 9%, considering that the total number of people is 100, but the percentage of these 9% accounts for up to 37% of the total picture. As shown in FIG. 3A, the graph is very biased to the upper left side. It can be seen.

xy평면에서 정비례하는 그래프라면 모든 인물이 동일한 빈도로 사진에 나타나고 있다는 것이므로 얼굴 인식을 위한 부가 정보로서 전혀 효용 가치가 없는 것이지만 실제로 사람들은 자신과 친분관계에 있는 자들과 보다 많은 사진을 찍는 것이 일반적인 경향이므로 도 3a에서와 같이 정비례 그래프로부터 상당히 벗어난 그래프의 모양을 가지게 된다면 얼굴 인식을 위한 부가 정보로서 상당한 효용 가치를 지니게 되며, 이와 같은 상관 관계에 근거하여 사진 속에 나타나는 인물에 대한 인식의 정확도를 높일 수 있게 된다. A graph that is directly proportional to the xy plane is that all the people appear in the photo at the same frequency, so it's not useful at all as additional information for face recognition, but in fact, people tend to take more photos with people they're acquainted with. Therefore, as shown in FIG. 3A, if the graph has a shape that deviates significantly from the direct proportional graph, it has significant utility value as additional information for face recognition, and it is possible to increase the accuracy of recognition of the person in the picture based on the correlation. Will be.

도 3b는 x축이 사진 수를 나타내고 y축이 사람수를 나타낼 때의 막대그래프(X, Y)이다. 여기서, 막대그래프(X, Y)는 Y 명은 자신의 얼굴이 나타나는 사진이 X매 있음을 나타낸다. 예를 들면, X 값이 1에 해당할 때 Y 값이 24라는 것은 모든 사진을 고려할 때 24명은 자신의 얼굴이 나온 사진을 1 매 가지고 있다는 것이고, X 값이 40에 해당할 때의 Y 값이 1 이라는 것은 모든 사진을 고려할 때 1명은 자신의 얼굴이 나온 사진을 무려 40 매나 가지고 있다는 것을 나타낸다.3B is bar graphs (X, Y) when the x-axis represents the number of pictures and the y-axis represents the number of people. Here, the bar graphs (X, Y) indicate that Y people have X pictures of their faces. For example, a Y value of 24 for an X value of 1 means that in all photos, 24 people have one photo of their face, and a Y value of 40 for an X value of 1 means that when we consider all the pictures, one person has as many as 40 pictures of his face.

도 3b를 보면 3개의 점선 박스 모양이 나타나는데, 가장 오른쪽 점선 박스는 가장 높은 빈도로 사진이 찍힌 사람이므로 셀프 샷(self shot)이나 가족 사진일 확률이 높고, 가운데 점선 박스는 그 다음 높은 빈도로 사진이 찍힌 사람이므로 가장 친한 친구들일 확률이 높으며, 왼쪽 점선 박스는 그 다음 높은 빈도로 사진이 찍힌 사람이므로 친한 친구들일 확률이 높다. 왼쪽 점선 박스 보다도 왼쪽에 있는 막대그래프에 속하는 사람들은 별로 친분관계가 없는 사람에 해당될 확률이 높다고 할 수 있는데, 이에 대해서는 당연히 개인차가 있을 수 있으며 이러한 개인차는 본 명세서 곳곳에서 언급되고 있는 업데이트 과정을 통하여 반영될 수 있음은 물론이다. 예를 들면, 가장 오른쪽 점선 박스에 해당하는 사람이 셀프 샷이나 가족이기 보다는 가장 친한 친구들일 수도 있으며, 이는 특정 인물의 주변 인간 관계에 대해 피드백을 받아, 즉 이전에 찍혔던 사진들에 대한 인물 분석을 통하여 피드백을 받아, 각각의 점선 박스에 해당하는 인물 그룹의 정체에 대해 수정을 가할 수 있음은 물론이며, 이에 따라 사진에 나타나는 인물들에 대한 얼굴 인식의 정확도를 높일 수 있게 된다.3B, three dotted box shapes appear. Since the rightmost dotted box is the person who is photographed at the highest frequency, it is more likely to be a self shot or a family picture, and the middle dotted box is photographed at the next higher frequency. This person is most likely a best friend because the person is taken, and the left dotted box is most likely a best friend since the person was taken the next highest frequency. Those who belong to the bar graph on the left side of the left dashed box are more likely to be people who do not have a close relationship with each other. Of course, there may be individual differences. Of course, it can be reflected through. For example, the person in the right-most dotted box may be a best friend rather than a self-shot or family member, which receives feedback about the surrounding relationships of a particular person, ie, portrait analysis of previously taken pictures. By receiving the feedback through, it is possible to modify the identity of the group of people corresponding to each dotted box, as well as to improve the accuracy of face recognition for the people appearing in the picture.

요컨대, 이미 생성되어 있는 디지털 데이터에 나타난 모든 인물 중 가장 높은 빈도로 나타난 상위 인물을 명단으로 하는 후보군 리스트가 생성된 후, 디지털 데이터가 새로 생성되면 이미 생성되어 있는 후보군 리스트 중 가장 확률이 높게 배정되어 있는 인물 중의 하나가 새로 생성된 디지털 데이터의 이미지에 나타나는 인물로서 판단될 가능성이 상당히 높다고 볼 수 있다. 이와 같은 프로세스를 거쳐 새로 생성된 디지털 데이터의 이미지에 나타나는 인물의 정체가 밝혀지면 후보군 리스트에 포함된 각각의 인물들에 배정된 확률 값에 대하여 실시간 업데이트 일어날 수 있음은 물론이며, 향후 또다시 새롭게 생성된 디지털 데이터의 이미지에 인물이 포함되어 있는 경우 업데이트된 확률 값을 가지고 있는 후보군 리스트를 참조로 하여 디지털 데이터의 이미지 내의 인물의 정체를 판단하게 된다. 이와 같은 프로세스는 이하에서 설명될 디시젼 퓨전, 예를 들면 베이지안 분석법을 참조로 행해질 수 있다.In short, after a candidate list is created that lists the top person who has the highest frequency among all the characters in the digital data already created, and the digital data is newly generated, the probability is most likely among the already created candidate list. It is very likely that one of the figures is judged as the person appearing in the image of the newly generated digital data. Through this process, if a person's identity appears in the newly created digital data image, it is possible to update in real time with respect to the probability value assigned to each person included in the candidate group list. When a person is included in the image of the digital data, the identity of the person in the image of the digital data is determined by referring to the candidate group list having the updated probability value. This process can be done with reference to decision fusion, for example Bayesian analysis, described below.

인물 인식을 위한 부가 정보 4Additional Information for Recognizing People 4

사진 속에 나타나는 인물의 얼굴에 대한 인식의 정확도를 높이기 위해서는 해당 사진에 자동 또는 수동 태깅되어 있는 태그를 참조할 수 있다. 예를 들어, 자동 또는 수동 태깅되어 있는 공간 태그나 시간 태그를 참조할 수 있다. 이와 같이 태그를 참조로 하고 이에 특정 인물의 생활 패턴 정보를 부가하여 사진 속의 인물에 대한 인식을 행할 수 있다.In order to increase the accuracy of recognition of the face of the person in the picture, a tag automatically or manually tagged in the picture may be referred to. For example, you can refer to spatial tags or time tags that are automatically or manually tagged. As described above, the tag may be referred to and life pattern information of a specific person may be added to recognize the person in the picture.

예를 들면, 사진이 주말에 생성된 것일 때(즉, 디지털 데이터의 시간 태그가 주말에 해당될 경우) 사진 속의 인물이 가족이나 친구일 확률이 높다고 가중치를 둘 수도 있고, 사진이 주중에 생성된 것일 때(즉, 디지털 데이터의 시간 태그가 주중에 해당될 경우) 사진 속의 인물이 동료일 확률이 높다고 가중치를 둘 수 있으며, 술집에서 사진이 찍힌 것일 때(즉, 디지털 데이터의 공간 태그가 술집에 해당될 경우) 사진 속의 인물이 친구나 동료일 확률이 높다고 가중치를 둘 수도 있고, 집안에서 사진이 찍힌 것일 때(즉, 디지털 데이터의 공간 태그가 집에 해당될 경 우) 사진 속의 인물이 가족일 확률이 높다고 가중치를 둘 수도 있다.For example, if a photo was created on the weekend (that is, if the time tag in the digital data falls on the weekend), you can weight the person in the photo most likely to be a family or friend, or the photo was created during the week. Can be weighted if the person in the picture is more likely to be a colleague, and when the picture was taken at the bar (i.e., the spatial tag of the digital data is at the bar). If applicable, you can weight the person in the picture to be more likely to be a friend or colleague, or if the picture was taken at home (i.e. if the spatial tag of the digital data is at home), the person in the picture is a family member. You can also weight the probability.

마찬가지로, 다른 생활 패턴을 가지고 있는 사람이라면, 예를 들어, 특정 친구와는 술집에서 만나지 않고 놀이 공원에서 주로 만나는 생활 패턴이라든지 주말에는 가족이나 친구와 같이 있지 않고 혼자 있는 자취생과 같은 경우라면, 그 사람의 과거 사진들의 생활 패턴을 자동 분석하여 그 생활 패턴에 맞게 확률의 가중치를 수정할 수 있음은 물론이다. Similarly, if you have a different lifestyle, for example, a lifestyle where you don't meet at a bar with a friend, usually at an amusement park, or if you are a single student who is not alone with your family or friends on the weekend, Of course, by automatically analyzing the life patterns of the past photos of the weight of the probability can be modified according to the life pattern.

요컨대, 이미 생성되어 있는 디지털 데이터에 나타난 특정 인물의 생활 패턴 또는 특수한 습성을 분석 및 파악하여 특정 시간대 및 특정 지역에서 생성된 디지털 데이터의 이미지에 나타나는 인물로 판단될 가능성이 높은 인물을 명단으로 하는 후보군 리스트를 생성한 후, 디지털 데이터가 새로 생성되면 이미 생성되어 있는 후보군 리스트 중 가장 확률이 높게 배정되어 있는 인물 중의 하나가 새로 생성된 디지털 데이터의 이미지에 나타나는 인물로서 판단될 가능성이 상당히 높다고 볼 수 있다. 이와 같은 프로세스를 거쳐 새로 생성된 디지털 데이터의 이미지에 나타나는 인물의 정체가 밝혀지면 후보군 리스트에 포함된 각각의 인물들에 배정된 확률값에 대하여 실시간 업데이트가 일어날 수 있음은 물론이며, 향후 또다시 새롭게 생성된 디지털 데이터의 이미지에 인물이 포함되어 있는 경우 업데이트된 확률 값을 가지고 있는 후보군 리스트를 참조로 하여 디지털 데이터의 이미지 내의 인물의 정체를 판단하게 된다. 이와 같은 프로세스는 이하에서 설명될 디시젼 퓨전, 예를 들면 베이지안 분석법을 참조로 행해질 수 있다.In short, a list of candidates whose names are most likely to be judged as people who appear in images of digital data generated at specific time zones and regions by analyzing and understanding the life patterns or special habits of specific people appearing in the digital data already generated. After the list is generated, when digital data is newly generated, it is highly likely that one of the most probable persons among the candidate groups already generated is judged as the person appearing in the image of the newly created digital data. . Through this process, if a person's identity appears in the newly created digital data image, the real-time update may occur for the probability value assigned to each person included in the candidate group list. When a person is included in the image of the digital data, the identity of the person in the image of the digital data is determined by referring to the candidate group list having the updated probability value. This process can be done with reference to decision fusion, for example Bayesian analysis, described below.

인물 인식을 위한 부가 정보 5Additional Information for Recognizing People 5

사진 속에 나타난 얼굴의 위치 및 크기를 분석하여 보면 얼굴 인식의 정확도를 높이는 데에 사용될 수 있음을 알 수 있다. 예를 들어, 사진의 센터에 하나의 큰 얼굴이 나타나면, 그것은 자신을 찍은 셀프샷일 수 있으며, 사진의 센터에 하나의 큰 얼굴이, 그 주변에 아주 작은 얼굴이 나타나면 센터의 큰 얼굴은 셀프샷이고 그 주변의 얼굴들은 모르는 사람들(예를 들면, 보행자)이 될 확률이 높다고 할 수 있다. 또한, 비슷한 크기의 얼굴이 두 개 나타나고, 그 두 얼굴 중 하나가 이미 나 자신으로 태깅되어 있는 상태라면 나머지 한 얼굴의 정체는 나의 친한 친구이거나 여자 친구일 확률이 높을 수 있다. 이 때, 상기에서 설명된 바와 같이, 과거 사진에 나타나는 인물들에 대한 분석에 따라, 즉 사진 주인공의 일반적인 경향 또는 특수한 습성에 따라, 확률의 가중치에 대하여 실시간 업데이터가 일어날 수 있음은 물론이다. 이와 같은 프로세스는 이하에서 설명될 디시젼 퓨전, 예를 들면 베이지안 분석법을 참조로 행해질 수 있다.Analyzing the position and size of the face in the picture shows that it can be used to increase the accuracy of face recognition. For example, if one big face appears in the center of the picture, it could be a self-shot of the person. If a big face appears in the center of the picture, and a very small face around it, the big face in the center is a selfie. Faces around them are more likely to become strangers (eg pedestrians). Also, if two faces of the same size appear and one of the two faces is already tagged with myself, the identity of the other face may be my best friend or girlfriend. At this time, as described above, according to the analysis of the people appearing in the past picture, that is, the real-time updater may occur with respect to the weight of the probability, depending on the general tendency or special behavior of the main character of the picture. This process can be done with reference to decision fusion, for example Bayesian analysis, described below.

이 밖에도 얼굴 인식에 대한 정확도를 높이기 위한 여러 방법이 있을 수 있는데, 예를 들면, 듀얼 카메라의 경우, 카메라 사용자 쪽을 향하고 있는 렌즈를 통해 찍힌 인물의 정체는 자신일 확률이 높을 수 있고 그렇지 않은 쪽을 향하고 있는 렌즈를 통해 찍힌 인물의 정체는 나를 제외한 친구일 확률이 높을 수 있다. 셀프샷이 전체 사진의 상당한 부분을 차지한다고 가정할 때 렌즈가 카메라 사용자 쪽을 향하지 않을 경우 셀프샷을 배제하게 되므로 얼굴 인식의 정확도가 높아지게 된다.There are many other ways to improve the accuracy of face recognition. For example, in the case of dual cameras, the identity of the person shot through the lens facing the camera user may be more likely to be himself or herself. The identity of the person shot through the lens facing the camera may be a friend except me. Assuming self-shots make up a significant portion of the entire picture, the accuracy of face recognition increases because the self-shots are excluded if the lens is not facing the camera user.

디시젼Decision 퓨전을 통한 인물 인식을 위한 부가 정보의 결합 Combining Additional Information for Recognizing People through Fusion

도 4는 상기에서 언급된 여러 개의 바람직한 실시예에 나타난 단서를 디시젼 퓨전, 예를 들어 베이지안 분석법을 통하여 결합하여 총합적으로 모든 팩터를 고려함으로써 새롭게 생성된 디지털 데이터의 이미지에 포함되는 인물에 해당될 확률을 가장 높게 배정받은 상위 n 명의 리스트, 일명 Top n 리스트를 만드는 프로세스를 나타내는 도면이다. n은 자연수이며, 여기서는 예를 들어 n = 9로 한다.4 corresponds to a person included in an image of digitally generated digital data by combining the clues shown in the several preferred embodiments mentioned above through decision fusion, e.g. Bayesian analysis, taking into account all the factors in total. This is a diagram illustrating a process of making a list of the top n people, aka top n lists, with the highest probability of becoming a candidate. n is a natural number, for example, let n = 9.

디시젼 퓨전이란 시간의 흐름에 따라 축적되는 정보를 결합하여 과학적 질문에 답하기 위한 반복적인 프로세스로서, 이슈가 되고 있는 주제에 대한 현재 상태를 평가하고 나머지 질문에 대한 답을 찾기 위해 새로운 데이터를 수집하며, 그 후 수집된 새로운 데이터 및 예전 데이터를 결합함으로써 지식을 업데이트하고 오차를 줄이는 효과를 가지는 분석법이다. 요컨대, 복수의 단서(multiple cue)와 선험적 확률(a priori probability)로부터 특정 이슈에 대해 판단을 내리는(decision making) 프로세스를 일컫는다.Decision Fusion is an iterative process for answering scientific questions by combining information that accumulates over time, evaluating the current state of the subject matter and collecting new data to find answers to the remaining questions. It is an analysis method that has the effect of updating knowledge and reducing errors by combining new data collected with old data. In short, it refers to the process of making a decision on a specific issue from multiple cues and a priori probability.

디시젼 퓨전에는 여러가지 방법이 있는데, 각각의 단서들로부터 확률을 구하고, 이 확률을 합쳐서 (예를 들면, 가중 확률 합(weighted probability sum)) 최종 확률을 구하여 Top n 리스트를 구하는 애드 혹(ad-hoc) 방법부터 애드 혹 방법과는 달리 각각의 단서들을 모두 고려한 확률을 계산하여 Top n 리스트를 구하는 베이지안 분석법(http://en.wikipedia.org/wiki/Bayesian_inference)에 이르기까지 여러가지 방법이 있다. 일반적으로 단서들은 상호 독립이 아니므로 (시간과 장소는 관련(correlate)되어 있는 경우가 있다. 예를 들면, 샐러리맨의 경우 주중 낮(=시 간)에는 보통 사무실(=장소)에 있을 것이다), 애드 혹 방법에서와 같이 각각의 단서들로부터 확률을 구하고, 이 확률을 합쳐서 최종 확률을 구하여 Top n 리스트를 구하는 것은 최적이 아닐수도 있으므로 본 발명은 베이지안 분석법에 의한 구현을 선호하지만 그렇다고 애드 혹 방법에 의한 구현을 배제하는 것은 아님을 명확히 밝혀둔다. There are several ways of decision fusion, which involves taking the probabilities from each of the clues, then combining those probabilities (e.g., weighted probability sum) to get the final probabilities to find the top n list. Unlike the hoc method and the ad hoc method, there are various methods ranging from Bayesian analysis ( http://en.wikipedia.org/wiki/Bayesian_inference ) to calculate the probability considering each clue and obtain the Top n list. In general, clues are not mutually independent (time and place are often correlated; for example, a salaryman will usually be in the office during the day of the week). As in the ad hoc method, it may not be optimal to obtain a probability from each of the clues, and then combine the probabilities to obtain the final probability, so that the present invention prefers the implementation by Bayesian analysis, but the ad hoc method It is clear that this does not preclude implementation.

이와 같은 베이지안 분석법을 통하여 인물 인식을 위한 부가 정보(상기에서 언급된 실시예들로부터 얻어진 후보군 리스트) 각각을 결합하여 Top 9 리스트를 생성할 수 있다. 도 4에서 볼 수 있듯이 왼편의 박스의 내용은, 사진 속에 나타난 인물이 100명이라고 가정할 때 상기에서 언급되었던 인물 인식을 위한 부가 정보 모두를 이용하여 계산된 인물 100명 각각의 확률(사진에 나타난 인물로서 판단될 확률)을 나타낸다. 이와 같은 박스 내에 기재된 확률은 베이지안 분석법 특유의 성질상 디지털 데이터가 새로이 생성될 때마다 계속 업데이트되며, 업데이트된 확률을 근거로 추후에 생성될 디지털 데이터의 이미지에 나타나는 인물의 정체를 추정하게 되는 바, 각각의 확률이 동적으로 변경되어 정확한 예측을 가능하게 해준다. 이와 같은 베이지안 분석법에 의해 오른쪽 박스와 같은 Top 9 리스트를 생성하게 되는 것이다. 생성된 Top 9 리스트를 참조하면 사진 속에 나타난 인물에 대한 얼굴 인식 시 생길 수 있는 오류를 상당 부분 제거할 수 있게 된다.Through the Bayesian analysis method, the Top 9 list may be generated by combining each of the additional information (list of candidate groups obtained from the above-mentioned embodiments) for person recognition. As shown in FIG. 4, the content of the box on the left shows the probability of each of the 100 persons calculated using all of the above-mentioned additional information for recognizing the person, assuming that there are 100 people in the picture (represented in the picture). Probability of being judged as a person). Due to the unique nature of Bayesian methods, the probability described in this box is continuously updated whenever new digital data is generated, and based on the updated probability, the identity of the person appearing in the image of digital data to be generated is estimated. Each probability is changed dynamically to allow accurate prediction. This Bayesian analysis generates the Top 9 list as the right box. Referring to the created Top 9 list, it is possible to remove a large part of errors that may occur when face recognition for the person in the picture.

부가 태그 추천 기능Additional tag recommendation function

도 5는 본 발명의 바람직한 실시예에 따라, 부가 태그를 추천하는 방법에 대 한 플로우 차트이다.5 is a flow chart for a method for recommending additional tags, in accordance with a preferred embodiment of the present invention.

디지털 데이터 관리 시스템의 DB에 복수의 기존 디지털 데이터가 저장되어 있는 상태에서 특정 사용자가 디지털 기기를 사용하여 특정 장소에서 하루 중의 특정 시간대에 특정 인물 또는 특정 사물의 이미지 중 적어도 하나를 포함하는 디지털 데이터를 생성한다(단계 501). 이 때, 생성된 디지털 데이터에는 장소, 인물, 사물, 시간대 중 적어도 하나가 태그로서 부여되어 있으며, 이들 태그는 디지털 데이터의 생성과 함께 자동적으로 디지털 데이터에 부여된다(동일한 출원인이 출원한 한국특허출원 제2006-14040호 참조). When a plurality of existing digital data are stored in a DB of a digital data management system, a specific user uses a digital device to display digital data including at least one of an image of a specific person or a specific object at a specific time of day in a specific place. Create (step 501). At this time, at least one of a place, a person, an object, and a time zone is assigned as a tag to the generated digital data, and these tags are automatically assigned to the digital data together with the generation of the digital data (the Korean patent application filed by the same applicant). No. 2006-14040).

그 후, 생성된 디지털 데이터에 자동적으로 부여된 태그와 동일한 태그를 가지는 디지털 데이터를 DB에서 검색한다(단계 502).Thereafter, the digital data having the same tag as the tag automatically assigned to the generated digital data is retrieved from the DB (step 502).

그 다음, 검색이 완료된 1 개 이상의 디지털 데이터에 부여되어 있는 모든 태그로부터 단계 502의 동일한 태그를 제외한 것을 나머지 태그라고 일컬을 때, 나머지 태그 각각이 검색된 디지털 데이터에 얼마나 자주 부여되었는지 각각의 빈도를 계산한다(단계 503). Then, when all of the tags assigned to the one or more digital data for which the search is completed are referred to as the remaining tags except for the same tag in step 502, the frequency of each is calculated how often each of the remaining tags is assigned to the retrieved digital data. (Step 503).

그 후, 나머지 태그 중 빈도가 가장 높은 N 개의 태그를 디지털 기기의 화면에 표시하여 사용자로 하여금 선택할 수 있는 기회를 주며(단계 504), 사용자는 화면에 표시된 N 개의 태그 중 적어도 하나를 선택한다(단계 505).Thereafter, the N tags with the highest frequency among the remaining tags are displayed on the screen of the digital device to give the user a chance to select (step 504), and the user selects at least one of the N tags displayed on the screen ( Step 505).

이에 따라 선택된 항목은 새로 생성된 디지털 데이터에 추가적으로 부여되어 부가 태그로서 기능하게 된다(단계 506). Accordingly, the selected item is added to the newly generated digital data to function as an additional tag (step 506).

예를 들면, 특정 사용자가 자신의 여자 친구와 사진을 찍을 때 대부분 특정 백화점에서 쇼핑을 하고 있을 때가 많다는 이유로 디지털 데이터에 '쇼핑'이라는 태그를 붙일 때가 많고, 또한 디지털 데이터에 '데이트'라는 태그를 붙일 때가 많다고 가정한다. 상기에서 언급된 한국특허출원 제2006-14040호의 명세서에서 볼 수 있듯이 특정 사용자에 의해 생성된 디지털 데이터에는 GPS에 의한 디지털 기기의 위치 추적으로 디지털 데이터가 생성된 장소를 알아내어 '특정 백화점'이라고 자동적으로 태그가 붙을 수 있으며, 본 발명에 의해 사진 속의 인물에 대한 얼굴 인식을 행함으로써 사진 속의 인물이 여자 친구임이 밝혀지면 여자 친구의 이름이 자동적으로 태그로서 붙을 수 있다. 이와 같이 자동적으로 부여된 장소 태그와 인물 태그 등을 검색어로 하여 DB에서 이와 동일한 태그를 가지고 있는 디지털 데이터를 검색하면, 이들 디지털 데이터에 가장 높은 빈도로 출현하는 태그는 아마도 '쇼핑' 이라는 태그 또는 '데이트'라는 태그일 것이다. 이와 같은 빈도를 디시젼 퓨전, 예를 들면 베이지안 분석법에 의해 계산하여 가장 빈도가 높은 n 개의 태그인 Top n 리스트, 예를 들면 Top 9 리스트를 생성할 수 있으며, 이들 리스트를 디지털 기기의 화면에 표시하여 사용자로 하여금 선택할 수 있게 해 준다. 따라서, 종래 기술인 플리커 등에서는 구현될 수 없었던 사용자 친화적인 태그 입력 방법을 구현할 수 있게 되는 것이다.For example, when a particular user takes a picture with her girlfriend, most of the time they are shopping at a particular department store, they often tag the digital data with "shopping" and also tag the digital data with "dating." Assume that there are many times to attach. As can be seen in the specification of Korean Patent Application No. 2006-14040 mentioned above, the digital data generated by a specific user is automatically identified as a 'specific department store' by finding a place where the digital data is generated by tracking the position of the digital device by GPS. The name of the girlfriend can be automatically tagged as a tag if it is found that the person in the picture is a girlfriend by performing face recognition on the person in the picture according to the present invention. When searching for digital data having the same tag in the DB using search terms such as place tag and person tag automatically assigned, the tag that appears most frequently in these digital data is probably called 'shopping' or ' It might be a tag called "date." This frequency can be calculated by decision fusion, e.g. Bayesian analysis, to generate the top n list, e.g., the Top 9 list, of the most frequent n tags, and display them on the screen of the digital device. To allow the user to make choices. Therefore, the user-friendly tag input method that could not be implemented in the prior art flicker, etc. can be implemented.

한편, 베이지안 분석법에 의해 Top n 리스트는 실시간으로 업데이트되게 되므로, 또 다른 디지털 데이터가 생성되면 그 직전에 생성된 디지털 데이터의 태그를 반영하여 업데이트 된 Top n 리스트로부터 부가 태그를 선택할 수 있는 기회를 사용자에게 제공할 수 있게 된다.Meanwhile, the top n list is updated in real time by Bayesian analysis, so when another digital data is generated, the user can select an additional tag from the updated top n list by reflecting the tag of the digital data generated immediately before. It can be provided to.

본 발명은 디지털 데이터로부터 인물 식별 정보를 추출할 때 보다 높은 정확도로 추출할 수 있도록 하며, 구체적으로는 디지털 데이터 관리 시스템에서, 디지털 데이터의 속성 및 내용에 따라 디지털 데이터로부터 인물 식별 정보를 추출하여 이를 기초로 디지털 데이터에 자동적으로 태그를 부여할 때, 사진 속의 인물들에 대한 제반 부가 정보를 활용하여 인물 식별 정보가 보다 정확하게 태깅될 수 있도록 한다.The present invention enables to extract with higher accuracy when extracting the person identification information from the digital data, specifically, in the digital data management system, by extracting the person identification information from the digital data according to the attributes and content of the digital data When automatically tagging digital data on the basis, the additional information on the people in the picture is utilized so that the person identification information can be tagged more accurately.

또한, 본 발명은, 디시젼 퓨전을 사용하여 새로 생성된 디지털 데이터에 부여되어 있는 태그와 동일한 태그를 가지고 있는 기존의 디지털 데이터에 자주 사용된 태그를 파악하여 이를 디지털 데이터에 추가적으로 부여함으로써 사용자의 입력 부담을 상당히 경감시키는 역할을 한다. In addition, the present invention, by using the Fusion Fusion to identify the tag frequently used in the existing digital data having the same tag as the tag given to the newly created digital data and additionally assign it to the digital data input of the user It is a very relieving burden.

비록 본 발명이 상술된 바와 같은 여러 실시예들과 관련하여 설명되었지만, 본 발명이 속하는 기술분야의 당업자가 이해할 수 있는 본 발명의 정신 및 범위를 벗어나지 않고 다른 변형예 및 변경예가 구현될 수 있다는 점이 이해되어야 할 것이다.　 그러한 변형예 및 변경예는 본 발명과 첨부된 특허청구범위의 범위 내에 속하는 것으로 고려되어야 할 것이다.Although the present invention has been described in connection with various embodiments as described above, it is to be understood that other variations and modifications may be implemented without departing from the spirit and scope of the invention as would be understood by those skilled in the art. It should be understood. Such modifications and variations are to be considered as falling within the scope of the invention and the appended claims.

Claims

A method of assigning identification information of a first person as a tag to first digital data generated by a digital device in a digital data management system, wherein the digital data management system includes identification information and life pattern information of a user of the digital device. The first digital data comprises an image of the first person,

Identifying the first person using at least one of the additional information about the first person, and

Automatically assigning identification information of the first person as a tag to the first digital data;

The additional information,

(a) the first person when the first digital data group consisting of at least one digital data different from the first digital data distributed in the same cluster as the first digital data includes an image of a second person; Information indicating a high probability of being the same person as the second person,

(b) if it is determined that the first digital data was generated during two or more encounters, information indicating that the image of the first person is likely to be a sender or a receiver of the communication associated with the encounter,

(c) when the image of the third person is included m times in the second group of digital data composed of at least one digital data different from the first digital data, the number of times of the m is increased. Information indicating that there is a high probability of being the same person as 3 persons, wherein m is a natural number;

(d) information about the first person obtained by considering at least one of a generation time and a place of the first digital data together with the life pattern information of the user of the digital device;

(e) information indicating a probability of identifying the first person as a specific person according to the position and size of the face of the first person in the image of the first person,

(f) if the digital device is a dual camera, the first person is a user of the digital device if the first digital data is generated when the lens of the dual camera is directed toward the user of the digital device. Information is high, and

(g) if the digital device is a dual camera, the first person is likely to be not the user of the digital device if the first digital data is generated when it is not directed toward the user of the digital device;

Containing at least one of

Way.

The method of claim 1,

Identifying the first person comprises combining the additional information by decision fusion,

The decision fusion is a generic term for the process of making a decision on a particular issue from multiple cues and a priori probability.

Way.

The method of claim 2,

The decision fusion may include a Bayesian assay or an ad hoc assay.

Way.

The method of claim 3,

The identifying of the first person may include generating a Top n list that is a list of n candidate groups having the highest probability of being the first person based on a result of combining the additional information by the Bayesian analysis or the ad hoc analysis. Containing more

Way.

The method of claim 4, wherein

The Top n list is updated in real time by the Bayesian method or the ad hoc method.

Way.

The method according to claim 4 or 5,

The step of automatically assigning the identification information of the first person as a tag to the first digital data includes: information on the person having the highest probability of being the first person among the candidate groups of the Top n list of the first person; Determining as identification information

Way.

The communication system of claim 1, wherein the communication is at least one of email communication, wired or wireless telephone call communication, SMS message communication, and messenger communication.

Way.

The method of claim 1,

The (b) information—information about the recipient or originator of the communication associated with the encounter—is obtained from a communication device used in connection with the encounter.

Way.

The method of claim 1,

The (b) information-information about the recipient or originator of the communication associated with the encounter-is obtained by recognizing the content of the communication.

Way.

The method of claim 1,

The (d) information-information about the first person obtained by considering at least one of a generation time and a place of the first digital data together with the life pattern information of the user of the digital device;

When the generation time of the first digital data corresponds to a holiday, it is information indicating that the probability that the first person is a family or friend of the user of the digital device is high.

When the generation time of the first digital data does not correspond to a holiday, the information indicates that the first person is more likely to be a coworker of the user of the digital device.

Way.

The method of claim 1,

When the place where the first digital data is generated corresponds to the bar, the first person is information indicating that the probability of the first person being a friend or a workmate of the user of the digital device is high.

When the place where the first digital data is generated corresponds to a home, the first person is information indicating that the probability of the first person being a family of the user of the digital device is high.

Way.

The method of claim 11,

The generation place of the first digital data is identified by referring to a tag of the first digital data.

Way.

The method of claim 1,

And when the first digital data further comprises an image of a fourth person, identifying the first person according to the information and the position and size of the face of the first person in the image of the first person. Information indicating the probability of identifying by-

If the face of the fourth person in the image of the fourth person is located in the center of the entire image of the first digital data, and the face of the fourth person is significantly larger than the face of the first person, the first person Information that indicates a low probability of identifying as a particular person.

Way.

The method of claim 1,

And when the first digital data further comprises an image of a fifth person, identifying the first person according to the information and the position and size of the face of the first person in the image of the first person. Information indicating the probability of identifying by-

If the size of the face of the first person is similar to the size of the face in the image of the fifth person, the first person is information indicating that the probability of being a friend of the fifth person is high.

Way.

The method of claim 14,

The fifth person is identified by referring to a tag of the first digital data.

Way.

A plurality of existing digital data is stored in a database included in the digital data management system, and a specific user includes at least one of an image of a specific person P or a specific object O at a specific time S of the day at a specific place S using a digital device. Generating first digital data, wherein at least one of the place S, the person P, the object O, and the time zone T is assigned as a primary tag to the first digital data; A method of adding a secondary tag as identification information to the first digital data,

Searching for identical_tag_digital data having the same tag as the primary tag among the existing digital data;

Calculating the frequency of each frequency of how often the remaining tags except the same tag as the primary tag among the tags assigned to the same tag digital data are assigned to the same tag digital data,

Displaying the N tags having the highest frequency among the remaining tags on a screen of the digital device;

The user selecting at least one of the N tags displayed on the screen as the secondary tag of the first digital data, and

Assigning the selected tag to the first digital data as the secondary tag;

Way.

The method of claim 16,

The frequency is calculated by decision fusion,

Way.

The method of claim 17,

The decision fusion includes Bayesian analysis or ad hoc analysis

Way.

The method of claim 18,

The N tags are updated in real time

Way.