KR101086773B1

KR101086773B1 - Method and apparatus for clustering of digital photos obtained from multiple cameras

Info

Publication number: KR101086773B1
Application number: KR1020080138033A
Authority: KR
Inventors: 조환규; 장철진
Original assignee: 포항공과대학교 산학협력단; 부산대학교 산학협력단
Priority date: 2008-12-31
Filing date: 2008-12-31
Publication date: 2011-11-25
Also published as: KR20100079514A

Abstract

본 발명은 여러대의 디지털 카메라로 촬영된 사진들을 구조화하고 그래프 알고리듬을 적용하는 것에 의해 사진들을 상황 기반으로 유효하게 분류할 수 있도록 한 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 방법 및 장치에 관한 것으로, 다수의 카메라, 촬영자로부터 입수된 사진을 클러스터링하는 방법에 있어서, 디지털 사진의 이미지 내용으로부터 내용기반 특징값을 추출하고, 디지털 사진의 메타데이터 및 파일정보로부터 부가정보 특징값을 추출하는 단계;사진들을 그룹화하고 특징값에 따라 구조화하는 단계;사진의 특징값 또는 해당사진이 포함된 그룹의 특징값을 이용하여 클러스터를 생성하는 단계;를 포함한다.The present invention relates to a method and an apparatus for clustering digital photographs taken with multiple cameras to enable effective classification of photographs based on situation by structuring photographs taken with multiple digital cameras and applying a graph algorithm. A method of clustering photographs obtained from a plurality of cameras and photographers, the method comprising: extracting content-based feature values from image contents of a digital photograph and extracting additional information feature values from metadata and file information of the digital photograph; Grouping them and structuring them according to feature values; generating clusters using feature values of a photo or feature values of a group including the corresponding photo.

디지털 사진, 클러스터링, 상황기반, 그래프 알고리듬, 구조화, 특징값 Digital Photography, Clustering, Context Based, Graph Algorithms, Structured, Feature Values

Description

Method and apparatus for clustering of digital photos taken with multiple cameras

본 발명은 디지털 사진의 클러스터링에 관한 것으로, 구체적으로 여러대의 디지털 카메라로 촬영된 사진들을 구조화하고 그래프 알고리듬을 적용하는 것에 의해 사진들을 상황 기반으로 유효하게 분류할 수 있도록 한 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 방법 및 장치에 관한 것이다.The present invention relates to the clustering of digital photographs. Specifically, digital photographs photographed by multiple cameras can be effectively classified based on situation by structuring photographs photographed by several digital cameras and applying graph algorithms. A method and apparatus for clustering of an apparatus are provided.

디지털 사진 클러스터링의 경우 이전의 방법에서는 주로 한 카메라나 한 명의 개인 사용자가 촬영한 사진에 대하여 사진의 이미지 내용 자체나 메타데이터 등의 부가 정보를 이용하여 클러스터링을 적용한 것이 대부분이었다. 특히 메타데이터 중 촬영시각을 클러스터링을 위한 주요 기준으로 사용하는 것이 보통이다.In the case of digital photo clustering, in the previous method, clustering was mainly applied to photographs taken by one camera or one individual user using additional information such as image content itself or metadata of the photo. In particular, it is common to use shooting time as the main criterion for clustering.

이는 일반 개인이 촬영한 사진의 경우 시간적으로 인접한 사진은 그 내용에 있어서도 의미상 유사할 가능성이 많다는 사항이 전제가 된 것이다. 또한 사람들이 사진을 관리할 때 사용하는 주요 기준 중 하나가 바로 해당 사진을 언제 어떠한 상황이나 행사에서 촬영했는가에 따라 사진을 분류하는 것이다.This is based on the premise that in the case of photographs taken by ordinary individuals, the photographs adjacent to each other in time are likely to be similar in meaning. In addition, one of the main criteria people use to manage their photos is to classify them according to when and under what circumstances and events.

이같은 방법들은 주로 사진을 시간순으로 정렬해놓고 시간차이와 더불어 내용이 많이 바뀌는 구간을 선형적으로 분할하는 방법을 주로 사용한다.These methods mainly use the method of arranging photos in chronological order and linearly dividing the section where the contents change much with time difference.

메타데이터를 접근하고 분석하는 데에는 계산량이 많이 소요되지 않기에 처리 시간이 빠르다는 장점도 있었지만 이 같은 전제는 개인 혹은 한 카메라로 촬영된 일반적인 사진에 국한되는 제약이 있다.Although the processing time is fast because it does not require much computation to access and analyze the metadata, this premise is limited to general photographs taken by individuals or by one camera.

따라서, 요즘과 같이 카메라가 대중화되고 보급률이 높아진 상황에서 어떤 동일한 행사나 여행에 있어서도 여러대의 카메라를 이용해서 촬영하게 되는 경우 동일한 시점에 서로 다른 대상을 촬영하거나, 같은 대상 혹은 같은 의미를 가지는 사물을 촬영하는 경우에도 사진간의 시간차이가 발생하는 경우가 발생하기 때문에 시간만을 주요 기준으로 하여 클러스터링 하는데 어려움이 따른다.Therefore, when the camera is popularized and the penetration rate is increased in recent years, when shooting with multiple cameras at the same event or travel, shooting different objects at the same time point, or taking the same object or objects with the same meaning. Even when shooting, the time difference between the pictures occurs, so it is difficult to cluster on the basis of time only.

특히 여러대의 카메라로 촬영하는 경우 자신의 카메라로 상대방의 사진을 촬영해주는 경우도 흔히 발생하고 있기에 이들 사진을 관리하기 위해서 추후에 여러대의 카메라 혹은 촬영자의 사진을 취합하여 사진을 분류하고자 하는 경우에 있어서 어려움이 많다. In particular, when taking pictures with multiple cameras, it is common to take pictures of the other party with one's own camera. In order to manage these pictures, in the future, when collecting pictures of multiple cameras or photographers and classifying pictures There are many difficulties.

이와 같이 종래 기술에서는 클러스터링 대상이 특정 개인의 단일 카메라로 촬영된 사진으로 대상이 국한되기 때문에 동일한 상황에서 촬영된 사진이라 하더라도 여러 카메라로부터 얻은 촬영시각이나 기타 메타데이터 내용등이 일관적이지 않은 사진에 대해서는 적용이 불가능한 문제가 있다.As such, in the related art, the clustering target is limited to a photograph taken by a single camera of a specific individual, so even if the photograph is taken in the same situation, the photographing time or other metadata contents obtained from multiple cameras are not consistent. There is a problem that is not applicable.

본 발명은 이와 같은 종래 기술의 문제를 해결하기 위한 것으로, 디지털 사진의 클러스터링시에 여러대의 디지털 카메라로 촬영된 사진들을 구조화하고 그래프 알고리듬을 적용하는 것에 의해 사진들을 상황 기반으로 유효하게 분류할 수 있도록 한 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 방법 및 장치를 제공하는데 그 목적이 있다.The present invention is to solve such a problem of the prior art, to organize the photos taken with multiple digital cameras in the clustering of digital photos and to effectively classify the photos based on the situation by applying a graph algorithm It is an object of the present invention to provide a method and apparatus for clustering digital photographs taken with a multi-camera.

본 발명은 사진의 이미지에서 추출한 특징값과 파일정보 및 메타데이터에서 추출한 특징값을 바탕으로 사진을 그래프 구조화하고 사진을 정점으로 하는 그래프에서 에지의 가중치를 부여한 다음 그래프 알고리즘을 적용하여 가장 연결 강도가 떨어지는 부분을 분할하는 것을 반복하여 클러스터링하는 방법 및 장치를 제공하는데 그 목적이 있다.The present invention constructs a graph based on the feature value extracted from the image of the photo and the feature value extracted from the file information and metadata, and assigns the weight of the edge to the graph with the photo as a vertex, and then applies the graph algorithm to obtain the most connection strength. It is an object of the present invention to provide a method and apparatus for iteratively clustering the divided portions.

본 발명은 사용자가 분류하고자 하는 클러스터의 기준 및 허용하고자 하는 시간간격을 설정할 수 있도록 하여 보다 적합한 클러스터 사진을 얻을 수 있고, 여러 촬영자에 의해 모인 사진들을 한가지의 분류 결과로 도출하는 것뿐만 아니라, 각자의 취향에 맞게 개별적으로 분류함으로써 자신의 원하는 방식대로 사진 관리를 수행할 수 있도록 한 디지털 사진의 클러스터링을 위한 방법 및 장치를 제공하는데 그 목적이 있다.The present invention enables the user to set the criteria of the cluster to be classified and the time interval to be allowed to obtain a more suitable cluster picture, and not only to derive the photos collected by several photographers as a single classification result, respectively. The purpose of the present invention is to provide a method and apparatus for clustering digital photographs, which can be classified according to their tastes so that photo management can be performed in a desired manner.

이와 같은 목적을 달성하기 위한 본 발명에 따른 다중 카메라로 촬영된 디지 털 사진의 클러스터링을 위한 방법은 다수의 카메라, 촬영자로부터 입수된 사진을 클러스터링하는 방법에 있어서, 디지털 사진의 이미지 내용으로부터 내용기반 특징값을 추출하고, 디지털 사진의 메타데이터 및 파일정보로부터 부가정보 특징값을 추출하는 단계;사진들을 그룹화하고 특징값에 따라 구조화하는 단계;사진의 특징값 또는 해당사진이 포함된 그룹의 특징값을 이용하여 클러스터를 생성하는 단계;를 포함하는 것을 특징으로 한다.In order to achieve the above object, a method for clustering digital photographs taken by multiple cameras according to the present invention is a method of clustering photographs obtained from a plurality of cameras and photographers. Extracting a value and extracting additional information feature values from metadata and file information of the digital picture; grouping the pictures and structuring them according to the feature value; characterizing the picture or feature value of the group including the corresponding picture Generating a cluster using the same;

여기서, 상기 사진의 내용기반 특징값은, 사진의 픽셀정보를 이용하여 추출한, 색상, 질감, 형태 값을 포함하는 것을 특징으로 한다.Here, the content-based feature value of the picture is characterized by including the color, texture, and shape values extracted by using the pixel information of the picture.

그리고 상기 사진의 내용기반 특징값은, 픽셀정보에서 추출한 색상, 질감, 형태 값에 기반한 통계적 분석값을 포함하는 것을 특징으로 한다.And the content-based feature value of the picture, characterized in that it comprises a statistical analysis value based on the color, texture, shape value extracted from the pixel information.

그리고 상기 통계적 분석값은, 영상의 히스토그램 정보 및 웨이블릿을 이용한 질감 분석 정보, 퓨리에 변환을 이용한 주파수 정보를 포함하는 것을 특징으로 한다.The statistical analysis value may include histogram information of an image, texture analysis information using a wavelet, and frequency information using a Fourier transform.

그리고 상기 부가정보 특징값은, 운영체제에서 제공하는 파일정보 및 디지털 사진에서 제공하는 EXIF 또는 IPTC와 같은 메타데이터에서 제공하는 사진 정보 필드값을 포함하는 것을 특징으로 한다.The additional information feature may include file information provided by an operating system and a picture information field value provided by metadata such as EXIF or IPTC provided by a digital picture.

그리고 상기 사진들을 구조화하는 단계에서, 사진을 특징값에 따라 그래프 구조로 변환하는 것을 포함하는 것을 특징으로 한다.In the structuring of the photographs, the photographs may be converted into a graph structure according to a feature value.

그리고 상기 사진을 그래프 구조로 변환하는데 있어서, 사진의 촬영자 또는 촬영된 카메라 별로 사진들을 정점으로 삼아 시간 순서로 정렬하여 이웃한 정점들 을 간선으로 연결함을 포함하는 것을 특징으로 한다.In converting the photo into a graph structure, the photographs are taken as vertices by the photographer or the photographed camera, and the neighboring vertices are connected by edges in a chronological order.

그리고 사진들을 변환한 그래프 구조에 최대 플로우를 계산하는 그래프 알고리즘을 적용하여 최대 플로우를 가지는 에지를 최소 컷으로 그래프를 분할하는 것을 특징으로 한다.In addition, a graph algorithm for calculating the maximum flow is applied to the graph structure in which the pictures are converted to divide the graph with the minimum cut at the edge having the maximum flow.

그리고 분할된 그래프를 입력으로하여 최대 플로우 계산을 반복 적용하여 기준치를 만족할때까지 그래프를 분할해 나가는 것을 특징으로 한다.And it is characterized by dividing the graph until the reference value is satisfied by repeatedly applying the maximum flow calculation by inputting the divided graph.

그리고 상기 사진 정점들을 간선로 연결할 때 에지의 가중치는 두 사진간의 특징값을 기반으로한 유사한 정도로 설정하는 것을 특징으로 한다.When the photo vertices are connected by edges, edge weights are set to a similar degree based on feature values between the two photographs.

그리고 사진 정점들을 간선으로 연결할 때 같은 카메라로 촬영한 사진들은 시간 순서상 이웃한 정점끼리 간선으로 연결하는데 있어 유사도 이외의 일정한 수치를 적용하여 그래프 알고리즘 적용시 분할이 여러 곳에서 일어나지 못하도록 하는 것을 특징으로 한다.And when the photo vertices are connected by edges, the pictures taken by the same camera are connected to each other by edges in order of time so that the division does not occur in several places when applying the graph algorithm by applying a certain value other than similarity. do.

그리고 상기 사진을 그래프 구조로 변환하는데 있어서, 각 사진은 그래프의 노드가 되며, 사진들간의 유사도를 고려하여 노드들간의 에지가 되는 방향 그래프를 구성하기 위하여,사진을 촬영한 카메라에 따라 롤 별로 나누고 촬영 시각에 따라서 정렬하는 과정;같은 롤에 속하는 사진은 시간상 이웃 노드들 사이를 촬영시각이 빠른 노드에서 다음 노드로 시간유사도 가중치를 가진 에지를 연결하는 과정;개별 사진들은 시간 및 내용 유사도를 같이 고려하여, 다른 롤에서 촬영시각이 더 늦은 것들 중 가장 유사한 사진 한장씩과 에지가 연결되도록 하는 과정;플로우를 흘려보내고 받아들이기 위한 시작 노드와 종료노드를 각 롤의 가장 첫 사진과 맨 마 지막 사진에 연결하는 과정;을 포함하는 것을 특징으로 한다.In converting the picture into a graph structure, each picture becomes a node of the graph, and in order to form a direction graph that becomes an edge between nodes in consideration of the similarity between the pictures, the picture is divided by roll according to the camera where the picture was taken. Sorting according to the shooting time; Photos in the same roll is the process of connecting the edges with time similarity weights from neighboring nodes in time to the next node in time; individual photos take time and content similarity together Making the edges connect with the closest one of the later shots on different rolls; connecting the start and end nodes to the first and last photo of each roll to flow and accept the flow. The process of; characterized in that it comprises a.

다른 목적을 달성하기 위한 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 장치는 디지털 사진의 클러스터링 장치에 있어서, 디지털 사진의 이미지 내용으로부터 내용기반 특징값을 추출하는 내용기반 특징값 추출부;디지털 사진의 메타데이터 및 파일정보로부터 부가정보 특징값을 추출하는 부가정보 특징값 추출부;사진들을 그룹화하는 그룹 정보 생성부상기 사진의 이미지 내용 특징점과 파일정보 및 메타데이터의 부가정보 특징점을 이용하여 그래프 구조로 변환하는 그래프 구조 변환부;사진의 특징값 또는 해당사진이 포함된 그룹의 특징값을 이용하여 클러스터를 생성하는 클러스터 생성부;를 포함하는 것을 특징으로 한다.An apparatus for clustering digital photos taken with multiple cameras for achieving another object may include: a content-based feature value extractor for extracting content-based feature values from an image content of a digital photo; An additional information feature value extracting unit for extracting additional information feature values from metadata and file information; a group information generation unit for grouping pictures into a graph structure using image content feature points of the pictures and additional information feature points of file information and metadata; And a graph structure converter for converting; a cluster generator for generating a cluster using feature values of a picture or feature values of a group including the corresponding picture.

이와 같은 본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 방법 및 장치는 다음과 같은 효과를 갖는다.Such a method and apparatus for clustering digital photos taken with multiple cameras according to the present invention have the following effects.

첫째, 여러대의 디지털 카메라로 촬영된 사진들을 구조화하고 그래프 알고리듬을 적용하는 것에 의해 사진들을 상황 기반으로 유효하게 분류할 수 있다.First, by structuring pictures taken by multiple digital cameras and applying graph algorithms, pictures can be effectively classified based on context.

즉, 여러명의 촬영자에 의해 다수 카메라로 촬영된 디지털 사진을 상황기반으로 효과적으로 클러스터링할 수 있다.That is, digital photographs taken by multiple cameras by multiple photographers can be effectively clustered on a situation-based basis.

둘째, 여러명의 촬영자에 의해 다수 카메라로 촬영된 디지털 사진을 사진의 이미지 내용 특징점과 파일정보 및 메타데이터의 부가정보 특징점을 이용하여 그래프 구조로 변환하여 나타낼 수 있다.Second, digital photographs photographed by multiple cameras by multiple photographers may be converted into a graph structure by using image content feature points of the picture and additional information feature points of file information and metadata.

따라서, 촬영시각이 일정하지 않은 다수 카메라로 촬영된 사진들을 그래프 알고리즘을 이용하여 효과적으로 분류할 수 있다.Therefore, it is possible to effectively classify the pictures taken by a plurality of cameras whose shooting time is not constant using a graph algorithm.

셋째, 촬영된 많은 사진들을 사용자가 일일이 분류하는 번거로움 없이 자동으로 계산이 가능함으로써 사용자 편의를 도모할 수 있다.Third, it is possible to automatically calculate a large number of photographed pictures without the user hassle of classifying each one can facilitate user convenience.

넷째, 사용자가 분류하고자 하는 클러스터의 기준 및 허용하고자 하는 시간간격을 설정할 수 있도록 하여 자신의 목적에 보다 적합한 클러스터 사진을 얻을 수 있다.Fourth, the user can set the criteria of the cluster to be classified and the time interval to be allowed to obtain a cluster picture more suitable for their purpose.

따라서, 여러 촬영자에 의해 모인 사진들을 한가지의 분류 결과로 도출하는 것뿐만 아니라, 각자의 취향에 맞게 개별적으로 분류함으로써 자신의 원하는 방식대로 사진 관리를 수행할 수 있다.Therefore, not only the photos collected by the various photographers are derived as a single classification result, but also the photos can be managed according to their desired manner by classifying them individually according to their tastes.

이하, 본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 방법 및 장치의 바람직한 실시예에 관하여 상세히 설명하면 다음과 같다.Hereinafter, a preferred embodiment of a method and apparatus for clustering digital photos taken with multiple cameras according to the present invention will be described in detail.

본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 방법 및 장치의 특징 및 이점들은 이하에서의 각 실시예에 대한 상세한 설명을 통해 명백해질 것이다.Features and advantages of the method and apparatus for clustering digital photographs taken with multiple cameras according to the present invention will become apparent from the following detailed description of each embodiment.

도 1은 디지털 사진의 시간 유사도 함수 그래프이고, 도 2내지 도 4는 본 발명에 따른 디지털 사진 클러스터링을 위한 플로우 네트워크이다.1 is a time similarity function graph of a digital picture, and FIGS. 2 to 4 are flow networks for digital picture clustering according to the present invention.

그리고 도 5는 본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 장치의 구성도이고, 도 6은 본 발명에 따른 다중 카메라로 촬영된 디 지털 사진의 클러스터링을 위한 플로우 차트이다.5 is a block diagram of an apparatus for clustering digital photos taken with multiple cameras according to the present invention, and FIG. 6 is a flowchart for clustering digital photos taken with multiple cameras according to the present invention.

본 발명은 디지털 사진의 클러스터링시에 여러대의 디지털 카메라로 촬영된 사진들을 구조화하고 그래프 알고리듬을 적용하는 것에 의해 사진들을 상황 기반으로 유효하게 분류할 수 있도록 한 것이다.The present invention enables to classify photographs effectively based on situation by structuring photographs taken by several digital cameras and applying a graph algorithm when clustering digital photographs.

본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 장치는 도 5에서와 같이 디지털 사진의 이미지 내용으로부터 내용기반 특징값을 추출하는 내용기반 특징값 추출부(51)와, 디지털 사진의 메타데이터 및 파일정보로부터 부가정보 특징값을 추출하는 부가정보 특징값 추출부(52)와, 사진들을 촬영자 또는 촬영에 이용된 카메라별로 그룹화하는 그룹 정보 생성부(53)와, 상기 구해진 사진의 이미지 내용 특징점과 파일정보 및 메타데이터의 부가정보 특징점을 이용하여 그래프 구조로 변환하는 그래프 구조 변환부(54)와, 사진의 특징값 또는 해당사진이 포함된 그룹의 특징값을 이용하여 클러스터를 생성하는 클러스터 생성부(55)를 포함하고 구성된다.The apparatus for clustering digital photographs taken with multiple cameras according to the present invention includes a content-based feature value extractor 51 for extracting content-based feature values from image contents of a digital photograph as shown in FIG. An additional information feature value extracting unit 52 for extracting additional information feature values from the data and file information, a group information generating unit 53 for grouping pictures by a photographer or a camera used for photographing, and image contents of the obtained picture A cluster structure converting unit 54 converts the graph into a graph structure by using the feature points and additional information feature points of the file information and metadata, and a cluster that generates a cluster using the feature values of the photo or the feature values of the group including the photo. It comprises a generator 55 and is configured.

이와 같은 구성을 갖는 본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 장치를 이용하여 사진 클러스터링을 하는 과정을 구체적으로 설명하면 다음과 같다.Hereinafter, a process of clustering photos using the apparatus for clustering digital photos taken with multiple cameras according to the present invention having the above configuration will be described in detail.

본 발명은 여러대의 카메라 혹은 여러명의 촬영자로부터 입수된 사진을 클러스터링하기 위해 도 6에서와 같은 과정을 수행한다.The present invention performs the same process as in FIG. 6 to cluster the photos obtained from multiple cameras or multiple photographers.

크게 사진 입력이 이루어지면(S601), 디지털 사진의 이미지 내용으로부터 내용기반 특징값을 추출하고 디지털 사진의 메타데이터 및 파일정보로부터 부가정보 특징값을 추출한다.(S602)When the photo input is made large (S601), the content-based feature value is extracted from the image content of the digital picture, and the additional information feature value is extracted from the metadata and file information of the digital picture (S602).

그리고 사진들을 촬영자 또는 촬영에 이용된 카메라별로 그룹화하고, 이미지 내용 특징점과 파일정보 및 메타데이터의 부가정보 특징점을 이용하여 그래프 구조로 변환한다.(S604)The photographs are grouped by the photographer or the camera used for photographing, and are converted into a graph structure by using the image content feature points, the file information, and the additional information feature points of the metadata (S604).

그리고 클러스터를 생성하기 위하여, 그래프 알고리듬을 적용하고 그래프 분할을(S604) 기준값을 충족시킬때까지 반복한다.(S605)In order to generate the cluster, the graph algorithm is applied and the graph segmentation is repeated until the reference value is satisfied (S604).

여기서, 디지털 사진으로부터 추출되는 내용기반 특징값은 디지털 사진 이미지의 색상, 질감, 형태등을 분석하여 얻은 수치값을 포함한다.Here, the content-based feature value extracted from the digital photograph includes a numerical value obtained by analyzing the color, texture, and shape of the digital photograph image.

또한, 이들 수치값을 통계적으로 분석한 히스토그램이나 퓨리에 변환을 통한 주파수 분석값, 웨이블릿을 이용한 통계적 분석값 등을 포함할 수 있다. In addition, it may include a histogram statistically analyzing these numerical values, frequency analysis values through Fourier transform, statistical analysis values using wavelets, and the like.

그리고 디지털 사진으로부터 부가정보 특징값을 추출하는 것은 운영체제에서 제공되는 파일정보 속성, 즉 파일 생성시각, 수정시각, 크기, 파일속성 등을 포함한다.Extracting the additional information feature value from the digital picture includes a file information attribute provided by an operating system, that is, a file creation time, a modification time, a size, a file attribute, and the like.

또한, 디지털 사진에서 제공하는 메타데이터 정보, 예를 들면 EXIF나 IPTC와 같은 규격에서 제공하는 사진에 관한 촬영 정보 및 내용 기술 정보를 포함한다. It also includes metadata information provided by digital photographs, for example, photographing information and content description information about photographs provided by standards such as EXIF and IPTC.

EXIF의 정보의 경우 사진이 촬영된 시각을 비롯하여, 조리개 설정, 셔터스피드, 초점거리, 플래쉬 설정, 화이트 밸런스 등과 같은 촬영당시의 카메라 설정에 관한 정보들이 다수 포함되어 있으며 이들은 사진 이미지 내용 비교시에 활용될 수 있다.EXIF's information includes information about the camera's settings at the time of shooting, including the time the picture was taken, as well as the aperture setting, shutter speed, focal length, flash setting, and white balance. Can be.

이에 더하여 부가정보 특징값은 메타데이터 정보를 확률 통계적으로 분석한 수치를 포함할 수 있다. 메타데이터의 속성값을 통해 베이지언(bayesian) 네트웍과 같은 확률적 분석 및 기계학습(machine learning)을 통해 얻은 사진의 내용 유추 결과 수치값을 부가정보 특징값에 포함시킬 수 있다. 이러한 값들은 디지털 사진의 분석 및 이미지 내용비교시에 참조값으로 적용가능하다.In addition, the additional information feature may include a numerical value obtained by statistically analyzing metadata information. Through the attribute values of the metadata, the numerical value of the result of analogy of the contents of the photograph obtained through probabilistic analysis and machine learning, such as a Bayesian network, may be included in the additional information feature value. These values are applicable as reference values when analyzing digital photographs and comparing image content.

상기한 사진을 카메라별로 그룹화하고 구조화 하는 것은 입력된 사진들을 그래프 형태로 구성함으로써 가능하다.Grouping and structuring the above pictures by cameras is possible by constructing the input pictures in the form of a graph.

그래프 알고리즘을 적용할 수 있는데, 그래프에서 구하고자 하는 것은 각 그래프에서의 사진 간의 유사도를 나타내는 에지를 최소한으로 끊으면서 그래프를 두 영역으로 분할하는 것이다.The graph algorithm can be applied, and the graph wants to divide the graph into two regions with the minimum cutting edges representing the similarity between the pictures in each graph.

즉, 사진들간의 유사도가 가장 떨어지는 부분을 찾아서 둘로 갈라놓는 것으로 이는 그래프에서의 최소 컷(minimum cut)을 구하는 것에 해당한다.In other words, it finds the parts with the lowest similarity between the pictures and divides them into two pieces, which corresponds to finding the minimum cut in the graph.

최소 컷 문제는 그래프에서 최대 플로우(max flow)를 찾는 것과 동일한 문제이다.The minimum cut problem is the same problem as finding the max flow in the graph.

따라서, 본 발명에서는 각각의 입력 사진이 노드로 이루어진 그래프를 최대 플로우 알고리즘을 적용시킬 수 있는 그래프 형태로 먼저 구성해야 한다.Therefore, in the present invention, a graph consisting of nodes of each input picture should be configured first in a graph form to which a maximum flow algorithm can be applied.

최대 플로우에서 그래프는 물을 흘려보낼 수 있는 파이프들의 망으로 비유될 수 있으며 방향성이 있는 각각의 파이프 연결(에지)들은 처리량과 플로우 값을 가진다.At maximum flow, the graph can be likened to a network of pipes capable of flowing water, and each directional pipe connection (edge) has throughput and flow values.

처리량(cap, capacity)은 해당 에지에서 물을 흘려보낼수 있는 수치이며, 각 파이프를 흐르는 처리량 이하의 물의 흐름이 플로우(flow)이다.Capacity (cap, capacity) is a number that can flow water at the edge, the flow of water below the throughput flowing through each pipe is a flow (flow).

시작 노드(s)에서 종료노드(t)로 물을 흘려보낼 때 최대로 흐르는 물의 양(max flow)을 구하는 것은 곧 최대량의 플로우를 흘려보내는 에지들을 컷으로 가지는 최소 컷(minimum cut)문제와 같다.Finding the maximum flow of water when flowing water from the start node (s) to the end node (t) is the same as the minimum cut problem with cuts at the edges that flow the maximum flow. .

사진을 그래프 형태로 구성하는 방식은 다음과 같은 과정을 거친다.The method of organizing pictures in graph form is as follows.

여기서 각 사진은 그래프의 노드가 되며, 사진들간의 유사도는 노드들간의 에지가 되는 방향 그래프를 구성한다.Each picture becomes a node of the graph, and the similarity between the pictures constitutes a direction graph that becomes an edge between the nodes.

(1)우선 사진을 촬영한 카메라에 따라 롤 별로 나누고 촬영 시각에 따라서 정렬한다.(1) First, divide the picture by roll according to the camera where the picture was taken and sort according to the shooting time.

(2)같은 롤에 속하는 사진(같은 카메라로 촬영한 사진)은 시간상 이웃 노드들 사이를 촬영시각이 빠른 노드에서 다음 노드로 시간유사도 가중치를 가진 에지를 연결한다.(2) Pictures belonging to the same roll (pictures taken by the same camera) connect edges with time similarity weights between nodes in time with neighboring nodes from the node having the quickest time to the next node.

(3)개별 사진들은 다른 롤에서 촬영시각이 더 늦은 것들 중 가장 유사한 사진 한장씩과 에지가 연결되며 이때 유사도는 시간 및 내용 유사도를 같이 고려한다.(3) The individual photos are connected with the edges of the most similar ones of the slower shooting time on different rolls, and the similarity takes into account the time and content similarity.

(4)플로우를 흘려보내고 받아들이기 위한 시작 노드와 종료노드는 각 롤의 가장 첫 사진과 맨 마지막 사진에 연결된다.(4) The start and end nodes for flowing and accepting flows are connected to the first and last pictures of each roll.

상기한 사진간 유사도를 에지의 가중치로 연결하는데 있어서 사진간 유사도 중 시간유사도는 다음과 같이 구한다.The time similarity among the similarities between the pictures in connecting the similarity between the pictures by the weight of the edge is calculated as follows.

입력되는 모든 사진 집합을

라고 정의한다. 이때 u번째 롤에 해당하는 사진은 명시적으로

와 같이 표시한다

.A set of all the photos

It is defined as. In this case, the photo corresponding to the u roll is explicitly

Display as

.

두 사진 pi와 pj 사이의 시간 유사도 ST는 수학식 1에서와 같이 같이 로지스틱 분포 함수(logistic distribution function)를 이용하여 시간간격이 멀어질수록 유사도 값이 떨어지도록 한다.The time similarity ST between the two pictures pi and pj uses a logistic distribution function as shown in Equation 1 so that the similarity value decreases as the time interval increases.

여기서 μ는 중간값의 위치를 나타내며 0보다 큰 실수 s는 스케일이며, ts(.)는 사진의 타임스탬프를 나타낸다. 함수의 출력값은 0부터 1까지이며 μ시간차가 날때 0.5의 값을 가진다. 본 발명에서 μ는 사용자 입력값을 사용하였으며 스케일 s는 주로 μ/5값을 사용하였으나 사용자의 의도에 따라 얼마든지 다른 값으로 사용가능하다.Where μ represents the position of the median, real s greater than zero is the scale, and ts (.) Represents the timestamp of the photo. The output value of the function is from 0 to 1 and has a value of 0.5 when there is a μ time difference. In the present invention, μ is used as a user input value, and scale s is mainly used as μ / 5 value, but may be used as any other value according to the user's intention.

시간 유사도 비교에서는 μ값과 s값을 조정함으로써 얼마만큼 시간간격이 떨어진 사진들을 같은 사진으로 용인할 것인지 그리고 시간차에 따른 유사도 값의 변화 정도를 조정할 수 있다.In the temporal similarity comparison, by adjusting the μ value and the s value, it is possible to adjust how much time-lapsed pictures are accepted as the same picture and the degree of change in the similarity value according to time difference.

도 1은 시간 유사도 함수의 예를 나타낸 것이다.1 shows an example of a time similarity function.

두 사진 간의 내용 유사도 SH는 앞서 구한 디지털 사진의 특징값을 이용한 비교를 통해서 구한다. 두 사진간의 통합 유사도 $S_C$는 수학식 2에서와 같이 시 간 유사도와 내용 유사도를 같이 고려한다.The content similarity SH between two photographs is obtained through comparison using the feature values of the digital photograph obtained above. The integrated similarity $ S_C $ between the two pictures considers the time similarity and the content similarity as in Equation 2.

그래프의 플로우 처리량 설정 및 에지 연결 과정을 구체적으로 설명하면 다음과 같다.The flow throughput setting and the edge connection process of the graph will be described in detail as follows.

사진들의 촬영 순서가 반영되도록 하기 위해 그래프 G(V, E)의 각 노드(사진)를 연결하는데 있어 같은 롤에 속하는 이웃 사진(노드)들을 그래프로 구성하는 과정은 다음과 같다.In order to reflect the shooting order of the pictures, the process of constructing a graph of neighboring pictures (nodes) belonging to the same role in connecting each node (picture) of the graph G (V, E) is as follows.

여기서 그래프에서 노드들의 집합 V는 그래프를 구성하기 위해 입력된 사진들과 소스(s), 싱크(t) 노드를 포함한다.Here, the set V of nodes in the graph includes the input pictures, the source (s) and the sink (t) nodes to construct the graph.

우선 시간적으로 이웃한 노드들 끼리 에지를 연결하며 에지 가중치(플로우 처리량,cap)는 시간 유사도만으로 적용한다.First, neighboring nodes connect edges in time, and edge weights (flow throughput, cap) are applied only with time similarity.

이는 단일 카메라로 촬영한 사진의 경우 시간적으로 인접한 사진의 경우 의미상으로도 유사한 경우가 많은 상황에 근거한 것으로 이미 다수의 발표된 이전 연구에서 이같은 시간 분석 방식을 활용하였다[Loui et al.(2003), Cooper et al(2005). Grapham et al.(2002)].This is based on a situation where a photograph taken with a single camera is often similar in meaning to a temporally adjacent photograph, and many previous studies have used this time analysis method [Loui et al. (2003). , Cooper et al (2005). Grapham et al. (2002)].

다음은 어떤 롤 u에 속하는 이웃 노드들간의 에지 연결을 나타낸 것이다. 같은 롤에서 앞뒤 사진이 기준치 Ts 이상으로 유사할 경우 이는 거의 동일한 사물을 연속하여 찍은것으로 간주하여 이 둘 사이에 컷이 생기지 못하도록 충분히 큰 상수 값 Cs를 더해준다.The following shows the edge connections between neighboring nodes belonging to a certain role u. If the front and back pictures on the same roll are more than the reference value Ts, it is assumed that the same thing is taken in succession, adding a constant value Cs that is large enough to prevent a cut between the two.

를 만족하는 각 사진에 대해서 수학식 3에서와 같이 에지 가중치를 부여한다.

Edge weights are assigned to each photo satisfying the equation as in Equation 3.

다음으로 수학식 4에서와 같이 특정 사진과 유사한 각 롤별 사진들 간의 에지를 연결한다. 이 때에는 통합 유사도인 $S_C$를 이용하여 두 사진을 비교하게 된다.Next, as shown in Equation 4, the edges between the pictures of each roll similar to the specific picture are connected. In this case, the two pictures are compared using the integrated similarity $ S_C $.

이는 다른 롤에 포함되는 사진끼리 연결하는 경우 여러 카메라간의 응집도를 높여서 해당 사진들간에 컷이 지나지 않도록 함으로써 같은 클러스터에 포함되도록 하기 위함이다.This is to increase the cohesion between several cameras when the pictures included in different rolls are connected so that the cuts do not pass between the pictures so that they are included in the same cluster.

다만 네트워크 플로우 그래프가 방향그래프이기 때문에 에지의 연결 방향은 사진의 촬영시간을 고려하여 컷을 구하고 그래프 상에 사이클이 생기지 않도록 하기위해 타임스탬프(ts)가 빠른 사진에서 늦은 사진으로 연결되도록 한다.However, since the network flow graph is a directional graph, the connection direction of the edge is to take a cut in consideration of the shooting time of the photo and to connect the time stamp (ts) from the quick photo to the late photo so as not to cause a cycle on the graph.

특정 사진에서의 SC 가중치를 가지는 에지 연결은 각 롤별로 가장 유사한 사 진 하나씩만 연결되도록 한다. 이때 같은 롤에 포함되는(u = v인) 사진도 연결지어준다.Edge concatenation with SC weights in a particular photo ensures that only one of the most similar photographs is attached to each roll. It also links the photos that are on the same roll (u = v).

사진들 간의 이제 연결이 마무리 되었으면 이제 플로를 실제 보내고 받을 소스(s)와 싱크(t) 노드를 연결해줘야 한다 (s, t ∈ V). 소스와 싱크는 도 2 에서 보여지는 바와 같이 각 롤의 첫번째 사진과 마지막 사진과 연결된다.Now that the connection between the pictures is complete, we need to connect the source (s) and sink (t) nodes to send and receive the flow (s, t ∈ V). The source and sink are connected with the first and last pictures of each roll as shown in FIG.

위와 같이 구성된 플로우 네트워크 상에서 최대 플로우 알고리즘을 적용하여 최소 컷을 구하게 되는데 일반적인 경우 도 2에서와 같은 분할을 기대하지만 그래프의 구성 상태에 따라서 때로는 도 3과 같이 한 롤 내에서도 컷이 여러 부분을 분할하게 되는 경우가 발생한다.The minimum cut is obtained by applying the maximum flow algorithm on the flow network configured as described above. In general, the cut is expected as shown in FIG. 2, but depending on the configuration of the graph, the cut is sometimes divided into several parts within one roll as shown in FIG. 3. The case occurs.

물론 좁은 시간대의 사진들을 구성하는 그래프에서 앞뒤로 유사한 사진이 흩어져 있는 경우에는 이 같은 컷이 가능하지만 일반적인 경우처럼 롤 내에서 사진이 촬영된 순서를 유지하고자 하는 경우에는 그림과 같은 컷이 발생하지 않도록 그래프의 에지 가중치를 조정해줄 필요가 있다.Of course, such a cut is possible if the similar pictures are scattered back and forth in the graph of the pictures in the narrow time zone, but if you want to maintain the order in which the pictures are taken in the roll as usual, the graph does not occur as shown. We need to adjust the edge weights of.

한가지 방법은 도 4에서처럼 같은 롤에 해당되는 사진들 이웃간의 에지 연결 에 더하여 충분히 가중치로 상수값 Cd를 가지는 에지를 소스에서부터 싱크까지 각각 중첩 연결함으로써 롤 내의 이웃 노드를 지나는 플로우를 매우 높여 같은 롤 내에서 컷이 여러 곳을 분할하지 못하도록 조정하는 것이다.One method is to increase the flow through the neighboring nodes in the roll by significantly overlapping edges with constant values Cd from the source to the sink, with sufficient weights in addition to the edge connections between neighbors of the same roll as in Figure 4 Is to prevent the cut from splitting several places.

이와 같은 본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 방법에서의 최대 플로우 알고리듬 적용에 관하여 설명하면 다음과 같다.The application of the maximum flow algorithm in the method for clustering digital photographs taken with multiple cameras according to the present invention will be described below.

상기한 방법과 같이 구성된 그래프에서 최대 플로우를 구하기 위해서는 이미 사용되고 있는 최대 플로우 구현방법을 활용할 수 있다.In order to obtain the maximum flow in the graph configured as described above, the maximum flow implementation method that is already used may be utilized.

이같은 구현 방법에는 Goldberg와 Tarjan의 preflow-push 방법을 비롯하여 relabeling과 gap 휴리스틱이 등이 적용된 몇가지 방법이 알려져 있으며, 어떠한 구현 방법을 사용하여도 본 발명의 의미와 동작방식은 동일하다.Some implementation methods such as reflowing and gap heuristics are known as goldberg and Tarjan preflow-push methods, and the meaning and operation of the present invention are the same using any implementation method.

최대 플로우 알고리즘을 통해서는 최대 플로우의 양과 네트워크를 흐르는 에지별 플로우의 양을 알 수 있다. 이들을 이용하면 최소 컷과 이를 기준으로 양쪽으로 나뉜 노드들 역시 구해낼 수 있다.The maximum flow algorithm shows the maximum flow amount and the amount of flow for each edge flowing through the network. Using them, you can also find the minimum cut and the nodes split into two based on it.

최대 플로우는 결과 수치를 [0, 1]로 정규화하기 위해서 다음과 같이 그래프에 최대 입력 가능한 플로우 양으로 나누어준다.The maximum flow is divided by the maximum amount of flows that can be entered into the graph to normalize the result to [0, 1].

만약, 최대 플로우 결과가 사용자 입력 기준치 Tm보다 크면 그래프를 구성하는 사진들이 서로간에 충분히 유사도가 큰 것으로 고려하여 최종적인 사진 클러스 터로 인식하고 기준치보다 작은 경우 그래프는 F와 R로 분할된다.If the maximum flow result is larger than the user input reference value Tm, the pictures constituting the graph are considered to be the final photo clusters considering that the similarities between each other are large enough, and if the size is smaller than the reference value, the graph is divided into F and R.

그리고 각각 F와 R에 속하는 사진들을 따로 플로우 네트워크 그래프를 형성하여 최대 플로우 알고리즘이 기준치 Tm을 만족하거나 분할된 한쪽 그래프가 s나 t 노드 밖에 남아있지 않을 때까지 반복 적용한다. Each of the pictures belonging to F and R is separately formed in a flow network graph, and it is repeatedly applied until the maximum flow algorithm satisfies the reference value Tm or there is only one segmented graph remaining outside the s or t nodes.

더 이상 분할되지 않는 서브 그래프에 남아있는 사진 노드들은 사진 클러스터를 형성하고 실제로 사용자에게 표시할 때는 이들을 시간 순 또는 사진의 내용 유사도순 혹은 사용자가 순서대로 정렬하여 제공한다.Photo nodes that remain in the subgraph that are no longer segmented form photo clusters and, when actually displayed to the user, provide them in chronological order or similarity of the contents of the photo or in the user's order.

이상에서와 같이 본 발명은 사진의 이미지에서 추출한 특징값과 파일정보 및 메타데이터에서 추출한 특징값을 바탕으로 사진을 그래프 구조화하고 사진을 정점으로 하는 그래프에서 에지의 가중치를 부여한 다음 그래프 알고리즘을 적용하여 가장 연결 강도가 떨어지는 부분을 분할하는 것을 반복하여 클러스터링한다.As described above, the present invention constructs the graph based on the feature value extracted from the image of the picture and the feature value extracted from the file information and metadata, and assigns the weight of the edge to the graph with the picture as a vertex, and then applies the graph algorithm. The clustering is repeated by dividing the portion having the least connection strength.

또한, 사용자가 분류하고자 하는 클러스터의 기준 및 허용하고자 하는 시간간격을 설정할 수 있도록 하여 자신의 목적에 보다 적합한 클러스터 사진을 얻을 수 있다.In addition, by allowing the user to set the criteria of the cluster to be classified and the time interval to allow, it is possible to obtain a cluster picture more suitable for their purpose.

이상 설명한 내용을 통해 당업자라면 본 발명의 기술 사상을 일탈하지 아니하는 범위에서 다양한 변경 및 수정이 가능함을 알 수 있을 것이다.It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit or scope of the invention.

따라서, 본 발명의 기술적 범위는 실시예에 기재된 내용으로 한정되는 것이 아니라 특허 청구의 범위에 의하여 정해져야 한다.Therefore, the technical scope of the present invention should not be limited to the contents described in the embodiments, but should be defined by the claims.

도 1은 디지털 사진의 시간 유사도 함수 그래프1 is a time similarity function graph of a digital photograph

도 2내지 도 4는 본 발명에 따른 디지털 사진 클러스터링을 위한 플로우 네트워크2-4 are flow networks for digital photo clustering according to the present invention.

도 5는 본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 장치의 구성도5 is a block diagram of an apparatus for clustering digital photos taken with multiple cameras according to the present invention;

도 6은 본 발명에 따른 다중 카메라로 촬영된 디지털 사진의 클러스터링을 위한 플로우 차트6 is a flow chart for clustering digital photos taken with multiple cameras in accordance with the present invention.

도면의 주요 부분에 대한 부호의 설명Explanation of symbols for the main parts of the drawings

51. 내용기반 특징값 추출부 52. 부가정보 특징값 추출부51. Content-based feature value extractor 52. Additional information feature value extractor

53. 그룹정보 생성부 54. 그래프 구조 변환부53. Group information generator 54. Graph structure converter

55. 클러스터 생성부55. Cluster Generator

Claims

In a method for clustering photos obtained from a plurality of cameras, photographers,

Extracting content-based feature values from the image content of the digital picture and extracting additional information feature values from metadata and file information of the digital picture;

A structured step of grouping photos, using the content-based feature value and the additional information feature value to align pictures in time order by taking photographs as photographic or photographed cameras as vertices and connecting neighboring vertices with edges to convert them into a graph structure. ;

And generating a cluster by using the content-based feature value and the additional information feature value of a group including a picture or a corresponding picture.

The content-based feature of claim 1, wherein

A method for clustering digital photographs taken with multiple cameras comprising color, texture, and shape values extracted using pixel information of a photograph.

The content-based feature of claim 1, wherein

A method for clustering digital photographs taken with multiple cameras, comprising statistical analysis based on color, texture, and shape values extracted from pixel information.

The method of claim 3, wherein the statistical analysis value,

A method for clustering digital photographs taken with multiple cameras, comprising histogram information of an image, texture analysis information using wavelets, and frequency information using Fourier transform.

The method of claim 1, wherein the additional information feature value,

A method for clustering digital photographs taken with multiple cameras, comprising: file information provided by an operating system and picture information field values provided by EXIF or IPTC metadata provided by a digital photograph.

delete

The method of claim 1, wherein a graph algorithm for calculating the maximum flow is applied to the graph structure in which the pictures are converted to divide the graph with the minimum cut at the edge having the maximum flow. Way.

10. The method of claim 8, wherein the divided graph is input and the maximum flow calculation is repeatedly applied to divide the graph until the reference value is satisfied.

The method of claim 1, wherein the weight of an edge when connecting the photo vertices by a trunk line is set based on the content-based feature value and the additional information feature value between two pictures. Way.

11. The method of claim 10, wherein when the photo vertices are connected by edges, the photographs taken by the same camera are adjusted by applying a constant value to connect neighboring vertices to the edges in time order to adjust the splitting when the graph algorithm is applied. A method for clustering digital photos taken with multiple cameras.

The method of claim 1, wherein in converting the photograph into a graph structure,

Dividing the picture by roll according to the camera where the picture was taken and sorting according to the shooting time;

A photograph belonging to the same roll is a process of connecting edges having weights based on the photographing time from neighboring nodes to nodes from the node having the fastest photographing time in time;

Allowing individual pictures to be edge-connected with one of the photographs having a later recording time on another roll based on time and content extracted using the content-based feature value and the additional information feature value;

Connecting the start node and the end node to the first and last pictures of each roll to flow and accept the flow; and constructing a direction graph that is an edge between the nodes, where the nodes of the graph are each picture Method for clustering digital photos taken with multiple cameras, characterized in that.

In the clustering device of digital photography,

A content-based feature value extraction unit for extracting content-based feature values from the image contents of the digital photograph;

An additional information feature value extracting unit for extracting additional information feature values from metadata and file information of the digital photograph;

Group information generator for grouping photos

Using the content-based feature value of the photo and the additional information feature value of the file information and metadata, the photographs are taken as vertices for each photographer or photographed camera and arranged in chronological order to connect neighboring vertices with edges to form a graph. A graph structure converter to convert;

Apparatus for clustering digital photos taken with multiple cameras comprising a; cluster generation unit for generating a cluster by using the content-based feature value of the photo or group, the additional information feature value.