KR102095814B1

KR102095814B1 - System and Method for Analyzing Video Data Using Change Pattern of Shot Size

Info

Publication number: KR102095814B1
Application number: KR1020180126299A
Authority: KR
Inventors: 임양미; 엄주언; 박서린
Original assignee: 덕성여자대학교 산학협력단
Priority date: 2018-10-22
Filing date: 2018-10-22
Publication date: 2020-04-01

Abstract

According to one aspect of the present invention, an image data analysis system using a shot change pattern capable of analyzing image data using a shot size change pattern of the image data comprises: a shot boundary detection unit which detects a boundary between shots from image data using a shot boundary detection algorithm; a shot size determination unit which determines a shot size of a corresponding shot based on a size of a representative object extracted from at least one of images constituting each shot; a climax section determination unit for determining a climax section of the image data using a change pattern of the shot size; and a database generation unit which maps and stores the determined shot size change pattern of the climax section with identification information of the image data.

Description

System and Method for Analyzing Video Data Using Change Pattern of Shot Size}

본 출원은 영상 데이터 분석 시스템 및 방법에 관한 것이다.The present application relates to an image data analysis system and method.

최근 한국의 드라마, 영화 등의 영상 데이터가 아시아, 유럽 각지에서 선광적인 인기를 끌고 있다. 이러한 인기는 드라마, 영화 등의 시나리오 기획력과 촬영 기술, 편집 기술, 특수효과 등의 다양한 기술등이 종합적으로 잘 융합되어 발현되고 있기 때문이다. 특히 드라마, 영화 등의 클라이맥스는 몰입감을 올려주는 중요한 부분으로 드라마, 영화 등의 흥행을 위한 결정적인 요인으로 볼 수 있다.Recently, video data such as Korean dramas and movies have gained popularity in Asia and Europe. This popularity is due to the fact that scenario planning powers such as dramas and movies, and various technologies such as shooting techniques, editing techniques, and special effects are combined and expressed in a comprehensive manner. In particular, climax in dramas and movies is an important part of enhancing immersion, and can be seen as a decisive factor for the success of dramas and movies.

따라서, 이러한 인기를 지속적으로 유지하기 위해서는 흥행에 성공한 영상 데이터의 특성을 분석하고, 분석된 특성을 기초로 영상 데이터를 생성할 필요가 있다. 이때 영상 데이터의 특성 분석을 위해 영상 데이터로부터 해당 영상 데이터의 클라이맥스를 선정하고, 클라이맥스에 적용된 촬영기법이나 편집기술 등을 확인하기 위한 시도가 이루어 지고 있다.Therefore, in order to maintain this popularity, it is necessary to analyze the characteristics of the video data that has been successful, and generate the image data based on the analyzed characteristics. At this time, to analyze the characteristics of the video data, attempts have been made to select a climax of the video data from the video data, and to confirm a shooting technique or editing technique applied to the climax.

하지, 영상 데이터의 클라이맥스의 선정은 분석자의 경험에 의해 이루어지는 것일 일반적이므로, 동일한 영상 데이터라 하더라도 분석자 마다 서로 다른 클라이맥스가 선정될 수 있어 영상 데이터의 분석이 정확하게 수행될 수 없을 뿐만 아니라, 클라이맥스 선정에도 많은 시간이 소요될 수 있다는 문제점이 있다.However, since the selection of the climax of the image data is generally performed by the experience of the analyst, different climaxes may be selected for each analyst even for the same image data, and analysis of the image data may not be accurately performed. There is a problem that it can take a lot of time.

본 발명은 영상 데이터의 샷 사이즈 변화 패턴을 이용하여 영상 데이터를 분석할 수 있는 샷 사이즈 변화패턴을 이용하여 영상 데이터 분석 시스템 및 방법을 제공하는 것을 그 기술적 과제로 한다.An object of the present invention is to provide an image data analysis system and method using a shot size change pattern capable of analyzing image data using a shot size change pattern of image data.

또한, 본 발명은 클라이맥스 구간의 샷 사이즈 변화패턴을 해당 영상 데이터의 식별정보와 매핑시켜 저장함으로써 영상분석용 데이터베이스를 생성할 수 있는 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템 및 방법을 제공하는 것을 또 다른 기술적 과제로 한다.In addition, the present invention provides a system and method for analyzing image data using a shot size change pattern that can generate a database for image analysis by mapping and storing the shot size change pattern in a climax section with identification information of corresponding image data. Another technical task.

또한, 본 발명은 영상분석용 데이터베이스에 저장된 영상 데이터의 샷 사이즈 변화패턴을 이용하여 특정 영상 데이터의 흥행 성공 여부를 미리 판단할 수 있는 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템 및 방법을 제공하는 것을 또 다른 기술적 과제로 한다. In addition, the present invention provides an image data analysis system and method using a shot size change pattern capable of determining in advance whether or not the success of specific image data is previously performed using a shot size change pattern of image data stored in a database for image analysis. Another technical task.

또한, 본 발명은 영상분석용 데이터베이스를 이용하여 타겟 영상을 생성할 수 있는 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템 및 방법을 제공하는 것을 또 다른 기술적 과제로 한다.Another object of the present invention is to provide an image data analysis system and method using a shot size change pattern capable of generating a target image using a database for image analysis.

상술한 기술적 과제를 달성하기 위한 본 발명의 일 측면에 따른 샷 변화패턴을 이용한 영상 데이터 분석 시스템은, 샷(Shot) 경계 검출 알고리즘을 이용하여 영상 데이터로부터 샷들 간의 경계를 검출하는 샷 경계 검출부; 각 샷을 구성하는 이미지들 중 적어도 하나의 이미지로부터 추출되는 대표객체의 크기를 기초로 해당 샷의 샷 사이즈를 결정하는 샷 사이즈 결정부; 상기 샷 사이즈의 변화패턴을 이용하여 상기 영상 데이터의 클라이맥스 구간을 결정하는 클라이맥스 구간 결정부; 및 상기 결정된 클라이맥스 구간의 샷 사이즈 변화 패턴을 상기 영상 데이터의 식별정보와 매핑시켜 저장하는 데이터베이스 생성부를 포함하는 것을 특징으로 한다.An image data analysis system using a shot change pattern according to an aspect of the present invention for achieving the above-described technical problem includes: a shot boundary detection unit that detects a boundary between shots from image data using a shot boundary detection algorithm; A shot size determination unit that determines a shot size of a corresponding shot based on a size of a representative object extracted from at least one of the images constituting each shot; A climax section determination unit for determining a climax section of the image data using the change pattern of the shot size; And a database generation unit that maps and stores the determined shot size change pattern of the climax section with the identification information of the image data.

상술한 기술적 과제를 달성하기 위한 본 발명의 일 측면에 따른 샷 변화패턴을 이용한 영상 데이터 분석 방법은, 샷 경계 검출 알고리즘을 이용하여 영상 데이터로부터 샷들 간의 경계를 검출하는 단계; 각 샷을 구성하는 이미지들 중 적어도 하나의 이미지로부터 추출되는 대표객체의 크기를 기초로 해당 샷의 샷 사이즈를 결정하는 단계; 및 상기 샷 사이즈의 변화패턴을 이용하여 상기 영상 데이터의 클라이맥스 구간을 결정하는 단계를 포함하는 것을 특징으로 한다.An image data analysis method using a shot change pattern according to an aspect of the present invention for achieving the above-described technical problem includes: detecting a boundary between shots from image data using a shot boundary detection algorithm; Determining a shot size of a corresponding shot based on a size of a representative object extracted from at least one image among images constituting each shot; And determining a climax section of the image data using the change pattern of the shot size.

본 발명에 따르면, 영상 데이터의 샷 사이즈 변화 패턴을 이용하여 영상 데이터로부터 클라이맥스를 자동으로 추출할 수 있기 때문에, 영상 데이터로부터 클라이맥스를 빠른 시간 내에 정확하게 추출할 수 있다는 효과가 있다.According to the present invention, since the climax can be automatically extracted from the image data by using the shot size change pattern of the image data, there is an effect that the climax can be accurately extracted from the image data in a short time.

또한, 본 발명에 따르면 각 영상 데이터의 클라이맥스에 대응되는 샷 사이즈 변화패턴을 이용하여 영상 데이터 분석용 데이터베이스를 생성할 수 있기 때문에 영상 데이터 분석이 용이해 진다는 효과가 있다.In addition, according to the present invention, since it is possible to create a database for analyzing image data using a shot size change pattern corresponding to a climax of each image data, there is an effect that analysis of image data is facilitated.

또한, 본 발명에 따르면 영상분석용 데이터베이스에 저장된 흥행에 영상 데이터의 샷 사이즈 변화패턴을 이용하여 특정 영상 데이터의 흥행 성공 여부를 예측할 수 있어 영화제작사의 관점에서 효과적인 비즈니스 플랜을 설계할 수 있다는 효과가 있다.In addition, according to the present invention, it is possible to predict whether or not the success of specific video data can be predicted by using the shot size change pattern of the video data in the box office stored in the database for image analysis. have.

또한, 본 발명에 따르면 타겟 영상데이터 생성시 영상분석용 데이터베이스에 저장된 특정 장르의 영상 데이터의 샷 사이즈 변화패턴을 이용할 수 있어 타겟 영상데이터 생성의 용이성을 증대시킴과 동시에 타겟 영상데이터의 흥행 가능성을 높일 수 있다는 효과가 있다.In addition, according to the present invention, when the target image data is generated, a shot size change pattern of image data of a specific genre stored in a database for image analysis can be used, thereby increasing the ease of generating the target image data and increasing the likelihood of the success of the target image data. It has the effect of being able to.

도 1은 본 발명의 일 실시예에 따른 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템의 구성을 보여주는 블록도이다.
도 2a 내지 도 2c는 하나의 샷 경계 검출 알고리즘을 이용하여 검출된 샷 경계의 부정확성을 보여주는 도면이다.
도 3은 도 1에 도시된 샷 사이즈 결정부의 구성을 구체적으로 보여주는 블록도이다.
도 4a는 이미지 내에서 검출된 복수개의 타겟영역의 예를 보여주는 도면이다.
도 4b는 복수개의 타겟영역들 중 결정된 대표객체의 예를 보여주는 도면이다.
도 4c는 샷 사이즈가 클로즈업샷인 대표객체의 예를 보여주는 도면이다.
도 5a는 정면에서 촬영된 사람이 포함되어 있지 않은 타겟영역이 포함된 이미지의 예를 보여주는 도면이다.
도 5b는 도 5a에 도시된 이미지에서 결정된 대표객체의 예를 보여주는 도면이다.
도 6은 사람이 포함되지 않은 이미지의 예를 보여주는 도면이다.
도 7은 샷 사이즈 변화패턴에 따라 결정된 클라이맥스 구간의 예를 보여주는 그래프이다.
도 8은 본 발명의 일 실시예에 따른 샷 사이즈 변화패턴을 이용한 영상 데이터 분석방법을 보여주는 플로우 차트이다.1 is a block diagram showing the configuration of an image data analysis system using a shot size change pattern according to an embodiment of the present invention.
2A to 2C are diagrams showing inaccuracies in shot boundaries detected using one shot boundary detection algorithm.
3 is a block diagram specifically showing the configuration of the shot size determination unit illustrated in FIG. 1.
4A is a diagram showing an example of a plurality of target areas detected in an image.
4B is a diagram showing an example of a representative object determined among a plurality of target areas.
4C is a view showing an example of a representative object whose shot size is a close-up shot.
5A is a view showing an example of an image including a target area that does not include a person photographed from the front.
5B is a diagram showing an example of a representative object determined from the image shown in FIG. 5A.
6 is a diagram showing an example of an image in which a person is not included.
7 is a graph showing an example of a climax section determined according to a shot size change pattern.
8 is a flowchart illustrating an image data analysis method using a shot size change pattern according to an embodiment of the present invention.

본 출원의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 일 예들을 참조하면 명확해질 것이다. 그러나 본 출원은 이하에서 개시되는 일 예들에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 것이며, 단지 본 출원의 일 예들은 본 출원의 개시가 완전하도록 하며, 본 출원이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 출원은 청구항의 범주에 의해 정의될 뿐이다.Advantages and features of the present application, and a method of achieving them will be clarified with reference to examples described below in detail together with the accompanying drawings. However, the present application is not limited to the examples disclosed below, but will be implemented in various different forms, and only the examples of the present application make the disclosure of the present application complete, and common knowledge in the art to which this application belongs It is provided to fully inform the person having the scope of the invention, and the present application is only defined by the scope of the claims.

본 출원의 일 예를 설명하기 위한 도면에 개시된 형상, 크기, 비율, 각도, 개수 등은 예시적인 것이므로 본 출원이 도시된 사항에 한정되는 것은 아니다. 명세서 전체에 걸쳐 동일 참조 부호는 동일 구성 요소를 지칭한다. 또한, 본 출원을 설명함에 있어서, 관련된 공지 기술에 대한 구체적인 설명이 본 출원의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우 그 상세한 설명은 생략한다. The shapes, sizes, ratios, angles, numbers, etc. disclosed in the drawings for explaining an example of the present application are exemplary, and the present application is not limited to the illustrated matters. The same reference numerals refer to the same components throughout the specification. In addition, in the description of the present application, when it is determined that a detailed description of related known technologies may unnecessarily obscure the subject matter of the present application, the detailed description will be omitted.

본 명세서에서 언급된 '포함한다', '갖는다', '이루어진다' 등이 사용되는 경우 '~만'이 사용되지 않는 이상 다른 부분이 추가될 수 있다. 구성 요소를 단수로 표현한 경우에 특별히 명시적인 기재 사항이 없는 한 복수를 포함하는 경우를 포함한다. When 'include', 'have', 'consist of' and the like mentioned in this specification are used, other parts may be added unless '~ man' is used. When a component is expressed as a singular number, the plural number is included unless otherwise specified.

구성 요소를 해석함에 있어서, 별도의 명시적 기재가 없더라도 오차 범위를 포함하는 것으로 해석한다.In analyzing the components, it is interpreted as including the error range even if there is no explicit description.

위치 관계에 대한 설명일 경우, 예를 들어, '~상에', '~상부에', '~하부에', '~옆에' 등으로 두 부분의 위치 관계가 설명되는 경우, '바로' 또는 '직접'이 사용되지 않는 이상 두 부분 사이에 하나 이상의 다른 부분이 위치할 수도 있다.In the case of the description of the positional relationship, for example, when the positional relationship of two parts is described as '~ top', '~ upper', '~ bottom', '~ side', etc., 'right' Alternatively, one or more other parts may be located between the two parts unless 'direct' is used.

시간 관계에 대한 설명일 경우, 예를 들어, '~후에', '~에 이어서', '~다음에', '~전에' 등으로 시간적 선후 관계가 설명되는 경우, '바로' 또는 '직접'이 사용되지 않는 이상 연속적이지 않은 경우도 포함할 수 있다.In the case of a description of a time relationship, for example, 'after', 'following', '~ after', '~ before', etc., when the temporal sequential relationship is described, 'right' or 'direct' It may also include cases that are not continuous unless it is used.

제1, 제2 등이 다양한 구성요소들을 서술하기 위해서 사용되나, 이들 구성요소들은 이들 용어에 의해 제한되지 않는다. 이들 용어들은 단지 하나의 구성요소를 다른 구성요소와 구별하기 위하여 사용하는 것이다. 따라서, 이하에서 언급되는 제1 구성요소는 본 출원의 기술적 사상 내에서 제2 구성요소일 수도 있다.The first, second, etc. are used to describe various components, but these components are not limited by these terms. These terms are only used to distinguish one component from another component. Accordingly, the first component mentioned below may be the second component within the technical spirit of the present application.

"제1 수평 축 방향", "제2 수평 축 방향" 및 "수직 축 방향"은 서로 간의 관계가 수직으로 이루어진 기하학적인 관계만으로 해석되어서는 아니 되며, 본 출원의 구성이 기능적으로 작용할 수 있는 범위 내에서보다 넓은 방향성을 가지는 것을 의미할 수 있다. "First horizontal axis direction", "Second horizontal axis direction" and "Vertical axis direction" should not be interpreted only as a geometric relationship in which the relationship between each other is vertical, and the scope in which the configuration of the present application can function functionally It can mean having a wider directionality than within.

"적어도 하나"의 용어는 하나 이상의 관련 항목으로부터 제시 가능한 모든 조합을 포함하는 것으로 이해되어야 한다. 예를 들어, "제 1 항목, 제 2 항목 및 제 3 항목 중에서 적어도 하나"의 의미는 제 1 항목, 제 2 항목 또는 제 3 항목 각각 뿐만 아니라 제 1 항목, 제 2 항목 및 제 3 항목 중에서 2개 이상으로부터 제시될 수 있는 모든 항목의 조합을 의미할 수 있다. It should be understood that the term “at least one” includes all possible combinations from one or more related items. For example, the meaning of “at least one of the first item, the second item, and the third item” means 2 of the first item, the second item, and the third item, as well as the first item, the second item, and the third item, respectively. It can mean any combination of items that can be presented from more than one dog.

본 출원의 여러 예들의 각각 특징들이 부분적으로 또는 전체적으로 서로 결합 또는 조합 가능하고, 기술적으로 다양한 연동 및 구동이 가능하며, 각 예들이 서로에 대하여 독립적으로 실시 가능할 수도 있고 연관 관계로 함께 실시할 수도 있다.Each of the features of the various examples of the present application may be partially or wholly combined with or combined with each other, technically various interlocking and driving may be possible, and each of the examples may be independently implemented with respect to each other or may be implemented together in an associative relationship. .

이하 도면을 참조하여 본 발명에 따른 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템에 대해 구체적으로 설명한다. 각 도면의 구성요소들에 참조부호를 부가함에 있어서, 동일한 구성요소들에 대해서는 비록 다른 도면상에 표시되더라도 가능한 한 동일한 부호를 가질 수 있다.Hereinafter, an image data analysis system using a shot size change pattern according to the present invention will be described in detail with reference to the drawings. In adding reference numerals to the components of each drawing, the same components may have the same reference numerals as possible even though they are displayed on different drawings.

도 1은 본 발명의 일 실시예에 따른 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템의 구성을 개략적으로 보여주는 블록도이다. 도 1에 도시된 바와 같이, 본 발명에 따른 샷 사이즈 변화패턴을 이용하여 영상 데이터 분석 시스템(100, 이하, '분석 시스템'이라 함)은 영상 데이터 수신부(110), 샷 경계 검출부(120), 샷 사이즈 결정부(130), 클라이맥스 구간 결정부(140), 및 데이터베이스 생성부(150)를 포함한다.1 is a block diagram schematically showing a configuration of an image data analysis system using a shot size change pattern according to an embodiment of the present invention. As shown in FIG. 1, the image data analysis system (hereinafter referred to as 'analysis system') using the shot size change pattern according to the present invention includes an image data receiving unit 110, a shot boundary detection unit 120, It includes a shot size determining unit 130, a climax section determining unit 140, and a database generating unit 150.

영상 데이터 수신부(110)는 분석대상이 되는 영상 데이터를 수신한다. 일 실시예에 있어서, 영상 데이터 수신부(110)를 통해 수신되는 영상 데이터는 영화, 드라마, 또는 애니메이션과 같이 시나리오 기반의 영상 데이터일 수 있다. 다른 예로, 영상 데이터 수신부(110)를 통해 수신되는 영상 데이터는 뮤직비디오 등과 같은 영상 데이터를 포함할 수도 있다.The image data receiving unit 110 receives image data to be analyzed. In one embodiment, the image data received through the image data receiving unit 110 may be scenario-based image data such as a movie, drama, or animation. As another example, the image data received through the image data receiving unit 110 may include image data such as a music video.

영상 데이터 수신부(110)는 인터넷과 같은 네트워크를 통해 외부로부터 영상 데이터를 수신할 수 있다. 다른 예로, 영상 데이터 수신부(110)는 분석 시스템(100)에 포함된 영상 데이터베이스(미도시)로부터 영상 데이터를 수신하거나, 모바일 디바이스, 하드디스크, 또는 USB(Universal Serial Bus)와 같은 별도의 저장매체로부터 영상 데이터를 수신할 수도 있을 것이다.The image data receiving unit 110 may receive image data from the outside through a network such as the Internet. As another example, the image data receiving unit 110 receives image data from an image database (not shown) included in the analysis system 100, or a separate storage medium such as a mobile device, hard disk, or Universal Serial Bus (USB). It may be possible to receive image data from.

영상 데이터 수신부(110)는 수신된 영상 데이터를 샷 경계 검출부(120)로 전달한다.The image data receiving unit 110 transmits the received image data to the shot boundary detection unit 120.

샷 경계 검출부(120)는 영상 데이터 수신부(110)를 통해 수신된 영상 데이터로부터 샷(Shot) 경계를 검출한다. 여기서, 샷이란 카메라의 연속적인 움직임으로 만들어지는 영상 데이터를 의미하는 것으로서 연속되는 복수개의 이미지들로 구성될 수 있다. 복수개의 샷들이 모여 하나의 씬(Scene)이 구성되고, 복수개의 씬이 모여 하나의 시퀀스(Sequence)가 구성된다.The shot boundary detection unit 120 detects a shot boundary from the image data received through the image data reception unit 110. Here, the shot refers to image data generated by continuous movement of the camera, and may be composed of a plurality of consecutive images. A plurality of shots are gathered to form a scene, and a plurality of scenes are gathered to form a sequence.

샷 샹계 검출부(120)는 미리 정해진 샷 경계 검출 알고리즘을 이용하여 영상 데이터로부터 샷 경계를 검출할 수 있다. 본 발명에 따른 샷 경계 검출부(120)는 복수개의 샷 경계 검출 알고리즘을 혼합하여 샷 경계를 검출한다.The shot chance detection unit 120 may detect a shot boundary from image data using a predetermined shot boundary detection algorithm. The shot boundary detection unit 120 according to the present invention detects a shot boundary by mixing a plurality of shot boundary detection algorithms.

본 발명에 따른 샷 경계 검출부(120)가 복수개의 샷 경계 검출알고리즘을 혼합하여 샷 경계를 검출하는 이유는, 영화와 같은 영상 데이터의 경우 샷과 샷의 연결부분에 장면전환기법(Video Transition Effects)이 많이 사용되고, 조명의 다양한 사용으로 인해 동일한 장면임에도 불구하고 조명의 밝기가 극심하게 변화할 수 있으며, 익스트림 클로즈업(Extreme Close-up) 상태에 있을 때 객체가 이동하면 화면 전체의 영상이 변경되는 것으로 판단될 수 있는 특성이 있기 때문에, 하나의 샷 경계 알고리즘만을 이용하여 샷 경계를 검출하게 되면 정확한 샷 경계가 검출되지 않기 때문이다.The reason why the shot boundary detection unit 120 according to the present invention detects a shot boundary by mixing a plurality of shot boundary detection algorithms is that in the case of video data such as a movie, a scene transition technique is used for the connection between the shot and the shot (Video Transition Effects). It is used a lot, and even though it is the same scene due to various uses of lighting, the brightness of the lighting can change dramatically, and when the object moves while in extreme close-up, the entire image of the screen changes. Because there is a characteristic that can be judged, if a shot boundary is detected using only one shot boundary algorithm, an accurate shot boundary is not detected.

예컨대, 샷 경계 검출부(120)가 하나의 샷 경계 검출알고리즘을 이용하여 샷 경계를 검출하게 되면, 도 2a에 도시된 바와 같이, 배경과 배우가 변경되지 않았음에도 불구하고 조명의 밝기가 급격히 변경되는 경우 샷 경계로 검출될 수 있다.For example, when the shot boundary detection unit 120 detects a shot boundary using a single shot boundary detection algorithm, as shown in FIG. 2A, the brightness of the lighting changes rapidly even though the background and the actor are not changed. In case, it can be detected as a shot boundary.

다른 예로, 샷 경계 검출부(120)가 하나의 샷 경계 검출알고리즘을 이용하여 샷 경계를 검출하게 되면, 도 2b에 도시된 바와 같이 디졸브(Dissove)나 페이드 인/아웃(Fade in/out)과 같은 장면전환기법이 적용되어 1초 이상 두 프레임이 연속적으로 섞이는 경우에도 셧 경계로 검출될 수 있다.As another example, when the shot boundary detection unit 120 detects a shot boundary using a single shot boundary detection algorithm, as shown in FIG. 2B, such as Dissove or Fade in / out. Even if two frames are continuously mixed for more than 1 second by applying the scene change technique, it can be detected as a shut boundary.

또 다른 예로, 샷 경계 검출부(120)가 하나의 샷 경계 검출알고리즘을 이용하여 샷 경계를 검출하게 되면, 도 2c에 도시된 바와 같이 과도한 클로즈업 상태에서 카메라 앞으로 큰 객체의 움직임이 발생되는 경우에도 샷 경계로 검출될 수 있다.As another example, when the shot boundary detection unit 120 detects a shot boundary using a single shot boundary detection algorithm, even when a large object moves in front of the camera in an excessive close-up state as illustrated in FIG. 2C, the shot is shot. It can be detected as a boundary.

따라서, 본 발명에 따른 샷 경계 검출부(120)는 상술한 바와 같은 샷 경계 검출의 부정확성을 해결하기 위해 샷 경계 검출시 복수개의 샷 경계 검출 알고리즘을 혼합 적용하여 샷 경계를 검출한다. 일 실시예에 있어서, 샷 경계 검출부(120)는 컬러 히스토그램(Color Histogram) 알고리즘, 에지 히스토그램(Edge Histrogram) 알고리즘, 및 옵티컬 플로우 트래킹(Optical Flow Tracking) 알고리즘을 혼합하여 샷 경계를 검출할 수 있다.Therefore, the shot boundary detection unit 120 according to the present invention detects the shot boundary by mixing and applying a plurality of shot boundary detection algorithms when detecting the shot boundary in order to solve the inaccuracy of the shot boundary detection as described above. In one embodiment, the shot boundary detection unit 120 may detect a shot boundary by mixing a color histogram algorithm, an edge histrogram algorithm, and an optical flow tracking algorithm.

구체적으로, 본 발명에 따른 샷 경계 검출부(120)는 조명의 밝기가 급변함으로 인해 샷 경계가 잘못 검출되는 것을 방지하기 위해 컬러 히스토그램(Color Histogram) 알고리즘과 에지 히스토그램(Edge Histrogram) 알고리즘을 혼합하여 사용한다. 이는 조명의 밝기가 급변하는 경우 컬러 히스토그램 변화는 크지만 엣지 히스토그램의 변화는 달라지지 않기 때문이다.Specifically, the shot boundary detection unit 120 according to the present invention is used by mixing a color histogram algorithm and an edge histogram algorithm to prevent the shot boundary from being incorrectly detected due to a sudden change in the brightness of the lighting. do. This is because the color histogram change is large when the brightness of the light changes rapidly, but the edge histogram does not change.

특히, 본 발명에 따른 샷 경계 검출부(120)는 옵티컬 플로우 트래킹 알고리즘에서 이용되는 KCF 필터(Kernelized Correlation Filters)를 추가로 이용함으로써 디졸브나 페이드 인/아웃과 같은 장면전환기법이 적용되는 경우에도 샷 경계를 정확하게 검출하게 된다. 이때, KCF 필터는 트랙커 함수의 하나로 대부분 목표물과 주변환경을 구별하기 위해 사용되는 특정분류 모델로서, KCF필터의 사용으로 인해 디졸브나 페이드 인/아웃과 같은 장면전환기법이 적용된 점진적 변화의 샷들에 대해 중복된 이미지들 및 겹치는 픽셀들을 제한할 수 있게 된다.Particularly, the shot boundary detection unit 120 according to the present invention additionally uses KCF filters (Kernelized Correlation Filters) used in the optical flow tracking algorithm, so that even when scene transition techniques such as dissolve or fade in / out are applied, the shot boundary is applied. Will accurately detect. At this time, the KCF filter is a specific classification model that is mostly used to distinguish the target from the surrounding environment as one of the tracker functions.For the shots of gradual change applied with a scene change technique such as dissolve or fade in / out due to the use of the KCF filter It is possible to limit overlapping images and overlapping pixels.

다시 도 1을 참조하면, 샷 샤이즈 결정부(130)는 샷 경계 검출부(120)에 의해 샷 경계가 검출되면, 각 샷을 구성하는 이미지들 중 적어도 하나의 이미지를 기초로 해당 샷의 샷 사이즈를 결정한다. 일 실시예에 있어서, 샷 사이즈 결정부(130)는 각 샷들 중 샷 경계에 위치하는 샷 경계 이미지(예컨대, 샷의 시작부분 이미지나 샷의 종료부분 이미지)를 기초로 해당 샷의 샷 사이즈를 결정할 수 있다. 이하, 설명의 편의를 위해 샷 사이즈 결정부(130)는 샷 경계 이미지를 기초로 샷 사이즈를 결정하는 것으로 설명하기로 한다. Referring back to FIG. 1, when the shot boundary is detected by the shot boundary detection unit 120, the shot sizing determiner 130 determines the shot size of the shot based on at least one image among images constituting each shot. Decide. In one embodiment, the shot size determination unit 130 determines the shot size of the shot based on a shot boundary image (eg, a beginning image of a shot or an end image of a shot) positioned at a shot boundary among each shot You can. Hereinafter, for convenience of description, the shot size determination unit 130 will be described as determining the shot size based on the shot boundary image.

샷 사이즈 결정부(130)는 샷 경계 이미지로부터 대표객체를 추출하고, 추출된 대표객체의 크기를 기초로 해당 샷의 샷 사이즈를 결정할 수 있다. 샷 사이즈 결정부(130)는 각 샷 별로 결정된 샷 사이즈를 해당 샷의 샷 경계 이미지와 함께 샷 사이즈 데이터베이스(135)에 저장한다.The shot size determination unit 130 may extract the representative object from the shot boundary image and determine the shot size of the corresponding shot based on the size of the extracted representative object. The shot size determination unit 130 stores the shot size determined for each shot in the shot size database 135 together with the shot boundary image of the corresponding shot.

이하, 도 3을 참조하여 본 발명에 따른 샷 사이즈 결정부(130)의 구성을 보다 구체적으로 설명한다.Hereinafter, the configuration of the shot size determination unit 130 according to the present invention will be described in more detail with reference to FIG. 3.

도 3은 본 발명의 일 실시예에 따른 샷 사이즈 결정부의 구성을 보여주는 블록도이다. 도 3에 도시된 바와 같이, 본 발명의 일 실시예에 따른 샷 사이즈 결정부(130)는 타겟영역 검출부(310), 대표객체 결정부(320), 및 샷 사이즈 판단부(330)를 포함한다. 또한, 본 발명에 따른 샷 사이즈 결정부(130)는 후보 타겟영역 결정부(340)를 더 포함할 수 있다.3 is a block diagram showing the configuration of a shot size determination unit according to an embodiment of the present invention. 3, the shot size determination unit 130 according to an embodiment of the present invention includes a target area detection unit 310, a representative object determination unit 320, and a shot size determination unit 330 . In addition, the shot size determination unit 130 according to the present invention may further include a candidate target area determination unit 340.

타겟영역 검출부(310)는 샷 경계 이미지 내에서 미리 정해진 크기를 갖는 바운딩 박스(Bounding box) 이미지를 이용하여 샷 경계 이미지로부터 사람에 해당하는 적어도 하나의 타겟영역을 추출한다. 예컨대, 타겟영역 검출부(310)는 도 4a에 도시된 바와 같이 샷 경계 이미지 내에서 바운딩 박스의 상단왼쪽 꼭지점과 하단 오른쪽 꼭지점을 정하여 각 사람의 움직임 방향에 가깝게 바운딩 박스들을 그려 각 사람에 해당하는 타겟영역들(410~440)을 추출한다. 도 4a의 경우 샷 경계 이미지 내에 복수의 사람이 존재하므로, 복수개의 타겟영역(410~440)이 추출된다.The target region detection unit 310 extracts at least one target region corresponding to a person from the shot boundary image using a bounding box image having a predetermined size in the shot boundary image. For example, the target area detection unit 310 determines the upper left vertex and lower right vertex of the bounding box within the shot boundary image as shown in FIG. 4A, draws bounding boxes close to each person's movement direction, and targets each person. Regions 410 to 440 are extracted. 4A, a plurality of target areas 410 to 440 are extracted because a plurality of people exist in the shot boundary image.

대표객체 결정부(320)는 타겟영역 추출부(310)에 의해 추출된 적어도 하나의 타겟영역들 중 크기가 가장 큰 타겟영역을 대표객체로 결정한다. 예컨대, 도 4b에 도시된 바와 같이 복수개의 타겟영역들(410~440)이 추출된 경우 대표객체 결정부(320)는 검출된 타겟영역들(410~440) 중에서 길이의 너비가 가장 큰 타겟영역(410)을 대표객체로 결정한다.The representative object determination unit 320 determines a target region having the largest size among at least one target region extracted by the target region extraction unit 310 as a representative object. For example, as illustrated in FIG. 4B, when a plurality of target regions 410 to 440 are extracted, the representative object determining unit 320 is a target region having the largest width among the detected target regions 410 to 440. 410 is determined as a representative object.

샷 사이즈 판단부(330)는 대표객체 결정부(320)에 의해 결정된 대표객체의 샷 사이즈를 검출하고, 검출된 대표객체의 샷 사이즈를 해당 샷의 샷 사이즈로 결정한다. 예컨대, 샷 사이즈는 익스트림 클로즈업(Extreme Close up), 클로즈업(Close up), 바스트샷(Bust Shot), 웨스트샷(West Shot), 미디엄샷(Midium Shot), 니샷(Knee Shot), 풀샷(Full Shot), 롱샷(Lone Shot), 및 익스트림 롱샷(Extreme Long Shot) 중 적어도 하나를 포함할 수 있다.The shot size determination unit 330 detects the shot size of the representative object determined by the representative object determination unit 320 and determines the detected shot size of the representative object as the shot size of the corresponding shot. For example, the shot size is Extreme Close up, Close up, Bust Shot, West Shot, Medium Shot, Knee Shot, Full Shot ), Long Shot (Lone Shot), and Extreme Long Shot (Extreme Long Shot).

일 실시예에 있어서, 샷 사이즈 판단부(330)는 딥러닝 기반의 학습모델을 이용하여 각 샷 사이즈 별로 설정된 기준 이미지 내의 객체와 대표객체를 비교함으로써 대표객체의 샷 사이즈를 검출할 수 있다. 예컨대, 도 4a 및 도 4b에 도시된 샷 경계 이미지의 경우 대표객체의 샷 사이즈는 풀샷이므로, 샷 사이즈 판단부(330)는 해당 샷의 샷 사이즈를 풀샷로 결정할 수 있다. 다른 예로, 도 4c에 도시된 샷 경계 이미지의 경우 대표객체의 샷 사이즈는 클로즈업이므로, 샷 사이즈 판단부(330)는 해당 샷의 샷 사이즈를 클로즈업샷으로 결정할 수 있다.In one embodiment, the shot size determination unit 330 may detect the shot size of the representative object by comparing the object in the reference image set for each shot size with the representative object using a deep learning-based learning model. For example, in the case of the shot boundary images illustrated in FIGS. 4A and 4B, since the shot size of the representative object is a full shot, the shot size determination unit 330 may determine the shot size of the corresponding shot as a full shot. As another example, in the case of the shot boundary image illustrated in FIG. 4C, since the shot size of the representative object is close-up, the shot size determination unit 330 may determine the shot size of the corresponding shot as a close-up shot.

한편, 영화의 흐름상 크기가 가장 큰 타겟영역이 대표객체에 해당하지 않을 수 있다. 예컨대, 도 5a 및 도 5b에 도시된 바와 같이 크기가 가장 큰 타겟영역(510)이 사람의 정면을 포함하고 있지 않은 경우 사람의 정면이 포함된 두 번째로 크기가 큰 타겟영역(520)이 대표객체에 해당할 수 있다. 따라서, 본 발명에 따른 샷 사이즈 결정부(130)는 대표객체를 정확하게 결정하기 위해, 도 3에 도시된 바와 같이 후보타겟영역 결정부(340)를 더 포함할 수 있다.Meanwhile, the target area having the largest size in the flow of the movie may not correspond to the representative object. For example, as illustrated in FIGS. 5A and 5B, when the target area 510 having the largest size does not include the front of the person, the second largest target area 520 including the front of the person is representative. It can correspond to an object. Accordingly, the shot size determining unit 130 according to the present invention may further include a candidate target region determining unit 340 as illustrated in FIG. 3 to accurately determine a representative object.

후보타겟영역 결정부(340)는 타겟영역 검출부(310)에 의해 복수개의 타겟영역이 검출되면, 검출된 복수개의 타겟영역들 중 정면에서 촬영된 사람이 포함되어 있는 후보 타겟영역들을 결정한다. 일 실시예에 있어서, 후보타겟영역 결정부(340)는 색인식 기법을 이용하여 타겟영역들 중 정면에서 촬영된 사람이 포함되어 있는 타겟영역들을 선별할 수 있다.When a plurality of target areas are detected by the target area detection unit 310, the candidate target area determination unit 340 determines candidate target areas including a person photographed from the front among the detected target areas. In one embodiment, the candidate target area determining unit 340 may select target areas including a person photographed from the front of the target areas using an indexing technique.

예컨대, 후보타겟영역 결정부(340)는 도 5a 및 도 5b에서는 제1 타겟영역(510)은 정면에서 촬영된 사람이 포함되어 있지 않기 때문에 후보타겟영역으로 결정하지 않고, 정면에서 촬영된 사람이 포함되어 있는 제2 타겟영역(520)만을 후보 타겟영역으로 결정한다. 이에 따라, 크기가 가장 크지만 정면에서 촬영된 사람이 포함되어 있지 않은 제1 타겟영역(510)은 대표객체로 결정되지 않고, 크기는 두 번째로 크지만 정면에서 촬영된 사람이 포함되어 있는 제2 타겟영역(520)이 대표객체로 결정된다. For example, the candidate target area determining unit 340 does not determine the candidate target area in FIG. 5A and 5B because the first target area 510 does not include a person photographed from the front, and the person photographed from the front is not determined. Only the included second target area 520 is determined as a candidate target area. Accordingly, the first target area 510 having the largest size but not including the person photographed from the front is not determined as a representative object, and the size is the second largest but includes the person including the person photographed from the front. 2 The target area 520 is determined as a representative object.

이러한 실시예에 따르는 경우 대표객체 결정부(320)는 후보타겟영역이 복수개이면 복수개의 후보타겟영역들 중 크기가 가장 큰 후보타겟영역을 대표객체로 결정하게 된다. 예컨대, 도 5a 및 도 5b에서는 후보타겟영역이 제2 타겟영역(520) 1개 이므로 대표객체 결정부(320)는 제2 타겟영역(520)을 대표객체로 결정하며 샷 사이즈 결정부(330)는 결정된 대표객체의 샷 사이즈인 미디엄 샷을 해당 샷의 샷 사이즈로 결정한다.According to this embodiment, if the representative object determining unit 320 has a plurality of candidate target regions, the candidate target region having the largest size among the plurality of candidate target regions is determined as the representative object. For example, in FIGS. 5A and 5B, since there is one candidate target area, the second target area 520, the representative object determination unit 320 determines the second target area 520 as a representative object, and the shot size determination unit 330 Determines the medium shot, which is the shot size of the determined representative object, as the shot size of the shot.

한편, 상술한 실시예에 있어서 타겟영역 검출부(310)는 샷 경계 이미지 내에서 사람이 포함된 영역만을 타겟영역으로 검출하는 것으로 설명하였다. 하지만, 변형된 실시예에 있어서, 도 6에 도시된 바와 같이 샷 경계 이미지 내에서 사람에 해당하는 영역이 검출되지 않는 경우, 타겟영역 검출부(310)는 미리 정해진 객체가 포함된 영역을 타겟영역으로 검출할 수도 있을 것이다.Meanwhile, in the above-described embodiment, it has been described that the target area detection unit 310 detects only the area including the person in the shot boundary image as the target area. However, in a modified embodiment, as illustrated in FIG. 6, when an area corresponding to a person is not detected in the shot boundary image, the target area detection unit 310 sets the area including the predetermined object as the target area. It may be detectable.

다시 도 1을 참조하면, 클라이맥스 구간 결정부(140)는 샷 사이즈 데이터베이스(135)로부터 샷 사이즈 결정부(130)에 의해 결정된 각 샷들의 샷 사이즈 변화패턴을 추출하고, 추출된 각 샷들의 샷 사이즈 변화 패턴을 기초로 영상 데이터의 클라이맥스 구간을 결정한다.Referring back to FIG. 1, the climax section determination unit 140 extracts a shot size change pattern of each shot determined by the shot size determination unit 130 from the shot size database 135, and the shot size of each extracted shot The climax section of the image data is determined based on the change pattern.

구체적으로, 클라이맥스 구간 결정부(140)는 각 샷들의 샷 사이즈를 기초로 하나의 씬(Scene) 또는 하나의 시퀀스(Sequence)를 구성하는 연속된 샷들의 샷 사이즈 변화패턴을 산출한다. 이를 위해, 클라이맥스 구간 결정부(140)는 샷 사이즈를 기초로 각 샷에 대한 스코어링(Scoring)을 수행할 수 있다. 일 예로, 클라이맥스 구간 결정부(140)는 샷 사이즈가 감소하는 순서(예컨대, 익스트림 클로즈업(Extreme Close up), 클로즈업(Close up), 바스트샷(Bust Shot), 웨스트샷(West Shot), 미디엄샷(Midium Shot), 니샷(Knee Shot), 풀샷(Full Shot), 롱샷(Lone Shot), 및 익스트림 롱샷(Extreme Long Shot 순서)에 따라 그 점수가 증가하도록 각 샷들에 대한 스코어링을 수행할 수 있다.Specifically, the climax section determiner 140 calculates a shot size change pattern of consecutive shots constituting a scene or a sequence based on the shot size of each shot. To this end, the climax section determiner 140 may perform scoring for each shot based on the shot size. As an example, the climax section determining unit 140 may reduce the order of shot sizes (eg, extreme close up, close up, bust shot, bust shot, west shot, medium shot) Scoring for each shot may be performed to increase the score according to (Midium Shot), Knee Shot, Full Shot, Long Shot, and Extreme Long Shot order.

예컨대, 클라이맥스 구간 결정부(140)는 샷 사이즈가 익스트림 클로즈업인 경우 해당 샷에 1점을 할당하고, 샷 사이즈가 클로즈업인 경우 해당 샷에 2점을 할당하며, 샷 사이즈가 바스트샷인 경우 해당 샷에 3점을 할당하고, 샷 사이즈가 웨스트샷인 경우 해당 샷에 4점을 할당하며, 샷사이즈가 미디엄샷인 경우 해당 샷에 5점을 할당하고, 샷 사이즈가 니샷인 경우 해당 샷에 6점을 할당하며, 샷 사이즈가 풀샷인 경우 해당 샷에 7점을 할당하고, 샷 사이즈가 롱샷인 경우 해당 샷에 8점을 할당하며, 샷 사이즈가 익스트림 롱샷인 경우 해당 샷에 9점을 할당할 수 있다.For example, the climax section determiner 140 assigns 1 point to the shot when the shot size is extreme close-up, and 2 points to the shot when the shot size is close-up, and the shot when the shot size is the bust shot Assign 3 points to, if the shot size is West Shot, assign 4 points to the shot, if the shot size is medium shot, assign 5 points to the shot, and if the shot size is knee shot, 6 to the shot You can assign 7 points to the shot if the shot size is full shot, 8 points to the shot if the shot size is long shot, and 9 points to the shot if the shot size is extreme long shot. have.

클라이맥스 구간 결정부(140)는 각 샷 별로 할당된 점수를 기초로 샷 사이즈의 변화패턴을 생성할 수 있다. 클라이맥스 구간 결정부(140)에 의해 생성된 샷 사이즈의 변화패턴이 도 7에 도시되어 있다. 도 7에서는 샷 사이즈가 익스트림 클로즈업샷, 클로즈업샷, 미디엄샷, 풀샷, 롱샷, 익스트림 롱샷만을 포함하는 것으로 도시하였지만, 이는 하나의 예에 불과할 뿐 샷 사이즈는 바스트샷, 웨스트샷, 니샷 등을 추가로 포함할 수도 있을 것이다.The climax section determiner 140 may generate a change pattern of the shot size based on the score allocated for each shot. The change pattern of the shot size generated by the climax section determining unit 140 is illustrated in FIG. 7. In FIG. 7, the shot size is shown to include only extreme close-up shot, close-up shot, medium shot, full shot, long shot, and extreme long shot, but this is only one example, and the shot size is bust shot, west shot, knee shot, etc. It may include.

또한, 상술한 실시예에 있어서는 클라이맥스 구간 결정부(140)는 샷 사이즈가 감소하는 순서에 따라 그 점수가 증가하도록 각 샷들에 대한 스코어링을 수행하는 것으로 설명하였지만, 이와 달리 클라이맥스 구간 결정부(140)는 샷 사이즈가 증가하는 순서에 따라 그 점수가 증가하도록 각 샷들에 대한 스코어링을 수행할 수도 있을 것이다.In addition, in the above-described embodiment, the climax section determination unit 140 has been described as performing scoring for each shot so that the score increases in the order in which the shot size decreases, but unlike this, the climax section determination unit 140 Scoring may be performed for each shot so that the score increases in the order in which the shot sizes increase.

한편, 클라이맥스 구간 결정부(140)는 샷 사이즈 변화패턴이 산출되면, 이를 기초로 클라이맥스 구간을 결정한다. 구체적으로, 클라이맥스 구간 결정부(140)는 샷 사이즈 변화패턴을 기초로 각 샷 별로 할당된 점수의 변화가 가장 큰 지점을 클라이맥스의 시작지점으로 결정할 수 있다. 도 7에 도시된 예의 경우, 클라이맥스 구간 결정부(140)는 각 샷 별로 할당된 점수의 변화가 가장 큰 지점 A를 클라이맥스의 시작지점으로 결정할 수 있다.Meanwhile, when the shot size change pattern is calculated, the climax section determiner 140 determines the climax section based on the shot size change pattern. Specifically, the climax section determiner 140 may determine the point where the change in the score allocated for each shot is the largest as the starting point of the climax based on the shot size change pattern. In the example shown in FIG. 7, the climax section determiner 140 may determine a point A having the largest change in score assigned to each shot as a starting point of the climax.

한편, 각 샷 별로 할당된 점수의 변화가 가장 큰 지점이 복수개인 경우, 클라이맥스 구간 결정부(140)는 각 샷에서의 카메라 앵글 변화를 추가로 이용하여 클라이맥스의 시작시점을 선택할 수 있다.On the other hand, if there are a plurality of points having the largest change in the score allocated for each shot, the climax section determiner 140 may further select the starting point of the climax by additionally using a change in camera angle in each shot.

구체적으로, 클라이맥스 구간 결정부(140)는 샷 사이즈를 기초로 한 각 샷의 스코어링과 유사하게 카메라 앵글을 기초로 각 샷에 대한 스코어링을 수행할 수 있다. 이러한 실시예에 따르는 경우 클라이맥스 구간 결정부(140)는 각 샷 별로 할당된 점수의 변화가 가장 큰 복수개의 지점들 중 카메라 앵글을 기초로 결정된 각 샷의 점수의 변화가 가장 큰 지점을 클라이맥스의 시작시점으로 결정할 수 있다.Specifically, the climax section determiner 140 may perform scoring for each shot based on the camera angle, similar to scoring of each shot based on the shot size. In accordance with this embodiment, the climax section determining unit 140 starts the climax of the points having the largest change in the score of each shot determined based on the camera angle among the plurality of points having the largest change in the score allocated for each shot. You can decide by time.

한편, 클라이맥스 구간 결정부(140)는 장면전환이 발생되는 지점을 클라이맥스의 종료시점으로 결정할 수 있다. 일 예로, 클라이맥스 구간 결정부(140)는 샷 사이즈가 익스트림 롱샷으로 일정시간 동안 유지되는 지점을 장면전환이 발생된 지점으로 판단할 수 있다. 예컨대 도 7에서, 클라이맥스 구간 결정부(140)는 지점 B를 클라이맥스의 종료시점으로 결정할 수 있다. 이러한 예에 따르는 경우 지점 A와 지점 B사이의 구간이 클라이맥스 구간으로 결정된다.Meanwhile, the climax section determiner 140 may determine a point at which the scene change occurs as the end point of the climax. For example, the climax section determiner 140 may determine a point where the shot size is maintained for a predetermined time as an extreme long shot as a point where a scene change occurs. For example, in FIG. 7, the climax section determiner 140 may determine point B as the end point of the climax. According to this example, a section between points A and B is determined as a climax section.

데이터베이스 생성부(150)는 클라이맥스 구간 결정부(140)에 의해 결정된 클라이맥스 구간의 샷 사이즈 변화 패턴을 영상 데이터의 식별정보와 매핑시켜 저장함으로써 영상 분석용 데이터베이스(160)를 생성한다. 예컨대, 데이터베이스 생성부(150)는 클라이맥스 구간에 포함된 각 샷 별로 해당 샷의 식별정보와 샷 사이즈 정보를 매핑시켜 저장할 수 있다. 이때, 각 샷의 식별정보는 각 샷의 시간순서에 따라 순차적으로 할당될 수 있고, 샷 사이즈 정보는 해당 샷 사이즈를 기초로 해당 샷에 부여된 점수일 수 있다.The database generating unit 150 generates a database 160 for image analysis by mapping and storing the shot size change pattern of the climax section determined by the climax section determining unit 140 with identification information of the image data. For example, the database generating unit 150 may map and store identification information of the corresponding shot and shot size information for each shot included in the climax section. At this time, the identification information of each shot may be sequentially assigned according to the time sequence of each shot, and the shot size information may be a score given to the shot based on the shot size.

일 실시예에 있어서, 데이터베이스 생성부(150)는 클라이맥스 구간의 샷 사이즈 변화 패턴을 해당 영상 데이터의 장르정보와 함께 저장할 수 있다. 본 발명에서 데이터베이스 생성부(150)가 클라이맥스 구간의 샷 사이즈 변화 패턴을 해당 영상 데이터의 장르정보와 함께 저장하는 이유는, 영상 제작자가 생성하기 원하는 장르에 대한 영상 분석정보만을 영상 제작자에게 용이하게 제공하기 위한 것이다. In one embodiment, the database generating unit 150 may store the shot size change pattern of the climax section together with genre information of the corresponding image data. In the present invention, the reason why the database generation unit 150 stores the shot size change pattern of the climax section together with the genre information of the corresponding image data is that the image producer easily provides only the image analysis information for the genre that the image producer wants to generate. It is to do.

이와 같이, 본 발명에서는 데이터베이스 생성부(150)가 클라이맥스 구간의 샷 사이즈 변화 패턴을 해당 영상 데이터의 장르정보와 함께 매칭시켜 저장한 영상 분석용 데이터베이스(160)를 생성하기 때문에, 영상 제작자는 영상 분석용 데이터베이스(160)에 저장된 영상 데이터의 샷 사이즈 변화패턴을 특정 영상 데이터의 샷 사이즈 변화패턴과 비교함으로써 특정 영상 데이터의 흥행 성공 여부를 예측할 수 있게 된다.As described above, in the present invention, since the database generation unit 150 matches the shot size change pattern of the climax section with genre information of the corresponding image data and generates the stored image analysis database 160, the image producer analyzes the image. By comparing the shot size change pattern of the image data stored in the usage database 160 with the shot size change pattern of the specific image data, it is possible to predict whether or not the success of the specific image data is successful.

또한, 영상 제작자가 특정 장르의 영상 생성을 희망하는 경우, 영상 분석용 데이터베이스(160)에서 해당 장르의 영상 데이터의 샷 사이즈 변화패턴을 추출한 후 이를 그대로 이용함으로써 영상 데이터를 용이하게 생성할 수 있을 뿐만 아니라, 흥행에 성공한 영상 데이터의 샷 사이즈 변화패턴을 추출하여 이용하는 경우 영상 제작자가 생성하기 원하는 영상의 흥행 가능성을 높일 수 있게 된다. In addition, when an image producer wishes to generate a video of a specific genre, it is possible to easily generate video data by extracting a shot size change pattern of the video data of the genre from the video analysis database 160 and using the same as it is. In addition, when the shot size change pattern of the image data that has been successful is extracted and used, it is possible to increase the likelihood of the image that the image producer wants to generate.

이하, 도 8을 참조하여 본 발명에 따른 샷 사이즈 변화패턴을 이용한 영상 데이터 분석방법을 설명한다.Hereinafter, an image data analysis method using a shot size change pattern according to the present invention will be described with reference to FIG. 8.

도 8은 본 발명의 일 실시예에 따른 샷 사이즈 변화패턴을 이용한 영상 데이터 분석방법을 보여주는 플로우 차트이다. 도 8에 도시된 샷 사이즈 변화패턴을 이용한 영상 데이터 분석방법은 도 1에 도시된 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템(이하, '분석 시스템'이라 함)에 의해 수행될 수 있다.8 is a flowchart illustrating an image data analysis method using a shot size change pattern according to an embodiment of the present invention. The image data analysis method using the shot size change pattern illustrated in FIG. 8 may be performed by an image data analysis system (hereinafter referred to as an 'analysis system') using the shot size change pattern illustrated in FIG. 1.

먼저, 분석 시스템은 샷 경계 검출 알고리즘을 이용하여 영상 데이터로부터 샷들 간의 경계를 검출한다(S800). 이때 샷 경계 검출의 대상이 되는 영상 데이터는 영화, 드라마, 또는 애니메이션과 같이 시나리오 기반의 영상 데이터이거나 뮤직비디오 등과 같은 영상 데이터일 수도 있다. 이러한 영상 데이터는 인터넷과 같은 네트워크를 통해 외부로부터 수신되거나 분석 시스템에 포함된 영상 데이터베이스로부터 직접 획득되거나 모바일 디바이스, 하드디스크, 또는 USB(Universal Serial Bus)와 같은 별도의 저장매체로부터 수신할 수 있다.First, the analysis system detects a boundary between shots from image data using a shot boundary detection algorithm (S800). In this case, the image data subject to the detection of the shot boundary may be scenario-based image data such as a movie, drama, or animation, or image data such as a music video. The image data may be received from the outside through a network such as the Internet or directly from an image database included in the analysis system, or may be received from a separate storage medium such as a mobile device, hard disk, or USB (Universal Serial Bus).

일 실시예에 있어서 분석 시스템은 컬러 히스토그램 알고리즘, 에지 히스토그램 알고리즘, 및 옵티컬 플로우 트래킹 알고리즘을 이용하여 샷 경계를 검출할 수 있다. 분석 시스템이 복수개의 샷 경계 검출알고리즘을 혼합하여 샷 경계를 검출하는 이유는, 영화와 같은 영상 데이터의 경우 샷과 샷의 연결부분에 장면전환기법이 많이 사용되고, 조명의 다양한 사용으로 인해 동일한 장면임에도 불구하고 조명의 밝기가 극심하게 변화할 수 있으며, 익스트림 클로즈업 상태에 있을 때 객체가 이동하면 화면 전체의 영상이 변경되는 것으로 판단될 수 있는 특성이 있기 때문에, 하나의 샷 경계 알고리즘만을 이용하여 샷 경계를 검출하게 되면 정확한 샷 경계가 검출되지 않기 때문이다.In one embodiment, the analysis system may detect a shot boundary using a color histogram algorithm, an edge histogram algorithm, and an optical flow tracking algorithm. The reason why the analysis system detects a shot boundary by mixing a plurality of shot boundary detection algorithms is that in the case of image data such as a movie, a scene change technique is often used in the connection portion between a shot and a shot, and it is the same scene due to various uses of lighting. In spite of this, the brightness of the lighting can change dramatically, and when the object moves while in extreme close-up, it can be judged that the entire image of the screen is changed. This is because the correct shot boundary is not detected when is detected.

이후, 분석 시스템은 각 샷을 구성하는 이미지들 중 적어도 하나의 이미지로부터 사람에 해당하는 적어도 하나의 타겟영역을 추출한다(S810). 일 실시예에 있어서, 분석 시스템은 각 샷들 중 샷 경계에 위치하는 샷 경계 이미지로부터 타겟영역을 추출할 수 있다.Thereafter, the analysis system extracts at least one target area corresponding to the person from at least one of the images constituting each shot (S810). In one embodiment, the analysis system may extract a target area from a shot boundary image located at a shot boundary among each shot.

구체적으로, 분석 시스템은 샷 경계 이미지 내에서 미리 정해진 크기를 갖는 바운딩 박스(Bounding box) 이미지를 이용하여 샷 경계 이미지로부터 사람에 해당하는 적어도 하나의 타겟영역을 추출한다. 분석 시스템은 샷 경계 이미지 내에서 사람에 해당하는 타겟영역이 검출되지 않는 경우, 미리 정해진 객체가 포함된 영역을 타겟영역으로 검출할 수 있다.Specifically, the analysis system extracts at least one target area corresponding to a person from the shot boundary image using a bounding box image having a predetermined size in the shot boundary image. When the target area corresponding to the person is not detected in the shot boundary image, the analysis system may detect the area including the predetermined object as the target area.

이후, 분석 시스템은 S810에서 추출된 적어도 하나의 타겟영역들 중 크기가 가장 큰 타겟영역을 대표객체로 결정한 후(S820), 결정된 대표객체의 샷 사이즈를 해당 샷의 샷 사이즈로 결정한다(S830).Thereafter, the analysis system determines the target area having the largest size among the at least one target areas extracted in S810 as the representative object (S820), and then determines the determined shot size of the representative object as the shot size of the corresponding shot (S830). .

이후, 분석 시스템은 S830에서 결정된 샷 사이즈의 변화패턴을 이용하여 영상 데이터의 클라이맥스 구간을 결정한다(S840). 일 실시예에 있어서, 분석 시스템은 클라이맥스 구간을 결정하기 위해, 각 샷의 샷 사이즈를 기초로 해당 샷에 대해 점수를 할당할 수 있다. 이때, 분석 시스템은는 샷 사이즈가 감소하는 순서에 따라 그 점수가 증가하도록 각 샷들에 대해 점수를 할당하거나 샷 사이즈가 증가하는 순서에 따라 그 점수가 증가하도록 각 샷들에 대한 점수를 할당할 수 있다.Thereafter, the analysis system determines a climax section of the image data using the change pattern of the shot size determined in S830 (S840). In one embodiment, the analysis system may assign a score for the shot based on the shot size of each shot to determine the climax interval. At this time, the analysis system may allocate a score for each shot so that the score increases in the order in which the shot size decreases or a score for each shot so that the score increases in the order in which the shot size increases.

분석 시스템은 각 샷에 할당된 점수를 기초로 복수개의 샷 들 중 가장 큰 점수변화가 발생되는 지점을 상기 클라이맥스의 시작시점으로 결정하고, 장면전환이 발생되는 지점을 상기 클라이맥스의 종료시점으로 결정할 수 있다. 장면전환이 발생되는 지점 결정과 관련하여, 분석 시스템은 샷 사이즈가 익스트림 롱샷으로 일정시간 동안 유지되는 지점을 장면전환이 발생된 지점으로 판단할 수 있다.Based on the score assigned to each shot, the analysis system may determine the point at which the largest score change among the plurality of shots occurs as the start point of the climax, and determine the point at which the scene change occurs as the end point of the climax. have. With regard to determining the point where the scene change occurs, the analysis system may determine the point where the shot size is maintained as an extreme long shot for a predetermined time as the point where the scene change occurs.

한편, 각 샷 별로 할당된 점수의 변화가 가장 큰 지점이 복수개인 경우, 분석 시스템은 각 샷에서의 카메라 앵글 변화를 추가로 이용하여 클라이맥스의 시작시점을 선택할 수 있다. 한편, 각 샷 별로 할당된 점수의 변화가 가장 큰 지점이 복수개인 경우, 클라이맥스 구간 결정부(140)는 각 샷에서의 카메라 앵글 변화를 추가로 이용하여 클라이맥스의 시작시점을 선택할 수 있다.On the other hand, when there are a plurality of points having the largest change in the score allocated for each shot, the analysis system may further select the starting point of the climax by additionally using a change in camera angle in each shot. On the other hand, if there are a plurality of points having the largest change in the score allocated for each shot, the climax section determiner 140 may further select the starting point of the climax by additionally using a change in camera angle in each shot.

예컨대, 분석 시스템은 샷 사이즈를 기초로 한 각 샷의 스코어링과 유사하게 카메라 앵글을 기초로 각 샷에 대한 스코어링을 수행하고, 각 샷 별로 할당된 점수의 변화가 가장 큰 복수개의 지점들 중 카메라 앵글을 기초로 결정된 각 샷의 점수의 변화가 가장 큰 지점을 클라이맥스의 시작시점으로 결정할 수 있다.For example, the analysis system performs scoring for each shot based on the camera angle, similar to scoring of each shot based on the shot size, and the camera angle among a plurality of points having the largest change in score allocated for each shot The point where the change in the score of each shot determined based on is the largest may be determined as the starting point of the climax.

한편, 분석 시스템은 각 샷 별로 할당된 점수를 기초로 샷 사이즈의 변화패턴을 생성할 수 있다.Meanwhile, the analysis system may generate a change pattern of the shot size based on the score allocated for each shot.

이후, 분석 시스템은 결정된 클라이맥스 구간의 샷 사이즈 변화 패턴을 영상 데이터의 식별정보와 매핑시켜 저장한다(S850).Thereafter, the analysis system maps and stores the determined shot size change pattern of the climax section with the identification information of the image data (S850).

예컨대, 분석 시스템은 클라이맥스 구간에 포함된 각 샷 별로 해당 샷의 식별정보와 샷 사이즈 정보를 매핑시켜 저장할 수 있다. 이때, 각 샷의 식별정보는 각 샷의 시간순서에 따라 순차적으로 할당될 수 있고, 샷 사이즈 정보는 해당 샷 사이즈를 기초로 해당 샷에 부여된 점수일 수 있다.For example, the analysis system may map and store identification information of the corresponding shot and shot size information for each shot included in the climax section. At this time, the identification information of each shot may be sequentially assigned according to the time sequence of each shot, and the shot size information may be a score given to the shot based on the shot size.

한편, 도 8에 도시하지는 않았지만, 분석 시스템은 추출된 적어도 하나의 타겟영역들 중 정면에서 촬영된 사람이 포함되어 있는 후보 타겟영역들을 결정하는 단계를 더 포함할 수 있다. 본 발명에 따른 분석 시스템이 타겟영역들 중 후보 타겟영역들을 결정하는 이유는 영화의 흐름상 크기가 가장 큰 타겟영역이 대표객체에 해당하지 않을 수 있기 때문이다.Meanwhile, although not illustrated in FIG. 8, the analysis system may further include determining candidate target areas including a person photographed from the front among at least one extracted target areas. The reason the analysis system according to the present invention determines candidate target areas among the target areas is that the target area having the largest size in the flow of the movie may not correspond to the representative object.

이러한 실시예에 따르는 경우 분석 시스템은 S830을 통해 검출된 복수개의 타겟영역이 추출되면, 추출된 복수개의 타겟영역들 중 정면에서 촬영된 사람이 포함되어 있는 후보 타겟영역들을 결정하고, 결정된 후보 타겟영역들 중 크기가 가장 큰 타겟영역을 대표객체로 결정하게 된다. 이때, 분석 시스템은 색인식 기법을 이용하여 타겟영역들 중 정면에서 촬영된 사람이 포함되어 있는 타겟영역들을 선별할 수 있다.According to this embodiment, when a plurality of target areas detected through S830 are extracted, the analysis system determines candidate target areas including a person photographed from the front among the extracted plurality of target areas, and determines the determined candidate target areas The target area with the largest size is determined as the representative object. At this time, the analysis system may select target regions including a person photographed from the front of the target regions using an indexing technique.

본 발명이 속하는 기술분야의 당업자는 상술한 본 발명이 그 기술적 사상이나 필수적 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다.Those skilled in the art to which the present invention pertains will understand that the above-described present invention can be implemented in other specific forms without changing its technical spirit or essential features.

예컨대, 도 8에 도시된 샷 사이즈 변화패턴을 이용한 영상 데이터 분석방법은 프로그램 형태로 구현될 수도 있을 것이다.For example, the image data analysis method using the shot size change pattern illustrated in FIG. 8 may be implemented in a program form.

그러므로, 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적인 것이 아닌 것으로 이해해야만 한다. 본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 등가 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.Therefore, it should be understood that the embodiments described above are illustrative in all respects and not restrictive. The scope of the present invention is indicated by the following claims rather than the above detailed description, and it should be interpreted that all changes or modifications derived from the meaning and scope of the claims and equivalent concepts are included in the scope of the present invention. do.

100: 샷 사이즈 변화패턴을 이용한 영상 데이터 분석 시스템
110: 영상 데이터 수신부 120: 샷 경계 검출부
130: 샷 사이즈 결정부 140: 클라이맥스 구간 결정부
150: 데이터베이스 생성부 160: 영상 분석용 데이터베이스
310: 타겟영역 검출부 320: 대표객체 결정부
330: 샷 사이즈 판단부 340: 후보 타겟영역 결정부100: image data analysis system using a shot size change pattern
110: image data receiving unit 120: shot boundary detection unit
130: shot size determining unit 140: climax section determining unit
150: database generation unit 160: database for image analysis
310: target area detection unit 320: representative object determination unit
330: shot size determination unit 340: candidate target area determination unit

Claims

A shot boundary detection unit that detects a boundary between shots from image data using a shot boundary detection algorithm;
A shot size determination unit that determines a shot size of a corresponding shot based on a size of a representative object extracted from at least one of the images constituting each shot;
A climax section determination unit for determining a climax section of the image data using the change pattern of the shot size; And
And a database generator configured to map and store the determined shot size change pattern of the climax section with identification information of the image data,
The shot size determination unit,
Within the image, at least one target region corresponding to a person is extracted using a bounding box image, and candidate target regions including a person photographed in front of the extracted at least one target regions are determined and determined The target area having the largest size among candidate target areas is determined as the representative object, and the shot size of the representative object is determined as the shot size of the corresponding shot,
An image data analysis system using a shot size change pattern, characterized in that the shot size of the representative object is obtained by comparing an object in a reference image set for each shot size with the representative object based on a deep learning-based learning model.

According to claim 1,
The climax section determining unit,
Based on the shot size of each shot, a score is assigned to the shot, and the point at which the largest score change occurs among the plurality of shots is determined as the start point of the climax, and the point at which the scene change occurs is the end of the climax An image data analysis system using a shot size change pattern characterized by determining the viewpoint.

According to claim 2,
The climax section determining unit,
When there are a plurality of points where the largest score change occurs, an image data analysis system using a shot size change pattern is characterized in that the starting point of the climax is determined by additionally using a camera angle change in each shot.

According to claim 3,
The climax section determining unit,
A score is assigned to a corresponding shot based on the camera angle, and a point in which a change in score allocated based on the camera angle is the greatest among a plurality of points where the largest score change occurs is a starting point of the climax. Video data analysis system using a shot size change pattern, characterized in that to determine.

delete

According to claim 1,
The shot size determination unit,
An image data analysis system using a shot size change pattern, characterized in that the target target regions including a person photographed in front of the target regions are determined using the indexing technique, and the candidate target regions are determined.

According to claim 1,
The shot size determination unit,
If an area corresponding to a person is not detected in the image, at least one target area including a predetermined object is detected, and a target area having the largest size among the extracted at least one target areas is determined as a representative object. , The image data analysis system using the shot size change pattern characterized in that the determined shot size of the representative object is determined as the shot size of the corresponding shot.

According to claim 1,
The shot boundary detection unit,
An image data analysis system using a shot size change pattern, characterized in that the shot boundary is detected using a color histogram algorithm, an edge histogram algorithm, and an optical flow tracking algorithm.

Detecting a boundary between shots from image data using a shot boundary detection algorithm;
Determining a shot size of a corresponding shot based on a size of a representative object extracted from at least one image among images constituting each shot; And
And determining a climax section of the image data using the change pattern of the shot size,
Determining the shot size,
Extracting at least one target area corresponding to a person from the image using a bounding box image;
Determining candidate target areas including a person photographed from the front among the extracted at least one target areas;
Determining a target area having the largest size among the determined candidate target areas as the representative object; And
And determining a shot size of the determined representative object as a shot size of the corresponding shot,
The shot size of the representative object is obtained by comparing an object in a reference image set for each shot size with the representative object based on a deep learning-based learning model, and analyzing the image data using the shot size change pattern.

The method of claim 11,
In the step of determining the climax section,
Based on the shot size of each shot, a score is assigned to the shot, and the point at which the largest score change occurs among the plurality of shots is determined as the start point of the climax, and the point at which the scene change occurs is the end of the climax Video data analysis method using a shot size change pattern, characterized in that determined by the viewpoint.

delete

The method of claim 11,
In the step of detecting the shot boundary,
A method for analyzing image data using a shot size change pattern, characterized in that the shot boundary is detected using a color histogram algorithm, an edge histogram algorithm, and an optical flow tracking algorithm.

The method of claim 11,
And mapping and storing the determined shot size change pattern of the climax section with identification information of the image data.