KR101437286B1

KR101437286B1 - Method and apparatus for identifying digital contents

Info

Publication number: KR101437286B1
Application number: KR1020130079477A
Authority: KR
Inventors: 이상훈; 오희석; 김종유
Original assignee: 연세대학교 산학협력단
Priority date: 2013-07-08
Filing date: 2013-07-08
Publication date: 2014-09-25

Abstract

The present invention relates to a method and an apparatus for identifying digital content, capable of identifying the digital content using structural features and non-structural features of the digital content. The method for identifying the digital content according to one embodiment of the present invention includes the steps of: maintaining a database storing the structural features and non-structural features of reference digital content; extracting the structural features including a line segment of a frame in target digital content, which is formed with a frame structure; extracting the non-structural features of the target digital content; and comparing the extracted structural features and non-structural features with the structural features and non-structural features of the reference digital content stored in the database.

Description

[0001] METHOD AND APPARATUS FOR IDENTIFYING DIGITAL CONTENTS [0002]

본 발명은 디지털 컨텐츠 식별 방법 및 장치에 관한 것으로, 보다 상세하게는 디지털 컨텐츠의 구조적 특징과 비구조적 특징을 이용하여 컨텐츠를 식별하는 디지털 컨텐츠 식별 방법 및 장치에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a digital content identification method and apparatus, and more particularly, to a digital content identification method and apparatus for identifying contents using structural and non-structural features of digital contents.

디지털 컨텐츠 식별 기술은 출판물, 특히 웹상에서 불법으로 스캔되어 유통 되는 컨텐츠에 대한 저작권을 보호하기 위해 기술로서 이용될 수 있다.Digital content identification technology can be used as a technology to protect copyrights on publications, particularly content that is illegally scanned and distributed on the Web.

이미지가 포함된 디지털 컨텐츠를 식별하는 기술의 하나로 OCR(optical character reader)를 이용하는 방법이 있다. One technique for identifying digital content containing images is to use an optical character reader (OCR).

OCR을 이용하여 컨텐츠를 식별하는 방법은 디지털 컨텐츠 내에 존재하는 텍스트를 추출한 후, 색인어 등의 유사도를 판별하여 유사성을 인식하는 기술이다. OCR을 이용하여 컨텐츠를 식별하는 방법은 로 평균적으로 장당 4~5초 정도의 OCR 처리시간 소요 및 40~60% 정도의 인식 정확도 성능을 보이고 있어 실질적으로 활용이 불가능한 상황이다. A method of identifying contents using OCR is a technique of extracting text existing in digital contents and then recognizing the similarity by determining the similarity degree of an index word or the like. As a method of identifying contents using OCR, it is impossible to practically use OCR processing time of about 4 ~ 5 seconds per page and recognition accuracy of about 40 ~ 60% on average.

예를 들어 100페이지로 구성된 한권의 만화책을 OCR 소프트웨어를 이용하여 텍스트로 변환하기 위해서는 5~10분 정도의 긴 시간이 요구된다.For example, it takes 5 to 10 minutes to convert a comic book composed of 100 pages into text using OCR software.

대한민국 공개특허 제10-2011-0060450호에는 DB구조가 변경되거나 업무 프로세스가 변경되는 경우 효율적으로 변경된 정보를 수정하는 디지털 콘텐츠 식별체계 웹서비스 장치 및 방법이 개시되어 있다.Korean Patent Laid-Open No. 10-2011-0060450 discloses a digital contents identification system web service apparatus and method for correcting information that has been changed efficiently when a DB structure is changed or a business process is changed.

하지만, 대한민국 공개특허 제10-2011-0060450호에는 불법적으로 유통되는 컨텐츠에 대한 저작권을 보호하기 위한 디지털 컨텐츠 식별 기술에 대해서는 개시되어 있지 않다.However, Korean Patent Laid-Open No. 10-2011-0060450 does not disclose a digital content identification technique for protecting the copyright of illegally distributed content.

따라서 정확하고 빠르게 불법 스캔 디지털 컨텐츠 검출을 위한 디지털 컨텐츠 식별 기술에 대한 연구가 필요한 실정이다.Therefore, it is necessary to study digital contents identification technology for detecting illegal scan digital contents accurately and quickly.

본 발명의 목적은 디지털 컨텐츠의 구조적 특징과 비구조적 특징을 이용하여 정확하고 빠르게 디지털 컨텐츠를 식별하는 방법 및 장치를 제공하는 데 있다.It is an object of the present invention to provide a method and apparatus for accurately and quickly identifying digital contents using structural and non-structural features of digital contents.

상기 목적을 달성하기 위해 본 발명의 일실시예에 의하면, 기준 디지털 컨텐츠에 구조적 특징 및 비구조적 특징이 저장된 데이터베이스를 유지하는 단계; 프레임 구조로 이루어진 대상 디지털 컨텐츠에서 프레임의 직성 성분을 포함하는 구조적 특징을 추출하는 단계; 상기 대상 디지털 컨텐츠의 비구조적 특징을 추출하는 단계; 및 상기 추출된 구조적 특징 및 비구조적 특징과 상기 데이터베이스에 저장된 기준 디지털 컨텐츠의 대한 구조적 특징 및 비구조적 특징을 비교하는 단계를 포함하는 디지털 컨텐츠 식별 방법이 제공된다. According to an embodiment of the present invention, there is provided a digital content management method comprising: maintaining a database in which structural and non-structural features are stored in a reference digital content; Extracting a structural feature including a linear component of a frame from an object digital content having a frame structure; Extracting non-structural features of the digital content; And comparing the extracted structural and non-structural features with structural and non-structural features of the reference digital content stored in the database.

상기 목적을 달성하기 위해 본 발명의 일실시예에 의하면, 디지털 컨텐츠의 입력을 수신하는 입력부; 기준 디지털 컨텐츠에 구조적 특징 및 비구조적 특징이 저장된 데이터베이스; 프레임 구조로 이루어진 대상 디지털 컨텐츠에서 프레임의 직성 성분을 포함하는 구조적 특징을 추출하는 제1특징 추출부; 상기 대상 디지털 컨텐츠의 비구조적 특징을 추출하는 제2특징 추출부; 상기 추출된 구조적 특징 및 비구조적 특징과 상기 데이터베이스에 저장된 기준 디지털 컨텐츠의 대한 구조적 특징 및 비구조적 특징을 비교하는 비교부; 상기 입력부, 상기 데이터베이스, 상기 제1특징 추출부, 상기 제2특징 추출부, 및 상기 비교부를 제어하는 제어부를 포함하는 디지털 컨텐츠 식별 장치가 제공된다.According to an aspect of the present invention, there is provided an information processing apparatus including an input unit for receiving an input of digital content; A database in which structural and non-structural features are stored in the reference digital content; A first feature extracting unit for extracting a structural feature including a linear component of a frame in an object digital content having a frame structure; A second feature extraction unit for extracting non-structural features of the target digital content; A comparison unit comparing the extracted structural and non-structural features with structural and non-structural features of the reference digital content stored in the database; And a control unit for controlling the input unit, the database, the first feature extraction unit, the second feature extraction unit, and the comparison unit.

본 발명의 일실시예에 의한 디지털 컨텐츠 식별 방법 및 장치는 디지털 컨텐츠의 구조적 특징과 비구조적 특징을 이용하여 정확하고 빠르게 디지털 컨텐츠를 식별할 수 있다. The digital content identification method and apparatus according to an embodiment of the present invention can accurately and quickly identify digital contents using structural features and non-structural features of digital contents.

도 1은 본 발명의 일실시예와 관련된 디지털 컨텐츠 식별 장치의 블록도이다.
도 2는 본 발명의 일실시예와 관련된 디지털 컨텐츠 식별 방법을 나타내는 흐름도이다.
도 3은 도 2에 도시된 디지털 컨텐츠 식별 방법 중 전처리 과정의 일례를 나타내는 도면이다.
도 4는 도 2에 도시된 디지털 컨텐츠 식별 방법 중 디지털 컨텐츠의 구조적 특징을 추출하는 과정의 일례를 나타내는 도면이다.
도 5는 도 4의 과정을 통해 추출된 직선 성분을 나타낸다.1 is a block diagram of a digital content identification apparatus according to an embodiment of the present invention.
2 is a flowchart illustrating a digital content identification method in accordance with an embodiment of the present invention.
3 is a diagram illustrating an example of a preprocessing process of the digital content identification method shown in FIG.
FIG. 4 is a diagram illustrating an example of a process of extracting structural features of digital contents in the digital content identification method shown in FIG. 2. Referring to FIG.
FIG. 5 shows a linear component extracted through the process of FIG.

이하, 본 발명의 일실시예와 관련된 디지털 컨텐츠 식별 방법 및 장치에 대해 도면을 참조하여 설명하도록 하겠다.Hereinafter, a digital content identification method and apparatus according to an embodiment of the present invention will be described with reference to the drawings.

본 명세서에서 사용되는 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 명세서에서, "구성된다" 또는 "포함한다" 등의 용어는 명세서상에 기재된 여러 구성 요소들, 또는 여러 단계들을 반드시 모두 포함하는 것으로 해석되지 않아야 하며, 그 중 일부 구성 요소들 또는 일부 단계들은 포함되지 않을 수도 있고, 또는 추가적인 구성 요소 또는 단계들을 더 포함할 수 있는 것으로 해석되어야 한다.As used herein, the singular forms "a", "an" and "the" include plural referents unless the context clearly dictates otherwise. In this specification, the terms "comprising ", or" comprising "and the like should not be construed as necessarily including the various elements or steps described in the specification, Or may be further comprised of additional components or steps.

본 명세서에 디지털 컨텐츠라 함은 오프라인 컨텐츠와 대비되는 개념으로 전자 파일 형태로 제작, 유통, 소비될 수 있는 컨텐츠를 의미한다. 예를 들어, 디지털 컨텐츠는 오프라인 출판물을 스캔한 스캔본을 포함할 수 있다. In this specification, digital content refers to content that can be produced, distributed and consumed in the form of electronic files in contrast with offline contents. For example, the digital content may include a scanned copy of an off-line publication.

본 발명의 일실시예에 의한 디지털 컨텐츠는 프레임 구조로 이루어진 디지털 컨텐츠를 포함할 수 있다.The digital contents according to an embodiment of the present invention may include digital contents having a frame structure.

본 명세서에서 기준 디지털 컨텐츠는 식별의 대상이 되는 디지털 컨텐츠(이하, '대상 디지털 컨텐츠'라 함)가 어떤 컨텐츠인지를 식별하기 위해 사용되는 컨텐츠를 말한다. 예를 들어, 원본 컨텐츠가 기준 디지털 컨텐츠가 될 수 있고, 웹 상에서 유통되고 있는 컨텐츠가 대상 디지털 컨텐츠가 될 수 있다.In this specification, the reference digital content refers to content used to identify which content is digital content (hereinafter referred to as 'target digital content') to be identified. For example, the original content may be the reference digital content, and the content circulated on the web may be the target digital content.

이하에서는 디지털 컨텐츠의 예로 오프라인 출판물인 만화책을 스캔한 스캔 만화책에 대해 설명하나, 본 발명이 이에 한정되는 것은 아니다. Hereinafter, an example of digital contents will be described as a scan comic book which is a scan of a comic book as an offline publication, but the present invention is not limited thereto.

도 1은 본 발명의 일실시예와 관련된 디지털 컨텐츠 식별 장치의 블록도이다.1 is a block diagram of a digital content identification apparatus according to an embodiment of the present invention.

도시된 바와 같이, 디지털 컨텐츠 식별 장치(100)는 입력부(110), 데이터베이스(120), 전처리부(130), 제1특징 추출부(140), 제2특징 추출부(150), 비교부(160), 후보 추출부(170), 및 제어부(180)를 포함할 수 있다.2, the digital content identification apparatus 100 includes an input unit 110, a database 120, a preprocessor 130, a first feature extraction unit 140, a second feature extraction unit 150, 160, a candidate extracting unit 170, and a control unit 180.

입력부(110)는 디지털 컨텐츠를 입력을 수신할 수 있다. 상기 디지털 컨텐츠 입력 수신은 사용자 인터페이스를 통해 이루어질 수 있다.The input unit 110 may receive input of digital contents. The digital content input and reception may be performed through a user interface.

데이터베이스(120)에는 기준 디지털 컨텐츠에 대한 구조적 특징 및 비구조적 특징이 저장될 수 있다. 기준 디지털 컨텐츠에 대한 구조적 특징 및 비구조적 특징에 대해서는 후술하도록 하겠다.The database 120 may store structural and non-structural features for the reference digital content. The structural and non-structural features of the reference digital content will be described later.

전처리부(130)는 대상 디지털 컨텐츠의 보다 정확한 식별을 위해 전처리 작업을 수행한다. 예를 들어, 전처리부(130)는 스캔 만화책의 식별을 위해 회전 보정, 노이즈 제거, 휘도 조절, 페이지 분할, 크기 일반화 등의 과정을 거쳐 기하학적/화질적 저하가 일어난 스캔물에 대한 영상 처리를 수행할 수 있다.The preprocessing unit 130 performs a preprocessing operation to more accurately identify the digital contents to be processed. For example, the preprocessing unit 130 performs image processing on scans in which geometric / image degradation has occurred through processes such as rotation correction, noise removal, brightness adjustment, page division, and size generalization to identify scan comics can do.

제1특징 추출부(140)는 디지털 컨텐츠의 구조적 특징을 추출할 수 있다. 상기 디지털 컨텐츠의 구조적 특징은 컨텐츠 식별을 위한 Coarse level의 핑거프린트(fingerprint)가 될 수 있다. 디지털 컨텐츠의 구조적 특징은 상기 디지털 컨텐츠를 이루고 있는 프레임의 구조적 특징을 포함할 수 있다.The first feature extraction unit 140 may extract the structural features of the digital content. The structural characteristic of the digital content may be a coarse level fingerprint for content identification. The structural characteristic of the digital contents may include a structural characteristic of the frame constituting the digital contents.

제2특징 추출부(150)는 디지털 컨텐츠의 비구조적 특징을 추출할 수 있다. 상기 디지털 컨텐츠의 비구조적 특징은 컨텐츠 식별을 위한 Fine level의 핑거프린트(fingerprint)가 될 수 있다. 디지털 컨텐츠의 비구조적 특징은 상기 디지털 컨텐츠의 프레임 내에 존재하는 영상의 구체적인 특성을 포함할 수 있다.The second feature extraction unit 150 can extract the non-structural features of the digital content. The non-structural feature of the digital content may be a fine level fingerprint for content identification. The unstructured characteristic of the digital content may include specific characteristics of the image existing in the frame of the digital content.

비교부(160)는 상기 제1특징 추출부(140)에서 추출된 구조적 특징 및 상기 제2특징 추출부(150)에서 추출된 비구조적 특징과 상기 데이터베이스(120)에 저장된 기준 디지털 컨텐츠의 대한 구조적 특징 및 비구조적 특징을 비교하여 유사도를 검색할 수 있다.The comparing unit 160 compares the structural feature extracted by the first feature extracting unit 140 and the unstructured feature extracted from the second feature extracting unit 150 with the structural data of the reference digital content stored in the database 120 The similarity can be searched by comparing features and non-structural features.

후보 추출부(170)는 상기 제1특징 추출부(140)에서 추출된 구조적 특징을 이용하여 상기 데이터베이스(120)에 저장된 기준 디지털 컨텐츠 중 후보 컨텐츠를 추출할 수 있다. 예를 들어, 후보 추출부(170)는 제1특징 추출부(140)에서 추출된 구조적 특징과 특정 임계 범위 이내의 유사도를 가지는 기준 디지털 컨텐츠를 추출하여 후보 컨텐츠로 결정할 수 있다. 상기 후보 컨텐츠 추출은 디지털 컨텐츠 식별을 위한 계산량 감소를 위해 Coarse level의 핑거프린트 검사 후 비교 대상이 되는 기준 컨텐츠의 범위를 줄이기 위해 수행될 수 있다.The candidate extracting unit 170 may extract the candidate content among the reference digital contents stored in the database 120 using the structural features extracted from the first feature extracting unit 140. [ For example, the candidate extracting unit 170 may extract the reference digital content having the similarity within the specific threshold range with the structural feature extracted from the first feature extracting unit 140, and determine the candidate digital content as the candidate content. The candidate content extraction may be performed to reduce the range of the reference content to be compared after the fingerprint check of the coarse level in order to reduce the calculation amount for digital content identification.

제어부(180)는 상기 입력부(110), 상기 데이터베이스(120), 상기 전처리부(130), 상기 제1특징 추출부(140), 상기 제2특징 추출부(150), 상기 비교부(160) 및 상기 후보 추출부(170)에서 수행하는 기능을 유기적으로 제어할 수 있다.The control unit 180 may include the input unit 110, the database 120, the preprocessor 130, the first feature extraction unit 140, the second feature extraction unit 150, the comparison unit 160, And the functions performed by the candidate extractor 170 can be organically controlled.

도 2는 본 발명의 일실시예와 관련된 디지털 컨텐츠 식별 방법을 나타내는 흐름도이다.2 is a flowchart illustrating a digital content identification method in accordance with an embodiment of the present invention.

제어부(180)는 기준 디지털 컨텐츠의 구조적 특징 및 비구조적 특징이 저장된 데이터베이스(120)를 유지할 수 있다(S210). 디지털 컨텐의 구조적 특징이라 함은 디지털 컨텐츠를 이루고 있는 프레임의 구조적 특징을 포함할 수 있다. 예를 들어, 상기 구조적 특징은 상기 프레임의 직선 성분을 포함할 수 있다. 상기 직선 성분은 상기 프레임을 이루는 직선의 개수, 상기 프레임을 이루는 직선의 길이, 및 상기 프레임을 이루는 직선의 방향 등을 포함할 수 있다.The control unit 180 may maintain the database 120 in which structural and non-structural features of the reference digital content are stored (S210). The structural characteristic of a digital content may include a structural characteristic of a frame constituting digital contents. For example, the structural feature may comprise a linear component of the frame. The straight line component may include a number of straight lines forming the frame, a length of a straight line forming the frame, and a direction of a straight line forming the frame.

또한, 디지털 컨텐츠의 비구조적 특징은 상기 디지털 컨텐츠의 프레임 내에 존재하는 영상의 구체적인 특성을 포함할 수 있다. 예를 들어, 디지털 컨텐츠의 비구조적 특징은 상기 영상의 특징 포인트(예: 좌표 정보)와 상기 특징 포인트에 대한 특징 정보를 포함하는 기술자를 포함할 수 있다.In addition, the unstructured characteristic of the digital content may include specific characteristics of the image existing in the frame of the digital content. For example, the non-structural feature of the digital content may include a descriptor including feature points (e.g., coordinate information) of the image and feature information about the feature points.

이하 실시예에서는 스캔 만화책 구조에 기반한 특징을 이용한 핑거프린트를 coarse level의 핑거프린트로 정의하며, 이러한 만화책 구조와는 별개로 보다 정확하고 구체적인 영상 특성을 이용한 핑거프린트를 fine-level의 핑거프린트로 정의한다.In the following embodiments, a fingerprint using a feature based on a scan comic book structure is defined as a coarse level fingerprint, and a fingerprint using a more accurate and specific image characteristic is defined as a fine-level fingerprint separately from the comic book structure do.

전처리부(130)는 입력된 대상 디지털 컨텐츠의 식별을 용이하게 하기 위해 전처리 작업을 수행할 수 있다(S220). 상기 전처리부(130)는 스캔 만화의 식별을 위해 회전 보정, 노이즈 제거, 휘도 조절, 페이지 분할, 크기 일반화 등의 과정을 거쳐 기하학적/화질적 저하가 일어난 대상 디지털 컨텐츠(예: 불법 스캔물)에 대한 영상 처리를 수행할 수 있다.The preprocessing unit 130 may perform a preprocessing operation to facilitate identification of the inputted digital content (S220). The preprocessing unit 130 performs a process of identifying the scan comics in the target digital contents (for example, illegal scanned images) in which the geometric / image degradation has occurred through processes such as rotation correction, noise removal, luminance adjustment, page division, size generalization, It is possible to perform image processing on the image.

도 3은 도 2에 도시된 디지털 컨텐츠 식별 방법 중 전처리 과정의 일례를 나타내는 도면이다.3 is a diagram illustrating an example of a preprocessing process of the digital content identification method shown in FIG.

도시된 바와 같이, 만화책 스캔본의 경우에는 정형화 되어있지 않은 불법 스캔 작업으로 인해 기하학적/화질적 저하가 발생할 수 있다. 본 발명의 일실시예에 의한 상기 전처리 과정에서는 시계방향 혹은 반시계방향으로 회전한 영상을 기하학적으로 올바르게 바로잡는 영상 처리를 수행될 수 있다. 또한, 상기 전처리 과정은 스캔본의 경우, 그 특성상 노이즈 및 대비도의 저하가 발생하기 마련인데, 보다 수월한 특징 추출(즉, 핑거프린트의 추출)을 위한 준비작업으로서, 해당 노이즈를 제거함과 동시에 밝은 영역과 어두운 영역의 히스토그램을 바탕으로 대비도를 향상시키는 과정을 포함할 수 있다.As shown, in the case of a comic book scan, geometric / image degradation may occur due to illegal scanning operations that are not formalized. In the preprocessing process according to an exemplary embodiment of the present invention, image processing may be performed to geometrically correct an image rotated clockwise or counterclockwise. In the preprocessing process, the noise and the degree of contrast deteriorate due to the nature of the scan pattern. In preparation for the feature extraction (i.e., extraction of the fingerprint), the noise is removed, And a process of improving the contrast based on the histogram of the area and the dark area.

더불어 대상 디지털 컨텐츠(예: 불법 스캔본)의 경우에는 하나의 영상에 한 면의 만화책 페이지가 아닌 양면의 만화책 페이지가 포함되는 경우가 존재할 수 있다. 따라서 상기 전처리 과정에서는 핑거프린트 추출에서의 오류를 최소화 하기 위해 양면의 페이지가 존재할 경우 페이지 분리를 수행될 수 있다. 이렇게 분리된 페이지는 가로/세로의 크기를 맞추기 위한 크기 일반화 과정을 거치게 된다.In addition, in the case of the target digital contents (for example, illegal scanned images), there may be a case where a single image includes a comic book page on both sides rather than a comic book page on one side. Therefore, in the preprocessing process, in order to minimize errors in the fingerprint extraction, page separation can be performed when there are pages on both sides. The separated pages are subjected to a size generalizing process to adjust the size of the horizontal and vertical images.

본 발명의 일실시예에 의한 디지털 컨텐츠 식별 방법에서는 효과적인 컨텐츠 식별을 위해 특징 추출이 단계별로 진행될 수 있다.In the digital content identification method according to an embodiment of the present invention, feature extraction may be performed step by step for effective content identification.

먼저, 제1특징 추출부(140)는 프레임 구조로 이루어진 스캔 만화책의 구조적 특징을 추출할 수 있다(S230). First, the first feature extraction unit 140 may extract a structural feature of a scan comic book having a frame structure (S230).

Coarse level의 핑거프린트 추출을 위해 본 발명의 일실시예에서는 직선으로 이루어진 만화책의 프레임 구조를 활용할 수 있다. 만화책은 그 특성상 한 페이지 내에 여러 사각의 프레임으로 나뉘어 있으며, 각각의 프레임은 가로 및 세로의 직선으로 구성되어 있다. 또한 0과 90의 직선 성분만이 아닌 다양한 방향으로의 직선을 포함할 수 있다.In order to extract the fingerprint of the coarse level, the frame structure of a comic book made up of a straight line can be utilized in an embodiment of the present invention. Comic books are divided into several rectangular frames within a page, and each frame is composed of a horizontal and vertical straight line. It may also include straight lines in various directions rather than only linear components of 0 and 90.

다만 이러한 프레임을 사각으로 정확히 추출하기 위해서는 효과적인 전처리에도 불구하고 영상 처리의 한계가 존재하므로, 본 실시예에서는 프레임의 직선 성분을 coarse-level의 핑거프린트로 추출한다. However, in order to accurately extract such a frame by a square, there is a limitation of image processing in spite of effective preprocessing. Therefore, in this embodiment, a straight line component of a frame is extracted as a coarse-level fingerprint.

도 4는 도 2에 도시된 디지털 컨텐츠 식별 방법 중 디지털 컨텐츠의 구조적 특징을 추출하는 과정의 일례를 나타내는 도면이다.FIG. 4 is a diagram illustrating an example of a process of extracting structural features of digital contents in the digital content identification method shown in FIG. 2. Referring to FIG.

도 4(a)는 입력된 대상 스캔 만화책의 원본 영상을 나타낸다. 도 4(b)는 Image Segmentation을 나타낸다. 원본 영상에 대하여 segmentation을 수행하여 프레임과 배경영역을 구분할 수 있다. 이때의 segmentation 기법은 region growing, k-mean clustering 및 thresholding 기법이 사용될 수 있다.4 (a) shows the original image of the inputted target scan comic book. 4 (b) shows an image segmentation. Segmentation is performed on the original image to distinguish the frame from the background area. In this case, the segmentation technique can be used for region growing, k-mean clustering and thresholding.

Segment로 나뉘어진 프레임에 대해서는 edge detection이 수행될 수 있다(도 4(c)). 이를 통해 프레임의 edge 성분을 추출할 수 있으며, 해당 edge 정보로부터 보다 정확한 직선 성분 검출을 위해 morphological filtering(도 4(d))을 수행함으로써 edge 성분을 보다 향상시킬 수 있다. 해당 edge로부터의 직선성분 추출은 Hough transform을 통해 이루어질 수 있다. 이로부터 각 프레임에서의 corner와 직선 성분을 구해낼 수 있다(도4(e)).Edge detection can be performed on a frame divided into segments (Fig. 4 (c)). In this way, the edge component of the frame can be extracted, and the edge component can be further improved by performing morphological filtering (FIG. 4 (d)) for more accurate linear component detection from the edge information. Linear component extraction from the edge can be done by Hough transform. From this, corner and straight line components in each frame can be obtained (Fig. 4 (e)).

도 5는 도 4의 과정을 통해 추출된 직선 성분을 나타낸다. 상기 직선 성분은 프레임을 이루는 직선의 개수, 프레임을 이루는 직선의 길이, 프레임을 이루는 직선의 방향 등을 포함할 수 있다.FIG. 5 shows a linear component extracted through the process of FIG. The straight line component may include the number of straight lines forming the frame, the length of the straight line forming the frame, the direction of the straight line forming the frame, and the like.

도시된 바와 같이, 추출된 직선 성분은 각 방향에 따라 나뉘어진다. 만화책 프레임 구조의 특성상 직사각형이 아닌 사각형, 혹은 다른 형태의 프레임이 존재하므로 이 때의 방향은 전 방향이 고려될 수 있다.As shown, the extracted linear components are divided according to each direction. Due to the nature of the comic book frame structure, rectangles other than rectangles or other types of frames exist, so that the direction can be considered as a forward direction.

이러한 직선은 해당 방향에 따라 label이 부여되며, 이에 따른 핑거프린트는 다음과 같다. These straight lines are labeled according to the direction, and the fingerprints accordingly are as follows.

1. 해당 방향에서의 label 수 (직선의 개수-도 5의 경우 0도: 10개, 90도 : 10개)1. Number of labels in the corresponding direction (number of straight lines - 0 degrees in FIG. 5: 10, 90 degrees: 10)

2. 각 label에 해당하는 직선의 길이2. The length of the straight line corresponding to each label

상기 직선 성분은 대상 스캔 만화책과 저장된 기준 스캔 만화책과의 유사도 검색을 위한 coarse-level의 핑거프린트로서 이용될 수 있다.The linear component may be used as a coarse-level fingerprint for similarity search between the target scan comic book and the stored reference scan comic book.

추출된 핑거프린트 정보가 데이터베이스(120)에 저장된 기준 스캔 만화책의 핑거프린트 정보와 같을 경우, 상기 대상 스캔 만화책이 저장된 기준 스캔 만화책의 스캔본(또는 복사본)으로 인식될 수 있다. If the extracted fingerprint information is the same as the fingerprint information of the reference scan comic book stored in the database 120, the target scan comic book may be recognized as a scanned copy (or a copy) of the stored reference scan comic book.

한편, 본 발명의 일실시예에 의하면, 영상 처리 과정에서의 오차가 발생할 수 있으므로 각각의 핑거프린트에 대해서는 적절한 threshold를 두어 유사도 검색의 효율과 정확도를 높일 수 있다.According to an embodiment of the present invention, an error may occur in an image processing process, so that an appropriate threshold is set for each fingerprint, thereby increasing the efficiency and accuracy of the similarity search.

Coarse-level의 핑거프린트 추출 후, 후보 추출부(170)는 상기 유사도 검색의 결과를 이용하여 상기 데이터베이스(120)에 저장된 기준 만화책 중 후보 만화책을 추출할 수 있다(S240). 즉, 후보 추출부(170)는 추출된 직성 성분과 특정 임계 범위 이내의 유사도를 가지는 기준 만화책을 추출하여 후보 만화책(이하, '후보군'이라 함)으로 결정할 수 있다. After extracting the coarse-level fingerprint, the candidate extracting unit 170 may extract a candidate comic book stored in the database 120 using the result of the similarity search (S240). That is, the candidate extracting unit 170 extracts a reference comic book having similarity to the extracted linear component and within a predetermined threshold range, and can determine a candidate comic book (hereinafter referred to as a candidate comic book).

Coarse level의 핑거프린트는 만화책에서의 세부 모습들이 배제된 개략적인 정보를 포함하고 있다. 따라서 프레임 구조의 복잡도에 따라서 비슷한 프레임구조를 갖는 후보군들이 많이 검출될 수 있다. Coarse level fingerprints contain rough information that excludes details in comics. Therefore, candidate groups having similar frame structures can be detected according to the complexity of the frame structure.

이는 coarse level에서는 속도의 향상을 위해 각각의 핑거프린트들이 고유성이 비교적 부족하기 때문이다. 이런 경우에 여러 후보군 중에서 다시 정확한 매칭 후보를 추출해야 한다. This is because the fingerprints of each fingerprint are relatively insufficient to improve the speed at the coarse level. In this case, accurate matching candidates should be extracted again among the candidates.

이를 위해 Coarse level 이후에 Fine level의 핑거피린트 검출 및 매칭이 수행될 수 있다. 즉, 상기 Coarse-level의 핑거프린트 추출 및 후보 추출 과정이 이루어지면, 상기 제2특징 추출부(150)는 대상 스캔 만화책의 비구조적 특징을 추출하는 Fine-level의 핑거프린트 추출을 수행할 수 있다(S250). 즉, 본 실시예에서는 대상 스캔 만화책의 비구조적 특징이 Fine-level의 핑거프린트가 될 수 있다.For this, Fine level fingerprint detection and matching can be performed after the coarse level. That is, when the coarse-level fingerprint extraction and the candidate extraction process are performed, the second feature extraction unit 150 can perform fine-level fingerprint extraction for extracting non-structural features of the target scan comic book (S250). That is, in this embodiment, the unstructured characteristic of the target scan comic book can be a fine-level fingerprint.

Coarse level의 핑거프린트가 만화책 페이지의 구조적인 특징을 추출하였던 것에 비하여 Fine level의 핑거프린트는 비구조적인 특징을 추출한다. Fine level의 핑거프린트는 각각의 만화책 페이지의 더욱 고유한 특징들을 포함하고 있다. 만화책의 프레임 외에 그림 정보로부터 Fine level의 핑거프린트가 획득될 수 있다. 따라서 Fine level의 핑거프린트는 더욱 많은 비트수를 필요로 하며 검색 및 유사도 검사에 많은 시간이 소모된다. 본 실시예서는 전체적인 계산량의 감소를 위해 Coarse level의 검사 후 좁혀진 후보군에 대하여 Fine level의 검사를 수행한다.Fine level fingerprints extract unstructured features, whereas coarse level fingerprints extract structural features of comic book pages. Fine level fingerprints contain more unique features of each comic book page. A fine level fingerprint can be obtained from the picture information in addition to the frame of the comic book. Therefore, a fine level fingerprint requires a larger number of bits and a lot of time is spent in search and similarity check. In this embodiment, a fine level test is performed on candidates narrowed after the inspection of the coarse level to reduce the overall calculation amount.

Fine level의 핑거프린트는 특징 포인트와 기술자로 구성될 수 있다. 특징 포인트는 이미지에서 좌표 정보를 나타내며, 해당 특징 포인트의 고유한 특징을 담고 있는 벡터가 기술자이다. 실질적으로 유사도 조사를 할 때는 각 포인트의 기술자의 유사도가 검사된다. Fine level fingerprints can consist of feature points and descriptors. Feature points represent coordinate information in an image, and a vector is a descriptor that contains a unique feature of the feature point. When the similarity degree is practically examined, the similarity degree of the descriptor of each point is checked.

특징 포인트의 검출은 프레임 외에 그림 부분에서 코너 성분이 강한 곳으로 정한다. 수행 시간의 단축 및 정확도 보장을 위해 Coarse 핑거프린트에 의한 유사도 검색 결과 획득한 후보군의 숫자에 따라 특징 포인트의 개수가 결정될 수 있다. 즉, 후보군의 수가 적을수록 적은 숫자의 특징 포인트가 추출되고, 후보군의 수가 많을수록 많은 숫자의 특징 포인트가 추출될 수 있다. Feature Point detection is determined to be stronger in the corner part than the frame part. In order to shorten the execution time and ensure accuracy, the number of feature points can be determined according to the number of candidate groups obtained by the similarity search result by the coarse fingerprint. That is, as the number of candidate groups is smaller, a smaller number of characteristic points are extracted, and as the number of candidate groups is larger, a larger number of characteristic points can be extracted.

다음으로 제2특징 추출부(150)는 특징 포인트의 특징을 설명할 수 있는 기술자를 생성해낼 수 있다. 기술자는 특징 포인트가 포함된 프레임의 사이즈, 특징 포인트를 기준으로 소정 영역 내에 인접한 주변 픽셀의 분포 정보를 포함할 수 있다. 본 실시예에서는 특징 포인트의 좌표가 기술자의 주요한 정보로서 활용된다. 같은 만화책임을 판단하는 문제에 있어서 전혀 다른 위치의 특징점 매칭은 의미가 없기 때문이다. Next, the second feature extracting unit 150 can generate a descriptor that can explain the feature of the feature point. The descriptor may include the size of the frame including the feature point, and distribution information of neighboring pixels adjacent to the predetermined region based on the feature point. In this embodiment, the coordinates of the feature point are utilized as the main information of the descriptor. This is because there is no point in matching the feature points of completely different positions in the problem of judging the same comic responsibility.

비교부(160)는 대상 스캔 만화책의 특징 포인트 및 기술자와 데이터베이스(120)에 저장된 후보군에 대하여 특징 포인트 및 기술자를 비교함으로써, 유사도 검색을 수행할 수 있다(S260).The comparing unit 160 may perform the similarity search by comparing feature points and descriptors with respect to feature points and descriptors of the target scan comic book and candidates stored in the database 120 (S260).

그리고 제어부(180)는 최종적으로 대상 스캔 만화책(즉, 테스트 만화책)이 데이터베이스(120)에 저장된 만화책과 같은 것인지를 식별할 수 있다(S270). 즉, 제어부(180)는 대상 스캔 만화책에서 추출된 Fine level 핑커프린트와 동일 또는 유사한 핑거프린트가 데이터베이스(120)에 저장되어 있는지를 확인함으로써, 대상 스캔 만화책이 데이터베이스(120)에 저장된 기준 만화책의 스캔본(또는 복사본)인지 여부를 판단할 수 있다. Finally, the control unit 180 can identify whether the target scan comic book (i.e., test comic book) is the same as the comic book stored in the database 120 (S270). That is, the control unit 180 checks whether the fingerprint identical or similar to the fine level fingerprint extracted from the target scan comic book is stored in the database 120, so that the target scan comic is scanned with the reference comic book stored in the database 120 It is possible to judge whether it is a copy (or a copy).

Fine level의 핑거프린트는 스캔된 만화책 이미지의 contrast, sharpness, brightness 등에 영향을 받아 변형될 수 있다. 따라서 본 발명의 일실시예에 의하면, Coarse level의 핑거프린트를 통한 구조적 유사도 검색이 선행된 후에 fine level의 핑거프린트를 통한 비구조적 특징의 유사도 검색이 이루어짐으로써 더욱 정확하고 잡음에 강인한 유사도 검색이 수행될 수 있다.Fine level fingerprints can be altered due to the contrast, sharpness, and brightness of scanned comic book images. Therefore, according to the embodiment of the present invention, after the structural similarity search through the fingerprint of the coarse level is preceded, the similarity search of the non-structural feature is performed through the fine level fingerprint, so that more accurate and robustness similarity search is performed .

상술한 디지털 컨텐츠 식별 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터로 판독 가능한 기록 매체에 기록될 수 있다. 이때, 컴퓨터로 판독 가능한 기록매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 한편, 기록매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다.The above-described digital content identification method may be implemented in the form of a program command that can be executed through various computer means and recorded in a computer-readable recording medium. At this time, the computer-readable recording medium may include program commands, data files, data structures, and the like, alone or in combination. On the other hand, the program instructions recorded on the recording medium may be those specially designed and configured for the present invention or may be available to those skilled in the art of computer software.

컴퓨터로 판독 가능한 기록매체에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(Magnetic Media), CD-ROM, DVD와 같은 광기록 매체(Optical Media), 플롭티컬 디스크(Floptical Disk)와 같은 자기-광 매체(Magneto-Optical Media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. The computer-readable recording medium includes a magnetic recording medium such as a magnetic medium such as a hard disk, a floppy disk and a magnetic tape, an optical medium such as a CD-ROM and a DVD, a magnetic disk such as a floppy disk, A magneto-optical media, and a hardware device specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like.

한편, 이러한 기록매체는 프로그램 명령, 데이터 구조 등을 지정하는 신호를 전송하는 반송파를 포함하는 광 또는 금속선, 도파관 등의 전송 매체일 수도 있다.The recording medium may be a transmission medium, such as a light or metal line, a wave guide, or the like, including a carrier wave for transmitting a signal designating a program command, a data structure, and the like.

또한, 프로그램 명령에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. 상술한 하드웨어 장치는 본 발명의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.The program instructions also include machine language code, such as those generated by the compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

상기와 같이 설명된 디지털 컨텐츠 식별 방법 및 장치는 상기 설명된 실시예들의 구성과 방법이 한정되게 적용될 수 있는 것이 아니라, 상기 실시예들은 다양한 변형이 이루어질 수 있도록 각 실시예들의 전부 또는 일부가 선택적으로 조합되어 구성될 수도 있다.The above-described digital contents identification method and apparatus are not limited to the configuration and method of the above-described embodiments, but the embodiments may be modified such that all or some of the embodiments are selectively And may be configured in combination.

100: 디지털 컨텐츠 식별 장치
110: 입력부
120: 데이터베이스
130: 전처리부
140: 제1특징 추출부
150: 제2특징 추출부
160: 비교부
170: 후보 추출부
180: 제어부100: digital content identification device
110: input unit
120: Database
130:
140: First feature extracting unit
150: second feature extracting unit
160:
170: candidate extracting unit
180:

Claims

Maintaining a database in which structural and non-structural features are stored in the reference digital content;
Extracting a structural feature including a linear component of a frame in an object digital content composed of a frame structure;
Extracting candidate contents to be compared with unstructured features of the reference digital contents stored in the database using the extracted structural features;
Determining a number of unstructured features based on the number of extracted candidate contents and extracting the unstructured features of the target digital content according to the determined number of unstructured features, - the more the number of contents, the more; And
Comparing the extracted non-structural features of the target digital content with non-structural features of the candidate content.

The method of claim 1,
A number of straight lines forming the frame, a length of a straight line forming the frame, and a direction of a straight line forming the frame.

2. The method of claim 1, wherein the non-
And a descriptor including feature points including specific coordinate information of an image existing in a frame of the target digital content and feature information about the feature point.

4. The method of claim 3, wherein the technician
Wherein the digital content identification information includes at least one of size information of a frame including the feature point and information about neighboring pixels within a predetermined region based on the feature point.

delete

The method of claim 1, wherein the reference digital content and the target digital content
And a scanned version of the offline publication scanned.

An input unit for receiving input of digital contents;
A database in which structural and non-structural features are stored in the reference digital content;
A first feature extracting unit for extracting a structural feature including a linear component of a frame in an object digital content composed of a frame structure;
A candidate extractor for extracting candidate contents to be compared with unstructured features of the reference digital contents stored in the database using the extracted structural features;
A second feature extraction unit for determining the number of non-structural features based on the number of extracted candidate contents and extracting non-structural features of the target digital content according to the determined number of non-structural features; The number of candidate contents increases as the number of candidate contents increases;
A comparing unit comparing non-structural features of the extracted target digital contents with non-structural features of the candidate contents; And
And a control unit for controlling the input unit, the database, the first feature extraction unit, the second feature extraction unit, and the comparison unit.

9. The method of claim 8,
A number of straight lines forming the frame, a length of a straight line forming the frame, and a direction of a straight line forming the frame.

9. The method of claim 8, wherein the non-
And a descriptor including feature points including specific coordinate information of an image existing in a frame of the target digital content and feature information about the feature point.

11. The method of claim 10, wherein the technician
A size information of a frame including the feature point, and information about neighboring pixels within a predetermined region based on the feature point.

delete

The method of claim 8, wherein the reference digital content and the target digital content
And a scanned version of the offline publication scanned.