KR101507709B1

KR101507709B1 - fingerprinting methods for providing additional information service related in broadcasting

Info

Publication number: KR101507709B1
Application number: KR20130145708A
Authority: KR
Inventors: 신동환; 김경순; 정신영; 김소원; 이재호; 최대훈; 이동준
Original assignee: (주)마크애니; 한국방송공사
Priority date: 2013-11-27
Filing date: 2013-11-27
Publication date: 2015-04-07

Abstract

The present invention relates to audio fingerprinting. According to the present invention, characteristic values of audio are stored in a DB in advance; later, the audio is recognized as contents by searching a relevant portion of the audio in the DB if the audio to be recognized has been stored; and a replay time point can be identified. Two or more characteristics values (DNAs) are used to enable real-time broadcasting recognition service and large-scale VOD recognition service to be provided. It is difficult to apply the contents recognition services stated earlier on a practical system with one DNA. However, when two or more DNAs are used, the contents recognition services with increased recognition performance can be more accurately provided.

Description

BACKGROUND OF THE INVENTION 1. Field of the Invention [0001] The present invention relates to a fingerprinting method for providing additional information related to a broadcast program,

본 발명은 오디오의 특징점을 사전에 데이터 베이스 (DB) 에 저장하고 나중에 인식하고자 하는 오디오가 이미 DB 에 저장된 경우에 해당 부분을 DB 검색에 의해서 콘텐츠를 인식하고 더불어 재생시점도 확인할 수 있는 오디오 핑거프린팅에 관한 발명이다.The present invention relates to an audio fingerprinting method in which audio feature points are stored in a database (DB) in advance and audio portions to be recognized later are already stored in the DB, the contents are recognized by DB search, .

오디오 핑거프린팅은 오디오의 특징점을 사전에 데이터 베이스 (DB) 에 저장하고 나중에 인식하고자 하는 오디오가 이미 DB 에 저장된 경우에 해당 부분을 DB 검색에 의해서 콘텐츠를 인식하고 더불어 재생시점도 확인할 수 있는 방법이다. 오디오콘텐츠의 특징점 (DNA) 을 추출하여 방송신호를 인식하는 방법은 많이 개발되어져 사용되고 있다. 여기서 특징점 (DNA) 라고 하는 것은 콘텐츠를 다른 콘텐츠와 구분할 수 있게 하는 특징값 (characteristic value) 을 말한다.Audio fingerprinting is a method of storing audio feature points in a database (DB) in advance and recognizing the contents by DB search if the audio to be recognized later is already stored in the DB . A method of extracting a feature point (DNA) of an audio content and recognizing a broadcast signal has been developed and used. Herein, the term " DNA " refers to a characteristic value that enables a content to be distinguished from other contents.

핑거프린팅 알고리즘은 콘텐츠의 고유한 특징점 (DNA) 을 이용하여 구현되는데 하나의 특징점을 이용했을 때 만족할만한 성능을 내지 못하는 경우가 많이 발생한다. 즉 임의의 A 라는 DNA 는 잡음환경에서 적은 양을 DNA 를 이용해서 빠르게 대략적으로 구별하는 특징이 있는 반면 정확하게 콘텐츠를 판별하기 어려운 특징이 있을 수 있다. 또 다른 임의의 B라는 DNA는 정확하게 콘텐츠를 판별할 수는 있으나 많은 양의 DNA 를 이용해야 하는 단점이 있는 DNA 가 있을 수 있다. 이런 DNA 들은 어느 하나만을 갖고 대용량의 VOD 서비스와 같은 응용분야에 적용하기 어려운 문제점이 있다.The fingerprinting algorithm is implemented using a unique feature point (DNA) of the content. When a feature point is used, a satisfactory performance is often not obtained. In other words, arbitrary DNA A may be characterized by a small amount of DNA in a noisy environment, which quickly distinguishes it roughly, but it is difficult to accurately discriminate the contents. Another arbitrary DNA, B, can discriminate the content accurately, but there may be a DNA having a disadvantage that a large amount of DNA must be used. There is a problem that it is difficult to apply such DNA to any application such as a large-capacity VOD service.

본 발명은 두 개 이상의 DNA 를 이용하여 실방송 인식 및 대용량 VOD 인식 서비스를 가능하게 하는 방법에 대한 것이다. 각각 하나의 DNA 로서는 실제 실용시스템에 적용하기 어렵지만 2 개이상의 DNA를 이용하여 콘텐츠 인식 서비스를 보다 정확하고 인식성능 향상이 가능하게 하고자 한다.The present invention relates to a method for enabling actual broadcast recognition and large capacity VOD recognition service using two or more DNAs. Although it is difficult to apply each DNA as a practical system, it is intended to make the content recognition service more accurate and improve the recognition performance by using two or more DNAs.

이하에서는 본 발명의 이해를 돕기 위한 기본적인 양태를 간단한 요약 형태로 제공한다. 이 요약은 모든 고려되는 양태들의 광범위한 개관이 아니며, 모든 양태들의 중요하거나 중대한 엘리먼트들을 식별하려는 것도, 임의의 또는 모든 양태들의 범위를 정확하게 서술하려는 것도 아니다. 이의 유일한 목적은 이후 제시되는 보다 상세한 설명에 대한 서두로서, 양태들의 일부 개념들을 간략한 형태로 제시하는 것이다.BRIEF DESCRIPTION OF THE DRAWINGS The above and other features and advantages of the present invention will become more apparent from the following detailed description when taken in conjunction with the accompanying drawings. This summary is not an extensive overview of all contemplated aspects, nor is it intended to identify key or critical elements of all aspects nor to precisely describe the scope of any or all aspects. Its sole purpose is to present some concepts of the aspects in a simplified form as a prelude to a more detailed description presented later.

상술한 바와 같은 문제점을 해결하기 위해, 대용량 VOD 서비스에서 2 개의 특징점을 이용하여 콘텐츠를 인식 시스템으로 콘텐츠를 인식하는 방법으로서, 입력 콘텐츠의 제 1 특징점을 데이터 베이스 내에서 검색하는 단계; 상기 제 1 특징점을 이용하여 후보군을 선택하는 단계로서, 상기 후보군은 소정의 기준을 통과한 상기 데이터 베이스 내의 후보 콘텐츠 ID 들을 포함하는, 상기 후보군을 선택하는 단계; 상기 후보군에 상기 후보 콘텐츠의 ID 들이 존재하는지를 결정하는 단계; 상기 후보군에 상기 후보 콘텐츠의 ID 들이 존재하지 않는 경우, 인식 실패 신호를 출력하는 단계; 상기 후보군에 상기 후보 콘텐츠의 ID 들이 존재하는 경우, 상기 입력 콘텐츠의 제 2 특징점을 상기 후보군에서 검색하는 단계; 상기 후보 콘텐츠 내에서 상기 제 2 특징점과 가장 유사한 후보 콘텐츠를 상기 후보군 내에서 선정하는 단계; 상기 가장 유사한 후보 콘텐츠에 대한 FPE (False Positive Error) 검사를 수행하는 단계; 상기 FPE 검사 결과가 실패인 경우 인식 오류 신호를 출력하는 단계; 및 상기 FPE 검사 결과가 성공인 경우 인식 완료 신호를 출력하는 단계를 포함하는, 대용량 VOD 서비스에서 2 개의 특징점을 이용하여 콘텐츠를 인식하는 방법을 이용한다.In order to solve the above problems, there is provided a method of recognizing a content with a content recognition system using two minutiae in a large capacity VOD service, comprising: searching a first minutiae point of the input content in a database; Selecting a candidate group using the first feature point, the candidate group including candidate content IDs in the database that have passed a predetermined criterion; Determining whether the candidate group has IDs of the candidate content; Outputting a recognition failure signal when IDs of the candidate contents do not exist in the candidate group; Searching candidate candidates for a second characteristic point of the input content if the candidates for the candidate content are present in the candidate group; Selecting a candidate content most similar to the second feature point in the candidate content within the candidate group; Performing a False Positive Error (FPE) check on the most similar candidate content; Outputting a recognition error signal if the FPE test result is failure; And outputting a recognition completion signal when the FPE test result is successful. In the large capacity VOD service, a method of recognizing a content using two minutiae points is used.

전술한 양태는 이후 충분히 설명될 것이고 특히 청구범위에 지적된 특성들을 포함할 것이다. 다음의 설명 및 첨부된 도면은 하나 이상의 양태들의 예시적인 양태들을 좀더 자세하게 개시한 것이다. 그러나, 이들 양태들은 여러 양태들의 원리가 이용될 수 있는 여러 방법들 중 일부만을 나타낸 것으로, 설명한 양태들은 모든 그러한 양태들 및 그들의 균등물을 포함시키는 것으로 이해해야할 것이다.The foregoing aspects will be fully described hereinafter and will include in particular the features pointed out in the claims. The following description and the annexed drawings set forth in detail certain illustrative aspects of one or more aspects. It is to be understood, however, that these embodiments represent only some of the various ways in which the principles of the various aspects may be utilized, and that the described aspects include all such aspects and their equivalents.

본 발명은 두 개 이상의 DNA 를 이용하여 콘텐츠 인식 서비스를 보다 정확하게 제공하고, 인식성능을 향상시킬 수 있는 효과가 있다. 또한 콘텐츠의 데이터 베이스 또한 2 개 이상의 군으로 분류하여 적은 검색 시스템 비용으로 효율적인 인식 성능을 구현할 수 있다.The present invention provides a content recognition service more accurately using two or more DNAs and improves recognition performance. Also, the database of contents can be categorized into two or more groups, and efficient recognition performance can be realized with a low search system cost.

도 1 은 본 발명의 2 개의 특징점 (DNA) 을 이용하여 콘텐츠를 인식하는 과정을 나타낸 일 실시예의 블록도이다.
도 2 는 대용량 콘텐츠를 인식하기 위한 인식 알고리즘의 과정을 나타낸 일 실시예를 설명한다.
도 3 은 실시간 방송 콘텐츠를 인식하는 흐름을 나타내는 일 실시예를 설명한다.
도 4 는 DNA 검색시스템의 전체적인 구조의 예를 나타낸 일 실시예를 설명한다.
도 5 는 도 4 의 인덱싱 시스템에 대한 보다 자세한 동작을 나타내는 일실시예의 블록도이다.1 is a block diagram of an embodiment showing a process of recognizing contents using two feature points (DNA) of the present invention.
FIG. 2 illustrates an embodiment of a recognition algorithm for recognizing a large-capacity content.
FIG. 3 illustrates an embodiment of a flow for recognizing real-time broadcast contents.
Fig. 4 illustrates an embodiment showing an example of the overall structure of a DNA search system.
5 is a block diagram of one embodiment illustrating more detailed operation of the indexing system of FIG.

본 발명은 다양한 변환을 가할 수 있고 여러 가지 실시 예를 가질 수 있는 바, 특정 실시 예들을 도면에 예시하고 바람직한 실시 예를 통해 상세하게 설명하고자 한다. 본 발명을 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다.BRIEF DESCRIPTION OF THE DRAWINGS The present invention is capable of various modifications and various embodiments, and specific embodiments are illustrated in the drawings and described in detail by way of preferred embodiments. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.

이하, 본 발명에 따른 바람직한 실시 형태를 첨부된 도면을 참조하여 상세하게 설명한다. 첨부된 도면과 함께 이하에 개시될 상세한 설명은 본 발명의 예시적인 실시형태를 설명하고자 하는 것이며, 본 발명이 실시될 수 있는 유일한 실시형태를 나타내고자 하는 것이 아니다. 이하의 상세한 설명은 본 발명의 완전한 이해를 제공하기 위해서 구체적 세부사항을 포함한다. 그러나, 당업자는 본 발명이 이러한 구체적 세부사항 없이도 실시될 수 있음을 안다.Hereinafter, preferred embodiments according to the present invention will be described in detail with reference to the accompanying drawings. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The following detailed description, together with the accompanying drawings, is intended to illustrate exemplary embodiments of the invention and is not intended to represent the only embodiments in which the invention may be practiced. The following detailed description includes specific details in order to provide a thorough understanding of the present invention. However, those skilled in the art will appreciate that the present invention may be practiced without these specific details.

본 발명은 실방송 및 대용량 VOD (Video On Demand)서비스를 할 때 시청자의 스마트 단말(스마트패드, 스마트폰 등)을 이용하여 부가정보 연동서비스를 하기 위해서 콘텐츠 특징점 (DNA) 을 이용하는 방법에 관한 것이다. 2 가지 특성이 있는 DNA 를 조합하여 원하는 부가정보 연동서비스를 수행한다.The present invention relates to a method of using content feature points (DNA) to perform a supplementary information interlocking service using a smart terminal (smart pad, smart phone, etc.) of a viewer in real broadcast and large capacity VOD (Video On Demand) . Combining the DNAs with two characteristics to perform the desired additional information interworking service.

본원 도 1 은 2 개의 DNA 를 이용하여 콘텐츠를 인식하는 과정을 나타내는 일 예시이다. 도면부호 101 은 제 1 DNA 를 이용하여 입력된 콘텐츠를 데이터 베이스 (DB) 검색을 수행한다. 도면부호 102 번은 후보군 선택을 하는 부분이다. 여기서는 유사도 측정을 하여 DB 에 저장된 DNA 와 검색에 사용된 검색 DNA 간의 유사도가 큰 순으로 정렬하여 가장 근사하다고 판단할 수 있는 콘텐츠 ID 순으로 정렬한다. 이렇게 유사하다고 판단되는 기준을 통과한 후보 콘텐츠 ID 를 후보군으로 선정한다. 여기서 판단되는 기준은 알고리즘마다 다를 수 있으나 보통 유사도 정도를 수치화하여 일정한 설정값을 넘는 경우 후보군으로 선정된다. 만일 기존 구축된 DB 에 존재하지 않는 DNA 가 검색되어 진다면 보통 후보군으로 선정된 콘텐츠 ID 는 없게 된다. 도면부호 103 은 후보군에 콘텐츠 ID 가 존재하는지 판단하는 기능을 수행한다. 도면부호 104 후보군에 후보가 존재하지 않는 경우로 콘텐츠 인식에 실패한 경우이다. 도면부호 105는 후보군에 후보 콘텐츠가 존재하는 경우 후보군에 선택된 후보 콘텐츠에 해당하는 제 2 DNA 를 검색하여 가장 유사한 콘텐츠를 선정한다. 보통 제 2 DNA 의 특징은 콘텐츠를 정밀하고 정확하게 유사도를 측정할 수 있으나 연산량이 많아서 바로 DNA 검색에 사용할 수 없는 특징을 갖는 DNA 가 사용된다. 도면부호 106 은 도면부호 105 에서 선정된 콘텐츠 후보에 대한 FPE (False Positive Error) 검사를 수행한다. FPE 는 잘못된 콘텐츠ID 를 인식하는 경우로서 발생하지 말아야 하는 경우이다. 만일 FPE 가 발생하면 응용 용도에 따라서 인식서비스에서 매우 안 좋은 결과를 만들 수 있다. 도면부호 107 은 FPE 검사 결과 오류가 있는지 판단하는 기능을 한다. 만일 오류인 경우에는 인식오류를 리턴 (108) 한다. 만일 FPE검사 결과가 오류가 아닌 경우에는 정상적인 인식완료 (109) 가 된 경우이다.1 is an example showing a process of recognizing contents using two DNAs. Reference numeral 101 denotes a database (DB) search for contents inputted using the first DNA. Reference numeral 102 denotes a candidate group. In this case, the degree of similarity is measured and arranged in descending order of the degree of similarity between the DNA stored in the DB and the search DNA used for the search, and arranged in the order of the content IDs which can be judged as the closest. The candidate content ID that has passed the criterion that is determined to be similar as the candidate group is selected as the candidate group. The criterion that is judged here may be different for each algorithm, but usually the degree of similarity is numerically expressed and selected as a candidate group if it exceeds a certain set value. If a DNA that does not exist in the established DB is searched, the content ID selected as the candidate is usually not available. Reference numeral 103 denotes a function for judging whether or not a content ID exists in a candidate group. The case where the candidates do not exist in the candidate group 104 and the content recognition fails. Reference numeral 105 denotes a search unit for searching for a second DNA corresponding to the candidate content selected in the candidate group when candidate contents exist in the candidate group, and selects the most similar content. The characteristic of the second DNA is that the DNA can be used to accurately and accurately measure the similarity of the content but can not be used for DNA search because of the large amount of computation. Reference numeral 106 denotes a FPE (False Positive Error) test for the selected content candidate. FPE is a case where it should not occur as a case of recognizing an invalid content ID. If FPE occurs, it can produce very poor results in recognition services depending on application usage. Reference numeral 107 denotes a function for judging whether there is an error as a result of the FPE inspection. If it is an error, a recognition error is returned (108). If the FPE test result is not an error, the normal recognition is completed (109).

본원 도 2 은 대용량 콘텐츠를 인식하기 위한 인식알고리즘의 과정의 예를 나타내는 일 예시이다. 도면부호 130 은 청취자의 스마트단말에서 오디오를 녹음해서 콘텐츠 인식을 위해서 검색서버에 보내준 쿼리 제 1 DNA 를 나타낸다. 도면부호 131 은 쿼리 제 1 DNA 데이터를 이용한 제 1 DNA DB 검색 과정을 나타낸다. 룩업테이블을 이용하여 콘텐츠마다 생성되어져 있는 DNA 파일에 히스토그램 테이블을 생성한다. 히스토그램에서 카운트하는 값은 쿼리 제 1 DNA 와 DNA 파일에 있는 DNA 값이 일치하는 개수이다. 도면부호 132 은 도면부호 131 에서 만들어진 히스토그램 테이블을 이용하여 개수가 많은 순서대로 배열하여 인식 콘텐츠 예상 후보들을 선정한다. 카운트 값이 가장 큰 콘텐츠가 가장 확률이 높은 콘텐츠 예상 후보가 된다. 도면부호 133 은 FPE (False Positive Error) 판별기로서 도면부호 132 에서 선정된 후보에 대한 최종 검증을 수행한다. 여기서 제 2 DNA를 사용하여 측정한 유사도값이 임의의 설정값 (Th) 이상이 되면 해당 콘텐츠 예상 후보를 인식된 콘텐츠로 판단하여 인식결과로서 전달한다. 만일 설정값을 넘지 못한다면 콘텐츠가 인식된 것이 없다고 판단한다. 유사도값을 측정하는 방법은 여러가지 디스턴스 (hamming distance, 유클리디언 디스턴스 등) 를 계산하는 방법이 있지만 여기서는 코릴레이션값을 사용한다. 아래 식은 코릴레이션 값을 구하는 일반적인 식이다. 여기서 x, y는 각각 비교하고자 하는 DNA 벡터값들이 된다.
2 is an example showing a process of a recognition algorithm for recognizing a large-capacity content. Reference numeral 130 denotes a query first DNA which is recorded in the smart terminal of the listener and sent to the search server for content recognition. Reference numeral 131 denotes a first DNA DB search process using query first DNA data. A histogram table is created in a DNA file generated for each content using a lookup table. The value counted in the histogram is the number of matches between the query first DNA and the DNA values in the DNA file. Reference numeral 132 denotes a list of candidate recognition content candidates arranged in descending order of numbers by using a histogram table created by the reference numeral 131. The content with the largest count value becomes the candidate with the highest probability. Reference numeral 133 denotes a FPE (False Positive Error) discriminator, which performs a final verification of the candidate selected at 132. Here, if the similarity value measured using the second DNA is equal to or larger than the predetermined set value Th, the content candidate is judged as the recognized content and transmitted as the recognition result. If the set value is not exceeded, it is determined that the content is not recognized. The method of measuring the similarity value is a method of calculating various distances (hamming distance, euclidean distance, etc.), but the correlation value is used here. The equation below is a general formula for calculating the correlation value. Where x and y are the DNA vector values to be compared, respectively.

---- 수식 1

----

본원 도 3 은 실시간 방송 콘텐츠를 인식하는 흐름을 나타내는 일 예시이다. 실시간 방송은 콘텐츠에 대한 DNA 를 미리 DB 화할 수 없기 때문에 실시간으로 DB 를 구성하고 이를 바로 사용해야 하는 제약사항이 있다. 도면부호 140 은 방송국의 실방송 시스템을 나타낸다. 온-에어 중인 영상의 음원을 이용하여 실시간 DNA 를 도면부호 141 에서 추출한다. 도면부호 142 는 실시간으로 추출된 DNA 를 DB 화하여 인식하는 시스템이다. 여기서는 Live DB 라고 한다. 도면부호 1421 은 추출된 DNA 를 약 2 분 분량을 축적한다. 여기의 시간 분량은 실방송 신호 지연시간을 극복할 수 있을 만큼 설정해야 한다. 그리고 항상 2분 분량의 DNA 를 유지하기 위해서 최신 DNA 1블록 데이터가 입력되면 가장 오래된 1개의 DNA 블록을 버린다. 코릴레이터 (Correlator) (1422) 는 도면부호 1421 의 2분분량의 DNA DB 에서 검색용 쿼리 DNA (147) 의 크기만큼 씩 가져와서 코릴레이션값을 구한다. 도면부호 1423 은 판단기로서 임의의 설정값 (Threshold Value) 을 Th 라고 했을 때 코릴레이션 값이 이 설정값보다 크면 정확히 인식된 것이라고 보고 해당 채널정보 및 방송타임옵셋 정보를 리턴한다. 만일 설정값보다 작으면 Live DB 검색 실패로 판정한다. 즉 검색용 DNA 값과 일치하는 방송신호가 없다고 판단한다. 도면부호 143은 시청자가 방송신호를 인식하기 위해서 스마트기기를 이용하여 오디오를 녹음한다. 도면부호 144 는 방송신호 검색을 위한 웹서비스 서버이다. 도면부호 145 는 검색요청을 받아서 매칭을 수행하는 역할을 하는 매칭 시스템이다. 도면부호 146 은 매칭 검색을 위해서 도면부호 143 으로부터 보내져온 Live DB 검색용 DNA 를 추출하는 기능을 한다. 도면부호 147 은 도면부호 146 에서 보내져온 검색용 DNA를 Live DB 에 보내 Live DB 검색을 실시한다.3 is an example showing a flow of recognizing real-time broadcast contents. Since real-time broadcasting can not DB-format the DNA for contents, there is a restriction to construct a DB in real time and use it immediately. Reference numeral 140 denotes an actual broadcasting system of the broadcasting station. Real-time DNA is extracted from the reference numeral 141 using the sound source of the image being on-air. Reference numeral 142 denotes a system for converting a DNA extracted in real time into a DB. Here, Live DB is called. Reference numeral 1421 accumulates about two minutes of the extracted DNA. The amount of time here must be set so as to overcome the actual broadcast signal delay time. And, to keep the DNA of 2 minutes at all times, when the latest DNA 1 block data is inputted, the oldest one DNA block is discarded. The correlator 1422 obtains the correlation value by taking the size of the query DNA 147 for retrieval from the DNA DB for two minutes of the reference numeral 1421. Reference numeral 1423 denotes a determiner, which determines that a correlation value is greater than the set value when the threshold value is Th and returns corresponding channel information and broadcast time offset information. If it is smaller than the set value, Live DB search is judged as failed. That is, there is no broadcast signal coinciding with the search DNA value. Reference numeral 143 denotes a device for recording audio using a smart device so that a viewer can recognize a broadcast signal. Reference numeral 144 denotes a web service server for searching broadcast signals. Reference numeral 145 denotes a matching system that plays a role of receiving a search request and performing matching. Reference numeral 146 denotes a function for extracting the Live DB search DNA sent from the reference numeral 143 for the matching search. Reference numeral 147 denotes a Live DB for retrieving the search DNA sent from the reference numeral 146 to the Live DB.

본원 도 4 는 DNA 검색시스템의 전체적인 구조의 예를 보여준다. 도면부호 150 은 일반 청취자가 스마트단말을 이용하여 콘텐츠인식 요청을 하는 부분이다. 도면부호 151 은 도면부호 150 에서 보내져온 DNA 검색요청을 수행하는 부분이다. 도면부호 1510 은 웹서버로서 입력된 검색쿼리 요청을 처리한다. 도면부호 1511 은 도면부호 1510 에서 접수된 쿼리요청 각각을 DNA DB 매칭을 통해서 검색명령을 수행한다. 도면부호 1512 는 전체적인 DNA DB 구성을 나타낸다. 전체 DNA DB 는 Preceding DNA DB (1533) 와 VOD용 DNA DB (1544)로 구성된다. Preceding DNA DB (1533) 는 빠른 검색을 수행할 수 있는 구조를 되어져 있어서 주로 라이브채널 인식용 DNA DB, 가장 많은 응용분야가 될 수 있는 CF인식용 DNA DB, 케이블TV 방송사에서 지상파 재 전송과 같은 콘텐츠를 빠른 시간에 인식하기 위한 Hot Content 인식용 DNA DB 로 이루어 졌다. VOD용 DNA DB (1544) 는 대용량의 콘텐츠를 인식하기 위해 구성된 DNA DB 로서 주로 VOD 서비스에서 주로 사용된다. 이처럼 2개의 DNA DB 군으로 분류한 것은 검색시스템이 적은 시스템 비용으로 효율적인 인식성능을 구현하기 위함이다. 먼저 인식쿼리가 입력되면 먼저 인식가능성이 가장 많은 Preceding DNA DB (1533) 를 먼저 검색하여 인식하고 만일 인식되지 않는다면 VOD용 DNA DB (1544) 를 검색하여 콘텐츠 인식을 위한 검색 서비스를 수행한다. 이러한 대용량 쿼리를 발생하는 경우는 주로 생방송 콘텐츠, 케이블TV가 지상파 콘텐츠를 재전송하는 경우, 혹은 CF 를 이용한 서비스가 활성화 된 경우에 CF 콘텐츠가 될 수 있다.4 shows an example of the overall structure of a DNA search system. Reference numeral 150 denotes a portion in which a general listener makes a content recognition request using a smart terminal. Reference numeral 151 denotes a portion for performing a DNA search request sent from the reference numeral 150. Reference numeral 1510 denotes a search query request input as a web server. Reference numeral 1511 denotes a search command through a DNA DB matching for each query request received at a reference numeral 1510. Reference numeral 1512 denotes the overall DNA DB structure. The entire DNA DB is composed of a Preceding DNA DB 1533 and a DNA DB for VOD 1544. The preceding DNA DB 1533 is structured to perform fast searching, and is mainly used for a live-channel-recognition DNA database, a CF-recognition DNA database that can be applied to the most applications, and a terrestrial re- And a DNA database for hot content recognition to recognize at a high speed. The DNA DB for VOD (1544) is a DNA database configured to recognize a large amount of contents, and is mainly used mainly in VOD service. The classification into two DNA DB groups is to realize efficient recognition performance with a low system cost for a search system. First, when a recognition query is inputted, the preceding DNA DB 1533 having the highest recognition possibility is first searched for and recognized. If not, the VOD DNA DB 1544 is searched to perform a search service for content recognition. In case of generating such a large capacity query, the CF content can be mainly used when the live broadcast contents, the cable TV are retransmitted terrestrial contents, or the service using CF is activated.

본원 도 5 은 도 4 에서 인덱싱시스템에 대한 보다 자세한 동작 흐름을 설명하는 일 예시이다. 인덱싱 시스템은 크게 2개 DNA DB 구조로 이루어 진다. 먼저 Preceding DNA DB (1533) 와 VOD용 DNA DB (1544) 로 구성된다. Preceding DNA DB (1533) 는 빠른 검색을 위해서 마련된 DB 로서 주로 Live 방송 콘텐츠를 인식하기 위한 DNA DB 와 검색용도가 빈번할 것으로 생각되는 CF DNA DB 및 Hot Content DNA DB 로 구성된다. Hot Content DB 는 지상파 방송국에서 방송한 콘텐츠 중에서 시청률이 높은 콘텐츠를 대상으로 먼저 검색하게 하기 위해서 Preceding DNA DB (1533) 에 위치하는 콘텐츠이다. Hot Content DB 선정 방법은 보통 최근 몇 주사이의 시청률이 높은 콘텐츠를 대상으로 할 수 있다. CF DNA DB 는 광고 콘텐츠에 대한 부가정보 제공기능을 수행하기 위해서 마련한 DB로서 짧은 광고시간에 보여주지 못한 상품의 정보나 바로 구매할 수 있는 수단으로 접근 등이 가능하게 할 수 있다. 이렇게 사용자들이 검색할 가능성이 높은 콘텐츠에 대해서 먼저 검색을 수행하고 이 검색을 통과하지 못한 쿼리에 대해서는 대용량 VOD 용 DNA DB (1544) 검색을 수행하여 VOD 콘텐츠를 검색하기 위한 검색과정으로 진행한다. 2 번의 검색에서 모두 검색실패하면 해당 쿼리는 인식불가능한 콘텐츠로서 미인식 결과를 리턴한다. 만일 인식한다면 콘텐츠 인식 결과를 리턴한다.5 is an illustration of a more detailed operation flow for the indexing system in FIG. The indexing system consists of two DNA DB structures. First, a Preceding DNA DB 1533 and a VOD DNA DB 1544 are constructed. The Preceding DNA DB 1533 is a DB provided for fast searching, and it mainly consists of a DNA DB for recognizing live broadcast contents and a CF DNA DB and a Hot Content DNA DB, which are thought to be frequently used for searching. The Hot Content DB is a content located in the Preceding DNA DB 1533 so that a content having a high viewership among the contents broadcast by a terrestrial broadcasting station can be searched first. Hot Content DB selection methods can be targeted to content that has a high audience rating in recent times. The CF DNA DB is provided to perform additional information providing function for the advertisement contents, and it can make access to the information of the goods which can not be shown in the short advertisement time or the means to purchase directly. In this way, the user first performs a search for a content likely to be searched by the user, and searches for a large-capacity VOD DNA DB (1544) for a query that does not pass the search, thereby proceeding to a search process for searching VOD contents. If all the retrieval fails in the second retrieval, the query returns unrecognized results as unrecognizable contents. If it recognizes, it returns the content recognition result.

개시된 기술에 관한 설명은 구조적 내지 기능적 설명을 위한 실시 예에 불과하므로, 개시된 기술의 권리범위는 본문에 설명된 실시 예에 의하여 제한되는 것으로 해석되어서는 아니 된다. 즉, 실시 예는 다양한 변경이 가능하고 여러 가지 형태를 가질 수 있으므로 개시된 기술의 권리범위는 기술적 사상을 실현할 수 있는 균등물들을 포함하는 것으로 이해되어야 한다.The description of the disclosed technique is merely an example for structural or functional explanation and the scope of the disclosed technology should not be construed as being limited by the embodiments described in the text. That is, the embodiments are to be construed as being variously embodied and having various forms, so that the scope of the disclosed technology should be understood to include equivalents capable of realizing technical ideas.

한편, 본 출원에서 서술되는 용어의 의미는 다음과 같이 이해되어야 할 것이다.Meanwhile, the meaning of the terms described in the present application should be understood as follows.

단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한 복수의 표현을 포함하는 것으로 이해되어야 하고, "포함하다" 또는 "가지다" 등의 용어는 설시된 특징, 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.It should be understood that the singular " include "or" have "are to be construed as including a stated feature, number, step, operation, component, It is to be understood that the combination is intended to specify that it is present and not to preclude the presence or addition of one or more other features, numbers, steps, operations, components, parts or combinations thereof.

여기서 사용되는 모든 용어들은 다르게 정의되지 않는 한, 개시된 기술이 속하는 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가진다. 일반적으로 사용되는 사전에 정의되어 있는 용어들은 관련 기술의 문맥상 가지는 의미와 일치하는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한 이상적이거나 과도하게 형식적인 의미를 지니는 것으로 해석될 수 없다.All terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the disclosed technology belongs, unless otherwise defined. Commonly used predefined terms should be interpreted to be consistent with the meanings in the context of the related art and can not be interpreted as having ideal or overly formal meaning unless explicitly defined in the present application.

상기에서는 본 발명의 바람직한 실시 예를 참조하여 설명하였지만, 해당 기술 분야의 숙련된 당업자는 하기의 특허 청구의 범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 수 있을 것이다.It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit or scope of the present invention as defined by the following claims It can be understood that

Claims

A method of recognizing contents with a contents recognition system using two minutiae points in a large-capacity VOD (Video On Demand) service,
Retrieving a first feature point of the input content in a database;
Selecting a candidate group using the first feature point,
Wherein the candidate group includes candidate content IDs in the database that have passed a predetermined criterion;
Determining whether the candidate group has IDs of the candidate content;
Outputting a recognition failure signal when IDs of the candidate contents do not exist in the candidate group;
Searching candidate candidates for a second characteristic point of the input content if the candidates for the candidate content are present in the candidate group;
Selecting a candidate content most similar to the second feature point in the candidate content within the candidate group;
Performing a False Positive Error (FPE) check on the most similar candidate content;
Outputting a recognition error signal if the FPE test result is failure; And
And outputting a recognition completion signal if the FPE test result is successful,
Wherein the second feature point can accurately measure the degree of similarity with respect to the content than the first feature point, but the amount of computation is larger than the first feature point.

The method according to claim 1,
Wherein searching the first feature point of the input content in the database comprises:
Generating a histogram table in a minutia file created for each content in a database using a lookup table;
Counting the number of coincidence of the first feature point and the feature point in the feature point file and displaying the counted number in the histogram table.

The method according to claim 1,
Wherein the step of selecting a candidate group using the first feature point comprises:
Measuring a degree of similarity between the first feature point and a feature point of the content in the database;
Arranging IDs of contents in the database in order of the measured similarity; And
Further comprising selecting candidate IDs of candidate contents in the database satisfying the predetermined criterion in which the degree of similarity is equal to or larger than a predetermined set value.

delete

The method according to claim 1,
Wherein performing FPE checking for the most similar candidate content comprises:
Further comprising determining that the FPE check is successful if the similarity value to the most similar candidate content measured using the second feature point is greater than or equal to a predetermined set value.

6. The method of claim 5,
Wherein the similarity value measurement is performed by at least one of Hamming distance, Euclidean distance, and correlation.

The method according to claim 1,
The method of claim 1, wherein if the input content is a real-time broadcast content, a step of converting the minutiae point of the real-time broadcast content into a database before searching the first minutia point in a database,
Extracting characteristic points of real-time broadcasting contents in real time;
Accumulating feature points of the real-time broadcast contents for a predetermined time; And
And discarding old feature points among the feature points accumulated in the predetermined time period, accumulating new feature points, and maintaining the accumulated feature points at the predetermined amount of time.

8. The method of claim 7,
Wherein the predetermined amount of time is two minutes.

The method according to claim 1,
Wherein,
A preceding feature point database provided for fast searching, and
A feature point database for a VOD for a content that is less likely to be searched than a content in a pre-syndication feature point database,
Wherein the VOD feature point database is used when the recognition failure or the recognition error signal is output as a result of performing the method of recognizing the contents using the pre-syndication feature point database.

10. The method of claim 9,
The pre-syndication feature point database includes a feature point database for live channel recognition, a feature point database for advertisement (CF) recognition, and hot content recognition for quickly recognizing contents such as terrestrial re-transmission in a cable TV broadcast company And a feature point database.