KR20100001064A

KR20100001064A - Method and system for searching contents using image recognition at internet protocol televition

Info

Publication number: KR20100001064A
Application number: KR1020080060816A
Authority: KR
Inventors: 변우섭; 김문식
Original assignee: 주식회사 케이티
Priority date: 2008-06-26
Filing date: 2008-06-26
Publication date: 2010-01-06

Abstract

PURPOSE: A method and a system for searching contents using image recognition in an internet protocol television are provided to analyze image data obtained from a set top box to generate a search tag, thereby generating contents using the generated tag. CONSTITUTION: A data transceiver(110) receives image data from a set top box. An image processor(120) extracts an image feature included in the image data. A tag for contents search is generated through the extracted feature. A contents database(140) matches one or more tags with the contents. A contents searcher(130) searches contents related to the image data.

Description

METHOD AND SYSTEM FOR SEARCHING CONTENTS USING IMAGE RECOGNITION AT INTERNET PROTOCOL TELEVITION}

본 발명은 아이피티브이에서 이미지 인식을 이용하여 컨텐츠를 검색하는 시스템 및 방법에 관한 것으로서, 보다 상세하게는, 이미지를 획득하고, 획득한 이미지를 분석하여 컨텐츠를 검색하는 시스템 및 방법에 관한 것이다.The present invention relates to a system and method for searching for content using image recognition in iPi Yi, and more particularly, to a system and method for acquiring an image and analyzing the acquired image to search for content.

최근 아이피티브이(IPTV) 서비스가 널리 보급되고, 특히 현재 아이피티브이에서 서비스하고 있는 VOD(vedio on demend) 중심의 다운로드 앤드 플레이(download & play, DnP) 방식의 서비스뿐만 아니라, 앞으로 서비스될 것으로 기대되는 스트리밍 방식의 실시간 채널 방송 서비스가 가입자에게 제공될 경우, 아이피티브이 사용자의 수가 증가할 것으로 예상된다.Recently, the IPTV service is widely spread, and in particular, the VOD (vedio on demend) download and play (DnP) type service currently being provided by IPTV is expected to be serviced in the future. When a streaming real-time channel broadcasting service is provided to a subscriber, it is expected that the number of users increases.

종래의 아이피티브이 컨텐츠 검색 기술은 메뉴 기반 또는 키워드(keyword) 기반의 검색 기술을 사용한다. 메뉴 기반의 검색 기술의 경우, 사용자는 컨텐츠 검색의 편리성을 제공받지만, 단계적으로 컨텐츠 검색이 수행되어 상대적으로 많은 시간이 소요된다. 특히 검색 대상인 컨텐츠가 아이피티브이 메뉴 내에 존재하지 않는 경우, 사용자는 검색을 위해 많은 시간을 소비하고도 원하는 컨텐츠를 획득하지 못하게 된다.Conventional IP content search technology uses a menu-based or keyword-based search technology. In the case of the menu-based search technology, the user is provided with the convenience of content search, but the content search is performed step by step, which takes a relatively long time. In particular, when the content to be searched for does not exist in the menu, the user may spend a lot of time for searching and may not acquire the desired content.

또한, 키워드 기반의 검색 기술의 경우, 사용자는 검색하고자 하는 키워드를 입력하여 빠르고 정확하게 컨텐츠를 검색할 수 있다. 그러나, 사용자가 키워드를 인식하고 있지 못한 경우, 컨텐츠 검색이 용이하지 않다. 또한, 키워드 입력 인터페이스가 불편한 경우, 키워드를 이용한 검색이 용이하지 않다.In addition, in the case of a keyword-based search technology, a user may search for content quickly and accurately by inputting a keyword to be searched. However, if the user does not recognize the keyword, content search is not easy. In addition, when the keyword input interface is inconvenient, the search using the keyword is not easy.

이로 인해, 메뉴 기반 또는 키워드 기반의 검색 이외에, 이미지를 입력하고 입력된 이미지를 기초로 하여 검색을 수행하는 검색 기술이 요구된다.For this reason, in addition to a menu-based or keyword-based search, a search technique for inputting an image and performing a search based on the input image is required.

종래의 한국공개특허 제2006-0118167호는 이동 단말의 카메라로 획득된 이미지에 대한 이미지 패턴 검색을 통해 이미지 컨텐츠를 검색하고 검색한 이미지 컨텐츠들에 대한 세부 정보로의 접속을 유도하는 발명을 개시하고 있으며, 한국공개특허 2008-0034248호는 이동 단말의 카메라로 획득된 얼굴을 분석하여, 이동 단말에 저장된 사진을 검색하는 발명을 개시하고 있다.Korean Patent Laid-Open Publication No. 2006-0118167 discloses an invention that searches for image contents through image pattern search for an image acquired by a camera of a mobile terminal and induces access to detailed information on the retrieved image contents. In addition, Korean Patent Laid-Open No. 2008-0034248 discloses an invention of searching for a photo stored in a mobile terminal by analyzing a face acquired by a camera of the mobile terminal.

이러한 종래 기술은 이동 단말의 카메라로 획득된 이미지와 검색 대상인 이미지를 비교하여 컨텐츠를 검색하는 발명을 개시하고 있으나, 이미지와 이미지를 비교하는 경우, 키워드를 생성하지 않아 재검색이 어려운 문제점이 존재하였다.The prior art discloses a method of searching contents by comparing an image acquired by a camera of a mobile terminal with an image to be searched. However, when comparing an image and an image, there is a problem in that re-search is difficult because a keyword is not generated.

더불어, 검색을 위한 이미지 전송에 있어, 무선 자원을 이용하기 때문에 사용자에게 많은 비용을 부담시키면서도, 원활한 고속 데이터 서비스를 제공할 수 없 다는 문제점이 존재하였다.In addition, in transmitting an image for retrieval, there is a problem in that a high-speed data service can not be provided smoothly even though a user can be charged a lot because of using radio resources.

본 발명의 일 실시예는 이동 단말 또는 셋탑 박스에서 획득한 이미지를 분석하여 검색을 위한 태그를 생성하고, 생성된 태그와 검색 대상인 컨텐츠가 포함하는 태그를 비교하여 컨텐츠를 검색하고 제공하고자 한다.An embodiment of the present invention is to generate a tag for searching by analyzing an image obtained from a mobile terminal or a set-top box, and to search for and provide content by comparing the generated tag with a tag included in the search target content.

상술한 기술적 과제를 달성하기 위한 기술적 수단으로서, 본 발명의 제 1 측면은 (a) 제공되는 컨텐츠에 대해 이미지로부터 추출될 수 있는 적어도 하나의 태그를 매칭시켜 저장하는 단계, (b) 셋탑 박스(set-top box, STB)로부터 이미지 데이터를 수신하는 단계, (c) 상기 이미지 데이터를 분석하여 컨텐츠 검색용 태그를 생성하는 단계 및 (d) 상기 컨텐츠 검색용 태그와 상기 컨텐츠에 매칭된 태그를 비교하여 상기 이미지 데이터에 관련된 컨텐츠를 검색하는 단계를 포함하는 아이피티브이(IPTV)에서 이미지 인식을 이용한 컨텐츠 검색 방법을 제공할 수 있다.As a technical means for achieving the above technical problem, the first aspect of the present invention (a) matching and storing at least one tag that can be extracted from the image for the provided content, (b) a set-top box ( receiving image data from a set-top box (STB); (c) analyzing the image data to generate a content search tag; and (d) comparing the content search tag with a tag matching the content. The method may provide a content retrieval method using image recognition in an IPTV including searching for content related to the image data.

또한, 본 발명의 제 2 측면은 셋탑 박스로부터 이미지 데이터를 수신하는 데이터 송수신부, 상기 이미지 데이터에 포함된 이미지의 특징을 추출하여 컨텐츠 검색용 태그를 생성하는 이미지 처리부, 각각의 컨텐츠에 대해 이미지로부터 추출될 수 있는 적어도 하나의 태그를 상기 컨텐츠에 매칭시켜 저장하는 컨텐츠 데이터베이스 및 상기 컨텐츠 검색용 태그를 이용하여 상기 컨텐츠 데이터베이스에서 상기 이미지 데이터에 관련된 컨텐츠를 검색하는 컨텐츠 검색부를 포함하고, 상기 컨텐 츠 검색부는 상기 컨텐츠 검색용 태그와 상기 컨텐츠에 포함된 태그를 비교하여 상기 컨텐츠를 검색하는 것인 아이피티브이(IPTV)에서 이미지 인식을 이용한 컨텐츠 검색 시스템을 제공할 수 있다.In addition, a second aspect of the present invention is a data transceiver for receiving image data from the set-top box, an image processing unit for extracting the features of the image included in the image data to generate a content search tag, from the image for each content And a content search unit for searching for content related to the image data in the content database using the content database and the content search tag, and matching and storing at least one tag that can be extracted with the content. The unit may provide a content retrieval system using image recognition in IPTV, which searches for the content by comparing the content search tag with a tag included in the content.

또한, 본 발명의 제 3 측면은 검색용 이미지의 입력을 제공하는 이미지 입력부, 리턴 채널을 통해 아이피티브이(IPTV) 시스템으로 상기 이미지를 송신하는 이미지 송신부 및 상기 아이피티브이(IPTV) 시스템으로부터 검색 결과에 대응하는 컨텐츠 리스트를 수신하여 컨텐츠의 선택을 제공하는 검색결과 출력부를 포함하는 이미지 인식을 이용하여 컨텐츠 검색을 제공하는 셋탑 박스(set-top box, STB)를 제공할 수 있다.In addition, a third aspect of the present invention provides an image input unit for providing an input of a search image, an image transmitter for transmitting the image to an IPTV system through a return channel, and a search result from the IPTV system. A set-top box (STB) for providing content search may be provided using image recognition including a search result output unit for receiving a corresponding content list and providing a selection of content.

전술한 본 발명의 과제 해결 수단에 의하면, 셋탑 박스로부터 획득된 이미지 데이터를 분석하여 검색용 태그를 생성하고, 생성된 태그를 이용하여 컨텐츠를 검색할 수 있다.According to the above-described problem solving means of the present invention, it is possible to generate a search tag by analyzing the image data obtained from the set-top box, it is possible to search the content using the generated tag.

또한, 전술한 본 발명의 과제 해결 수단에 의하면, 이미지 검색시 키워드를 포함하는 검색용 태그를 함께 제공하여 검색용 태그와 컨텐츠가 포함하는 태그를 비교함으로써 검색 속도를 향상시킬 수 있고 빠른 재검색이 가능하며, 컨텐츠 검색 시스템의 부하를 감소시킬 수 있다.In addition, according to the above-described problem solving means of the present invention, by providing a search tag containing a keyword when searching for an image to compare the search tag and the tag included in the content can improve the search speed and fast re-search is possible In addition, the load of the content retrieval system can be reduced.

도 1은 본 발명의 일 실시예가 적용될 수 있는 아이피티브이(IPTV) 방송 시스템의 구성을 도시한 블록도이다.1 is a block diagram showing the configuration of an IPTV broadcasting system to which an embodiment of the present invention can be applied.

종래의 기술에 따른 아이피티브이(IPTV) 방송 시스템은 방송 사업자(1000), 헤드엔드 시스템(2000), 네트워크 망(3000) 및 사용자 단말기(4000)를 포함한다.The IPTV broadcasting system according to the related art includes a broadcaster 1000, a headend system 2000, a network network 3000, and a user terminal 4000.

또한, 헤드엔드 시스템(2000)은 베이스 밴드 시스템(2010), 압축 다중화 시스템(2020), 수신 제한 시스템(CAS: Conditional Access System)(2030), 백 오피스 시스템(2040), 모니터링 시스템(2050), 미디어 관리 시스템(Media Operation Core: MOC)(2060), 가입자 관리 시스템(2070), 데이터 방송 시스템(2080), EPG(Electronic Program Guide) 시스템(2090) 및 리턴 패스 서버 시스템(2100)을 포함한다.In addition, the headend system 2000 includes a baseband system 2010, a compression multiplexing system 2020, a conditional access system (CAS) 2030, a back office system 2040, a monitoring system 2050, A media management core (MOC) 2060, a subscriber management system 2070, a data broadcasting system 2080, an electronic program guide (EPG) system 2090, and a return path server system 2100.

방송 사업자(1000)는 방송 컨텐츠를 제작, 편집 및 변경하여 헤드엔드 시스템(2000)으로 제공하는 역할을 한다. 방송 사업자(1000)는 프로그램 공급자(PP), 지상파 또는 컨텐츠 제공자(CP)를 포함할 수 있다. 또한, 방송 사업자(1000)의 의하여 제공되는 방송 컨텐츠는 기존 방송 컨텐츠와 인터넷 상의 풍부한 컨텐츠를 포함할 수 있다.The broadcaster 1000 serves to produce, edit, and change broadcast content to provide to the headend system 2000. The broadcaster 1000 may include a program provider (PP), a terrestrial wave, or a content provider (CP). In addition, the broadcast content provided by the broadcaster 1000 may include existing broadcast content and rich content on the Internet.

헤드엔드 시스템(2000)은 방송 사업자(1000)로부터 방송 컨텐츠를 수신하여 관리하며, 사용자 단말기(4000)로 컨텐츠를 분배하여 방송/녹화/재생 서비스를 제공하는 역할을 한다. 상기 수신한 방송 컨텐츠에는 관련 부가 정보 및 이러한 부가 정보에 대한 EPG가 포함되어 있을 수 있다.The headend system 2000 receives and manages broadcast content from the broadcaster 1000 and distributes content to the user terminal 4000 to provide broadcast / recording / playback services. The received broadcast content may include related additional information and an EPG for such additional information.

헤드엔드 시스템(2000)은 멀티캐스트 라우팅 프로토콜을 지원하는 라우터를 경유하여 가입자 집선 장치, 가입자 스위치를 통해 방송 영상 및 음성 신호, 데이터 방송용 데이터 및 프로그램 추천 서비스 메뉴를 포함하는 EPG 정보(PSIP/PSI/SI 정보)를 멀티캐스팅으로 다수의 가입자의 IP 셋탑 박스로 전송할 수 있다.The head-end system 2000 includes EPG information (PSIP / PSI /) including a subscriber concentrator, a broadcast video and audio signal, data broadcasting data, and a program recommendation service menu through a router supporting a multicast routing protocol. SI information) can be transmitted to IP set-top boxes of multiple subscribers by multicasting.

베이스 밴드 시스템(2010)은 외부 프로그램 공급자(PP)로부터 MPEG2 방송 신호, 또는 지상파로부터 아날로그 방송 신호를 수신하고, 수신한 소스(source) 방송 신호를 SDI(Serial Digital Interface) 신호로 변환하고, 프레임(Frame)을 동기화하며, 루틴 스위처(Routine Switcher)를 통해 여러 방송 채널(예를 들어, 100 채널)의 방송 영상 및 음성 신호들을 분배하며, 자막 생성기(CG) 및 자동 프로그램 제어기(Automatic Program Controller: APC)에 의해 상기 방송 영상 및 음성 신호에 광고, 로고, 또는 자막 중 적어도 어느 하나를 삽입하여(신호 편집 및 가공) 상기 압축 다중화 시스템(2020)으로 전송한다.The baseband system 2010 receives an MPEG2 broadcast signal from an external program provider (PP) or an analog broadcast signal from terrestrial waves, converts the received source broadcast signal into a SDI (Serial Digital Interface) signal, and converts a frame ( Frames), and distributes video and audio signals from multiple broadcast channels (e.g. 100 channels) via a routine switcher, subtitle generator (CG) and automatic program controller (APC). At least one of an advertisement, a logo, or a subtitle is inserted into the broadcast video and audio signal (signal editing and processing) and transmitted to the compression multiplexing system 2020.

상기 SDI(Serial Digital Interface) 신호는, 예를 들어, 270Mbps의 전송률을 가진 디지털 신호 표준안으로서, 복합 디지털 영상과 4채널의 디지털 오디오 신호가 혼합되어 있을 수 있다.The SDI (Serial Digital Interface) signal is, for example, a digital signal standard having a transmission rate of 270 Mbps, and a composite digital video and four channels of digital audio signals may be mixed.

상기 베이스 밴드 시스템(2010)은 기본적으로 프로그램 공급자(PP), 지상파 등의 방송 신호를 각각 수신하는 수신 장치(예: DS-3 단국, 야기(Yagi) 안테나, IRD로 아날로그 방송 신호를 수신하는 튜너(Tuner)), 수신 장치에서 수신된 소스(Source) 신호를 SDI 신호로 변환 및 보정하고 프레임을 동기화하기 위한 프레임 동기화기(Frame Synchronizer), 운용 관리를 위해 모든 방송 신호 채널을 연결/집중화하는 A/V 라우터 등의 신호 분배기, 상기 SDI 신호에 광고, 로고, 자막을 삽입 하여 신호를 편집하고 가공하는 자막 생성기(character generator)를 포함할 수 있다.The baseband system 2010 is basically a tuner for receiving analog broadcast signals through a receiving device (eg, a DS-3 station, a Yagi antenna, and an IRD) for receiving broadcast signals such as a program provider (PP) and terrestrial waves, respectively. (Tuner)), a frame synchronizer for converting and correcting a source signal received from a receiving device into an SDI signal, synchronizing frames, and A connecting / centralizing all broadcast signal channels for operation management. A signal splitter such as a / V router and a subtitle generator for inserting an advertisement, a logo, and a subtitle into the SDI signal to edit and process the signal.

압축 다중화 시스템(2020)은 상기 베이스 밴드 시스템(2010)으로부터 수신된 방송 영상 및 음성 신호(Video, Audio)를 방송 채널 별(예를 들어, 100 채널)로 각각 A/V 인코더(A/V Encoder)로 입력하여 SDI(Serial Digital Interface) 영상 신호를 H.264로 압축하고, 음성 신호를 MPEG-2 AAC로 압축하여 MPEG-2 TS(Transport Stream)을 생성하고, 압축된 방송 영상 및 음성인 MPEG-2 TS 신호와 함께 데이터 인코더(data encoder) 및 PSI/SI 발생기(PSI/SI Generator)에 의해 생성된 데이터 방송용 데이터 및 EPG 정보(PSIP/PSI/SI 정보)를 다중화(Multiplexing)한 후, 다중화된 MPEG-2 TS 신호를 수신 제한 기술을 사용하는 경우 스크램블러(Scrambler)에 입력하여 암호화하고 최종적으로 IP 패킷화하여 IP 패킷화한 TS(Transport Stream) 방송 신호를 송출할 수 있다.The compression multiplexing system 2020 uses an A / V encoder (A / V Encoder) for broadcasting video and audio signals (Video, Audio) received from the baseband system 2010 for each broadcasting channel (eg, 100 channels). ) To compress the SDI (Serial Digital Interface) video signal to H.264, and to compress the audio signal to MPEG-2 AAC to generate MPEG-2 TS (Transport Stream), and to compress compressed broadcast video and audio -2 after multiplexing data broadcasting data and EPG information (PSIP / PSI / SI information) generated by a data encoder and a PSI / SI generator together with the TS signal When the received MPEG-2 TS signal is used in the reception restriction technique, the TS-2 may be inputted to a scrambler, encrypted, and finally IP packetized to transmit an IP packetized TS (Transport Stream) broadcast signal.

또한, 선택적으로, 프로그램 추천 컨텐츠에 대한 불법 시청과 불법 복제를 방지하기 위해 수신 제한 시스템(2030)을 사용할 수 있다.Also, optionally, the reception restriction system 2030 may be used to prevent illegal viewing and illegal copying of program recommended content.

수신 제한 시스템(2030)은 실시간 채널에 대한 암호화 및 VOD 컨텐츠의 사전 암호화를 수행하며 시청 권한을 제어함으로써 인증된 사용자에 한해 채널 및 컨텐츠를 이용할 수 있도록 하는 역할을 한다. 아이피티브이(IPTV) 컨텐츠의 불법 복제를 방지하기 위해 수신 제한 시스템(2030) 대신에 디지털 저작권 관리(DRM: Digital Rights Management) 방식을 사용할 수도 있다.The reception restriction system 2030 performs encryption of the real-time channel and pre-encryption of the VOD content, and controls viewing authority so that only the authenticated user can use the channel and the content. In order to prevent illegal copying of IPTV contents, a digital rights management (DRM) scheme may be used instead of the reception restriction system 2030.

백 오피스 시스템(2040)은 프로비저닝(Provisioning) 시스템으로서 가입자 별로 아이피티브이(IPTV) 프로그램 서비스 사용에 대한 과금 처리 기능을 제공한다.The back office system 2040 is a provisioning system and provides a billing processing function for use of an IPTV program service for each subscriber.

모니터링 시스템(2050)은 관제 시스템으로, 아이피티브이(IPTV) 방송을 위한 A/V 방송 신호의 송출 장애, 아이피티브이(IPTV) 헤드엔드 시스템의 다운 링크를 모니터링하여 수신 장애, 및 자막 확인 등을 모니터링할 수 있다.The monitoring system 2050 is a control system that monitors transmission failures of A / V broadcast signals for IPTV broadcasting, reception failures by monitoring downlinks of IPTV headend systems, and confirmation of subtitles and captions. can do.

미디어 관리 시스템(2060)은 방송 업무를 운영하기 위한 각종 비즈니스 프로세스 정보(프로그램 편성 정보, 소재 정보, 계약 정보, 상품 정보 등)를 관리하는 시스템이다. 미디어 관리 시스템(2060)은 방송 센터의 중앙에서 각 시스템들과 유기적인 결합을 통해 정보 흐름을 통합 관리한다.The media management system 2060 is a system that manages various business process information (program organization information, location information, contract information, product information, etc.) for operating a broadcasting business. The media management system 2060 integrates and manages the information flow through organic coupling with each system in the center of the broadcasting center.

상기 미디어 관리 시스템(2060)은 방송 프로그램 편성 정보, 컨텐츠 및 미디어 관리 정보, 프로그램 제공자(PP)와 컨텐츠 제공자(CP)의 계약 정보, 상품 정보를 관리하고, 방송 센터의 중앙에서 각 시스템들과의 유기적인 결합을 통해 정보 흐름을 통합 관리하는 중재자(Coordinator) 역할을 수행할 수 있다.The media management system 2060 manages broadcast program organization information, content and media management information, contract information of a program provider (PP) and a content provider (CP), and product information, and manages each system in the center of a broadcasting center. Through organic integration, it can act as a coordinator to manage and manage the flow of information.

또한, 상기 미디어 관리 시스템(2060)은 획득(Acquisition) 측면에서 계약 관리, 미디어 및 컨텐츠 메타데이터(meta data) 관리, 방송 스케줄 정보인 EPG 정보 획득/관리, 운영(operation) 측면에서 실시간 방송 및 VOD 채널편성 관리, 각 서브시스템과 연동을 에이전트(Agent) 관리, VOD 카탈로그 생성 관리 및 각종 상품 관리를 제공하며, 분석 측면에서 CP/CA와의 정산, 가입자 시청 성향 등의 마케팅 분석 리포팅, 송출(Delivery) 측면에서 방송 송출 모니터링, 비디오 서버 송출 관리 및 VOD 가입자 인증, CP/CA와의 정산을 위한 송출 결과 기록/관리, 연동된 각 서브시스템과의 데이터 동기화를 제공할 수 있다.In addition, the media management system 2060 includes contract management, media and content metadata management, EPG information acquisition / management as broadcast schedule information, and real time broadcasting and VOD in terms of operation. It provides channel formation management, agent management, interworking with each subsystem, VOD catalog creation management, and various product management.In terms of analysis, marketing analysis reporting, delivery such as settlement of CP / CA, subscriber viewing propensity, etc. In terms of broadcasting transmission monitoring, video server transmission management and VOD subscriber authentication, transmission result recording / management for settlement with CP / CA, and data synchronization with each subsystem connected.

가입자 관리 시스템(2070)은 아이피티브이(IPTV) 서비스를 위한 회원 가입 및 해지, 회원 정보 관리 기능을 제공한다.The subscriber management system 2070 provides a member subscription and termination for IPTV service and member information management.

데이터 방송 시스템(2080)은 상기 데이터 방송용 데이터의 저작 및 검증, 편성 및 송출한다.The data broadcasting system 2080 authors, verifies, organizes, and transmits the data broadcasting data.

상기 데이터 방송 시스템(2080)은 데이터 인코딩을 관리하기 위한 데이터 에이전트 관리자(Data Agent Manager), 프로그램 관련 정보(Program Specific Information)/서비스 정보(SI: Service Information)를 발생하기 위한 PSI/SI 생성기(PSI/SI Generator), 방송 영상 및 음성 신호에 데이터 방송용 데이터를 인코딩하기 위한 데이터 서버/데이터 인코더(Data Server/Data Encoder), 상기 방송 영상 및 음성 신호에 데이터의 멀티플렉싱 기능을 관리하기 위한 멀티플렉서 관리자(Multiplexer Manager), 및 스케줄러 사용자 인터페이스(Scheduler UI)를 포함할 수 있다.The data broadcasting system 2080 includes a data agent manager (PSI) for managing data encoding, a PSI / SI generator (PSI) for generating program specific information / service information (SI). / SI Generator), Data Server / Data Encoder for encoding data broadcasting data into broadcast video and audio signals, and Multiplexer Manager for managing multiplexing functions of data into the broadcast video and audio signals Manager) and a scheduler user interface (Scheduler UI).

또한, 상기 데이터 방송 시스템(2080)은 지상파 ACAP(Application Configuration Access Protocol) 데이터 방송 표준에 따라 A/V 서버(A/V Server)로부터 제공된 A/V 데이터를 A/V 인코더(A/V Encoder)에 의해 방송 영상 및 음성 신호로 압축하고, 압축된 영상 및 음성 신호를 저작 도구(Authoring Tool)에 의해 애플리케이션(Application)으로부터 제공된 데이터를 데이터 서버/데이터 인코더 및 PSI/SI(Program Specific Information/Service Information) 발생기에 의해 생성된 데이터 방송용 데이터 및 EPG 정보(PSIP/SI 정보)와 함께 멀티플렉서(Multiplexer) 에 의해 멀티플렉싱되어 데이터 방송 프로그램의 수집, 저장에서부터 방송 프로그램 데이터 및 관련 정보의 부호화 및 송출을 할 수 있다.In addition, the data broadcasting system 2080 may use the A / V encoder to provide the A / V data provided from the A / V server according to the terrestrial Application Configuration Access Protocol (ACAP) data broadcasting standard. Compresses the broadcast video and audio signals by using a data server / data encoder and PSI / SI (Program Specific Information / Service Information), and compresses the compressed video and audio signals from an application by the authoring tool. ) Is multiplexed by a multiplexer together with data broadcasting data and EPG information (PSIP / SI information) generated by the generator to encode and transmit broadcast program data and related information from the collection and storage of data broadcasting programs. .

EPG 시스템(2090)은 EPG 서버를 포함하고, 사용자 단말기(4000)로 전자프로그램 가이드(EPG) 서비스를 제공한다.The EPG system 2090 includes an EPG server and provides an electronic program guide (EPG) service to the user terminal 4000.

리턴 패스 서버 시스템(2100)은 데이터 제공자(DP: Data Provider)에 의해 양방향 데이터를 처리하며, 사용자 단말기(4000)로부터 온라인 청구서 전달, 양방향 데이터의 이용 내역/과금 연동 처리를 제공하고, 개인화 인증 처리, 및 프로그램 추천 서비스를 위한 양방향 데이터를 수신하여 이에 대응하는 응답 데이터를 사용자 단말기(4000)로 유니캐스팅으로 전송할 수 있다.The return path server system 2100 processes bidirectional data by a data provider (DP), provides online bill transfer from the user terminal 4000, usage history / billing interworking processing of bidirectional data, and personalization authentication processing. , And receive bidirectional data for the program recommendation service and transmit the corresponding response data to the user terminal 4000 in unicasting.

네트워크 망(3000)은 헤드엔드 시스템(2000)으로부터 방송 컨텐츠를 수신하여 사용자 단말기(4000)에게 상기 수신한 방송 컨텐츠를 전달하는 역할을 한다. 네트워크 망(3000)은 백본(Backbone)망 및 액서스(Access)망을 포함하며, 상기 액서스망은 이더넷(Ethernet), xDSL(ADSL, VDSL), HFC(Hybrid Fiber Coaxial Ca), FTTC(Fiber To The Curb), FTTH(Fiber To The Home) 구조 중 어느 하나의 토폴로지로 구성될 수 있다.The network 3000 receives the broadcast content from the headend system 2000 and delivers the received broadcast content to the user terminal 4000. The network network 3000 includes a backbone network and an access network, and the access network includes Ethernet, xDSL (ADSL, VDSL), Hybrid Fiber Coaxial Ca (HFC), and Fiber To The Curb), and may be configured in any one topology of a fiber to the home (FTTH) structure.

사용자 단말기(4000)는 인터넷 방송 서비스를 이용하기 위한 장치로서, 일반적으로는 아이피티브이(IPTV), 셋탑 박스(STB) 및 리모콘을 포함한다. 아이피티브이(IPTV)는 헤드엔드 시스템(2000)으로부터 수신한 방송 컨텐츠를 출력하고, 리모콘을 통하여 입력받은 사용자 응답 신호를 셋탑 박스의 리턴 채널을 통하여 헤드엔드 시스템(2000)으로 전달한다.The user terminal 4000 is an apparatus for using an Internet broadcasting service, and generally includes an IPTV, a set top box, and a remote controller. IPTV outputs the broadcast content received from the headend system 2000 and transmits a user response signal received through the remote controller to the headend system 2000 through the return channel of the set top box.

사용자 단말기는 IP STB가 내장된 TV, 또는 사용자의 TV와 연결된 IP 셋탑 박스(IP STB), 컴퓨터, 노트북, 또는 개인 휴대용 단말기 중 어느 하나의 단말을 사용할 수 있다.The user terminal may use any one of a TV with an IP STB or an IP set-top box (IP STB), a computer, a notebook, or a personal portable terminal connected to the user's TV.

상기 IP 셋탑 박스는 CPU, 미디어 프로세서, 플래시 램, 이더넷 모듈 등의 STB 하드웨어로 구성된 하드웨어 계층, 디바이스 드라이버와 운영체제 등의 시스템 소프트웨어를 포함하는 시스템 소프트웨어 계층, 자바 가상 머신(Java Virtual Machine: JVM), 수신 제한 시스템(Conditional Access System: CAS) 모듈 및 디지털 저작권 관리(Digital Rights Management: DRM) 인터페이스 모듈, 스트리밍 프로토콜(RTP, RTSP)을 탑재한 미들웨어 계층, MPEG2, MPEG4, MPEG7, H.264, WMV-9 등의 다양한 멀티 코덱(Multi CODEC)을 지원하며 아이피티브이(IPTV) 서비스 채널을 선택하기 위한 전자프로그램 가이드(Electronic Program Guide: EPG)를 포함하는 애플리케이션 계층의 4계층을 포함할 수 있다. 이때, 가입자의 IP 셋탑 박스는 IPv4 주소 또는 IPv6 주소가 할당될 수 있다.The IP set-top box includes a hardware layer composed of STB hardware such as a CPU, a media processor, flash RAM, and an Ethernet module, a system software layer including a system driver such as a device driver and an operating system, a Java Virtual Machine (JVM), Middleware layer with Conditional Access System (CAS) module and Digital Rights Management (DRM) interface module, streaming protocol (RTP, RTSP), MPEG2, MPEG4, MPEG7, H.264, WMV- It may include four layers of an application layer that supports various multi codecs such as 9 and includes an electronic program guide (EPG) for selecting an IPTV service channel. In this case, the subscriber's IP set-top box may be assigned an IPv4 address or an IPv6 address.

도 2는 본 발명의 일 실시예에 따른 아이피티브이에서 이미지 인식을 이용한 컨텐츠 검색 시스템의 구성을 도시한 블록도이다.2 is a block diagram illustrating a configuration of a content retrieval system using image recognition in an IP according to an embodiment of the present invention.

본 발명의 일 실시예에 따른 아이피티브이에서 이미지 인식을 이용한 컨텐츠 검색 시스템은 이동 단말(10), 셋탑 박스(set-top box, STB)(20) 및 검색 서버(100)를 포함한다.A content retrieval system using image recognition in an IP according to an embodiment of the present invention includes a mobile terminal 10, a set-top box (STB) 20, and a search server 100.

이동 단말(10)은 장착된 카메라를 이용하여 이미지 데이터를 생성할 수 있으며, 셋탑 박스(20)로 제어 명령 및 생성된 이미지 데이터를 전송할 수 있다. 이동 단말(10)은 블루투스(Bluetooth), 적외선 통신 등의 근거리 무선 통신 등의 원거리 통신을 이용하여 셋탑 박스(20)와의 데이터 송수신을 수행할 수 있다. 따라서, 이동 단말(10)은 디지털 카메라를 장착한 셀룰러 폰, 리모컨 등 이미지 촬영이 가능하고 근거리 무선 통신을 수행할 수 있는 단말을 포함한다.The mobile terminal 10 may generate image data using a mounted camera, and transmit a control command and the generated image data to the set-top box 20. The mobile terminal 10 may perform data transmission / reception with the set top box 20 using long distance communication such as short range wireless communication such as Bluetooth and infrared communication. Therefore, the mobile terminal 10 includes a terminal capable of capturing images such as a cellular phone and a remote controller equipped with a digital camera and performing short-range wireless communication.

또한, 이동 단말(10)은 무선 통신을 이용하여 인터넷 등으로부터 이미지 데이터를 다운로드 받아 저장한 후, 저장된 이미지 데이터를 셋탑 박스(20)로 전송할 수 있다. 이처럼 미리 저장된 이미지 데이터를 이용하는 경우, 컨텐츠 검색 시간을 단축시킬 수 있다.In addition, the mobile terminal 10 may download and store image data from the Internet through wireless communication, and then transmit the stored image data to the set-top box 20. When using the pre-stored image data in this way, it is possible to shorten the content search time.

이동 단말(10)은 무선 통신뿐만 아니라 유선 통신을 통하여 이미지 데이터를 셋탑 박스(20)로 전송할 수 있다. 예를 들어, 이동 단말(10)은 유에스비(universal serial bus, USB) 통신 인터페이스를 이용하여, 셋탑 박스(20)와 유선으로 연결되어 미리 저장된 이미지 데이터를 셋탑 박스로 전송할 수 있다.The mobile terminal 10 may transmit image data to the set-top box 20 through wired communication as well as wireless communication. For example, the mobile terminal 10 may be connected to the set-top box 20 in a wired manner by using a universal serial bus (USB) communication interface to transmit pre-stored image data to the set-top box.

셋탑 박스(set-top box, STB)(20)는 유선 또는 무선 통신을 이용하여 이동 단말(10)로부터 이미지 데이터를 획득하거나, 또는 화면 캡처 등을 이용하여 자체적으로 이미지 데이터를 획득하고, 획득한 이미지 데이터를 IP(internet protocol) 네트워크를 통하여 검색 서버(100)로 전송한다.The set-top box (STB) 20 acquires image data from the mobile terminal 10 by using wired or wireless communication, or acquires image data by itself using screen capture, and the like. The image data is transmitted to the search server 100 through an IP (internet protocol) network.

또한, 셋탑 박스(20)는 검색 서버(100)로부터 관련 컨텐츠를 수신하여, 텔레비전 등의 가입자 단말(도시 생략)로 제공한다.In addition, the set-top box 20 receives related content from the search server 100 and provides the content to a subscriber terminal (not shown) such as a television.

검색 서버(100)는 IP 네트워크를 통해 셋탑 박스(20)로부터 이미지 데이터를 수신하고, 수신한 이미지로부터 추출된 태그를 이용하여 관련 컨텐츠를 검색한다. 검색 서버(100)는 검색한 관련 컨텐츠를 셋탑 박스(20)로 전송한다.The search server 100 receives image data from the set-top box 20 through an IP network, and searches for related content using a tag extracted from the received image. The search server 100 transmits the searched related content to the set top box 20.

도 3은 본 발명의 일 실시예에 따른 아이피티브이에서 이미지 인식을 이용한 컨텐츠 검색 방법의 흐름을 도시한 신호 흐름도이다.3 is a signal flow diagram illustrating a flow of a content retrieval method using image recognition in an IP according to an embodiment of the present invention.

단계(S110)에서, 이동 단말(10) 또는 셋탑 박스(20)는 이미지를 획득하여 이미지 데이터를 생성한다. 이동 단말(10)은 내장된 디지털 카메라를 이용하여 이미지를 획득하거나, 유선 또는 무선 통신을 이용하여 이미지를 획득할 수 있다.In step S110, the mobile terminal 10 or the set top box 20 acquires an image and generates image data. The mobile terminal 10 may acquire an image using a built-in digital camera, or may acquire an image using wired or wireless communication.

또한, 셋탑 박스(20)는 화면 캡처 등을 통해 이미지를 획득하거나, 유선 또는 근거리 무선 통신을 이용하여 이미지를 획득할 수 있다.In addition, the set-top box 20 may acquire an image through screen capture or the like, or may acquire an image through wired or short-range wireless communication.

단계(S120)에서, 이동 단말(10)은 생성한 이미지 데이터를 셋탑 박스(20)로 전송한다. 이동 단말(10)은 근거리 무선 통신을 이용하거나, 유선 통신을 이용하여 이미지 데이터를 셋탑 박스(20)로 전송할 수 있다.In step S120, the mobile terminal 10 transmits the generated image data to the set top box 20. The mobile terminal 10 may transmit image data to the set-top box 20 using short-range wireless communication or wired communication.

단계(S130)에서, 셋탑 박스(20)는 단계(S110)에서 생성한 이미지 데이터 또는 단계(S120)에서 수신한 이미지 데이터를 컨텐츠 서버(100)로 전송한다. 셋탑 박스(20)는 아이피(internet protocol, IP) 네트워크를 통하여 이미지 데이터를 컨텐츠 서버(100)로 전송할 수 있다.In operation S130, the set-top box 20 transmits the image data generated in operation S110 or the image data received in operation S120 to the content server 100. The set top box 20 may transmit image data to the content server 100 through an internet protocol (IP) network.

단계(S140)에서, 컨텐츠 서버(100)는 단계(S130)에서 수신한 이미지 데이터를 분석하여 특징을 추출하고 인식 파라미터를 생성하여, 컨텐츠 검색을 위한 태그를 생성한다. 이러한 컨텐츠 검색용 태그는 인식 파라미터를 이용하여 생성되며, 예를 들어 배우 이름, 건축물의 명칭 등을 포함할 수 있다. 컨텐츠 검색용 태그는 이미지 패턴의 분석, 얼굴 인식, 미리 저장된 얼굴 이미지와의 비교를 통해 생성될 수 있다.In operation S140, the content server 100 analyzes the image data received in operation S130, extracts a feature, generates a recognition parameter, and generates a tag for content search. The content search tag is generated using a recognition parameter and may include, for example, an actor's name, a building's name, and the like. The content search tag may be generated through analysis of an image pattern, face recognition, and comparison with a pre-stored face image.

이미지 데이터 분석 중 인물에 대한 분석은 이미지 데이터에 포함된 얼굴을 검출하고 인식하는 알고리즘에 의해 수행될 수 있다.Analysis of the person during image data analysis may be performed by an algorithm for detecting and recognizing a face included in the image data.

얼굴 검출은 입력된 이미지 데이터에 존재하는 얼굴의 영역을 추출하고, 추출된 얼굴에 존재하는 눈의 위치를 찾음으로써 수행될 수 있다. 얼굴 검출은 얼굴 인식을 위한 전처리 과정으로 볼 수 있으며, 또한 그 자체만으로도 유용한 어플리케이션이 될 수 있다.Face detection may be performed by extracting an area of a face existing in the input image data and finding a location of an eye present in the extracted face. Face detection can be seen as a preprocessing process for face recognition and can be a useful application on its own.

얼굴 검출은 얼굴의 질감 특성을 이용하여 얼굴의 영역을 추출하고, 얼굴의 피부색(skin color) 정보를 이용하여 잘못 검출된 영역(false-positive)을 제거하는 과정을 통해 수행될 수 있다.The face detection may be performed by extracting an area of the face by using the texture characteristic of the face and removing a false-positive area by using skin color information of the face.

얼굴의 질감 특성을 이용한 얼굴 영역 추출은 객체 검출 알고리즘을 이용하여 수행될 수 있으며, 얼굴의 피부색 정보는 RGB 색 공간에서 정의되어, 얼굴 영역에 존재하는 피부색 픽셀들의 비율에 따라 잘못 검출된 영역이 제거될 수 있다.The face region extraction using the texture characteristics of the face may be performed using an object detection algorithm, and the skin color information of the face is defined in the RGB color space, so that an incorrectly detected region is removed according to the ratio of skin color pixels present in the face region. Can be.

일단 얼굴 영역이 검출되면, 얼굴 영역에 존재하는 눈 위치를 찾기 위하여 아이맵(eyemap)이 결정된다. 아이맵은 눈의 밝기(luma) 및 색상(chorma) 정보를 이용하여 결정될 수 있다. 결정된 아이맵은 적응적 임계치(adaptive threshold)를 이용하여 이진 영상으로 만들어진다.Once the face area is detected, an eyemap is determined to find the eye position present in the face area. The eye map may be determined using luma and color information of the eye. The determined eye map is made into a binary image using an adaptive threshold.

만들어진 이진 영상으로부터 눈에 해당되는 후보 영역들이 분류되고, 좌/우 영역에서 가장 높은 평균 아이맵 값을 가지는 후보 영역이 최종 눈의 영역으로 간주된다. 이렇게 좌/우 눈의 영역이 결정되면, 각 영역의 중심점이 눈의 위치로 설 정된다.Candidate regions corresponding to the eyes are classified from the created binary image, and the candidate region having the highest average eyemap value in the left and right regions is regarded as the final eye region. When the area of the left and right eyes is determined in this way, the center point of each area is set to the position of the eye.

얼굴 인식은, 입력된 이미지 데이터로부터 추출되어 특정한 규격으로 정규화된 얼굴 이미지로부터 얼굴 특징을 추출하고, 추출된 얼굴 특징을 각 인물의 데이터베이스의 학습된 모델들과 비교하여, 가장 유사도가 높은 인물로 얼굴을 식별하는, 일련의 과정을 통해 수행될 수 있다.Face recognition extracts face features from face images extracted from the input image data and normalized to a specific standard, and compares the extracted face features with trained models in each person's database, thereby making the face the person with the highest similarity. This can be done through a series of processes to identify.

얼굴 인식을 위해 PCA(principal component analysis), FLDA(Fisher linear discriminant analysis) 또는 RLDA(Regualized linear discriminant analysis) 등의 알고리즘이 사용될 수 있다.Algorithms such as principal component analysis (PCA), Fisher linear discriminant analysis (FLDA) or Regularized linear discriminant analysis (RLDA) may be used for face recognition.

단계(S150)에서, 컨텐츠 서버(100)는 컨텐츠 검색용 태그와 동일 또는 유사한 태그 또는 관련된 것으로 판단되는 태그를 포함하는 컨텐츠를 컨텐츠 서버(100)에 포함된 데이터베이스 또는 외부의 데이터베이스에서 검색한다.In operation S150, the content server 100 searches for a content including a tag that is the same as or similar to the content search tag or a tag determined to be related to the database included in the content server 100 or an external database.

단계(S160)에서, 컨텐츠 서버(100)는 검색된 컨텐츠를 셋탑 박스(20)로 전송한다.In operation S160, the content server 100 transmits the retrieved content to the set top box 20.

단계(S170)에서, 셋탑 박스(20)는 컨텐츠 서버(100)로부터 수신한 컨텐츠를 사용자에게 제공한다.In step S170, the set-top box 20 provides the user with the content received from the content server 100.

도 4는 본 발명의 일 실시예에 따라 이미지 데이터를 분석하여 컨텐츠를 검색하는 방법의 흐름을 도시한 순서도이다.4 is a flowchart illustrating a method of searching for content by analyzing image data according to an embodiment of the present invention.

단계(S210)에서, 셋탑 박스(210)로부터 이미지 데이터를 수신한다. 이미지 데이터는 전술한 바와 같이 인물 이미지 또는 객체 이미지를 포함할 수 있다.In operation S210, image data is received from the set-top box 210. The image data may include a person image or an object image as described above.

단계(S220)에서, 단계(S210)에서 수신한 이미지 데이터가 인물 이미지를 포 함하는지 여부를 판단한다. 즉, 이미지 데이터가 인물, 예를 들어 영화 배우 등을 포함하는지 여부를 판단하며, 인물을 포함하는 것은 인물의 전신뿐만 아니라 얼굴만을 포함하는 경우도 해당한다. 따라서, 이미지 데이터가 얼굴을 비롯한 인물 이미지를 포함하는지 여부를 판단한다.In step S220, it is determined whether the image data received in step S210 includes a person image. That is, it is determined whether the image data includes a person, for example, a movie star, and the like, and the case of including a person corresponds to a case in which only the face of the person is included as well as the face. Therefore, it is determined whether the image data includes a person image including a face.

단계(S230)에서는, 단계(S220)에서 이미지 데이터가 인물 이미지를 포함하는 것으로 판단되는 경우, 이미지 데이터로부터 인물 이미지의 특징을 추출하여 인물 인식 파라미터를 생성한다. 인물 이미지의 특징은 얼굴 인식(face recognition)을 이용하여 추출될 수 있으며, 인물 인식 파라미터는 인물 이미지의 특징으로부터 검출된 인물의 이름, 인물을 대표하는 키워드 등을 포함할 수 있다.In step S230, when it is determined in step S220 that the image data includes a person image, a feature of the person image is extracted from the image data to generate a person recognition parameter. The feature of the person image may be extracted using face recognition, and the person recognition parameter may include a name of the person detected from the feature of the person image, a keyword representing the person, and the like.

예를 들어, 이미지 데이터가 영화 배우 "홍길동"의 얼굴을 포함하는 경우, 얼굴 인식을 통해 인물 이미지의 특징으로서 "홍길동"의 얼굴이 검출될 수 있으며, 이로 인해 인물 인식 파라미터는 단어로써 "홍길동"을 키워드로 포함할 수 있다. 또한, 인물 인식 파라미터는 영화 배우 "홍길동"의 대표 영화 제목을 포함할 수 있다.For example, when the image data includes the face of the movie actor "Hong Gil Dong", the face recognition may detect the face of "Hong Gil Dong" as a feature of the human image, so that the person recognition parameter is "Hong Gil Dong" as the word. Can be included as a keyword. In addition, the person recognition parameter may include a representative movie title of the movie actor "Hong Gil Dong".

단계(S240)에서는, 단계(S230)에서 생성된 인물 인식 파라미터를 이용하여 컨텐츠 검색용 태그인 인물 이미지 태그를 생성한다. 컨텐츠 검색용 태그는 각 컨텐츠에 대응되는 태그와 비교하기 위하여 생성된다.In step S240, a person image tag, which is a tag for content search, is generated using the person recognition parameter generated in step S230. The content search tag is generated to compare with a tag corresponding to each content.

단계(S250)에서, 단계(S210)에서 수신한 이미지 데이터가 객체 이미지를 포함하는지 여부를 판단한다. 즉, 이미지 데이터가 건축물, 산, 강, 동물, 자동차 등의 객체를 포함하는지 여부를 판단한다.In step S250, it is determined whether the image data received in step S210 includes an object image. That is, it is determined whether the image data includes objects such as buildings, mountains, rivers, animals, and cars.

단계(S260)에서는, 단계(S250)에서 이미지 데이터가 객체 이미지를 포함하는 것으로 판단되는 경우, 이미지 데이터로부터 객체 이미지의 특징을 추출하여 객체 인식 파라미터를 생성한다. 객체 이미지의 특징은 객체 인식(object recognition)을 이용하여 추출될 수 있으며, 객체 인식 파라미터는 객체 이미지의 특징으로부터 검출된 명칭, 예를 들어 건축물의 명칭, 동물의 이름, 자동차의 모델명 등을 포함할 수 있다.In operation S260, when it is determined in operation S250 that the image data includes an object image, an object recognition parameter is generated by extracting a feature of the object image from the image data. The feature of the object image may be extracted using object recognition, and the object recognition parameter may include a name detected from the feature of the object image, for example, a building name, an animal name, a model name of a vehicle, and the like. Can be.

예를 들어, 이미지 데이터가 건축물 "동대문"을 포함하는 경우, 객체 인식을 통해 객체 이미지의 특징으로서 "동대문"의 형상이 검출될 수 있으며, 이로 인해 객체 인식 파라미터는 단어로써 "동대문"을 키워드로 포함할 수 있다.For example, when the image data includes the building "Dongdaemun", the object recognition may detect the shape of "Dongdaemun" as a feature of the object image, so that the object recognition parameter is a word "Dongdaemun" as a keyword. It may include.

단계(S270)에서, 단계(S260)에서 생성된 객체 인식 파라미터를 이용하여 컨텐츠 검색용 태그인 객체 이미지 태그를 생성한다. 전술한 바와 같이, 컨텐츠 검색용 태그는 각 컨텐츠에 대응되는 태그와 비교하기 위하여 생성된다.In operation S270, an object image tag, which is a tag for content search, is generated using the object recognition parameter generated in operation S260. As described above, the content search tag is generated for comparison with a tag corresponding to each content.

단계(S280)에서, 단계(S240)에서 생성된 인물 이미지 태그 또는 단계(S270)에서 생성된 객체 이미지 태그를 이용하여 단계(S210)에서 수신한 이미지 데이터에 관련된 컨텐츠를 검색하고, 검색된 컨텐츠를 셋탑 박스(도시 생략)에 제공한다.In step S280, the content related to the image data received in step S210 is searched using the person image tag generated in step S240 or the object image tag generated in step S270, and the retrieved content is set-top. It is provided in a box (not shown).

이미지 데이터에 관련된 컨텐츠는 인물 이미지 태그, 객체 이미지 태그 또는 각각의 태그를 포함하는 태그와 검색 대상이 되는 컨텐츠에 첨부된 태그를 비교하여 검색될 수 있다.Content related to the image data may be searched by comparing a person image tag, an object image tag, or a tag including each tag with a tag attached to the content to be searched.

즉, 검색 대상이 되는 컨텐츠는 각각 요약 정보를 제공하는 태그를 포함한다. 따라서, 컨텐츠에 포함된 태그와 이미지 데이터로부터 생성된 태그를 비교하 여 컨텐츠가 검색될 수 있다. 이처럼 검색되어 셋탑 박스에 제공되는 컨텐츠는 이미지, 동영상 또는 텍스트 파일을 포함할 수 있다.That is, the content to be searched for includes tags that provide summary information. Therefore, the content may be searched by comparing a tag included in the content with a tag generated from image data. The content retrieved and provided to the set-top box may include an image, a video, or a text file.

아이피티브이(IPTV) 관련 영화, 드라마 등의 모든 컨텐츠는 컨텐츠를 대표하는 포스터를 포함할 수 있다. 이러한 포스터 속에는 대부분 주인공이 등장하며, 컨텐츠의 특징을 시각화할 수 있는 이미지가 포함되어 있다. 이러한 포스터를 대상으로 이미지 분석을 이용하여 컨텐츠가 검색될 수 있다.All content such as IPTV related movies, dramas, etc. may include a poster representing the content. Most of these posters have main characters and images that can visualize the characteristics of the contents. Content may be searched for such a poster using image analysis.

예를 들어, 포스터에 포함된 주연 또는 조연 배우에 대한 인물 이미지와 배경이 되는 하늘, 산, 건축물, 나무, 동물, 물고기 또는 자동차 등에 대한 객체 이미지를 이용하면, 시청자가 원하는 컨텐츠가 포스터 이미지 정보를 이용하여 간단히 검색될 수 있다.For example, if you use a character image of a lead or supporting actor included in a poster and an object image of a sky, a mountain, a building, a tree, an animal, a fish, a car, etc., the content desired by a viewer Can be simply retrieved.

또한, 아이피티브이(IPTV) 시청 중 화면에 디스플레이된 배우 또는 자동차 등에 대한 정보가 필요한 경우, 아이피티브이(IPTV) 사용자는 화면 캡처를 이용하여 배우 또는 자동차에 대한 이미지 데이터를 생성하여 관련 컨텐츠를 검색할 수 있다.In addition, if information about an actor or a car displayed on the screen is required while watching an IPTV, the IPTV user may use screen capture to generate image data about the actor or the car and search for related content. Can be.

도 5는 본 발명의 일 실시예에 따른 검색 서버의 구성을 도시한 블록도이다.5 is a block diagram showing the configuration of a search server according to an embodiment of the present invention.

본 발명의 일 실시예에 따른 검색 서버(100)는 데이터 송수신부(110), 이미지 처리부(120) 및 컨텐츠 검색부(130)를 포함한다.Search server 100 according to an embodiment of the present invention includes a data transmission and reception unit 110, an image processing unit 120 and a content search unit 130.

데이터 송수신부(110)는 셋탑 박스(도시 생략)로부터 이미지 데이터를 수신하고, 컨텐츠 검색부(130)에 의해 검색된 컨텐츠를 셋탑 박스로 전송한다.The data transmission / reception unit 110 receives image data from a set-top box (not shown), and transmits the content searched by the content search unit 130 to the set-top box.

이미지 처리부(120)는 데이터 송수신부(110)가 수신한 이미지 데이터를 분석 하여 특징을 추출하여 인식 모델 파라미터를 생성하고, 생성된 인식 모델 파라미터로부터 관련된 태그를 추출한다. 추출되는 태그는 컨텐츠 검색용 태그로서, 이미지 데이터와 관련된 키워드로 구성될 수 있다.The image processor 120 analyzes the image data received by the data transceiver 110, extracts a feature, generates a recognition model parameter, and extracts a related tag from the generated recognition model parameter. The extracted tag is a content search tag, and may be composed of keywords related to image data.

이미지 처리부(120)는 이미지 데이터에 포함된 얼굴 이미지 및 객체 이미지 중 적어도 하나를 분석하여 태그를 추출할 수 있다. 또한, 얼굴 이미지 및 객체 이미지로부터 각각 태그가 추출된 경우, 이미지 처리부(120)는 각각의 태그를 결합하여 태그를 생성할 수도 있다.The image processor 120 may extract a tag by analyzing at least one of a face image and an object image included in the image data. In addition, when a tag is extracted from each of the face image and the object image, the image processor 120 may combine the respective tags to generate a tag.

컨텐츠 검색부(130)는 이미지 처리부(120)에 의해 생성된 태그와 동일 또는 유사하거나 관련된 것으로 판단되는 태그를 포함하는 컨텐츠를 컨텐츠 데이터베이스(140)에서 검색한다.The content search unit 130 searches the content database 140 for content including a tag determined to be the same, similar, or related to the tag generated by the image processing unit 120.

컨텐츠 데이터베이스(140)는 다양한 컨텐츠를 저장하며, 컨텐츠 데이터베이스(140)에 저장된 컨텐츠는 각각 관련된 태그를 포함한다. 이러한 태그는 하나 이상일 수 있다.The content database 140 stores various contents, and the contents stored in the content database 140 each include an associated tag. There may be more than one such tag.

또한, 컨텐츠 검색부(130)는 컨텐츠 데이터베이스(140)뿐만 아니라 컨텐츠를 포함하는 외부의 데이터베이스 서버(도시 생략)로부터 컨텐츠 검색용 태그를 이용하여 컨텐츠를 검색할 수도 있다.In addition, the content search unit 130 may search for content using a content search tag from an external database server (not shown) including the content as well as the content database 140.

도 6은 본 발명의 일 실시예에 따른 이미지 처리부의 구성을 도시한 블록도이다.6 is a block diagram illustrating a configuration of an image processor according to an exemplary embodiment of the present invention.

본 발명의 일 실시예에 따른 이미지 처리부(120)는 인물 이미지 추출 모듈(121), 객체 이미지 추출 모듈(122) 및 태그 생성부(123)를 포함한다.The image processing unit 120 according to an embodiment of the present invention includes a person image extraction module 121, an object image extraction module 122, and a tag generator 123.

인물 이미지 추출 모듈(121)은 이미지 데이터에 포함된 인물 이미지로부터 특징을 추출하여 인물 인식 파라미터를 생성한다. 인물 이미지로부터 생성된 인물 인식 파라미터는 사람 이름일 수 있다.The person image extraction module 121 extracts a feature from the person image included in the image data and generates a person recognition parameter. The person recognition parameter generated from the person image may be a person name.

예를 들어, 이미지 데이터가 영화 배우의 얼굴을 포함하는 경우, 인물 이미지 추출 모듈(121)은 영화 배우의 이름을 이용하여 인물 인식 파라미터를 생성할 수 있다.For example, when the image data includes a face of a movie star, the person image extraction module 121 may generate a person recognition parameter using the name of the movie star.

객체 이미지 추출 모듈(122)은 이미지 데이터에 포함된 이미지 중에서 인물 이미지를 제외한 이미지인 객체 이미지로부터 특징을 추출하여 객체 인식 파라미터를 생성한다. 객체 이미지는, 예를 들어 주변 배경 이미지인, 건축물, 산, 하늘, 강, 바다 등의 이미지를 포함할 수 있다. 따라서, 객체 이미지로부터 생성된 객체 인식 파라미터는 건축물, 산, 강 등의 명칭이 될 수 있다.The object image extraction module 122 generates an object recognition parameter by extracting a feature from an object image that is an image excluding a person image among images included in the image data. The object image may include, for example, an image of a building, a mountain, a sky, a river, and the sea, which are surrounding background images. Accordingly, the object recognition parameter generated from the object image may be a name of a building, a mountain, a river, or the like.

예를 들어, 이미지 데이터가 유명 건축물의 이미지를 포함하는 경우, 객체 이미지 추출 모듈(122)은 유명 건축물의 명칭을 이용하여 객체 인식 파라미터를 생성할 수 있다.For example, when the image data includes an image of a famous building, the object image extraction module 122 may generate an object recognition parameter using the name of the famous building.

태그 생성부(123)은 인물 이미지 추출 모듈(121) 또는 객체 이미지 추출 모듈(122)로부터 생성된 인물 인식 파라미터 또는 객체 인식 파라미터를 이용하여 컨텐츠 검색을 위한 태그(tag)를 생성한다.The tag generator 123 generates a tag for content search using the person recognition parameter or the object recognition parameter generated from the person image extraction module 121 or the object image extraction module 122.

도 7은 본 발명의 일 실시예에 따른 셋탑 박스(STB)의 구성을 도시한 블록도이다.7 is a block diagram showing the configuration of a set-top box (STB) according to an embodiment of the present invention.

본 발명의 일 실시예에 따른 셋탑 박스(STB)(20)는 이미지 입력부(21), 이미 지 송신부(22), 검색결과 출력부(23), 통신 인터페이스(24) 및 캡처부(25)를 포함한다.Set-top box (STB) 20 according to an embodiment of the present invention is the image input unit 21, the image transmitter 22, the search result output unit 23, the communication interface 24 and the capture unit 25 Include.

이미지 입력부(21)는 이동 단말(도시 생략)로부터 이미지 데이터의 입력을 수신하거나, 캡처부(25)로부터 가입자 단말의 화면에 대한 캡처 이미지 데이터를 수신한다.The image input unit 21 receives an input of image data from a mobile terminal (not shown) or receives captured image data of a screen of the subscriber terminal from the capture unit 25.

이미지 송신부(22)는 이미지 입력부(21)가 수신한 이미지 데이터를 아이피(internet protocol, IP) 네트워크를 통하여 아이피티브이(IPTV) 시스템(도시 생략) 또는 컨텐츠 서버(도시 생략)로 전송한다. 특히, 이미지 송신부(22)는 리턴 채널을 통해 이미지 데이터를 아이피티브이(IPTV) 시스템 또는 컨텐츠 서버로 전송할 수 있다.The image transmitter 22 transmits the image data received by the image input unit 21 to an IPTV system (not shown) or a content server (not shown) through an internet protocol (IP) network. In particular, the image transmitter 22 may transmit image data to an IPTV system or a content server through a return channel.

검색결과 출력부(23)는 아이피티브이(IPTV) 시스템 또는 컨텐츠 서버가 수행한 검색 결과를 출력한다. 보다 상세하게는, 검색결과 출력부(23)는 아이피티브이(IPTV) 시스템 또는 컨텐츠 서버에 의한 검색 결과에 대응하는 컨텐츠 리스트를 수신하고, 수신한 컨텐츠 리스트를 출력하여 아이피티브이(IPTV)의 사용자에게 컨텐츠에 대한 선택을 제공한다.The search result output unit 23 outputs a search result performed by an IPTV system or a content server. More specifically, the search result output unit 23 receives a content list corresponding to a search result by an IPTV system or a content server, and outputs the received content list to a user of the IPTV. Provide a selection of content.

따라서, 아이피티브이의 사용자는 출력된 검색 결과를 이용하여 원하는 컨텐츠를 선택할 수 있다.Accordingly, the user of the iPyiyi can select the desired content by using the output search result.

통신 인터페이스(24)는 이동 단말(도시 생략)과의 유선 또는 근거리 무선 통신을 위한 인터페이스를 제공한다. 에를 들어, 통신 인터페이스(24)는 유에스비(USB), 적외선 또는 블루투스(Bluetooth) 통신을 위한 통신 인터페이스를 제공할 수 있다.The communication interface 24 provides an interface for wired or short-range wireless communication with a mobile terminal (not shown). For example, the communication interface 24 may provide a communication interface for USB, infrared or Bluetooth communication.

또한, 통신 인터페이스(24)는 이미지 입력부(21)와 연결되어 이동 단말로부터 수신한 이미지 데이터를 이미지 입력부로 전달한다.In addition, the communication interface 24 is connected to the image input unit 21 to transfer the image data received from the mobile terminal to the image input unit.

캡처부(25)는 셋탑 박스(20)와 연결된 가입자 단말의 화면을 캡처(capture)하여 이미지 데이터를 생성한다. 즉, 캡처부(25)는 가입자 단말의 화면에 방송중인 컨텐츠의 이미지를 캡처하고, 캡처한 이미지를 이용하여 이미지 데이터를 생성한다. 캡처부(25)에 의해 생성된 이미지 데이터는 이미지 입력부(21)로 전달된다.The capturer 25 captures a screen of the subscriber station connected to the set top box 20 to generate image data. That is, the capture unit 25 captures an image of the content being broadcast on the screen of the subscriber terminal, and generates image data using the captured image. The image data generated by the capture unit 25 is transferred to the image input unit 21.

본 발명의 일 실시예는 컴퓨터에 의해 실행되는 프로그램 모듈과 같은 컴퓨터에 의해 실행가능한 명령어를 포함하는 기록 매체의 형태로도 구현될 수 있다. 컴퓨터 판독 가능 매체는 컴퓨터에 의해 액세스될 수 있는 임의의 가용 매체일 수 있고, 휘발성 및 비휘발성 매체, 분리형 및 비분리형 매체를 모두 포함한다. 또한, 컴퓨터 판독가능 매체는 컴퓨터 저장 매체 및 통신 매체를 모두 포함할 수 있다. 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함한다. 통신 매체는 전형적으로 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈, 또는 반송파와 같은 변조된 데이터 신호의 기타 데이터, 또는 기타 전송 메커니즘을 포함하며, 임의의 정보 전달 매체를 포함한다.One embodiment of the present invention can also be implemented in the form of a recording medium containing instructions executable by a computer, such as a program module executed by the computer. Computer readable media can be any available media that can be accessed by a computer and includes both volatile and nonvolatile media, removable and non-removable media. In addition, computer readable media may include both computer storage media and communication media. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. Communication media typically includes computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave, or other transmission mechanism, and includes any information delivery media.

본 발명의 방법 및 시스템은 특정 실시예와 관련하여 설명되었지만, 그것들의 구성 요소 또는 동작의 일부 또는 전부는 범용 하드웨어 아키텍쳐를 갖는 컴퓨 터 시스템을 사용하여 구현될 수 있다.Although the methods and systems of the present invention have been described in connection with specific embodiments, some or all of their components or operations may be implemented using a computer system having a general purpose hardware architecture.

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수 있다.The foregoing description of the present invention is intended for illustration, and it will be understood by those skilled in the art that the present invention may be easily modified in other specific forms without changing the technical spirit or essential features of the present invention. will be. Therefore, it should be understood that the embodiments described above are exemplary in all respects and not restrictive. For example, each component described as a single type may be implemented in a distributed manner, and similarly, components described as distributed may be implemented in a combined form.

본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.The scope of the present invention is shown by the following claims rather than the above description, and all changes or modifications derived from the meaning and scope of the claims and their equivalents should be construed as being included in the scope of the present invention. do.

도 1은 본 발명의 일 실시예가 적용될 수 있는 아이피티브이(IPTV) 방송 시스템의 구성을 도시한 블록도,1 is a block diagram showing the configuration of an IPTV broadcasting system to which an embodiment of the present invention can be applied;

도 2는 본 발명의 일 실시예에 따른 아이피티브이에서 이미지 인식을 이용한 컨텐츠 검색 시스템의 구성을 도시한 블록도,FIG. 2 is a block diagram illustrating a configuration of a content retrieval system using image recognition in an IP according to an embodiment of the present invention; FIG.

도 3은 본 발명의 일 실시예에 따른 아이피티브이에서 이미지 인식을 이용한 컨텐츠 검색 방법의 흐름을 도시한 신호 흐름도,3 is a signal flow diagram illustrating a flow of a content retrieval method using image recognition in an IP according to an embodiment of the present invention;

도 4는 본 발명의 일 실시예에 따라 이미지 데이터를 분석하여 컨텐츠를 검색하는 방법의 흐름을 도시한 순서도,4 is a flowchart illustrating a method of searching for content by analyzing image data according to an embodiment of the present invention;

도 5는 본 발명의 일 실시예에 따른 검색 서버의 구성을 도시한 블록도,5 is a block diagram showing the configuration of a search server according to an embodiment of the present invention;

도 6은 본 발명의 일 실시예에 따른 이미지 처리부의 구성을 도시한 블록도,6 is a block diagram showing a configuration of an image processing unit according to an embodiment of the present invention;

도 7은 본 발명의 일 실시예에 따른 셋탑 박스(STB)의 구성을 도시한 블록도.7 is a block diagram showing the configuration of a set-top box (STB) according to an embodiment of the present invention.

Claims

In the content retrieval method using image recognition in IPTV,

(a) matching and storing at least one tag that can be extracted from an image with respect to provided content,

(b) receiving image data from a set-top box (STB),

(c) analyzing the image data to generate a tag for content search; and

(d) searching for content related to the image data by comparing the content search tag with a tag matching the content

Content search method comprising a.

The method of claim 1,

And the image data is stored in the set top box (STB) through short-range communication from an external device.

The method of claim 2,

And the external device is a mobile terminal having a camera.

The method of claim 1,

And the image data is generated by capturing a screen of a subscriber station connected to the set top box (STB).

The method according to claim 2 or 4,

(e) providing a content list for viewing the retrieved content

Content search method further comprising.

The method according to claim 2 or 4,

In step (b),

(b1) generating a person image tag by extracting a feature of the person image when the image data includes a person image

Including,

The content search tag includes the person image tag.

The method according to claim 2 or 4,

(b2) if the image data includes an object image, generating an object image tag by extracting features of the object image

Including,

The content search tag comprises the object image tag.

In a content retrieval system using image recognition in IPTV,

A data transceiver for receiving image data from the set-top box,

An image processor extracting a feature of an image included in the image data and generating a content search tag;

A content database that matches and stores at least one tag that can be extracted from an image for each content;

A content searching unit that searches for contents related to the image data in the contents database using the contents searching tag;

Including,

And the content search unit searches for the content by comparing the content search tag with the tag included in the content.

The method of claim 8,

The image processing unit,

A person image extraction module for generating a person recognition parameter from the person image included in the image data;

An object image extraction module for generating an object recognition parameter from the object image included in the image data;

Tag generation unit for generating a content search tag using the person recognition parameter or the object recognition parameter.

Content retrieval system comprising a.

In a set-top box (STB) that provides content retrieval using image recognition,

An image input unit providing an input of an image for search,

An image transmitter for transmitting the image to an IPTV system through a return channel;

Search result output unit for receiving a list of content corresponding to the search results from the IPTV system to provide a selection of content

Set top box providing a content search comprising a.

The method of claim 10,

The search result is a set-top box for providing a content search that is generated by comparing the tag extracted from the input image and the tag matching each content.

The method of claim 10,

Short-range communication interface connected to the image input unit

Set top box to provide a content search further comprising.

The method of claim 10,

Capture unit for capturing an image of the content being broadcast

More,

And the image captured by the capture unit is delivered to the image input unit.