KR20190069695A

KR20190069695A - Video content creation device with subtitle

Info

Publication number: KR20190069695A
Application number: KR1020170169862A
Authority: KR
Inventors: 김태준
Original assignee: 주식회사 유니큐
Priority date: 2017-12-12
Filing date: 2017-12-12
Publication date: 2019-06-20

Abstract

The present invention relates to a video content creating device with a subtitle. According to the present invention, a subtitle character is inserted into a speech bubble part to be displayed, thereby clearly confirming a relationship between a subtitle and a talker. This is even more helpful in a place where a speaker or the like cannot be utilized. In addition, an important part on a screen can be prevented from being hidden by enabling the speech bubble to be provided in an even tone part of an image or by changing a position of the speech bubble according to an instruction of a user. Furthermore, a form of the speech bubble can be varied according to a comment of the talker or characteristics of lines.

Description

Title: Video content creation device with subtitle}

본 발명은 영상 컨텐츠의 작성 장치에 관한 것으로, 보다 구체적으로는 자막이 부착된 영상 컨텐츠를 제작할 수 있도록 하는 기술에 대한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention [0002] The present invention relates to a video content creating apparatus, and more particularly, to a technique for creating video content having a caption attached thereto.

종래, 외국어의 영화 내용을 이해하기 위해서, 등장 인물의 대사를 영상 화면 일부에 모국어로 문자 표시를 해주곤 하였다. 이를 통해 시청자는 등장 인물이 외국어로 대화를 하더라도 모국어 문자를 통해 대화 내용을 이해할 수 있었다.Conventionally, in order to understand the movie contents of a foreign language, the characters of the characters were displayed on the part of the video screen in the native language. Through this, the viewer could understand the conversation contents through the mother tongue characters even if the character talks in a foreign language.

또한 최근에는 등장 인물이 모국어로 대화를 하고 있더라도 연출상의 이유로 대화 내용을 자막으로 표시해주는 방송 프로그램이 유행하고 있다. 물론 등장 인물의 대화 내용 외에도 영상 내용에 대한 보조 설명으로 자막이 다수 활용되고 있다.Recently, even if the characters are conversing in their own language, a broadcast program for displaying the contents of the conversation as subtitles is becoming prevalent for reasons of presentation. Of course, in addition to the contents of the characters, subtitles are being used for the explanation of the video contents.

통상적으로 시청자는 화면과 함께 출력되는 음성을 통해 발화자가 누구인지 인식한다. 따라서 종래와 같이 단순한 자막 출력만을 이용하였을 경우, 음소거 상태로 화면을 시청할 때에는 누가 발화하고 있는지 알 수 없게 된다.Typically, the viewer recognizes who is speaking through the voice output along with the screen. Therefore, when only the subtitle output is used as in the conventional art, it is impossible to know who is speaking when watching the screen in the mute state.

본 발명은 상술한 바와 같은 종래 기술의 문제점을 해결하기 위해 안출된 것으로, 발화자와 자막과의 관계를 쉽게 알 수 있도록 하는 기술을 제공하는 데 그 목적이 있다.SUMMARY OF THE INVENTION It is an object of the present invention to provide a technique for easily knowing a relationship between a speaker and a caption.

상기 목적을 달성하기 위한 본 발명에 따른 자막이 있는 영상 컨텐츠 작성 장치는, 말풍선에 의한 자막이 부착된 영상 컨텐츠를 제공하기 위해 필요한 데이터를 작성하기 위한 컨텐츠 작성 장치로서, 원 데이터가 되는 영상 컨텐츠 데이터에 기반한 영상 내에서 말풍선을 표시해야 할 시간을 추출하는 말풍선 시간 추출 수단과, 상기 말풍선 시간 추출 수단이 추출한 시간의 영상 내에서 말풍선의 표시에 적합한 말풍선 영역을 결정하는 말풍선 영역 결정 수단과, 상기 말풍선 영역 결정 수단이 결정된 상기 말풍선 영역에 합성하는 말풍선 화상을 결정하는 말풍선 화상 결정 수단과, 상기 말풍선 화상 결정 수단이 결정된 상기 말풍선 화상에 합성하는 자막 문자를 결정하는 자막 문자 결정 수단과 상기 말풍선을 표시해야 할 시간에 관한 정보, 상기 말풍선 영역에 관한 정보, 상기 말풍선 화상에 관한 정보 및 상기 자막 문자에 관한 정보 중, 적어도 하나의 정보를 데이터화함으로써 말풍선 데이터를 작성하는 말풍선 데이터 작성 수단을 구비하고, 상기 말풍선 데이터 작성 수단에 의해 작성된 말풍선 데이터는 상기 영상 컨텐츠 데이터와 함께 재생됨으로써 말풍선에 의한 자막이 부착된 영상 컨텐츠를 제공할 수 있다.According to another aspect of the present invention, there is provided an apparatus for generating a video content having a caption according to the present invention, the apparatus comprising: a content creator for creating data necessary for providing caption- A speech bubble region determining means for determining a speech bubble region suitable for display of a speech balloon in an image of a time extracted by the speech balloon time extracting means; A balloon image determining means for determining a balloon image to be synthesized in the balloon region determined by the region determining means; a caption character determining means for determining a caption character to be synthesized with the balloon image determined by the balloon image determining means; Information on time to be performed, And bubble data creating means for creating bubble data by converting at least one piece of information among the information on the region, the information on the region, the information on the region, the information on the region, the information on the balloon image, and the information on the caption character, Can be reproduced together with the image content data, thereby providing image contents with captions by the speech balloon.

본 발명에 따르면 자막 문자를 말풍선부분에 삽입해 표시토록 함으로써, 자막과 발화자의 관계를 명확하게 확인할 수가 있다. 이는 스피커 등을 활용할 수 없는 장소에서 더욱 도움이 된다.According to the present invention, the relationship between the caption and the speaker can be clearly confirmed by inserting and displaying the caption character in the speech bubble portion. This is even more helpful in places where you can not use speakers, for example.

또한 영상에서 색조가 평탄한 부분에 말풍선이 마련되도록 하거나 사용자의 지시에 따라 말풍선 위치가 변경될 수 있도록 함으로써, 화면상 중요한 부분이 가려질 우려가 없다. 또한 발화자의 발언에 따라 또는 대사의 성격에 따라 말풍선 형태를 다양하게 변경할 수가 있다.Also, it is possible to prevent the important part on the screen from being hidden by allowing the speech balloon to be provided in the flat part of the image in the image or by changing the position of the speech balloon according to the instruction of the user. Also, depending on the utterance of the speaker or the nature of the dialogue, the form of speech balloon can be variously changed.

도1은 본 발명의 실시예에 따른 자막이 부착된 영상 컨텐츠를 제작하는 장치를 설명하기 위한 도면.BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a diagram for explaining an apparatus for producing image contents with captions according to an embodiment of the present invention; Fig.

이하에서는 본 발명의 바람직한 실시예를 첨부된 도면을 참조하여 설명한다. 다만 발명의 요지와 무관한 일부 구성은 생략 또는 압축할 것이나, 생략된 구성이라고 하여 반드시 본 발명에서 필요가 없는 구성은 아니며, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 결합되어 사용될 수 있다.Hereinafter, preferred embodiments of the present invention will be described with reference to the accompanying drawings. However, some configurations which are not related to the gist of the present invention may be omitted or compressed, but the configurations omitted are not necessarily those not necessary in the present invention, and they may be combined by a person having ordinary skill in the art to which the present invention belongs. .

도1은 본 발명의 실시예에 따른 자막이 있는 영상 컨텐츠의 작성 장치를 설명하기 위한 도면이다. 도1에 도시된 바와 같이, 방송 시스템은 컨텐츠 작성 장치(1), 컨텐츠 송신 장치(2), 방송 장치(3), 컨텐츠 재생 장치(4), 컨텐츠 표시 장치(5)를 포함한다.1 is a diagram for explaining an apparatus for generating an image content having a caption according to an embodiment of the present invention. 1, the broadcasting system includes a content creating apparatus 1, a content transmitting apparatus 2, a broadcasting apparatus 3, a content reproducing apparatus 4, and a content display apparatus 5. [

컨텐츠 작성 장치(1)는 미리 저장되어 있는 컨텐츠 데이터에 말풍선을 이용한 자막을 합성하여 말풍선 데이터를 작성하기 위해 마련된다.The content creating apparatus 1 is provided for composing the caption using the speech balloon to the content data stored in advance to create speech balloon data.

컨텐츠 송신 장치(2)는 컨텐츠 작성 장치(1)에서 컨텐츠 데이터 및 말풍선 데이터를 취득하여 이들을 다중화하고, 다중화 데이터를 구내 회선이나 공중망, 인터넷, 전파망 등의 네트워크를 통해 방송 장치(3)로 송신한다.The content transmitting apparatus 2 acquires the content data and speech balloon data from the content creating apparatus 1 and multiplexes them and transmits the multiplexed data to the broadcasting apparatus 3 via a network such as an in-premise line, a public network, the Internet, do.

방송 장치(3)는 컨텐츠 송신 장치(2)에서 보내져 오는 다중화 데이터를 수신하고 안테나를 통해 방송한다.The broadcast apparatus 3 receives the multiplexed data sent from the content transmitting apparatus 2 and broadcasts the multiplexed data through the antenna.

컨텐츠 재생 장치(4)는 방송 장치(3)로부터 송신되어 오는 다중화 데이터를 수신해 해석하고 말풍선을 이용한 자막이 부착된 영상을 컨텐츠 표시 장치(5)에 표시한다. 컨텐츠 표시 장치(5)는 컨텐츠 재생 장치(4)에서 보내져 오는 신호에 따라 말풍선을 이용한 자막이 부착된 영상을 표시한다.The content reproduction apparatus 4 receives and interprets the multiplexed data transmitted from the broadcasting apparatus 3 and displays an image having the caption using the speech balloon on the content display apparatus 5. [ The content display apparatus 5 displays an image with a caption using speech balloon according to a signal sent from the content reproduction apparatus 4. [

컨텐츠 작성 장치(1)는 데이터 작성 제어부(11)과 입력부(12)와 표시 출력부(13)과 타임 카운트부(14)와 저장부(15)를 포함한다.The content creating apparatus 1 includes a data creation control unit 11, an input unit 12, a display output unit 13, a time count unit 14, and a storage unit 15.

입력부(12)는 마우스나 키보드, 터치패널, 조이스틱 등의 입력 디바이스로서, 이들을 조작함으로써 사용자가 입력한 조작 정보를 데이터 작성 제어부(11)에 입력할 수 있다.The input unit 12 is an input device such as a mouse, a keyboard, a touch panel, or a joystick, and can input operation information input by a user to the data creation control unit 11 by operating these devices.

저장부(15)는 하드 디스크 등의 기록 장치이다. 저장부(15)에는 컨텐츠 데이터와 자막 리스트 데이터와 형상 데이터가 저장되어 있다.The storage unit 15 is a recording device such as a hard disk. The storage unit 15 stores content data, caption list data, and shape data.

컨텐츠 데이터는 영상 및 음성이 MPEG(Moving Picture Experts Group) 등의 방식으로 부호화된 스트림 데이터일 수 있다.The content data may be stream data in which video and audio are encoded by a method such as MPEG (Moving Picture Experts Group).

자막 리스트 데이터는 자막 문자와 그것을 표시하는 시간에 관한 정보를 포함할 수 있다. 자막 리스트 데이터에는 예를 들어 자막 개시 시간과 자막 계속 시간과 자막 문자가 등록되어 있을 수 있다. 여기서 자막 개시 시간은 컨텐츠의 선두에서 얼마나 경과한 시점에서 대응하는 자막 문자를 표시하는지를 나타내는 정보이다. 자막 계속 시간은 대응하는 자막 문자가 계속하여 표시되는 시간에 대한 정보이다.The caption list data may include information on a caption character and a time for displaying it. For example, the caption start time, the caption continuation time, and the caption character may be registered in the caption list data. Here, the caption start time is information indicating how long a caption character is displayed at the beginning of the content. The caption continuation time is information on the time when the corresponding caption character is continuously displayed.

말풍선 형상 데이터는 말풍선의 형상을 정의한 데이터이다. 예를 들어 말풍선 형상 데이터에는 말풍선 형상의 명칭, 출력용 데이터 등이 저장되어 있을 수 있다.The balloon shape data is data defining the shape of the balloon. For example, in the speech ball shape data, the name of the speech ball shape, data for output, and the like may be stored.

도4는 말풍선 데이터의 데이터 구성의 예를 나타내는 도이다. 도4에 나타난 바와 같이, 출력용 데이터에는 예를 들어 컨텐츠 데이터명에 대응하여 자막 개시 시간, 자막 계속 시간, 자막 문자 전개 스피드, 자막 문자 속성, 말풍선 범위, 말풍선 개시점, 말풍선 형상 및 자막 문자가 기술되어 있다. 자막 개시 시간 및 자막 계속 시간은 말풍선을 표시해야 할 시간에 관한 정보이다. 자막 문자 전개 스피드, 자막 문자 속성 및 자막 문자는 자막 문자에 관한 정보이다. 말풍선 범위 및 말풍선 개시점은 말풍선의 표시에 적합한 영상 내의 말풍선 영역에 관한 정보이다. 말풍선 형상은 말풍선 영역에 합성하는 말풍선 화상에 관한 정보이다. 말풍선 데이터는 말풍선을 표시해야 할 시간에 관한 정보, 말풍선 영역에 관한 정보, 말풍선 화상에 관한 정보 및 자막 문자에 관한 정보 중, 적어도 하나의 정보가 데이터화 된 것일 수 있다. 예를 들어 말풍선 데이터는 메타 랭귀지로 기술되어 있다. 여기서 자막 개시 시간, 자막 계속 시간 및 자막 문자는 자막 리스트 데이터의 것과 동일하다.4 is a diagram showing an example of the data structure of speech balloon data. As shown in Fig. 4, the output data includes, for example, a caption start time, a caption continuation time, a caption character expansion speed, a caption character attribute, a speech bubble range start point, a speech bubble start point, . The caption start time and the caption continuation time are information on the time to display a speech balloon. The subtitle character development speed, the subtitle character attribute, and the subtitle character are information on the subtitle character. The speech balloon range and the speech balloon start point are information about the speech balloon region in the image suitable for display of the speech balloon. The speech balloon shape is information about speech balloons to be synthesized in the speech balloon region. The speech balloon data may be data in which at least one piece of information among the information on the time to display the speech balloon, information on the speech balloon region, information on the speech balloon image, and information on the caption character is data. For example, speech balloon data is described as meta-language. Here, the caption start time, caption continuation time, and caption character are the same as those of the caption list data.

자막 문자 전개 스피드는 자막 계속 시간내에 자막 문자를 문장의 선두에서 순차적으로 표시하는 속도를 나타내는 정보이다. 자막 문자 속성은 자막 문자의 폰트 종류나 자막 문자의 색상, 자막 문자의 배경과 투과율, 자막 문자의 틀 종류 등을 나타내는 정보이다. 말풍선 범위는 말풍선을 합성하는 화면상의 위치를 나타내는 정보이다. 말풍선 개시점은 말풍선을 시작하는 화면상의 위치를 나타내는 정보이다. 말풍선 형상은 말풍선 데이터에 등록되어 있는 말풍선의 명칭을 나타내는 정보이다.The caption character development speed is information indicating the speed at which caption characters are sequentially displayed at the beginning of the caption within the caption continuation time. The subtitle character attribute is information indicating the font type of the subtitle character, the color of the subtitle character, the background and transparency of the subtitle character, and the frame type of the subtitle character. The speech balloon range is information indicating a position on the screen for synthesizing speech balloons. The speech balloon start point is information indicating the position on the screen where the speech balloon starts. The speech ball shape is information indicating the name of the speech ball registered in the speech ball data.

타임 카운트부(14)는 시간을 계측한다. 표시 출력부(13)는 데이터 작성 제어부(11)로부터의 신호에 따라 영상 및 말풍선 작성을 위한 화상을 표시하고 음성을 출력한다.The time counting section 14 measures the time. The display output unit 13 displays an image for creating a video image and a speech balloon in accordance with a signal from the data creation control unit 11 and outputs audio.

데이터 작성 제어부(11)는 컨텐츠 데이터를 재생하여 음성의 시작 시간 및 하강 시간을 검출해 자막 개시 시간 및 자막 계속 시간을 인식한다. 데이터 작성 제어부(11)는 인식한 자막 개시 시간 및 자막 계속 시간과 입력부(12)를 통해 사용자가 입력한 자막 문자와 대응시켜 자막 리스트 데이터를 작성하고 저장부(15)에 저장한다. 데이터 작성 제어부(11)는 자막 리스트 데이터를 참조하여 음성의 시작 시간을 검출하고, 해당 표시 시간의 영상 및 음성을 표시 출력부(13)에 표시 출력시킨다. 데이터 작성 제어부(11)는 표시하고 있는 영상에 형상을 합성시키고 해당 말풍선 형상 안에 자막 문자를 합성한다. 데이터 작성 제어부(11)는 상기 합성 결과, 사용자에 의한 최종결정이 이루어지면, 해당 자막 개시 시간의 말풍선 데이터를 작성한다. 데이터 작성 제어부(11)는 자막 개시 시간마다 작성된 말풍선 데이터를 종합하여 최종적인 말풍선 데이터를 생성한다. 데이터 작성 제어부(11)는 생성한 최종적인 말풍선 데이터를 저장부(15)에 저장한다.The data creation control unit 11 detects the start time and the fall time of the voice by reproducing the content data, and recognizes the caption start time and the caption continuation time. The data creation control unit 11 creates the caption list data in association with the recognized caption start time and the caption continuation time and the caption character input by the user through the input unit 12 and stores the created caption list data in the storage unit 15. [ The data creation control unit 11 detects the start time of the voice with reference to the caption list data, and displays and outputs the video and audio of the display time on the display output unit 13. [ The data creation control unit 11 composites the shape of the displayed image and synthesizes the caption characters in the balloon shape. When the final determination by the user is made as a result of the composition, the data creation control unit 11 creates speech balloon data of the subtitle start time. The data creation control unit 11 synthesizes the speech ball data created at each caption start time to generate final speech balloon data. The data creation control unit 11 stores the final speech balloon data generated in the storage unit 15. [

컨텐츠 송신 장치(2)는 다중화 제어부(21)과 조작부(22)와 오류 정정 부호 부가부(23)와 디지털 변조부(24)와 송신부(25)를 포함한다.The contents transmitting apparatus 2 includes a multiplexing control section 21, an operation section 22, an error correction code adding section 23, a digital modulation section 24 and a transmission section 25.

컨텐츠 재생 장치(4)는 재생 제어부(41)와 조작부(42)와 합성 전송부(43)와 타임 카운트부(44)와 말풍선 형상 기억부(45)와 수신부(46)와 복조부(47)와 오류 정정부(48)를 포함한다.The content reproduction apparatus 4 includes a reproduction control section 41, an operation section 42, a synthesis transfer section 43, a time count section 44, a speech balloon shape storage section 45, a reception section 46, And an error correction unit 48.

컨텐츠 표시 장치(5)는 표시 출력 디바이스부(51)와 구동 회로부(52)를 포함한다.The content display device 5 includes a display output device unit 51 and a drive circuit unit 52. [

이하 컨텐츠 작성 장치(1)의 동작에 대해서 설명한다.Hereinafter, the operation of the content creating apparatus 1 will be described.

우선 컨텐츠 작성 장치(1)의 데이터 작성 제어부(11)는 입력부(12)를 통해 입력되는 사용자로부터의 지시에 따라 저장부(15)에 저장되어 있는 원하는 컨텐츠 데이터를 읽어내고, 표시 출력부(13)에 영상을 표시시키고 음성을 출력시킨다.The data creation control unit 11 of the content creation apparatus 1 reads desired content data stored in the storage unit 15 in accordance with an instruction input by the user through the input unit 12, ) And outputs a voice.

이후 데이터 작성 제어부(11)는 음성 인식을 통해 음성의 시동 타이밍이 도래하는지를 판단한다. 음성의 시동 타이밍이 도래했을 경우, 데이터 작성 제어부(11)는 음성의 시동 타이밍을 자막 개시 시간으로 하고, 음성이 서서히 줄어드는 시간까지의 경과 간격을 자막 계속 시간으로 하여 이전의 음성에 대응하는 자막 문자를 사용자에게 입력시킨다. 데이터 작성 제어부(11)는 해당 자막 개시 시간, 해당 자막 계속 시간 및 그 자막 문자를 자막 리스트 데이터의 일부로서 저장부(15)에 저장한다.Thereafter, the data creation control unit 11 determines whether or not the voice start timing comes through voice recognition. When the start timing of the voice is reached, the data creation control section 11 sets the start timing of the voice as the caption start time and the elapsed time until the time of the voice gradually decreases as the caption continuation time, To the user. The data creation control unit 11 stores the caption start time, the caption continuation time and the caption character in the storage unit 15 as part of the caption list data.

데이터 작성 제어부(11)는 그 컨텐츠 데이터의 재생이 전체 종료했는지 판단한다. 전체 종료했을 경우, 데이터 작성 제어부(11)는 수시로 작성된 자막 리스트 데이터를 하나로 정리하고, 해당 컨텐츠에 대한 자막 리스트 데이터를 최종적으로 작성하고 저장부(15)에 저장하다.The data creation control unit 11 determines whether or not the reproduction of the content data is completely finished. The data creation control unit 11 arranges the closed caption list data in one time and finally creates the caption list data for the content and stores it in the storage unit 15. [

데이터 작성 제어부(11)는 자막 리스트 데이터를 참조하여 선두에서 순서대로 자막 개시 시간 및 자막 계속 시간을 인식한다. 이어서 데이터 작성 제어부(11)는 컨텐츠 데이터를 참조하여 인식한 자막 개시 시간부터 자막 계속 시간까지의 영상 및 음성을 표시 출력부(13)에 재생시킨다.The data creation control unit 11 recognizes the caption start time and the caption continuation time in order from the head with reference to the caption list data. Subsequently, the data creation control section (11) refers to the content data and reproduces the video and audio from the recognized caption start time to the caption continuation time on the display output section (13).

이어서 데이터 작성 제어부(11)는 자막 개시 시간부터 자막 계속 시간까지의 영상 내의 색채의 평탄도를 계산하여 같은 색조인 부분(이하, 평탄 부분이라고 한다)을 추출한다. 이어서 데이터 작성 제어부(11)는 추출한 평탄 부분에 들어갈 정도의 크기의 사각형을 인식한다. 이어서 데이터 작성 제어부(11)는 인식한 사각형을 점선의 틀(이하, 4각 테두리라고 한다)로 나타내도록, 표시 출력부(13)에 자막 시작 시점의 영상과 합성시켜 표시시킨다. 이 때 데이터 작성 제어부(11)는 4각 테두리의 네 귀퉁이에 특정 표식을 가한다.Subsequently, the data creation control unit 11 calculates the flatness of the color in the video from the caption start time to the caption continuation time, and extracts a portion having the same color tone (hereinafter, referred to as a flat portion). Then, the data creation control unit 11 recognizes a rectangle having a size enough to fit into the extracted flat portion. Subsequently, the data creation control unit 11 displays the synthesized image on the display output unit 13 with the image at the caption start time, so that the recognized rectangle is represented by a dotted line frame (hereinafter referred to as quadrangle frame). At this time, the data creation control unit 11 adds specific markings to the four corners of the quadrangular frame.

이어서 데이터 작성 제어부(11)는 4각 테두리를 말풍선을 표시하는 범위로서 좋은지 사용자에게 묻도록 표시 출력부(13)에 표시시키고, 사용자로부터의 수정 지시가 있으면, 지시에 따른 4각 테두리를 말풍선을 표시하는 범위로 한다. 이 때, 데이터 작성 제어부(11)는 4각 테두리의 네 귀퉁이 좌표를 메모리에 일시적으로 저장한다. 또한 사용자에 의한 수정은 입력부(12)에 의해 수행된다. 예를 들어 사용자는 마우스의 포인터를 4각 테두리의 사방 또는 네 귀퉁이에 위치시키고 드래그함으로써 4각 테두리의 크기 및/또는 위치를 수정할 수 있다.Then, the data creation control unit 11 causes the display output unit 13 to display the quadrangular frame on the display output unit 13 so as to ask the user whether it is good as a range for displaying speech bubbles. If there is a correction instruction from the user, Display range. At this time, the data creation control unit 11 temporarily stores four corner coordinates of the quadrangular frame in the memory. In addition, modification by the user is performed by the input unit 12. For example, the user can modify the size and / or position of the quadrangle by positioning and dragging the mouse pointer at four or four corners of the quadrangle.

이어서 데이터 작성 제어부(11)는 영상에서 인물의 얼굴 부분을 인식한다. 이 때의 인식 기법으로서는 피부의 색상이나 얼굴의 형상 등에 기초하여 인물의 얼굴 부분을 인식할 수 있다.Then, the data creation control unit 11 recognizes the face portion of the person in the image. As a recognition method at this time, the face portion of the person can be recognized based on the color of the skin or the shape of the face.

이어서 데이터 작성 제어부(11)는 인식한 얼굴 부분의 면적을 구해 해당 면적이 소정의 임계값을 초과하는지를 판단한다. 초과할 경우, 데이터 작성 제어부(11)는 입가 부분을 검출하여 해당 입가에서 4각 테두리의 대각선 교점(이하, 4각 테두리 중심이라고 한다)을 향한 기준선을 표시 출력부(13)에 표시시키고, 해당 기준선상에 말풍선의 개시점 후보를 표시 시킨다.Subsequently, the data creation control unit 11 obtains the area of the recognized face portion and determines whether the area exceeds a predetermined threshold value. The data creation control unit 11 detects the mouth portion and displays a reference line toward the diagonal intersection of the quadrangular frame at the mouth (hereinafter, referred to as quadrangular frame center) on the display output unit 13, Display the candidate point of the speech balloon on the reference line.

한편, 초과하지 않는 경우, 데이터 작성 제어부(11)는 얼굴의 중심 부분을 인식하여 해당 중심 부분에서 4각 테두리중심으로 향하여 기준선을 표시 출력부(13)에 표시시키고 해당 기준선상에 말풍선 개시점의 후보를 표시시킨다.On the other hand, if not exceeding, the data creation control unit 11 recognizes the center portion of the face, displays the reference line on the display output unit 13 from the center portion toward the center of the quadrangle frame, Mark candidates.

데이터 작성 제어부(11)는 입력부(12)를 통한 사용자로부터의 지시에 따라 말풍선 개시점을 수정하고 최종적인 말풍선 개시점의 좌표를 메모리에 저장한다. 더불어 사용자에 의한 수정이 없는 경우, 데이터 작성 제어부(11)는 말풍선 개시점의 후보로 한 좌표를 저장한다.The data creation control unit 11 corrects the speech balloon starting point according to an instruction from the user through the input unit 12 and stores the coordinates of the final speech balloon starting point in the memory. In addition, in the case where there is no modification by the user, the data creation control unit 11 stores coordinates that are candidates for the balloon start point.

데이터 작성 제어부(11)는 표준의 말풍선 형상이라고 미리 설정되어 있는 말풍선 형상에 관한 데이터를 저장부(15)에서 읽어내고, 필요가 있으면, 앞서 결정한 4각 테두리 안에 최대한 들어가도록, 말풍선 형상의 사이즈를 변경하여 4각 테두리 내로 사이즈 변경 후의 말풍선 화상을 표시 출력부(13)에 표시시킨다.The data creation control unit 11 reads the data relating to the balloon shape that is preset in the standard balloon shape in the storage unit 15 and sets the size of the balloon shape so as to be within the determined four- And causes the display unit 13 to display the speech balloon image after the resizing in the four corners.

이어서 데이터 작성 제어부(11)는 사용자로부터의 지시에 따라 말풍선 화상을 수정한다. 구체적으로는 말풍선의 형상이나 크기, 방향 등이 수정된다. 이러한 수정은 말풍선의 형상을 나타내는 다이얼로그 박스에서 사용자가 원하는 형상을 선택하도록 해서 수행된다. 또한 사이즈 수정은 표시 중의 말풍선을 드래그 함으로써 수행될 수도 있다.Subsequently, the data creation control unit 11 corrects the speech balloon image in accordance with an instruction from the user. Specifically, the shape, size, and direction of speech balloon are modified. This modification is performed by allowing the user to select a desired shape in a dialog box representing the shape of the balloon. The size modification may also be performed by dragging the speech balloon in the display.

사용자에 의한 수정이 종료, 또는 수정이 없었으면 데이터 작성 제어부(11)는 최종적인 말풍선 화상을 결정한다. 이 때 데이터 작성 제어부(11)는 말풍선 화상의 형상을 나타내는 명칭을 메모리에 일시 저장한다. 또한 말풍선 화상의 사이즈가 변경이 되어 있을 경우에는 데이터 작성 제어부(11)는 사이즈 변경이 있던 말풍선을 둘러싸는 최소의 크기 4각 테두리 네 귀퉁이 좌표를 말풍선을 표시하는 범위로서 메모리에 저장되어 있는 네 귀퉁이의 좌표를 변경한다.If the modification by the user is not ended or corrected, the data creation control unit 11 determines the final speech balloon image. At this time, the data creation control unit 11 temporarily stores the name indicating the shape of the speech balloon image in the memory. When the size of the speech balloon image has been changed, the data creation control unit 11 sets the minimum size of the four-corner frame surrounding the speech balloon in which the size change has been made, and the four corner coordinates as a range for displaying speech balloon, .

이어서 데이터 작성 제어부(11)는 해당 자막 개시 시간의 자막 문자를 자막 리스트 데이터에서 읽어내, 결정한 말풍선 안에 자막 문자를 삽입한다. 이 때 데이터 작성 제어부(11)는 자막 개시 시간부터 자막 계속 시간이 경과할 때까지의 동안, 자막 문자가 선두에서 1 프레임마다에 표시하도록, 표시 출력부(13)에 지시한다. 이 때, 데이터 작성 제어부(11)는 자막 문자 전개 스피드를 결정한다. 자막 문자 전개 스피드는 한 프레임 중에 무슨 문자의 자막을 새로 표시하는지를 단계적으로 나타냄으로써 정의된다. 예를 들어 노멀 스피드이면 한 프레임 중에 새로 6 문자를 표시하는 등과 같게 정의된다. 데이터 작성 제어부(11)는 자막 문자 전개 스피드도 메모리에 일시 저장해 둔다.Subsequently, the data creation control unit 11 reads the caption character of the caption start time from the caption list data, and inserts the caption character into the determined speech balloon. At this time, the data creation control unit 11 instructs the display output unit 13 to display the caption characters at the beginning of each frame, from the caption start time until the caption continuation time elapses. At this time, the data creation control section 11 determines the subtitle character development speed. The subtitle speed is defined by stepping through the display of the subtitle of a character in a frame. For example, a normal speed is defined as displaying six new characters per frame. The data creation control unit 11 also temporarily stores the subtitle character development speed in the memory.

이어서 데이터 작성 제어부(11)는 사용자로부터의 지시에 따라 자막 문자를 수정한다. 여기에서는 자막 문자 속성으로서 문자의 종류, 문자의 색상, 자막의 배경, 자막의 투과율, 자막의 틀 종류, 자막 문자의 강조 등을 수정할 수 있다. 데이터 작성 제어부(11)는 이들 자막 문자 속성도 메모리에 일시 저장해 둔다. 덧붙여 컨텐츠 작성 장치(1)는 영상 컨텐츠 데이터의 재생 중 음성 음량을 판단하는 음량 판단부를 포함할 수 있다. 이 때, 컨텐츠 작성 장치(1)는 음량 판단부가 판단한 음량에 따라 자막 문자의 속성을 변화시킬 수 있다. 예를 들어 음량이 큰 경우, 컨텐츠 작성 장치(1)는 자막 문자를 크게 하거나 변색시킬 수 있다.Subsequently, the data creation control unit 11 corrects the caption character according to an instruction from the user. Here, as the caption character attribute, the type of the character, the color of the character, the background of the caption, the transmittance of the caption, the type of the caption frame, and the emphasis of the caption character can be modified. The data creation control section 11 temporarily stores these subtitle character attributes in the memory. In addition, the content creating apparatus 1 may include a volume determining unit for determining the volume of audio during reproduction of the video content data. At this time, the content creating apparatus 1 can change the attribute of the caption character according to the volume determined by the volume determination unit. For example, when the volume is large, the content creating apparatus 1 may enlarge or discolor the caption characters.

이어서 데이터 작성 제어부(11)는 메모리에 일시 저장한 정보를 읽어내고, 자막 개시 시간에 대응하도록, 자막 계속 시간, 자막 문자 전개 스피드, 자막 문자 속성, 말풍선 범위(4각 테두리의 네 귀퉁이 좌표), 말풍선 개시점의 좌표, 말풍선 형상 및 자막 문자를 저장부(15)에 저장한다.Subsequently, the data creation control unit 11 reads the information temporarily stored in the memory, and reads the information of the subtitle duration time, the subtitle character development speed, the subtitle character attribute, the speech balloon range (four corner coordinates of the quadrangular frame) The coordinates of the speech balloon start point, the speech balloon shape, and the caption character are stored in the storage unit 15.

이어서 데이터 작성 제어부(11)는 컨텐츠가 전체 종료했는지를 판단한다. 종료하였을 경우, 데이터 작성 제어부(11)는 지금까지 작성한 자막 개시 시간 별 말풍선 데이터를 하나에 통합하여 원하는 컨텐츠 데이터에 대응하는 최종적인 말풍선 데이터를 작성하여 저장부(15)에 저장한다.Subsequently, the data creation control unit 11 determines whether or not the content has been completely terminated. The data creation control unit 11 creates the final speech balloon data corresponding to the desired content data by integrating the speech balloon data for each caption start time created so far into one and stores it in the storage unit 15. [

본 발명에 따른 말풍선 자막 부착 영상 컨텐츠 작성 장치에 의하면 발화자와 자막과의 관계를 쉽게 알 수 있고, 전체 화면을 편하게 감상할 수 있다.According to the apparatus for creating a picture content with a speech balloon caption according to the present invention, the relationship between a speaking person and a caption can be easily recognized and a whole screen can be easily viewed.

상기한 본 발명의 바람직한 실시예는 예시의 목적을 위해 개시된 것이고, 본 발명에 대해 통상의 지식을 가진 당업자라면, 본 발명의 사상과 범위 안에서 다양한 수정, 변경 및 부가가 가능할 것이며, 이러한 수정, 변경 및 부가는 본 발명의 특허청구 범위에 속하는 것으로 보아야 할 것이다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed embodiments, but, on the contrary, And additions should be considered as falling within the scope of the claims of the present invention.

1 : 컨텐츠 작성 장치
2,2a : 컨텐츠 송신 장치
2b : 패키지 미디어화 장치
3 : 방송 장치
3a : 인터넷
3b : 유통 시스템
4,4a : 컨텐츠 재생 장치
4b : 패키지 미디어 재생 장치
5 : 컨텐츠 표시 장치
11 : 데이터 작성 제어부
12 : 입력부
13 : 표시 출력부
14,44 : 타임 카운트부
15 : 저장부
21 : 다중화 제어부
22,42 : 조작부
23 : 오류 정정 부호 부가부
24 : 디지털 변조부
25 : 송신부
41 : 재생 제어부
43 : 합성 전송부
45 : 말풍선 형상 기억부
46 : 수신부
47 : 복조부
48 : 오류 정정부
51 : 표시 출력 디바이스부
52 : 구동 회로부
1: Content creation device
2.2a: Content transmitting device
2b: Package mediaization device
3: Broadcasting device
3a: Internet
3b: Distribution system
4a: Content playback device
4b: Package media playback device
5: Content display device
11: Data creation control section
12:
13:
14,44: Time count section
15:
21: multiplexing control section
22,
23: Error correcting code adding unit
24: Digital Modulation section
25:
41:
43:
45: speech ball shape memory unit
46:
47: Demodulator
48:
51: Display output device section
52:

Claims

A content creating apparatus for creating data necessary for providing image content with caption by a balloon, comprising: a speech balloon time extracting means for extracting a time to display a speech balloon in an image based on image content data to be original data; A bubble area determining means for determining a bubble area suitable for display of a speech balloon in an image of the time extracted by the speech balloon time extracting means; a speech balloon image determining means for determining a speech balloon image to be synthesized in the speech balloon region determined by the speech balloon region determining means; A caption character determination means for determining a caption character to be synthesized with the speech balloon image determined by the speech balloon image determination means, information on a time to display the speech balloon, information on the speech balloon region, information on the speech balloon image And information on the caption character The speech balloon data created by the speech balloon data creation means is reproduced together with the image content data, so that the speech balloon data generated by the speech balloon data creation means is reproduced together with the image content data, The content providing apparatus comprising: