KR100992509B1

KR100992509B1 - Method and apparatus for realistically providing iptv service based on voice modulation with character/avatar of study/education contents in iptv system

Info

Publication number: KR100992509B1
Application number: KR1020090036078A
Authority: KR
Inventors: 정태경
Original assignee: 명지대학교 산학협력단
Priority date: 2009-04-24
Filing date: 2009-04-24
Publication date: 2010-11-08
Also published as: KR100992509B9; KR20100117368A

Abstract

PURPOSE: A method and a device for servicing a lifelike IPTV based on voice modulation base through character/avatar appearing in IPTV system are provided to supply lifelike by altering voice or motion/gesture of acquaintance such as parents, sibling who can feel comfortable and warn feeling for a IPTV viewer, thereby improving the learning ability. CONSTITUTION: A voice alteration unit performs alteration in way which replaces the patterned voice of relevant frequency band unit. A pattern storing unit stores each patterned voice about a plurality of users. A user selection unit selects patterned voice about one user in the pattern storing unit. A frequency analyzing unit receives input of a voice of relevant user in other point to update the patterned voice into each frequency band unit.

Description

Method and Apparatus for Realistically Providing IPTV Service Based on Voice Modulation with Character / Avatar of Study / Education Contents in IPTV in Character / Avatar Appeared in Educational / Education Content in IPTV System System}

본 발명은 IPTV(Internet Protocol Television) 서비스를 제공하는 시스템에 관한 것으로서, 특히, IPTV 서비스에서 제공되는 학습용이나 교육용 콘텐츠 등 등장 인물이 나오는 콘텐츠의 캐릭터나 아바타(제 3의 매개체)의 목소리나 동작/표정을 IPTV 시청자(예를 들어, 아동)가 편안하고 따뜻한 정서를 느낄 수 있는 부모, 형제 등 지인의 목소리나 동작/표정으로 변조하여 제공할 수 있고, 이외에도 이와 같은 변조 방식을 이용하여 원격지에서 아바타를 통하여 일정 거리 유지와 같은 건강상의 지시사항 등으로 따뜻한 정서로 아동의 IPTV 시청을 지도할 수 있고, 시간 측정 등을 통하여 다른 지시 사항을 전달하거나 콘텐츠 재생을 제어하여 학습 능력을 향상시킬 수 있는 음성 변조 기반의 IPTV 서비스 방법 및 장치에 관한 것이다. The present invention relates to a system for providing an Internet Protocol Television (IPTV) service, and more particularly, to a voice or an action of a character or avatar (third mediator) of content in which a character, such as learning or educational content provided by the IPTV service, appears. The facial expression can be provided by modulating the voice or gesture / action of acquaintances such as parents and siblings that IPTV viewers (for example, children) can feel comfortable and warm emotions. Through the health instructions such as maintaining a certain distance through the child can guide the child's watching IPTV with a warm emotion, and to improve the learning ability by passing other instructions or controlling the content playback through time measurement, etc. The present invention relates to a modulation-based IPTV service method and apparatus.

최근 IPTV 단말을 가진 사용자는 인터넷 등 네트워크를 통하여IPTV 서버로부터 제공되는 영상 관련 및 양방향 서비스를 이용할 수 있게 되었다. IPTV 서비스에 대한 권한을 부여받은 합법적인 클라이언트, 즉, 시청자 또는 사용자는 댁내에서 IPTV 서버에서 제공되는 다양한 멀티미디어 콘텐츠를 이용할 수 있다.Recently, a user having an IPTV terminal can use a video related and interactive service provided from an IPTV server through a network such as the Internet. Legitimate clients, i.e. viewers or users, who are authorized for the IPTV service can use various multimedia contents provided by the IPTV server at home.

사용자는 IPTV 단말을 통하여 현재 방영 중인 일반 TV 방송이나 라디오 방송을 실시간으로 감상할 수 있을 뿐만 아니라, 이러한 방송이 이미 지나간 시간 이후에도 서버에 관리되는 해당 콘텐츠를 요청하여 다운로드 또는 스트리밍 방식으로 원하는 시간에 아무 때라도 감상할 수 있다. 이외에도, 사용자는 IPTV 서버에서 관리되는 영화, 음악, 게임, 케이블 TV 방송, DMB 방송, 해외 방송 등을 요청하여 감상할 수 있다. The user can not only watch general TV broadcasts or radio broadcasts that are currently being broadcast through the IPTV terminal, but also request the corresponding content managed by the server even after the broadcast time has elapsed and download or stream any time at any time. You can enjoy even when. In addition, the user may request and enjoy a movie, music, game, cable TV broadcast, DMB broadcast, overseas broadcast, and the like, which are managed by the IPTV server.

이와 같은 IPTV 서비스는, IPTV 단말과 IPTV 서버 사이에서 서비스를 중계하는 셋톱 박스(set-top box)를 통하여 이루어질 수 있다. 그러나, 기존의 셋톱 박스는 IPTV 단말에 연결되어 사용자 인증과 IPTV 서버와의 통신을 중계하여 양방향성 데이터의 단순 교환 기능을 수행하였다. Such an IPTV service may be provided through a set-top box for relaying a service between an IPTV terminal and an IPTV server. However, the existing set-top box is connected to the IPTV terminal and relays user authentication and communication with the IPTV server to perform a simple exchange of bidirectional data.

이에 따라, IPTV 시청자인 아동이 학습용이나 교육용 콘텐츠 등 등장 인물이 나오는 콘텐츠, 예를 들어, 동화를 시청하는 경우에, IPTV 단말은 IPTV 서버로부터 일방적으로 제공되는 콘텐츠의 영상과 음성을 그대로 전달받아 재생할 뿐이었다. 이때, 부모는 동화 내용을 아동과 함께 시청하면서 직접 설명할 수 있지만, 부모가 아동과 함께 시청하지 못하는 상황이라면 아동은 콘텐츠의 영상과 음성을 그대로 시청하기만 하여야 한다. 홀로 IPTV를 시청하는 아동으로서는 부모의 따뜻한 정서를 느끼지 못하므로, 콘텐츠 내용에 지루함을 느끼거나 재생되는 콘텐츠의 일방적 음성에 흥미를 잃게 되고, 이는 학습 능력을 떨어뜨리는 원인이 될 수 있다. 또한, 원격에 있는 부모가 아동의 IPTV 시청 상황을 모니터링하면서 동화 내용을 설명해 주기는 쉽지 않은 문제점이 있다.Accordingly, when a child, who is an IPTV viewer, watches content in which characters appear, such as learning or educational content, for example, a fairy tale, the IPTV terminal receives and reproduces the video and audio of the content provided unilaterally from the IPTV server. It was only. At this time, the parent can directly explain the fairy tale content while watching the child, but if the parent cannot watch with the child, the child should only watch the video and audio of the content as it is. Since children who watch IPTV alone do not feel the warm emotions of their parents, they may be bored with the content or lose interest in the unilateral voice of the content being played, which may cause a drop in learning ability. In addition, there is a problem that it is not easy for the remote parents to explain the fairy tale contents while monitoring the child's IPTV viewing situation.

따라서, 본 발명은 상술한 문제점을 해결하기 위한 것으로서, 본 발명의 목적은, IPTV 서비스에서 제공되는 학습용이나 교육용 등 등장 인물이 나오는 콘텐츠의 캐릭터나 아바타의 목소리나 동작/표정을 IPTV 시청자(예를 들어, 아동)가 편안하고 따뜻한 정서를 느낄 수 있는 부모, 형제 등 지인의 목소리나 동작/표정으로 변조하여 실감나게 제공함으로써 학습 능력을 향상시킬 수 있는 음성 변조 기반의 실감형 IPTV 서비스 방법 및 장치를 제공하는 데 있다.Accordingly, an object of the present invention is to solve the above-described problems, and an object of the present invention is to provide an IPTV viewer (eg For example, the present invention provides a voice modulation-based IPTV service method and apparatus that can improve learning ability by modulating the voice or gesture / action of acquaintances such as parents and siblings that can feel comfortable and warm emotion. To provide.

그리고, 본 발명의 다른 목적은, 이와 같은 변조 방식을 이용하여 원격지에서 아바타를 통하여 따뜻한 정서로 일정 거리 유지와 같은 건강상의 지시사항 등으로 아동의 IPTV 시청을 지도하여 학습 흥미를 유발할 수 있으며, 원격지에서도 학습 능력을 향상시키기 위하여 콘텐츠 재생의 제어 등을 수행할 수 있는 음성 변조 기반의 실감형 IPTV 서비스 방법 및 장치를 제공하는 데 있다.In addition, another object of the present invention, by using such a modulation method can lead to learning interest by instructing children to watch the IPTV with health instructions, such as maintaining a certain distance with a warm emotion through the avatar at a remote location, In the present invention, there is provided a method and apparatus for voice modulation based realistic IPTV service that can perform content control and the like to improve learning ability.

먼저, 본 발명의 특징을 요약하면, 상기와 같은 본 발명의 목적을 달성하기 위한 본 발명의 일면에 따른 음성 변조 기반의 실감형 IPTV 서비스를 제공하는 방법은, 사용자의 목소리를 입력받아 주파수를 분석하여 사용자의 언어 발음 습관을 주파수대역 단위로 패턴화하여 패턴화된 목소리를 저장하고, 단말에서 재생되는 콘 텐츠의 캐릭터나 아바타(제 3의 매개체)의 본래 목소리를 상기 패턴화된 목소리로 변조하여 상기 단말로 제공하는 것을 특징으로 한다.First, to summarize the features of the present invention, a method for providing a voice modulation-based realistic IPTV service according to an aspect of the present invention for achieving the object of the present invention, the user's voice input to analyze the frequency Patterning the user's language pronunciation habits in units of frequency bands to store the patterned voice, and modulating the original voice of the content character or avatar (third mediator) played on the terminal into the patterned voice Characterized in that provided to the terminal.

사용자에 대한 카메라 영상으로부터 사용자의 동작이나 표정을 분석한 정보를 더 저장하고, 상기 분석한 정보에 따라 단말에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 동작이나 표정을 상기 사용자의 동작이나 표정으로 변조하여 상기 단말로 제공할 수도 있다.And further storing information analyzing the user's motion or facial expression from the camera image of the user, and modulating the original motion or facial expression of the character or avatar of the content played on the terminal according to the analyzed information. It may be provided to the terminal.

상기 패턴화된 목소리는 가청 주파수 대역(예를 들어, 20~20kHz)을 소정 단위로 분할한 각 대역에 대한 목소리 패턴을 포함한다.The patterned voice includes a voice pattern for each band obtained by dividing an audible frequency band (for example, 20 to 20 kHz) into predetermined units.

상기 저장된 패턴화된 목소리를 선택하여, 다른 시점에 해당 사용자의 목소리를 입력받아 각 주파수 대역 단위로 상기 패턴화된 목소리를 업데이트할 수 있다.The stored patterned voice may be selected, and the patterned voice may be updated in units of frequency bands by receiving the user's voice at different time points.

상기 단말에서 재생되는 콘텐츠에 복수의 캐릭터나 아바타가 포함된 경우에, 상기 단말을 통하여 상기 복수의 캐릭터나 아바타 중 어느 하나를 선택한 후 해당 음원에 대하여 상기 변조를 수행할 수 있다.When a plurality of characters or avatars are included in the content played in the terminal, the modulation may be performed on the corresponding sound source after selecting any one of the plurality of characters or avatars through the terminal.

상기 저장 과정을 통하여 복수의 사용자에 대하여 패턴화된 목소리를 저장하고, 상기 단말을 통하여 상기 복수의 사용자 중 어느 하나를 선택한 후 해당 패턴화된 목소리로 상기 음원에 대하여 상기 변조를 수행할 수 있다.The voice may store a patterned voice for a plurality of users through the storing process, select one of the plurality of users through the terminal, and perform the modulation on the sound source with the corresponding patterned voice.

상기 변조는, 상기 콘텐츠의 캐릭터나 아바타 본래의 스탠더드 음원을 상기 주파수대역 단위로 분리하고 상기 패턴화된 목소리의 주파수 분포 대역으로 이동하여, 이동된 본래의 스탠더드 음원의 각 주파수대역 단위 마다 상기 패턴화된 목소 리의 해당 주파수대역 단위로 대체하는 방식을 이용할 수 있다. The modulation separates the standard sound source of the character or avatar of the content into the frequency band unit and moves to the frequency distribution band of the patterned voice, thereby patterning the pattern for each frequency band unit of the moved original standard sound source. Alternatives may be made for the corresponding frequency band units.

상기 단말에서 콘텐츠를 재생하는 동안에, 카메라를 이용하여 상기 단말 앞에서 상기 콘텐츠를 시청하는 사람에 대한 모니터링 영상을 생성하여 미리 지정된 단말로 전송하고, 상기 미리 지정된 단말을 통하여 입력되는 지시 사항이 상기 콘텐츠 상의 아바타를 통하여 상기 패턴화된 목소리로 출력될 수 있다.While playing the content on the terminal, a monitoring image of a person watching the content in front of the terminal using a camera is generated and transmitted to a predetermined terminal, and instructions input through the predetermined terminal are displayed on the content. The avatar may be output through the patterned voice.

상기 단말에서 콘텐츠를 재생하는 시간을 측정하여 일정 시간 경과 후에 상기 콘텐츠 상의 아바타를 통하여 상기 패턴화된 목소리로 지시 사항을 출력하거나 상기 콘텐츠의 재생을 중지할 수 있다.The terminal may measure the time for playing the content and output an instruction in the patterned voice through the avatar on the content or stop playing the content after a predetermined time elapses.

그리고, 본 발명의 다른 일면에 따른, 디지털 멀티미디어 콘텐츠의 실감형 서비스 제공 장치에 있어서, 사용자의 목소리를 입력받아 주파수를 분석하여 사용자의 언어 발음 습관을 소정 주파수대역 단위로 패턴화하여 패턴화된 목소리를 저장하는 발음 패턴화부; 및 단말에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 목소리를 상기 패턴화된 목소리로 변조하여 상기 단말로 제공하는 목소리 변조부를 포함할 수 있다.According to another aspect of the present invention, in a realistic service providing apparatus for digital multimedia contents, the voice of the user is inputted and the frequency is analyzed to pattern the user's language pronunciation habit in a predetermined frequency band unit to pattern the voice Pronunciation patterning unit for storing the; And a voice modulator for modulating the original voice of the character or avatar of the content played in the terminal into the patterned voice and providing the same to the terminal.

상기 실감형 서비스 제공 장치는, 사용자에 대한 카메라 영상으로부터 사용자의 동작이나 표정을 분석한 정보를 저장하고, 상기 분석한 정보에 따라 단말에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 동작이나 표정을 상기 사용자의 동작이나 표정으로 변조하여 상기 단말로 제공하는 동작/표정 분석부를 더 포함할 수 있다.The sensory service providing apparatus stores information analyzing the user's motion or facial expression from a camera image of the user, and displays the original motion or facial expression of the character or avatar of the content reproduced in the terminal according to the analyzed information. It may further include an operation / expression analysis unit provided to the terminal by modulating the operation or facial expression of the.

상기 실감형 서비스 제공 장치는 IPTV 서비스를 제공하기 위하여 사용될 수 있다.The realistic service providing apparatus may be used to provide an IPTV service.

상기 발음 패턴화부는, 복수의 사용자에 대한 각 패턴화된 목소리를 저장하는 패턴 저장부; 상기 패턴 저장부에서 어느 한 사용자에 대한 패턴화된 목소리를 선택하는 사용자 선택부; 및 선택된 상기 패턴화된 목소리에 대하여, 다른 시점에 해당 사용자의 목소리를 입력받아 각 주파수 대역 단위로 상기 패턴화된 목소리를 업데이트하는 주파수 분석부를 포함한다. The pronunciation patterning unit may include: a pattern storage unit storing each patterned voice for a plurality of users; A user selector which selects a patterned voice for a user in the pattern storage; And a frequency analyzer configured to receive a voice of a corresponding user at a different time point with respect to the selected patterned voice and update the patterned voice in units of frequency bands.

상기 목소리 변조부는, 상기 단말에서 재생되는 콘텐츠에 복수의 캐릭터나 아바타가 포함된 경우에, 상기 단말을 통하여 상기 복수의 캐릭터나 아바타 중 어느 하나를 선택하는 캐릭터 선택부를 포함하고, 상기 선택된 캐릭터나 아바타의 해당 음원에 대하여 상기 변조를 수행할 수 있다.The voice modulator includes a character selector configured to select any one of the plurality of characters or avatars through the terminal when a plurality of characters or avatars are included in the content played by the terminal. The modulation may be performed on the corresponding sound source of.

상기 목소리 변조부는, 상기 발음 패턴화부를 통하여 복수의 사용자에 대하여 패턴화된 목소리가 저장되면, 상기 단말을 통하여 상기 복수의 사용자 중 어느 하나의 패턴화된 목소리를 선택하는 발음 패턴 선택부를 포함하고, 선택된 상기 패턴화된 목소리로 상기 음원에 대하여 상기 변조를 수행할 수 있다.The voice modulator may include a pronunciation pattern selector configured to select one of the plurality of patterned voices through the terminal when a voice patterned for a plurality of users is stored through the pronunciation patterner. The modulation may be performed on the sound source with the selected patterned voice.

상기 목소리 변조부는, 상기 캐릭터나 아바타의 본래 목소리를 추출하는 캐릭터 음원 추출부; 및 상기 추출된 목소리를 상기 패턴화된 목소리로 변조하는 캐릭터 음원 변조부를 포함할 수 있다.The voice modulator may include: a character sound source extracting unit extracting an original voice of the character or avatar; And a character sound source modulator for modulating the extracted voice into the patterned voice.

상기 목소리 변조부는, 상기 콘텐츠의 캐릭터나 아바타 본래의 스탠더드 음원을 상기 주파수대역 단위로 분리하고 상기 패턴화된 목소리의 주파수 분포 대역으로 이동하여, 이동된 본래의 스탠더드 음원의 각 주파수대역 단위 마다 상기 패 턴화된 목소리의 해당 주파수대역 단위로 대체하는 방식을 이용할 수 있다. The voice modulator separates the standard sound source of the character or avatar of the content into the frequency band unit, moves to the frequency distribution band of the patterned voice, and transmits the pad to each frequency band unit of the moved standard sound source. A method of replacing the turned voice by the corresponding frequency band unit may be used.

본 발명에 따른 실감형 IPTV 서비스를 위한 방법 및 장치에 따르면, 학습용이나 교육용 등 등장 인물이 나오는 콘텐츠의 캐릭터나 아바타의 목소리나 동작/표정을 아동 등 IPTV 시청자가 편안하고 따뜻한 정서를 느낄 수 있는 부모, 형제 등 지인의 목소리나 동작/표정으로 변조하여 실감나게 제공함으로써 학습 능력을 향상시킬 수 있다.According to the method and apparatus for a realistic IPTV service according to the present invention, a parent or child who feels a comfortable and warm emotion of an IPTV viewer, such as a child, with a voice or an action / expression of a character or an avatar whose content comes from a character such as learning or education Can improve the learning ability by providing them with realism by altering them with the voices, gestures and expressions of acquaintances such as brothers and sisters.

그리고, 본 발명에 따른 실감형 IPTV 서비스를 위한 방법 및 장치에 따르면, 콘텐츠의 캐릭터나 아바타의 목소리나 동작/표정의 변조를 통하여 원격지에서도 아바타를 통하여 따뜻한 정서로 아동의 IPTV 시청을 지도하여 학습 흥미를 유발할 수 있으며, 원격지에서도 콘텐츠 재생의 제어 등을 통하여 학습 능력을 향상시킬 수 있다.In addition, according to the method and apparatus for an immersive IPTV service according to the present invention, it is interesting to instruct a child to watch IPTV with a warm emotion through an avatar at a remote place through the modulation of the voice or motion / expression of the character or avatar of the content. In addition, it is possible to improve the learning ability through the control of content playback even at a remote location.

이하 첨부 도면들 및 첨부 도면들에 기재된 내용들을 참조하여 본 발명의 바람직한 실시예를 상세하게 설명하지만, 본 발명이 실시예들에 의해 제한되거나 한정되는 것은 아니다. 각 도면에 제시된 동일한 참조부호는 동일한 부재를 나타낸다. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. Like reference numerals in the drawings denote like elements.

도 1은 본 발명의 일실시예에 따른 IPTV 시스템(100)의 사용 환경을 설명하기 위한 도면이다.1 is a view for explaining the usage environment of the IPTV system 100 according to an embodiment of the present invention.

도 1을 참조하면, IPTV 단말(110)의 STB(Set Top Box) 중계장치를 통하여 전 자 지불식 충전 카드, 예를 들어, IC 카드, 스마트카드, RFID 카드, USIM 카드 등의 사용자 정보 입력 수단으로부터 사용자 정보를 읽어 들여서, 인터넷이나 이동통신 사업자망 등 네트워크에 연결된 콘텐츠 서버(120)에서 인증되면, 동영상, e-book, 음악, 방송 프로그램 등의 멀티미디어 데이터나, 특히, 학습용/교육용 콘텐츠를 제공받아 IPTV를 시청할 수 있다. 특히, 본 발명에서는 IPTV 단말(110)에서 재생되는 콘텐츠의 캐릭터나 아바타와 같은 다른 제 3의 매개체의 본래 목소리나 동작/표정을 부모 등 사용자가 미리 학습시킨 언어 발음 습관에 따른 패턴화된 목소리 또는 동작이나 표정으로 변조하여 IPTV 단말(110)로 실감나게 제공할 수 있도록 하였다. 부모 등 사용자는 STB 또는 콘텐츠 서버(120)에서 동작하는 실감형 서비스 제공 장치(도 3의 330 또는 도 4의 440 참조)를 이용하여 부모의 목소리나 동작/표정을 학습시켜 저장하고 어느 때라도 이용할 수 있다. Referring to FIG. 1, user information input means such as an electronic payment card, for example, an IC card, a smart card, an RFID card, or a USIM card, is provided through a set top box (STB) relay of the IPTV terminal 110. When user information is read from and authenticated in a content server 120 connected to a network such as the Internet or a mobile service provider network, multimedia data such as a video, an e-book, music, a broadcast program, or, in particular, educational / educational content is provided. You can watch IPTV. In particular, in the present invention, the patterned voice according to the language pronunciation habits, such as the parent or the like, that the user has previously learned the original voice or gesture / action of another third medium such as a character or avatar of the content played on the IPTV terminal 110, or Modulation to an action or a facial expression to provide realistically to the IPTV terminal 110. The parent or the like can learn and store the voice or gesture / action of the parent using the sensory service providing device (see 330 of FIG. 3 or 440 of FIG. 4) operated by the STB or the content server 120 and use it at any time. have.

IPTV 단말(110)에서 시청될 수 있는 IPTV 콘텐츠는 도 2와 같이 캐릭터나 아바타가 포함되는 아동용 동화와 같은 학습용/교육용 콘텐츠일 수 있다. 부모는 그의 아동에게 프로그램 내용, 예를 들어, 동화책 내용 등을 읽어 주거나 설명해 줄 수도 있지만, 본 발명에서는 STB에 연결된 마이크(microphone)을 통하여 사용자(부모, 형제, 친지 등)의 목소리를 입력받아 주파수를 분석하여 사용자의 언어 발음 습관을 주파수대역 단위로 패턴화하여 패턴화된 목소리를 STB 또는 콘텐츠 서버(120)에 저장함으로써, IPTV 단말(110)에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 목소리를 위와 같이 패턴화된 목소리로 변조하여 IPTV 단말(110)로 제공할 수 있다. 또한, 사용자에 대한 카메라 영상으로부터 사용자의 동작이나 표정을 분 석한 정보를 저장하고, 상기 분석한 정보에 따라 IPTV 단말(110)에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 동작이나 표정을 해당 사용자의 동작이나 표정으로 변조하여 IPTV단말(110)로 제공할 수도 있다.The IPTV content that can be viewed on the IPTV terminal 110 may be educational / educational content such as children's fairy tales including a character or an avatar as shown in FIG. 2. The parent may read or explain the program contents, for example, the contents of a fairy tale book to his child, but in the present invention, the frequency of receiving a voice of a user (parent, sibling, relative, etc.) is input through a microphone connected to the STB. By analyzing the user's language pronunciation habits in units of frequency bands and storing the patterned voices in the STB or the content server 120, thereby reconstructing the original voices of the characters or avatars of the content played on the IPTV terminal 110. The same patterned voice may be modulated and provided to the IPTV terminal 110. In addition, it stores information analyzing the user's motion or facial expression from the camera image of the user, and according to the analysis information to the original motion or facial expression of the character or avatar of the content to be played on the IPTV terminal 110 Alternatively, it may be provided to the IPTV terminal 110 by modulating the expression.

이외에도 본 발명에서는, IPTV 단말(110)에서 콘텐츠를 재생하는 동안에, 카메라(예를 들어, 웹 카메라)를 이용하여 IPTV 단말(110) 앞에서 해당 콘텐츠를 시청하는 사람(예를 들어, 아동)에 대한 모니터링 영상을 생성하여 미리 지정된 단말(예를 들어, 부모의 IPTV 단말, 또는 이동 통신 단말 등)로 전송하고, 이에 따라 부모는 아동의 시청 상황을 모니터링하면서 자신의 단말(IPTV 단말, 또는 이동 통신 단말 등)을 통하여 지시 사항(TV와의 거리 유지 등 부모 입장의 하고 싶은 말이나 내용)을 입력할 수 있다. 이와 같이 입력된 지시 사항은 IPTV 단말(110)에서 재생되고 있는 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리(또는, 기 저장된 부모의 동작/표정의 반영도 가능)로 출력될 수도 있다. In addition, in the present invention, while playing the content in the IPTV terminal 110, a person (for example, a child) watching the content in front of the IPTV terminal 110 using a camera (for example, a web camera). The monitoring image is generated and transmitted to a predetermined terminal (for example, the parent's IPTV terminal or the mobile communication terminal), and thus the parent monitors the child's viewing situation while the parent (IPTV terminal or the mobile communication terminal) is monitored. Etc.), you can enter instructions (says and content you want your parents to say, such as keeping your distance from the TV). In this way, the input instruction may be output as a patterned voice of a parent (or a reflection of a previously stored parent's action / expression) through an avatar on the content being played in the IPTV terminal 110.

다른 한편, 본 발명에서는, IPTV 단말(110)에서 콘텐츠를 재생하는 시간을 측정하여 일정 시간 경과 후에 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리(또는, 기 저장된 부모의 동작/표정의 반영도 가능)로 지시 사항(잠시 쉬었다가 시청하라는 등 부모 입장의 하고 싶은 말이나 내용)을 출력하거나 콘텐츠의 재생을 중지시킴으로써, 학습 능력을 향상시킬 수도 있고 아동의 시력 보호 등을 위하여도 이와 같은 기능이 필요하다.On the other hand, in the present invention, it is also possible to measure the time to play the content in the IPTV terminal 110 to reflect the patterned voice of the parent (or, previously stored parent's motion / expression through the avatar on the content after a certain time elapsed). ) To print instructions or to stop the playback of content by printing instructions or to stop watching content. Do.

도 3은 본 발명의 일실시예에 따른 IPTV 중계 장치(300)를 설명하기 위한 도면이다. IPTV 중계 장치(300)는 STB에 해당하며, IPTV 단말(110)과 콘텐츠 서 버(120) 사이에서 위와 같은 인증과 디지털 멀티미디어 데이터의 통신을 중계한다.3 is a view for explaining the IPTV relay device 300 according to an embodiment of the present invention. The IPTV relay device 300 corresponds to STB, and relays the above authentication and communication of digital multimedia data between the IPTV terminal 110 and the content server 120.

도 3을 참조하면, IPTV 중계 장치(300)는 제어부(310), 리더포트(311), TV 포트(312), 이더넷 포트(313), 카메라(314), 마이크(315), 인증부(320), 실감형 서비스 제공부(330), 및 영상 전송부(340)를 포함한다. 제어부(310)는 IPTV 중계 장치(300)의 위와 같은 구성요소들의 전반적인 제어를 담당한다. Referring to FIG. 3, the IPTV relay device 300 includes a control unit 310, a reader port 311, a TV port 312, an Ethernet port 313, a camera 314, a microphone 315, and an authentication unit 320. ), A realistic service provider 330, and an image transmitter 340. The controller 310 is responsible for the overall control of the above components of the IPTV relay device 300.

리더포트(511)는 전자 지불식 충전 카드로부터 사용자 정보를 읽는 카드 리더(reader)와 연결되고, 인증부(320)는 읽어진 사용자 정보를 콘텐츠 서버(120)로 전송해 사용자 인증을 중계한다. TV 포트(312)는 IPTV 단말(110)과 연결되어 콘텐츠 서버(120)로부터 제공된 콘텐츠 데이터를 IPTV 단말(110)로 전송한다. 이더넷 포트(313)는 인터넷 등 통신 라인과 연결되어 콘텐츠 서버(120)와 연결되도록 하며, 콘텐츠 데이터 등 콘텐츠 서버(120)와의 통신에 필요한 데이터를 주고 받는다. The reader port 511 is connected to a card reader that reads user information from an electronic payment card, and the authentication unit 320 transmits the read user information to the content server 120 to relay user authentication. The TV port 312 is connected to the IPTV terminal 110 to transmit content data provided from the content server 120 to the IPTV terminal 110. The Ethernet port 313 is connected to a communication line such as the Internet so as to be connected to the content server 120, and exchanges data necessary for communication with the content server 120, such as content data.

실감형 서비스 제공부(330)는 마이크(315)를 통하여 사용자의 목소리를 입력받을 수 있고, 입력된 사용자 목소리의 주파수를 분석하여 사용자의 언어 발음 습관을 주파수대역 단위로 패턴화하여 패턴화된 목소리를 저장할 수 있다. 이에 따라, 실감형 서비스 제공부(330)는 IPTV 단말(110)에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 목소리를 위와 같이 패턴화된 목소리로 변조하여 IPTV 단말(110)로 제공할 수 있다. 또한, 실감형 서비스 제공부(330)는 카메라(314)를 통하여 사용자에 대한 영상을 입력받을 수 있고, 사용자에 대한 카메라 영상으로부터 사용자의 동작이나 표정을 분석한 정보를 저장하고, 상기 분석한 정보에 따라 IPTV 단말(110)에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 동작이나 표정을 해당 사 용자의 동작이나 표정으로 변조하여 IPTV 단말(110)로 제공할 수도 있다.The realistic service provider 330 may receive the voice of the user through the microphone 315, and analyze the frequency of the input voice of the user, thereby patterning the user's language pronunciation habit in units of frequency bands and patterning the voice. Can be stored. Accordingly, the sensory service provider 330 may provide the IPTV terminal 110 by modulating the original voice of the character or avatar of the content played in the IPTV terminal 110 into a patterned voice as described above. In addition, the sensory service provider 330 may receive an image of the user through the camera 314, store information analyzing the user's motion or facial expression from the camera image of the user, and analyze the information. Accordingly, the original motion or facial expression of the character or avatar of the content played in the IPTV terminal 110 may be modulated into the motion or facial expression of the corresponding user and provided to the IPTV terminal 110.

영상 전송부(340)는 IPTV 단말(110)에서 콘텐츠를 재생하는 동안에, 카메라(314)를 이용하여 IPTV 단말(110) 앞에서 해당 콘텐츠를 시청하는 사람(예를 들어, 아동)에 대한 모니터링 영상을 생성하여 미리 지정된 단말(예를 들어, 부모의 IPTV 단말, 또는 이동 통신 단말 등)로 전송할 수 있다. 이에 따라 부모는 아동의 시청 상황을 모니터링하면서 자신의 단말(IPTV 단말, 또는 이동 통신 단말 등)을 통하여 지시 사항(TV와의 거리 유지 등 부모 입장의 하고 싶은 말이나 내용)을 입력할 수 있다. 이와 같이 원격지에서 입력된 지시 사항은 인터넷 또는 이동 통신망을 통하여 실감형 서비스 제공부(330)로 전달될 수 있고, 실감형 서비스 제공부(330)는 수신한 지시사항을 IPTV 단말(110)에서 재생되고 있는 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리(또는, 기 저장된 부모의 동작/표정의 반영도 가능)로 출력되도록 할 수도 있다. The image transmitting unit 340 may monitor the image of the person (eg, a child) who watches the content in front of the IPTV terminal 110 using the camera 314 while the content is played in the IPTV terminal 110. It may be generated and transmitted to a predetermined terminal (for example, the parent's IPTV terminal or a mobile communication terminal). Accordingly, the parents can input instructions (words or contents that the parents want to maintain such as distance from the TV) through their terminals (such as an IPTV terminal or a mobile communication terminal) while monitoring the viewing state of the child. In this way, the instructions input from the remote location may be delivered to the realistic service provider 330 through the Internet or a mobile communication network, and the realistic service provider 330 plays the received instructions on the IPTV terminal 110. The avatar may be output as a patterned voice of a parent (or a reflection of a previously stored parent's motion / expression) through the avatar on the content.

또한, 제어부(310)는 IPTV 단말(110)에서 콘텐츠를 재생하는 시작 시간을 기억하였다가, 시작 시간으로부터 콘텐츠가 재생되는 시간을 측정하여 일정 시간 경과 후에 알람 신호를 생성할 수 있다. 이때, 알람 신호에 따라 실감형 서비스 제공부(330)가 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리(또는, 기 저장된 부모의 동작/표정의 반영도 가능)로 미리 저장된 지시 사항(잠시 쉬었다가 시청하라는 등 부모 입장의 하고 싶은 말이나 내용)을 출력할 수 있다. 또는, 알람 신호에 따라 제어부(310)는 콘텐츠의 재생을 중지시킬 수도 있다. In addition, the controller 310 may store the start time of playing the content in the IPTV terminal 110, and then measure the time at which the content is played from the start time to generate an alarm signal after a predetermined time elapses. At this time, in response to the alarm signal, the sensory service provider 330 pre-stores the instructions stored in the parent's patterned voice (or may reflect the previously stored parent's motion / expression) through the avatar on the content (for a while, and then rests). You can print out what you want to say or watch. Alternatively, the controller 310 may stop playing the content according to the alarm signal.

도 4는 본 발명의 일실시예에 따른 콘텐츠 서버(400)를 설명하기 위한 도면 이다. 도 4를 참조하면, 본 발명의 일실시예에 따른 콘텐츠 서버(400)는 제어부(410), 회원정보 DB(411), 콘텐츠 DB(412), 인증부(420), 콘텐츠 제공부(430), 및 실감형 서비스 제공부(440)를 포함한다. 제어부(410)는 콘텐츠 서버(600)의 위와 같은 구성요소들의 전반적인 제어를 담당한다. 4 is a view for explaining a content server 400 according to an embodiment of the present invention. 4, the content server 400 according to an embodiment of the present invention is the control unit 410, member information DB 411, content DB 412, authentication unit 420, content providing unit 430 , And a realistic service provider 440. The controller 410 is responsible for the overall control of the above components of the content server 600.

먼저, 도 3에서 리더포트(311)에 연결된 카드 리더를 이용하여 사용자 카드로부터 읽어진 카드 번호를 리더포트(311)를 통하여 수신하고 비밀번호를 입력 받으면, 인증부(320)는 카드번호, 비밀번호 등이 포함된 사용자 정보를 이더넷 포트(313)를 통하여 콘텐츠 서버(400)로 전송하여 사용자의 인증을 요청할 수 있다.First, in FIG. 3, when a card number read from a user card is received through the reader port 311 using a card reader connected to the reader port 311 and a password is input, the authentication unit 320 receives a card number, a password, and the like. The included user information may be transmitted to the content server 400 through the Ethernet port 313 to request authentication of the user.

이에 따라, 콘텐츠 서버(400)의 인증부(420)는 수신된 사용자 정보를 회원정보 DB(411)에 저장 관리되는 회원 가입자의 대응 정보와 비교하여 일치하는 지 여부를 확인하는 인증 과정을 처리할 수 있다. 위와 같은 인증 과정을 통하여 회원 가입자인지 여부가 인증되면, 사용자의 콘텐츠 요청 입력을 받은 도 3의 IPTV 중계 장치(300)는 콘텐츠 서버(400)로 해당 콘텐츠를 요청하고, 이에 따라 콘텐츠 제공부(430)는 콘텐츠 DB(412)에 저장 관리되는 많은 콘텐츠들 중 해당 콘텐츠 정보를 검색하여, 일치되는 콘텐츠 정보를 IPTV 중계 장치(300)로 전송할 수 있다. 이때, 콘텐츠의 제공에 대하여 콘텐츠 제공부(430)는 사용자 정보에 기초하여 해당 카드에 대한 결제를 처리할 수도 있다.Accordingly, the authentication unit 420 of the content server 400 compares the received user information with the corresponding information of the member subscribers stored and managed in the member information DB 411 to process an authentication process for checking whether or not they match. Can be. If it is authenticated as a member subscriber through the above authentication process, the IPTV relay device 300 of FIG. 3 receiving the user's content request input requests the corresponding content to the content server 400, and thus the content provider 430. ) May search for the corresponding content information among many contents stored and managed in the content DB 412 and transmit the matched content information to the IPTV relay device 300. In this case, regarding the provision of the content, the content provider 430 may process the payment for the corresponding card based on the user information.

이에 따라 IPTV 중계 장치(300)의 제어부(310)는 이더넷 포트(313)를 통하여 콘텐츠 서버로부터 수신되는 해당 콘텐츠 정보를 시청하도록 TV 포트(312)를 통하여 IPTV 단말(110)로 출력할 수 있게 된다. Accordingly, the control unit 310 of the IPTV relay device 300 may output to the IPTV terminal 110 through the TV port 312 to view the corresponding content information received from the content server through the Ethernet port 313. .

한편, 도 3의 IPTV 중계 장치(300)에 실감형 서비스 제공부(330)가 구비되어 위에서 기술한 바와 같은 기능을 수행할 수도 있지만, 이와 같은 실감형 서비스 제공부(330) 대신에 도4의 실감형 서비스 제공부(440)가 그와 유사한 기능을 수행할 수도 있다.Meanwhile, although the sensory service provider 330 may be provided in the IPTV relay device 300 of FIG. 3 to perform a function as described above, the sensory service provider 330 of FIG. The realistic service provider 440 may perform a similar function.

예를 들어, 도 4에서, 실감형 서비스 제공부(440)는 도 3의 마이크(315)를 통하여 사용자의 목소리를 입력받을 수 있고, 입력된 사용자 목소리의 주파수를 분석하여 사용자의 언어 발음 습관을 주파수대역 단위로 패턴화하여 패턴화된 목소리를 저장할 수 있다. 이에 따라, 실감형 서비스 제공부(400)는 콘텐츠 제공부(430)를 통하여 IPTV 단말(110)로 제공되어 재생되는 콘텐츠의 캐릭터나 아바타의 본래 목소리를 위와 같이 패턴화된 목소리로 변조하여 IPTV 단말(110)로 제공할 수 있다. 또한, 실감형 서비스 제공부(440)는 카메라(314)를 통하여 사용자에 대한 영상을 입력받을 수 있고, 사용자에 대한 카메라 영상으로부터 사용자의 동작이나 표정을 분석한 정보를 저장하고, 상기 분석한 정보에 따라 IPTV 단말(110)에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 동작이나 표정을 해당 사용자의 동작이나 표정으로 변조하여 IPTV 단말(110)로 제공할 수도 있다.For example, in FIG. 4, the sensory service provider 440 may receive a voice of a user through the microphone 315 of FIG. 3, and analyze the frequency of the input voice of the user to determine the user's language pronunciation habit. Patterned voices can be stored by patterning them in frequency band units. Accordingly, the sensory service provider 400 modulates the original voice of the character or avatar of the content to be reproduced and provided to the IPTV terminal 110 through the content provider 430 into a patterned voice as described above. 110 may be provided. In addition, the sensory service provider 440 may receive an image of the user through the camera 314, store information analyzing the user's motion or facial expression from the camera image of the user, and analyze the information. Accordingly, the original motion or facial expression of the character or avatar of the content played in the IPTV terminal 110 may be modulated into the motion or facial expression of the corresponding user and provided to the IPTV terminal 110.

IPTV 단말(110)에서 콘텐츠를 재생하는 동안에, 도 3의 영상 전송부(340)가 카메라(314)를 이용하여 IPTV 단말(110) 앞에서 해당 콘텐츠를 시청하는 사람(예를 들어, 아동)에 대한 모니터링 영상을 생성하여 미리 지정된 단말(예를 들어, 부모의 IPTV 단말, 또는 이동 통신 단말 등)로 전송할 경우에는, 이에 따라 부모는 아동의 시청 상황을 모니터링하면서 자신의 단말(IPTV 단말, 또는 이동 통신 단말 등)을 통하여 지시 사항(TV와의 거리 유지 등 부모 입장의 하고 싶은 말이나 내용)을 입력할 수 있다. 이와 같은 경우에, 원격지에서 부모가 입력하는 지시 사항이 인터넷 또는 이동 통신망을 통하여 실감형 서비스 제공부(440)로 전달될 수 있고, 실감형 서비스 제공부(440)는 수신한 지시 사항을 IPTV 단말(110)에서 재생되고 있는 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리로 출력되도록 할 수도 있다. 이때, 실감형 서비스 제공부(440)는 기 저장한 부모의 동작/표정을 아바타의 동작/표정으로 출력되도록 반영할 수도 있다.While the content is being played on the IPTV terminal 110, the image transmitter 340 of FIG. 3 uses a camera 314 for a person (eg, a child) watching the content in front of the IPTV terminal 110. When a monitoring image is generated and transmitted to a predetermined terminal (for example, a parent's IPTV terminal or a mobile communication terminal), the parent monitors the child's viewing situation accordingly and thus the parent (IPTV terminal or mobile communication). You can enter instructions (words or content you want to say to your parents, such as keeping your distance from the TV) through your terminal, etc.). In this case, the instructions input by the parent from the remote location may be delivered to the realistic service provider 440 through the Internet or a mobile communication network, and the realistic service provider 440 may transmit the received instructions to the IPTV terminal. The avatar may be output as a patterned voice of a parent through an avatar on the content being played at 110. In this case, the tangible service provider 440 may reflect the pre-stored motion / expression of the parent as the output / action of the avatar.

또한, 제어부(410)는 IPTV 단말(110)에서 콘텐츠를 재생하는 시작 시간을 기억하였다가, 시작 시간으로부터 콘텐츠가 재생되는 시간을 측정하여 일정 시간 경과 후에 알람 신호를 생성할 수 있다. 이때, 알람 신호에 따라 실감형 서비스 제공부(440)가 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리로 미리 저장된 지시 사항(잠시 쉬었다가 시청하라는 등 부모 입장의 하고 싶은 말이나 내용)을 출력할 수 있다. 이때에도, 실감형 서비스 제공부(440)는 기 저장한 부모의 동작/표정을 아바타의 동작/표정으로 출력되도록 반영할 수도 있다. 또는, 위와 같은 알람 신호에 따라 제어부(410)는 콘텐츠의 재생을 중지시킬 수도 있다. In addition, the controller 410 may store the start time of playing the content in the IPTV terminal 110, and then measure the time at which the content is played from the start time to generate an alarm signal after a predetermined time elapses. At this time, in response to the alarm signal, the realistic service provider 440 outputs the pre-stored instructions (words or contents desired by the parents, such as pause and watch) through the patterned voice of the parent through the avatar on the content. Can be. In this case, the sensory service provider 440 may reflect the pre-stored motion / expression of the parent as the avatar motion / expression. Alternatively, the controller 410 may stop the playback of the content according to the above alarm signal.

도 5에는 도 3의 실감형 서비스 제공부(330) 또는 도 4의 실감형 서비스 제공부(440)의 좀 더 구체적인 설명을 위한 장치(400)의 구성도가 도시되어 있다.5 is a block diagram of an apparatus 400 for a more detailed description of the realistic service provider 330 of FIG. 3 or the realistic service provider 440 of FIG. 4.

도 5는 본 발명의 일실시예에 따른 실감형 서비스 제공 장치(500)를 설명하기 위한 도면이다. 본 발명의 일실시예에 따른 실감형 서비스 제공 장치(500)는 도 3의 실감형 서비스 제공부(330) 또는 도 4의 실감형 서비스 제공부(440)의 기능을 실현할 수 있는 장치로서, 이하에서 그 동작을 좀 더 구체적으로 설명한다. FIG. 5 is a diagram for describing a realistic service providing apparatus 500 according to an embodiment of the present invention. The realistic service providing apparatus 500 according to an embodiment of the present invention is a device capable of realizing the functions of the realistic service providing unit 330 of FIG. 3 or the realistic service providing unit 440 of FIG. 4. The operation will be described in more detail.

도 5를 참조하면, 본 발명의 일실시예에 따른 실감형 서비스 제공 장치(500)는, 발음 패턴화부(510), 목소리 변조부(520) 및 동작/표정 분석부(530)를 포함한다. 발음 패턴화부(510)는 사용자 선택부(511), 패턴 저장부(512) 및 주파수 분석부(513)를 포함하고, 목소리 변조부(520)는 캐릭터 선택부(521), 발음 패턴 선택부(522), 캐릭터 음원 추출부(523) 및 캐릭터 음원 변조부(524)를 포함한다. Referring to FIG. 5, the sensory service providing apparatus 500 according to an embodiment of the present invention includes a pronunciation patterning unit 510, a voice modulator 520, and an operation / expression analyzer 530. The pronunciation patterner 510 includes a user selector 511, a pattern storage 512, and a frequency analyzer 513. The voice modulator 520 includes a character selector 521 and a pronunciation pattern selector ( 522, a character sound source extractor 523, and a character sound source modulator 524.

발음 패턴화부(510)는 사용자의 목소리를 입력받아 주파수를 분석하여 사용자의 언어 발음 습관을 소정 주파수대역 단위로 패턴화하여 패턴화된 목소리를 패턴 저장부(512)에 저장할 수 있다. 목소리 변조부(520)는 단말에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 목소리를 상기 패턴화된 목소리로 변조하여 해당 단말로 제공할 수 있다.The pronunciation patterner 510 may analyze the frequency by receiving the user's voice and pattern the language pronunciation habit of the user in a predetermined frequency band to store the patterned voice in the pattern storage unit 512. The voice modulator 520 may modulate the original voice of the character or avatar of the content played in the terminal into the patterned voice and provide the same to the corresponding terminal.

본 발명의 일실시예에 따른 목소리 변조 제공 장치(500)의 좀 더 구체적인 동작 설명을 위하여 도 6의 흐름도가 참조된다.Referring to the flowchart of FIG. 6 for more detailed operation of the voice modulation providing apparatus 500 according to an embodiment of the present invention.

먼저, IPTV 단말(110)을 시청하는 사람이 언어 발음 습관을 학습시켜 저장하고자 하는 경우에, 사용자 선택부(511)가 IPTV 단말(110)에 제공하는 소정 메뉴를 통하여 발음 패턴화할 사용자를 선택하거나 새로이 설정할 수 있다(S610). 발음 패턴화할 사용자가 선택되면, 해당 사용자는 마이크(315)를 통하여 미리 준비한 문서를 읽거나 콘텐츠에서 나오는 음성을 따라해보거나 기타 자신의 발음 습관을 학습시킬 수 있는 자료를 발음하고(S620), 이에 따라 마이크(315)로 입력되는 사용자의 목소리는 주파수 분석부(513)에서 분석된다. 주파수 분석부(513)는 사용자의 목소 리를 입력받아 주파수를 분석하여 사용자의 언어 발음 습관을 소정 주파수대역 단위로 패턴화하여 패턴화된 목소리를 패턴 저장부(512)에 저장할 수 있다(S630). First, when a person watching the IPTV terminal 110 wants to learn and store a language pronunciation habit, the user selector 511 selects a user to pattern pronunciation through a predetermined menu provided to the IPTV terminal 110. It can be set newly (S610). When the user to select a pronunciation pattern is selected, the user reads a document prepared in advance through the microphone 315, or reads a voice from the content, or pronounces a material for learning his / her pronunciation habit (S620). The voice of the user input through the microphone 315 is analyzed by the frequency analyzer 513. The frequency analyzer 513 may receive the voice of the user, analyze the frequency, and pattern the user's language pronunciation habit in a predetermined frequency band unit to store the patterned voice in the pattern storage unit 512 (S630). .

주파수 분석부(513)는 가청 주파수 대역(예를 들어, 20~20kHz)을 소정 단위, 예를 들어, 10Hz 단위, 또는 100Hz 단위 등으로 분할한 각 대역에 대한 목소리 패턴을 구분하여 해당 패턴화된 목소리를 저장할 수 있다. 예를 들어, 주파수 분석부(513)는 도 7과 같이 단어나 문장 등으로 입력되는 사용자의 목소리를 자음과 모음으로 분리하고 이들을 조합하여 음소별("나", "는", "학"..등 또는 영어 알파벳 포함) 또는 문장별("나는 간다" 등 또는 영어 문장 포함) 패턴화할 수 있고, 패턴화된 음소나 문장은 해당 주파수대역에 맞게 대응시켜 저장함으로써, 각 주파수대역 단위의 패턴화된 목소리에 해당하는 벡터들(

)을 생성할 수 있다. 이때, 사용자가 아직 학습시키지 않은(발음하지 않은) 음소나 문장도 생성할 수 있는데, 이들은 기 이루어진 패턴화된 목소리들의 평균 주파수에 해당 음소나 문장이 대응되어 저장되도록 할 수 있다. The frequency analyzer 513 divides the audible frequency band (for example, 20 to 20 kHz) into predetermined units, for example, 10 Hz units, 100 Hz units, and the like, and divides a voice pattern for each band into a corresponding patterned pattern. You can save your voice. For example, the frequency analyzer 513 separates a user's voice input into a word or a sentence into consonants and vowels, and combines them into phonemes ("I", """," hak ") as shown in FIG. Etc. or the English alphabet) or sentence-by-sentence ("I go" etc. or English sentences) can be patterned, and the patterned phonemes or sentences are stored in correspondence with the corresponding frequency band, thereby patterning each unit Vectors corresponding to the voice of

) Can be created. In this case, the phoneme or sentence that the user has not yet learned (pronounced) may also be generated, which may allow the corresponding phoneme or sentence to be stored in correspondence with the average frequency of the predetermined patterned voices.

패턴 저장부(512)는 사람마다 구분하여 해당 사용자에 대응된 각 패턴화된 목소리를 저장할 수 있으며, 사용자 선택부(511)는 위와 같이 저장된 복수의 사용자 중에서 어느 한 사용자에 대한 패턴화된 목소리를 선택할 수 있다. 사용자는 언어 발음 습관을 재차 학습시키고자 하는 경우에, 사용자 선택부(511)의 동작에 따라 기 저장된 자신의 패턴화된 목소리를 선택할 수 있고, 이에 따라 자신의 발음 습관을 위와 같이 다시 학습시킬 수 있다. 이에 따라 주파수 분석부(513)는 기 저장된 해당 패턴화된 목소리에 대하여, 위와 같이 다른 시점에 해당 사용자의 목소리를 입력받는 경우에, 다시 각 주파수 대역 단위로 해당 패턴화된 목소리를 업데이트하여 패턴 저장부(512)에 저장할 수 있다. 이에 따라, 사용자가 이전에는 발음하지 못했던 문구나 억양, 음소(영어 알파벳 포함) 등에 대한 새로운 패턴화된 목소리가 추가될 수 있다. The pattern storage unit 512 may store each patterned voice corresponding to the corresponding user by dividing each person, and the user selector 511 may store the patterned voice for any one user among the plurality of users stored as described above. You can choose. When the user wants to learn the language pronunciation habit again, the user may select his or her own patterned voice according to the operation of the user selection unit 511, thereby re-learning his pronunciation habit as described above. have. Accordingly, the frequency analyzer 513 updates the patterned voice in each frequency band unit and stores the pattern when the user's voice is input at a different time as described above with respect to the previously stored corresponding patterned voice. It can be stored in the unit 512. Accordingly, new patterned voices for phrases, intonations, phonemes (including the English alphabet), etc., which the user has not previously pronounced may be added.

한편, 목소리 변조부(520)는 단말에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 목소리를 위와 같이 사용자의 언어 발음 습관이 반영된 해당 패턴화된 목소리로 변조하여 해당 단말로 제공할 수 있다.Meanwhile, the voice modulator 520 may modulate the original voice of the character or avatar of the content played in the terminal into the corresponding patterned voice reflecting the user's language pronunciation habit, and provide the same to the corresponding terminal.

예를 들어, 캐릭터 선택부(521)는 IPTV 단말(110)에서 재생되는 콘텐츠에 복수의 캐릭터나 아바타가 포함된 경우에, IPTV 단말(110)을 통하여 상기 복수의 캐릭터나 아바타 중 어느 하나를 선택하도록 지원할 수 있다(S640). 도 2와 같은, "백설공주" 동화 속에는 여러 등장인물이 나오고, 경우에 따라서는 학습용 또는 교육용 콘텐츠의 속성상 아바타를 추가적으로 삽입하여 현재 상황을 음성 또는 문자로 설명하거나 필요한 지시 사항을 음성 또는 문자로 안내할 수도 있는데, 이때의 등장 인물이나 아바타의 음성이 이미 콘텐츠에 스탠더드 음원으로 제공되고 있으나, 본 발명에서는 위와 같이 캐릭터 선택부(521)에 의하여 IPTV 단말(110)에 제공되는 메뉴 등을 통하여 상기 복수의 캐릭터나 아바타 중 어느 하나를 선택하면, 캐릭터 음원 변조부(524)는 선택된 캐릭터나 아바타의 해당 음원에 대하여 위와 같이 패턴화된 목소리로 변조하여 출력할 수 있다.For example, the character selector 521 selects any one of the plurality of characters or avatars through the IPTV terminal 110 when the plurality of characters or avatars are included in the content played by the IPTV terminal 110. It can be supported to (S640). In the “Snow White” fairy tale as shown in FIG. 2, various characters appear, and in some cases, an avatar is additionally inserted due to the nature of educational or educational content to describe the current situation in a voice or a text or a necessary instruction in a voice or a text. Although the voice of the person or avatar at this time is already provided as a standard sound source to the content, in the present invention, as described above, the menu is provided to the IPTV terminal 110 by the character selector 521 as described above. If one of a plurality of characters or avatars is selected, the character sound source modulator 524 may modulate and output the patterned voice with respect to the corresponding sound source of the selected character or avatar as described above.

이때, 위와 같은 패턴화된 목소리는 부모, 형제, 인척 등 아동과 같은 시청자에게 친근감이나 따뜻한 정서를 느끼게 해줄 수 있는 목소리로 선택하여 출력이 가능하다. 예를 들어, 발음 패턴화부(510)를 통하여 패턴 저장부(512)에 복수의 사용자에 대하여 패턴화된 목소리가 저장되면, 발음 패턴 선택부(522)는 IPTV 단말(110)을 통하여 상기 복수의 사용자 중 어느 하나의 패턴화된 목소리를 선택하도록 제공하고 해당 선택을 입력받을 수 있다(S650). 이에 따라 캐릭터 음원 변조부(524)는 캐릭터 선택부(521)에 의하여 선택된 캐릭터나 아바타의 해당 음원에 대하여 위와 같이 발음 패턴 선택부(522)에 의하여 선택된 패턴화된 목소리로 변조하여 출력할 수 있다.At this time, the patterned voice as described above can be selected and output as a voice that can make the viewers, such as parents, siblings, relatives, children feel friendly or warm emotion. For example, when a patterned voice for a plurality of users is stored in the pattern storage unit 512 through the pronunciation patterning unit 510, the pronunciation pattern selection unit 522 may transmit the plurality of voices through the IPTV terminal 110. The user may provide one of the patterned voices to be selected and receive the corresponding selection (S650). Accordingly, the character sound source modulator 524 may modulate and output the corresponding sound source of the character or avatar selected by the character selector 521 to the patterned voice selected by the pronunciation pattern selector 522 as described above. .

음원 추출부(523)는 IPTV 단말(110)에서 재생되는 콘텐츠의 음원으로부터 캐릭터 선택부(521)에 의하여 선택된 캐릭터나 아바타의 본래 목소리를 추출할 수 있다(S660). The sound source extractor 523 may extract the original voice of the character or avatar selected by the character selector 521 from the sound source of the content played in the IPTV terminal 110 (S660).

이에 따라, 음원 변조부(524)는 음원 추출부(523)가 추출한 해당 목소리를 발음 패턴 선택부(522)에 의하여 선택된 패턴화된 목소리로 변조할 수 있다(S670). 음원 변조부(524)는 도 7과 같이 콘텐츠의 캐릭터나 아바타의 행동에 맞게 이미 제공되고 있는 본래의 스탠더드(standard) 음원(각각의 음소나 문장 단위의 음원)을 주파수대역 단위로 분리하고 상기 패턴화된 목소리의 주파수 분포 대역으로 이동하여, MUX(multiplexer)를 이용해 이동된 본래의 스탠더드 음원의 각 주파수대역 단위 마다 상기 패턴화된 목소리(

)로 대체하여 변조된 목소리를 생성함으로써, IPTV 단말(110)로 제공할 수 있다. 즉, 각 주파수 대역별로 해당 사용자의 특색있는 톤이 저장된 패턴 저장부(512)의 패턴화된 목소리의 주파수 분포에 따라 사람마다 특색있는 톤으로 음성이 출력되므로, 이를 이용하여 스탠더드 음원의 전체적인 주파수 분포 대역을 해당 패턴화된 목소리의 주파수 분포 대역으로 이동시켜 각 주파수대역 단위 마다 패턴화된 목소리의 해당 주파수대역 단위로 대체하여 패턴화된 목소리와 유사한 음성 출력을 얻을 수 있다. 예를 들어, 패턴화된 목소리의 주파수 분포가 일정 중심 주파수에 많이 분포하고 나머지가 중심 주파수 좌우로 퍼져 있는 경우에, 스탠더드 음원의 주파수 분포의 중심을 패턴화된 목소리의 중심 주파수와 일치 시켜 같은 주파수 이동만큼 스탠더드 음원의 나머지 주파수 대역의 음원도 주파수 이동시킴으로써, 스탠더드 음원의 주파수 분포를 패턴화된 목소리의 주파수 분포와 유사하게 만들 수 있다. 이와 같이 중심 주파수를 일치시킨 후 스탠더드 음원(각각의 음소나 문장 단위의 음원)의 각 주파수대역 단위 마다 패턴화된 목소리의 해당 주파수대역 단위로 대체하여 패턴화된 목소리와 유사한 음성 출력을 얻을 수 있다. 예를 들어, 패턴화된 목소리 중 음소 '가'가 0~100Hz 대역에 속하면 스탠더드 음원의 해당 음소 '가'는 0~100Hz 대역의 소리로 대체될 수 있다.Accordingly, the sound source modulator 524 may modulate the corresponding voice extracted by the sound source extractor 523 into a patterned voice selected by the pronunciation pattern selector 522 (S670). The sound source modulator 524 separates the original standard sound source (sound source of each phoneme or sentence unit) which is already provided according to the behavior of the character or avatar of the content as shown in FIG. The patterned voice is shifted to each frequency band unit of the original standard sound source shifted to the frequency distribution band of the normalized voice and shifted using a MUX (multiplexer).

By generating a modulated voice by replacing with), it can be provided to the IPTV terminal 110. That is, according to the frequency distribution of the patterned voice of the pattern storage unit 512 in which the user's characteristic tone is stored for each frequency band, the voice is output as the characteristic tone for each person, thereby using the overall frequency distribution of the standard sound source. By shifting the band to the frequency distribution band of the corresponding patterned voice, the voice output similar to the patterned voice can be obtained by replacing the band with the corresponding frequency band unit of the patterned voice for each frequency band unit. For example, if the frequency distribution of the patterned voice is distributed at a constant center frequency, and the remainder is spread to the left and right of the center frequency, the center of the frequency distribution of the standard sound source is matched with the center frequency of the patterned voice. By shifting the frequency of the sound source in the remaining frequency bands of the standard sound source by the shift, the frequency distribution of the standard sound source can be made similar to the frequency distribution of the patterned voice. In this way, after matching the center frequency, each frequency band unit of the standard sound source (each phoneme or sentence unit) can be replaced with the corresponding frequency band unit of the patterned voice to obtain a voice output similar to the patterned voice. . For example, if the phoneme 'ga' of the patterned voice belongs to the 0 to 100Hz band, the corresponding phoneme 'ga' of the standard sound source may be replaced with a sound of the 0 to 100Hz band.

이와 같은 음성 변조 방식에 따라, 단말에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 목소리는 패턴화된 목소리로 출력됨으로써, 아동과 같은 시청자는 부모, 형제, 인척 등의 목소리로 콘텐츠를 시청함으로써 친근감이나 따뜻한 정서를 느낄 수 있고, 이에 따라 학습 흥미를 유발하여 학습 효과를 높일 수 있게 된다. According to such a voice modulation method, the original voice of the character or avatar of the content played on the terminal is output in a patterned voice, so that a viewer such as a child can watch the content in a voice of parents, siblings, or relatives, thereby making the user feel friendly or warm. Emotion can be felt, thereby inducing learning interests, thereby increasing the learning effect.

한편, 도 3의 IPTV 중계 장치(300)에 실감형 서비스 제공부(330)가 구비되어 위에서 기술한 바와 같은 기능을 수행할 수도 있지만, 이와 같은 실감형 서비스 제공 장치(500)가 도4의 실감형 서비스 제공부(440)에 적용되는 경우에는, 도 7과 같이 소정 인코딩 방식에 따라 인코딩하여 인코딩된 데이터를 네트워크의 전송 방식 에 따라 패턴화 목소리를 나타내는 태그(tag)와 전송 헤더(header) 뒤의 페이로드(payload)에 포함시키고, 패턴화된 목소리 벡터(

)의 해당 벡터값도 포함시켜 위와 같이 변조된 목소리를 네트워크로 전송할 수 있고 이에 따라 STB에서 수신하는 변조된 목소리가 디코딩되어 IPTV 단말(110)에서 재생될 수 있다.Meanwhile, although the sensory service provider 330 may be provided in the IPTV relay device 300 of FIG. 3 to perform a function as described above, the sensory service provider 500 as described above may be realized. When applied to the type service providing unit 440, as shown in FIG. 7 after the tag and the transmission header (data) indicating the patterned voice according to the transmission method of the network encoded by encoding according to a predetermined encoding method In the payload of the

The modulated voice received by the STB may be decoded and reproduced by the IPTV terminal 110 by transmitting the modulated voice to the network as described above.

마찬가지로, 도 3의 IPTV 중계장치(300)나 도 4의 콘텐츠 서버(400)의 중계를 통하여, 부모가 아동의 시청 상황을 모니터링하면서 자신의 단말(IPTV 단말, 또는 이동 통신 단말 등)을 통하여 지시 사항(TV와의 거리 유지 등 부모 입장의 하고 싶은 말이나 내용)을 수신하는 경우나, 콘텐츠가 재생되는 시간을 측정하여 일정 시간 경과 후에 알람 신호를 생성하는 경우 등에 있어서도, 위와 같은 변조 방식에 따라, 음원 변조부(524)는 부모의 단말로부터 수신한 지시 사항을 IPTV 단말(110)에서 재생되고 있는 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리로 출력되도록 할 수도 있을 뿐만 아니라, 콘텐츠 상의 아바타를 통하여 부모의 패턴화된 목소리로 미리 저장된 지시 사항(잠시 쉬었다가 시청하라는 등 부모 입장의 하고 싶은 말이나 내용)을 출력할 수도 있다. Similarly, through the relaying of the IPTV relay apparatus 300 of FIG. 3 or the content server 400 of FIG. 4, the parent instructs through his terminal (such as an IPTV terminal or a mobile communication terminal) while monitoring the viewing situation of the child. In the case of receiving an item (word or content that the parent wants to maintain such as keeping distance from the TV), or measuring the playing time of the content and generating an alarm signal after a certain time elapses, according to the above modulation method, The sound source modulator 524 may not only output the instruction received from the parent's terminal through the avatar on the content being played on the IPTV terminal 110, but also output the patterned voice of the parent through the avatar on the content. You can also print out pre-stored instructions (what you want to say to your parents, like taking a break and watching) in your parents' patterned voice. The.

한편, 동작/표정 분석부(530)는 카메라(314)를 통하여 사용자에 대한 영상을 입력받을 수 있고, 사용자에 대한 카메라 영상으로부터 사용자의 동작이나 표정을 분석한 정보를 소정 저장 수단에 저장하고, 상기 분석한 정보에 따라 IPTV 단말(110)에서 재생되는 콘텐츠의 캐릭터나 아바타의 본래 동작이나 표정을 해당 사용자의 동작이나 표정으로 변조하여 IPTV 단말(110)로 제공할 수도 있다. 동작/표정 분석부(530)는 동작이나 표정을 학습시켜 저장하고자 하는 사용자의 카메라 영 상을 분석하여 반복되는 동작 여러 개를 패턴화하여 해당 정보를 저장할 수 있고, 사용자 얼굴의 표정을 분석하여 얼굴 모습 뿐만 아니라 그 표정까지 분석하여 몇 개의 패턴으로 분류하여 해당 정보를 저장할 수 있다. 동작/표정 분석부(530)는 위와 같이 저장된 사용자의 동작/표정에 대한 정보를 이용하여 콘텐츠의 캐릭터나 아바타의 본래 동작이나 표정 대신에 출력되도록 할 수 있다. 이에 따라, 아동과 같은 시청자는 콘텐츠의 캐릭터나 아바타에 대하여 부모, 형제, 인척 등의 목소리 뿐만 아니라 동작이나 표정까지 보면서 콘텐츠를 시청함으로써 더욱 더 친근감이나 따뜻한 정서를 느낄 수 있고, 이에 따라 더욱 학습 흥미를 유발할 수 있게 된다. On the other hand, the motion / expression analysis unit 530 may receive an image of the user through the camera 314, and stores information analyzing the user's motion or facial expression from the camera image of the user in a predetermined storage means, According to the analyzed information, the original motion or facial expression of the character or avatar of the content played on the IPTV terminal 110 may be modulated into the motion or facial expression of the corresponding user and provided to the IPTV terminal 110. The motion / expression analysis unit 530 may analyze the camera image of the user to learn and store the motion or facial expression and store the information by patterning a plurality of repeated motions and analyze the facial expression of the user's face. Not only the appearance but also the expression can be analyzed and classified into several patterns to store relevant information. The motion / face analysis unit 530 may output the information instead of the original motion or expression of the character or avatar of the content by using the information on the motion / face of the user stored as described above. Accordingly, viewers, such as children, can feel more friendly or warmer emotions by watching the contents by watching not only the voices of parents, siblings, relatives, etc. but also movements and expressions of the characters or avatars of the contents. Can be triggered.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다. 그러므로, 본 발명의 범위는 설명된 실시예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등한 것들에 의해 정해져야 한다.As described above, the present invention has been described by way of limited embodiments and drawings, but the present invention is not limited to the above embodiments, and those skilled in the art to which the present invention pertains various modifications and variations from such descriptions. This is possible. Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined not only by the claims below but also by the equivalents of the claims.

도 1은 본 발명의 일실시예에 따른 IPTV 시스템의 사용 환경을 설명하기 위한 도면이다.1 is a view for explaining the usage environment of the IPTV system according to an embodiment of the present invention.

도 2은 본 발명의 일실시예에 따른 IPTV 콘텐츠의 일례를 설명하기 위한 도면이다.2 is a view for explaining an example of IPTV content according to an embodiment of the present invention.

도 3는 본 발명의 일실시예에 따른 IPTV 중계 장치를 설명하기 위한 도면이다.3 is a diagram illustrating an IPTV relay apparatus according to an embodiment of the present invention.

도 4는 본 발명의 일실시예에 따른 콘텐츠 서버를 설명하기 위한 도면이다.4 is a view for explaining a content server according to an embodiment of the present invention.

도 5는 본 발명의 일실시예에 따른 실감형 서비스 제공 장치를 설명하기 위한 도면이다.FIG. 5 is a diagram for describing a realistic service providing apparatus according to an embodiment of the present invention.

도 6은 도 5의 실감형 서비스 제공 장치의 동작 설명을 위한 흐름도이다.6 is a flowchart for describing an operation of the sensory service providing apparatus of FIG. 5.

도 7은 본 발명의 일실시예에 따른 캐릭터 음원의 변조 과정을 설명하기 위한 도면이다.7 is a view for explaining a modulation process of the character sound source according to an embodiment of the present invention.

Claims

In the IPTV service method,

The user's voice is input to analyze the frequency and pattern the user's language pronunciation habits in units of frequency bands to store the patterned voice, and the user's voice is patterned by phonemes or sentences to pattern the phonemes or sentences. Store corresponding to the frequency band, and relearn the stored patterned voices so that new patterned voices for phrases, intonations, or phonemes (including the English alphabet) are added,

While playing the content on the terminal, the original voice of the character or third media of the content to be reproduced is modulated into the patterned voice and provided to the terminal, but the original sound source of the character of the content or the third media is provided. The modulation is separated in the frequency band unit and shifted to the frequency distribution band of the patterned voice, and replaced by the corresponding frequency band unit of the patterned voice for each frequency band unit of the moved original standard sound source. Performing,

Generates a monitoring image of a person watching the content in front of the terminal using a camera and transmits the monitoring image to a predetermined terminal, and receives instructions on the content inputted through the predetermined terminal on the content. And outputting the patterned voice through the third medium.

The method of claim 1, further comprising storing information analyzing the user's motion or facial expression from the camera image of the user,

IPTV service method, characterized in that by modulating the original operation or facial expression of the character or the third medium of the content reproduced in the terminal according to the analyzed information to the user's operation or facial expression.

The method of claim 1,

The patterned voice includes a voice pattern for each band obtained by dividing an audible frequency band into predetermined units.

The method of claim 1,

Selecting the stored patterned voice and receiving the user's voice at a different time to update the patterned voice in units of frequency bands.

The sound source of claim 1, wherein when the content to be played on the terminal includes a plurality of characters or a third medium, the selected sound source is selected through the terminal. IPTV service method, characterized in that for performing the modulation.

6. The method of claim 5, wherein the storage process stores a patterned voice for a plurality of users, selects one of the plurality of users through the terminal, and modulates the sound source with the patterned voice. IPTV service method, characterized in that for performing.

delete

The method of claim 1,

An IPTV service characterized by measuring a time for playing the content in the terminal and outputting an instruction to the person watching the content with the patterned voice through a third medium on the content after a predetermined time elapses; Way.

In the realistic service providing apparatus for digital multimedia content,

The user's voice is input to analyze the frequency and pattern the user's language pronunciation habit in a predetermined frequency band unit to store the patterned voice. A pronunciation patterning unit for storing a sentence corresponding to the frequency band and re-learning the stored patterned voice to update a new patterned voice for a phrase, intonation, or a phoneme (including the English alphabet) to be added; And

While playing the content on the terminal, the original voice of the character or third media of the content to be reproduced is modulated into the patterned voice and provided to the terminal, but the original sound source of the character of the content or the third media is provided. The modulation is separated in the frequency band unit and shifted to the frequency distribution band of the patterned voice, and replaced by the corresponding frequency band unit of the patterned voice for each frequency band unit of the moved original standard sound source. A voice modulator to perform,

When a monitoring image of a person watching the content is generated in front of the terminal through a camera and transmitted to a predetermined terminal, the voice modulator is input through the predetermined terminal, the instruction for the person watching the content. And outputting, as the patterned voice, through the third medium on the content.

The method of claim 10, further comprising storing information analyzing the user's motion or facial expression from a camera image of the user, and recalling the original motion or facial expression of a character or a third medium of content reproduced on the terminal according to the analyzed information. Motion / expression analysis unit provided to the terminal by modulating the user's motion or facial expression

Realistic service providing apparatus further comprising a.

The apparatus of claim 10, wherein the apparatus for providing a realistic service is to provide an IPTV service.

The method of claim 10, wherein the pronunciation patterning unit,

A pattern storage unit for storing each patterned voice for a plurality of users;

A user selector which selects a patterned voice for a user in the pattern storage; And

A frequency analyzer for receiving the user's voice at a different point in time for the selected patterned voice and updating the patterned voice in units of frequency bands.

Realistic service providing device comprising a.

The method of claim 10, wherein the voice modulator,

When the content to be played in the terminal includes a plurality of characters or a third medium, including a character selection unit for selecting any one of the plurality of characters or a third medium through the terminal,

And the modulation is performed on a corresponding sound source of the selected character or a third medium.

The method of claim 10, wherein the voice modulator,

If the voice patterned for a plurality of users is stored through the pronunciation patterning unit, and includes a pronunciation pattern selection unit for selecting any one of the plurality of the patterned voice through the terminal,

Realistic service providing apparatus characterized in that for performing the modulation on the corresponding sound source with the selected patterned voice.

The method of claim 10, wherein the voice modulator,

A character sound source extracting unit extracting an original voice of the character or a third medium; And

Character sound source modulator for modulating the extracted voice into the patterned voice

Realistic service providing device comprising a.

delete