KR102593763B1

KR102593763B1 - Method and server for guiding action of internet broadcasting host based on reaction of listeners

Info

Publication number: KR102593763B1
Application number: KR1020230066264A
Authority: KR
Inventors: 우성한; 안시성
Original assignee: 주식회사 스푼라디오
Priority date: 2023-05-23
Filing date: 2023-05-23
Publication date: 2023-10-26

Abstract

청취자 반응에 기반하여 방송 호스트의 방송 진행 방향을 유도하는 서버가 개시된다. 본 서버는, 보이스 기반의 라이브 인터넷 방송의 호스트 단말 및 청취자 단말과 통신하는 서버 통신부, 방송 데이터가 저장된 메모리; 및 하나 이상의 코어를 포함하는 서버 프로세서를 포함할 수 있다. 서버 프로세서는, 청취자의 반응 정보를 수집하는 수집 모듈, 수집 모듈에 의해 수집된 청취자의 반응 정보에 기초하여 청취자의 감정 및 참여도를 분석하는 분석 모듈 및 분석된 청취자의 감정 및 참여도에 따라 호스트의 방송 진행 방향을 가이드하는 진행 유도 모듈을 포함할 수 있다. 본 서버가 제공됨으로써, 호스트의 방송 진행이 보다 원활하게 수행되어 사용자 편의가 제고될 수 있다.A server that guides the broadcasting direction of a broadcasting host based on listener responses is disclosed. This server includes a server communication unit that communicates with a host terminal and a listener terminal of a voice-based live Internet broadcast, a memory storing broadcast data; and a server processor including one or more cores. The server processor includes a collection module that collects the listener's reaction information, an analysis module that analyzes the listener's emotion and participation based on the listener's reaction information collected by the collection module, and a host according to the analyzed listener's emotion and participation. It may include a progress guidance module that guides the direction of the broadcast. By providing this server, the host's broadcast can be performed more smoothly and user convenience can be improved.

Description

Method for inducing the broadcasting direction of a broadcasting host based on listener response and a server for the same {METHOD AND SERVER FOR GUIDING ACTION OF INTERNET BROADCASTING HOST BASED ON REACTION OF LISTENERS}

본 개시는 보이스 기반의 라이브 인터넷 방송 서비스에 관한 것이다. 보다 상세하게는, 본 개시는 청취자 반응에 기반하여 방송 호스트의 방송 진행 방향을 유도하는 방법 및 이를 위한 서버에 관한 것이다.This disclosure relates to a voice-based live Internet broadcasting service. More specifically, the present disclosure relates to a method for inducing a broadcasting host's broadcasting direction based on listener responses and a server for the same.

이하에 기술되는 내용은 단순히 본 실시예와 관련되는 배경 정보만을 제공할 뿐 종래기술을 구성하는 것이 아니다.The content described below simply provides background information related to this embodiment and does not constitute prior art.

과거 지상파 중심의 방송 시스템에서 최근 인터넷이 급격하게 보급됨으로써, 인터넷을 통한 방송 서비스를 지원하는 플랫폼에 대한 시청자들의 수요가 폭발적으로 증가함에 따라, 인터넷 기반의 스트리밍 서비스가 활발하게 제공되고 있으며, 영상 기반의 스트리밍 서비스뿐만 아니라 오디오 기반의 스트리밍 서비스도 꾸준하게 성장하고 있다.With the recent rapid spread of the Internet in the past terrestrial-centered broadcasting system, viewer demand for platforms that support broadcasting services through the Internet has increased explosively, and Internet-based streaming services are being actively provided, and video-based streaming services are being actively provided. In addition to streaming services, audio-based streaming services are also steadily growing.

또한, 오디오 기반의 스트리밍 서비스에서 녹화된 방송뿐만 아니라 호스트의 음성을 직접 듣고 실시간 소통을 원하는 청취자의 니즈에 따라 보이스 기반의 라이브 인터넷 방송도 활발하게 진행되고 있다. 보이스 기반의 라이브 인터넷 방송에서 방송 호스트가 오디오 기반으로 방송을 진행하고, 메신저 채팅 및 다자간 라이브 통화(call) 등을 통해 실시간 소통이 이루어지고 있다.In addition, voice-based live Internet broadcasting is being actively conducted in response to the needs of listeners who want to hear the host's voice directly and communicate in real time, as well as broadcasts recorded in audio-based streaming services. In voice-based live Internet broadcasting, the broadcast host broadcasts based on audio, and real-time communication takes place through messenger chatting and multi-party live calls.

보이스 기반의 라이브 인터넷 방송을 진행하는 호스트는 단독 또는 소규모로 구성되고 실제로 음성을 발하면서 방송을 진행하다보니, 청취자의 반응을 민감하게 살피거나 방송 중간중간에 챙겨야 할 내용을 누락시키거나 청취자가 원하는 방송 흐름을 놓칠 수 있는 문제점이 있다. Hosts who conduct voice-based live Internet broadcasts are single or small in number and broadcast while actually speaking, so they are sensitive to listeners' reactions, omit content that needs to be taken care of during the broadcast, or do not listen to what the listener wants. There is a problem where the broadcast flow can be missed.

이에, 방송 호스트를 보조하여 방송 진행을 보다 원활하게 하는 방법이 필요하다.Accordingly, there is a need for a method to assist the broadcast host to make broadcasting proceed more smoothly.

등록특허공보 제10-244905호(등록일: 2022.09.14)Registered Patent Publication No. 10-244905 (Registration date: 2022.09.14)

본 개시에 개시된 실시예는 보이스 기반의 라이브 인터넷 방송에서 청취자 반응에 기반하여 방송 호스트의 방송 진행 방향을 유도하는 방법을 제공하는데 그 목적이 있다.The purpose of the embodiment disclosed in the present disclosure is to provide a method of inducing a broadcasting host's direction of broadcasting based on listener responses in voice-based live Internet broadcasting.

본 개시가 해결하고자 하는 과제들은 이상에서 언급된 과제로 제한되지 않으며, 언급되지 않은 또 다른 과제들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The problems to be solved by the present disclosure are not limited to the problems mentioned above, and other problems not mentioned can be clearly understood by those skilled in the art from the description below.

상술한 기술적 과제를 달성하기 위한 본 개시에 따른 청취자 반응에 기반하여 방송 호스트의 방송 진행 방향을 유도하는 서버는, 보이스 기반의 라이브 인터넷 방송의 호스트 단말 및 청취자 단말과 통신하는 서버 통신부; 방송 데이터가 저장된 메모리; 및 하나 이상의 코어를 포함하는 서버 프로세서를 포함할 수 있다.The server that guides the broadcasting direction of the broadcasting host based on the listener response according to the present disclosure to achieve the above-described technical problem includes: a server communication unit that communicates with the host terminal and the listener terminal of the voice-based live Internet broadcast; A memory in which broadcast data is stored; and a server processor including one or more cores.

상기 서버 프로세서는, 청취자의 반응 정보를 수집하는 수집 모듈; 상기 수집 모듈에 의해 수집된 상기 청취자의 반응 정보에 기초하여 청취자의 감정 및 참여도를 분석하는 분석 모듈; 및 상기 분석된 청취자의 감정 및 참여도에 따라 호스트의 방송 진행 방향을 가이드하는 진행 유도 모듈을 포함할 수 있다.The server processor includes a collection module that collects listener response information; an analysis module that analyzes the listener's emotions and participation based on the listener's reaction information collected by the collection module; and a progress guidance module that guides the host's broadcast direction according to the analyzed listener's emotions and participation.

상기 서버 프로세서는, 상기 분석된 청취자의 감정 및 참여도에 기초하여, 상기 청취자를 그룹화하고 그룹 별로 레이블을 할당하는 그룹화 모듈을 더 포함할 수 있다.The server processor may further include a grouping module that groups the listeners and assigns labels to each group based on the analyzed emotions and participation levels of the listeners.

상기 진행 유도 모듈은, 상기 호스트의 방송 진행 방향을 상기 그룹마다 맞춤형으로 가이드하도록 구성될 수 있다.The progress guidance module may be configured to guide the host's broadcast progress direction in a customized manner for each group.

상기 서버 프로세서는, 가이드된 방송 진행 방향에 대한 호스트의 순응 여부를 모니터링하는 모니터링 모듈을 더 포함할 수 있다.The server processor may further include a monitoring module that monitors the host's compliance with the guided broadcast direction.

상기 서버 프로세서는, 상기 호스트가 순응하여 행동한 경우, 가이드된 방송 진행 방향의 적정성을 청취자 피드백을 통해 결정하도록 구성될 수 있다.The server processor may be configured to determine the appropriateness of the guided broadcast direction through listener feedback when the host acts compliantly.

상기 서버 프로세서는, 상기 라이브 인터넷 방송 중에, 호스트가 포함된 인터넷 콜이나 채팅창에서 청취자의 문의사항이 있는 경우, 문의사항을 리스트업하여 상기 서버 통신부를 통해 호스트 단말에 제공하며, 문의사항 리스트에서 답변이 완료된 문의사항을 제외한 문의사항을 소정 주기로 상기 호스트 단말에 제공하도록 구성될 수 있다. During the live Internet broadcast, if there is an inquiry from a listener in an Internet call or chat window involving the host, the server processor lists the inquiry and provides it to the host terminal through the server communication unit, and selects the inquiry from the inquiry list. It may be configured to provide inquiries excluding inquiries that have been answered to the host terminal at a predetermined period.

상기 서버 프로세서는, 청취자의 문의사항 중에서 중복되거나 소정의 유사도 범위를 갖는 문의사항을 병합하고, 문의 빈도수가 높은 문의사항부터 정렬하여 상기 서버 통신부를 통해 호스트 단말에 제공하도록 구성될 수 있다.The server processor may be configured to merge inquiries that are duplicated or have a predetermined similarity range among listener inquiries, sort inquiries with the highest frequency of inquiries, and provide the inquiries to the host terminal through the server communication unit.

상기 서버 프로세서는, 상기 분석된 청취자의 감정 및 참여도에 대한 데이터가 상기 호스트 단말에서 시각적으로 디스플레이되도록, 상기 분석된 청취자의 감정 및 참여도에 대한 데이터를 상기 서버 통신부를 통해 상기 호스트 단말에 제공하도록 구성될 수 있다.The server processor provides data on the analyzed emotions and participation of the listener to the host terminal through the server communication unit so that the data on the analyzed emotions and participation of the listener are visually displayed on the host terminal. It can be configured to do so.

또한, 상술한 기술적 과제를 달성하기 위한 본 개시에 따른 컴퓨팅 장치에 의해 수행되는 청취자 반응에 기반하여 방송 호스트의 방송 진행 방향을 유도하는 방법은, 청취자의 반응 정보를 수집하는 단계; 상기 수집된 상기 청취자의 반응 정보에 기초하여 청취자의 감정 및 참여도를 분석하는 단계; 및 상기 분석된 청취자의 감정 및 참여도에 따라 호스트의 방송 진행 방향을 가이드하는 단계를 포함할 수 있다.In addition, a method of inducing a broadcasting direction of a broadcasting host based on a listener response performed by a computing device according to the present disclosure for achieving the above-described technical problem includes collecting response information of the listener; Analyzing the listener's emotions and participation based on the collected response information of the listener; And it may include guiding the host's direction of broadcasting according to the analyzed listener's emotions and participation.

상기 유도 방법은, 상기 분석하는 단계 이후에, 상기 분석된 청취자의 감정 및 참여도에 기초하여, 상기 청취자를 그룹화하고 그룹 별로 레이블을 할당하는 단계를 더 포함할 수 있다.The derivation method may further include, after the analyzing step, grouping the listeners and assigning labels to each group based on the analyzed emotions and participation of the listeners.

상기 가이드하는 단계는, 상기 호스트의 방송 진행 방향을 상기 그룹마다 맞춤형으로 가이드하는 단계를 포함할 수 있다.The guiding step may include guiding the host's broadcast direction in a customized manner for each group.

상기 유도 방법은, 상기 가이드하는 단계 이후에, 가이드된 방송 진행 방향에 대한 호스트의 순응 여부를 모니터링하는 단계; 및 상기 호스트가 순응하여 행동한 경우, 가이드된 방송 진행 방향의 적정성을 청취자 피드백을 통해 결정하는 단계를 더 포함할 수 있다.The guidance method includes, after the guiding step, monitoring whether the host is compliant with the guided broadcast direction; And when the host behaves compliantly, the method may further include determining the appropriateness of the guided broadcasting direction through listener feedback.

상기 유도 방법은, 보이스 기반의 라이브 인터넷 방송 중에, 호스트가 포함된 인터넷 콜이나 채팅창에서 청취자의 문의사항이 있는 경우, 문의사항을 리스트업하여 호스트 단말에 제공하는 단계; 및 문의사항 리스트에서 답변이 완료된 문의사항을 제외한 문의사항을 소정 주기로 상기 호스트 단말에 제공하는 단계를 더 포함할 수 있다.The induction method includes the following steps: during a voice-based live Internet broadcast, if a listener has an inquiry in an Internet call or chat window involving the host, listing the inquiry and providing the inquiry to the host terminal; and providing inquiries excluding answers that have been answered from the inquiry list to the host terminal at a predetermined period.

이 외에도, 본 개시를 구현하기 위한 방법을 실행하기 위해 컴퓨터 판독 가능한 기록 매체에 저장된 컴퓨터 프로그램이 더 제공될 수 있다.In addition to this, a computer program stored in a computer-readable recording medium may be further provided to execute the method for implementing the present disclosure.

이 외에도, 본 개시를 구현하기 위한 방법을 실행하기 위한 컴퓨터 프로그램을 기록하는 컴퓨터 판독 가능한 기록 매체가 더 제공될 수 있다.In addition, a computer-readable recording medium recording a computer program for executing a method for implementing the present disclosure may be further provided.

본 개시의 전술한 과제 해결 수단에 의하면, 보이스 기반의 라이브 인터넷 방송 중에 호스트의 방송 진행 방향이 가이드될 수 있으므로, 호스트의 방송 진행 부담을 다소 완화할 수 있으며, 청취자도 방송 참여가 독려될 수 있다.According to the means for solving the above-described problem of the present disclosure, the direction of the host's broadcast can be guided during voice-based live Internet broadcasting, so the burden of the host's broadcast can be somewhat alleviated, and listeners can also be encouraged to participate in the broadcast. .

본 개시의 효과들은 이상에서 언급된 효과로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The effects of the present disclosure are not limited to the effects mentioned above, and other effects not mentioned may be clearly understood by those skilled in the art from the description below.

도 1은 본 개시에 따른 보이스 기반의 라이브 인터넷 방송 시스템을 개략적으로 설명하기 위한 도면이다.
도 2는 본 개시에 따른 청취자 반응에 기반하여 방송 호스트의 방송 진행 방향을 유도하는 서버 및 호스트 단말의 구성들을 나타내는 상대 블록도이다.
도 3은 본 개시에 따른 방송 호스트의 방송 진행 방향을 유도하는 서버의 프로세싱 과정을 나타내는 시퀀스도이다.
도 4는 본 개시에 따른 감정 분석 모델의 학습 및 적용 시의 프로세싱 과정을 설명하기 위한 도면이다.
도 5는 본 개시에 따른 방송 호스트의 방송 진행 방향을 유도하는 서버가 청취자 반응에 기반하여 방송 진행을 가이드하는 방법을 설명하기 위한 도면이다.1 is a diagram schematically illustrating a voice-based live Internet broadcasting system according to the present disclosure.
FIG. 2 is a relative block diagram showing the configurations of a server and a host terminal that guide the broadcasting host's broadcasting direction based on listener responses according to the present disclosure.
Figure 3 is a sequence diagram showing the processing process of the server that guides the broadcasting direction of the broadcasting host according to the present disclosure.
FIG. 4 is a diagram illustrating a processing process when learning and applying an emotion analysis model according to the present disclosure.
FIG. 5 is a diagram illustrating a method in which a server that guides the broadcasting direction of a broadcasting host according to the present disclosure guides the broadcasting progress based on listener responses.

본 개시 전체에 걸쳐 동일 참조 부호는 동일 구성요소를 지칭한다. 본 개시가 실시예들의 모든 요소들을 설명하는 것은 아니며, 본 개시가 속하는 기술분야에서 일반적인 내용 또는 실시예들 간에 중복되는 내용은 생략한다. 명세서에서 사용되는 '부, 모듈, 부재, 블록'이라는 용어는 소프트웨어 또는 하드웨어로 구현될 수 있으며, 실시예들에 따라 복수의 '부, 모듈, 부재, 블록'이 하나의 구성요소로 구현되거나, 하나의 '부, 모듈, 부재, 블록'이 복수의 구성요소들을 포함하는 것도 가능하다. Like reference numerals refer to like elements throughout this disclosure. The present disclosure does not describe all elements of the embodiments, and general content or overlapping content between the embodiments in the technical field to which the present disclosure pertains is omitted. The term 'unit, module, member, block' used in the specification may be implemented as software or hardware, and depending on the embodiment, a plurality of 'unit, module, member, block' may be implemented as a single component, or It is also possible for one 'part, module, member, or block' to include multiple components.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 직접적으로 연결되어 있는 경우뿐 아니라, 간접적으로 연결되어 있는 경우를 포함하고, 간접적인 연결은 무선 통신망을 통해 연결되는 것을 포함한다.Throughout the specification, when a part is said to be “connected” to another part, this includes not only direct connection but also indirect connection, and indirect connection includes connection through a wireless communication network. do.

또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미한다.Additionally, when a part "includes" a certain component, this means that it may further include other components rather than excluding other components, unless specifically stated to the contrary.

명세서 전체에서, 어떤 부재가 다른 부재 "상에" 위치하고 있다고 할 때, 이는 어떤 부재가 다른 부재에 접해 있는 경우뿐 아니라 두 부재 사이에 또 다른 부재가 존재하는 경우도 포함한다.Throughout the specification, when a member is said to be located “on” another member, this includes not only cases where a member is in contact with another member, but also cases where another member exists between the two members.

제 1, 제 2 등의 용어는 하나의 구성요소를 다른 구성요소로부터 구별하기 위해 사용되는 것으로, 구성요소가 전술된 용어들에 의해 제한되는 것은 아니다. Terms such as first and second are used to distinguish one component from another component, and the components are not limited by the above-mentioned terms.

단수의 표현은 문맥상 명백하게 예외가 있지 않는 한, 복수의 표현을 포함한다.Singular expressions include plural expressions unless the context clearly makes an exception.

각 단계들에 있어 식별부호는 설명의 편의를 위하여 사용되는 것으로 식별부호는 각 단계들의 순서를 설명하는 것이 아니며, 각 단계들은 문맥상 명백하게 특정 순서를 기재하지 않는 이상 명기된 순서와 다르게 실시될 수 있다. The identification code for each step is used for convenience of explanation. The identification code does not explain the order of each step, and each step may be performed differently from the specified order unless a specific order is clearly stated in the context. there is.

이하 첨부된 도면들을 참고하여 본 개시의 작용 원리 및 실시예들에 대해 설명한다.Hereinafter, the operating principle and embodiments of the present disclosure will be described with reference to the attached drawings.

본 명세서에서 '본 개시에 따른 방송 호스트의 방송 진행 방향을 유도하는 서버'는 클라우드 시스템뿐만 아니라 연산처리를 수행하여 사용자에게 결과를 제공할 수 있는 다양한 장치들로 구현될 수 있다. 예를 들어, 본 개시에 따른 방송 호스트의 방송 진행 방향을 유도하는 서버는, 컴퓨터, 서버 장치 및 휴대용 단말기를 모두 포함하거나, 또는 어느 하나의 형태가 될 수 있다.In this specification, the 'server that guides the broadcasting direction of the broadcasting host according to the present disclosure' can be implemented not only by a cloud system but also by various devices that can perform computational processing and provide results to users. For example, the server that guides the broadcasting direction of the broadcasting host according to the present disclosure may include all of a computer, a server device, and a portable terminal, or may be in any one form.

여기에서, 상기 컴퓨터는 예를 들어, 웹 브라우저(WEB Browser)가 탑재된 노트북, 데스크톱(desktop), 랩톱(laptop), 태블릿 PC, 슬레이트 PC 등을 포함할 수 있다.Here, the computer may include, for example, a laptop, desktop, laptop, tablet PC, slate PC, etc. equipped with a web browser.

상기 서버 장치는 외부 장치와 통신을 수행하여 정보를 처리하는 서버로써, 애플리케이션 서버, 컴퓨팅 서버, 데이터베이스 서버, 파일 서버, 게임 서버, 메일 서버, 프록시 서버 및 웹 서버 등을 포함할 수 있다.The server device is a server that processes information by communicating with external devices, and may include an application server, computing server, database server, file server, game server, mail server, proxy server, and web server.

상기 휴대용 단말기는 예를 들어, 휴대성과 이동성이 보장되는 무선 통신 장치로서, PCS(Personal Communication System), GSM(Global System for Mobile communications), PDC(Personal Digital Cellular), PHS(Personal Handyphone System), PDA(Personal Digital Assistant), IMT(International Mobile Telecommunication)-2000, CDMA(Code Division Multiple Access)-2000, W-CDMA(W-Code Division Multiple Access), WiBro(Wireless Broadband Internet) 단말, 스마트 폰(Smart Phone) 등과 같은 모든 종류의 핸드헬드(Handheld) 기반의 무선 통신 장치와 시계, 반지, 팔찌, 발찌, 목걸이, 안경, 콘택트 렌즈, 또는 머리 착용형 장치(head-mounted-device(HMD) 등과 같은 웨어러블 장치를 포함할 수 있다.The portable terminal is, for example, a wireless communication device that guarantees portability and mobility, such as PCS (Personal Communication System), GSM (Global System for Mobile communications), PDC (Personal Digital Cellular), PHS (Personal Handyphone System), and PDA. (Personal Digital Assistant), IMT (International Mobile Telecommunication)-2000, CDMA (Code Division Multiple Access)-2000, W-CDMA (W-Code Division Multiple Access), WiBro (Wireless Broadband Internet) terminal, smart phone ), all types of handheld wireless communication devices, and wearable devices such as watches, rings, bracelets, anklets, necklaces, glasses, contact lenses, or head-mounted-device (HMD). may include.

본 개시에 따른 인공 지능과 관련된 기능은 프로세서와 메모리를 통해 동작된다. 프로세서는 하나 또는 복수의 프로세서로 구성될 수 있다. 이때, 하나 또는 복수의 프로세서는 CPU, AP, DSP(Digital Signal Processor) 등과 같은 범용 프로세서, GPU, VPU(Vision Processing Unit)와 같은 그래픽 전용 프로세서 또는 NPU와 같은 인공 지능 전용 프로세서일 수 있다. 하나 또는 복수의 프로세서는, 메모리에 저장된 기 정의된 동작 규칙 또는 인공 지능 모델에 따라, 입력 데이터를 처리하도록 제어한다. 또는, 하나 또는 복수의 프로세서가 인공 지능 전용 프로세서인 경우, 인공 지능 전용 프로세서는, 특정 인공 지능 모델의 처리에 특화된 하드웨어 구조로 설계될 수 있다.Functions related to artificial intelligence according to the present disclosure are operated through a processor and memory. The processor may consist of one or multiple processors. At this time, one or more processors may be a general-purpose processor such as a CPU, AP, or DSP (Digital Signal Processor), a graphics-specific processor such as a GPU or VPU (Vision Processing Unit), or an artificial intelligence-specific processor such as an NPU. One or more processors control input data to be processed according to predefined operation rules or artificial intelligence models stored in memory. Alternatively, when one or more processors are dedicated artificial intelligence processors, the artificial intelligence dedicated processors may be designed with a hardware structure specialized for processing a specific artificial intelligence model.

기 정의된 동작 규칙 또는 인공 지능 모델은 학습을 통해 만들어진 것을 특징으로 한다. 여기서, 학습을 통해 만들어진다는 것은, 기본 인공 지능 모델이 학습 알고리즘에 의하여 다수의 학습 데이터들을 이용하여 학습됨으로써, 원하는 특성(또는, 목적)을 수행하도록 설정된 기 정의된 동작 규칙 또는 인공 지능 모델이 만들어짐을 의미한다. 이러한 학습은 본 개시에 따른 인공 지능이 수행되는 기기 자체에서 이루어질 수도 있고, 별도의 서버 및/또는 시스템을 통해 이루어 질 수도 있다. 학습 알고리즘의 예로는, 지도형 학습(supervised learning), 비지도형 학습(unsupervised learning), 준지도형 학습(semi-supervised learning) 또는 강화 학습(reinforcement learning)이 있으나, 전술한 예에 한정되지 않는다.Predefined operation rules or artificial intelligence models are characterized by being created through learning. Here, created through learning means that a basic artificial intelligence model is learned using a large number of learning data by a learning algorithm, thereby creating a predefined operation rule or artificial intelligence model set to perform the desired characteristics (or purpose). It means burden. This learning may be accomplished in the device itself that performs artificial intelligence according to the present disclosure, or may be accomplished through a separate server and/or system. Examples of learning algorithms include supervised learning, unsupervised learning, semi-supervised learning, or reinforcement learning, but are not limited to the examples described above.

인공 지능 모델은, 복수의 신경망 레이어들로 구성될 수 있다. 복수의 신경망 레이어들 각각은 복수의 가중치들 (weight values)을 갖고 있으며, 이전(previous) 레이어의 연산 결과와 복수의 가중치들 간의 연산을 통해 신경 망 연산을 수행한다. 복수의 신경망 레이어들이 갖고 있는 복수의 가중치들은 인공 지능 모델의 학습 결과에 의해 최적화될 수 있다. 예를 들어, 학습 과정 동안 인공 지능 모델에서 획득한 로스(loss) 값 또는 코스트(cost) 값이 감소 또는 최소화되도록 복수의 가중치들이 갱신될 수 있다. 인공 신경망은 심층 신경망(DNN:Deep Neural Network)를 포함할 수 있으며, 예를 들어, CNN (Convolutional Neural Network), DNN (Deep Neural Network), RNN (Recurrent Neural Network), RBM (Restricted Boltzmann Machine), DBN (Deep Belief Network), BRDNN(Bidirectional Recurrent Deep Neural Network) 또는 심층 Q-네트워크 (Deep Q-Networks) 등이 있으나, 전술한 예에 한정되지 않는다.An artificial intelligence model may be composed of multiple neural network layers. Each of the plurality of neural network layers has a plurality of weight values, and neural network calculation is performed through calculation between the calculation result of the previous layer and the plurality of weights. Multiple weights of multiple neural network layers can be optimized by the learning results of the artificial intelligence model. For example, a plurality of weights may be updated so that loss or cost values obtained from the artificial intelligence model are reduced or minimized during the learning process. Artificial neural networks may include deep neural networks (DNN), for example, Convolutional Neural Network (CNN), Deep Neural Network (DNN), Recurrent Neural Network (RNN), Restricted Boltzmann Machine (RBM), Deep Belief Network (DBN), Bidirectional Recurrent Deep Neural Network (BRDNN), or Deep Q-Networks, etc., but are not limited to the examples described above.

프로세서는 뉴럴 네트워크를 생성하거나, 뉴럴 네트워크를 훈련(train, 또는 학습(learn)하거나, 수신되는 입력 데이터를 기초로 연산을 수행하고, 수행 결과를 기초로 정보 신호(information signal)를 생성하거나, 뉴럴 네트워크를 재훈련(retrain)할 수 있다. The processor creates a neural network, trains or learns a neural network, performs calculations based on received input data, generates an information signal based on the results, or generates a neural network. The network can be retrained.

뉴럴 네트워크는 CNN(Convolutional Neural Network), RNN(Recurrent Neural Network), 퍼셉트론(perceptron), 다층 퍼셉트론(multilayer perceptron), FF(Feed Forward), RBF(Radial Basis Network), DFF(Deep Feed Forward), LSTM(Long Short Term Memory), GRU(Gated Recurrent Unit), AE(Auto Encoder), VAE(Variational Auto Encoder), DAE(Denoising Auto Encoder), SAE(Sparse Auto Encoder), MC(Markov Chain), HN(Hopfield Network), BM(Boltzmann Machine), RBM(Restricted Boltzmann Machine), DBN(Depp Belief Network), DCN(Deep Convolutional Network), DN(Deconvolutional Network), DCIGN(Deep Convolutional Inverse Graphics Network), GAN(Generative Adversarial Network), LSM(Liquid State Machine), ELM(Extreme Learning Machine), ESN(Echo State Network), DRN(Deep Residual Network), DNC(Differentiable Neural Computer), NTM(Neural Turning Machine), CN(Capsule Network), KN(Kohonen Network) 및 AN(Attention Network)를 포함할 수 있으나 이에 한정되는 것이 아닌 임의의 뉴럴 네트워크를 포함할 수 있음은 통상의 기술자가 이해할 것이다.Neural networks include CNN (Convolutional Neural Network), RNN (Recurrent Neural Network), perceptron, multilayer perceptron, FF (Feed Forward), RBF (Radial Basis Network), DFF (Deep Feed Forward), and LSTM. (Long Short Term Memory), GRU (Gated Recurrent Unit), AE (Auto Encoder), VAE (Variational Auto Encoder), DAE (Denoising Auto Encoder), SAE (Sparse Auto Encoder), MC (Markov Chain), HN (Hopfield) Network), BM (Boltzmann Machine), RBM (Restricted Boltzmann Machine), DBN (Depp Belief Network), DCN (Deep Convolutional Network), DN (Deconvolutional Network), DCIGN (Deep Convolutional Inverse Graphics Network), GAN (Generative Adversarial Network) ), Liquid State Machine (LSM), Extreme Learning Machine (ELM), Echo State Network (ESN), Deep Residual Network (DRN), Differential Neural Computer (DNC), Neural Turning Machine (NTM), Capsule Network (CN), Those skilled in the art will understand that it may include any neural network, including, but not limited to, KN (Kohonen Network) and AN (Attention Network).

본 개시의 예시적인 실시예에 따르면, 프로세서는 GoogleNet, AlexNet, VGG Network 등과 같은 CNN(Convolution Neural Network), R-CNN(Region with Convolution Neural Network), RPN(Region Proposal Network), RNN(Recurrent Neural Network), S-DNN(Stacking-based deep Neural Network), S-SDNN(State-Space Dynamic Neural Network), Deconvolution Network, DBN(Deep Belief Network), RBM(Restrcted Boltzman Machine), Fully Convolutional Network, LSTM(Long Short-Term Memory) Network, Classification Network, Generative Modeling, eXplainable AI, Continual AI, Representation Learning, AI for Material Design, 자연어 처리를 위한 BERT, SP-BERT, MRC/QA, Text Analysis, Dialog System, GPT-3, GPT-4, 비전 처리를 위한 Visual Analytics, Visual Understanding, Video Synthesis, ResNet 데이터 지능을 위한 Anomaly Detection, Prediction, Time-Series Forecasting, Optimization, Recommendation, Data Creation 등 다양한 인공 지능 구조 및 알고리즘을 이용할 수 있으며, 이에 제한되지 않는다. 이하, 첨부된 도면을 참조하여 본 개시의 실시예를 상세하게 설명한다.According to an exemplary embodiment of the present disclosure, the processor may support a Convolution Neural Network (CNN), a Region with Convolution Neural Network (R-CNN), a Region Proposal Network (RPN), a Recurrent Neural Network (RNN), such as GoogleNet, AlexNet, VGG Network, etc. ), S-DNN (Stacking-based deep Neural Network), S-SDNN (State-Space Dynamic Neural Network), Deconvolution Network, DBN (Deep Belief Network), RBM (Restrcted Boltzman Machine), Fully Convolutional Network, LSTM (Long Short-Term Memory) Network, Classification Network, Generative Modeling, eXplainable AI, Continual AI, Representation Learning, AI for Material Design, BERT for natural language processing, SP-BERT, MRC/QA, Text Analysis, Dialog System, GPT-3 , GPT-4, Visual Analytics for vision processing, Visual Understanding, Video Synthesis, and Anomaly Detection, Prediction, Time-Series Forecasting, Optimization, Recommendation, and Data Creation for ResNet data intelligence. , but is not limited to this. Hereinafter, embodiments of the present disclosure will be described in detail with reference to the attached drawings.

도 1은 본 개시에 따른 보이스 기반의 라이브 인터넷 방송 시스템(1000)을 개략적으로 설명하기 위한 도면이다.1 is a diagram schematically illustrating a voice-based live Internet broadcasting system 1000 according to the present disclosure.

라이브 인터넷 방송 시스템(1000)은 보이스(voice) 기반의 라이브 인터넷 방송을 제공하기 위해 필요한 구성들을 포함할 수 있는데, 라이브 인터넷 방송 시스템(1000)은 청취자 반응에 기반하여 방송 호스트의 방송 진행 방향을 유도하는 서버(100, 이하 “방송 진행 방향 유도 서버”라고 칭함), 라이브 인터넷 방송의 호스트(H)에 대응하는 호스트 단말(200) 및 라이브 인터넷 방송의 청취자(L) 각각에 대응하는 청취자 단말(300)을 포함할 수 있다.The live Internet broadcasting system 1000 may include components necessary to provide voice-based live Internet broadcasting. The live Internet broadcasting system 1000 guides the broadcasting host's broadcasting direction based on listener responses. a server (100, hereinafter referred to as “broadcast direction guidance server”), a host terminal (200) corresponding to the host (H) of the live Internet broadcast, and a listener terminal (300) corresponding to the listener (L) of the live Internet broadcast. ) may include.

방송 진행 방향 유도 서버(100)는 보이스 기반의 라이브 인터넷 방송을 위한 방송 데이터를 호스트 단말(200)로부터 수신하여, 청취자 단말(300)에 제공함으로써, 라이브 인터넷 방송이 청취자(L)에게 제공되게 할 수 있다(S11).The broadcast direction guidance server 100 receives broadcast data for voice-based live Internet broadcasting from the host terminal 200 and provides it to the listener terminal 300, so that the live Internet broadcast is provided to the listener (L). (S11).

실시예에서, 방송 진행 방향 유도 서버(100)는 서버/클라이언트의 서버뿐만 아니라, 클라우드로도 구현될 수 있다. 이 경우, 방송 진행 방향 유도 서버(100)는 스토리지, 서버, 가상화 모듈/시스템, 운영체제, 미들웨어, 런타임 모듈, 데이터 및 애플리케이션 등을 모두 구비할 수 있다. 이에, 클라이언트 단의 호스트 단말(200) 및 청취자 단말(300)에서는 서비스 애플리케이션만 구비하면, 클라우드 시스템이 제공하는 다양한 기능을 이용할 수 있으며, 사용량에 따라 과금될 수 있다. 가령, 방송 진행 방향 유도 서버(100)는 SaaS(Server as a software) 기반으로 구현될 수 있으나, 실시예가 이에 한정되는 것은 아니다.In an embodiment, the broadcast direction guidance server 100 may be implemented not only as a server/client server, but also as a cloud. In this case, the broadcast direction guidance server 100 may be equipped with storage, servers, virtualization modules/systems, operating systems, middleware, runtime modules, data, and applications. Accordingly, the host terminal 200 and the listener terminal 300 of the client side can use various functions provided by the cloud system as long as they are equipped with a service application, and can be charged according to the amount of usage. For example, the broadcast direction guidance server 100 may be implemented based on SaaS (Server as a software), but the embodiment is not limited to this.

방송 진행 방향 유도 서버(100)는 호스트(H)와 청취자(L)가 소통하도록 메신저 및 다자 간 인터넷 콜(call) 기능을 호스트 단말(H) 및 청취자 단말(L)에 제공할 수 있다(S13, S15).The broadcast direction guidance server 100 can provide messenger and multi-party Internet call functions to the host terminal (H) and the listener terminal (L) so that the host (H) and the listener (L) can communicate (S13) , S15).

방송 진행 방향 유도 서버(100)는 청취자의 반응 정보를 수집하고, 수집 모듈에 의해 수집된 청취자의 반응 정보에 기초하여 청취자의 감정 및 참여도를 분석하며, 분석된 청취자의 감정 및 참여도에 따라 호스트의 방송 진행 방향을 가이드할 수 있다(S17).The broadcast direction guidance server 100 collects the listener's reaction information, analyzes the listener's emotion and participation level based on the listener's reaction information collected by the collection module, and analyzes the listener's emotion and participation level according to the analyzed listener's emotion and participation level. The direction of the host's broadcast can be guided (S17).

도 2는 본 개시에 따른 방송 진행 방향 유도 서버(100) 및 호스트 단말(200)의 구성들을 나타내는 상대 블록도이다. FIG. 2 is a relative block diagram showing the configurations of the broadcast direction guidance server 100 and the host terminal 200 according to the present disclosure.

먼저, 본 개시에 따른 방송 진행 방향 유도 서버(100)는 서버 통신부(110), 서버 입력부(120), 서버 메모리(150) 및 적어도 하나의 서버 프로세서(190)를 포함할 수 있다. 도 2에 도시된 방송 진행 방향 유도 서버(100)의 구성요소들은 본 개시에 따른 방송 진행 방향 유도 서버(100)를 구현하는데 있어서 필수적인 것은 아니어서, 본 명세서 상에서 설명되는 방송 진행 방향 유도 서버(100)는 위에서 열거된 구성요소들 보다 많거나, 또는 적은 구성요소들을 가질 수 있다.First, the broadcast direction guidance server 100 according to the present disclosure may include a server communication unit 110, a server input unit 120, a server memory 150, and at least one server processor 190. The components of the broadcast direction guidance server 100 shown in FIG. 2 are not essential for implementing the broadcast direction guidance server 100 according to the present disclosure, so the broadcast direction guidance server 100 described in this specification ) may have more or fewer components than those listed above.

상기 구성요소들 중 서버 통신부(110)는 외부 장치(가령, 보이스 기반의 라이브 인터넷 방송의 호스트 단말(200)및 청취자 단말(300))와 통신을 가능하게 하는 하나 이상의 구성 요소를 포함할 수 있으며, 예를 들어, 방송 수신 모듈, 유선통신 모듈, 무선통신 모듈, 근거리 통신 모듈, 위치정보 모듈 중 적어도 하나를 포함할 수 있다. 서버 통신부(110)는 보이스(voice) 기반의 라이브 인터넷 방송의 호스트 단말(200) 및 청취자 단말(300)과 통신할 수 있다.Among the above components, the server communication unit 110 may include one or more components that enable communication with external devices (e.g., the host terminal 200 and the listener terminal 300 of voice-based live Internet broadcasting). , For example, it may include at least one of a broadcast reception module, a wired communication module, a wireless communication module, a short-range communication module, and a location information module. The server communication unit 110 can communicate with the host terminal 200 and the listener terminal 300 of a voice-based live Internet broadcast.

서버 입력부(120)는 영상 정보(또는 신호), 오디오 정보(또는 신호), 데이터, 또는 사용자로부터 입력되는 정보의 입력을 위한 것으로서, 적어도 하나의 카메라, 적어도 하나의 마이크로폰 및 사용자 입력부 중 적어도 하나를 포함할 수 있다. 서버 입력부(120)에서 수집한 음성 데이터나 이미지 데이터는 분석되어 사용자의 제어명령으로 처리될 수 있다.The server input unit 120 is for inputting image information (or signal), audio information (or signal), data, or information input from a user, and includes at least one of at least one camera, at least one microphone, and a user input unit. It can be included. Voice data or image data collected from the server input unit 120 may be analyzed and processed as a user's control command.

서버 메모리(150)는 방송 진행 방향 유도 서버(100)의 다양한 기능을 지원하는 데이터와, 제어부의 동작을 위한 프로그램을 저장할 수 있고, 입/출력되는 데이터들(예를 들어, 음악 파일, 정지영상, 동영상 등)을 저장할 있고, 방송 진행 방향 유도 서버(100)에서 구동되는 다수의 응용 프로그램(application program 또는 애플리케이션(application)), 본 서버의 동작을 위한 데이터들, 명령어들을 저장할 수 있다. 이러한 응용 프로그램 중 적어도 일부는, 무선 통신을 통해 외부 서버로부터 다운로드 될 수 있다. The server memory 150 can store data supporting various functions of the broadcasting direction guidance server 100 and a program for the operation of the control unit, and can store input/output data (e.g., music files, still images, etc.). , video, etc.), a plurality of application programs (application programs or applications) running on the broadcasting direction guidance server 100, data for operation of the server, and commands can be stored. At least some of these applications may be downloaded from an external server via wireless communication.

이러한, 서버 메모리(150)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), SSD 타입(Solid State Disk type), SDD 타입(Silicon Disk Drive type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램(random access memory; RAM), SRAM(static random access memory), 롬(read-only memory; ROM), EEPROM(electrically erasable programmable read-only memory), PROM(programmable read-only memory), 자기 메모리, 자기 디스크 및 광디스크 중 적어도 하나의 타입의 저장매체를 포함할 수 있다. 또한, 서버 메모리(150)는 방송 진행 방향 유도 서버(100)와는 분리되어 있으나 유선 또는 무선으로 연결된 데이터베이스가 될 수도 있으며, 데이터 베이스 시스템으로 구현될 수도 있다. The server memory 150 is of flash memory type, hard disk type, solid state disk type, SDD type (Silicon Disk Drive type), and multimedia card micro type. card micro type), card type memory (e.g. SD or XD memory, etc.), random access memory (RAM), static random access memory (SRAM), read-only memory (ROM), EEPROM (electrically It may include at least one type of storage medium among erasable programmable read-only memory (PROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, and optical disk. In addition, the server memory 150 is separate from the broadcast direction guidance server 100, but may be a database connected by wire or wirelessly, and may be implemented as a database system.

서버 메모리(150)는 방송 데이터를 저장할 수 있으며, 감정 분석 모델(151)을 저장할 수 있다. 감정 분석 모델(151)은 학습이 완료된 모델일 수 있으며, 학습 중간에도 수시로 저장될 수 있다.The server memory 150 can store broadcast data and can store an emotion analysis model 151. The emotion analysis model 151 may be a model whose learning has been completed, and may be saved at any time even during training.

서버 프로세서(190)는 적어도 하나의 코어를 포함하며, 방송 진행 방향 유도 서버(100 내의 구성요소들의 동작을 제어하기 위한 알고리즘 또는 알고리즘을 재현한 프로그램에 대한 데이터를 저장하는 메모리, 및 메모리에 저장된 데이터를 이용하여 전술한 동작을 수행하는 적어도 하나의 프로세서(미도시)로 구현될 수 있다. 이때, 메모리와 프로세서는 각각 별개의 칩으로 구현될 수 있다. 또는, 메모리와 프로세서는 단일 칩으로 구현될 수도 있다.The server processor 190 includes at least one core, a memory for storing data about an algorithm for controlling the operation of components within the broadcast direction guidance server 100 or a program that reproduces the algorithm, and data stored in the memory. It can be implemented with at least one processor (not shown) that performs the above-described operations using. At this time, the memory and the processor can each be implemented as separate chips. Alternatively, the memory and the processor can be implemented as a single chip. It may be possible.

서버 프로세서(190)는 청취자의 반응 정보를 수집하는 수집 모듈(191), 수집 모듈에 의해 수집된 상기 청취자의 반응 정보에 기초하여 청취자의 감정 및 참여도를 분석하는 분석 모듈(193), 분석된 청취자의 감정 및 참여도에 따라 호스트의 방송 진행 방향을 가이드하는 진행 유도 모듈(195)을 포함할 수 있다.The server processor 190 includes a collection module 191 that collects the listener's reaction information, an analysis module 193 that analyzes the listener's emotions and participation based on the listener's reaction information collected by the collection module, and an analysis module 193 that analyzes the listener's reaction information. It may include a progress guidance module 195 that guides the direction of the host's broadcast according to the listener's emotions and participation level.

또한, 서버 프로세서(190)는 분석된 청취자의 감정 및 참여도에 기초하여, 청취자를 그룹화하고 그룹 별로 레이블을 할당하는 그룹화 모듈(197) 및 가이드된 방송 진행 방향에 대한 호스트의 순응 여부를 모니터링하는 모니터링 모듈(199)을 더 포함할 수 있다.In addition, the server processor 190 includes a grouping module 197 that groups listeners and assigns labels to each group based on the analyzed emotions and participation of the listeners, and monitors the host's compliance with the guided broadcast direction. It may further include a monitoring module 199.

또한, 서버 프로세서(190)는 이하의 도 3 내지 도 5에서 설명되는 본 개시에 따른 다양한 실시 예들을 방송 진행 방향 유도 서버(100) 상에서 구현하기 위하여, 위에서 살펴본 구성요소들을 중 어느 하나 또는 복수를 조합하여 제어할 수 있다. In addition, the server processor 190 includes one or more of the components described above in order to implement various embodiments according to the present disclosure described in FIGS. 3 to 5 below on the broadcast direction guidance server 100. They can be controlled by combining them.

도 2에 도시된 호스트 단말(200)은 단말 통신부(210), 단말 입력부(220), 단말 디스플레이(230), 단말 메모리(250) 및 단말 프로세서(290)를 포함할 수 있다. 도 2에 도시된 호스트 단말(200)의 구성요소들은 본 개시에 따른 호스트 단말(200)를 구현하는데 있어서 필수적인 것은 아니어서, 본 명세서 상에서 설명되는 호스트 단말(200)은 위에서 열거된 구성요소들 보다 많거나, 또는 적은 구성요소들을 가질 수 있다.The host terminal 200 shown in FIG. 2 may include a terminal communication unit 210, a terminal input unit 220, a terminal display 230, a terminal memory 250, and a terminal processor 290. The components of the host terminal 200 shown in FIG. 2 are not essential for implementing the host terminal 200 according to the present disclosure, so the host terminal 200 described in this specification has more components than the components listed above. It may have many or few components.

방송 진행 방향 유도 서버(100)의 구성들과 차이점을 중심으로 설명하면, 단말 디스플레이(230)는 보이스 기반의 라이브 인터넷 방송의 메신저 채팅을 출력할 수 있으며, 라이브 인터넷 방송을 위한 다양한 인터페이스를 출력할 수 있다.To explain the configurations and differences of the broadcasting direction guidance server 100, the terminal display 230 can output messenger chat for voice-based live Internet broadcasting and can output various interfaces for live Internet broadcasting. You can.

실시예에서, 단말 프로세서(290)는 호스트(H)의 발화 내용을 인식하고 필터링하는 필터링 모듈을 포함할 수 있는데, 필터링 모듈은 호스트(H) 및 청취자(L)의 발화 내용을 STT(speech to text) 기법을 이용하여 음성(voice)을 텍스트화하고 필터링 대상 문구(비속어 또는 욕설 등)를 하이라이트하여 표시할 수 있으며, 수신되는 호스트(H)의 보이스 신호에서도 해당 필터링 대상 문구를 가공(가령, 특정 사운드로 대체하거나 출력을 낮추는 방법이 있으나, 실시예가 이에 한정되는 것은 아님)하여 출력하거나 방송 진행 방향 유도 서버(100)로 제공할 수 있다.In an embodiment, the terminal processor 290 may include a filtering module that recognizes and filters the content of the speech of the host (H), and the filtering module can select the content of the speech of the host (H) and the listener (L) through speech to speech (STT). Using the text) technique, the voice can be converted into text and the filtering target phrase (slang or swear words, etc.) can be highlighted and displayed, and the filtering target phrase can be processed (e.g., in the voice signal of the host (H) received). There are ways to replace it with a specific sound or lower the output, but the embodiment is not limited to this) and output it or provide it to the broadcasting direction guidance server 100.

도 2에 도시된 구성 요소들의 성능에 대응하여 적어도 하나의 구성요소가 추가되거나 삭제될 수 있다. 또한, 구성 요소들의 상호 위치는 시스템의 성능 또는 구조에 대응하여 변경될 수 있다는 것은 당해 기술 분야에서 통상의 지식을 가진 자에게 용이하게 이해될 것이다.At least one component may be added or deleted in response to the performance of the components shown in FIG. 2. Additionally, it will be easily understood by those skilled in the art that the mutual positions of the components may be changed in response to the performance or structure of the system.

한편, 도 2에서 도시된 각각의 구성요소는 소프트웨어 및/또는 Field Programmable Gate Array(FPGA) 및 주문형 반도체(ASIC, Application Specific Integrated Circuit)와 같은 하드웨어 구성요소를 의미한다. Meanwhile, each component shown in FIG. 2 refers to software and/or hardware components such as Field Programmable Gate Array (FPGA) and Application Specific Integrated Circuit (ASIC).

도 3은 본 개시에 따른 방송 진행 방향 유도 서버(100)의 프로세싱 과정을 나타내는 시퀀스도이다. 도 4는 본 개시에 따른 감정 분석 모델(153)의 학습/적용 시의 프로세싱 과정을 설명하기 위한 도면이고, 도 5는 본 개시에 따른 방송 진행 방향 유도 서버(100)가 청취자 반응에 기반하여 방송 진행 "??*을 가이드하는 방법을 설명하기 위한 도면이다. 도 3을 설명하면서, 필요한 부분에서 도 4 및 도 5를 함께 참조하여 설명하기로 한다.Figure 3 is a sequence diagram showing the processing process of the broadcast direction guidance server 100 according to the present disclosure. FIG. 4 is a diagram for explaining the processing process when learning/applying the emotion analysis model 153 according to the present disclosure, and FIG. 5 shows the broadcasting direction guidance server 100 according to the present disclosure performing broadcasting based on listener responses. This is a drawing to explain a method of guiding progress "??*. While explaining FIG. 3, necessary parts will be explained with reference to FIGS. 4 and 5 together.

서버 프로세서(190)의 수집 모듈(191)은 청취자의 반응 정보를 수집할 수 있다(S310).The collection module 191 of the server processor 190 may collect listener reaction information (S310).

여기서, 청취자의 반응 정보는, 호스트(H)의 보이스에 대한 청취자(L)의 반응, 호스트가 제공하는 뮤직에 대한 청취자의 반응, 기록된 대화 기록(채팅 기록 또는 텍스트화된 대화 기록 등)으로부터 추출된 청취자의 반응에 관련된 정보를 포함할 수 있으며, 서버 프로세서(190)는 청취자 단말(300)로부터 직접 수집할 수 있다. Here, the listener's reaction information is derived from the listener's (L) reaction to the host's (H) voice, the listener's reaction to the music provided by the host, and recorded conversation records (chat records, text conversation records, etc.). It may include information related to the extracted listener's response, and the server processor 190 can collect it directly from the listener terminal 300.

이때, 서버 프로세서(190)는 청취자 단말(300)의 단말 센싱부(330)를 통해 수집된 생체 데이터를 분석하여, 청취자(L)의 반응을 판단할 수 있다.At this time, the server processor 190 may analyze the biometric data collected through the terminal sensing unit 330 of the listener terminal 300 and determine the response of the listener (L).

또한, 서버 프로세서(190)는 청취자의 채팅 내용 및 인터넷 콜의 발화 내용을 자연어 처리(NLP)하거나, STT(speech to text) 알고리즘을 적용하여 텍스트화할 수 있는데, 자연어 처리 알고리즘의 경우 시퀀스 투 시퀀스 모델(sequence to sequence model), 딥러닝 모델 등을 포함할 수 있으며, 단어 주머니 모델, 불용어 제거 모델, 어간 추출 모델 및 표제어 추출 모델 등이 적용될 수 있으나, 실시예가 이에 한정되는 것은 아니다.In addition, the server processor 190 can convert the listener's chat content and Internet call utterance into text by applying natural language processing (NLP) or a speech to text (STT) algorithm. In the case of the natural language processing algorithm, a sequence-to-sequence model It may include a sequence to sequence model, a deep learning model, etc., and a bag of words model, a stopword removal model, a stemming model, and a lemma extraction model may be applied, but the embodiment is not limited thereto.

S310 단계에서 청취자의 반응 정보가 수집된 후, 서버 프로세서(190)의 분석 모듈(193)은 상기 수집 모듈에 의해 수집된 상기 청취자의 반응 정보에 기초하여 청취자의 감정 및 참여도를 분석할 수 있다(S320).After the listener's reaction information is collected in step S310, the analysis module 193 of the server processor 190 may analyze the listener's emotions and participation based on the listener's reaction information collected by the collection module. (S320).

도 4를 참고하면, 분석 모듈(193)은 채팅창이나 인터넷 콜에서 청취자의 반응 정보를 텍스트화하여 감정 분석 모델(151)에 입력하면, 감정 분석 모델(151)은 청취자의 감정 상태를 출력할 수 있다. 상기 감정 분석 모델(151)은 신경망 기반으로 미리 학습된 모델일 수 있다.Referring to FIG. 4, the analysis module 193 converts the listener's reaction information into text in a chat window or Internet call and inputs it into the emotion analysis model 151, and the emotion analysis model 151 outputs the listener's emotional state. You can. The emotion analysis model 151 may be a pre-trained model based on a neural network.

실시예에서, 감정 분석 모델(151)은 텍스트화된 키워드, 청취자 단말(300)로부터 수집된 청취자의 생체 데이터(가령, 심박수, 심전도, 혈압, 맥박, 당수치 등)에 기반하여, 청취자의 특정 감정 상태를 분석할 수 있다. 감정 분석 모델(151)은 기쁨, 슬픔, 지루함, 재밌음, 화남 등의 감정 상태를 분석하여 출력할 수 있다. 서버 프로세서(190)는 분석된 특정 감정 상태를 서버 메모리(150)에 구분하여 저장할 수 있다.In an embodiment, the emotion analysis model 151 is based on textual keywords and the listener's biometric data collected from the listener terminal 300 (e.g., heart rate, electrocardiogram, blood pressure, pulse, sugar level, etc.) to identify the listener. Emotional states can be analyzed. The emotion analysis model 151 can analyze and output emotional states such as joy, sadness, boredom, fun, and anger. The server processor 190 may separately store the analyzed specific emotional state in the server memory 150.

또한, 분석 모듈(193)은 청취자 참여도를 측정하기 위한 다양한 지표를 활용할 수 있는데, 분석 모듈(193)은 채팅 활동량, 후원 횟수, 방송 시청 시간, 공유 횟수 및 호감 표시 횟수 등에 기초하여, 청취자의 참여도를 결정할 수 있으나, 실시예가 이에 한정되는 것은 아니다.In addition, the analysis module 193 can utilize various indicators to measure listener participation. The analysis module 193 analyzes listeners based on the amount of chat activity, number of sponsorships, broadcast viewing time, number of shares, number of likes, etc. The degree of participation can be determined, but the embodiment is not limited to this.

S320 단계 이후, 그룹화 모듈(195)은 분석된 청취자의 감정 및 참여도에 기초하여, 청취자를 그룹화하고 그룹 별로 레이블을 할당할 수 있다.After step S320, the grouping module 195 may group listeners and assign labels to each group based on the analyzed emotions and participation of the listeners.

그룹화 모듈(195)은 K-평균 알고리즘, DBSCAN(density based spatial clustering of application with noise) 알고리즘, 계층적 클러스터링 알고리즘 등을 이용하여 청취자를 그룹화할 수 있다.The grouping module 195 may group listeners using a K-means algorithm, DBSCAN (density based spatial clustering of application with noise) algorithm, hierarchical clustering algorithm, etc.

여기서, K-평균 알고리즘은 주어진 데이터를 K개의 클러스터로 묶는 알고리즘으로, 각 클러스터와 거리 차이의 분산을 최소화하는 방식으로 동작하는 알고리즘이며, DBSCAN 알고리즘은 클러스터링 알고리즘 중 밀도 방식의 클러스터링 방식이고, 계측적 클러스터링 알고리즘은 여러개의 군집 중에서 가장 유사도가 높은 혹은 거리가 가까운 군집 두 개를 선택하여 하나로 합치면서 군집 개수를 줄여 가는 알고리즘이라 할 수 있다.Here, the K-means algorithm is an algorithm that groups given data into K clusters, and operates by minimizing the variance of the distance difference between each cluster. The DBSCAN algorithm is a density-based clustering method among clustering algorithms, and an instrumental algorithm. The clustering algorithm can be said to be an algorithm that reduces the number of clusters by selecting the two clusters with the highest similarity or the closest distance among several clusters and combining them into one.

가령, 그룹화 모듈(195)은 참여도가 높은 그룹, 참여도가 높은 그룹 중에서도 후원 금액이 소정 금액 이상인 소그룹, 참여도가 낮은 그룹, 참여도가 낮지만 후원 금액이 소정 금액 이상이 소그룹, 흥분을 잘하는 그룹, 냉소적인 그룹 등으로 청취자를 그룹화할 수 있으나, 감정/참여도의 그룹화는 상술한 실시예에 한정되지 않는다.For example, the grouping module 195 creates excitement among groups with high participation, small groups with a sponsorship amount of more than a certain amount among groups with high participation, a group with low participation, and small groups with low participation but with a sponsorship amount of more than a certain amount. Listeners can be grouped into good groups, cynical groups, etc., but grouping of emotions/participation is not limited to the above-described embodiments.

이때, 그룹화 모듈(195)은 참여도의 정도, 흥분의 정도, 냉소의 정도를 판단할 때, 분야 별 참여도 및 감정에 대한 기준 레벨을 설정할 수 있다. 즉, 그룹화 모듈(195)은 방송 채널 종류(가령, 게임, 소개팅, 뮤직, 스포츠 등)에 따라 서로 다른 기준으로 참여도 및 감정을 판단할 수 있다. 이를 위해, 그룹화 모듈(195)은 방송 채널 종류에 따른 감정 및 참여도를 판단하기 위해 기준 레벨을 설정하는 인공 지능 기반의 기준 레벨 결정 모델을 이용할 수 있다. 그룹화 모듈(195)은 참여도가 낮은 그룹의 기준 레벨, 감정이 기쁜 상태의 기준 레벨, 슬픈 상태의 기준 레벨, 흥분의 기준 레벨, 냉소의 기준 레벨 등을 기준 레벨 결정 모델을 이용하여 결정할 수 있다.At this time, when determining the degree of participation, degree of excitement, and degree of cynicism, the grouping module 195 may set standard levels for participation and emotion for each field. That is, the grouping module 195 can determine participation and emotions based on different criteria depending on the type of broadcast channel (eg, game, blind date, music, sports, etc.). To this end, the grouping module 195 may use an artificial intelligence-based reference level determination model that sets a reference level to determine emotion and participation according to the type of broadcast channel. The grouping module 195 can determine the standard level of a group with low participation, the standard level of a happy state, the standard level of a sad state, the standard level of excitement, and the standard level of cynicism using a standard level determination model. .

여기서, 기준 레벨 결정 모델은 방송 채널 종류 정보, 채팅 기록 정보, 인터넷 콜의 대화 기록 정보, 청취자의 생체 정보 등을 입력받아, 각 방송 채녈 별로 감정 및 참여도의 기준 레벨을 결정할 수 있다.Here, the standard level determination model receives broadcast channel type information, chat record information, Internet call conversation record information, listener biometric information, etc., and can determine the standard level of emotion and participation for each broadcast channel.

또한, 서버 프로세서(190)는 청취자를 감정 및 참여도에 기초화여, 그룹화할 때, 상술한 그룹화 알고리즘을 모두 수행한 후, 명확한 결과를 도출하는 그룹화 알고리즘을 선택할 수 있다. 가령, 서버 프로세서(190)는 그룹의 개수, 그룹 간의 거리가 멀리 배치된 상황 등에 기초하여, 그룹화 정도를 평가하여, 그룹화 알고리즘을 선택할 수 있으며, 선택된 그룹화 알고리즘이 그룹화한 데이터를 이용할 수 있다.Additionally, when grouping listeners based on emotion and participation, the server processor 190 may perform all of the above-described grouping algorithms and then select a grouping algorithm that produces clear results. For example, the server processor 190 may select a grouping algorithm by evaluating the degree of grouping based on the number of groups, a situation where the groups are far apart, etc., and use data grouped by the selected grouping algorithm.

S330 단계 이후, 서버 프로세서(190)의 진행 유도 모듈(195)은 분석된 청취자의 감정 및 참여도에 따라 호스트의 방송 진행 방향을 가이드할 수 있다(S340).After step S330, the progress guidance module 195 of the server processor 190 may guide the host's broadcast direction according to the analyzed listener's emotions and participation (S340).

이때, 진행 유도 모듈(195)은 호스트의 방송 진행 방향을 그룹마다 맞춤형으로 가이드할 수 있다.At this time, the progress guidance module 195 can guide the host's broadcast progress direction in a customized manner for each group.

가령, 진행 유도 모듈(195)은 기쁨을 느끼면서 참여도가 높은 그룹에게, 감사 인사를 하라고 호스트에게 가이드할 수 있으며, 우울함을 느끼면서 참여도가 높은 그룹에게 즐거운 뮤직을 재생하라고 가이드할 수 있다.For example, the progress guidance module 195 may guide the host to say thank you to a group that feels happy and has high participation, and may guide the group that feels depressed and has high participation to play pleasant music. .

S340 단계 이후, 서버 프로세서(190)의 모니터링 모듈(199)은 가이드된 방송 진행 방향에 대한 호스트의 순응 여부를 모니터링할 수 있다(S350).After step S340, the monitoring module 199 of the server processor 190 may monitor the host's compliance with the guided broadcast direction (S350).

서버 프로세서(190)는 호스트가 순응하여 행동한 경우(S360), 가이드된 방송 진행 방향의 적정성을 청취자 피드백을 통해 결정할 수 있다(S370).If the host behaves compliantly (S360), the server processor 190 may determine the appropriateness of the guided broadcasting direction through listener feedback (S370).

서버 프로세서(190)는 호스트가 순응하여 행동한 경우에는, 라이브 인터넷 방송의 청취자 증감 변동수, 후원자 증감 변동 추세, 후원 금액의 증감 변동 추세, 채팅창의 활성도, 청취자의 참여 증감 변동 추세 등을 분석하여 호스트 단말(200)에 제공할 수 있다. 서버 프로세서(190)는 방송 진행 가이드에 대해 과금되는 것에 부정적 견해를 가진 호스트에게, 먼저 방송 진행 가이드를 수행한 후, 그에 따른 효과를 보여줌으로써, 과금을 방송 진행 가이드 이후에 수행할 수도 있으며, 방송 진행 가이드하기 전에 후불에 대한 계약서를 호스트 단말(200)에 제공할 수 있다.If the host behaves compliantly, the server processor 190 analyzes the number of changes in the number of listeners of the live Internet broadcast, the trend of changes in the increase or decrease of sponsors, the trend of increase or decrease in sponsorship amount, the activity of the chat window, and the trend of changes in increase or decrease in listener participation. It can be provided to the host terminal 200. The server processor 190 may first perform the broadcasting guide and then show the resulting effect to a host who has a negative opinion about charging for the broadcasting guide, so that charging can be performed after the broadcasting guide. Before proceeding with the guide, a contract for deferred payment may be provided to the host terminal 200.

또한, 서버 프로세서(190)는 호스트가 순응하여 행동한 경우에는, 청취자 단말(300)에 방송 진행 유도에 따른 만족도, 방송 진행에 적절한 조치인지 등에 대한 데이터를 수집하여, 호스트 단말(200)에 제공할 수 있다.In addition, if the host behaves compliantly, the server processor 190 collects data on the listener terminal 300's satisfaction with inducing the broadcast to proceed and whether the action is appropriate for the broadcast, and provides the data to the host terminal 200. can do.

실시예에서, 서버 프로세서(190)는 분석된 청취자의 감정 및 참여도에 대한 데이터가 호스트 단말(200)에서 시각적으로 디스플레이되도록, 분석된 청취자의 감정 및 참여도에 대한 데이터를 서버 통신부(110)를 통해 상기 호스트 단말(200)에 제공할 수 있다.In an embodiment, the server processor 190 sends data about the analyzed listener's emotion and participation to the server communication unit 110 so that the data about the analyzed listener's emotion and participation is visually displayed on the host terminal 200. It can be provided to the host terminal 200 through .

도 5를 참고하면, 서버 프로세서(190)는 라이브 인터넷 방송 중에, 호스트가 포함된 인터넷 콜이나 채팅창에서 청취자의 문의사항이 있는 경우, 문의사항을 리스트업하여 서버 통신부(110)를 통해 호스트 단말(200)에 제공하며(S410), 문의사항 리스트에서 답변이 완료된 문의사항을 제외한 문의사항을 소정 주기로 호스트 단말(200)에 제공할 수 있다(S420). 이때, 서버 프로세서(190)는 인터넷 콜에서 발생된 문의사항을 텍스트화하여 호스트 단말(200)에 제공할 수 있다. Referring to FIG. 5, during a live Internet broadcast, if there is an inquiry from a listener in an Internet call or chat window involving the host, the server processor 190 lists the inquiry and sends it to the host terminal through the server communication unit 110. It is provided to (200) (S410), and inquiries excluding inquiries that have been answered in the inquiry list can be provided to the host terminal 200 at a predetermined period (S420). At this time, the server processor 190 may convert the inquiry generated from the Internet call into text and provide it to the host terminal 200.

이 경우, 서버 프로세서(190)는 청취자의 문의사항 중에서 중복되거나 소정의 유사도 범위를 갖는 문의사항을 병합하고, 문의 빈도수가 높은 문의사항부터 정렬하여 서버 통신부(110)를 통해 호스트 단말(200)에 제공할 수 있다.In this case, the server processor 190 merges inquiries that are duplicated or have a predetermined similarity range among the listener's inquiries, sorts the inquiries with the highest frequency of inquiries, and sends them to the host terminal 200 through the server communication unit 110. can be provided.

서버 프로세서(190)는 문의사항이 중복되거나 소정 유사도를 갖는지 판단하기 위해, 두 문의사항이 입력되면, 유사도를 퍼센트로 출력하는 유사도 검출 알고리즘을 사용할 수 있다. 이때, 서버 프로세서(190)는 각 문의사항을 공간 상의 벡터로 대응시킬 수 있으며, 두 벡터 간 코사인 유사도 값이 0.8 이상 1 이하인 경우 문의사항이 유사한 것으로 판단할 수 있다. 서버 프로세서(190)는 각 문의사항의 명사를 중심으로 문의사항 간의 유사도를 판단할 수 있다. 코사인 유사도 값은 백터 간 스칼라곱과 크기에 의해 산출될 수 있다.The server processor 190 may use a similarity detection algorithm that outputs the similarity as a percentage when two inquiries are input in order to determine whether the inquiries overlap or have a certain degree of similarity. At this time, the server processor 190 may correspond to each inquiry with a vector in space, and if the cosine similarity value between the two vectors is 0.8 or more and 1 or less, it may be determined that the inquiries are similar. The server processor 190 may determine the degree of similarity between inquiries based on the noun of each inquiry. The cosine similarity value can be calculated by the scalar product and size between vectors.

실시예에서, 호스트 단말(200)은 답변이 안된 문의사항에 대해, 답변하거나 거부할 수 있으며, 거부할 경우, 서버 프로세서(190)는 거부된 문의사항을 호스트 단말(200)만 알 수 있도록 표시하여 호스트 단말(200)에 제공할 수 있다.In an embodiment, the host terminal 200 can respond or reject unanswered inquiries, and if rejected, the server processor 190 displays the rejected inquiries so that only the host terminal 200 can see them. This can be provided to the host terminal 200.

실시예에서, 서버 프로세서(190)는 곤란한 문의사항을 소정 수준 이상하는 청취자에게 경고 메시지를 서버 통신부(110)를 통해 청취자 단말에 제공할 수 있다.In an embodiment, the server processor 190 may provide a warning message to the listener terminal through the server communication unit 110 to listeners who ask difficult questions beyond a predetermined level.

다른 실시예에서, 서버 프로세서(190)는 다양한 기준에 따라 하이라이트 영상을 생성할 수 있는데, 이하에서 구체적으로 설명하기로 한다.In another embodiment, the server processor 190 may generate a highlight image according to various criteria, which will be described in detail below.

하이라이트 영상은 라이브 방송의 실제 사용된 콘텐츠뿐만 아니라, 라이브 방송에서 사용되지 않은 데이터, 하이라이트 영상의 가치를 높여주거나 흥미롭고 매력적으로 만들거나 조회수를 높이기 위한 오디오 데이터, 영상 데이터, 세대 별 또는 상황 별 사용하는 언어 데이터, 라이브 인터넷 방송의 핫키워드, 문자, 부호 등을 다양하게 포함하는 콘텐츠로 구현될 수 있다. 이에, 하이라이트 영상은 보이스 기반으로 생성될 수 있으며, 동영상 기반으로도 생성될 수 있다.Highlight videos are not only the content actually used in the live broadcast, but also data not used in the live broadcast, audio data, video data, used by generation or situation to increase the value of the highlight video, make it more interesting and attractive, or increase the number of views. It can be implemented as content containing a variety of language data, hot keywords from live Internet broadcasts, letters, and symbols. Accordingly, the highlight video can be generated based on voice and can also be generated based on video.

서버 프로세서(190)는 라이브 인터넷 방송에 대한 하나 이상의 하이라이트 영상을 생성하기 위한 트리거 조건을 설정할 수 있는데, 서버 프로세서(190)는 라이브 인터넷 방송에서 채팅 진행 속도, 후원 청취자의 수 및 후원 청취자의 후원 금액, 특정 키워드의 사용 빈도수, 키워드에 함의된 특정 감정 상태, 청취자의 참여도, 호스트의 보이스 강도, 호스트의 발화 빈도 및 발화 시간 중 적어도 하나가 소정 수준 이상인 조건을 상기 트리거 조건으로 설정할 수 있다.The server processor 190 may set trigger conditions for generating one or more highlight videos for a live Internet broadcast, where the server processor 190 determines the chat progress speed, the number of sponsored listeners, and the sponsorship amount of the sponsored listeners in the live Internet broadcast. , a condition in which at least one of the frequency of use of a specific keyword, a specific emotional state implied by the keyword, the listener's participation, the intensity of the host's voice, the frequency of the host's speech, and the speech time is above a predetermined level can be set as the trigger condition.

서버 프로세서(190)는 트리거 조건을 만족하는 라이브 인터넷 방송의 구간마다 하이라이트 영상을 생성할 수 있다. 서버 프로세서(190)는 트리거 조건이 만족된 방송 구간마다 보강 데이터를 조합할 수 있다. The server processor 190 may generate a highlight video for each section of a live Internet broadcast that satisfies trigger conditions. The server processor 190 may combine reinforcement data for each broadcast section in which the trigger condition is satisfied.

여기서, 보강 데이터는 라이브 인터넷 방송에서 실제 방송되거나 사용되지 않은 데이터이며, 하이라이트 영상의 조회수를 높이기 위한 오디오 데이터, 영상 데이터, 세대 별 또는 상황 별 사용하는 언어 데이터, 라이브 인터넷 방송의 핫키워드 중 적어도 하나를 포함할 수 있다.Here, the reinforcement data is data that is not actually broadcast or used in the live Internet broadcast, and is at least one of audio data to increase the number of views of the highlight video, video data, language data used by generation or situation, and hot keywords in the live Internet broadcast. may include.

서버 프로세서(190)는 라이브 인터넷 방송에서 실제 방송되거나 사용되지 않은 데이터를 방송 구간에 적절하게 조합하여 하이라이트 영상을 재탄생시킬 수 있으며, 하이라이트 영상의 조회수를 높이기 위한 오디오 데이터(뮤직, 특정 보이스, 핫키워드에 대응하는 사운드 등)를 추가할 수 있으며, 호스트에 대응하는 아바타와 청취자에 대응하는 아바타 콘텐츠를 추가하고, 채팅이나 인터넷 콜에서 대화를 재구성할 수 있다. 이때, 서버 프로세서(190)는 대화를 재구성하면서, 가상의 제3자를 추가하여 대화를 평가하는 멘트를 추가할 수 있다. 서버 프로세서(190)는 호스트나 청취자의 대화에서 호스트나 청취자가 감정을 드러내는 상황에서는 아바타도 감정을 나타내도록 하이라이트 영상을 생성할 수 있다. 이때, 서버 프로세서(190)는 키워드에 함의된 감정을 파악하는 상술한 인공 지능 기반의 분석 모델을 함께 이용할 수 있다.The server processor 190 can recreate a highlight video by appropriately combining data that is actually broadcast or not used in a live Internet broadcast in the broadcast section, and audio data (music, specific voice, hot content) to increase the number of views of the highlight video. (sounds corresponding to keywords, etc.) can be added, avatars corresponding to the host and avatar content corresponding to the listeners can be added, and conversations can be reconstructed in chat or Internet calls. At this time, the server processor 190 may reconstruct the conversation and add a comment evaluating the conversation by adding a virtual third party. The server processor 190 may generate a highlight video so that the avatar also shows emotions in situations where the host or listener shows emotions in a conversation between the host and listeners. At this time, the server processor 190 may use the above-described artificial intelligence-based analysis model that identifies the emotion implied in the keyword.

또한, 서버 프로세서(190)는 세대 별 또는 상황 별 사용하는 언어 데이터, 라이브 인터넷 방송의 핫키워드 등을 결정된 방송 구간에 함께 조합하여 하이라이트 영상을 생성할 수 있다. 서버 프로세서(190)는 핫키워드 중에서도 우선순위가 높은 키워드를 하이라이트 영상의 적절한 지점에 영상 기반으로 또는 오디오 기반으로 삽입할 수 있다.Additionally, the server processor 190 may generate a highlight image by combining language data used by generation or situation, hot keywords of live Internet broadcasting, etc., together in the determined broadcast section. The server processor 190 can insert keywords with high priority among hot keywords at appropriate points in the highlight video based on video or audio.

또한, 서버 프로세서(190)는 트리거 조건을 만족하는 구간이 복수인 경우, 복수의 구간 중 일부를 하나의 하이라이트 영상에 통합하되, 통합된 영상에서 트리거 조건 간의 가중치에 기반하여 영상 내 재생 순서를 구간 별로 다르게 배치할 수 있다. In addition, when there are multiple sections that satisfy the trigger conditions, the server processor 190 integrates some of the multiple sections into one highlight video and adjusts the playback order within the video based on the weight between the trigger conditions in the integrated image. It can be arranged very differently.

실시예에서, 서버 프로세서(190)는 후원 금액이 많은 구간에 대응하는 하이라이트 영상을 보강 데이터를 추가하여 별도로 생성하고, 별도의 하이라이트 영상을 추가적으로 더 생성할 수 있다. 서버 프로세서(190)는 별도의 하이라이트 영상의 경우, 채팅 진행 속도가 소정 수준 이상인 구간, 특정 키워드의 사용 빈도수 또는 키워드에 함의된 특정 감정 상태가 소정 수준 이상인 구간, 청취자의 참여율이 소정 수준 이상인 구간, 호스트의 보이스 강도/발화 빈도/발화 시간이 소정 수준 이상인 구간으로 구분하여, 보강데이터를 구간별로 추가하여 하이라이트 영상을 생성할 수 있다.In an embodiment, the server processor 190 may separately generate a highlight image corresponding to a section with a large sponsorship amount by adding reinforcement data, and may additionally generate a separate highlight image. In the case of a separate highlight video, the server processor 190 may include a section where the chat progress speed is above a predetermined level, a section where the frequency of use of a specific keyword or a specific emotional state implied by the keyword is above a predetermined level, a section where the listener's participation rate is above a predetermined level, A highlight video can be created by dividing the host into sections where the voice intensity/utterance frequency/utterance time is above a certain level and adding reinforcement data for each section.

실시예에서, 서버 프로세서(190)는 통합 영상을 생성할 때, 복수의 트리거 조건을 동시에 만족하는 지점을 하이라이트 영상에서 전반부에 배치하고, 트리거 조건을 하나만 만족하는 지점을 하이라이트 영상의 후반부에 순차적으로 배치할 수 있다. 또한, 서버 프로세서(190)는 가중치가 높은 트리거 조건을 통합 하이라이트 영상의 전반부에 배치할 수 있다. 상기 가중치는 호스트의 선택에 따라 결정될 수 있으나, 실시예가 이에 한정되는 것은 아니다. 실시예에서, 서버 프로세서(190)는 통합 영상의 재생 시간이 소정 시간 이상보다 긴 경우, 통합 영상을 복수의 부분 통합 영상으로 생성할 수도 있다.In an embodiment, when generating an integrated image, the server processor 190 places points that simultaneously satisfy a plurality of trigger conditions in the first half of the highlight image, and sequentially places points that satisfy only one trigger condition in the second half of the highlight image. It can be placed. Additionally, the server processor 190 may place a trigger condition with a high weight in the first half of the integrated highlight image. The weight may be determined according to the selection of the host, but the embodiment is not limited thereto. In an embodiment, if the playback time of the combined image is longer than a predetermined time, the server processor 190 may generate the combined image into a plurality of partial combined images.

한편, 개시된 실시예들은 컴퓨터에 의해 실행 가능한 명령어를 저장하는 기록매체의 형태로 구현될 수 있다. 명령어는 프로그램 코드의 형태로 저장될 수 있으며, 프로세서에 의해 실행되었을 때, 프로그램 모듈을 생성하여 개시된 실시예들의 동작을 수행할 수 있다. 기록매체는 컴퓨터로 읽을 수 있는 기록매체로 구현될 수 있다.Meanwhile, the disclosed embodiments may be implemented in the form of a recording medium that stores instructions executable by a computer. Instructions may be stored in the form of program code, and when executed by a processor, may create program modules to perform operations of the disclosed embodiments. The recording medium may be implemented as a computer-readable recording medium.

컴퓨터가 읽을 수 있는 기록매체로는 컴퓨터에 의하여 해독될 수 있는 명령어가 저장된 모든 종류의 기록 매체를 포함한다. 예를 들어, ROM(Read Only Memory), RAM(Random Access Memory), 자기 테이프, 자기 디스크, 플래쉬 메모리, 광 데이터 저장장치 등이 있을 수 있다. Computer-readable recording media include all types of recording media storing instructions that can be decoded by a computer. For example, there may be Read Only Memory (ROM), Random Access Memory (RAM), magnetic tape, magnetic disk, flash memory, optical data storage device, etc.

이상에서와 같이 첨부된 도면을 참조하여 개시된 실시예들을 설명하였다. 본 개시가 속하는 기술분야에서 통상의 지식을 가진 자는 본 개시의 기술적 사상이나 필수적인 특징을 변경하지 않고도, 개시된 실시예들과 다른 형태로 본 개시가 실시될 수 있음을 이해할 것이다. 개시된 실시예들은 예시적인 것이며, 한정적으로 해석되어서는 안 된다.As described above, the disclosed embodiments have been described with reference to the attached drawings. A person skilled in the art to which this disclosure pertains will understand that the present disclosure may be practiced in forms different from the disclosed embodiments without changing the technical idea or essential features of the present disclosure. The disclosed embodiments are illustrative and should not be construed as limiting.

100 : 방송 진행 방향 유도 서버, 200: 호스트 단말, 300: 청취자 단말.100: Broadcast direction guidance server, 200: Host terminal, 300: Listener terminal.

Claims

As a server that guides the direction of the broadcast host's broadcast based on listener responses,
a server communication unit that communicates with a host terminal and a listener terminal of a voice-based live Internet broadcast; A memory in which broadcast data is stored; and a server processor including one or more cores,
The server processor is,
a collection module that collects listener reaction information;
an analysis module that analyzes the listener's emotions and participation based on the listener's reaction information collected by the collection module; and
It includes a progression guidance module that guides the direction of the host's broadcast according to the analyzed listener's emotions and participation,
The server processor is,
During the live Internet broadcast, in order to guide the host's broadcast progress, if there is an inquiry from a listener in an Internet call or chat window involving the host, the inquiry is listed and provided to the host terminal through the server communication unit. , Inquiries excluding inquiries that have been answered in the inquiry list are provided to the host terminal at a predetermined period,
The server processor is,
If an unanswered inquiry among the listed inquiries is rejected by the host terminal, information on the rejected inquiry is provided to the host terminal through the server communication unit,
The server processor monitors the host's compliance with the guided broadcast direction through a monitoring module,
The server processor is,
If the host complies and acts, the appropriateness of the guided broadcast direction is determined through listener feedback,
If the host behaves compliantly, the server processor analyzes the number of changes in the number of listeners of the live Internet broadcast, the trend of changes in the increase or decrease of sponsors, the trend of increase or decrease in sponsorship amount, the activity of the chat window, and the trend of changes in increase or decrease in listener participation, and the server Provided to the host terminal through the communication department,
The server processor is configured to, when the host behaves compliantly, provide a satisfaction level according to the listener's guidance in broadcasting to the host terminal through the server communication unit.

According to paragraph 1,
The server processor is,
Based on the analyzed emotions and participation of the listeners, it further includes a grouping module that groups the listeners and assigns labels to each group,
The progress guidance module is,
A broadcasting direction guidance server configured to guide the broadcasting direction of the host in a customized manner for each group.

delete

According to paragraph 2,
The server processor is,
A broadcasting direction guidance server configured to merge inquiries that overlap or have a predetermined similarity range among listener inquiries, sort inquiries with the highest frequency of inquiries, and provide the information to the host terminal through the server communication unit.

According to clause 5,
The server processor is,
Broadcasting configured to provide data on the analyzed emotion and participation of the listener to the host terminal through the server communication unit, so that the data on the analyzed emotion and participation of the listener are visually displayed in the host terminal. Direction guidance server.

A method of deriving the broadcasting direction of a broadcasting host based on listener responses performed by a server processor of a broadcasting direction inducing server,
collecting listener reaction information;
Analyzing the listener's emotions and participation based on the collected response information of the listener; and
It includes guiding the direction of the host's broadcast according to the analyzed listener's emotions and participation,
The server processor is,
During the live Internet broadcast, in order to guide the host's broadcast progress, if there are inquiries from listeners in an Internet call or chat window involving the host, the inquiries are listed and provided to the host terminal through the server communication unit, Inquiries excluding answers that have been answered from the inquiry list are provided to the host terminal at a predetermined interval,
The server processor is,
If an unanswered inquiry among the listed inquiries is rejected by the host terminal, information about the rejected inquiry is provided to the host terminal.
The server processor monitors the host's compliance with the guided broadcast direction through a monitoring module,
The server processor is,
If the host complies and acts, the appropriateness of the guided broadcast direction is determined through listener feedback,
If the host behaves compliantly, the server processor analyzes the number of changes in the number of listeners of the live Internet broadcast, the trend of increase or decrease in sponsors, the trend of increase or decrease in sponsorship amount, the activity of the chat window, and the trend of increase or decrease in listener participation, and Provided to,
The server processor is configured to provide the host terminal with satisfaction based on the listener's guidance in broadcasting when the host behaves compliantly.

In clause 7,
After the above analysis step,
Based on the analyzed emotions and participation of the listeners, further comprising grouping the listeners and assigning labels to each group,
The guiding steps are:
A broadcasting direction guidance method comprising guiding the host's broadcasting direction in a customized manner for each group.

delete