KR102593762B1

KR102593762B1 - Method and server for generating highlight contents for live internet broadcasting based on voice recognition

Info

Publication number: KR102593762B1
Application number: KR1020230066263A
Authority: KR
Inventors: 우성한; 안시성
Original assignee: 주식회사 스푼라디오
Priority date: 2023-05-23
Filing date: 2023-05-23
Publication date: 2023-10-26

Abstract

보이스 기반의 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성하는 서버가 개시된다. 본 서버는, 보이스 기반의 라이브 인터넷 방송의 호스트 단말 및 청취자 단말과 통신하는 서버 통신부, 하이라이트 콘텐츠에서 사용되는 보강 데이터를 저장하는 서버 메모리 및 하나 이상의 코어를 포함하는 서버 프로세서를 포함할 수 있다. 서버 프로세서는, 방송 데이터 관리 모듈, 설정 모듈, 구간 결정 모듈 및 생성 모듈 등을 포함할 수 있다. 본 서버가 제공됨으로써, 사용자 편의가 제고될 수 있으며, 라이브 인터넷 방송의 수익이 증가하는 효과가 기대될 수 있다.A server that generates highlight content for voice-based live Internet broadcasting is disclosed. This server may include a server communication unit that communicates with a host terminal and a listener terminal of a voice-based live Internet broadcast, a server memory that stores reinforcement data used in highlight content, and a server processor including one or more cores. The server processor may include a broadcast data management module, a setting module, a section determination module, and a generation module. By providing this server, user convenience can be improved and the effect of increasing profits from live Internet broadcasting can be expected.

Description

Method for generating highlight content for live Internet broadcasting based on voice recognition and server for the same {METHOD AND SERVER FOR GENERATING HIGHLIGHT CONTENTS FOR LIVE INTERNET BROADCASTING BASED ON VOICE RECOGNITION}

본 개시는 보이스 기반의 라이브 인터넷 방송 서비스에 관한 것이다. 보다 상세하게는, 본 개시는 보이스 인식 기반의 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성하는 방법 및 이를 위한 서버에 관한 것이다.This disclosure relates to a voice-based live Internet broadcasting service. More specifically, the present disclosure relates to a method for generating highlight content for a live Internet broadcast based on voice recognition and a server for the same.

이하에 기술되는 내용은 단순히 본 실시예와 관련되는 배경 정보만을 제공할 뿐 종래기술을 구성하는 것이 아니다.The content described below simply provides background information related to this embodiment and does not constitute prior art.

과거 지상파 중심의 방송 시스템에서 최근 인터넷이 급격하게 보급됨으로써, 인터넷을 통한 방송 서비스를 지원하는 플랫폼에 대한 시청자들의 수요가 폭발적으로 증가함에 따라, 인터넷 기반의 스트리밍 서비스가 활발하게 제공되고 있으며, 영상 기반의 스트리밍 서비스뿐만 아니라 오디오 기반의 스트리밍 서비스도 꾸준하게 성장하고 있다.With the recent rapid spread of the Internet in the past terrestrial-centered broadcasting system, viewer demand for platforms that support broadcasting services through the Internet has increased explosively, and Internet-based streaming services are being actively provided, and video-based streaming services are being actively provided. In addition to streaming services, audio-based streaming services are also steadily growing.

또한, 오디오 기반의 스트리밍 서비스에서 녹화된 방송뿐만 아니라 호스트의 음성을 직접 듣고 실시간 소통을 원하는 청취자의 니즈에 따라 보이스 기반의 라이브 인터넷 방송도 활발하게 진행되고 있다. 보이스 기반의 라이브 인터넷 방송에서 방송 호스트가 오디오 기반으로 방송을 진행하고, 메신저 채팅 및 다자간 라이브 통화(call) 등을 통해 실시간 소통이 이루어지고 있다.In addition, voice-based live Internet broadcasting is being actively conducted in response to the needs of listeners who want to hear the host's voice directly and communicate in real time, as well as broadcasts recorded in audio-based streaming services. In voice-based live Internet broadcasting, the broadcast host broadcasts based on audio, and real-time communication takes place through messenger chatting and multi-party live calls.

방송 호스트의 경우 단독 혹은 소규모로 구성된 경우가 다수이므로, 방송 진행뿐만 아니라 방송과 관련된 콘텐츠를 별도로 제작하기에는 어려움이 따른다Since many broadcast hosts are single or small-scale, it is difficult not only to conduct broadcasts but also to separately produce broadcast-related content.

등록특허공보 제10-2388670호(등록일: 2022.04.15)Registered Patent Publication No. 10-2388670 (Registration date: 2022.04.15)

본 개시에 개시된 실시예는 보이스 기반의 라이브 인터넷 방송과 관련된 하이라이트 콘텐츠를 생성하는 방법을 제공하는데 그 목적이 있다.The purpose of the embodiments disclosed in this disclosure is to provide a method for generating highlight content related to voice-based live Internet broadcasting.

또한, 본 개시에 개시된 실시예는 보이스 기반의 라이브 인터넷 방송과 관련된 복수의 하이라이트 콘텐츠를 생성하여 방송 호스트에게 제공하고 방송 호스트의 취사선택에 따라 하이라이트 콘텐츠를 선정하는 방법을 제공하는데 그 목적이 있다.In addition, the purpose of the embodiment disclosed in the present disclosure is to provide a method of generating a plurality of highlight contents related to voice-based live Internet broadcasting, providing them to the broadcast host, and selecting the highlight content according to the broadcast host's selection.

또한, 본 개시에 개시된 실시예는 청취자가 선호하는 라이브 인터넷 방송과 관련된 하이라이트 콘텐츠를 제작하여 라이브 방송 참여를 독려하는 방법을 제공하는데 그 목적이 있다.Additionally, the purpose of the embodiment disclosed in the present disclosure is to provide a method of encouraging listeners to participate in live broadcasting by producing highlight content related to their preferred live Internet broadcasting.

본 개시가 해결하고자 하는 과제들은 이상에서 언급된 과제로 제한되지 않으며, 언급되지 않은 또 다른 과제들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The problems to be solved by the present disclosure are not limited to the problems mentioned above, and other problems not mentioned can be clearly understood by those skilled in the art from the description below.

상술한 기술적 과제를 달성하기 위한 본 개시에 따른 보이스 기반의 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성하는 서버는, 보이스(voice) 기반의 라이브 인터넷 방송의 호스트 단말 및 청취자 단말과 통신하는 서버 통신부; 하이라이트 콘텐츠에서 사용되는 보강 데이터를 저장하는 서버 메모리; 및 하나 이상의 코어를 포함하는 서버 프로세서를 포함할 수 있다.A server that generates highlight content for voice-based live Internet broadcasting according to the present disclosure to achieve the above-described technical problem includes a server communication unit that communicates with a host terminal and a listener terminal of the voice-based live Internet broadcasting; Server memory to store enrichment data used in highlight content; and a server processor including one or more cores.

상기 서버 프로세서는, 상기 라이브 인터넷 방송의 방송 데이터를 소정의 카테고리로 구분하여 상기 서버 메모리에 저장하는 방송 데이터 관리 모듈; 상기 라이브 인터넷 방송에 대한 하나 이상의 하이라이트 콘텐츠를 생성하기 위한 트리거 조건을 설정하는 설정 모듈; 상기 설정된 트리거 조건을 만족하는 상기 라이브 인터넷 방송의 방송 구간을 결정하는 구간 결정 모듈; 및 상기 결정된 방송 구간의 방송 데이터 및 상기 보강 데이터를 조합하여 상기 라이브 인터넷 방송에 대한 상기 하이라이트 콘텐츠를 생성하는 생성 모듈을 포함할 수 있다.The server processor includes a broadcast data management module that divides broadcast data of the live Internet broadcast into predetermined categories and stores them in the server memory; a setting module that sets trigger conditions for generating one or more highlight contents for the live Internet broadcast; a section determination module that determines a broadcast section of the live Internet broadcast that satisfies the set trigger condition; and a creation module that generates the highlight content for the live Internet broadcast by combining the broadcast data of the determined broadcast section and the reinforcement data.

상기 서버 프로세서는, 상기 하이라이트 콘텐츠를 생성하여 상기 서버 통신부를 통해 상기 호스트 단말에 제공하며, 상기 하이라이트 콘텐츠 중 적어도 하나가 상기 호스트 단말에 의해 선택되는 경우, 상기 선택된 하이라이트 콘텐츠를 청취자 단말에 상기 서버 통신부를 통해 제공하도록 구성될 수 있다.The server processor generates the highlight content and provides it to the host terminal through the server communication unit, and when at least one of the highlight contents is selected by the host terminal, the selected highlight content is transmitted to the listener terminal through the server communication unit. It can be configured to provide through.

상기 방송 데이터 관리 모듈은, 상기 라이브 인터넷 방송의 방송 데이터를 상기 호스트의 보이스 데이터, 채팅 데이터, 방송 중 라이브 통화 데이터 및 후원 관련 데이터 중 적어도 하나의 카테고리로 구분하여 상기 서버 메모리에 저장하도록 구성될 수 있다.The broadcast data management module may be configured to divide broadcast data of the live Internet broadcast into at least one category of the host's voice data, chat data, live call data during broadcast, and sponsorship-related data and store them in the server memory. there is.

상기 설정 모듈은, 상기 라이브 인터넷 방송에서 채팅 진행 속도, 후원 청취자의 수 및 후원 청취자의 후원 금액, 특정 키워드의 사용 빈도수, 키워드에 함의된 특정 감정 상태, 청취자의 참여도, 호스트의 보이스 강도, 호스트의 발화 빈도 및 발화 시간 중 적어도 하나가 소정 수준 이상인 조건을 상기 트리거 조건으로 설정하도록 구성될 수 있다.The setting module includes the chat progress speed in the live Internet broadcast, the number of sponsored listeners and the sponsorship amount of sponsored listeners, the frequency of use of a specific keyword, a specific emotional state implied by the keyword, the listener's participation level, the host's voice intensity, and the host's voice intensity. It may be configured to set a condition in which at least one of the utterance frequency and ignition time is above a predetermined level as the trigger condition.

상기 설정 모듈은, 상기 호스트 단말에서 요청한 조건을 상기 하이라이트 콘텐츠를 생성하기 위한 트리거 조건으로 설정하도록 구성될 수 있다.The setting module may be configured to set a condition requested by the host terminal as a trigger condition for generating the highlight content.

상기 생성 모듈은, 상기 트리거 조건을 만족하는 구간이 복수인 경우, 복수의 구간 중 일부를 하나의 하이라이트 콘텐츠에 통합하되, 통합된 콘텐츠에서 상기 트리거 조건 간의 가중치에 기반하여 콘텐츠 내 재생 순서를 구간 별로 다르게 배치하도록 구성될 수 있다.If there are a plurality of sections satisfying the trigger conditions, the generation module integrates some of the multiple sections into one highlight content, and determines the playback order within the content for each section based on the weight between the trigger conditions in the integrated content. It can be configured to be deployed differently.

상기 보강 데이터는, 상기 라이브 인터넷 방송에서 실제 방송되거나 사용되지 않은 데이터이며, 상기 하이라이트 콘텐츠의 조회수를 높이기 위한 오디오 데이터, 영상 데이터, 세대 별 또는 상황 별 사용하는 언어 데이터, 라이브 인터넷 방송의 핫키워드 중 적어도 하나를 포함할 수 있다.The reinforcement data is data that is not actually broadcast or used in the live Internet broadcast, and includes audio data to increase the number of views of the highlight content, video data, language data used by generation or situation, and hot keywords of the live Internet broadcast. It can contain at least one.

또한, 상술한 기술적 과제를 달성하기 위한 본 개시에 따른 컴퓨팅 장치에 의해 수행되는 보이스 기반의 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성하는 방법은, 상기 라이브 인터넷 방송의 방송 데이터를 소정의 카테고리로 구분하여 저장하는 단계; 상기 라이브 인터넷 방송에 대한 하나 이상의 하이라이트 콘텐츠를 생성하기 위한 트리거 조건을 설정하는 단계; 상기 설정된 트리거 조건을 만족하는 상기 라이브 인터넷 방송의 방송 구간을 결정하는 단계; 및 상기 결정된 방송 구간의 방송 데이터 및 하이라이트 콘텐츠에서 사용하기 위해 미리 저장된 보강 데이터를 조합하여 상기 라이브 인터넷 방송에 대한 상기 하이라이트 콘텐츠를 생성하는 단계를 포함할 수 있다.In addition, a method of generating highlight content for a voice-based live Internet broadcast performed by a computing device according to the present disclosure to achieve the above-described technical problem includes dividing broadcast data of the live Internet broadcast into predetermined categories. saving; setting trigger conditions for generating one or more highlight contents for the live Internet broadcast; determining a broadcast section of the live Internet broadcast that satisfies the set trigger condition; and generating the highlight content for the live Internet broadcast by combining broadcast data of the determined broadcast section and reinforcement data pre-stored for use in the highlight content.

상기 생성 방법은, 상기 생성하는 단계 이후에, 상기 하이라이트 콘텐츠를 생성하여 호스트 단말에 제공하는 단계; 및 상기 하이라이트 콘텐츠 중 적어도 하나가 상기 호스트 단말에 의해 선택되는 경우, 상기 선택된 하이라이트 콘텐츠를 청취자 단말에 제공하는 단계를 더 포함할 수 있다.The generating method includes, after the generating step, generating the highlight content and providing it to a host terminal; and when at least one of the highlight contents is selected by the host terminal, providing the selected highlight contents to the listener terminal.

상기 설정하는 단계는, 상기 라이브 인터넷 방송에서 채팅 진행 속도, 후원 청취자의 수 및 후원 청취자의 후원 금액, 특정 키워드의 사용 빈도수, 키워드에 함의된 특정 감정 상태, 청취자의 참여도, 호스트의 보이스 강도, 호스트의 발화 빈도 및 발화 시간 중 적어도 하나가 소정 수준 이상인 조건을 상기 트리거 조건으로 설정하는 단계를 포함할 수 있다.The setting step includes the chat progress speed in the live Internet broadcast, the number of sponsored listeners and the sponsorship amount of sponsored listeners, the frequency of use of a specific keyword, a specific emotional state implied by the keyword, the listener's participation, the host's voice strength, It may include setting a condition in which at least one of the host's speech frequency and speech time is above a predetermined level as the trigger condition.

이 외에도, 본 개시를 구현하기 위한 방법을 실행하기 위해 컴퓨터 판독 가능한 기록 매체에 저장된 컴퓨터 프로그램이 더 제공될 수 있다.In addition to this, a computer program stored in a computer-readable recording medium may be further provided to execute the method for implementing the present disclosure.

이 외에도, 본 개시를 구현하기 위한 방법을 실행하기 위한 컴퓨터 프로그램을 기록하는 컴퓨터 판독 가능한 기록 매체가 더 제공될 수 있다.In addition, a computer-readable recording medium recording a computer program for executing a method for implementing the present disclosure may be further provided.

본 개시의 전술한 과제 해결 수단에 의하면, 보이스 기반의 라이브 인터넷 방송과 관련된 하이라이트 콘텐츠 제작에 대한 호스트의 부담이 줄어들 수 있으며, 하이라이트 콘텐츠를 시청하여 해당 라이브 인터넷 방송에 흥미를 느낀 청취자의 방송 참여 및 후원이 증가될 수 있어서, 사용자 편의가 제고될 수 있으며 라이브 인터넷 방송을 통한 수익이 증가하는 효과가 기대될 수 있다.According to the means for solving the above-described problem of the present disclosure, the burden on the host for producing highlight content related to voice-based live Internet broadcasting can be reduced, and listeners who are interested in the live Internet broadcast by watching the highlight content can participate in the broadcast and Sponsorship can increase, user convenience can be improved, and profits can be expected to increase through live Internet broadcasting.

본 개시의 효과들은 이상에서 언급된 효과로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The effects of the present disclosure are not limited to the effects mentioned above, and other effects not mentioned may be clearly understood by those skilled in the art from the description below.

도 1은 본 개시에 따른 보이스 기반의 라이브 인터넷 방송 시스템을 개략적으로 설명하기 위한 도면이다.
도 2는 본 개시에 따른 하이라이트 콘텐츠를 생성하는 서버 및 호스트 단말의 구성들을 나타내는 상대 블록도이다.
도 3은 본 개시에 따른 하이라이트 콘텐츠를 생성하는 서버의 프로세싱 과정을 나타내는 시퀀스도이다.
도 4는 본 개시에 따른 데이터를 구분하여 저장하는 하이라이트 콘텐츠를 생성하는 서버를 설명하기 위한 도면이다.
도 5는 본 개시에 따른 하이라이트 콘텐츠를 생성하기 위한 트리거 조건을 설명하기 위한 도면이다.
도 6은 본 개시에 따른 하이라이트 콘텐츠를 생성하는 서버의 호스트 단말에 의해 선택된 하이라이트 콘텐츠를 공유하는 과정을 설명하기 위한 도면이다.1 is a diagram schematically illustrating a voice-based live Internet broadcasting system according to the present disclosure.
Figure 2 is a relative block diagram showing the configurations of a server and a host terminal that generate highlight content according to the present disclosure.
Figure 3 is a sequence diagram showing the processing process of a server that generates highlight content according to the present disclosure.
FIG. 4 is a diagram illustrating a server that generates highlight content that separates and stores data according to the present disclosure.
FIG. 5 is a diagram illustrating trigger conditions for generating highlight content according to the present disclosure.
FIG. 6 is a diagram illustrating a process for sharing highlight content selected by a host terminal of a server generating highlight content according to the present disclosure.

본 개시 전체에 걸쳐 동일 참조 부호는 동일 구성요소를 지칭한다. 본 개시가 실시예들의 모든 요소들을 설명하는 것은 아니며, 본 개시가 속하는 기술분야에서 일반적인 내용 또는 실시예들 간에 중복되는 내용은 생략한다. 명세서에서 사용되는 '부, 모듈, 부재, 블록'이라는 용어는 소프트웨어 또는 하드웨어로 구현될 수 있으며, 실시예들에 따라 복수의 '부, 모듈, 부재, 블록'이 하나의 구성요소로 구현되거나, 하나의 '부, 모듈, 부재, 블록'이 복수의 구성요소들을 포함하는 것도 가능하다. Like reference numerals refer to like elements throughout this disclosure. The present disclosure does not describe all elements of the embodiments, and general content or overlapping content between the embodiments in the technical field to which the present disclosure pertains is omitted. The term 'unit, module, member, block' used in the specification may be implemented as software or hardware, and depending on the embodiment, a plurality of 'unit, module, member, block' may be implemented as a single component, or It is also possible for one 'part, module, member, or block' to include multiple components.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 직접적으로 연결되어 있는 경우뿐 아니라, 간접적으로 연결되어 있는 경우를 포함하고, 간접적인 연결은 무선 통신망을 통해 연결되는 것을 포함한다.Throughout the specification, when a part is said to be “connected” to another part, this includes not only direct connection but also indirect connection, and indirect connection includes connection through a wireless communication network. do.

또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미한다.Additionally, when a part "includes" a certain component, this means that it may further include other components rather than excluding other components, unless specifically stated to the contrary.

명세서 전체에서, 어떤 부재가 다른 부재 "상에" 위치하고 있다고 할 때, 이는 어떤 부재가 다른 부재에 접해 있는 경우뿐 아니라 두 부재 사이에 또 다른 부재가 존재하는 경우도 포함한다.Throughout the specification, when a member is said to be located “on” another member, this includes not only cases where a member is in contact with another member, but also cases where another member exists between the two members.

제 1, 제 2 등의 용어는 하나의 구성요소를 다른 구성요소로부터 구별하기 위해 사용되는 것으로, 구성요소가 전술된 용어들에 의해 제한되는 것은 아니다. Terms such as first and second are used to distinguish one component from another component, and the components are not limited by the above-mentioned terms.

단수의 표현은 문맥상 명백하게 예외가 있지 않는 한, 복수의 표현을 포함한다.Singular expressions include plural expressions unless the context clearly makes an exception.

각 단계들에 있어 식별부호는 설명의 편의를 위하여 사용되는 것으로 식별부호는 각 단계들의 순서를 설명하는 것이 아니며, 각 단계들은 문맥상 명백하게 특정 순서를 기재하지 않는 이상 명기된 순서와 다르게 실시될 수 있다. The identification code for each step is used for convenience of explanation. The identification code does not explain the order of each step, and each step may be performed differently from the specified order unless a specific order is clearly stated in the context. there is.

이하 첨부된 도면들을 참고하여 본 개시의 작용 원리 및 실시예들에 대해 설명한다.Hereinafter, the operating principle and embodiments of the present disclosure will be described with reference to the attached drawings.

본 명세서에서 '본 개시에 따른 하이라이트 콘텐츠를 생성하는 서버'는 클라우드 시스템뿐만 아니라 연산처리를 수행하여 사용자에게 결과를 제공할 수 있는 다양한 장치들로 구현될 수 있다. 예를 들어, 본 개시에 따른 하이라이트 콘텐츠를 생성하는 서버는, 컴퓨터, 서버 장치 및 휴대용 단말기를 모두 포함하거나, 또는 어느 하나의 형태가 될 수 있다.In this specification, the 'server that generates highlight content according to the present disclosure' may be implemented not only by a cloud system but also by various devices that can perform computational processing and provide results to the user. For example, a server that generates highlight content according to the present disclosure may include all of a computer, a server device, and a portable terminal, or may be in any one form.

여기에서, 상기 컴퓨터는 예를 들어, 웹 브라우저(WEB Browser)가 탑재된 노트북, 데스크톱(desktop), 랩톱(laptop), 태블릿 PC, 슬레이트 PC 등을 포함할 수 있다.Here, the computer may include, for example, a laptop, desktop, laptop, tablet PC, slate PC, etc. equipped with a web browser.

상기 서버 장치는 외부 장치와 통신을 수행하여 정보를 처리하는 서버로써, 애플리케이션 서버, 컴퓨팅 서버, 데이터베이스 서버, 파일 서버, 게임 서버, 메일 서버, 프록시 서버 및 웹 서버 등을 포함할 수 있다.The server device is a server that processes information by communicating with external devices, and may include an application server, computing server, database server, file server, game server, mail server, proxy server, and web server.

상기 휴대용 단말기는 예를 들어, 휴대성과 이동성이 보장되는 무선 통신 장치로서, PCS(Personal Communication System), GSM(Global System for Mobile communications), PDC(Personal Digital Cellular), PHS(Personal Handyphone System), PDA(Personal Digital Assistant), IMT(International Mobile Telecommunication)-2000, CDMA(Code Division Multiple Access)-2000, W-CDMA(W-Code Division Multiple Access), WiBro(Wireless Broadband Internet) 단말, 스마트 폰(Smart Phone) 등과 같은 모든 종류의 핸드헬드(Handheld) 기반의 무선 통신 장치와 시계, 반지, 팔찌, 발찌, 목걸이, 안경, 콘택트 렌즈, 또는 머리 착용형 장치(head-mounted-device(HMD) 등과 같은 웨어러블 장치를 포함할 수 있다.The portable terminal is, for example, a wireless communication device that guarantees portability and mobility, such as PCS (Personal Communication System), GSM (Global System for Mobile communications), PDC (Personal Digital Cellular), PHS (Personal Handyphone System), and PDA. (Personal Digital Assistant), IMT (International Mobile Telecommunication)-2000, CDMA (Code Division Multiple Access)-2000, W-CDMA (W-Code Division Multiple Access), WiBro (Wireless Broadband Internet) terminal, smart phone ), all types of handheld wireless communication devices, and wearable devices such as watches, rings, bracelets, anklets, necklaces, glasses, contact lenses, or head-mounted-device (HMD). may include.

본 개시에 따른 인공 지능과 관련된 기능은 프로세서와 메모리를 통해 동작된다. 프로세서는 하나 또는 복수의 프로세서로 구성될 수 있다. 이때, 하나 또는 복수의 프로세서는 CPU, AP, DSP(Digital Signal Processor) 등과 같은 범용 프로세서, GPU, VPU(Vision Processing Unit)와 같은 그래픽 전용 프로세서 또는 NPU와 같은 인공 지능 전용 프로세서일 수 있다. 하나 또는 복수의 프로세서는, 메모리에 저장된 기 정의된 동작 규칙 또는 인공 지능 모델에 따라, 입력 데이터를 처리하도록 제어한다. 또는, 하나 또는 복수의 프로세서가 인공 지능 전용 프로세서인 경우, 인공 지능 전용 프로세서는, 특정 인공 지능 모델의 처리에 특화된 하드웨어 구조로 설계될 수 있다.Functions related to artificial intelligence according to the present disclosure are operated through a processor and memory. The processor may consist of one or multiple processors. At this time, one or more processors may be a general-purpose processor such as a CPU, AP, or DSP (Digital Signal Processor), a graphics-specific processor such as a GPU or VPU (Vision Processing Unit), or an artificial intelligence-specific processor such as an NPU. One or more processors control input data to be processed according to predefined operation rules or artificial intelligence models stored in memory. Alternatively, when one or more processors are dedicated artificial intelligence processors, the artificial intelligence dedicated processors may be designed with a hardware structure specialized for processing a specific artificial intelligence model.

기 정의된 동작 규칙 또는 인공 지능 모델은 학습을 통해 만들어진 것을 특징으로 한다. 여기서, 학습을 통해 만들어진다는 것은, 기본 인공 지능 모델이 학습 알고리즘에 의하여 다수의 학습 데이터들을 이용하여 학습됨으로써, 원하는 특성(또는, 목적)을 수행하도록 설정된 기 정의된 동작 규칙 또는 인공 지능 모델이 만들어짐을 의미한다. 이러한 학습은 본 개시에 따른 인공 지능이 수행되는 기기 자체에서 이루어질 수도 있고, 별도의 서버 및/또는 시스템을 통해 이루어 질 수도 있다. 학습 알고리즘의 예로는, 지도형 학습(supervised learning), 비지도형 학습(unsupervised learning), 준지도형 학습(semi-supervised learning) 또는 강화 학습(reinforcement learning)이 있으나, 전술한 예에 한정되지 않는다.Predefined operation rules or artificial intelligence models are characterized by being created through learning. Here, created through learning means that a basic artificial intelligence model is learned using a large number of learning data by a learning algorithm, thereby creating a predefined operation rule or artificial intelligence model set to perform the desired characteristics (or purpose). It means burden. This learning may be accomplished in the device itself that performs artificial intelligence according to the present disclosure, or may be accomplished through a separate server and/or system. Examples of learning algorithms include supervised learning, unsupervised learning, semi-supervised learning, or reinforcement learning, but are not limited to the examples described above.

인공 지능 모델은, 복수의 신경망 레이어들로 구성될 수 있다. 복수의 신경망 레이어들 각각은 복수의 가중치들 (weight values)을 갖고 있으며, 이전(previous) 레이어의 연산 결과와 복수의 가중치들 간의 연산을 통해 신경 망 연산을 수행한다. 복수의 신경망 레이어들이 갖고 있는 복수의 가중치들은 인공 지능 모델의 학습 결과에 의해 최적화될 수 있다. 예를 들어, 학습 과정 동안 인공 지능 모델에서 획득한 로스(loss) 값 또는 코스트(cost) 값이 감소 또는 최소화되도록 복수의 가중치들이 갱신될 수 있다. 인공 신경망은 심층 신경망(DNN:Deep Neural Network)를 포함할 수 있으며, 예를 들어, CNN (Convolutional Neural Network), DNN (Deep Neural Network), RNN (Recurrent Neural Network), RBM (Restricted Boltzmann Machine), DBN (Deep Belief Network), BRDNN(Bidirectional Recurrent Deep Neural Network) 또는 심층 Q-네트워크 (Deep Q-Networks) 등이 있으나, 전술한 예에 한정되지 않는다.An artificial intelligence model may be composed of multiple neural network layers. Each of the plurality of neural network layers has a plurality of weight values, and neural network calculation is performed through calculation between the calculation result of the previous layer and the plurality of weights. Multiple weights of multiple neural network layers can be optimized by the learning results of the artificial intelligence model. For example, a plurality of weights may be updated so that loss or cost values obtained from the artificial intelligence model are reduced or minimized during the learning process. Artificial neural networks may include deep neural networks (DNN), for example, Convolutional Neural Network (CNN), Deep Neural Network (DNN), Recurrent Neural Network (RNN), Restricted Boltzmann Machine (RBM), Deep Belief Network (DBN), Bidirectional Recurrent Deep Neural Network (BRDNN), or Deep Q-Networks, etc., but are not limited to the examples described above.

프로세서는 뉴럴 네트워크를 생성하거나, 뉴럴 네트워크를 훈련(train, 또는 학습(learn)하거나, 수신되는 입력 데이터를 기초로 연산을 수행하고, 수행 결과를 기초로 정보 신호(information signal)를 생성하거나, 뉴럴 네트워크를 재훈련(retrain)할 수 있다. The processor creates a neural network, trains or learns a neural network, performs calculations based on received input data, generates an information signal based on the results, or generates a neural network. The network can be retrained.

뉴럴 네트워크는 CNN(Convolutional Neural Network), RNN(Recurrent Neural Network), 퍼셉트론(perceptron), 다층 퍼셉트론(multilayer perceptron), FF(Feed Forward), RBF(Radial Basis Network), DFF(Deep Feed Forward), LSTM(Long Short Term Memory), GRU(Gated Recurrent Unit), AE(Auto Encoder), VAE(Variational Auto Encoder), DAE(Denoising Auto Encoder), SAE(Sparse Auto Encoder), MC(Markov Chain), HN(Hopfield Network), BM(Boltzmann Machine), RBM(Restricted Boltzmann Machine), DBN(Depp Belief Network), DCN(Deep Convolutional Network), DN(Deconvolutional Network), DCIGN(Deep Convolutional Inverse Graphics Network), GAN(Generative Adversarial Network), LSM(Liquid State Machine), ELM(Extreme Learning Machine), ESN(Echo State Network), DRN(Deep Residual Network), DNC(Differentiable Neural Computer), NTM(Neural Turning Machine), CN(Capsule Network), KN(Kohonen Network) 및 AN(Attention Network)를 포함할 수 있으나 이에 한정되는 것이 아닌 임의의 뉴럴 네트워크를 포함할 수 있음은 통상의 기술자가 이해할 것이다.Neural networks include CNN (Convolutional Neural Network), RNN (Recurrent Neural Network), perceptron, multilayer perceptron, FF (Feed Forward), RBF (Radial Basis Network), DFF (Deep Feed Forward), and LSTM. (Long Short Term Memory), GRU (Gated Recurrent Unit), AE (Auto Encoder), VAE (Variational Auto Encoder), DAE (Denoising Auto Encoder), SAE (Sparse Auto Encoder), MC (Markov Chain), HN (Hopfield) Network), BM (Boltzmann Machine), RBM (Restricted Boltzmann Machine), DBN (Depp Belief Network), DCN (Deep Convolutional Network), DN (Deconvolutional Network), DCIGN (Deep Convolutional Inverse Graphics Network), GAN (Generative Adversarial Network) ), Liquid State Machine (LSM), Extreme Learning Machine (ELM), Echo State Network (ESN), Deep Residual Network (DRN), Differential Neural Computer (DNC), Neural Turning Machine (NTM), Capsule Network (CN), Those skilled in the art will understand that it may include any neural network, including, but not limited to, KN (Kohonen Network) and AN (Attention Network).

본 개시의 예시적인 실시예에 따르면, 프로세서는 GoogleNet, AlexNet, VGG Network 등과 같은 CNN(Convolution Neural Network), R-CNN(Region with Convolution Neural Network), RPN(Region Proposal Network), RNN(Recurrent Neural Network), S-DNN(Stacking-based deep Neural Network), S-SDNN(State-Space Dynamic Neural Network), Deconvolution Network, DBN(Deep Belief Network), RBM(Restrcted Boltzman Machine), Fully Convolutional Network, LSTM(Long Short-Term Memory) Network, Classification Network, Generative Modeling, eXplainable AI, Continual AI, Representation Learning, AI for Material Design, 자연어 처리를 위한 BERT, SP-BERT, MRC/QA, Text Analysis, Dialog System, GPT-3, GPT-4, 비전 처리를 위한 Visual Analytics, Visual Understanding, Video Synthesis, ResNet 데이터 지능을 위한 Anomaly Detection, Prediction, Time-Series Forecasting, Optimization, Recommendation, Data Creation 등 다양한 인공 지능 구조 및 알고리즘을 이용할 수 있으며, 이에 제한되지 않는다. 이하, 첨부된 도면을 참조하여 본 개시의 실시예를 상세하게 설명한다.According to an exemplary embodiment of the present disclosure, the processor may support a Convolution Neural Network (CNN), a Region with Convolution Neural Network (R-CNN), a Region Proposal Network (RPN), a Recurrent Neural Network (RNN), such as GoogleNet, AlexNet, VGG Network, etc. ), S-DNN (Stacking-based deep Neural Network), S-SDNN (State-Space Dynamic Neural Network), Deconvolution Network, DBN (Deep Belief Network), RBM (Restrcted Boltzman Machine), Fully Convolutional Network, LSTM (Long Short-Term Memory) Network, Classification Network, Generative Modeling, eXplainable AI, Continual AI, Representation Learning, AI for Material Design, BERT for natural language processing, SP-BERT, MRC/QA, Text Analysis, Dialog System, GPT-3 , GPT-4, Visual Analytics for vision processing, Visual Understanding, Video Synthesis, and Anomaly Detection, Prediction, Time-Series Forecasting, Optimization, Recommendation, and Data Creation for ResNet data intelligence. , but is not limited to this. Hereinafter, embodiments of the present disclosure will be described in detail with reference to the attached drawings.

도 1은 본 개시에 따른 보이스 기반의 라이브 인터넷 방송 시스템(1000)을 개략적으로 설명하기 위한 도면이다.1 is a diagram schematically illustrating a voice-based live Internet broadcasting system 1000 according to the present disclosure.

라이브 인터넷 방송 시스템(1000)은 보이스(voice) 기반의 라이브 인터넷 방송을 제공하기 위해 필요한 구성들을 포함할 수 있는데, 라이브 인터넷 방송 시스템(1000)은 보이스 기반의 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성하는 서버(100, 이하 '하이라이트 콘텐츠 생성 서버'라 칭함), 라이브 인터넷 방송의 호스트(H)에 대응하는 호스트 단말(200) 및 라이브 인터넷 방송의 청취자(L) 각각에 대응하는 청취자 단말(300)을 포함할 수 있다.The live Internet broadcasting system 1000 may include components necessary to provide voice-based live Internet broadcasting. The live Internet broadcasting system 1000 generates highlight content for voice-based live Internet broadcasting. A server (100, hereinafter referred to as 'highlight content creation server'), a host terminal (200) corresponding to the host (H) of the live Internet broadcast, and a listener terminal (300) corresponding to the listener (L) of the live Internet broadcast. It can be included.

하이라이트 콘텐츠 생성 서버(100)는 보이스 기반의 라이브 인터넷 방송을 위한 방송 데이터를 호스트 단말(200)로부터 수신하여, 청취자 단말(300)에 제공함으로써, 라이브 인터넷 방송이 청취자(L)에게 제공되게 할 수 있다(S11). The highlight content creation server 100 receives broadcast data for voice-based live Internet broadcasting from the host terminal 200 and provides it to the listener terminal 300, so that the live Internet broadcast can be provided to the listener (L). There is (S11).

실시예에서, 하이라이트 콘텐츠 생성 서버(100)는 서버 및 클라이언트 구조의 서버 기능뿐만 아니라, 클라우드로도 구현될 수 있다. 이 경우, 하이라이트 콘텐츠 생성 서버(100)는 스토리지, 서버, 가상화 모듈/시스템, 운영체제, 미들웨어, 런타임 모듈, 데이터 및 애플리케이션 등을 모두 구비할 수 있다. 이에, 클라이언트 단의 호스트 단말(200) 및 청취자 단말(300)은 서비스 애플리케이션만 구비하면, 클라우드 시스템이 제공하는 다양한 기능을 이용할 수 있으며, 사용량에 따라 과금될 수 있다. 가령, 하이라이트 콘텐츠 생성 서버(100)는 SaaS(Server as a software) 기반으로 구현될 수 있으나, 실시예가 이에 한정되는 것은 아니다.In an embodiment, the highlight content creation server 100 may be implemented not only as a server function in a server and client structure, but also as a cloud. In this case, the highlight content creation server 100 may be equipped with storage, servers, virtualization modules/systems, operating systems, middleware, runtime modules, data, and applications. Accordingly, the host terminal 200 and the listener terminal 300 at the client end can use various functions provided by the cloud system as long as they are equipped with a service application, and can be charged according to the amount of usage. For example, the highlight content creation server 100 may be implemented based on SaaS (Server as a software), but the embodiment is not limited to this.

하이라이트 콘텐츠 생성 서버(100)는 호스트(H)와 청취자(L)가 소통하도록 메신저 및 다자 간 인터넷 콜(call) 기능을 호스트 단말(200) 및 청취자 단말(300)에 제공할 수 있다(S13, S15).The highlight content creation server 100 may provide a messenger and multi-party Internet call function to the host terminal 200 and the listener terminal 300 so that the host (H) and the listener (L) can communicate (S13, S15).

하이라이트 콘텐츠 생성 서버(100)는 호스트(H)가 보이스 기반의 라이브 인터넷 방송을 진행하는 중에 또는 방송이 완료된 이후에, 라이브 인터넷 방송과 관련된 하이라이트 콘텐츠를 제작(생성)하여, 호스트 단말(200) 및 청취자 단말(300)에 제공할 수 있다(S17, S19).The highlight content creation server 100 produces (generates) highlight content related to the live Internet broadcast while the host (H) is conducting a voice-based live Internet broadcast or after the broadcast is completed, and provides the host terminal 200 and It can be provided to the listener terminal 300 (S17, S19).

여기서, 하이라이트 콘텐츠는 라이브 방송에서 실제 사용된 콘텐츠뿐만 아니라, 라이브 방송에서 사용되지 않은 데이터, 하이라이트 콘텐츠의 가치를 높여주거나 흥미롭고 매력적으로 만들거나 조회수를 높이기 위한 오디오 데이터, 영상 데이터, 세대 별 또는 상황 별 사용하는 언어 데이터, 라이브 인터넷 방송의 핫키워드, 문자, 부호 등을 다양하게 포함하는 콘텐츠로 구현될 수 있다. 이에, 하이라이트 콘텐츠는 보이스 기반으로 생성될 수 있으며, 동영상 기반으로도 생성될 수 있다.Here, highlight content is not only content actually used in the live broadcast, but also data not used in the live broadcast, audio data, video data, generational or situational data to increase the value of the highlight content, make it interesting and attractive, or increase the number of views. It can be implemented as content that includes a variety of language data, hot keywords from live Internet broadcasts, letters, and symbols. Accordingly, highlight content can be generated based on voice and can also be generated based on video.

도 2는 본 개시에 따른 하이라이트 콘텐츠 생성 서버(100) 및 호스트 단말(200)의 구성들을 나타내는 상대 블록도이다.FIG. 2 is a relative block diagram showing the configurations of the highlight content creation server 100 and the host terminal 200 according to the present disclosure.

먼저, 본 개시에 따른 하이라이트 콘텐츠 생성 서버(100)는 서버 통신부(110), 서버 입력부(120), 서버 디스플레이(130), 서버 메모리(150) 및 적어도 하나의 프로세서(190)를 포함할 수 있다. 도 2에 도시된 하이라이트 콘텐츠 생성 서버(100)의 구성요소들은 본 개시에 따른 하이라이트 콘텐츠 생성 서버(100)를 구현하는데 있어서 필수적인 것은 아니어서, 본 명세서 상에서 설명되는 하이라이트 콘텐츠 생성 서버(100)는 위에서 열거된 구성요소들 보다 많거나, 또는 적은 구성요소들을 가질 수 있다.First, the highlight content creation server 100 according to the present disclosure may include a server communication unit 110, a server input unit 120, a server display 130, a server memory 150, and at least one processor 190. . The components of the highlight content creation server 100 shown in FIG. 2 are not essential for implementing the highlight content creation server 100 according to the present disclosure, so the highlight content creation server 100 described above is It may have more or fewer components than those listed.

상기 구성요소들 중 서버 통신부(110)는 외부 장치(호스트 단말 및 청취자 단말 등을 포함)와 통신을 가능하게 하는 하나 이상의 구성 요소를 포함할 수 있으며, 예를 들어, 방송 수신 모듈, 유선통신 모듈, 무선통신 모듈, 근거리 통신 모듈, 위치정보 모듈 중 적어도 하나를 포함할 수 있다. 서버 통신부(110)는 보이스(voice) 기반의 라이브 인터넷 방송의 호스트 단말(200) 및 청취자 단말(300)과 통신할 수 있다.Among the above components, the server communication unit 110 may include one or more components that enable communication with external devices (including host terminals and listener terminals, etc.), for example, a broadcast reception module and a wired communication module. , it may include at least one of a wireless communication module, a short-range communication module, and a location information module. The server communication unit 110 can communicate with the host terminal 200 and the listener terminal 300 of a voice-based live Internet broadcast.

서버 입력부(120)는 영상 정보(또는 신호), 오디오 정보(또는 신호), 데이터, 또는 사용자로부터 입력되는 정보의 입력을 위한 것으로서, 적어도 하나의 카메라, 적어도 하나의 마이크로폰 및 사용자 입력부 중 적어도 하나를 포함할 수 있다. 서버 입력부(120)에서 수집한 음성 데이터나 이미지 데이터는 분석되어 사용자의 제어명령으로 처리될 수 있다.The server input unit 120 is for inputting image information (or signal), audio information (or signal), data, or information input from a user, and includes at least one of at least one camera, at least one microphone, and a user input unit. It can be included. Voice data or image data collected from the server input unit 120 may be analyzed and processed as a user's control command.

서버 디스플레이(130)는 하이라이트 콘텐츠 생성 서버(100)에서 처리되는 정보를 표시(출력)한다. 예를 들어, 서버 디스플레이(130)는 하이라이트 콘텐츠 생성 서버(100)에서 구동되는 응용 프로그램(일 예로, 어플리케이션)의 실행화면 정보, 또는 이러한 실행화면 정보에 따른 UI(User Interface), GUI(Graphic User Interface) 정보를 표시할 수 있다. The server display 130 displays (outputs) information processed by the highlight content creation server 100. For example, the server display 130 displays execution screen information of an application (eg, an application) running on the highlight content creation server 100, or a user interface (UI) or graphic user interface (GUI) according to such execution screen information. Interface) information can be displayed.

서버 메모리(150)는 하이라이트 콘텐츠 생성 서버(100)의 다양한 기능을 지원하는 데이터와, 제어부의 동작을 위한 프로그램을 저장할 수 있고, 입/출력되는 데이터들(예를 들어, 음악 파일, 정지영상, 동영상 등)을 저장할 있고, 하이라이트 콘텐츠 생성 서버(100)에서 구동되는 다수의 응용 프로그램(application program 또는 애플리케이션(application)), 본 장치의 동작을 위한 데이터들, 명령어들을 저장할 수 있다. 이러한 응용 프로그램 중 적어도 일부는, 무선 통신을 통해 외부 서버로부터 다운로드 될 수 있다. 특히, 서버 메모리(150)는 하이라이트 콘텐츠 제작에 사용되는 보강 데이터를 저장할 수 있다.The server memory 150 can store data supporting various functions of the highlight content creation server 100 and a program for the operation of the control unit, and can store input/output data (e.g., music files, still images, video, etc.), a plurality of application programs (application programs or applications) running on the highlight content creation server 100, data for operation of the device, and commands can be stored. At least some of these applications may be downloaded from an external server via wireless communication. In particular, the server memory 150 may store reinforcement data used to create highlight content.

이러한, 서버 메모리(150)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), SSD 타입(Solid State Disk type), SDD 타입(Silicon Disk Drive type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램(random access memory; RAM), SRAM(static random access memory), 롬(read-only memory; ROM), EEPROM(electrically erasable programmable read-only memory), PROM(programmable read-only memory), 자기 메모리, 자기 디스크 및 광디스크 중 적어도 하나의 타입의 저장매체를 포함할 수 있다. 또한, 서버 메모리(150)는 하이라이트 콘텐츠 생성 서버(100)와는 분리되어 있으나, 유선 또는 무선으로 연결된 데이터베이스가 될 수도 있으며, 데이터 베이스 시스템으로 구현될 수도 있다. 서버 메모리(150)는 하이라이트 콘텐츠에서 사용되는 상술한 보강 데이터를 저장할 수 있다. The server memory 150 is of flash memory type, hard disk type, solid state disk type, SDD type (Silicon Disk Drive type), and multimedia card micro type. card micro type), card type memory (e.g. SD or XD memory, etc.), random access memory (RAM), static random access memory (SRAM), read-only memory (ROM), EEPROM (electrically It may include at least one type of storage medium among erasable programmable read-only memory (PROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, and optical disk. In addition, the server memory 150 is separate from the highlight content creation server 100, but may be a database connected by wire or wirelessly, and may be implemented as a database system. The server memory 150 may store the above-described reinforcement data used in highlight content.

서버 프로세서(190)는 적어도 하나의 코어를 포함하며, 하이라이트 콘텐츠 생성 서버(100) 내의 구성요소들의 동작을 제어하기 위한 알고리즘 또는 알고리즘을 재현한 프로그램에 대한 데이터를 저장하는 메모리, 및 메모리에 저장된 데이터를 이용하여 전술한 동작을 수행하는 적어도 하나의 프로세서(미도시)로 구현될 수 있다. 이때, 메모리와 프로세서는 각각 별개의 칩으로 구현될 수 있다. 또는, 메모리와 프로세서는 단일 칩으로 구현될 수도 있다.The server processor 190 includes at least one core, a memory that stores data for an algorithm for controlling the operation of components in the highlight content creation server 100 or a program that reproduces the algorithm, and data stored in the memory. It can be implemented with at least one processor (not shown) that performs the above-described operations using . At this time, the memory and processor may each be implemented as separate chips. Alternatively, the memory and processor may be implemented as a single chip.

서버 프로세서(190)는 라이브 인터넷 방송의 방송 데이터를 소정의 카테고리로 구분하여 서버 메모리(150)에 저장하는 방송 데이터 관리 모듈(191), 라이브 인터넷 방송에 대한 하나 이상의 하이라이트 콘텐츠를 생성하기 위한 트리거 조건을 설정하는 설정 모듈(193), 설정된 트리거 조건을 만족하는 라이브 인터넷 방송의 방송 구간을 결정하는 구간 결정 모듈(195) 및 결정된 방송 구간의 방송 데이터 및 보강 데이터를 조합하여 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성하는 생성 모듈(197)을 포함할 수 있다.The server processor 190 includes a broadcast data management module 191 that divides broadcast data of live Internet broadcasts into predetermined categories and stores them in the server memory 150, and trigger conditions for generating one or more highlight contents for live Internet broadcasts. A setting module 193 for setting, a section determination module 195 for determining the broadcast section of the live Internet broadcast that satisfies the set trigger conditions, and a highlight content for the live Internet broadcast by combining the broadcast data and reinforcement data of the determined broadcast section. It may include a generation module 197 that generates.

또한, 서버 프로세서(190)는 이하의 도 3 내지 도 6에서 설명되는 본 개시에 따른 다양한 실시 예들을 하이라이트 콘텐츠 생성 서버(100) 상에서 구현하기 위하여, 위에서 살펴본 구성요소들을 중 어느 하나 또는 복수를 조합하여 제어할 수 있다. In addition, the server processor 190 combines any one or a plurality of the components described above in order to implement various embodiments according to the present disclosure described in FIGS. 3 to 6 below on the highlight content creation server 100. You can control it.

도 2에 도시된 호스트 단말(200)은 단말 통신부(210), 단말 입력부(220), 단말 디스플레이(230), 단말 메모리(250) 및 단말 프로세서(290)를 포함할 수 있다. 도 2에 도시된 호스트 단말(200)의 구성요소들은 본 개시에 따른 호스트 단말(200)를 구현하는데 있어서 필수적인 것은 아니어서, 본 명세서 상에서 설명되는 호스트 단말(200)은 위에서 열거된 구성요소들 보다 많거나, 또는 적은 구성요소들을 가질 수 있다.The host terminal 200 shown in FIG. 2 may include a terminal communication unit 210, a terminal input unit 220, a terminal display 230, a terminal memory 250, and a terminal processor 290. The components of the host terminal 200 shown in FIG. 2 are not essential for implementing the host terminal 200 according to the present disclosure, so the host terminal 200 described in this specification has more components than the components listed above. It may have many or few components.

상기 하이라이트 콘텐츠 생성 서버(100)의 구성들과 차이점을 중심으로 설명하면, 단말 입력부(220)는 호스트(H)의 보이스(voice)를 입력받기 위한 마이크로폰을 포함할 수 있으나, 실시예가 이에 한정되는 것은 아니다.When explaining the configurations and differences of the highlight content creation server 100, the terminal input unit 220 may include a microphone for receiving the voice of the host H, but the embodiment is limited to this. That is not the case.

단말 디스플레이(230)는 보이스 기반의 라이브 인터넷 방송의 메신저 채팅을 출력할 수 있으며, 라이브 인터넷 방송을 위한 다양한 인터페이스를 출력할 수 있다.The terminal display 230 can output voice-based messenger chatting for live Internet broadcasting and various interfaces for live Internet broadcasting.

실시예에서, 단말 프로세서(290)는 호스트(H)의 발화 내용을 인식하고 필터링하는 필터링 모듈을 포함할 수 있는데, 필터링 모듈은 호스트(H)의 발화 내용을 STT(speech to text) 기법에 기초하여 음성(voice)를 텍스트화하고 필터링 대상 문구(가령, 비속어 또는 욕설 등을 포함)를 단말 디스플레이(230) 상에 하이라이트하여 표시할 수 있으며, 송출되는 호스트(H)의 보이스 신호에서도 해당 필터링 대상 문구를 가공(가령, 특정 사운드로 대체하거나 출력을 낮추는 방법이 있으나, 실시예가 이에 한정되는 것은 아님)하여 하이라이트 콘텐츠 생성 서버(100)로 제공할 수 있다. 이때, 단말 프로세서(290)는 호스트(H)가 필터링 대상 문구 사용한 것을 인식할 수 있도록 단말 디스플레이(230)나 스피커를 통해 경고 알림을 제공할 수 있다.In an embodiment, the terminal processor 290 may include a filtering module that recognizes and filters the content of the host H's speech, and the filtering module recognizes the content of the host H's speech based on a speech to text (STT) technique. By converting the voice into text, filtering target phrases (including, for example, profanity or swear words, etc.) can be highlighted and displayed on the terminal display 230, and the filtering target can also be displayed in the transmitted voice signal of the host (H). The phrase may be processed (for example, there is a method of replacing it with a specific sound or lowering the output, but the embodiment is not limited thereto) and provided to the highlight content creation server 100. At this time, the terminal processor 290 may provide a warning notification through the terminal display 230 or speaker so that the host H can recognize that the filtering target phrase has been used.

도 2에 도시된 구성 요소들의 성능에 대응하여 적어도 하나의 구성요소가 추가되거나 삭제될 수 있다. 또한, 구성 요소들의 상호 위치는 시스템의 성능 또는 구조에 대응하여 변경될 수 있다는 것은 당해 기술 분야에서 통상의 지식을 가진 자에게 용이하게 이해될 것이다.At least one component may be added or deleted in response to the performance of the components shown in FIG. 2. Additionally, it will be easily understood by those skilled in the art that the mutual positions of the components may be changed in response to the performance or structure of the system.

한편, 도 2에서 도시된 각각의 구성요소는 소프트웨어 및/또는 Field Programmable Gate Array(FPGA) 및 주문형 반도체(ASIC, Application Specific Integrated Circuit)와 같은 하드웨어 구성요소를 의미한다. Meanwhile, each component shown in FIG. 2 refers to software and/or hardware components such as Field Programmable Gate Array (FPGA) and Application Specific Integrated Circuit (ASIC).

도 3은 본 개시에 따른 하이라이트 콘텐츠 생성 서버(100)의 프로세싱 과정을 나타내는 시퀀스도이다. 도 4는 본 개시에 따른 데이터를 구분하여 저장하는 하이라이트 콘텐츠 생성 서버(100)를 설명하기 위한 도면이고, 도 5는 본 개시에 따른 하이라이트 콘텐츠를 생성하기 위한 트리거 조건을 설명하기 위한 도면이다. 도 3을 설명하면서 도 4 및 도 5를 필요한 부분에서 함께 참조하여 설명하기로 한다.Figure 3 is a sequence diagram showing the processing process of the highlight content creation server 100 according to the present disclosure. FIG. 4 is a diagram for explaining the highlight content creation server 100 that separates and stores data according to the present disclosure, and FIG. 5 is a diagram for explaining trigger conditions for generating highlight content according to the present disclosure. While explaining FIG. 3, FIGS. 4 and 5 will be described with reference to FIGS. 4 and 5 where necessary.

우선, 서버 프로세서(190)의 방송 데이터 관리 모듈(191)은 보이스 기반 라이브 인터넷 방송의 방송 데이터를 소정의 카테고리로 구분하여 저장할 수 있다(S310).First, the broadcast data management module 191 of the server processor 190 can store broadcast data of voice-based live Internet broadcasting by dividing it into predetermined categories (S310).

도 4를 참고하면, 방송 데이터 관리 모듈(191)은 방송 데이터(BD) 및 보강 데이터(VD)를 구분하여 저장할 수 있다. Referring to FIG. 4, the broadcast data management module 191 can separately store broadcast data (BD) and reinforcement data (VD).

방송 데이터 관리 모듈(191)은 라이브 인터넷 방송의 방송 데이터(BD)를, 호스트(H)의 보이스 데이터(BD1), 채팅 데이터(BD2), 방송 중 라이브 통화 데이터(BD3) 및 후원 관련 데이터(BD4) 중 적어도 하나의 카테고리로 구분하여 서버 메모리(150)에 저장할 수 있으며, 방송 데이터 관리 모듈(191)은 다양한 분석 데이터(BD5)를 서버 메모리(150)에 별도로 저장할 수 있다.The broadcast data management module 191 includes broadcast data (BD) of a live Internet broadcast, voice data (BD1) of the host (H), chat data (BD2), live call data during broadcast (BD3), and sponsorship-related data (BD4). ) can be divided into at least one category and stored in the server memory 150, and the broadcast data management module 191 can separately store various analysis data (BD5) in the server memory 150.

방송 데이터 관리 모듈(191)은 보이스 데이터(BD1)를 소정의 타임 구간 별로 별도로 저장할 수 있다. 방송 데이터 관리 모듈(191)은 채팅 데이터(BD2)를 호스트(H) 및 청취자(L)로 구분하여 저장할 수 있으며, 채팅이 아닌 기본 템플릿(호스트(H)나 청취자(L)가 입력하지 않은 문구, 이미지 등)을 구분하여 저장할 수 있다.The broadcast data management module 191 may separately store voice data BD1 for each predetermined time section. The broadcast data management module 191 can store chat data (BD2) by dividing it into host (H) and listener (L), and can store non-chat basic templates (phrases not entered by the host (H) or listener (L)). , images, etc.) can be stored separately.

방송 데이터 관리 모듈(191)은 방송 중 라이브 통화 데이터(BD3) 중에서 호스트(H)와 청취자(L)의 발화 내용을 별도로 저장할 수 있으며, STT 알고리즘에 기반하여, 텍스트로 변환하여 서버 메모리(150)에 저장할 수 있다.The broadcast data management module 191 can separately store the speech contents of the host (H) and the listener (L) among the live call data (BD3) during the broadcast, and convert them into text based on the STT algorithm to store them in the server memory 150. It can be saved in .

방송 데이터 관리 모듈(191)은 후원 관련 데이터(BD4)를 저장할 수 있는데, 후원 청취자의 수 및 후원 청취자의 후원 금액 등에 대한 정보를 저장할 수 있다.The broadcast data management module 191 can store sponsorship-related data (BD4), including information on the number of sponsored listeners and the sponsorship amount of sponsored listeners.

방송 데이터 관리 모듈(191)은 다양한 분석 데이터(BD5)를 서버 메모리(150)에 저장할 수 있는데, 분석된 데이터(BD5)는 특정 키워드의 사용 빈도수, 키워드에 함의된 특정 감정 상태, 청취자의 참여도, 호스트의 보이스 강도, 호스트의 발화 빈도 및 발화 시간 등을 포함할 수 있으나, 실시예가 이에 한정되는 것은 아니다.The broadcast data management module 191 can store various analysis data (BD5) in the server memory 150. The analyzed data (BD5) includes the frequency of use of specific keywords, specific emotional states implied by keywords, and listener participation. , the intensity of the host's voice, the host's speaking frequency and speaking time, etc., but the embodiment is not limited thereto.

실시예에서, 서버 프로세서(190)는 인공 지능 기반의 분석 모델을 이용하여, 특정 키워드에 함의된 특정 감정 상태를 분석할 수 있다. 분석 모델은 호스트(H)의 스피치를 텍스트화하고, 채팅창의 문구들을 입력으로 받아, 키워드에 함의된 특정 감정 상태를 분석할 수 있다. 분석 모델은 기쁨, 슬픔, 지루함, 재밌음, 화남 등의 감정 상태를 분석하여 출력할 수 있다. 서버 프로세서(190)는 분석된 특정 감정 상태를 서버 메모리(150)에 구분하여 저장할 수 있다.In an embodiment, the server processor 190 may analyze a specific emotional state implied by a specific keyword using an artificial intelligence-based analysis model. The analysis model can convert the host (H)'s speech into text, receive phrases from the chat window as input, and analyze the specific emotional state implied by the keyword. The analysis model can analyze and output emotional states such as joy, sadness, boredom, fun, and anger. The server processor 190 may separately store the analyzed specific emotional state in the server memory 150.

또한, 방송 데이터 관리 모듈(191)은 보강 데이터(VD)를 방송 데이터(BD)와 구분하여 저장할 수 있는데, 라이브 인터넷 방송에서 실제 방송되거나 사용되지 않은 데이터일 수 있다. 다만, 보강 데이터(VD)로 저장했지만, 실제 방송에서 중복적으로 사용되는 경우, 실제 방송 데이터(BD)에도 중복적으로 저장할 수 있다.Additionally, the broadcast data management module 191 can store augmentation data (VD) separately from broadcast data (BD), which may be data that is not actually broadcast or used in a live Internet broadcast. However, although it is saved as reinforcement data (VD), if it is used redundantly in actual broadcasting, it can be stored redundantly in actual broadcast data (BD).

보강 데이터는 하이라이트 콘텐츠의 조회수를 높이기 위한 오디오 데이터, 영상 데이터, 세대 별 또는 상황 별 사용하는 언어 데이터, 라이브 인터넷 방송의 핫키워드 등을 포함할 수 있으나, 실시예가 이에 한정되는 것은 아니다. Reinforcement data may include audio data to increase the number of views of highlight content, video data, language data used by generation or situation, hot keywords of live Internet broadcasts, etc., but the embodiment is not limited thereto.

실시예에서, 서버 프로세서(190)는 하이라이트 콘텐츠 생성을 위해, 청취자의 가장 참여가 많은 시간대에 사용하는 핫키워드를 청취자 인기순으로 정렬하여 저장하고 있다가, 하이라이트 콘텐츠 제작 시에 우선순위가 높은 핫키워드를 사용할 수 있다. 이때, 서버 프로세서(190)는 이미 사용한 핫키워드의 사용수에 따라서 핫키워드의 인기순위를 하향 조정할 수도 있다.In an embodiment, in order to create highlight content, the server processor 190 sorts and stores hot keywords used during times of greatest listener participation in order of listener popularity, and selects the hot keywords with high priority when creating highlight content. can be used. At this time, the server processor 190 may adjust the popularity ranking of the hot keyword downward according to the number of uses of the hot keyword that has already been used.

S310 단계 이후, 서버 프로세서(190)의 설정 모듈(193)은 라이브 인터넷 방송에 대한 하나 이상의 하이라이트 콘텐츠를 생성하기 위한 트리거 조건(TC)을 설정할 수 있다(S320).After step S310, the setting module 193 of the server processor 190 may set a trigger condition (TC) for generating one or more highlight contents for a live Internet broadcast (S320).

도 5를 참고하면, 서버 프로세서(190)는 해당 트리거 조건이 만족되면, 하이라이트 콘텐츠를 생성할 수 있다. 트리거 조건(TC)은, 라이브 인터넷 방송에서 채팅 진행 속도가 소정 수준 이상인 경우(TC1), 후원 청취자의 수가 소정 수준 이상인 경우, 후원 청취자의 후원 금액이 소정 수준 이상인 경우(TC2), 특정 시간대인 경우(TC3), 특정 키워드의 사용 빈도수가 소정 수준 이상인 경우, 키워드에 함의된 특정 감정 상태가 소정 수준 이상인 경우(TC4), 호스트의 보이스 강도가 소정 수준 이상인 경우, 호스트의 발화 빈도가 소정 수준 이상인 경우 및 호스트의 발화 시간이 소정 수준 이상인 경우(TC5), 청취자의 참여도가 소정 수준 이상인 경우(TC6) 중 적어도 하나를 포함할 수 있다. 또한, 트리거 조건(TC)은 호스트(H)에 의해 요청된 설정 조건(TC7)도 포함할 수 있다.Referring to FIG. 5, the server processor 190 can generate highlight content when the corresponding trigger condition is satisfied. Trigger conditions (TC) are when the chat progress speed in a live Internet broadcast is above a certain level (TC1), when the number of sponsored listeners is above a certain level, when the sponsorship amount of sponsored listeners is above a certain level (TC2), and when it is a specific time zone. (TC3), when the frequency of use of a specific keyword is above a predetermined level, when the specific emotional state implied by the keyword is above a predetermined level (TC4), when the host's voice intensity is above a predetermined level, when the host's utterance frequency is above a predetermined level It may include at least one of a case where the host's speech time is above a predetermined level (TC5) and a case where the listener's participation level is above a predetermined level (TC6). Additionally, the trigger condition (TC) may also include a setting condition (TC7) requested by the host (H).

구체적으로, 서버 프로세서(190)는 라이브 인터넷 방송에서 채팅 진행 속도가 소정 수준 이상일 때(TC1), 가령, 초당 채팅수가 5개 이상인 경우, 채팅 속도의 변화(가속도)가 소정 가속도 이상인 경우, 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수 있다.Specifically, the server processor 190 sets a trigger condition when the chat progress speed in a live Internet broadcast is above a predetermined level (TC1), for example, when the number of chats per second is 5 or more, and when the change in chat speed (acceleration) is above a predetermined acceleration. It is determined that this is satisfied, and highlight content can be created.

또한, 서버 프로세서(190)는 후원 청취자의 수가 소정 수준 이상일 때 또는 후원 청취자의 후원 금액이 소정 수준 이상일 때(TC2), 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수 있다. 실시예에서, 서버 프로세서(190)는 인기 호스트가 진행하는 라이브 인터넷 방송의 경우, 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수도 있다. 인기 호스트의 기준은 청취자의 참여인원수, 청취자의 후원인원수 및 후원금액 등에 의해 결정될 수 있으나, 실시예가 이에 한정되는 것은 아니다.Additionally, the server processor 190 may determine that the trigger condition is satisfied when the number of sponsored listeners is greater than a predetermined level or when the sponsorship amount of the sponsored listeners is greater than a predetermined level (TC2) and generate highlight content. In an embodiment, the server processor 190 may determine that a trigger condition is satisfied in the case of a live Internet broadcast conducted by a popular host and generate highlight content. The standard for a popular host may be determined by the number of listeners participating, the number of listeners sponsoring, and the sponsorship amount, but the embodiment is not limited to this.

또한, 서버 프로세서(190)는 특정 시간 대의 방송인 경우(TC3), 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수 있다. 실시예에서, 서버 프로세서(190)는 호스트(H)가 설정한 시간 또는 청취자(L)가 설정한 시간의 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성할 수 있다.Additionally, in the case of a broadcast in a specific time zone (TC3), the server processor 190 may determine that the trigger condition is satisfied and generate highlight content. In an embodiment, the server processor 190 may generate highlight content for a live Internet broadcast at a time set by the host (H) or at a time set by the listener (L).

실시예에서, 서버 프로세서(190)는 소정 후원 금액 이상일 경우, 해당 후원 금액을 지불한 후원 청취자의 메신저 채팅 내용 또는 인터넷 콜 내용을 포함하는 하이라이트 콘텐츠를 해당 후원자만을 위해 생성할 수도 있다. 이를 위해, 서버 프로세서(190)는 후원 금액 순으로 후원자를 리스트업하고 후원 시점과 후원자의 참여 정보(메신저 채팅 내용, 인터넷 콜 내용 등)를 동기화하여 서버 메모리(150)에 저장할 수 있다.In an embodiment, if the sponsorship amount is more than a predetermined amount, the server processor 190 may generate highlight content including messenger chat content or Internet call content of the sponsored listener who paid the sponsorship amount only for the sponsor. To this end, the server processor 190 may list sponsors in order of sponsorship amount, synchronize the sponsorship timing and sponsor participation information (messenger chat content, Internet call content, etc.) and store them in the server memory 150.

또한, 서버 프로세서(190)는 특정 키워드의 사용 빈도수가 소정 수준 이상일 때(TC4), 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수 있다. 가령, 서버 프로세서(190)는 청취자(L)나 호스트(H)가 입력한 감정을 나타내는 이모티콘('ㅋ', 'ㅠ' 등도 포함)의 개수가 소정 개수 이상인 경우(TC4), 하이라이트 콘텐츠를 생성할 수 있다. 이때, 서버 프로세서(190)는 단순히 이모티콘의 개수를 카운팅할 수도 있으나, 채팅 하나당 하나의 이모티콘만 반영할 수 있다. Additionally, the server processor 190 may determine that the trigger condition is satisfied when the frequency of use of a specific keyword is above a predetermined level (TC4) and generate highlight content. For example, the server processor 190 generates highlight content when the number of emoticons (including 'ㅋ', 'ㅠ', etc.) representing emotions input by the listener (L) or host (H) is more than a predetermined number (TC4). can do. At this time, the server processor 190 may simply count the number of emoticons, but only one emoticon can be reflected per chat.

또한, 서버 프로세서(190)는 호스트(H)의 보이스 강도가 소정 수준 이상이거나 호스트의 발화 빈도가 소정 수준 이상이거나 호스트의 발화 시간이 소정 수준 이상일 때(TC5), 가령, 서버 프로세서(190)는 호스트(H)의 보이스 강도가 소정 데시벨 이상으로 측정되거나 발화 시간/빈도가 소정 수준 이상인 경우, 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수 있다.In addition, the server processor 190 operates when the voice intensity of the host (H) is higher than a predetermined level, the speech frequency of the host is higher than a predetermined level, or the speech time of the host is higher than a predetermined level (TC5), for example, the server processor 190 When the voice intensity of the host H is measured to be above a predetermined decibel or the utterance time/frequency is above a predetermined level, it is determined that the trigger condition is satisfied, and highlight content can be generated.

또한, 서버 프로세서(190)는 청취자의 참여도가 소정 수준 이상일 때(TC6), 가령, 서버 프로세서(190)는 특정 라이브 인터넷 방송에 소정 인원 이상이 참여한 경우, 각 청취자의 채팅 참여 개수가 소정 개수 이상인 경우, 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수 있다.In addition, when the listener's participation degree is above a predetermined level (TC6), for example, when more than a predetermined number of people participate in a specific live Internet broadcast, the server processor 190 sets the number of chat participation of each listener to a predetermined number. In the case above, it is determined that the trigger condition is satisfied, and highlight content can be generated.

실시예에서, 서버 프로세서(190)는 특정 청취자만 과도한 채팅 내용을 입력하는 경우, 특정 청취자의 입력 키워드를 축소하여 카운팅하거나, 경우에 따라서는, 무시할 수 있다. In an embodiment, when only a specific listener inputs excessive chat content, the server processor 190 may reduce and count the input keywords of the specific listener or, in some cases, ignore them.

또한, 서버 프로세서(190)는 호스트 단말(200)에서 하이라이트 콘텐츠 생성에 대해 요청한 경우(TC7), 트리거 조건이 만족된 것으로 판단하여, 하이라이트 콘텐츠를 생성할 수 있다.Additionally, when the host terminal 200 requests creation of highlight content (TC7), the server processor 190 may determine that the trigger condition is satisfied and generate highlight content.

S320 단계 이후, 서버 프로세서(190)의 구간 결정 모듈(195)은 설정된 트리거 조건을 만족하는 라이브 인터넷 방송의 방송 구간을 결정할 수 있다(S330).After step S320, the section determination module 195 of the server processor 190 may determine a broadcast section of a live Internet broadcast that satisfies a set trigger condition (S330).

구간 결정 모듈(195)은 라이브 인터넷 방송에서 트리거 조건 별로 트리거 조건을 만족하는 방송 구간을 결정할 수 있다.The section determination module 195 can determine a broadcast section that satisfies the trigger condition for each trigger condition in a live Internet broadcast.

S330 단계 이후, 서버 프로세서(190)의 생성 모듈(197)은 결정된 방송 구간의 방송 데이터 및 보강 데이터를 조합하여 라이브 인터넷 방송에 대한 하이라이트 콘텐츠를 생성할 수 있다(S340).After step S330, the generation module 197 of the server processor 190 may generate highlight content for the live Internet broadcast by combining the broadcast data and reinforcement data of the determined broadcast section (S340).

생성 모듈(197)은 다양한 기준에 따라 하이라이트 콘텐츠를 생성할 수 있는데, 이하에서 구체적으로 설명하기로 한다.The creation module 197 can generate highlight content according to various criteria, which will be described in detail below.

생성 모듈(197)은 트리거 조건을 만족하는 구간마다 하이라이트 콘텐츠를 생성할 수 있다. 생성 모듈(197)은 트리거 조건이 만족된 방송 구간마다 보강 데이터를 조합할 수 있다. The creation module 197 may generate highlight content for each section that satisfies the trigger condition. The generation module 197 may combine reinforcement data for each broadcast section in which the trigger condition is satisfied.

생성 모듈(197)은 라이브 인터넷 방송에서 실제 방송되거나 사용되지 않은 데이터를 방송 구간에 조합하여 하이라이트 콘텐츠를 재탄생시킬 수 있으며, 하이라이트 콘텐츠의 조회수를 높이기 위한 오디오 데이터(뮤직, 특정 보이스, 핫키워드에 대응하는 사운드 등)를 추가할 수 있으며, 호스트에 대응하는 아바타와 청취자에 대응하는 아바타 콘텐츠를 추가하고, 채팅이나 인터넷 콜에서 대화를 재구성할 수 있다. 이때, 서버 프로세서(190)는 대화를 재구성하면서, 가상의 제3자를 추가하여 대화를 평가하는 멘트를 추가할 수 있다. 생성 모듈(197)은 호스트나 청취자의 대화에서 호스트나 청취자가 감정을 드러내는 상황에서는 아바타도 감정을 나타내도록 하이라이트 콘텐츠를 생성할 수 있다. 이때, 생성 모듈(197)은 키워드에 함의된 감정을 파악하는 상술한 인공 지능 기반의 분석 모델을 함께 이용할 수 있다.The creation module 197 can recreate highlight content by combining data that is actually broadcast or not used in a live Internet broadcast into a broadcast section, and can generate audio data (music, specific voice, and hot keywords) to increase the number of views of highlight content. (corresponding sound, etc.) can be added, avatars corresponding to the host and avatar content corresponding to the listener can be added, and conversations can be reconstructed in chat or Internet calls. At this time, the server processor 190 may reconstruct the conversation and add a comment evaluating the conversation by adding a virtual third party. The creation module 197 can generate highlight content so that the avatar also shows emotions in situations where the host or listener shows emotions in a conversation between the hosts and listeners. At this time, the creation module 197 may use the above-described artificial intelligence-based analysis model that identifies the emotion implied in the keyword.

또한, 생성 모듈(197)은 세대 별 또는 상황 별 사용하는 언어 데이터, 라이브 인터넷 방송의 핫키워드 등을 결정된 방송 구간에 함께 조합하여 하이라이트 콘텐츠를 생성할 수 있다. 생성 모듈(197)은 핫키워드 중에서도 우선순위가 높은 키워드를 하이라이트 콘텐츠의 적절한 지점에 영상 기반으로 또는 오디오 기반으로 삽입할 수 있다.Additionally, the creation module 197 may generate highlight content by combining language data used by generation or situation, hot keywords of live Internet broadcasting, etc., together in the determined broadcast section. The creation module 197 can insert keywords with high priority among hot keywords at appropriate points in the highlight content based on video or audio.

또한, 생성 모듈(197)은 트리거 조건을 만족하는 구간이 복수인 경우, 복수의 구간 중 일부를 하나의 하이라이트 콘텐츠에 통합하되, 통합된 콘텐츠에서 트리거 조건 간의 가중치에 기반하여 콘텐츠 내 재생 순서를 구간 별로 다르게 배치할 수 있다. In addition, when there are multiple sections that satisfy the trigger conditions, the creation module 197 integrates some of the multiple sections into one highlight content, and determines the playback order within the content based on the weight between the trigger conditions in the integrated content. It can be arranged very differently.

실시예에서, 생성 모듈(197)은 후원 금액이 많은 구간에 대응하는 하이라이트 콘텐츠를 보강 데이터를 추가하여 별도로 생성하고, 별도의 하이라이트 콘텐츠를 추가적으로 더 생성할 수 있다. 생성 모듈(197)은 별도의 하이라이트 콘텐츠의 경우, 채팅 진행 속도가 소정 수준 이상인 구간, 특정 키워드의 사용 빈도수 또는 키워드에 함의된 특정 감정 상태가 소정 수준 이상인 구간, 청취자의 참여율이 소정 수준 이상인 구간, 호스트의 보이스 강도/발화 빈도/발화 시간이 소정 수준 이상인 구간으로 구분하여, 보강데이터를 구간별로 추가하여 하이라이트 콘텐츠를 생성할 수 있다.In an embodiment, the creation module 197 may separately generate highlight content corresponding to a section with a large sponsorship amount by adding reinforcement data, and may additionally generate separate highlight content. In the case of separate highlight content, the creation module 197 includes a section where the chat progress speed is above a predetermined level, a section where the frequency of use of a specific keyword or a specific emotional state implied by the keyword is above a predetermined level, a section where the listener's participation rate is above a predetermined level, Highlight content can be created by dividing the host into sections where the voice intensity/utterance frequency/utterance time is above a certain level and adding reinforcement data for each section.

실시예에서, 생성 모듈(197)은 통합 콘텐츠를 생성할 때, 복수의 트리거 조건을 동시에 만족하는 지점을 하이라이트 콘텐츠에서 전반부에 배치하고, 트리거 조건을 하나만 만족하는 지점을 하이라이트 콘텐츠의 후반부에 순차적으로 배치할 수 있다. 또한, 생성 모듈(197)은 가중치가 높은 트리거 조건을 통합 하이라이트 콘텐츠의 전반부에 배치할 수 있다. 상기 가중치는 호스트의 선택에 따라 결정될 수 있으나, 실시예가 이에 한정되는 것은 아니다. In an embodiment, when generating integrated content, the creation module 197 places points that simultaneously satisfy a plurality of trigger conditions in the first half of the highlight content, and sequentially places points that satisfy only one trigger condition in the second half of the highlight content. It can be placed. Additionally, the creation module 197 may place a trigger condition with a high weight in the first half of the integrated highlight content. The weight may be determined according to the selection of the host, but the embodiment is not limited thereto.

실시예에서, 생성 모듈(197)은 통합 콘텐츠의 재생 시간이 소정 시간 이상보다 긴 경우, 통합 콘텐츠를 복수의 부분 통합 콘텐츠로 생성할 수도 있다.In an embodiment, the creation module 197 may generate the integrated content into a plurality of partially integrated contents when the playback time of the integrated content is longer than a predetermined time.

도 6은 본 개시에 따른 하이라이트 콘텐츠 생성 서버(100)의 호스트 단말에 의해 선택된 하이라이트 콘텐츠를 공유하는 과정을 설명하기 위한 도면이다.FIG. 6 is a diagram illustrating a process of sharing highlight content selected by the host terminal of the highlight content creation server 100 according to the present disclosure.

우선, 서버 프로세서(190)는 복수의 하이라이트 콘텐츠를 생성하여(S410), 호스트 단말(200)에 제공할 수 있다(S420). First, the server processor 190 may generate a plurality of highlight contents (S410) and provide them to the host terminal 200 (S420).

호스트 단말(200)은 하이라이트 콘텐츠를 선택하여(S430), 선택 알림을 하이라이트 콘텐츠 생성 서버(100)에 제공할 수 있다(S440).The host terminal 200 may select highlight content (S430) and provide a selection notification to the highlight content creation server 100 (S440).

실시예에서, 서버 프로세서(190)는 호스트에게 하이라이트 콘텐츠를 제공하면서, 과금 프로세스를 수행할 수 있다.In an embodiment, server processor 190 may perform a billing process while providing highlight content to the host.

서버 프로세서(190)는 선택된 하이라이트 콘텐츠를 청취자 단말(300)에 제공할 수 있으며(S450), 청취자 단말(300)은 해당 하이라이트 콘텐츠를 재생할 수 있다(S460).The server processor 190 can provide the selected highlight content to the listener terminal 300 (S450), and the listener terminal 300 can play the corresponding highlight content (S460).

이때, 청취자 단말(300)은 해당 방송의 청취자일 필요는 없으며, 하이라이트 콘텐츠를 다운로드하지 않더라도 실시간으로 제공받을 수도 있다. 이에, 방송 청취 독려의 효과가 발생될 수 있다.At this time, the listener terminal 300 does not need to be a listener of the broadcast, and can receive highlight content in real time even without downloading it. Accordingly, the effect of encouraging broadcast listening may occur.

다른 실시예에서, 서버 프로세서(190)는 라이브 인터넷 방송 중에, 호스트가 포함된 인터넷 콜이나 채팅창에서 청취자의 문의사항이 있는 경우, 문의사항을 리스트업하여 서버 통신부(110)를 통해 호스트 단말(200)에 제공할 수 있으며, 문의사항 리스트에서 답변이 완료된 문의사항을 제외한 문의사항을 소정 주기로 호스트 단말(200)에 제공할 수 있다. 이때, 서버 프로세서(190)는 인터넷 콜에서 발생된 문의사항을 텍스트화하여 호스트 단말(200)에 제공할 수 있다. In another embodiment, during a live Internet broadcast, if there is an inquiry from a listener in an Internet call or chat window involving the host, the server processor 190 lists the inquiry and sends it to the host terminal ( 200), and inquiries excluding inquiries that have been answered from the inquiry list may be provided to the host terminal 200 at a predetermined period. At this time, the server processor 190 may convert the inquiry generated from the Internet call into text and provide it to the host terminal 200.

이 경우, 서버 프로세서(190)는 청취자의 문의사항 중에서 중복되거나 소정의 유사도 범위를 갖는 문의사항을 병합하고, 문의 빈도수가 높은 문의사항부터 정렬하여 서버 통신부(110)를 통해 호스트 단말(200)에 제공할 수 있다.In this case, the server processor 190 merges inquiries that are duplicated or have a predetermined similarity range among the listener's inquiries, sorts the inquiries with the highest frequency of inquiries, and sends them to the host terminal 200 through the server communication unit 110. can be provided.

서버 프로세서(190)는 문의사항이 중복되거나 소정 유사도를 갖는지 판단하기 위해, 두 문의사항이 입력되면, 유사도를 퍼센트로 출력하는 유사도 검출 알고리즘을 사용할 수 있다. 이때, 서버 프로세서(190)는 각 문의사항을 공간 상의 벡터로 대응시킬 수 있으며, 두 벡터 간 코사인 유사도 값이 0.8 이상 1 이하인 경우 문의사항이 유사한 것으로 판단할 수 있다. 서버 프로세서(190)는 각 문의사항의 명사를 중심으로 문의사항 간의 유사도를 판단할 수 있으며, 코사인 유사도 값은 백터 간 스칼라곱과 크기에 의해 산출될 수 있다.The server processor 190 may use a similarity detection algorithm that outputs the similarity as a percentage when two inquiries are input in order to determine whether the inquiries overlap or have a certain degree of similarity. At this time, the server processor 190 may correspond to each inquiry with a vector in space, and if the cosine similarity value between the two vectors is 0.8 or more and 1 or less, it may be determined that the inquiries are similar. The server processor 190 can determine the similarity between inquiries based on the noun of each inquiry, and the cosine similarity value can be calculated by the scalar product and size between vectors.

실시예에서, 호스트 단말(200)은 답변이 안된 문의사항에 대해, 답변하거나 거부할 수 있으며, 거부할 경우, 서버 프로세서(190)는 거부된 문의사항을 호스트 단말(200)만 알 수 있도록 표시하여 호스트 단말(200)에 제공할 수 있다.In an embodiment, the host terminal 200 can respond or reject unanswered inquiries, and if rejected, the server processor 190 displays the rejected inquiries so that only the host terminal 200 can see them. This can be provided to the host terminal 200.

실시예에서, 서버 프로세서(190)는 곤란한 문의사항을 소정 수준 이상하는 청취자에게 경고 메시지를 서버 통신부(110)를 통해 청취자 단말에 제공할 수 있다.In an embodiment, the server processor 190 may provide a warning message to the listener terminal through the server communication unit 110 to listeners who ask difficult questions beyond a predetermined level.

한편, 개시된 실시예들은 컴퓨터에 의해 실행 가능한 명령어를 저장하는 기록매체의 형태로 구현될 수 있다. 명령어는 프로그램 코드의 형태로 저장될 수 있으며, 프로세서에 의해 실행되었을 때, 프로그램 모듈을 생성하여 개시된 실시예들의 동작을 수행할 수 있다. 기록매체는 컴퓨터로 읽을 수 있는 기록매체로 구현될 수 있다.Meanwhile, the disclosed embodiments may be implemented in the form of a recording medium that stores instructions executable by a computer. Instructions may be stored in the form of program code, and when executed by a processor, may create program modules to perform operations of the disclosed embodiments. The recording medium may be implemented as a computer-readable recording medium.

컴퓨터가 읽을 수 있는 기록매체로는 컴퓨터에 의하여 해독될 수 있는 명령어가 저장된 모든 종류의 기록 매체를 포함한다. 예를 들어, ROM(Read Only Memory), RAM(Random Access Memory), 자기 테이프, 자기 디스크, 플래쉬 메모리, 광 데이터 저장장치 등이 있을 수 있다. Computer-readable recording media include all types of recording media storing instructions that can be decoded by a computer. For example, there may be Read Only Memory (ROM), Random Access Memory (RAM), magnetic tape, magnetic disk, flash memory, optical data storage device, etc.

이상에서와 같이 첨부된 도면을 참조하여 개시된 실시예들을 설명하였다. 본 개시가 속하는 기술분야에서 통상의 지식을 가진 자는 본 개시의 기술적 사상이나 필수적인 특징을 변경하지 않고도, 개시된 실시예들과 다른 형태로 본 개시가 실시될 수 있음을 이해할 것이다. 개시된 실시예들은 예시적인 것이며, 한정적으로 해석되어서는 안 된다.As described above, the disclosed embodiments have been described with reference to the attached drawings. A person skilled in the art to which this disclosure pertains will understand that the present disclosure may be practiced in forms different from the disclosed embodiments without changing the technical idea or essential features of the present disclosure. The disclosed embodiments are illustrative and should not be construed as limiting.

100: 하이라이트 콘텐츠 생성 서버, 200: 호스트 단말, 300: 청취자 단말.100: highlight content creation server, 200: host terminal, 300: listener terminal.

Claims

As a server that generates highlight content for voice-based live Internet broadcasting,
a server communication unit that communicates with a host terminal and a listener terminal of a voice-based live Internet broadcast; Server memory to store enrichment data used in highlight content; and
A server processor including one or more cores,
The server processor is,
a broadcast data management module that divides broadcast data of the live Internet broadcast into predetermined categories and stores them in the server memory;
a setting module that sets trigger conditions for generating one or more highlight contents for the live Internet broadcast;
a section determination module that determines a broadcast section of the live Internet broadcast that satisfies the set trigger condition; and
A generation module that generates the highlight content for the live Internet broadcast by combining the broadcast data of the determined broadcast section and the reinforcement data,
The reinforcement data is,
It is data that is not actually broadcast or used in the live Internet broadcast, and includes at least one of audio data to increase the number of views of the highlight content, video data, language data used by generation or situation, and hot keywords of the live Internet broadcast; ,
The creation module is,
If there are multiple sections that satisfy the trigger conditions, some of the multiple sections are integrated into one highlight content, and the playback order within the content is arranged differently for each section based on the weight between the trigger conditions in the integrated content. ,
The creation module is,
A section that satisfies the trigger condition with a high weight is placed in the first half of the integrated content, a section that simultaneously satisfies a plurality of trigger conditions is placed in the first half of the integrated content, and a section that satisfies only one trigger condition is placed in the first half of the integrated content. A server that is sequentially placed in the second half of the integrated content.

According to paragraph 1,
The server processor is,
The highlight content is generated and provided to the host terminal through the server communication unit, and when at least one of the highlight contents is selected by the host terminal, the selected highlight content is provided to the listener terminal through the server communication unit. Being a server.

According to paragraph 2,
The broadcast data management module,
A server configured to classify broadcast data of the live Internet broadcast into at least one category of the host's voice data, chat data, live call data during the broadcast, and sponsorship-related data and store them in the server memory.

According to paragraph 3,
The setting module is,
In the live Internet broadcast, the speed of chatting, the number of sponsored listeners and the sponsorship amount of sponsored listeners, the frequency of use of specific keywords, the specific emotional state implied by the keyword, listener participation, the intensity of the host's voice, and the frequency and speech of the host. A server configured to set a condition in which at least one of the times is above a predetermined level as the trigger condition.

According to paragraph 4,
The setting module is,
A server configured to set a condition requested by the host terminal as a trigger condition for generating the highlight content.

delete

A method of generating highlight content for a voice-based live Internet broadcast performed by a highlight content creation server,
Classifying and storing broadcast data of the live Internet broadcast into predetermined categories;
setting trigger conditions for generating one or more highlight contents for the live Internet broadcast;
determining a broadcast section of the live Internet broadcast that satisfies the set trigger condition; and
Generating, through a generation module, the highlight content for the live Internet broadcast by combining broadcast data of the determined broadcast section and reinforcement data pre-stored for use in the highlight content,
The reinforcement data is,
It is data that is not actually broadcast or used in the live Internet broadcast, and includes at least one of audio data to increase the number of views of the highlight content, video data, language data used by generation or situation, and hot keywords of the live Internet broadcast; ,
The creation module is,
If there are multiple sections that satisfy the trigger conditions, some of the multiple sections are integrated into one highlight content, and the playback order within the content is arranged differently for each section based on the weight between the trigger conditions in the integrated content. ,
The creation module is,
A section that satisfies the trigger condition with a high weight is placed in the first half of the integrated content, a section that simultaneously satisfies a plurality of trigger conditions is placed in the first half of the integrated content, and a section that satisfies only one trigger condition is placed in the first half of the integrated content. A creation method that places sequentially in the second half of integrated content.

According to clause 8,
After the above generating step,
generating the highlight content and providing it to a host terminal; and
When at least one of the highlight contents is selected by the host terminal, the method further comprising providing the selected highlight content to a listener terminal.

According to clause 9,
The setting steps are:
In the live Internet broadcast, the speed of chatting, the number of sponsored listeners and the sponsorship amount of sponsored listeners, the frequency of use of specific keywords, the specific emotional state implied by the keyword, listener participation, the intensity of the host's voice, and the frequency and speech of the host. A generating method comprising setting a condition in which at least one of the times is above a predetermined level as the trigger condition.