KR102578169B1

KR102578169B1 - System for Providing Auto Complete Service YouTube Thumbnail

Info

Publication number: KR102578169B1
Application number: KR1020230033173A
Authority: KR
Inventors: 석대건
Original assignee: 주식회사 웨이크유어마인드
Priority date: 2023-03-14
Filing date: 2023-03-14
Publication date: 2023-09-14

Abstract

유튜브 썸네일 자동 완성 서비스 시스템은 유튜브 영상 주소를 입력하면, 시청자 관점으로 노출 클릭률이 높은 썸네일 이미지를 자동으로 완성하여 업로드한 영상이 도달 가능한 최고 조회수를 달성할 수 있다.
본 발명은 쉽고 편리한 유튜브 썸네일 제작 전용 서비스를 제공하고, 차별화된 썸네일 이미지를 자동 완성하여 채널 유입 시청자의 지속적 클릭으로 조회수 상승 효과를 기대할 수 있다.The YouTube thumbnail auto-complete service system automatically completes a thumbnail image with a high click-through rate from the viewer's perspective when a YouTube video address is entered, allowing the uploaded video to achieve the highest number of views possible.
The present invention provides an easy and convenient service dedicated to creating YouTube thumbnails, and automatically completes differentiated thumbnail images, which can be expected to increase the number of views through continuous clicking by viewers entering the channel.

Description

YouTube Thumbnail Auto Complete Service System {System for Providing Auto Complete Service YouTube Thumbnail}

본 발명은 유튜브 썸네일 자동 완성 서비스 시스템에 관한 것으로서, 더욱 상세하게는 유튜브 영상 주소를 입력하면, 시청자 관점으로 노출 클릭률이 높은 썸네일 이미지를 자동으로 완성하여 업로드한 영상이 도달 가능한 최고 조회수를 달성할 수 있는 유튜브 썸네일 자동 완성 서비스 시스템에 관한 것이다.The present invention relates to a YouTube thumbnail auto-completion service system. More specifically, when a YouTube video address is entered, a thumbnail image with a high click-through rate is automatically completed from the viewer's perspective, allowing the uploaded video to achieve the highest number of views achievable. This is about the YouTube thumbnail auto-complete service system.

우리나라 인구 529명당 1개 채널을 보유하고, 소규모 일반 유튜버 수를 포함하면, 유튜버 채널이 10배 이상의 규모로 추정된다.There is one channel per 529 people in Korea, and if the number of small-scale general YouTubers is included, the number of YouTube channels is estimated to be more than 10 times the size.

2022년 기준으로 유튜버를 포함한 크리에이터 이코노미 시장 규모가 약 120조 5000억원으로 유튜버 크리에이터의 경제 성장이 지속적으로 예상된다.As of 2022, the size of the creator economy market, including YouTubers, is approximately 120.5 trillion won, and the economic growth of YouTube creators is expected to continue.

2023년 국내 총 유튜브 채널 수는 약 170만개로 조사되었으며, 2021년 수익 창출 유튜버 수가 2021년 110,654명이고, 2022년 125,043명이고, 구독자 1만명 이상의 보휴 유튜브 채널의 수가 2021년 22,056명이고, 2022년 24,924명이다.The total number of YouTube channels in Korea in 2023 was found to be approximately 1.7 million, the number of YouTubers making money in 2021 is 110,654 in 2021, 125,043 in 2022, and the number of YouTube channels with more than 10,000 subscribers is 22,056 in 2021 and 2022. There are 24,924 people per year.

유튜브 영상의 클릭을 결정하는 요소는 유튜브 시청자 및 유튜버를 조사하면, 썸네일 영상이 유튜브 시청 결정에 86.6%로 영향을 미친다.When we survey YouTube viewers and YouTubers about the factors that determine clicks on YouTube videos, thumbnail videos influence the decision to watch YouTube by 86.6%.

이와 같이, 유튜브 썸네일은 영상 조회수를 결정하는 핵심적인 요인이다.Likewise, YouTube thumbnails are a key factor in determining the number of video views.

하지만 많은 유튜버는 내용 설명이 반복되는 뻔한 카피, 가독성 떨어지는 디자인 템플릿 사용과, 시청자 관점을 고려하지 않은 잡다한 편집으로 클릭하고 싶은 썸네일을 만들지 못하는 문제점이 있다.However, many YouTubers have the problem of not being able to create clickable thumbnails due to obvious copy that repeats content descriptions, use of design templates that are difficult to read, and miscellaneous editing that does not take the viewer's perspective into consideration.

포토샵, 미리캔버스, 망고보드 등 디자인 툴로 썸네일을 제작하는데, 디자인 툴의 경우, 유튜버에게 창작 작업을 요구하는 단점이 있다. 특히, 고령층 등 디지털 취약 유튜버가 디자인 툴을 활용하기 복잡하고, 어려움이 있었다.Thumbnails are created using design tools such as Photoshop, Precanvas, and Mangoboard, but design tools have the disadvantage of requiring creative work from YouTubers. In particular, it was complicated and difficult for digitally vulnerable YouTubers, such as the elderly, to use design tools.

현재 시장 내에서는 유튜버 맞춤형 썸네일 제작 서비스가 거의 전무한 실정이다.Currently, there is almost no customized thumbnail creation service for YouTubers in the market.

한국 특허 등록번호 제10-1990089호Korean Patent Registration No. 10-1990089

이와 같은 문제점을 해결하기 위하여, 본 발명은 유튜브 영상 주소를 입력하면, 시청자 관점으로 노출 클릭률이 높은 썸네일 이미지를 자동으로 완성하여 업로드한 영상이 도달 가능한 최고 조회수를 달성할 수 있는 유튜브 썸네일 자동 완성 서비스 시스템을 제공하는데 그 목적이 있다.In order to solve this problem, the present invention provides a YouTube thumbnail auto-completion service that, when a YouTube video address is entered, automatically completes a thumbnail image with a high click-through rate from the viewer's perspective, thereby achieving the highest number of views that the uploaded video can reach. The purpose is to provide a system.

상기 목적을 달성하기 위한 본 발명의 특징에 따른 유튜브 썸네일 자동 완성 서비스 시스템은,The YouTube thumbnail automatic completion service system according to the characteristics of the present invention to achieve the above object is,

구독자수가 기설정된 기준 구독자수 이상인 유튜브 채널을 외부 서버 또는 인터넷상에서 검색하고, 검색된 유튜브 채널의 썸네일 이미지를 유튜브 영상 스크린샷 프레임으로 캡쳐하여 추출하는 썸네일 이미지 추출부;A thumbnail image extraction unit that searches an external server or the Internet for a YouTube channel with a number of subscribers equal to or greater than a preset standard, and captures and extracts the thumbnail image of the searched YouTube channel as a YouTube video screenshot frame;

상기 썸네일 이미지 추출부로부터 수신한 복수의 유튜브 영상 스크린샷 프레임을 썸네일 유형별로 분류하고, 상기 썸네일 유형별로 분류한 유튜브 영상 스크린샷 프레임을 데이터베이스화하여 저장하고, 상기 썸네일 유형별로 유튜브 영상 스크린샷 프레임 내에서 객체를 추출하는 객체 이미지 추출부와, 상기 객체 이미지 추출부에서 감지한 객체를 분석하여 상기 영상 스크린샷 프레임에 포함된 썸네일 배경 이미지를 생성하는 객체 이미지 분석부와, 상기 썸네일 유형별로 유튜브 영상 스크린샷 프레임 내에서 텍스트 영역을 추출하는 텍스트 검출부와, 상기 추출한 텍스트 영역을 분석하여 상기 유튜브 영상 스크린샷 프레임에 포함된 텍스트 정보를 생성하는 텍스트 분석부로 이루어진 썸네일 이미지 데이터베이스부; 및Classifies a plurality of YouTube video screenshot frames received from the thumbnail image extraction unit by thumbnail type, stores the YouTube video screenshot frames classified by thumbnail type in a database, and stores them in a YouTube video screenshot frame by thumbnail type. an object image extraction unit that extracts an object from the object image extraction unit, an object image analysis unit that analyzes the object detected by the object image extraction unit to generate a thumbnail background image included in the video screenshot frame, and a YouTube video screen for each thumbnail type. A thumbnail image database unit comprising a text detection unit that extracts a text area within a shot frame, and a text analysis unit that analyzes the extracted text area to generate text information included in the YouTube video screenshot frame; and

상기 객체 이미지 분석부에서 생성한 썸네일 배경 이미지와, 상기 텍스트 분석부에서 생성한 텍스트 정보를 결합한 최종 썸네일 이미지를 생성하는 제어부를 포함한다.It includes a control unit that generates a final thumbnail image by combining the thumbnail background image generated by the object image analysis unit and the text information generated by the text analysis unit.

상기 제어부로부터 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)을 수신하고, 상기 텍스트 내용을 이용하여 상기 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 출력하는 카피라이팅 분석 생성부;A copywriting analysis generation unit that receives a thumbnail type and text content (YouTube video keyword) from the control unit and outputs a thumbnail text (thumbnail copy) corresponding to the thumbnail type using the text content;

상기 제어부로부터 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임을 수신하고, 상기 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력하는 백그라운드 이미징 분석 생성부; 및a background imaging analysis generation unit that receives a thumbnail type and a plurality of YouTube video screenshot frames from the control unit and outputs an optimal thumbnail background image corresponding to the thumbnail type; and

상기 카피라이팅 분석 생성부에서 생성한 썸네일 텍스트와, 상기 백그라운드 이미징 분석 생성부에서 생성한 최적 썸네일 배경 이미지를 결합하여 최종 썸네일 이미지를 완성하는 썸네일 결합부를 포함한다.and a thumbnail combination unit that combines the thumbnail text generated by the copywriting analysis generation unit and the optimal thumbnail background image generated by the background imaging analysis generation unit to complete a final thumbnail image.

썸네일 이미지 데이터베이스부는 상기 썸네일 유형별로 상기 유튜브 영상 스크린샷 프레임 내의 객체의 위치, 크기, 개수, 구도로 이루어진 기준 객체 정보를 설정하고, 상기 썸네일 유형별로 상기 유튜브 영상 스크린샷 프레임 내의 텍스트 내용, 크기, 위치로 이루어진 기준 텍스트 정보를 설정하며, 상기 기준 객체 정보와 상기 기준 텍스트 정보를 이용하여 각각의 유튜브 영상 스크린샷 프레임에 대응하는 썸네일 유형을 분류하는 썸네일 유형 분류부를 더 포함한다.The thumbnail image database unit sets reference object information consisting of the location, size, number, and composition of objects within the YouTube video screenshot frame for each thumbnail type, and sets standard object information consisting of the location, size, number, and composition of objects within the YouTube video screenshot frame for each thumbnail type, and provides text content, size, and location within the YouTube video screenshot frame for each thumbnail type. It sets the standard text information and further includes a thumbnail type classification unit that classifies the thumbnail type corresponding to each YouTube video screenshot frame using the reference object information and the reference text information.

유튜브 영상 주소(웹주소)를 입력받는 경우, 영상 프레임을 복수개 획득하는 영상 입력부; 및A video input unit that obtains a plurality of video frames when a YouTube video address (web address) is input; and

상기 각 영상 프레임을 N×N 픽셀(Pixel) 크기로 구성되는 매크로 블록 간의 차분값을 계산하여 상기 영상 프레임의 장면 전환을 판단하는 장면 전환 감지부를 더 포함하며,It further includes a scene change detection unit that determines a scene change of the image frame by calculating a difference value between macro blocks composed of N × N pixel size for each image frame,

상기 썸네일 이미지 추출부는 상기 장면 전환 감지부에서 장면 전환 지점을 찾아낸 후, 기설정된 장면별 지속 시간을 이용하여 장면 전환이 이루어진 유튜브 영상 스크린샷 프레임을 캡쳐하여 추출하고,The thumbnail image extractor finds the scene change point in the scene change detection unit, then captures and extracts a YouTube video screenshot frame in which the scene change occurred using a preset duration for each scene,

상기 장면 전환이 이루어진 유튜브 영상 스크린샷 프레임은 상기 제어부의 제어에 따라 상기 썸네일 이미지 데이터베이스부에서 썸네일 유형을 분류하고, 상기 카피라이팅 분석 생성부에서 상기 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 출력하고, 상기 백그라운드 이미징 분석 생성부에서 상기 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력하고, 상기 썸네일 결합부에서 상기 썸네일 유형별로 상기 썸네일 텍스트를 상기 최적 썸네일 배경 이미지에 결합하여 상기 최종 썸네일 이미지를 자동 완성한다.The YouTube video screenshot frame in which the scene change occurred is classified into a thumbnail type in the thumbnail image database unit under the control of the control unit, and the copywriting analysis generation unit outputs a thumbnail text (thumbnail copy) corresponding to the thumbnail type. The background imaging analysis generation unit outputs an optimal thumbnail background image corresponding to the thumbnail type, and the thumbnail combination unit automatically generates the final thumbnail image by combining the thumbnail text with the optimal thumbnail background image for each thumbnail type. Complete.

전술한 구성에 의하여, 본 발명은 유튜브 영상 주소를 입력하면, 시청자 관점으로 노출 클릭률이 높은 썸네일 이미지를 자동으로 완성하여 업로드한 영상이 도달 가능한 최고 조회수를 달성할 수 있는 효과가 있다.With the above-described configuration, the present invention has the effect of automatically completing a thumbnail image with a high click-through rate from the viewer's perspective when a YouTube video address is entered, thereby achieving the highest number of views that the uploaded video can reach.

본 발명은 쉽고 편리한 유튜브 썸네일 제작 전용 서비스를 제공하고, 차별화된 썸네일 이미지를 자동 완성하여 채널 유입 시청자의 지속적 클릭으로 조회수 상승 효과를 기대할 수 있다.The present invention provides an easy and convenient service dedicated to creating YouTube thumbnails, and automatically completes differentiated thumbnail images, which can be expected to increase the number of views through continuous clicking by viewers entering the channel.

도 1은 본 발명의 실시예에 따른 유튜브 썸네일 자동 완성 서비스 시스템의 구성을 나타낸 도면이다.
도 2는 본 발명의 실시예에 따른 썸네일 이미지 데이터베이스부의 내부 구성을 간략하게 나타낸 블록도이다.
도 3은 본 발명의 실시예에 따른 유튜브 썸네일 이미지의 객체 분석과 텍스트 분석의 일례를 나타낸 도면이다.
도 4는 본 발명의 실시예에 따른 썸네일 텍스트를 최적 썸네일 배경 이미지에 결합하여 최종 썸네일 이미지를 자동 완성하는 모습을 나타낸 도면이다.
도 5는 본 발명의 실시예에 따른 인공 신경망 엔진부의 내부 구성을 간략하게 나타낸 블록도이다.
도 6은 본 발명의 실시예에 따른 최적 썸네일 배경 이미지를 추천하는 개념을 나타낸 도면이다.Figure 1 is a diagram showing the configuration of a YouTube thumbnail automatic completion service system according to an embodiment of the present invention.
Figure 2 is a block diagram briefly showing the internal configuration of a thumbnail image database unit according to an embodiment of the present invention.
Figure 3 is a diagram showing an example of object analysis and text analysis of a YouTube thumbnail image according to an embodiment of the present invention.
Figure 4 is a diagram showing automatic completion of the final thumbnail image by combining thumbnail text with an optimal thumbnail background image according to an embodiment of the present invention.
Figure 5 is a block diagram briefly showing the internal configuration of the artificial neural network engine unit according to an embodiment of the present invention.
Figure 6 is a diagram illustrating the concept of recommending an optimal thumbnail background image according to an embodiment of the present invention.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세한 설명에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였다.Since the present invention can make various changes and have various embodiments, specific embodiments will be illustrated in the drawings and described in detail in the detailed description. However, this is not intended to limit the present invention to specific embodiments, and should be understood to include all changes, equivalents, and substitutes included in the spirit and technical scope of the present invention. While describing each drawing, similar reference numerals are used for similar components.

제1, 제2, A, B 등의 용어는 다양한 구성요소들을 설명하는 데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다. "및/또는"이라는 용어는 복수의 관련된 기재된 항목들의 조합 또는 복수의 관련된 기재된 항목들 중의 어느 항목을 포함한다.Terms such as first, second, A, B, etc. may be used to describe various components, but the components should not be limited by the terms. The above terms are used only for the purpose of distinguishing one component from another. For example, a first component may be named a second component, and similarly, the second component may also be named a first component without departing from the scope of the present invention. The term “and/or” includes any of a plurality of related stated items or a combination of a plurality of related stated items.

어떤 구성요소가 다른 구성요소에 "연결되어" 있다거나 "접속되어" 있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 반면에, 어떤 구성요소가 다른 구성요소에 "직접 연결되어" 있다거나 "직접 접속되어" 있다고 언급된 때에는, 중간에 다른 구성요소가 존재하지 않는 것으로 이해되어야 할 것이다.When a component is said to be "connected" or "connected" to another component, it is understood that it may be directly connected to or connected to the other component, but that other components may exist in between. It should be. On the other hand, when it is mentioned that a component is “directly connected” or “directly connected” to another component, it should be understood that there are no other components in between.

본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terms used in this application are only used to describe specific embodiments and are not intended to limit the invention. Singular expressions include plural expressions unless the context clearly dictates otherwise. In this application, terms such as “comprise” or “have” are intended to designate the presence of features, numbers, steps, operations, components, parts, or combinations thereof described in the specification, but are not intended to indicate the presence of one or more other features. It should be understood that this does not exclude in advance the possibility of the existence or addition of elements, numbers, steps, operations, components, parts, or combinations thereof.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.Unless otherwise defined, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by a person of ordinary skill in the technical field to which the present invention pertains. Terms defined in commonly used dictionaries should be interpreted as having a meaning consistent with the meaning in the context of the related technology, and unless explicitly defined in the present application, should not be interpreted in an ideal or excessively formal sense. No.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 본 발명을 설명함에 있어 전체적인 이해를 용이하게 하기 위하여 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다.Hereinafter, preferred embodiments of the present invention will be described in more detail with reference to the attached drawings. In order to facilitate overall understanding when describing the present invention, the same reference numerals are used for the same components in the drawings, and duplicate descriptions for the same components are omitted.

도 1은 본 발명의 실시예에 따른 유튜브 썸네일 자동 완성 서비스 시스템의 구성을 나타낸 도면이고, 도 2는 본 발명의 실시예에 따른 썸네일 이미지 데이터베이스부의 내부 구성을 간략하게 나타낸 블록도이고, 도 3은 본 발명의 실시예에 따른 유튜브 썸네일 이미지의 객체 분석과 텍스트 분석의 일례를 나타낸 도면이고, 도 4는 본 발명의 실시예에 따른 썸네일 텍스트를 최적 썸네일 배경 이미지에 결합하여 최종 썸네일 이미지를 자동 완성하는 모습을 나타낸 도면이고, 도 5는 본 발명의 실시예에 따른 인공 신경망 엔진부의 내부 구성을 간략하게 나타낸 블록도이고, 도 6은 본 발명의 실시예에 따른 최적 썸네일 배경 이미지를 추천하는 개념을 나타낸 도면이다.
본 발명의 실시예에 따른 유튜브 썸네일 자동 완성 서비스 시스템(10)은 외부 서버(20) 및 유튜브 썸네일 자동 완성 서비스 장치(100)를 포함한다.Figure 1 is a diagram showing the configuration of a YouTube thumbnail automatic completion service system according to an embodiment of the present invention, Figure 2 is a block diagram briefly showing the internal structure of a thumbnail image database unit according to an embodiment of the present invention, and Figure 3 is A diagram showing an example of object analysis and text analysis of a YouTube thumbnail image according to an embodiment of the present invention, and Figure 4 shows a diagram showing an example of automatically completing the final thumbnail image by combining the thumbnail text with an optimal thumbnail background image according to an embodiment of the present invention. It is a diagram showing the appearance, Figure 5 is a block diagram briefly showing the internal structure of the artificial neural network engine unit according to an embodiment of the present invention, and Figure 6 shows the concept of recommending the optimal thumbnail background image according to an embodiment of the present invention. It is a drawing.
The YouTube thumbnail auto-complete service system 10 according to an embodiment of the present invention includes an external server 20 and a YouTube thumbnail auto-complete service device 100.

본 발명의 실시예에 따른 유튜브 썸네일 자동 완성 서비스 장치(100)는 영상 수신부(110), 영상 입력부(111), 장면 전환 감지부(112), 썸네일 이미지 추출부(120), 썸네일 선호도 조사부(130), 썸네일 이미지 선호도 분석부(140), 썸네일 이미지 데이터베이스부(150), 제어부(160), 카피라이팅 분석 생성부(170), 백그라운드 이미징 분석 생성부(180) 및 썸네일 결합부(190)를 포함한다.The YouTube thumbnail automatic completion service device 100 according to an embodiment of the present invention includes a video receiving unit 110, a video input unit 111, a scene change detection unit 112, a thumbnail image extracting unit 120, and a thumbnail preference research unit 130. ), a thumbnail image preference analysis unit 140, a thumbnail image database unit 150, a control unit 160, a copywriting analysis generation unit 170, a background imaging analysis generation unit 180, and a thumbnail combination unit 190. do.

영상 수신부(110)는 외부 서버(20)와 인터넷과 연결하여 접속하고, 외부 서버(20)와 인터넷으로부터 영상 정보를 수신하는 통신 인터페이스를 제공한다.The video receiver 110 connects to the external server 20 and the Internet and provides a communication interface for receiving video information from the external server 20 and the Internet.

썸네일 이미지 추출부(120)는 구독자수가 기설정된 기준 구독자수 이상인 유튜브 채널을 외부 서버 또는 인터넷상에서 검색하고, 검색된 유튜브 채널의 썸네일 이미지를 유튜브 영상 스크린샷 프레임으로 캡쳐하여 추출하며, 추출한 유튜브 영상 스크린샷 프레임을 썸네일 이미지 데이터베이스부(150)에 저장한다. 구독자수를 기준으로 썸네일 이미지를 추출하는 이유는 사람들의 선호도가 높은 썸네일 이미지를 분석하기 위한 것이다.The thumbnail image extraction unit 120 searches an external server or the Internet for YouTube channels whose number of subscribers is greater than a preset standard number of subscribers, captures and extracts the thumbnail image of the searched YouTube channel as a YouTube video screenshot frame, and extracts the extracted YouTube video screenshot. The frame is stored in the thumbnail image database unit 150. The reason for extracting thumbnail images based on the number of subscribers is to analyze thumbnail images that people prefer.

썸네일 선호도 조사부(130)는 구독자수가 기설정된 기준 구독자수 이상인 유튜브 채널 중에서 해당 유튜브 채널이 다루는 콘텐츠 종류가 일치하는 유튜브 채널을 분류한다.The thumbnail preference research unit 130 classifies YouTube channels that match the type of content handled by the YouTube channel among YouTube channels with the number of subscribers exceeding a preset standard.

썸네일 선호도 조사부(130)는 분류한 유튜브 채널 정보에 대하여 사용자가 직접 입력하여 선택하는 형태의 설문 조사 요청 정보를 생성하여 인터넷상으로 전송한다. 여기서, 설문 조사 요청 정보는 여러 개의 유튜브 채널 정보를 포함하고, 유튜브 채널에 대한 선호 또는 비선호를 선택하는 입력창을 포함한다.The thumbnail preference research unit 130 generates survey request information in a form in which the user directly inputs and selects the classified YouTube channel information and transmits it over the Internet. Here, the survey request information includes information on multiple YouTube channels, and includes an input window for selecting preference or disfavor for the YouTube channel.

썸네일 선호도 조사부(130)는 인터넷으로부터 설문 조사 요청 정보에 대응하는 응답 신호를 수신하여 저장하고, 응답 신호에 따라 유튜브 채널의 선호 개수와 비선호 개수를 유튜브 채널별로 누적 계산하여 저장한다.The thumbnail preference research unit 130 receives and stores a response signal corresponding to the survey request information from the Internet, and cumulatively calculates and stores the number of preferred and non-preferred YouTube channels for each YouTube channel according to the response signal.

썸네일 이미지 선호도 분석부(140)는 분류한 유튜브 채널 정보에 속하는 각 콘텐츠의 조회수(Views)를 수신하고, 각 콘텐츠의 조회수, 유튜브 채널의 선호 개수와 비선호 개수를 이용하여 수학식 1를 통해 선호도 지수를 계산하고, 계산한 선호도 지수를 기설정된 기준 선호도 지수 이상인지 판단한다.The thumbnail image preference analysis unit 140 receives the number of views of each content belonging to the classified YouTube channel information, and calculates the preference index through Equation 1 using the number of views of each content and the number of preferences and dislikes of the YouTube channel. Calculate and determine whether the calculated preference index is greater than or equal to the preset standard preference index.

썸네일 이미지 선호도 분석부(140)는 선호도 지수가 기설정된 기준 선호도 지수 이상인 콘텐츠의 썸네일 이미지(유튜브 영상 스크린샷 프레임)를 추출하여 썸네일 이미지 데이터베이스부(150)로 전송한다.The thumbnail image preference analysis unit 140 extracts a thumbnail image (YouTube video screenshot frame) of content whose preference index is greater than or equal to a preset standard preference index and transmits it to the thumbnail image database unit 150.

여기서, N은 콘텐츠 종류가 일치하는 유튜브 채널의 총 개수이고, i는 콘텐츠 종류가 일치하는 유튜브 채널들의 순번을 나타내고, PI는 특정 유튜브 채널에 대한 선호도를 나타내는 선호도 지수(Preference Index)이고, W는 유튜브 채널이 다루는 콘텐츠 종류에 따라 다르게 설정된 가중치이고, V는 특정 유튜브 채널의 콘텐츠에 대한 조회수(Views)이고, P는 해당 유튜브 채널의 선호 개수(Preference Number)를 나타내며, NP는 해당 유튜브 채널의 비선호 개수(Non-Preference Number)를 나타낸다. 콘텐츠 종류에 따라 사람들이 많이 관심을 가지는 주제가 있고, 관심을 가지지 않는 주제가 있으므로 가중치를 다르게 설정한다.Here, N is the total number of YouTube channels with matching content types, i represents the order of YouTube channels with matching content types, PI is the preference index indicating preference for a specific YouTube channel, and W is It is a weight set differently depending on the type of content handled by the YouTube channel, V is the number of views for the content of a specific YouTube channel, P represents the preference number of the YouTube channel, and NP is the non-preference of the YouTube channel. Indicates the number (Non-Preference Number). Depending on the type of content, there are topics that people are interested in and topics that people are not interested in, so the weights are set differently.

썸네일 이미지 데이터베이스부(150)는 썸네일 이미지 추출부(120) 또는 썸네일 이미지 선호도 분석부(140)로부터 15,000장 이상의 썸네일 이미지(유튜브 영상 스크린샷 프레임)를 수신하여 저장하고, 데이터 저장 시, 썸네일 이미지를 썸네일 유형별로 분류하고, 썸네일 유형별로 분류한 유튜브 영상 스크린샷 프레임을 데이터베이스화하여 저장되며, 유튜브 영상 스크린샷 프레임에서 최적 썸네일 배경 이미지와 텍스트 정보를 추출한다.The thumbnail image database unit 150 receives and stores more than 15,000 thumbnail images (YouTube video screenshot frames) from the thumbnail image extraction unit 120 or the thumbnail image preference analysis unit 140, and when saving data, the thumbnail images are The YouTube video screenshot frames classified by thumbnail type are stored in a database, and the optimal thumbnail background image and text information are extracted from the YouTube video screenshot frames.

썸네일 이미지 데이터베이스부(150)는 썸네일 유형 분류부(151), 객체 이미지 추출부(152), 객체 이미지 분석부(153), 객체 이미지 저장부(154), 텍스트 검출부(155), 텍스트 분석부(156) 및 텍스트 정보 저장부(157)를 포함한다.The thumbnail image database unit 150 includes a thumbnail type classification unit 151, an object image extraction unit 152, an object image analysis unit 153, an object image storage unit 154, a text detection unit 155, and a text analysis unit ( 156) and a text information storage unit 157.

썸네일 유형 분류부(151)는 썸네일 유형을 사용자가 편하게 요청할 수 있는 제1 썸네일, 영상의 조회수를 높여주는 제2 썸네일, 영상 내용을 효과적으로 표현하는 제3 썸네일, 시청자가 클릭하고 싶은 제4 썸네일, 같은 주제의 다른 영상 썸네일보다 차별성 있는 제5 썸네일, 사용자가 빠르게 결과를 받을 수 있는 제6 썸네일로 구분한다. 물론, 썸네일 유형은 추가하거나 변경할 수도 있다.The thumbnail type classification unit 151 divides the thumbnail type into a first thumbnail that the user can conveniently request, a second thumbnail that increases the number of views of the video, a third thumbnail that effectively expresses the video content, a fourth thumbnail that the viewer wants to click, and It is divided into the 5th thumbnail, which is more differentiated than other video thumbnails of the same topic, and the 6th thumbnail, which allows users to quickly receive results. Of course, thumbnail types can be added or changed.

썸네일 유형 분류부(151)는 썸네일 유형별로 유튜브 영상 스크린샷 프레임 내의 객체의 위치, 크기, 개수, 구도 템플릿으로 이루어진 기준 객체 정보를 설정하고, 썸네일 유형별로 유튜브 영상 스크린샷 프레임 내의 텍스트 내용, 크기, 위치로 이루어진 기준 텍스트 정보를 설정하고 있다.The thumbnail type classification unit 151 sets standard object information consisting of the location, size, number, and composition template of objects within the YouTube video screenshot frame for each thumbnail type, and sets the text content, size, and content within the YouTube video screenshot frame for each thumbnail type. Setting reference text information consisting of location.

썸네일 유형 분류부(151)는 기준 객체 정보와 기준 텍스트 정보를 이용하여 각각의 유튜브 영상 스크린샷 프레임에 대응하는 썸네일 유형을 분류한다.The thumbnail type classification unit 151 classifies the thumbnail type corresponding to each YouTube video screenshot frame using reference object information and reference text information.

객체 이미지 추출부(152)는 썸네일 유형별로 유튜브 영상 스크린샷 프레임(썸네일 이미지) 내에서 객체를 추출할 수 있다.The object image extraction unit 152 can extract objects within YouTube video screenshot frames (thumbnail images) for each thumbnail type.

객체 이미지 추출부(152)는 각각의 썸네일 이미지에서 N×N 픽셀(Pixel) 크기로 구성되는 매크로 블록으로 분할하고, 분할한 매크로 블록에 대해서 경계선을 표시하는 흑백 영상으로 변환한다.The object image extraction unit 152 divides each thumbnail image into macroblocks of N×N pixel size and converts the divided macroblocks into a black-and-white image indicating a boundary line.

객체 이미지 추출부(152)는 복수의 매크로 블록에 대하여 에지 검출 알고리즘을 이용하여 각각의 썸네일 이미지 프레임을 에지 영상으로 변환한다.The object image extractor 152 converts each thumbnail image frame into an edge image using an edge detection algorithm for a plurality of macro blocks.

에지 검출 알고리즘은 경계선의 픽셀값이 기설정된 기준 픽셀값 이상으로 큰 차이로 발견되는 구간만 에지로 판별하는 것을 의미한다. 다른 실시예로서 에지 검출 알고리즘은 소벨 연산 방법을 이용하여 픽셀 사이의 밝기값의 변화율을 이용하여 에지를 검출할 수 있다.The edge detection algorithm means that only the section in which the pixel value of the boundary line is found to have a larger difference than the preset reference pixel value is identified as an edge. As another embodiment, the edge detection algorithm may detect edges using the rate of change of brightness values between pixels using the Sobel operation method.

에지 검출 알고리즘은 썸네일 이미지에 대하여 영상 속 인물의 경계선을 추출할 수 있는 에지 영상이면 어떠한 방법도 가능하다.The edge detection algorithm can use any method for the thumbnail image as long as it is an edge image that can extract the boundary line of the person in the video.

객체 이미지 추출부(152)는 심층 신경망(Deep Neural Networks, DNN), 컨볼루션 신경망 (Convolutional deep Neural Networks, CNN), 순환 신경망(Reccurent Neural Network, RNN) 및 심층 신뢰 신경망(Deep Belief Networks, DBN) 중 어느 하나의 신경망을 이용하여 썸네일 이미지 프레임의 특징맵을 추출한다.The object image extraction unit 152 uses deep neural networks (DNN), convolutional deep neural networks (CNN), recurrent neural networks (RNN), and deep belief networks (DBN). Extract the feature map of the thumbnail image frame using one of the neural networks.

객체 이미지 추출부(152)는 딥러닝(Deep learning)을 기반으로 학습부에 의하여 이미 학습이 완료된 모델을 이용하여서 특징맵을 생성할 수 있다. 딥러닝은 여러 비선형 변환기법의 조합을 통해 높은 수준의 추상화(Abstractions, 다량의 데이터나 복잡한 자료들 속에서 핵심적인 내용 또는 기능을 요약하는 작업)를 시도하는 기계학습(Machine Learning) 알고리즘의 집합으로 정의된다.The object image extraction unit 152 can generate a feature map using a model that has already been trained by the learning unit based on deep learning. Deep learning is a set of machine learning algorithms that attempt high-level abstraction (summarizing key content or functions in large amounts of data or complex data) through a combination of various non-linear transformation techniques. is defined.

객체 이미지 추출부(152)는 유튜브 영상 스크린샷 프레임에서 객체가 존재할 것으로 추정되는 영역을 추출하고, 추출된 영역으로부터 특징을 나타내는 특징맵을 추출한다.The object image extraction unit 152 extracts an area where an object is estimated to exist from a YouTube video screenshot frame and extracts a feature map representing features from the extracted area.

여기서, 객체는 인물, 동물, 무생물(건물, 자전거 등)을 모두 포함하는 개념이다.Here, an object is a concept that includes people, animals, and inanimate objects (buildings, bicycles, etc.).

객체 이미지 추출부(152)는 추출한 특징맵을 기초로 썸네일 이미지에서 객체의 존재가 추정되는 적어도 하나의 영역을 추출한다. 영역을 추출하는 방법은 예를 들어 faster RCNN, SSD(Single Shot MultiBox Detector), YOLO(You Only Look Once) 등이 있을 수 있다.The object image extraction unit 152 extracts at least one area in which the presence of an object is estimated from the thumbnail image based on the extracted feature map. Methods for extracting the region may include, for example, faster RCNN, SSD (Single Shot MultiBox Detector), YOLO (You Only Look Once), etc.

객체 이미지 추출부(152)는 유튜브 영상 스크린샷 프레임에서 객체를 분할하는 딥러닝 모델을 이용하여 객체를 감지한다.The object image extraction unit 152 detects objects using a deep learning model that segments objects in YouTube video screenshot frames.

도 2에 도시된 바와 같이, 객체 이미지 분석부(153)는 객체 이미지 추출부(152)에서 감지한 객체를 분석하여 유튜브 영상 스크린샷 프레임(썸네일 이미지) 내에서의 객체 크기, 위치, 구도 템플릿을 판단하고, 객체 크기, 위치, 구도 템플릿을 포함한 영상 스크린샷 프레임을 썸네일 배경 이미지로 객체 이미지 저장부(154)에 저장한다.As shown in FIG. 2, the object image analysis unit 153 analyzes the object detected by the object image extraction unit 152 and determines the object size, position, and composition template within the YouTube video screenshot frame (thumbnail image). After making the determination, the video screenshot frame including the object size, position, and composition template is stored in the object image storage unit 154 as a thumbnail background image.

텍스트 검출부(155)는 썸네일 유형별로 유튜브 영상 스크린샷 프레임(썸네일 이미지) 내에서 텍스트 영역을 추출할 수 있다.The text detection unit 155 can extract a text area within a YouTube video screenshot frame (thumbnail image) for each thumbnail type.

텍스트 검출부(155)는 썸네일 이미지 데이터베이스부(150)에 저장된 썸네일 이미지 내에 텍스트에 해당되는 영역(텍스트 검출 영역)을 검출할 수 있다.The text detection unit 155 may detect an area (text detection area) corresponding to text within the thumbnail image stored in the thumbnail image database unit 150.

텍스트 검출부(155)는 이미지 기반 텍스트 번역 기능을 제공하는 것으로, OCR(Optical Character Reader)과 같은 문자 인식 기술을 바탕으로 이미지 내에서 번역 대상이 되는 텍스트 부분을 검출할 수 있다.The text detection unit 155 provides an image-based text translation function and can detect text parts to be translated within the image based on character recognition technology such as OCR (Optical Character Reader).

텍스트 검출부(155)는 유튜브 영상 스크린샷 프레임 내에서 텍스트 부분을 사각형 박스 영역으로 지정할 수 있으며, 이때 텍스트 검출 영역에 대한 이미지 상의 좌표값을 포함하는 텍스트 박스 정보를 출력할 수 있다. 예를 들어, 텍스트 박스 정보는 좌측 상단 좌표, 우측 상단 좌표, 좌측 하단 좌표, 우측 하단 좌표를 포함할 수 있다.The text detection unit 155 can designate a text portion within a YouTube video screenshot frame as a rectangular box area, and at this time, can output text box information including coordinate values on the image for the text detection area. For example, text box information may include upper left coordinates, upper right coordinates, lower left coordinates, and lower right coordinates.

도 3에 도시된 바와 같이, 텍스트 분석부(156)는 유튜브 영상 스크린샷 프레임 내에서 추출한 텍스트 영역을 분석하여 텍스트 형태, 크기, 위치를 판단하고, 유튜브 영상 스크린샷 프레임에 포함된 텍스트 정보를 생성하여 텍스트 정보 저장부(157)에 저장한다.As shown in FIG. 3, the text analysis unit 156 analyzes the text area extracted from the YouTube video screenshot frame, determines the text shape, size, and location, and generates text information included in the YouTube video screenshot frame. and stored in the text information storage unit 157.

제어부(160)는 썸네일 유형에 대응하는 객체 이미지 저장부(154)에 저장된 썸네일 배경 이미지와, 텍스트 정보 저장부(157)에 저장된 텍스트 정보를 결합한 최종 썸네일 이미지를 생성한다.The control unit 160 generates a final thumbnail image by combining the thumbnail background image stored in the object image storage unit 154 corresponding to the thumbnail type and the text information stored in the text information storage unit 157.

제어부(160)는 썸네일 유형에 대응하는 최종 썸네일 이미지 내의 객체의 위치, 크기, 개수, 구도와, 텍스트 내용, 크기, 위치를 최종 썸네일 이미지와 함께 썸네일 이미지 데이터베이스부(150)에 저장한다.The control unit 160 stores the location, size, number, composition, text content, size, and position of objects in the final thumbnail image corresponding to the thumbnail type together with the final thumbnail image in the thumbnail image database unit 150.

여기서, 텍스트 내용은 유튜브 채널의 콘텐츠에 포함된 유튜브 영상 키워드, 커머셜 카피, 트위터 트렌드 토픽을 나타낸다.
도 5에 도시된 바와 같이, 인공 신경망 엔진부(200)는 카피라이팅 분석 생성부(170)와 백그라운드 이미징 분석 생성부(180)를 포함한다.Here, the text content represents YouTube video keywords, commercial copy, and Twitter trend topics included in the YouTube channel content.
As shown in FIG. 5, the artificial neural network engine unit 200 includes a copywriting analysis generation unit 170 and a background imaging analysis generation unit 180.

삭제delete

제어부(160)는 텍스트 정보 저장부(157)에 저장된 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)을 카피라이팅 분석 생성부(170)로 입력하고, 카피라이팅 분석 생성부(170)로부터 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 출력한다.The control unit 160 inputs the thumbnail type and text content (YouTube video keywords) stored in the text information storage unit 157 to the copywriting analysis generation unit 170, and uses the text content from the copywriting analysis generation unit 170. to output thumbnail text (thumbnail copy) corresponding to the thumbnail type.

카피라이팅 분석 생성부(170)는 유튜브 영상 키워드에 따른 썸네일 텍스트를 생성하기 위하여 썸네일 텍스트 생성 인공지능 알고리즘을 인공 신경망 엔진부에 적용하며, 제1 인공 신경 처리망(210) 및 제1 학습부(211)를 더 포함한다.The copywriting analysis generation unit 170 applies a thumbnail text generation artificial intelligence algorithm to the artificial neural network engine unit to generate thumbnail text according to YouTube video keywords, and the first artificial neural processing network 210 and the first learning unit ( 211) is further included.

카피라이팅 분석 생성부(170)는 제어부(160)로부터 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)을 수신하고, 텍스트 내용을 이용하여 상기 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 출력한다.The copywriting analysis generation unit 170 receives the thumbnail type and text content (YouTube video keyword) from the control unit 160, and outputs thumbnail text (thumbnail copy) corresponding to the thumbnail type using the text content.

제어부(160)는 객체 이미지 저장부(154)에 저장된 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 백그라운드 이미징 분석 생성부(180)로 입력하고, 백그라운드 이미징 분석 생성부(180)로부터 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력한다.The control unit 160 inputs the thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images) stored in the object image storage unit 154 to the background imaging analysis generation unit 180, and Outputs the optimal thumbnail background image corresponding to the thumbnail type.

백그라운드 이미징 분석 생성부(180)는 썸네일 유형에 따른 최적 썸네일 배경 이미지를 생성하기 위하여 썸네일 배경 이미지 추천 인공지능 알고리즘을 인공 신경망 엔진부에 적용하며, 제2 인공 신경 처리망(220) 및 제2 학습부(221)를 더 포함한다.The background imaging analysis generation unit 180 applies a thumbnail background image recommendation artificial intelligence algorithm to the artificial neural network engine unit to generate an optimal thumbnail background image according to the thumbnail type, and the second artificial neural processing network 220 and the second learning unit. It further includes part 221.

백그라운드 이미징 분석 생성부(180)는 제어부(160)로부터 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임을 수신하고, 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력한다.The background imaging analysis generation unit 180 receives a thumbnail type and a plurality of YouTube video screenshot frames from the control unit 160, and outputs an optimal thumbnail background image corresponding to the thumbnail type.

도 4에 도시된 바와 같이, 썸네일 결합부(190)는 카피라이팅 분석 생성부(170)에서 생성한 썸네일 텍스트와 백그라운드 이미징 분석 생성부(180)에서 생성한 최적 썸네일 배경 이미지를 결합하여 최종 썸네일 이미지를 완성한다.As shown in FIG. 4, the thumbnail combining unit 190 combines the thumbnail text generated by the copywriting analysis generating unit 170 and the optimal thumbnail background image generated by the background imaging analysis generating unit 180 to create a final thumbnail image. Complete .

도 5에 도시된 바와 같이, 본 발명의 실시예에 따른 인공 신경망 엔진부(200)는 제1 인공 신경 처리망(210)을 학습시키는 제1 학습부(211)를 더 포함하며, 제2 인공 신경 처리망(220)을 학습시키는 제2 학습부(221)를 더 포함한다.As shown in FIG. 5, the artificial neural network engine unit 200 according to an embodiment of the present invention further includes a first learning unit 211 that trains the first artificial neural processing network 210, and a second artificial neural processing network 210. It further includes a second learning unit 221 that trains the neural processing network 220.

삭제delete

인공 신경망 엔진부(200)는 입력으로 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)을 입력하고, 출력으로 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트를 출력할 수 있다.The artificial neural network engine unit 200 can input a thumbnail type and text content (YouTube video keyword) as input, and output thumbnail text corresponding to the thumbnail type using the text content as an output.

인공 신경망 엔진부(200)는 입력으로 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 입력하고, 출력으로 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력할 수 있다.The artificial neural network engine unit 200 can input a thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images) as input, and output an optimal thumbnail background image corresponding to the thumbnail type as an output.

제1 학습부(211)는 텍스트 정보 저장부(157)에 저장된 학습 데이터를 이용하여 제1 인공 신경 처리망(210)을 통해 학습시킬 수 있다.The first learning unit 211 can learn through the first artificial neural processing network 210 using learning data stored in the text information storage unit 157.

제2 학습부(221)는 객체 이미지 저장부(154)에 저장된 학습 데이터를 이용하여 제2 인공 신경 처리망(220)을 통해 학습시킬 수 있다.The second learning unit 221 can learn through the second artificial neural processing network 220 using learning data stored in the object image storage unit 154.

제1, 2 학습부(211, 221)는 학습 데이터로부터 특징점을 검출할 수 있으며, 검출한 특징점의 학습 결과에 기초하여 제1, 2 인공 신경 처리망(210, 220)에 적용되는 연결 가중치를 조정할 수 있다.The first and second learning units (211, 221) can detect feature points from the learning data, and determine connection weights applied to the first and second artificial neural processing networks (210, 220) based on the learning results of the detected feature points. It can be adjusted.

제1, 2 학습부(211, 221)는 신경망의 출력층에서 생성된 출력값과 학습 데이터에 대한 원하는 기대값을 비교하여 오류를 계산하고, 오류를 줄이는 방향으로 신경망의 인식 모델에 적용되는 연결 가중치를 조정할 수 있다.The first and second learning units 211 and 221 calculate the error by comparing the output value generated from the output layer of the neural network with the desired expected value for the learning data, and adjust the connection weight applied to the recognition model of the neural network in the direction of reducing the error. It can be adjusted.

제1 학습부(211)는 입력층, 히든층 및 출력층으로 구성되어 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 출력층으로 출력되도록 제1 인공 신경 처리망(210)을 학습시킬 수 있다.The first learning unit 211 is composed of an input layer, a hidden layer, and an output layer, and uses text content to learn the first artificial neural processing network 210 to output thumbnail text (thumbnail copy) corresponding to the thumbnail type to the output layer. You can do it.

제2 학습부(221)는 입력층, 히든층 및 출력층으로 구성되어 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 이용하여 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력층으로 출력되도록 제2 인공 신경 처리망(220)을 학습시킬 수 있다.The second learning unit 221 consists of an input layer, a hidden layer, and an output layer, and uses a plurality of YouTube video screenshot frames (thumbnail images) to output the optimal thumbnail background image corresponding to the thumbnail type to the output layer. The processing network 220 can be trained.

제1 학습부(211)는 제어부(160)로부터 수신한 학습 데이터인 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)에서 특징점을 검출하고, 특징점을 이용하여 제1 인공 신경 처리망(210)을 학습시킬 수 있다.The first learning unit 211 detects feature points from the thumbnail type and text content (YouTube video keywords), which are learning data received from the control unit 160, and trains the first artificial neural processing network 210 using the feature points. You can.

다시 말해, 제1 학습부(211)는 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)으로부터 특징 벡터들을 검출하고, 검출된 특징 벡터들을 이용하여 제1 인공 신경 처리망(210)을 학습시킬 수 있다.In other words, the first learning unit 211 can detect feature vectors from the thumbnail type and text content (YouTube video keywords) and train the first artificial neural processing network 210 using the detected feature vectors.

제2 학습부(221)는 제어부(160)로부터 수신한 학습 데이터인 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)에서 특징점을 검출하고, 특징점을 이용하여 제2 인공 신경 처리망(220)을 학습시킬 수 있다.The second learning unit 221 detects feature points from the thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images), which are learning data received from the control unit 160, and uses the feature points to generate a second artificial neural processing network (220). ) can be learned.

다시 말해, 제2 학습부(221)는 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)으로부터 특징 벡터들을 검출하고, 검출된 특징 벡터들을 이용하여 제2 인공 신경 처리망(220)을 학습시킬 수 있다.In other words, the second learning unit 221 detects feature vectors from the thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images), and learns the second artificial neural processing network 220 using the detected feature vectors. You can do it.

제1, 2 학습부(211, 221)는 제1, 2 인공 신경 처리망(210, 220)을 학습시켜 제1, 2 인공 신경 처리망(210, 220)의 가중치가 조절되어 학습이 완료되면, 제1, 2 인공 신경 처리망(210, 220)이 인공 신경망 엔진부(200)가 될 수 있다.The first and second learning units (211, 221) train the first and second artificial neural processing networks (210, 220), and when the weights of the first and second artificial neural processing networks (210, 220) are adjusted and learning is completed, , the first and second artificial neural processing networks 210 and 220 may become the artificial neural network engine unit 200.

제1 학습부(211)는 제어부(160)로부터 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)으로 이루어진 훈련 세트(Train Set)를 제1 인공 신경 처리망(210)의 학습을 수행하고, 유형별 딥러닝 특징값과 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트를 학습한다.The first learning unit 211 performs learning of the first artificial neural processing network 210 on a training set consisting of thumbnail types and text contents (YouTube video keywords) from the control unit 160, and performs deep learning for each type. Learn thumbnail text corresponding to the thumbnail type using feature values and text content.

제1 학습부(211)는 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 제1 인공 신경 처리망(210)에서 학습한다.The first learning unit 211 learns the thumbnail text (thumbnail copy) corresponding to the thumbnail type from the first artificial neural processing network 210 using text content.

제2 학습부(221)는 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 이용하여 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 제2 인공 신경 처리망(220)에서 학습한다.The second learning unit 221 uses a plurality of YouTube video screenshot frames (thumbnail images) to learn the optimal thumbnail background image corresponding to the thumbnail type in the second artificial neural processing network 220.

제1 학습부(211)는 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)으로 이루어진 훈련 세트(Train Set)를 입력 벡터가 되며, 입력층, 은닉층, 출력층을 통과하면, 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 출력 벡터로 생성하도록 지도 학습을 통해 학습된다.The first learning unit 211 uses a training set consisting of a thumbnail type and text content (YouTube video keywords) as an input vector, and when it passes through the input layer, hidden layer, and output layer, it uses the text content to determine the thumbnail type. It is trained through supervised learning to generate the corresponding thumbnail text (thumbnail copy) as an output vector.

제2 학습부(221)는 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)으로 이루어진 훈련 세트(Train Set)를 입력 벡터가 되며, 입력층, 은닉층, 출력층을 통과하면, 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 이용하여 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력 벡터로 생성하도록 지도 학습을 통해 학습된다.The second learning unit 221 uses a training set consisting of a thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images) as an input vector, and when it passes through the input layer, hidden layer, and output layer, a plurality of YouTube videos are generated. It is learned through supervised learning to use a screenshot frame (thumbnail image) to generate the optimal thumbnail background image corresponding to the thumbnail type as an output vector.

제1, 2 학습부(211, 221)는 객체 이미지 저장부(154)와 텍스트 정보 저장부(157)에 저장된 학습 데이터를 제1, 2 인공 신경 처리망(210, 220)에 입력시켜 정답 데이터가 출력될 수 있게, 즉 손실 함수가 최소값이 되게 제1, 2 인공 신경 처리망(210, 220)의 연결 가중치를 조절하는 역할을 수행하며, 이를 통해 생성된 것이 학습이 완료된 제1, 2 인공 신경 처리망(210, 220)이 된다.The first and second learning units 211 and 221 input the learning data stored in the object image storage unit 154 and the text information storage unit 157 into the first and second artificial neural processing networks 210 and 220 to produce correct answer data. It plays the role of adjusting the connection weights of the first and second artificial neural processing networks (210, 220) so that the loss function can be output, that is, the loss function becomes the minimum value, and the first and second artificial neural processing networks for which learning has been completed are generated through this. It becomes a neural processing network (210, 220).

제1, 2 학습부(211, 221)는 업데이트된 연결 가중치를 가지는 제1, 2 인공 신경 처리망(210, 220)에 학습 데이터, 테스트 데이터 또는 학습에 사용되지 않은 새 데이터가 입력되면, 입력층 - 은닉층 - 폴링 커넥티드 레이어부 - 출력층을 통해 출력된 결과를 획득하여 응답 데이터로 출력할 수 있다.When training data, test data, or new data not used for learning are input to the first and second artificial neural processing networks 210, 220 having updated connection weights, the first and second learning units (211, 221) input Layer - Hidden layer - Polling connected layer - The results output through the output layer can be obtained and output as response data.

제1 학습부(211)는 입력된 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트의 예측 결과 여부를 기반으로 최적화를 통해 딥러닝 기반 인식 모델(썸네일 카피 생성 인공지능 알고리즘)을 생성한다.The first learning unit 211 uses the input text content to create a deep learning-based recognition model (thumbnail copy creation artificial intelligence algorithm) through optimization based on whether the thumbnail text corresponding to the thumbnail type is predicted.

학습이 완료된 제1 인공 신경 처리망(210)은 딥러닝 기반 인식 모델을 이용하여 입력된 텍스트 내용을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트를 예측할 수 있다.The first artificial neural processing network 210, which has completed learning, can predict the thumbnail text corresponding to the thumbnail type using the input text content using a deep learning-based recognition model.

제2 학습부(221)는 입력된 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 이용하여 썸네일 유형에 대응하는 최적 썸네일 배경 이미지의 예측 결과 여부를 기반으로 최적화를 통해 딥러닝 기반 인식 모델(썸네일 배경 이미지 추천 인공지능 알고리즘)을 생성한다.The second learning unit 221 performs deep learning-based recognition through optimization based on the prediction result of the optimal thumbnail background image corresponding to the thumbnail type using the input thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images). Create a model (thumbnail background image recommendation artificial intelligence algorithm).

학습이 완료된 제2 인공 신경 처리망(220)은 딥러닝 기반 인식 모델을 이용하여 입력된 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 이용하여 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 예측할 수 있다.The second artificial neural processing network 220, which has completed learning, uses a deep learning-based recognition model to create an optimal thumbnail background image corresponding to the thumbnail type using the input thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images). It is predictable.

제1 인공 신경 처리망(210)은 제1 학습부(211)로부터 신경망의 인식 모델을 생성하고, 제1 학습부(211)로부터 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)을 수신하며, 신경망의 인식 모델을 이용하여 썸네일 유형에 대응하는 썸네일 텍스트를 예측한다.The first artificial neural processing network 210 generates a recognition model of the neural network from the first learning unit 211, receives the thumbnail type and text content (YouTube video keywords) from the first learning unit 211, and Predict thumbnail text corresponding to the thumbnail type using a recognition model.

제2 인공 신경 처리망(220)은 제2 학습부(221)로부터 신경망의 인식 모델을 생성하고, 제2 학습부(221)로부터 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 수신하며, 신경망의 인식 모델을 이용하여 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 예측한다.The second artificial neural processing network 220 generates a recognition model of the neural network from the second learning unit 221, and receives a thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images) from the second learning unit 221. And, using a neural network recognition model, the optimal thumbnail background image corresponding to the thumbnail type is predicted.

제1 인공 신경 처리망(210)은 수신한 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)을 양자화하여 특징점들을 추출하고, 추출된 특징점들을 제1 학습부(211)에서 학습된 학습 데이터의 특징 벡터들과 비교하여 썸네일 유형에 대응하는 썸네일 텍스트를 제어부(160)로 전송한다.The first artificial neural processing network 210 extracts feature points by quantizing the received thumbnail type and text content (YouTube video keyword), and converts the extracted feature points into feature vectors of the training data learned in the first learning unit 211. and transmits the thumbnail text corresponding to the thumbnail type to the control unit 160.

제1 인공 신경 처리망(210)은 테스트 데이터인 썸네일 유형과 텍스트 내용(유튜브 영상 키워드)을 딥러닝 기반 인식 모델을 이용하여 응답 데이터의 결과값(썸네일 유형에 대응하는 썸네일 텍스트의 결과)으로 출력한다.The first artificial neural processing network 210 outputs the thumbnail type and text content (YouTube video keyword), which are test data, as the result of response data (the result of the thumbnail text corresponding to the thumbnail type) using a deep learning-based recognition model. do.

제2 인공 신경 처리망(220)은 수신한 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 양자화하여 특징점들을 추출하고, 추출된 특징점들을 제2 학습부(221)에서 학습된 학습 데이터의 특징 벡터들과 비교하여 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 제어부(160)로 전송한다.The second artificial neural processing network 220 extracts feature points by quantizing the received thumbnail type and a plurality of YouTube video screenshot frames (thumbnail images), and uses the extracted feature points as learning data learned in the second learning unit 221. The optimal thumbnail background image corresponding to the thumbnail type is transmitted to the control unit 160 by comparing the feature vectors of .

제2 인공 신경 처리망(220)은 테스트 데이터인 썸네일 유형과 복수의 유튜브 영상 스크린샷 프레임(썸네일 이미지)을 딥러닝 기반 인식 모델을 이용하여 응답 데이터의 결과값(썸네일 유형에 대응하는 최적 썸네일 배경 이미지의 결과)으로 출력한다.The second artificial neural processing network 220 uses the thumbnail type, which is test data, and a plurality of YouTube video screenshot frames (thumbnail images) using a deep learning-based recognition model to determine the result of the response data (optimal thumbnail background corresponding to the thumbnail type). output as an image result).

백그라운드 이미징 분석 생성부(180)는 학습 데이터 부분에 영화 스틸샷 이미지를 추가할 수 있다. 여기서, 영화 스틸컷은 시청자 주목도가 높은 이미지일 수 있다.The background imaging analysis generation unit 180 may add a movie still shot image to the learning data portion. Here, a movie still cut may be an image that attracts high viewer attention.

백그라운드 이미징 분석 생성부(180)는 영화 스틸컷 이미지의 라벨링을 통해 인공지능 알고리즘을 개발하여 최적 썸네일 배경 이미지를 추천할 수 있다.The background imaging analysis generation unit 180 can recommend the optimal thumbnail background image by developing an artificial intelligence algorithm through labeling of movie still cut images.

도 6에 도시된 바와 같이, 백그라운드 이미징 분석 생성부(180)는 예를 들어, 영화 스틸컷 이미지를 학습한 상태에서 다량의 유튜브 영상 스크린샷 이미지 프레임의 입력 시 최적 썸네일 배경 이미지를 추천하여 썸네일 텍스트(썸네일 카피)와 결합 시 적용될 수 있다.As shown in FIG. 6, the background imaging analysis generation unit 180 recommends the optimal thumbnail background image when inputting a large amount of YouTube video screenshot image frames, for example, while learning movie still cut images, and generates thumbnail text. Can be applied when combined with (thumbnail copy).

영상 입력부(111)는 유튜브 영상 주소(웹주소)를 입력받는 경우, 영상 프레임을 복수개 획득한다. 여기서, 유튜브 영상 주소는 구독자수가 기설정된 기준 구독자수 이상인 유튜브 채널의 주소를 의미한다.When a YouTube video address (web address) is input, the video input unit 111 obtains a plurality of video frames. Here, the YouTube video address refers to the address of the YouTube channel whose number of subscribers is greater than or equal to the preset standard number of subscribers.

장면 전환 감지부(112)는 각 영상 프레임을 N×N 픽셀(Pixel) 크기로 구성되는 매크로 블록으로 분할하고, 분할한 매크로 블록에 대해서 경계선을 표시하는 흑백 영상으로 변환한다.The scene change detection unit 112 divides each image frame into macro blocks composed of N × N pixels (Pixel) in size, and converts the divided macro blocks into a black and white image that indicates a boundary line.

장면 전환 감지부(112)는 복수의 매크로 블록에 대하여 에지 검출 알고리즘을 이용하여 각각의 영상 프레임을 에지 영상으로 변환한다.The scene change detection unit 112 converts each image frame into an edge image using an edge detection algorithm for a plurality of macro blocks.

장면 전환 감지부(112)는 각각의 영상 프레임을 16×16 픽셀(Pixel) 크기로 구성되는 매크로 블록으로 분할하고, 분할한 매크로 블록에 대해서 경계선을 표시하는 흑백 영상으로 변환하고, 에지 검출 알고리즘을 이용하여 장면 전환을 검출하기 위해서 1/24초 전에 N-1번째 영상 프레임의 에지 영상과 N번째 영상 프레임의 에지 영상을 생성한다.The scene change detection unit 112 divides each image frame into macro blocks with a size of 16 × 16 pixels, converts the divided macro blocks into a black and white image indicating a border, and uses an edge detection algorithm. To detect a scene change, an edge image of the N-1th video frame and an edge image of the Nth video frame are generated 1/24 second ago.

장면 전환 감지부(112)는 N-1번째 영상 프레임과 N번째 영상 프레임의 같은 위치의 매크로 블록 간의 차분값을 계산하고, 매크로 블록 간의 차분값이 기설정된 제1 임계값보다 작은 경우, 유사 매크로 블록으로 판단하고, 매크로 블록 간의 차분값이 기설정된 제1 임계값(T1)보다 큰 경우, 장면이 다른 매크로 블록으로 판단한다.The scene change detection unit 112 calculates the difference value between the macro blocks at the same position in the N-1th image frame and the N-th image frame, and when the difference value between macro blocks is smaller than the preset first threshold, the similar macro It is determined to be a block, and if the difference value between macro blocks is greater than the preset first threshold (T1), the scene is judged to be a different macro block.

장면 전환 감지부(112)는 영상 프레임 내 모든 유사 매크로 블록의 개수를 합산하고, 합산한 유사 매크로 블록의 개수를 기설정된 제2 임계값(T2)보다 크다면, 장면 전환이 이루어진 것으로 판단한다. 물론, 장면 전환 감지부(112)는 장면 전환을 판단하는 알고리즘을 위의 방법 이외에 다양하게 구성할 수 있다.The scene change detection unit 112 adds up the number of all similar macro blocks in the image frame, and if the summed number of similar macro blocks is greater than the preset second threshold T2, it determines that a scene change has occurred. Of course, the scene change detection unit 112 may configure various algorithms for determining scene change other than the above method.

썸네일 이미지 추출부(120)는 장면 전환 감지부(112)에서 장면 전환 지점을 찾아낸 후, 기설정된 장면별 지속 시간을 이용하여 장면 전환이 이루어진 유튜브 영상 스크린샷 프레임만을 캡쳐하여 추출한다.The thumbnail image extraction unit 120 finds the scene change point in the scene change detection unit 112, and then captures and extracts only the YouTube video screenshot frames in which the scene change occurred using the preset duration for each scene.

썸네일 이미지 추출부(120)는 추출한 유튜브 영상 스크린샷 프레임을 썸네일 이미지 데이터베이스부(150)에 저장한다.The thumbnail image extraction unit 120 stores the extracted YouTube video screenshot frame in the thumbnail image database unit 150.

다른 실시예로서, 썸네일 이미지 추출부(120)는 추출한 유튜브 영상 스크린샷 프레임에서 '인기장면'을 포함한 유튜브 영상 스크린샷 프레임을 별도로 추출할 수 있다. 여기서, 인기장면은 유튜브 채널의 동영상에서 이탈한 사용자가 없는 구간의 장면, 즉, 해당 동영상에서 등장하는 사용자들이 모두 포함된 장면을 의미한다.As another embodiment, the thumbnail image extractor 120 may separately extract YouTube video screenshot frames including 'popular scenes' from the extracted YouTube video screenshot frames. Here, a popular scene refers to a scene in a section where no users have left the YouTube channel video, that is, a scene that includes all users appearing in the video.

객체 이미지 분석부(153)는 썸네일 이미지 추출부(120)에서 인기장면을 포함한 유튜브 영상 스크린샷 프레임의 구도 템플릿을 분석하고, 유튜브 영상 스크린샷 프레임 내의 사각형 박스로 최적 구도를 생성하거나 디스플레이부(미도시)에 표시할 수 있다.The object image analysis unit 153 analyzes the composition template of the YouTube video screenshot frame including popular scenes in the thumbnail image extraction unit 120, and generates the optimal composition with a square box within the YouTube video screenshot frame or displays the image on the display unit (Mido). (hour) can be displayed.

객체 이미지 분석부(153)는 구도 템플릿을 생성하는 경우, 생성한 구도 템플릿에서 텍스트 정보가 삽입될 범위(기준점)를 미리 지정할 수 있다.When creating a composition template, the object image analysis unit 153 may specify in advance a range (reference point) into which text information will be inserted in the created composition template.

장면 전환이 이루어진 유튜브 영상 스크린샷 프레임은 제어부(160)의 제어에 따라 썸네일 이미지 데이터베이스부(150)에서 썸네일 유형을 분류하고, 카피라이팅 분석 생성부(170)에서 썸네일 유형에 대응하는 썸네일 텍스트(썸네일 카피)를 출력하고, 백그라운드 이미징 분석 생성부(180)에서 썸네일 유형에 대응하는 최적 썸네일 배경 이미지를 출력하고, 썸네일 결합부(190)에서 썸네일 유형별로 썸네일 텍스트를 최적 썸네일 배경 이미지에 결합하여 최종 썸네일 이미지를 자동 완성한다.The YouTube video screenshot frame with the scene change is classified into thumbnail types in the thumbnail image database unit 150 under the control of the control unit 160, and the copywriting analysis generation unit 170 generates thumbnail text (thumbnail text) corresponding to the thumbnail type. copy), the background imaging analysis generation unit 180 outputs an optimal thumbnail background image corresponding to the thumbnail type, and the thumbnail combination unit 190 combines the thumbnail text for each thumbnail type with the optimal thumbnail background image to create a final thumbnail. Autocomplete the image.

다시 말해, 유튜브 썸네일 자동 완성 서비스 시스템(100)은 유튜브 영상 주소를 입력하면, 썸네일 이미지를 자동 완성하여 표시해주는 서비스를 제공한다.In other words, the YouTube thumbnail auto-complete service system 100 provides a service that automatically completes and displays a thumbnail image when a YouTube video address is entered.

본 명세서의 실시예에 따른 동작은 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 프로그램 또는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의해 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어 분산 방식으로 컴퓨터로 읽을 수 있는 프로그램 또는 코드가 저장되고 실행될 수 있다.Operations according to embodiments of the present specification can be implemented as a computer-readable program or code on a computer-readable recording medium. Computer-readable recording media include all types of recording devices that store data that can be read by a computer system. Additionally, computer-readable recording media can be distributed across networked computer systems so that computer-readable programs or codes can be stored and executed in a distributed manner.

실시예가 소프트웨어로 구현될 때, 상술한 기법은 상술한 기능을 수행하는 모듈(과정, 기능 등)로 구현될 수 있다. 모듈은 메모리에 저장되고, 프로세서에 의해 실행될 수 있다. 메모리는 프로세서 내부 또는 외부에 있을 수 있고, 잘 알려진 다양한 수단으로 프로세서와 연결될 수 있다.When the embodiment is implemented in software, the above-described techniques may be implemented as modules (processes, functions, etc.) that perform the above-described functions. Modules are stored in memory and can be executed by a processor. Memory may be internal or external to the processor and may be connected to the processor by a variety of well-known means.

또한, 컴퓨터가 읽을 수 있는 기록매체는 롬(rom), 램(ram), 플래시 메모리(flash memory) 등과 같이 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치를 포함할 수 있다. 프로그램 명령은 컴파일러(compiler)에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터(interpreter) 등을 사용해서 컴퓨터에 의해 실행될 수 있는 고급 언어 코드를 포함할 수 있다.Additionally, computer-readable recording media may include hardware devices specially configured to store and execute program instructions, such as ROM, RAM, flash memory, etc. Program instructions may include not only machine language code such as that created by a compiler, but also high-level language code that can be executed by a computer using an interpreter, etc.

본 발명의 일부 측면들은 장치의 문맥에서 설명되었으나, 그것은 상응하는 방법에 따른 설명 또한 나타낼 수 있고, 여기서 블록 또는 장치는 방법 단계 또는 방법 단계의 특징에 상응한다. 유사하게, 방법의 문맥에서 설명된 측면들은 또한 상응하는 블록 또는 아이템 또는 상응하는 장치의 특징으로 나타낼 수 있다. 방법 단계들의 몇몇 또는 전부는 예를 들어, 마이크로프로세서, 프로그램 가능한 컴퓨터 또는 전자 회로와 같은 하드웨어 장치에 의해(또는 이용하여) 수행될 수 있다. 몇몇의 실시예에서, 가장 중요한 방법 단계들의 하나 이상은 이와 같은 장치에 의해 수행될 수 있다.Although some aspects of the invention have been described in the context of an apparatus, it may also refer to a corresponding method description, where a block or device corresponds to a method step or feature of a method step. Similarly, aspects described in the context of a method may also be represented by corresponding blocks or items or features of a corresponding device. Some or all of the method steps may be performed by (or using) a hardware device, such as a microprocessor, programmable computer, or electronic circuit, for example. In some embodiments, one or more of the most important method steps may be performed by such an apparatus.

실시예들에서, 프로그램 가능한 로직 장치(예를 들어, 필드 프로그래머블 게이트 어레이)가 여기서 설명된 방법들의 기능의 일부 또는 전부를 수행하기 위해 사용될 수 있다. 실시예들에서, 필드 프로그래머블 게이트 어레이는 여기서 설명된 방법들 중 하나를 수행하기 위한 마이크로프로세서와 함께 작동할 수 있다. 일반적으로, 방법들은 어떤 하드웨어 장치에 의해 수행되는 것이 바람직하다.In embodiments, a programmable logic device (e.g., a field programmable gate array) may be used to perform some or all of the functionality of the methods described herein. In embodiments, a field programmable gate array may operate in conjunction with a microprocessor to perform one of the methods described herein. In general, the methods are preferably performed by some hardware device.

상기에서는 본 발명의 바람직한 실시예를 참조하여 설명하였지만, 해당 기술 분야의 숙련된 당업자는 하기의 특허 청구의 범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 수 있을 것이다.Although the present invention has been described above with reference to preferred embodiments, those skilled in the art may make various modifications and changes to the present invention without departing from the spirit and scope of the present invention as set forth in the claims below. You will understand that you can do it.

100: 유튜브 썸네일 자동 완성 서비스 시스템 110: 영상 수신부
111: 영상 입력부 112: 장면 전환 감지부
120: 썸네일 이미지 추출부 130: 썸네일 선호도 조사부
140: 썸네일 이미지 선호도 분석부 150: 썸네일 이미지 데이터베이스부
151: 썸네일 유형 분류부 152: 객체 이미지 추출부
153: 객체 이미지 분석부 154: 객체 이미지 저장부
155: 텍스트 검출부 156: 텍스트 분석부
157: 텍스트 정보 저장부 160: 제어부
170: 카피라이팅 분석 생성부 180: 백그라운드 이미징 분석 생성부
190: 썸네일 결합부 200: 인공 신경망 엔진부
210: 제1 인공 신경 처리망 211: 제1 학습부
220: 제2 인공 신경 처리망 221: 제2 학습부100: YouTube thumbnail auto-complete service system 110: Video receiver
111: video input unit 112: scene change detection unit
120: Thumbnail image extraction unit 130: Thumbnail preference research unit
140: Thumbnail image preference analysis unit 150: Thumbnail image database unit
151: Thumbnail type classification unit 152: Object image extraction unit
153: object image analysis unit 154: object image storage unit
155: text detection unit 156: text analysis unit
157: text information storage unit 160: control unit
170: Copywriting analysis generation unit 180: Background imaging analysis generation unit
190: Thumbnail coupling part 200: Artificial neural network engine part
210: first artificial neural processing network 211: first learning unit
220: Second artificial neural processing network 221: Second learning unit

Claims

A thumbnail image extraction unit that searches an external server or the Internet for a YouTube channel with a number of subscribers equal to or greater than a preset standard, and captures and extracts the thumbnail image of the searched YouTube channel as a YouTube video screenshot frame;
Classifies a plurality of YouTube video screenshot frames received from the thumbnail image extraction unit by thumbnail type, converts and stores the YouTube video screenshot frames classified by thumbnail type into a database, and stores them in a YouTube video screenshot frame for each thumbnail type. an object image extraction unit that extracts an object from the object image extraction unit, an object image analysis unit that analyzes the object detected by the object image extraction unit and generates a thumbnail background image included in the video screenshot frame, and a YouTube video screen for each thumbnail type. A thumbnail image database unit consisting of a text detection unit that extracts a text area within the shot frame and a text analysis unit that analyzes the extracted text area to generate text information included in the YouTube video screenshot frame;
A control unit that generates a final thumbnail image by combining the thumbnail background image generated by the object image analysis unit and the text information generated by the text analysis unit;
A copywriting analysis generation unit that receives a thumbnail type and text content (YouTube video keyword) from the control unit and outputs a thumbnail text (thumbnail copy) corresponding to the thumbnail type using the text content;
a background imaging analysis generation unit that receives a thumbnail type and a plurality of YouTube video screenshot frames from the control unit and outputs an optimal thumbnail background image corresponding to the thumbnail type; and
YouTube including a YouTube thumbnail auto-completion service device having a thumbnail combination unit that combines the thumbnail text generated by the copywriting analysis generation unit and the optimal thumbnail background image generated by the background imaging analysis generation unit to complete a final thumbnail image. Thumbnail auto-complete service system.

delete

In claim 1,
The thumbnail image database unit sets reference object information consisting of the location, size, number, and composition of objects within the YouTube video screenshot frame for each thumbnail type, and sets text content, size, and position within the YouTube video screenshot frame for each thumbnail type. A YouTube thumbnail auto-complete service system that sets reference text information consisting of and further includes a thumbnail type classification unit that classifies thumbnail types corresponding to each YouTube video screenshot frame using the reference object information and the reference text information.

In claim 1,
The YouTube thumbnail auto-complete service device,
A video input unit that obtains a plurality of video frames when a YouTube video address (web address) is input; and
It further includes a scene change detection unit that determines a scene change of the image frame by calculating a difference value between macro blocks composed of N × N pixel size for each image frame,
The thumbnail image extractor finds the scene change point in the scene change detection unit, then captures and extracts a YouTube video screenshot frame in which the scene change occurred using a preset duration for each scene,
The YouTube video screenshot frame in which the scene change occurred is classified into a thumbnail type in the thumbnail image database unit under the control of the control unit, and the copywriting analysis generation unit outputs a thumbnail text (thumbnail copy) corresponding to the thumbnail type. The background imaging analysis generation unit outputs an optimal thumbnail background image corresponding to the thumbnail type, and the thumbnail combination unit automatically generates the final thumbnail image by combining the thumbnail text with the optimal thumbnail background image for each thumbnail type. YouTube thumbnail auto-complete service system.