KR20210023690A

KR20210023690A - Apparatus for measuring a contribution of content and method thereof

Info

Publication number: KR20210023690A
Application number: KR1020200093578A
Authority: KR
Inventors: 박현준
Original assignee: 박현준
Priority date: 2019-08-23
Filing date: 2020-07-28
Publication date: 2021-03-04
Also published as: KR20210023916A; KR102480117B1

Abstract

The present invention relates to a method for measuring contribution of content to identify a bad user and an apparatus thereof. According to one embodiment of the present invention, the method executed by a computing device comprises the following steps: determining, by a banned content identification module, whether the content is banned from a bulletin board, wherein the prohibited content identification module is a module for determining whether the content is banned on the bulletin board by using data extracted from the content; determining, by a bad user identification module, whether a user who created the content is a bad user when it is determined that the content is banned on the bulletin board, and determining, by a contribution measurement module, determining contribution of the content when it is determined that the content is allowed on the bulletin board, wherein the bad user identification module is a module for determining whether the user is a bad user by using content creation user log data and the contribution measurement module is a module for measuring the contribution of the content by the content page log data; and giving a penalty given to the user when the user is determined as a bad user by the bad user identification module, and measuring the contribution of the content by the contribution measurement module when the user is determined as a normal user by the bad user identification module.

Description

Method and device for measuring content contribution {APPARATUS FOR MEASURING A CONTRIBUTION OF CONTENT AND METHOD THEREOF}

본 발명은 컨텐츠의 기여도를 측정하는 방법 및 그 장치에 관한 것이다. 보다 구체적으로, 머신 러닝을 이용하여 컨텐츠에 관한 로그 데이터에 대한 정량적 평가를 도출하고, 이를 기초로 기여도를 측정하는 방법 및 그 장치에 관한 것이다.The present invention relates to a method and an apparatus for measuring the contribution of content. More specifically, it relates to a method and an apparatus for deriving a quantitative evaluation of log data on content using machine learning, and measuring a contribution degree based on this.

온라인 커뮤니티란, 온라인 상에서 공통의 관심사나 환경을 가진 이들이 소통하는 공간을 의미한다. 온라인 커뮤니티를 이용하는 유저들은 커뮤니티에 마련된 게시판에 컨텐츠를 업로드하거나 업로드된 컨텐츠를 열람한다.An online community is a space where people with common interests or environments communicate online. Users who use the online community upload content to the bulletin board provided in the community or browse the uploaded content.

이러한 온라인 커뮤니티에서, 온라인 커뮤니티 관리자가 온라인 커뮤니티를 관리하여 발생한 수익을 컨텐츠를 업로드한 유저에게 분배하는 방법이 요구된다. 종래의 수익 분배 방법은, 단순히 조회수에 기반하거나 고정 수수료를 이용한 방법에 한정되어 있다.In such an online community, there is a need for a method in which the online community manager distributes the profits generated by managing the online community to users who upload the content. The conventional profit distribution method is limited to a method based on simply number of views or using a fixed fee.

조회수를 이용한 수익 분배 방법은, 부정 클릭, 매크로 및 트래픽 공격에 의한 오차를 배제할 수 없으며, 고정 수수료를 이용한 방법은, 각각의 유저마다 수수료를 환산하기 위한 별도의 인적 감정 절차를 요구하여 상당한 인건비가 소모된다.The profit distribution method using views cannot exclude errors due to fraudulent clicks, macros, and traffic attacks, and the method using a fixed fee requires a separate personal evaluation procedure to convert the fee for each user, resulting in considerable labor costs. Is consumed.

본 발명이 해결하고자 하는 기술적 과제는, 컨텐츠의 기여도를 측정하는 방법 및 그 방법이 적용된 장치를 제공하는 것이다.The technical problem to be solved by the present invention is to provide a method of measuring the contribution of content and an apparatus to which the method is applied.

본 발명이 해결하고자 하는 다른 기술적 과제는, 금지 컨텐츠를 식별하는 방법 및 그 방법이 적용된 장치를 제공하는 것이다.Another technical problem to be solved by the present invention is to provide a method for identifying prohibited content and an apparatus to which the method is applied.

본 발명이 해결하고자 하는 또 다른 기술적 과제는, 불량 유저를 식별하는 방법 및 그 방법이 적용된 장치를 제공하는 것이다.Another technical problem to be solved by the present invention is to provide a method for identifying a bad user and an apparatus to which the method is applied.

본 발명이 해결하고자 하는 또 다른 기술적 과제는, 컨텐츠의 기여도를 측정하는 인공 지능 기반의 모델을 생성하는 방법 및 그 장치를 제공하는 것이다.Another technical problem to be solved by the present invention is to provide a method and apparatus for generating an artificial intelligence-based model for measuring the contribution of content.

본 발명이 해결하고자 하는 다른 기술적 과제는, 금지 컨텐츠를 식별하는 인공 지능 기반의 모델을 생성하는 방법 및 그 장치를 제공하는 것이다.Another technical problem to be solved by the present invention is to provide a method and apparatus for generating an artificial intelligence-based model for identifying prohibited content.

본 발명이 해결하고자 하는 또 다른 기술적 과제는, 불량 유저를 식별하는 인공 지능 기반의 모델을 생성하는 방법 및 그 방법이 적용된 장치를 제공하는 것이다.Another technical problem to be solved by the present invention is to provide a method of generating an artificial intelligence-based model for identifying a bad user and an apparatus to which the method is applied.

본 발명이 해결하고자 하는 또 다른 기술적 과제는, 컨텐츠에 관한 로그 데이터를 기초로 결정된 기여도를 이용하여, 웹서비스에서 발생한 수익에 대한 공정한 분배 방법 및 그 방법이 적용된 장치를 제공하는 것이다.Another technical problem to be solved by the present invention is to provide a fair distribution method for profits generated from a web service and an apparatus to which the method is applied, using a contribution degree determined based on log data related to contents.

본 발명의 기술적 과제들은 이상에서 언급한 기술적 과제들로 제한되지 않으며, 언급되지 않은 또 다른 기술적 과제들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The technical problems of the present invention are not limited to the technical problems mentioned above, and other technical problems that are not mentioned will be clearly understood by those skilled in the art from the following description.

상기 기술적 과제를 해결하기 위한 본 발명의 일 실시예에 따른 기여도 측정 방법은, 컴퓨팅 장치에 의해 수행되는 방법에 있어서, 금지 컨텐츠 식별 모듈에 의해 컨텐츠가 게시판에서 금지되는지 여부를 결정하되, 상기 금지 컨텐츠 식별 모듈은 상기 컨텐츠에서 추출된 데이터를 이용하여 상기 컨텐츠가 게시판에서 금지되는지 여부를 결정하는 모듈인, 단계, 상기 컨텐츠가 상기 게시판에서 금지되는 것으로 결정된 경우, 불량 유저 식별 모듈에 의해 상기 컨텐츠를 작성한 유저가 불량 유저인지 여부를 결정하고, 상기 컨텐츠가 상기 게시판에서 허용되는 것으로 결정된 경우, 기여도 측정 모듈에 의해 상기 컨텐츠의 기여도를 결정하되, 상기 불량 유저 식별 모듈은 컨텐츠 작성 유저 로그 데이터를 이용하여 상기 유저가 불량 유저인지 여부를 결정하는 모듈이고, 상기 기여도 측정 모듈은 컨텐츠 페이지 로그 데이터를 이용하여 상기 컨텐츠의 기여도를 측정하는 모듈인, 단계 및 상기 불량 유저 식별 모듈에 의해 사용자가 불량 유저로 결정된 경우, 상기 사용자에게 페널티를 부여하고, 상기 불량 유저 식별 모듈에 의해 상기 사용자가 정상 유저로 결정된 경우, 상기 기여도 측정 모듈에 의해 상기 컨텐츠의 기여도를 결정하는 단계를 포함할 수 있다.In the method of measuring contribution according to an embodiment of the present invention for solving the technical problem, in a method performed by a computing device, determining whether or not content is prohibited on a bulletin board by a prohibited content identification module, the prohibited content The identification module is a module that determines whether the content is prohibited on the bulletin board by using the data extracted from the content. When it is determined that the content is prohibited on the bulletin board, the content is created by a bad user identification module. Determines whether the user is a bad user, and when it is determined that the content is allowed in the bulletin board, the contribution level of the content is determined by a contribution measurement module, and the bad user identification module uses the content creation user log data A module that determines whether the user is a bad user, and the contribution measurement module is a module that measures the contribution of the content using content page log data, and when the user is determined as a bad user by the bad user identification module , When the user is determined to be a normal user by giving a penalty to the user and by the bad user identification module, determining a contribution degree of the content by the contribution measurement module.

일 실시예에서, 상기 금지 컨텐츠 식별 모듈은 상기 컨텐츠에 포함된 이미지를 CNN(Convolutional Neural Network) 기반의 NSFW(Not Safe For Work) 디텍터에 입력하여 출력된 데이터에 의해, 상기 컨텐츠가 상기 게시판에서 금지되는지 여부가 결정되는 것일 수 있다. 여기서, 상기 출력된 데이터는 상기 이미지가 금지 항목을 포함할 확률을 포함하고, 상기 컨텐츠가 상기 게시판에서 금지되는지 여부를 결정하는 단계는 상기 확률이 기준치 이상인 경우, 상기 컨텐츠가 상기 게시판에서 금지되는 것으로 결정하는 단계를 포함할 수 있다. 이때, 상기 기준치는 상기 불량 유저 식별 모듈에 의해 불량 유저로 결정된 유저에 의해 작성된 컨텐츠의 경우, 정상 유저에 의해 작성된 컨텐츠보다 낮게 형성되는 것일 수 있다. In one embodiment, the forbidden content identification module inputs the image included in the content to a Convolutional Neural Network (CNN)-based Not Safe For Work (NSFW) detector, and the content is prohibited in the bulletin board by the output data. It may be determined whether or not. Here, the output data includes a probability that the image contains a prohibited item, and the step of determining whether the content is prohibited on the bulletin board is that when the probability is greater than or equal to a reference value, the content is prohibited on the bulletin board. It may include the step of determining. In this case, in the case of content created by a user who is determined to be a bad user by the bad user identification module, the reference value may be formed to be lower than the content written by a normal user.

일 실시예에서, 상기 컨텐츠는 텍스트 및 이미지를 포함하고, 상기 금지 컨텐츠 식별 모듈은 상기 컨텐츠에 포함된 텍스트에 의해 상기 컨텐츠가 상기 게시판에서 금지되는지 여부가 결정되는 것인 제1 모듈 및 상기 컨텐츠에 포함된 이미지에 의해 상기 컨텐츠가 상기 게시판에서 금지되는지 여부가 결정되는 것인 제2 모듈을 포함할 수 있다. 여기서, 상기 금지 컨텐츠 식별 모듈은 상기 제1 모듈에 의해 상기 컨텐츠가 상기 게시판에서 금지되는지 여부가 결정된 이후에, 상기 제2 모듈에 의해 상기 컨텐츠가 상기 게시판에서 금지되는지 여부가 결정되는 것일 수 있다.In an embodiment, the content includes text and an image, and the prohibited content identification module determines whether or not the content is prohibited on the bulletin board by text included in the content. It may include a second module that determines whether the content is prohibited on the bulletin board by the included image. Here, the prohibited content identification module may determine whether the content is prohibited on the bulletin board by the second module after determining whether the content is prohibited on the bulletin board by the first module.

일 실시예에서, 상기 불량 유저 식별 모듈은 상기 유저 로그 데이터에서 추출된 복수의 특징들을 얻어, 상기 복수의 특징들로 구성된 학습 데이터셋을 인공 신경망에 입력하여 상기 인공 신경망을 학습시켜 생성된 것일 수 있다. 여기서, 상기 복수의 특징은 상기 유저의 가입일, 접속일수, 글수, 댓글수, 컨텐츠 조회수, 부여한 좋아요수, 부여한 싫어요수, 부여한 신고수, 받은 좋아요수, 받은 싫어요수, 받은 신고수, 받은 신고 종류, 삭제 글수, 삭제 댓글수, IP, 총 체류시간, 평균 체류시간, 총 클릭수 및 디바이스 종류를 포함할 수 있다. 또한, 상기 인공 신경망은 앙상블 학습에 의한 것일 수 있다.In one embodiment, the bad user identification module may be generated by learning the artificial neural network by obtaining a plurality of features extracted from the user log data and inputting a training dataset composed of the plurality of features into an artificial neural network. have. Here, the plurality of features are the user's subscription, number of days of connection, number of posts, number of comments, number of content views, number of likes, number of likes, number of dislikes, number of reports, number of likes, number of likes received, number of reports received, type of report received. , Number of deleted posts, number of deleted comments, IP, total stay time, average stay time, total click count and device type. In addition, the artificial neural network may be based on ensemble learning.

일 실시예에서, 상기 유저 로그데이터는 상기 유저의 가입일, 접속일수, 글수, 댓글수, 컨텐츠 조회수, 부여한 좋아요수, 부여한 싫어요수, 부여한 신고수, 받은 좋아요수, 받은 싫어요수, 받은 신고수, 받은 신고 종류, 삭제 글수, 삭제 댓글수, IP, 총 체류시간, 평균 체류시간, 총 클릭수 및 디바이스 종류를 포함할 수 있다.In one embodiment, the user log data includes the user's membership date, access days, number of posts, number of comments, number of content views, number of likes, number of dislikes, number of reports given, number of likes received, number of dislikes received, number of reports received, It can include the type of report received, the number of deleted posts, the number of deleted comments, IP, total stay time, average stay time, total click count, and device type.

일 실시예에서, 인공 신경망을 가지는 컴퓨팅 장치에 의해 수행되는 방법에 있어서 컨텐츠 페이지 로그 데이터에서 추출된 복수의 특징들을 얻는 단계 및 상기 복수의 특징들을 상기 인공 신경망에 입력하고, 상기 인공 신경망으로부터 출력된 데이터를 이용하여 컨텐츠의 기여도를 결정하는 단계를 포함하되, 상기 복수의 특징들은 상기 컨텐츠 페이지의 특정 부분을 클릭하는 유저의 클릭 위치, 상기 클릭 위치의 클릭 횟수, 상기 컨텐츠 페이지의 이전 페이지, 상기 컨텐츠 페이지의 이후 페이지, 상기 컨텐츠 페이지의 열람 디바이스 종류, 상기 컨텐츠 페이지의 댓글 작성 유저 정보 및 상기 컨텐츠 페이지의 열람 유저 정보를 포함할 수 있다. 여기서, 상기 인공 신경망은 상기 복수의 특징들로 구성된 학습 데이터셋을 입력하여, 지도 학습(supervised learning)기반의 앙상블 학습에 의한 것일 수 있다. 또한, 상기 복수의 특징들은 상기 컨텐츠에 포함된 텍스트 길이, 상기 컨텐츠에 이미지 포함 여부, 상기 컨텐츠에 포함된 이미지 개수, 상기 컨텐츠에 포함된 이미지 파일타입, 상기 컨텐츠에 포함된 이미지 용량, 상기 컨텐츠에 포함된 이미지 다운 횟수, 상기 컨텐츠의 작성 시간, 상기 컨텐츠 페이지의 조회수, 상기 컨텐츠 페이지의 좋아요수, 상기 컨텐츠 페이지의 싫어요수, 상키 컨텐츠 페이지의 신고수, 상기 컨텐츠 페이지의 신고 타입, 상기 컨텐츠 페이지의 댓글수 및 상기 컨텐츠 페이지의 체류시간을 더 포함할 수도 있다.In one embodiment, in a method performed by a computing device having an artificial neural network, obtaining a plurality of features extracted from content page log data, inputting the plurality of features into the artificial neural network, and outputting from the artificial neural network Determining a content contribution by using data, wherein the plurality of features include a click position of a user who clicks a specific part of the content page, a click count of the click position, a previous page of the content page, and the content A page after the page, a type of a viewing device for the content page, user information for writing comments on the content page, and user information for viewing the content page may be included. Here, the artificial neural network may be performed by ensemble learning based on supervised learning by inputting a learning dataset composed of the plurality of features. In addition, the plurality of features include a text length included in the content, whether an image is included in the content, the number of images included in the content, an image file type included in the content, an image capacity included in the content, and The number of included image downloads, the creation time of the content, the number of views of the content page, the number of likes of the content page, the number of dislikes of the content page, the number of reports of the content page, the report type of the content page, the content page The number of comments and the residence time of the content page may be further included.

본 발명의 일 실시예에 따른 금지 컨텐츠 식별 방법은, 인공 신경망을 가지는 컴퓨팅 장치에 의해 수행되는 방법에 있어서, 컨텐츠에 포함된 텍스트에서 추출된 특징을 얻는 단계 및 상기 특징을 상기 인공 신경망에 입력하고, 상기 인공 신경망으로부터 출력된 데이터를 이용하여 상기 컨텐츠가 게시판에서 금지되는지 여부를 결정하는 단계를 포함하되, 상기 특징을 얻는 단계는 상기 텍스트의 외형 유사도에 의해 한글 자모, 숫자 및 특수 문자가 유사한 벡터로 추출되는 단계를 포함할 수 있다. 여기서, 상기 출력된 데이터는 상기 텍스트가 금지 항목을 포함할 확률을 포함하고, 상기 컨텐츠가 게시판에서 금지되는지 여부를 결정하는 단계는 상기 확률이 기준치 이상인 경우, 상기 컨텐츠가 상기 게시판에서 금지되는 것으로 결정하는 단계를 포함하거나 상기 텍스트가 필수 항목을 포함할 확률을 포함하고, 상기 컨텐츠가 게시판에서 금지되는지 여부를 결정하는 단계는, 상기 확률이 기준치 이하인 경우, 상기 컨텐츠가 상기 게시판에서 금지되는 것으로 결정하는 단계를 포함할 수 있다.In the method of identifying prohibited content according to an embodiment of the present invention, in a method performed by a computing device having an artificial neural network, obtaining a feature extracted from a text included in the content and inputting the feature into the artificial neural network And determining whether the content is prohibited on the bulletin board by using the data output from the artificial neural network, wherein the obtaining of the characteristic includes a vector in which Korean alphabets, numbers, and special characters are similar according to the appearance similarity of the text. It may include the step of extracting. Here, the output data includes a probability that the text contains a prohibited item, and the step of determining whether the content is prohibited on the bulletin board is determined that the content is prohibited on the bulletin board when the probability is greater than or equal to a reference value. Including the step of or including a probability that the text includes a required item, and determining whether the content is prohibited on the bulletin board, when the probability is less than a reference value, determining that the content is prohibited in the bulletin board It may include steps.

일 실시예에서, 상기 금지 항목은 욕설물 그룹, 음란물 그룹, 폭력물 그룹 및 저작권 위반 그룹 중 어느 하나에 속하는 것일 수 있다. 또한, 제1 항목을 금지 항목으로 추가하는 단계를 더 포함하되, 상기 제1 항목은 제1 금지 항목을 기준으로 기준 거리 이내에 위치하는 것이고, 상기 제1 금지 항목을 포함할 확률이 기준치 이상인 복수의 컨텐츠가 상기 제1 항목을 기준 횟수 이상 포함하는 것일 수 있다. In one embodiment, the prohibited item may belong to any one of abusive material group, pornographic material group, violent material group, and copyright violation group. In addition, it further comprises the step of adding the first item as a prohibited item, wherein the first item is located within a reference distance based on the first prohibited item, and the probability of including the first prohibited item is greater than or equal to a reference value. The content may include the first item more than a reference number of times.

일 실시예에서, 상기 필수 항목은 상기 게시판의 카테고리에 의해 결정되는 것일 수 있다.In an embodiment, the required item may be determined by a category of the bulletin board.

도 1은 본 발명의 일 실시예에 따른 기여도 측정 시스템을 설명하기 위한 도면이다.
도 2는 본 발명의 다른 실시예에 따른 기여도 측정 방법을 설명하기 위한 순서도이다.
도 3은 도 2를 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.
도 4는 도 3을 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.
도 5 내지 도 6은 도 2를 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.
도 7은 도 6을 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.
도 8은 본 발명의 또 다른 실시예에 따른 기여도 측정 장치를 구현할 수 있는 컴퓨팅 장치를 설명하기 위한 도면이다.1 is a diagram illustrating a system for measuring a contribution degree according to an exemplary embodiment of the present invention.
2 is a flowchart illustrating a method of measuring a contribution degree according to another embodiment of the present invention.
3 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 2.
4 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 3.
5 to 6 are flow charts for specifically explaining some operations of the method described with reference to FIG. 2.
7 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 6.
8 is a diagram for describing a computing device capable of implementing an apparatus for measuring a contribution degree according to another exemplary embodiment of the present invention.

이하, 첨부된 도면을 참조하여 본 발명의 바람직한 실시예들을 상세히 설명한다. 본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나 본 발명의 기술적 사상은 이하의 실시예들에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 수 있으며, 단지 이하의 실시예들은 본 발명의 기술적 사상을 완전하도록 하고, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 본 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명의 기술적 사상은 청구항의 범주에 의해 정의될 뿐이다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings. Advantages and features of the present invention, and a method of achieving them will become apparent with reference to the embodiments described below in detail together with the accompanying drawings. However, the technical idea of the present invention is not limited to the following embodiments, but may be implemented in various different forms, and only the following embodiments complete the technical idea of the present invention, and in the technical field to which the present invention pertains. It is provided to completely inform the scope of the present invention to those of ordinary skill in the art, and the technical idea of the present invention is only defined by the scope of the claims.

각 도면의 구성요소들에 참조부호를 부가함에 있어서, 동일한 구성요소들에 대해서는 비록 다른 도면상에 표시되더라도 가능한 한 동일한 부호를 가지도록 하고 있음에 유의해야 한다. 또한, 본 발명을 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다.In adding reference numerals to elements of each drawing, it should be noted that the same elements are assigned the same numerals as possible, even if they are indicated on different drawings. In addition, in describing the present invention, when it is determined that a detailed description of a related known configuration or function may obscure the subject matter of the present invention, a detailed description thereof will be omitted.

다른 정의가 없다면, 본 명세서에서 사용되는 모든 용어(기술 및 과학적 용어를 포함)는 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 공통적으로 이해될 수 있는 의미로 사용될 수 있다. 또 일반적으로 사용되는 사전에 정의되어 있는 용어들은 명백하게 특별히 정의되어 있지 않는 한 이상적으로 또는 과도하게 해석되지 않는다. 본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다.Unless otherwise defined, all terms (including technical and scientific terms) used in the present specification may be used with meanings that can be commonly understood by those of ordinary skill in the art to which the present invention belongs. In addition, terms defined in a commonly used dictionary are not interpreted ideally or excessively unless explicitly defined specifically. The terms used in the present specification are for describing exemplary embodiments and are not intended to limit the present invention. In this specification, the singular form also includes the plural form unless specifically stated in the phrase.

또한, 본 발명의 구성 요소를 설명하는 데 있어서, 제1, 제2, A, B, (a), (b) 등의 용어를 사용할 수 있다. 이러한 용어는 그 구성 요소를 다른 구성 요소와 구별하기 위한 것일 뿐, 그 용어에 의해 해당 구성 요소의 본질이나 차례 또는 순서 등이 한정되지 않는다. 어떤 구성 요소가 다른 구성요소에 "연결", "결합" 또는 "접속"된다고 기재된 경우, 그 구성 요소는 그 다른 구성요소에 직접적으로 연결되거나 또는 접속될 수 있지만, 각 구성 요소 사이에 또 다른 구성 요소가 "연결", "결합" 또는 "접속"될 수도 있다고 이해되어야 할 것이다.In addition, in describing the constituent elements of the present invention, terms such as first, second, A, B, (a) and (b) may be used. These terms are for distinguishing the constituent element from other constituent elements, and the nature, order, or order of the constituent element is not limited by the term. When a component is described as being "connected", "coupled" or "connected" to another component, the component may be directly connected or connected to that other component, but another component between each component It will be understood that elements may be “connected”, “coupled” or “connected”.

명세서에서 사용되는 "포함한다 (comprises)" 및/또는 "포함하는 (comprising)"은 언급된 구성 요소, 단계, 동작 및/또는 소자는 하나 이상의 다른 구성 요소, 단계, 동작 및/또는 소자의 존재 또는 추가를 배제하지 않는다.As used in the specification, "comprises" and/or "comprising" refers to the presence of one or more other components, steps, actions and/or elements in which the recited component, step, operation and/or element is Or does not preclude additions.

이하, 본 발명의 몇몇 실시예들에 대하여 첨부된 도면에 따라 상세하게 설명한다.Hereinafter, some embodiments of the present invention will be described in detail with reference to the accompanying drawings.

이하, 본 발명의 일 실시예에 따른 대상 객체 탐지 시스템의 구성 및 동작을 도 1을 참조하여 설명한다. 도 1은 본 발명의 일 실시예에 따른 기여도 측정 시스템을 설명하기 위한 도면이다.Hereinafter, a configuration and operation of a target object detection system according to an embodiment of the present invention will be described with reference to FIG. 1. 1 is a diagram illustrating a system for measuring a contribution degree according to an exemplary embodiment of the present invention.

본 실시예에 따른 기여도 측정 시스템은 유저가 작성한 컨텐츠의 기여도를 측정하는 시스템으로 이해될 수 있을 것이다. 여기서, 컨텐츠는 온라인 커뮤니티에 업로드된 것으로 텍스트 또는 이미지를 포함하는 것으로 이해될 수 있다. 여기서, 기여도는 관리자가 컨텐츠를 작성한 유저에게 분배하는 수익의 기준이 될 수 있다.The contribution measurement system according to the present embodiment may be understood as a system for measuring the contribution of content created by a user. Here, the content is uploaded to the online community and may be understood as including text or images. Here, the degree of contribution may be a criterion of the revenue distributed by the administrator to the users who created the content.

도 1에 도시된 기여도 측정 시스템의 각각의 구성 요소들은 기능적으로 구분되는 기능 요소들을 나타낸 것으로서, 어느 하나 이상의 구성 요소가 실제 물리적 환경에서는 서로 통합되어 구현될 수 있다.Each of the constituent elements of the contribution measurement system shown in FIG. 1 represents functional elements that are functionally divided, and any one or more constituent elements may be integrated and implemented with each other in an actual physical environment.

도 1을 참조하면, 기여도 측정 시스템은 데이터 베이스(100), 유저 단말(200), 금지 컨텐츠 식별 장치(300), 불량 유저 식별 장치(400), 기여도 측정 장치(500) 및 관리자 단말(600)을 포함할 수 있다. 이하 각각의 구성 요소에 대하여 구체적으로 설명한다.Referring to FIG. 1, the contribution measurement system includes a database 100, a user terminal 200, a prohibited content identification device 300, a bad user identification device 400, a contribution measurement device 500, and a manager terminal 600. It may include. Hereinafter, each component will be described in detail.

데이터 베이스(100)는 컨텐츠, 컨텐츠 페이지 로그 데이터 및 유저 로그 데이터를 저장할 수 있다. The database 100 may store content, content page log data, and user log data.

데이터 베이스(100)는 컨텐츠를 유저 단말(200)에 전송할 수 있다. 또한, 데이터 베이스(100)는 관리자 단말(600)에 컨텐츠, 컨텐츠 페이지 로그 데이터 및 유저 로그 데이터를 전송할 수 있다. 또한, 데이터 베이스(100)는 금지 컨텐츠 식별 장치(300)에 컨텐츠를 전송할 수 있다. 또한, 데이터 베이스(100)는 불량 유저 식별 장치(400)에 유저 로그 데이터를 전송할 수 있다. 나아가, 데이터 베이스(100)는 컨텐츠 페이지 로그 데이터를 기여도 측정 장치(500)에 전송할 수 있다.The database 100 may transmit content to the user terminal 200. In addition, the database 100 may transmit content, content page log data, and user log data to the manager terminal 600. In addition, the database 100 may transmit content to the prohibited content identification device 300. In addition, the database 100 may transmit user log data to the device 400 for identifying a bad user. Furthermore, the database 100 may transmit content page log data to the apparatus 500 for measuring the degree of contribution.

데이터 베이스(100)는 유저 단말(200)로부터 유저가 작성한 컨텐츠, 유저 로그 데이터 및 컨텐츠 페이지 로그 데이터를 수신할 수 있다.The database 100 may receive content, user log data, and content page log data created by a user from the user terminal 200.

데이터 베이스(100)는 데이터 베이스 서버, 네트워크 파일 스토리지 등과 같은 외부 저장소일 수 있다. 또한, HDD, SSD 등과 같은 내부 저장 매체일 수 있다. 다만, 이에 한정되는 것은 아니고, 클라우드 서비스일 수 있다. 클라우드 서비스는 서버, 스토리지, SW 등 다양한 IT 자원을 사용자에게 제공하는 주문형 아웃소싱 서비스를 말한다. 예를 들어, 스토리지를 제공하는 클라우드 서비스는, 사용자에게 각종 데이터를 네트워크에 연결된 서비스 제공 서버의 스토리지에 저장하고, 필요 시 네트워크를 통해 다운로드할 수 있도록 제공하는 서비스이다. 상기 클라우드 서비스는 사용자에게 가상화 된 인프라 환경을 제공할 수 있는 IaaS(Infrastructure as a Service) 또는 인터넷을 통해 애플리케이션 소프트웨어를 제공하는 SaaS(Software as a Service)가 될 수 있으나, 이에 한정되는 것은 아니다. The database 100 may be an external storage such as a database server or network file storage. In addition, it may be an internal storage medium such as an HDD or SSD. However, it is not limited thereto, and may be a cloud service. Cloud service refers to an on-demand outsourcing service that provides various IT resources such as servers, storage, and software to users. For example, a cloud service that provides storage is a service that provides users with various data to be stored in storage of a service providing server connected to a network and downloaded through a network when necessary. The cloud service may be IaaS (Infrastructure as a Service) that can provide a virtualized infrastructure environment to users or SaaS (Software as a Service) that provides application software through the Internet, but is not limited thereto.

다음으로, 유저 단말(200)은 데이터 베이스(100)로부터 컨텐츠를 수신하여 디스플레이할 수 있다. 보다 구체적으로, 유저 단말(200)은 유저의 입력에 따라, 데이터 베이스(100)에 컨텐츠 열람 요청을 전송한다. 그리고, 유저 단말(200)은 데이터 베이스(100)로부터 컨텐츠를 수신한다. 그리고, 유저 단말(200)은 수신된 컨텐츠를 화면에 출력한다. 유저 단말(200)은 컨텐츠를 출력하기 위하여 웹 브라우저 또는 전용 애플리케이션이 설치되어 있을 수 있다.Next, the user terminal 200 may receive and display content from the database 100. More specifically, the user terminal 200 transmits a content browsing request to the database 100 according to a user's input. Then, the user terminal 200 receives content from the database 100. Then, the user terminal 200 outputs the received content on the screen. The user terminal 200 may have a web browser or a dedicated application installed in order to output content.

본 발명의 일 실시예에서 참조될 수 있는 유저 단말(200)은 네트워크를 통하여 데이터 베이스(100)로부터 수신된 컨텐츠를 출력할 수 있는 장치라면, 어떠한 장치라도 허용될 수 있다. 예를 들어, 본 발명의 일 실시예에서 참조될 수 있는 사용자 장치(200)는 데스크탑(Desktop), 워크스테이션(Workstation), 서버(Server), 랩탑(Laptop), 태블릿(Tablet), 스마트폰(Smart Phone) 또는 패블릿(Phablet) 중 어느 하나가 될 수 있으나, 이에 한정되지 않고, 휴대용 멀티미디어 플레이어(Portable Multimedia Player, PMP), 개인용 휴대 단말기(Personal Digital Assistants, PDA) 또는 전자책 단말기(E-Book Reader) 등과 같은 형태의 장치가 될 수도 있다.The user terminal 200 that can be referred to in an embodiment of the present invention may be any device as long as it is a device capable of outputting the content received from the database 100 through a network. For example, the user device 200 that can be referred to in an embodiment of the present invention is a desktop, a workstation, a server, a laptop, a tablet, a smartphone ( Smart Phone) or a phablet (Phablet), but is not limited thereto, and a portable multimedia player (PMP), personal digital assistants (PDA), or an e-book terminal (E- It can also be a type of device such as Book Reader).

또한, 유저 단말(200)은 유저가 작성한 컨텐츠, 유저 로그 데이터 및 컨텐츠 페이지 로그 데이터를 데이터 베이스(100)에 전송할 수 있다. 본 발명의 일 실시예에서 참조될 수 있는 유저 단말(200)은 유저로부터 작성된 컨텐츠를 입력 받을 수 있는 장치라면, 어떠한 장치라도 허용될 수 있다. In addition, the user terminal 200 may transmit content, user log data, and content page log data created by the user to the database 100. The user terminal 200 that can be referred to in an embodiment of the present invention may be any device as long as it is a device capable of receiving an input of content created by a user.

다음으로, 금지 컨텐츠 식별 장치(300)는 컨텐츠를 분석하여 컨텐츠가 온라인 커뮤니티의 게시판에서 금지되는지 여부를 결정하는 장치일 수 있다. 금지 컨텐츠 식별 장치(300)는 유저가 작성한 컨텐츠를 분석하여, 분석된 결과를 데이터 베이스(100)에 전송할 수 있다. 또한, 금지 컨텐츠 식별 장치(300)는 분석된 결과를 불량 유저 식별 장치(400)에 전송할 수 있다. 또한, 금지 컨텐츠 식별 장치(300)는 분석된 결과를 기여도 측정 장치(500)에 전송할 수 있다. 나아가, 금지 컨텐츠 식별 장치(300)는 분석된 결과를 관리자 단말(600)에 전송할 수도 있다.Next, the prohibited content identification device 300 may be a device that analyzes the content and determines whether the content is prohibited on the bulletin board of the online community. The prohibited content identification device 300 may analyze the content created by the user and transmit the analyzed result to the database 100. In addition, the prohibited content identification device 300 may transmit the analyzed result to the bad user identification device 400. In addition, the prohibited content identification device 300 may transmit the analyzed result to the contribution measurement device 500. Furthermore, the prohibited content identification device 300 may transmit the analyzed result to the manager terminal 600.

일 실시예에서, 금지 컨텐츠 식별 장치(300)는 유저가 작성한 컨텐츠 분석 결과 게시판에서 금지되는 컨텐츠로 결정된 경우, 데이터 베이스(100)에 저장된 컨텐츠를 삭제하는 요청을 전송할 수 있다.In one embodiment, the prohibited content identification device 300 may transmit a request to delete the content stored in the database 100 when it is determined that the content is prohibited in the bulletin board as a result of analyzing the content created by the user.

일 실시예에서, 금지 컨텐츠 식별 장치(300)는 유저가 작성한 컨텐츠 분석 결과 게시판에서 금지되는 컨텐츠로 결정된 경우, 관리자 단말(600)에 업로드된 컨텐츠를 삭제하는 요청을 전송할 수 있다. 이와 같은, 본 발명의 일 실시예에 따른, 금지 컨텐츠 식별 장치(300)에 대해서는 추후 도 2 내지 도 8을 참조하여 구체적으로 설명하기로 한다.In an embodiment, the prohibited content identification device 300 may transmit a request to delete the uploaded content to the manager terminal 600 when it is determined that the content is prohibited in the bulletin board as a result of analyzing the content created by the user. Such a prohibited content identification device 300 according to an embodiment of the present invention will be described in detail later with reference to FIGS. 2 to 8.

다음으로, 불량 유저 식별 장치(400)는 유저 로그 데이터를 분석하여 유저가 불량 유저인지 여부를 결정하는 장치일 수 있다. 불량 유저 식별 장치(400)는 유저 로그 데이터를 분석하여, 분석된 결과를 데이터 베이스(100)에 전송할 수 있다. 또한, 불량 유저 식별 장치(400)는 분석된 결과를 금지 컨텐츠 식별 장치(300)에 전송할 수 있다. 또한, 불량 유저 식별 장치(400)는 분석된 결과를 기여도 측정 장치(500)에 전송할 수 있다. 나아가, 불량 유저 식별 장치(400)는 분석된 결과를 관리자 단말(600)에 전송할 수도 있다.Next, the bad user identification device 400 may be a device that determines whether the user is a bad user by analyzing user log data. The bad user identification device 400 may analyze user log data and transmit the analyzed result to the database 100. In addition, the bad user identification device 400 may transmit the analyzed result to the prohibited content identification device 300. In addition, the bad user identification device 400 may transmit the analyzed result to the contribution measurement device 500. Furthermore, the bad user identification device 400 may transmit the analyzed result to the manager terminal 600.

일 실시예에서, 불량 유저 식별 장치(400)는 유저 로그 데이터 분석 결과 불량 유저로 결정된 경우, 데이터 베이스(100)에 유저를 특정할 수 있는 속성과 매칭하여 불량 유저 데이터를 저장할 수 있다. 이와 같은, 본 발명의 일 실시예에 따른, 불량 유저 식별 장치(400)에 대해서는 추후 도 2 내지 도 8을 참조하여 구체적으로 설명하기로 한다.In an embodiment, when a user log data analysis result determines that a bad user is a bad user, the bad user identification device 400 may store bad user data by matching an attribute capable of specifying the user in the database 100. Such, a bad user identification device 400 according to an embodiment of the present invention will be described in detail later with reference to FIGS. 2 to 8.

다음으로, 기여도 측정 장치(500)는 컨텐츠 페이지 로그 데이터를 분석하여 컨텐츠의 기여도를 결정하는 장치일 수 있다. 기여도 측정 장치(500)는 컨텐츠 페이지 로그 데이터를 분석하여, 분석된 결과를 데이터 베이스(100)에 전송할 수 있다. 또한, 기여도 측정 장치(500)는 분석된 결과를 유저 단말(200)에 전송할 수 있다. 나아가, 기여도 측정 장치(500)는 분석된 결과를 관리자 단말(600)에 전송할 수 있다.Next, the contribution measurement device 500 may be a device that determines a content contribution by analyzing content page log data. The contribution measurement device 500 may analyze content page log data and transmit the analyzed result to the database 100. In addition, the contribution measurement device 500 may transmit the analyzed result to the user terminal 200. Further, the contribution measurement device 500 may transmit the analyzed result to the manager terminal 600.

일 실시예에서, 기여도 측정 장치(500)는 컨텐츠 페이지 로그 데이터 분석 결과 컨텐츠에 결정된 기여도를 컨텐츠와 매칭하여 데이터 베이스(100)에 저장할 수 있다.In an embodiment, the contribution measurement device 500 may match the determined contribution to the content as a result of analyzing the content page log data with the content and store it in the database 100.

일 실시예에서, 기여도 측정 장치(500)는 컨텐츠 페이지 로그 데이터 분석 결과 컨텐츠에 결정된 기여도를 유저 단말(200)에 전송하여, 컨텐츠를 작성한 유저가 결정된 기여도를 확인할 수 있다. 이와 같은, 본 발명의 일 실시예에 따른, 기여도 측정 장치(500)에 대해서는 추후 도 2 내지 도 8을 참조하여 구체적으로 설명하기로 한다.In an embodiment, the contribution measurement device 500 may transmit the determined contribution to the content as a result of analyzing the content page log data to the user terminal 200, and the user who created the content may check the determined contribution. As described above, an apparatus 500 for measuring a contribution degree according to an embodiment of the present invention will be described in detail later with reference to FIGS. 2 to 8.

다음으로, 관리자 단말(600)은 데이터 베이스(100)로부터 컨텐츠, 유저 로그 데이터 및 컨텐츠 페이지 로그 데이터를 수신할 수 있다. 또한, 관리자 단말(600)은 유저 단말(200)로부터 수익 분배를 요청하는 유저의 입력을 수신할 수 있다. 또한, 관리자 단말(600)은 금지 컨텐츠 식별 장치로부터, 유저에 의해 작성된 컨텐츠를 분석한 결과를 수신할 수 있다. 또한, 관리자 단말(600)은 불량 유저 식별 장치(400)로부터 유저 로그 데이터를 분석한 결과를 수신할 수 있다. 나아가, 관리자 단말(600)은 기여도 측정 장치(500)로부터 컨텐츠 페이지 로그 데이터를 분석한 결과를 수신할 수 있다. Next, the manager terminal 600 may receive content, user log data, and content page log data from the database 100. In addition, the manager terminal 600 may receive an input of a user requesting a profit distribution from the user terminal 200. In addition, the manager terminal 600 may receive a result of analyzing the content created by the user from the prohibited content identification device. In addition, the manager terminal 600 may receive a result of analyzing user log data from the bad user identification device 400. Furthermore, the manager terminal 600 may receive a result of analyzing the content page log data from the contribution measurement device 500.

관리자 단말(600)은 본 발명의 일 실시예에 따른 기여도 측정 시스템을 관리할 수 있다. 상기 기여도 측정 시스템을 관리하기위한 적절한 요청을 상기 기여도 측정 시스템 개별 구성 요소에게 전송할 수 있다. The manager terminal 600 may manage the contribution measurement system according to an embodiment of the present invention. An appropriate request for managing the contribution measurement system may be transmitted to individual components of the contribution measurement system.

본 발명의 일 실시예에서 참조될 수 있는 관리자 단말(600)은 앞서 설명된 동작을 수행할 수 있는 장치라면, 어떠한 장치라도 허용될 수 있다.The manager terminal 600 that can be referred to in an embodiment of the present invention may be any device as long as it is a device capable of performing the above-described operation.

이하, 도 2를 참조하여 본 발명의 다른 실시예에 따른 기여도 측정 방법을 설명한다. 본 실시예에 따른 기여도 측정 방법은 컴퓨팅 장치에 의하여 수행될 수 있다. 또한, 본 실시예에 따른 방법은 제1 컴퓨팅 장치와 제2 컴퓨팅 장치에 의하여 나뉘어 수행될 수 있다. 이하, 본 실시예에 따른 방법의 각 동작을 수행함에 있어서, 그 주체의 기재가 생략되면, 그 주체는 상기 컴퓨팅 장치인 것으로 해석될 수 있을 것이다.Hereinafter, a method of measuring a contribution degree according to another embodiment of the present invention will be described with reference to FIG. 2. The method of measuring the degree of contribution according to the present embodiment may be performed by a computing device. Also, the method according to the present embodiment may be performed by being divided by the first computing device and the second computing device. Hereinafter, in performing each operation of the method according to the present embodiment, if the description of the subject is omitted, the subject may be interpreted as being the computing device.

도 2를 참조하면, 금지 컨텐츠 식별 모듈에 의해 컨텐츠가 게시판에서 금지되는지 여부가 결정된다(S100). 여기서, 금지 컨텐츠 식별 모듈은 컨텐츠에서 추출된 데이터를 이용하여 컨텐츠가 게시판에서 금지되는지 여부를 결정하는 모듈이다. 다음으로, 금지 컨텐츠 식별 모듈에 의해 작성된 컨텐츠가 금지되면(S200), 불량 유저 식별 모듈에 의해 컨텐츠 작성 유저가 불량 유저인지 여부가 결정된다(S300). 여기서, 불량 유저 식별 모듈은 컨텐츠 작성 유저 로그 데이터를 이용하여 유저가 불량 유저인지 여부를 결정하는 모듈이다. 다음으로, 금지 컨텐츠 식별 모듈에 의해 작성된 컨텐츠가 허용되면(S200), 기여도 측정 모듈에 의해 컨텐츠 기여도가 결정된다(S500). 여기서, 기여도 측정 모듈은 컨텐츠 페이지 로그 데이터를 이용하여 컨텐츠의 기여도를 측정하는 모듈이다. 다음으로, 불량 유저 식별 모듈에 의해 컨텐츠를 작성한 유저가 불량 유저로 결정되면(S400), 불량 유저로 결정된 유저에게 페널티가 부여된다(S600). 불량 유저 식별 모듈에 의해 컨텐츠를 작성한 유저가 정상 유저로 결정되면(S500) 기여도 측정 모듈에 의해 컨텐츠 기여도가 결정된다.Referring to FIG. 2, it is determined whether or not the content is prohibited on the bulletin board by the prohibited content identification module (S100). Here, the prohibited content identification module is a module that determines whether or not the content is prohibited on the bulletin board by using data extracted from the content. Next, if the content created by the prohibited content identification module is prohibited (S200), it is determined whether the content creation user is a bad user by the bad user identification module (S300). Here, the bad user identification module is a module that determines whether the user is a bad user by using the content creation user log data. Next, if the content created by the prohibited content identification module is allowed (S200), the content contribution is determined by the contribution measurement module (S500). Here, the contribution measurement module is a module that measures the contribution of the content by using log data of the content page. Next, when the user who created the content by the bad user identification module is determined as a bad user (S400), a penalty is given to the user determined as a bad user (S600). When the user who created the content by the bad user identification module is determined as a normal user (S500), the content contribution is determined by the contribution measurement module.

본 실시예에 따르면, 컨텐츠의 기여도 측정을 위해 컨텐츠 내용의 적절성 및 컨텐츠를 작성한 유저의 적절성이 선행적으로 판단될 수 있다. 컨텐츠 내용의 적절성 및 컨텐츠를 작성한 유저의 적절성이 사전에 판단됨으로써, 금지 컨텐츠 또는 불량 유저가 자동적으로 검출될 수 있다.According to the present embodiment, in order to measure the contribution of the content, the appropriateness of the content and the appropriateness of the user who created the content may be determined in advance. As the appropriateness of the content content and the appropriateness of the user who created the content are determined in advance, prohibited content or bad users can be automatically detected.

본 실시예에 따르면, 컨텐츠와 관련된 다양한 속성이 고려되어 컨텐츠의 기여도가 공정하게 측정될 수 있다. 이러한 기여도를 기초로, 온라인 커뮤니티 관리자가 온라인 커뮤니티를 관리하며 발생한 수익을 공정하게 분재할 수 있다. 발생한 수익이 공정하게 분배됨으로써, 온라인 커뮤니티에 다양한 유저가 유입될 수 있으므로, 관리자 및 유저 모두에게 win-win이 될 수 있다.According to the present embodiment, various attributes related to the content are considered, so that the contribution of the content can be measured fairly. Based on this contribution, the online community manager can manage the online community and distribute the profits generated fairly. As the generated profits are distributed fairly, various users can flow into the online community, so it can be a win-win for both administrators and users.

일 실시예에서, 불량 유저로 결정된 경우 페널티가 부여될 수 있다. 여기서, 페널티는 불량 유저에게 적용되는 일종의 벌칙이다. 예를 들어, 온라인 커뮤니티의 유저 경험치를 몰수할 수 있다. 다른 예를 들어, 기간을 정하여 컨텐츠 작성을 금지할 수 있다. 또 다른 예를 들어, 기간을 정하여 온라인 커뮤니티의 입장을 제한할 수도 있다. 다만, 본 예시에 본 발명이 한정되는 것은 아니고, 불량 유저의 온라인 커뮤니티 이용과 관련된 각종 제한 사항이 적용될 수 있다. 따라서, 이러한 제한 사항을 전부 포함한 페널티가 본 발명에 포함될 수 있다. In an embodiment, if the user is determined to be a bad user, a penalty may be given. Here, penalty is a kind of penalty applied to bad users. For example, it is possible to confiscate user experience from online communities. For another example, content creation may be prohibited by setting a period. As another example, you can limit the entry of the online community by setting a period of time. However, the present invention is not limited to this example, and various restrictions related to the use of the online community by bad users may be applied. Accordingly, penalties including all of these limitations may be included in the present invention.

이하, 금지 컨텐츠 식별 모듈에 관한 구체적인 설명을 위해 도 3 내지 도 4를 참조하여 설명하기로 한다. 도 3은 도 2를 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.Hereinafter, a detailed description of the prohibited content identification module will be described with reference to FIGS. 3 to 4. 3 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 2.

도 3을 참조하면, 도 2를 참조하여 설명된 단계 S100이 구체화된다.Referring to FIG. 3, step S100 described with reference to FIG. 2 is embodied.

단계 S110에서, 제1 모듈에 의해 컨텐츠에 포함된 텍스트에 의해 컨텐츠가 게시판에서 금지되는지 여부가 결정된다. 단계 S120에서, 제2 모듈에 의해 컨텐츠에 포함된 이미지에 의해 컨텐츠가 게시판에서 금지되는지 여부가 결정된다.In step S110, it is determined whether the content is prohibited on the bulletin board by the text included in the content by the first module. In step S120, it is determined whether or not the content is prohibited on the bulletin board by the image included in the content by the second module.

일 실시예에서, 제1 모듈 및 제2 모듈은 병렬적으로 진행될 수 있다. 제1 모듈과 제2 모듈이 병렬적으로 진행됨으로써, 금지 컨텐츠 식별 시간이 단축될 수 있다.In one embodiment, the first module and the second module may run in parallel. As the first module and the second module proceed in parallel, a time for identifying prohibited content may be shortened.

일 실시예에서, 제1 모듈에 의해 결정된 이후 제2 모듈에 의해 금지 컨텐츠인지 여부가 결정될 수 있다. 텍스트를 기초로 상기 텍스트를 포함한 컨텐츠가 금지 컨텐츠인지 여부가 선행적으로 결정됨으로써, 제2 모듈에 의한 이미지 분석 효율이 높아질 수 있다. 보다 구체적인 예를 들어, 텍스트가 음란물을 포함하는 경우, 제2 모듈에 의한 이미지 분석 시 음란물을 디텍하는 모델이 정해질 수 있다. 다른 예를 들어, 텍스트가 폭력물을 포함하는 경우, 제2 모듈에 의한 이미지 분석시 폭력물을 디텍하는 모델이 정해질 수 있다.In an embodiment, after being determined by the first module, whether or not the content is prohibited may be determined by the second module. By determining whether the content including the text is prohibited content based on the text, the efficiency of image analysis by the second module may be increased. For a more specific example, when the text includes pornographic material, a model for detecting the pornographic material may be determined during image analysis by the second module. For another example, when the text includes a violent material, a model for detecting the violent material may be determined when an image is analyzed by the second module.

도 2 내지 도 3을 참조하여 설명된 금지 컨텐츠 식별 모듈에 관한 보다 구체적인 설명을 위해 도 4를 참조하기로 한다. 도 4는 도 3을 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.Referring to FIG. 4 for a more detailed description of the prohibited content identification module described with reference to FIGS. 2 to 3. 4 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 3.

도 4를 참조하면, 도 3을 참조하여 설명된 단계 S110 및 S120이 구체화된다.Referring to FIG. 4, steps S110 and S120 described with reference to FIG. 3 are embodied.

단계 S111에서, 컨텐츠에 포함된 텍스트에서 추출된 특징이 얻어지고, 단계 S112에서, 텍스트에서 추출된 특징이 인공 신경망에 입력되고, 인공 신경망에 의해 출력된 데이터를 이용하여 컨텐츠가 게시판에서 금지되는지 여부가 판단된다. 단계 S121에서, 컨텐츠에 포함된 이미지에서 추출된 특징이 얻어지고, 단계 S122에서, 이미지에서 추출된 특징이 인공 신경망에 입력되고, 인공 신경망에 의해 출력된 데이터를 이용하여 컨텐츠가 게시판에서 금지되는지 여부가 판단된다.In step S111, the feature extracted from the text included in the content is obtained, and in step S112, the feature extracted from the text is input to the artificial neural network, and whether the content is prohibited on the bulletin board using the data output by the artificial neural network. Is judged. In step S121, the feature extracted from the image included in the content is obtained, and in step S122, the feature extracted from the image is input to the artificial neural network, and whether the content is prohibited on the bulletin board using the data output by the artificial neural network. Is judged.

일 실시예에서, 컨텐츠에 포함된 이미지를 CNN(Convolutional Neural Network) 기반의 NSFW(Not Safe For Work) 디텍터에 입력하여 출력된 데이터에 의해 컨텐츠가 게시판에서 금지되는지 여부가 결정될 수 있다. 여기서, 출력된 데이터는 이미지가 금지 항목을 포함할 확률을 포함할 수 있고, 이때 확률이 기준치 이상인 경우, 컨텐츠가 게시판에서 금지되는 것으로 결정될 수 있다. 본 실시예에 따르면, 금지 컨텐츠를 식별하는 구체적인 방법이 제공될 수 있다. In one embodiment, whether or not the content is prohibited on the bulletin board may be determined by data output by inputting an image included in the content to a Convolutional Neural Network (CNN)-based Not Safe For Work (NSFW) detector. Here, the output data may include a probability that the image includes a prohibited item. In this case, if the probability is greater than or equal to the reference value, it may be determined that the content is prohibited on the bulletin board. According to the present embodiment, a specific method of identifying prohibited content may be provided.

구체적인 예를 들어, 텐서 플로우를 이용해 컨텐츠에 포함된 이미지 분석 후 음란물이 학습되어 있는 NSFW 모델에 의해 금지 항목인, 음란물을 포함하고 있는지 여부가 판단될 수 있다. 여기서, NSFW 점수 중 Neutral이 30% 이하일 경우 금지 컨텐츠로 판단될 수 있다. 다만, 본 예시에 본 발명이 한정되는 것은 아니고, 머신 러닝에 의해 생성된 모델을 이용하여 이미지를 분석하고, 분석된 이미지가 금지 항목을 포함하는지 여부를 식별하는 모든 공지된 방법이 본 발명에 포함될 수 있다. 본 예시에서, 금지 항목을 음란물에 한정하고 있으나, 금지 항목은 음란물에 한정되는 것은 아니다. 몇몇 실시예에서 참조될 수 있는 금지 항목은 추후 구체화될 것이다.For a specific example, after analyzing an image included in the content using a tensor flow, it may be determined whether or not pornography, which is a prohibited item, is included by an NSFW model in which pornography is learned. Here, if Neutral among the NSFW scores is 30% or less, it may be determined as prohibited content. However, the present invention is not limited to this example, and all known methods for analyzing an image using a model generated by machine learning and identifying whether the analyzed image contains prohibited items are included in the present invention. I can. In this example, the prohibited item is limited to pornography, but the prohibited item is not limited to pornography. Prohibited items that may be referenced in some embodiments will be specified later.

일 실시예에서, 컨텐츠에 포함된 텍스트에서 추출된 특징을 얻어, 인공 신경망에 입력하고, 인공 신경망으로부터 출력된 데이터를 이용하여 컨텐츠가 게시판에서 금지되는지 여부가 결정될 수 있다. 여기서, 출력된 데이터는 컨텐츠에 포함된 텍스트가 금지 항목을 포함할 확률을 포함하고, 상기 확률이 기준치 이상인 경우, 컨텐츠가 게시판에서 금지되는 것으로 결정될 수 있다. 또한, 출력된 데이터는 컨텐츠에 포함된 텍스트가 필수 항목을 포함할 확률을 포함하고, 상기 확률이 기준치 이하인 경우, 컨텐츠가 게시판에서 금지되는 것으로 결정될 수 있다. In an embodiment, whether or not the content is prohibited on the bulletin board may be determined by obtaining features extracted from text included in the content, inputting it into an artificial neural network, and using data output from the artificial neural network. Here, the output data includes a probability that the text included in the content includes a prohibited item, and when the probability is greater than or equal to a reference value, it may be determined that the content is prohibited on the bulletin board. In addition, the output data includes a probability that the text included in the content includes an essential item, and when the probability is less than or equal to the reference value, it may be determined that the content is prohibited on the bulletin board.

본 실시예에 따르면, 금지 항목을 포함하거나 필수 항목을 포함하지 않는 컨텐츠가 게시판에서 금지되는 것으로 결정될 수 있다. 금지 컨텐츠의 식별에 있어서, 금지 항목 또는 필수 항목의 포함 여부를 기준으로 판단함으로써, 온라인 커뮤니티 게시판에 적합한 컨텐츠가 업로드될 수 있다. 컨텐츠 내용의 적절성이 사전에 판단됨으로써, 공정한 기여도 산정에 도움이 될 수 있다.According to the present embodiment, it may be determined that content including the prohibited item or not including the required item is prohibited on the bulletin board. In the identification of prohibited content, appropriate content may be uploaded to the online community bulletin board by determining whether a prohibited item or a required item is included. As the appropriateness of the content content is determined in advance, it can be helpful in calculating the fair contribution.

일 실시예에서, 금지 항목은 욕설물 그룹, 폭력물 그룹 및 저작권 위반 그룹 중 어느 하나에 속할 수 있다. 예를 들어, 금지 항목이 욕설물 그룹에 포함되는 경우, 욕설물과 관련된 텍스트가 복수의 금지 항목으로 사전에 정의될 수 있다. 다른 예를 들어, 금지 항목이 폭력물 그룹에 포함되는 경우, 폭력물과 관련된 텍스트가 복수의 금지 항목으로 사전에 정의될 수 있다. 다만, 본 발명이 이에 한정되는 것은 아니고, 금지 항목에 의해 텍스트의 적절성을 판단하는 모든 방법이 본 발명에 포함될 수 있다.In one embodiment, the prohibited item may belong to any one of an abusive group, a violent group, and a copyright violation group. For example, when the prohibited item is included in the profanity group, the text related to the profanity may be defined in advance as a plurality of prohibited items. For another example, when the prohibited item is included in the violence group, text related to the violence may be defined in advance as a plurality of prohibited items. However, the present invention is not limited thereto, and all methods of determining the appropriateness of text by prohibited items may be included in the present invention.

일 실시예에서, 필수 항목은 게시판의 카테고리에 의해 결정되는 것일 수 있다. 본 실시예에 따르면, 게시판의 주제와 관련 없는 글은 금지 컨텐츠로 결정될 수 있다. 예를 들어, '자전거'와 관련된 게시판에 '자전거'와 무관한 컨텐츠가 업로드 되는 경우, 금지 컨텐츠로 결정될 수 있다. 여기서, '자전거'와 관련된 텍스트가 복수의 필수 항목으로 사전에 정의될 수 있다.In an embodiment, the required item may be determined by a category of the bulletin board. According to the present embodiment, articles that are not related to the subject of the bulletin board may be determined as prohibited content. For example, when content irrelevant to'bicycle' is uploaded to a bulletin board related to'bicycle', it may be determined as prohibited content. Here, text related to'bicycle' may be defined in advance as a plurality of essential items.

인공 신경망을 이용하여 컨텐츠에 포함된 텍스트에 의해, 컨텐츠가 게시판에서 금지되는지 여부가 결정되는 본 발명의 일 실시예에서 참조될 수 있는 금지 컨텐츠 식별 방법은, 머신 러닝에 의해 생성된 모델을 이용하여 텍스트를 분석하고, 분석된 텍스트가 금지 항목을 포함하는지 여부를 식별하는 모든 공지된 방법이 본 발명에 포함될 수 있다. 상기 인공 신경망은, CNN 기반의 레이어(layer)를 통해 오타 등의 노이즈가 최소화된 후 금지 항목을 포함할 확률이 출력될 수 있다. CNN 기반의 텍스트 분류 방법은, "Character-level Convolutional Networks for Text Classification”, Xiang Zhang 외 2인, 2015, 등 다양한 문헌이 참조될 수 있다. 여기서, 한글 자모 40개, 영어 알파벳 26개, 스페이스를 포함한 특수 문자 34개 및 숫자 10개가 이용되어 한글 텍스트에 적합하도록 양자화(quantization)될 수 있다.A method for identifying prohibited content that can be referred to in an embodiment of the present invention in which it is determined whether or not the content is prohibited on a bulletin board is determined by text included in the content using an artificial neural network, using a model generated by machine learning. All known methods of analyzing text and identifying whether the analyzed text contains prohibited items may be included in the present invention. The artificial neural network may output a probability of including a prohibited item after noise such as a typo is minimized through a CNN-based layer. For the CNN-based text classification method, various documents such as "Character-level Convolutional Networks for Text Classification", Xiang Zhang et al., 2015, etc. may be referenced. Here, 40 Korean alphabets, 26 English alphabets, and spaces Including 34 special characters and 10 numbers can be used to be quantized to suit Korean text.

인공 신경망을 이용하여 컨텐츠에 포함된 텍스트에 의해, 컨텐츠가 게시판에서 금지되는지 여부가 결정되는 본 발명의 일 실시예에서 참조될 수 있는 금지 컨텐츠 식별 방법은, 구체적인 예를 들어, 금지 항목 중 욕설물 그룹 분석의 경우, 상기 금지 항목에 은어, 비속어, 된소리, 거센 소리 등이 포함될 수 있고, 이를 레이블링(labeling)하여 지도 학습(supervised learning) 기반의 앙상블 모델을 통해 학습이 수행될 수 있다.A method for identifying prohibited content that can be referred to in an embodiment of the present invention in which it is determined whether or not the content is prohibited on a bulletin board by text included in the content using an artificial neural network is, for example, a profanity among prohibited items. In the case of group analysis, the prohibited items may include sweet words, profanity, vulgar sounds, harsh sounds, etc., and learning may be performed through an ensemble model based on supervised learning by labeling them.

일 실시예에서, 인공 신경망에 입력될 추출된 특징은, 텍스트의 외형 유사도에 의해 한글 자모, 숫자 및 특수 문자가 유사한 벡터로 추출될 수 있다. 한글의 특성상 구체적인 예를 들어, 알파벳 'O', 숫자 '0' 및 한글 자음 'ㅇ'는 의미가 상이함에도 유사한 외형을 갖는다. 다른 예를 들어, 알파벳 'l', 숫자 '1' 및 한글 모음 'ㅣ'도 의미가 상이함에도 유사한 외형을 갖는다. 예시된 것과 같은 외형 유사도가 기준치 이상인 개별 문자를 유사한 벡터로 추출하여 학습시킴으로써, 한글 자음 또는 모음을 일부 변경하여 금지 항목을 포함하여 작성하는 컨텐츠도 식별될 수 있다. In an embodiment, the extracted features to be input to the artificial neural network may be extracted as a vector in which Korean alphabets, numbers, and special characters are similar according to the appearance similarity of the text. For specific examples of the characteristics of Hangul, the alphabet'O', the number '0', and the Korean consonant'ㅇ' have similar appearances even though their meanings are different. For another example, the alphabet'l', the number '1', and the Korean vowel'ㅣ' have similar appearances even though they have different meanings. Contents created including prohibited items by partially changing Hangul consonants or vowels can be identified by extracting and learning individual characters having an appearance similarity greater than or equal to the reference value as illustrated in the example.

일 실시예에서, 금지 항목의 추가에 온라인 커뮤니티 유저의 신고 데이터가 이용될 수 있다. 신고 데이터를 이용함으로써, 부적절한 텍스트를 금지 항목에 추가할 수 있다. 일 실시예에서, 신고 종류 데이터를 이용해, 금지 항목의 그룹이 정해져 추가될 수도 있다. 예를 들어, 신고 데이터가 기준치 이상인 텍스트가 존재하고, 상기 텍스트의 신고 종류가 음란물인 경우, 상기 텍스트를 음란물 그룹에 추가할 수 있다.In one embodiment, report data of an online community user may be used for the addition of prohibited items. By using the report data, inappropriate text can be added to prohibited items. In one embodiment, using the report type data, a group of prohibited items may be determined and added. For example, if there is a text whose report data is more than a standard value, and the type of report of the text is pornography, the text may be added to the pornography group.

일 실시예에서, 제1 항목이 제1 금지 항목을 기준으로 기준 거리 이내에 위치한 텍스트이고, 제1 금지 항목을 포함할 확률이 기준치 이상인 복수의 컨텐츠가 제1 항목을 기준 횟수 이상 포함하는 경우, 제1 항목을 금지항목에 포함할 수 있다.In one embodiment, when the first item is text located within a reference distance based on the first prohibited item, and a plurality of contents whose probability of including the first prohibited item is greater than or equal to the reference value includes the first item more than the reference number of times, the first item 1 item can be included in prohibited items.

본 실시예에 따르면, 금지 항목에 포함되어 있는 제1 금지 항목과 기준 횟수 이상 함께 기준 거리 이내에서 사용된 제1 항목이 금지 항목으로 포함될 수 있다. 추가되는 제1 항목은 제1 금지 항목과 기준 횟수 이상 사용된 텍스트이므로, 금지 항목에 해당할 확률이 높을 수 있다. 또한, 유저의 신고 데이터가 누적되지 않은 경우에도, 제1 금지 항목 포함 여부에 따라 금지 컨텐츠 식별 모듈을 실행하면서, 자동적으로 금지 항목이 추가될 수 있다.According to the present embodiment, the first prohibited item included in the prohibited item and the first item used within the reference distance together with the reference number or more may be included as the prohibited item. Since the added first item is the first prohibited item and the text used more than the reference number of times, the probability of corresponding to the prohibited item may be high. In addition, even when the user's report data is not accumulated, the prohibited item may be automatically added while executing the prohibited content identification module according to whether the first prohibited item is included.

일 실시예에서, 인공 신경망에 의해 출력된 데이터는 금지 항목을 포함할 확률을 포함하고, 상기 확률이 기준치 이상인 경우, 컨텐츠가 게시판에서 금지되는 것으로 결정될 수 있다. 여기서, 불량 유저에 의해 작성된 컨텐츠의 경우, 정상 유저에 의해 작성된 컨텐츠보다 상기 기준치가 낮게 형성될 수 있다. 본 실시예에 따르면, 불량 유저에 의해 작성된 글의 경우 기준치가 낮게 형성됨으로써, 불량 유저의 글 작성을 제재하는 동시에, 금지 컨텐츠가 포함될 확률이 높은 컨텐츠를 쉽게 금지시킬 수 있다. 본 실시예에서 참조된 불량 유저 식별은 후술될 불량 유저 식별 모듈에 의해서 구체화될 수 있다.In an embodiment, the data output by the artificial neural network includes a probability of including a prohibited item, and when the probability is greater than or equal to a reference value, it may be determined that the content is prohibited on the bulletin board. Here, in the case of content created by a bad user, the reference value may be lower than that of content created by a normal user. According to the present embodiment, since the reference value is formed to be low in the case of writing by a bad user, it is possible to restrict writing by a bad user and to easily prohibit contents with a high probability of including prohibited contents. The bad user identification referenced in this embodiment may be embodied by a bad user identification module to be described later.

이하 도 5를 참조하여, 불량 유저 식별 모듈에 관해 구체적으로 설명하기로 한다. 도 5는 도 2를 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.Hereinafter, a defective user identification module will be described in detail with reference to FIG. 5. 5 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 2.

도 5를 참조하면, 단계 S300이 구체화된다. 5, step S300 is embodied.

단계 S310에서, 유저 로그 데이터에서 추출된 특징이 얻어지고, 단계 S320에서, 상기 특징을 인공 신경망에 입력하고, 출력된 데이터를 이용하여 불량 유저인지 여부가 결정된다.In step S310, a feature extracted from the user log data is obtained, and in step S320, the feature is input to an artificial neural network, and whether or not the user is a bad user is determined using the output data.

일 실시예에서, 불량 유저 식별 모듈은 유저 로그 데이터에서 추출된 복수의 특징들을 얻고, 상기 복수의 특징들로 구성된 학습 데이터셋을 인공 신경망에 입력하여 인공 신경망을 학습시켜 생성된 것일 수 있다. 여기서, 인공 신경망은 앙상블 학습에 의한 것일 수 있다. 불량 유저 식별 모듈에서 이용되는 인공 신경망은 불량 유저인지 또는 정상 유저인지를 판단하는 이진 분류 문제를 해결하는 공지된 모든 방법이 본 발명에 이용될 수 있다.In an embodiment, the bad user identification module may be generated by obtaining a plurality of features extracted from user log data, and inputting a training dataset composed of the plurality of features into an artificial neural network to train the artificial neural network. Here, the artificial neural network may be based on ensemble learning. All known methods for solving the binary classification problem of determining whether the artificial neural network used in the bad user identification module is a bad user or a normal user can be used in the present invention.

일 실시예에서, 유저 로그 데이터는, 유저의 가입일, 접속일수, 글수, 댓글수, 컨텐츠 조회수, 부여한 좋아요수, 부여한 싫어요수, 부여한 신고수, 받은 좋아요수, 받은 싫어요수, 받은 신고수, 받은 신고 종류, 삭제 글수, 삭제 댓글수, IP, 총 체류시간, 평균 체류시간, 총 클릭수 및 디바이스 종류를 포함할 수 있다.In one embodiment, the user log data is the user's membership date, number of days accessed, number of posts, number of comments, number of content views, number of likes, number of dislikes, number of reports, number of likes, number of likes received, number of reports received, received Report type, number of deleted posts, number of deleted comments, IP, total stay time, average stay time, total number of clicks and device type can be included.

여기서, 유저의 IP는 유저의 위치 정보를 통해 불량 유저에 해당하는지 여부에 대한 기준으로 삼을 수 있고, 유저의 디바이스 종류에 의해 너무 오래된 기종의 디바이스 등 의심되는 디바이스를 불량 유저를 판단하는 기준으로 삼을 수 있다.Here, the user's IP can be used as a criterion for whether it corresponds to a bad user through the user's location information, and a suspected device such as a device of a type that is too old according to the user's device type is used as a criterion for determining a bad user I can take it.

일 실시예에서, 수집되는 로그 데이터 중 적어도 둘 이상의 데이터가 선택되고, 상기 데이터의 비율에 의해서 결정되는 데이터를 불량 유저 판단하는 기준으로 삼을 수도 있다. 예를 들어, 유저의 글수를 유저의 가입일로 나눈 값을 통해, 가입일 대비 글수가 과도하게 많은 유저를 불량 유저로 판단할 수 있다. 다른 예를 들어, 좋아요수를 컨텐츠 조회수로 나눈 값을 통해, 조회수 대비 좋아요수가 과도하게 많은 유저를 불량 유저로 판단할 수 있다. 다만, 본 예시에 본 발명이 한정되는 것은 아니고, 앞서 수집되는 로그 데이터가 상호 연계되어 불량 유저를 판단하는 기준이 될 수 있는 모든 방법이 본 발명에 포함될 수 있다.In an embodiment, at least two or more of the collected log data may be selected, and data determined by a ratio of the data may be used as a criterion for determining a bad user. For example, through a value obtained by dividing the user's number of posts by the user's subscription, a user having an excessively large number of posts compared to the subscription period may be determined as a bad user. For another example, through a value obtained by dividing the number of likes by the number of content views, a user having an excessively large number of likes relative to the number of views may be determined as a bad user. However, the present invention is not limited to this example, and all methods that can be a criterion for determining a bad user by linking previously collected log data may be included in the present invention.

본 실시예에 따르면, 유저 로그 데이터에서 추출될 수 있는 다양한 유저 속성이 반영되어 유저가 불량 유저인지 여부가 판단될 수 있다.According to the present embodiment, various user attributes that may be extracted from user log data may be reflected to determine whether the user is a bad user.

이하 도 6 내지 도 7을 참조하여, 기여도 측정 모듈에 관해 구체적으로 설명하기로 한다. 도 6은 도 2를 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.Hereinafter, with reference to FIGS. 6 to 7, the contribution measurement module will be described in detail. 6 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 2.

도 6을 참조하면, 단계 S400이 구체화된다.Referring to FIG. 6, step S400 is embodied.

단계 S410에서, 컨텐츠 페이지 로그 데이터에서 추출된 특징이 얻어지고, 단계 S420에서, 상기 특징을 인공 신경망에 입력하고, 출력된 데이터를 이용하여 기여도가 결정된다.In step S410, a feature extracted from the content page log data is obtained, and in step S420, the feature is input to an artificial neural network, and a contribution degree is determined using the output data.

일 실시예에서, 컨텐츠 페이지 로그 데이터는, 상기 컨텐츠 페이지의 특정 부분을 클릭하는 유저의 클릭 위치, 상기 클릭 위치의 클릭 횟수, 상기 컨텐츠 페이지의 이전 페이지, 상기 컨텐츠 페이지의 이후 페이지, 상기 컨텐츠 페이지의 열람 디바이스 종류, 상기 컨텐츠 페이지의 댓글 작성 유저 정보 및 상기 컨텐츠 페이지의 열람 유저 정보를 포함할 수 있다.In one embodiment, the content page log data includes a click position of a user who clicks a specific part of the content page, the number of clicks at the click position, a previous page of the content page, a page after the content page, and It may include a browsing device type, comment writing user information of the content page, and browsing user information of the content page.

여기서 컨텐츠 페이지의 특정 부분을 클릭하는 유저의 클릭 위치는, 컨텐츠를 열람하는 유저가 컨텐츠의 특정 부분을 클릭하거나 확대하는 등의 동작을 기여도를 측정하는 하나의 기준으로 삼을 수 있다. 예를 들어, 클릭 위치가 컨텐츠 전체에 걸쳐 고르게 퍼져 있는 컨텐츠의 경우, 기여도가 높게 평가될 수 있는 근거가 될 수 있다. 클릭 횟수도 이와 유사하게 적용될 수 있다.Here, the click position of a user who clicks on a specific part of the content page may use an operation of a user browsing the content to click or enlarge a specific part of the content as a criterion for measuring the degree of contribution. For example, in the case of content in which the click location is evenly spread throughout the content, it may be a basis for a high contribution rating. The number of clicks can be similarly applied.

여기서 컨텐츠 페이지의 이전 또는 이후 페이지는, 컨텐츠에 접속된 경위를 판단하여 기여도를 측정하는 하나의 기준으로 삼을 수 있다. 예를 들어, URL을 통해 바로 접속된 횟수가 많은 컨텐츠는 매크로에 의한 부정 조회수로 평가되어, 기여도가 낮게 평가될 수 있는 근거가 될 수 있다. Here, the page before or after the content page may be used as a criterion for measuring the degree of contribution by determining how the content is accessed. For example, content with a large number of direct access through a URL may be evaluated as an illegal number of views by a macro, which may serve as a basis for a low contribution rate.

여기서 댓글 작성 유저 정보 및 열람 유저 정보는, 댓글을 작성하거나 열람한 유저의 정보를 토대로 컨텐츠의 기여도를 측정하는 하나의 기준으로 삼을 수 있다. 예를 들어, 기여도가 높은 글을 다수 작성한 유저들의 댓글이 많은 경우 또는 정상 유저로 판단된 유저의 댓글이 많은 경우 등 기여도가 높게 평가될 수 있는 근거가 될 수 있다. Here, the comment writing user information and the browsing user information may be used as a criterion for measuring the degree of contribution of the content based on the information of the user who wrote or viewed the comment. For example, if there are many comments from users who have written a large number of posts with high contribution, or if there are many comments from users who are judged as normal users, it may be a basis for the high contribution rating.

일 실시예에서, 컨텐츠 페이지 로그 데이터는, 상기 컨텐츠에 포함된 텍스트 길이, 상기 컨텐츠에 이미지 포함 여부, 상기 컨텐츠에 포함된 이미지 개수, 상기 컨텐츠에 포함된 이미지 파일타입, 상기 컨텐츠에 포함된 이미지 용량, 상기 컨텐츠에 포함된 이미지 다운 횟수, 상기 컨텐츠의 작성 시간, 상기 컨텐츠 페이지의 조회수, 상기 컨텐츠 페이지의 좋아요수, 상기 컨텐츠 페이지의 싫어요수, 상키 컨텐츠 페이지의 신고수, 상기 컨텐츠 페이지의 신고 타입, 상기 컨텐츠 페이지의 댓글수 및 상기 컨텐츠 페이지의 체류시간을 더 포함할 수도 있다. In one embodiment, the content page log data includes: a text length included in the content, whether an image is included in the content, the number of images included in the content, an image file type included in the content, and an image capacity included in the content , The number of image downloads included in the content, the creation time of the content, the number of views of the content page, the number of likes of the content page, the number of dislikes of the content page, the number of reports of the content page, the report type of the content page, It may further include the number of comments on the content page and a residence time of the content page.

일 실시예에서, 수집되는 로그 데이터 중 적어도 둘 이상의 데이터가 선택되고, 상기 데이터의 비율에 의해서 결정되는 데이터를 기여도 측정하는 기준으로 삼을 수도 있다. 예를 들어, 좋아요수를 컨텐츠 조회수로 나눈 값을 통해, 조회수 대비 좋아요수가 과도하게 많은 컨텐츠의 기여도를 낮게 평가할 수 있다. 다만, 본 예시에 본 발명이 한정되는 것은 아니고, 앞서 수집되는 로그 데이터가 상호 연계되어 컨텐츠의 기여도를 판단하는 기준이 될 수 있는 모든 방법이 본 발명에 포함될 수 있다.In an embodiment, at least two or more of the collected log data are selected, and data determined by the ratio of the data may be used as a criterion for measuring the degree of contribution. For example, through a value obtained by dividing the number of likes by the number of content views, it is possible to evaluate the contribution of content that has an excessively large number of likes compared to the number of views. However, the present invention is not limited to this example, and all methods that can be used as a criterion for determining a content contribution level by linking previously collected log data may be included in the present invention.

본 실시예에 따르면, 컨텐츠 페이지 로그 데이터에서 추출될 수 있는 다양한 속성이 반영되어 기여도가 측정될 수 있다.According to the present embodiment, various attributes that can be extracted from the content page log data may be reflected to measure the contribution.

이하 도 7을 참조하여, 보다 구체적으로 설명하기로 한다. 도 7은 도 6을 참조하여 설명된 방법의 일부 동작을 구체적으로 설명하기 위한 순서도이다.Hereinafter, with reference to FIG. 7, it will be described in more detail. 7 is a flow chart for specifically explaining some operations of the method described with reference to FIG. 6.

도 7을 참조하면 단계 S420이 구체화된다.Referring to FIG. 7, step S420 is embodied.

단계 S421에서 앙상블 모델에 포함된 복수의 모델에 의해 기여도가 산출되고, 단계 S422에서 산출된 기여도의 평균값이 얻어지고, 단계 S423에서 검증 결과 결정 계수가 기준 범위에 해당하는 경우, 단계 S424에서 컨텐츠의 기여도가 결정된다. In step S421, the contribution is calculated by a plurality of models included in the ensemble model, the average value of the contribution calculated in step S422 is obtained, and in step S423, if the determination coefficient falls within the reference range, the content is determined in step S424. Contribution is determined.

일 실시예에서, 컨텐츠 페이지 로그 데이터를 이용하여 추출되는 특징이 벡터로 추출될 수 있고, 상기 특징 벡터는 학습 및 검증 데이터로 이용될 수 있다. 학습 및 검증 데이터의 비율은 일반적인 머신 러닝 과정과 같이 8:2 또는 7:3 등으로 선택될 수 있다. 다만, 본 발명이 이에 한정되는 것은 아니다.In an embodiment, a feature extracted using content page log data may be extracted as a vector, and the feature vector may be used as training and verification data. The ratio of training and verification data can be selected as 8:2 or 7:3, as in a general machine learning process. However, the present invention is not limited thereto.

일 실시예에서, 기여도 측정 모델을 학습할 경우 신경망을 구성하는 뉴런은 다음과 같이 표현될 수 있다.In an embodiment, when learning the contribution measurement model, neurons constituting the neural network may be expressed as follows.

,

y는 활성화 함수이며, sigmoid, tanh, ReLU 등 다양한 함수가 적용될 수도 있다. β는 각 특징 별 가중치이고 인공지능 학습 과정을 통해 최적화된다. ε는 학습을 위해 초기화하여 설정되는 바이어스일 수 있다. ε는 노이즈로 예컨대, 데이터 저장 에러, 서버렉 등의 노이즈를 포함할 수 있다. x는 컨텐츠 페이지 로그 데이터에서 추출되는 특징이다. n은 유저 수를 의미한다. 지도 학습 과정 동안에는 각 모델 별 파라미터를 튜닝하고, 엔지니어링 작업을 통해 모델 성능을 검증하고 평가하여 객체 유형 분류 모델을 생성할 수 있다.y is an activation function, and various functions such as sigmoid, tanh, and ReLU may be applied. β is the weight for each feature and is optimized through the artificial intelligence learning process. ε may be a bias that is initialized and set for learning. ε is noise, and may include, for example, noise such as a data storage error and server rack. x is a feature extracted from the log data of the content page. n means the number of users. During the supervised learning process, parameters for each model can be tuned, and model performance can be verified and evaluated through engineering work to generate an object type classification model.

일 실시예에서, 초기 개별 y값 설정을 위해 아래와 같은 수식이 이용될 수 있다. 예를 들어, 유저 1의 y1 초기값은 아래와 같다. 다만, 이에 한정되는 것은 아니다.In one embodiment, the following equation may be used to initially set an individual y value. For example, the initial value of y1 of user 1 is as follows. However, it is not limited thereto.

일 실시예에서, 인공 신경망에 의해 학습된 모델의 기여도가 유의미한 값인지 여부가 검증될 수 있다. 이러한 검증에는 앞서 설명된 검증 데이터가 이용될 수 있다. 검증 데이터를 모델에 입력하여 얻은 기여도와 실제 값의 차이가 결정 계수로 구해질 수 있다. 여기서, 결정 계수가 기준 범위에 해당하는 경우, 유의미하다고 판단되어, 기여도가 결정될 수 있다. 예컨대, 결정 계수의 기준 범위는 0.5 이상 1이하의 범위로 설정될 수 있다. 다만, 이에 한정되는 것은 아니다.In an embodiment, it may be verified whether the contribution of the model learned by the artificial neural network is a significant value. For this verification, the verification data described above may be used. The difference between the contribution obtained by inputting the verification data into the model and the actual value can be obtained as a coefficient of determination. Here, when the determination coefficient falls within the reference range, it is determined to be significant, and the contribution degree may be determined. For example, the reference range of the determination coefficient may be set to a range of 0.5 or more and 1 or less. However, it is not limited thereto.

일 실시예에서, 기여도 측정 성능 개선을 위해 여러 개의 모델을 결합한 앙상블 모델이 이용될 수 있다. 여기서, 지도 학습 기반의 앙상블 모델이 이용되어 기여도 측정을 위한 학습이 진행되거나 기여도가 측정될 수 있다. 여기서, 앙상블 모델은 Deep Learning, AdaBoost, Neural Network 및 Ridge Penalized Regression 모델을 포함할 수 있다. 다만, 이에 한정되는 것은 아니다.In an embodiment, an ensemble model combining several models may be used to improve the performance of measuring contribution. Here, an ensemble model based on supervised learning may be used to perform learning to measure the degree of contribution or measure the degree of contribution. Here, the ensemble model may include Deep Learning, AdaBoost, Neural Network, and Ridge Penalized Regression model. However, it is not limited thereto.

일 실시예에서, 여러 개의 모델이 측정한 기여도의 평균이 얻어져 기여도 측정 성능이 개선될 수도 있다.In an embodiment, the average of the contributions measured by several models may be obtained, so that the performance of the contribution measurement may be improved.

본 발명의 일 실시예에서 참조될 수 있는 복수의 모듈은 인공 신경망에 의해 생성된 모듈일 수 있다. 모듈의 생성을 위한 공지된 머신 러닝 기술이 적용될 수 있다. 즉, 특정 모듈의 생성을 설명하기 위해 구체적으로 예시한 머신 러닝 기술에 본 발명이 한정되는 것은 아니다.A plurality of modules that may be referred to in an embodiment of the present invention may be a module generated by an artificial neural network. Known machine learning techniques for generating modules can be applied. That is, the present invention is not limited to the machine learning technology specifically illustrated to explain the generation of a specific module.

본 발명의 일 실시예에 따른 기여도 측정 방법은, 컨텐츠의 기여도 측정에 있어서 다양한 속성을 고려함으로써, 공정한 기여도가 결정될 수 있다. 결정된 기여도를 통해서, 관리자가 온라인 커뮤니티를 관리하면서 발생한 수익을 유저에게 공정하게 분배할 수 있다. 공정한 수익의 분배를 통해 유저의 유입이 증가되어 유저 및 관리자 모두 win-win 할 수 있다. In the method of measuring contribution according to an embodiment of the present invention, a fair contribution may be determined by considering various attributes in measuring the contribution of content. Through the determined contribution, the manager can fairly distribute the profits generated by managing the online community to users. The inflow of users increases through fair distribution of profits, so both users and managers can win-win.

지금까지 도 1 내지 도 7을 참조하여, 본 발명의 일 실시예에 따른 기여도 측정 방법 및 그 응용분야에 대해서 설명하였다. 이하에서는, 본 발명의 일 실시예에 따른 기여도 측정 장치를 구현할 수 있는 예시적인 컴퓨팅 장치(1500)에 대하여 설명하도록 한다.So far, with reference to FIGS. 1 to 7, a method of measuring a contribution degree according to an embodiment of the present invention and its application field have been described. Hereinafter, an exemplary computing device 1500 capable of implementing an apparatus for measuring a contribution degree according to an embodiment of the present invention will be described.

도 8은 본 발명의 일 실시예에 따른 기여도 측정 장치를 구현할 수 있는 컴퓨팅 장치(1500)를 설명하기 위한 도면이다.8 is a diagram illustrating a computing device 1500 capable of implementing an apparatus for measuring contribution according to an embodiment of the present invention.

도 8에 도시된 바와 같이, 컴퓨팅 장치(1500)는 하나 이상의 프로세서(1510), 버스(1550), 통신 인터페이스(1570), 프로세서(1510)에 의하여 수행되는 컴퓨터 프로그램(1591)을 로드(load)하는 메모리(1530)와, 컴퓨터 프로그램(1591)을 저장하는 스토리지(1590)를 포함할 수 있다. 다만, 도 8에는 본 발명의 실시예와 관련 있는 구성 요소들 만이 도시되어 있다. 따라서, 본 발명이 속한 기술분야의 통상의 기술자라면 도 8에 도시된 구성요소들 외에 다른 범용적인 구성 요소들이 더 포함될 수 있음을 알 수 있다.As shown in FIG. 8, the computing device 1500 loads one or more processors 1510, a bus 1550, a communication interface 1570, and a computer program 1591 executed by the processor 1510. The memory 1530 may include a storage 1590 for storing the computer program 1591. However, only the constituent elements related to the embodiment of the present invention are shown in FIG. 8. Accordingly, those of ordinary skill in the art to which the present invention pertains can recognize that other general-purpose components may be further included in addition to the components illustrated in FIG. 8.

프로세서(1510)는 컴퓨팅 장치(1500)의 각 구성의 전반적인 동작을 제어한다. 프로세서(1510)는 CPU(Central Processing Unit), MPU(Micro Processor Unit), MCU(Micro Controller Unit), GPU(Graphic Processing Unit) 또는 본 발명의 기술 분야에 잘 알려진 임의의 형태의 프로세서를 포함하여 구성될 수 있다. 또한, 프로세서(1510)는 본 발명의 실시예들에 따른 방법을 실행하기 위한 적어도 하나의 애플리케이션 또는 프로그램에 대한 연산을 수행할 수 있다. 컴퓨팅 장치(1500)는 하나 이상의 프로세서를 구비할 수 있다.The processor 1510 controls the overall operation of each component of the computing device 1500. The processor 1510 includes a CPU (Central Processing Unit), MPU (Micro Processor Unit), MCU (Micro Controller Unit), GPU (Graphic Processing Unit), or any type of processor well known in the technical field of the present invention. Can be. Also, the processor 1510 may perform an operation on at least one application or program for executing the method according to the embodiments of the present invention. The computing device 1500 may include one or more processors.

메모리(1530)는 각종 데이터, 명령 및/또는 정보를 저장한다. 메모리(1530)는 본 발명의 실시예들에 따른 방법을 실행하기 위하여 스토리지(1590)로부터 하나 이상의 프로그램(1591)을 로드 할 수 있다. 메모리(1530)는 RAM과 같은 휘발성 메모리로 구현될 수 있을 것이나, 본 발명의 기술적 범위가 이에 한정되는 것은 아니다.The memory 1530 stores various types of data, commands, and/or information. The memory 1530 may load one or more programs 1591 from the storage 1590 to execute the method according to embodiments of the present invention. The memory 1530 may be implemented as a volatile memory such as RAM, but the technical scope of the present invention is not limited thereto.

버스(1550)는 컴퓨팅 장치(1500)의 구성 요소 간 통신 기능을 제공한다. 버스(1550)는 주소 버스(Address Bus), 데이터 버스(Data Bus) 및 제어 버스(Control Bus) 등 다양한 형태의 버스로 구현될 수 있다.The bus 1550 provides communication functions between components of the computing device 1500. The bus 1550 may be implemented as various types of buses such as an address bus, a data bus, and a control bus.

통신 인터페이스(1570)는 컴퓨팅 장치(1500)의 유무선 인터넷 통신을 지원한다. 또한, 통신 인터페이스(1570)는 인터넷 통신 외의 다양한 통신 방식을 지원할 수도 있다. 이를 위해, 통신 인터페이스(1570)는 본 발명의 기술 분야에 잘 알려진 통신 모듈을 포함하여 구성될 수 있다.The communication interface 1570 supports wired/wireless Internet communication of the computing device 1500. In addition, the communication interface 1570 may support various communication methods other than Internet communication. To this end, the communication interface 1570 may be configured to include a communication module well known in the art.

몇몇 실시예들에 따르면, 통신 인터페이스(1570)는 생략될 수도 있다.According to some embodiments, the communication interface 1570 may be omitted.

스토리지(1590)는 상기 하나 이상의 프로그램(1591)과 각종 데이터를 비임시적으로 저장할 수 있다. The storage 1590 may non-temporarily store the one or more programs 1591 and various data.

스토리지(1590)는 ROM(Read Only Memory), EPROM(Erasable Programmable ROM), EEPROM(Electrically Erasable Programmable ROM), 플래시 메모리 등과 같은 비휘발성 메모리, 하드 디스크, 착탈형 디스크, 또는 본 발명이 속하는 기술 분야에서 잘 알려진 임의의 형태의 컴퓨터로 읽을 수 있는 기록 매체를 포함하여 구성될 수 있다.The storage 1590 is a nonvolatile memory such as a ROM (Read Only Memory), EPROM (Erasable Programmable ROM), EEPROM (Electrically Erasable Programmable ROM), flash memory, etc., a hard disk, a removable disk, or well in the technical field to which the present invention pertains. It may be configured to include any known computer-readable recording medium.

컴퓨터 프로그램(1591)은 메모리(1530)에 로드 될 때 프로세서(1510)로 하여금 본 발명의 다양한 실시예에 따른 방법/동작을 수행하도록 하는 하나 이상의 인스트럭션들을 포함할 수 있다. 즉, 프로세서(1510)는 상기 하나 이상의 인스트럭션들을 실행함으로써, 본 발명의 다양한 실시예에 따른 방법/동작들을 수행할 수 있다.The computer program 1591 may include one or more instructions that when loaded into the memory 1530 cause the processor 1510 to perform a method/operation according to various embodiments of the present invention. That is, the processor 1510 may perform methods/operations according to various embodiments of the present disclosure by executing the one or more instructions.

위와 같은 경우, 컴퓨팅 장치(1500)를 통해 본 발명의 일 실시예에 따른 기여도 측정 장치가 구현될 수 있다.In the above case, the apparatus for measuring a contribution degree according to an embodiment of the present invention may be implemented through the computing device 1500.

지금까지 도 1 내지 도 8을 참조하여 본 발명의 다양한 실시예들 및 그 실시예들에 따른 효과들을 언급하였다. 본 발명의 기술적 사상에 따른 효과들은 이상에서 언급한 효과들로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.So far, various embodiments of the present invention and effects according to the embodiments have been mentioned with reference to FIGS. 1 to 8. The effects according to the technical idea of the present invention are not limited to the above-mentioned effects, and other effects not mentioned will be clearly understood by those of ordinary skill in the art from the following description.

지금까지 도 1 내지 도 8을 참조하여 설명된 본 발명의 기술적 사상은 컴퓨터가 읽을 수 있는 매체 상에 컴퓨터가 읽을 수 있는 코드로 구현될 수 있다. 상기 컴퓨터로 읽을 수 있는 기록 매체는, 예를 들어 이동형 기록 매체(CD, DVD, 블루레이 디스크, USB 저장 장치, 이동식 하드 디스크)이거나, 고정식 기록 매체(ROM, RAM, 컴퓨터 구비 형 하드 디스크)일 수 있다. 상기 컴퓨터로 읽을 수 있는 기록 매체에 기록된 상기 컴퓨터 프로그램은 인터넷 등의 네트워크를 통하여 다른 컴퓨팅 장치에 전송되어 상기 다른 컴퓨팅 장치에 설치될 수 있고, 이로써 상기 다른 컴퓨팅 장치에서 사용될 수 있다.The technical idea of the present invention described with reference to FIGS. 1 to 8 so far may be implemented as a computer-readable code on a computer-readable medium. The computer-readable recording medium is, for example, a removable recording medium (CD, DVD, Blu-ray disk, USB storage device, removable hard disk) or a fixed recording medium (ROM, RAM, computer-equipped hard disk). I can. The computer program recorded on the computer-readable recording medium may be transmitted to another computing device through a network such as the Internet and installed in the other computing device, thereby being used in the other computing device.

이상에서, 본 발명의 실시예를 구성하는 모든 구성 요소들이 하나로 결합되거나 결합되어 동작하는 것으로 설명되었다고 해서, 본 발명의 기술적 사상이 반드시 이러한 실시예에 한정되는 것은 아니다. 즉, 본 발명의 목적 범위 안에서라면, 그 모든 구성요소들이 하나 이상으로 선택적으로 결합하여 동작할 수도 있다.In the above, even if all the constituent elements constituting an embodiment of the present invention have been described as being combined into one or operating in combination, the technical idea of the present invention is not necessarily limited to this embodiment. That is, as long as it is within the scope of the object of the present invention, one or more of the components may be selectively combined and operated.

도면에서 동작들이 특정한 순서로 도시되어 있지만, 반드시 동작들이 도시된 특정한 순서로 또는 순차적 순서로 실행 되어야만 하거나 또는 모든 도시 된 동작들이 실행 되어야만 원하는 결과를 얻을 수 있는 것으로 이해되어서는 안 된다. 특정 상황에서는, 멀티태스킹 및 병렬 처리가 유리할 수도 있다. 더욱이, 위에 설명한 실시예들에서 다양한 구성들의 분리는 그러한 분리가 반드시 필요한 것으로 이해되어서는 안 되고, 설명된 프로그램 컴포넌트들 및 시스템들은 일반적으로 단일 소프트웨어 제품으로 함께 통합되거나 다수의 소프트웨어 제품으로 패키지 될 수 있음을 이해하여야 한다.Although the operations are illustrated in a specific order in the drawings, it should not be understood that the operations must be executed in the specific order shown or in a sequential order, or all illustrated operations must be executed to obtain a desired result. In certain situations, multitasking and parallel processing may be advantageous. Moreover, the separation of the various components in the above-described embodiments should not be understood as necessarily requiring such separation, and the described program components and systems may generally be integrated together into a single software product or packaged into multiple software products. It should be understood that there is.

이상 첨부된 도면을 참조하여 본 발명의 실시예들을 설명하였지만, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자는 그 기술적 사상이나 필수적인 특징을 변경하지 않고서 본 발명이 다른 구체적인 형태로도 실시될 수 있다는 것을 이해할 수 있다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적인 것이 아닌 것으로 이해해야만 한다. 본 발명의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명에 의해 정의되는 기술적 사상의 권리범위에 포함되는 것으로 해석되어야 할 것이다.Although the embodiments of the present invention have been described with reference to the accompanying drawings above, those of ordinary skill in the art to which the present invention pertains can also implement the present invention in other specific forms without changing the technical idea or essential features. I can understand that there is. Therefore, it should be understood that the embodiments described above are illustrative in all respects and not limiting. The scope of protection of the present invention should be interpreted by the following claims, and all technical ideas within the scope equivalent thereto should be construed as being included in the scope of the technical ideas defined by the present invention.

Claims

In the method performed by the computing device,
Determining whether the content is prohibited on the bulletin board by the prohibited content identification module, wherein the prohibited content identification module is a module for determining whether the content is prohibited on the bulletin board using data extracted from the content;
When it is determined that the content is prohibited in the bulletin board, it is determined by the bad user identification module whether the user who created the content is a bad user, and when it is determined that the content is allowed in the bulletin board, the contribution measurement module Determining the degree of contribution of the content, wherein the bad user identification module is a module that determines whether the user is a bad user by using the user log data who created the content, and the contribution measurement module uses the log data of the content page A module for measuring the degree of contribution, the step; And
When the user is determined to be a bad user by the bad user identification module, a penalty is given to the user, and when the user is determined to be a normal user by the bad user identification module, the contribution of the content is determined by the contribution measurement module. Comprising the step of determining,
How to measure content contribution.

The method of claim 1,
The prohibited content identification module,
Whether the content is prohibited on the bulletin board is determined by the data output by inputting the image included in the content into a Convolutional Neural Network (CNN)-based NSFW (Not Safe For Work) detector,
How to measure content contribution.

The method of claim 2,
The output data is,
Contains the probability that the image contains prohibited items,
Determining whether the content is prohibited on the bulletin board,
In case the probability is greater than or equal to a reference value, determining that the content is prohibited in the bulletin board,
How to measure content contribution.

The method of claim 3,
The reference value,
In the case of content created by a user determined as a bad user by the bad user identification module, it is formed lower than the content created by a normal user,
How to measure content contribution.

The method of claim 1,
The content is,
Contains text and images,
The prohibited content identification module,
A first module in which it is determined whether the content is prohibited on the bulletin board by text included in the content, and a second module in which it is determined whether or not the content is prohibited in the bulletin board by an image included in the content. Including modules,
How to measure content contribution.

The method of claim 5,
The prohibited content identification module,
After determining whether the content is prohibited on the bulletin board by the first module, it is determined whether or not the content is prohibited on the bulletin board by the second module,
How to measure content contribution.

The method of claim 1,
The bad user identification module,
It is generated by obtaining a plurality of features extracted from the user log data, inputting a training dataset composed of the plurality of features into an artificial neural network, and training the artificial neural network,
How to measure content contribution.

The method of claim 7,
The plurality of features,
Member since, access days, number of posts, number of comments, number of content views, number of likes, number of dislikes, number of reports, number of likes received, number of dislikes received, number of reports received, type of report received, number of deleted posts, number of comments deleted , IP, total dwell time, average dwell time, total clicks and device type,
How to measure content contribution.

The method of claim 7,
The artificial neural network,
By ensemble learning,
How to measure content contribution.

The method of claim 1,
The user log data,
Member since, access days, number of posts, number of comments, number of content views, number of likes, number of dislikes, number of reports, number of likes received, number of dislikes received, number of reports received, type of report received, number of deleted posts, number of comments deleted , IP, total dwell time, average dwell time, total clicks and device type,
How to measure content contribution.

In the method performed by a computing device having an artificial neural network,
Obtaining a plurality of features extracted from the content page log data; And
Including the step of inputting the plurality of features to the artificial neural network, and determining a content contribution degree using data output from the artificial neural network,
The plurality of features,
The click position of a user who clicks a specific part of the content page, the number of clicks of the click position, the previous page of the content page, the page after the content page, the type of viewing device of the content page, the user who writes a comment on the content page Including information and browsing user information of the content page,
How to measure content contribution.

The method of claim 11,
The artificial neural network,
By inputting a learning dataset composed of the plurality of features, supervised learning-based ensemble learning,
How to measure content contribution.

The method of claim 11,
The plurality of features,
The length of the text included in the content, whether the content contains an image, the number of images included in the content, the image file type included in the content, the image size included in the content, the number of images downloaded in the content, the Content creation time, number of views of the content page, number of likes of the content page, number of dislikes of the content page, number of reports of the content page, report type of the content page, number of comments on the content page, and number of comments on the content page Further comprising a residence time,
How to measure content contribution.

In the method performed by a computing device having an artificial neural network,
Obtaining features extracted from text included in the content; And
Inputting the feature into the artificial neural network, and determining whether the content is prohibited on the bulletin board using data output from the artificial neural network,
The step of obtaining the feature,
Including the step of extracting the Korean alphabet, alphabet, number, and special characters as similar vectors according to the appearance similarity of the text,
How to identify prohibited content.

The method of claim 14,
The output data is,
Contains the probability that the text contains a prohibited item,
Determining whether the content is prohibited on the bulletin board,
In case the probability is greater than or equal to a reference value, determining that the content is prohibited in the bulletin board,
How to identify prohibited content.

The method of claim 15,
The prohibited items above are:
Belonging to any one of abusive group, pornographic group, violent group, and copyright violation group,
How to identify prohibited content.

The method of claim 15,
Further comprising the step of adding the first item as a prohibited item,
The first item is located within a reference distance based on the first prohibited item, and a plurality of contents having a probability of including the first prohibited item being equal to or greater than a reference value include the first item at least a reference number of times,
How to identify prohibited content.

The method of claim 14,
The output data is,
Contains the probability that the text contains a required item,
Determining whether the content is prohibited on the bulletin board,
If the probability is less than the reference value, comprising the step of determining that the content is prohibited in the bulletin board,
How to identify prohibited content.

The method of claim 18,
The required items above are:
That is determined by the category of the bulletin board,
How to identify prohibited content.