KR20120106667A

KR20120106667A - Method and apparatus for producing user reputation of online network

Info

Publication number: KR20120106667A
Application number: KR1020120082833A
Authority: KR
Inventors: 신효섭
Original assignee: 건국대학교 산학협력단
Priority date: 2012-07-27
Filing date: 2012-07-27
Publication date: 2012-09-26

Abstract

PURPOSE: A method for calculating a reputation of an online network user and a device thereof are provided to calculate reputations of online network users by considering the number of user-centric feedbacks regardless of the number of postings written by each user. CONSTITUTION: A receiving unit(800) receives feedback information about online network postings. A first information obtaining unit(810) obtains first information for providing a feedback to a first user. An initial reputation calculating unit(820) calculates an initial reputation score of the first user by using the first information. A second information obtaining unit(830) obtains second information for exchanging feedbacks between the first user and other users. A sociality calculating unit(840) calculates a sociality score of the first user by using the second information. [Reference numerals] (800) Receiving unit; (810) First information obtaining unit; (820) Initial reputation calculating unit; (830) Second information obtaining unit; (840) Sociality calculating unit; (850) Pure reputation calculating unit; (860) Reducing unit; (870) Final reputation calculating unit

Description

Method and device for calculating reputation of online network user {Method and Apparatus for Producing User Reputation of Online Network}

본 발명은 온라인 네트워크 사용자의 평판도 산출 방법 및 장치에 관한 것이다. 구체적으로 본 발명은, 온라인 네트워크 사용자가 작성한 온라인 네트워크 포스트에 대한 피드백 정보를 수신하고, 온라인 네트워크 사용자간의 피드백에 기반한 사회성을 고려하여 온라인 네트워크 사용자의 평판도를 산출하는 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for calculating reputation of an online network user. Specifically, the present invention relates to a method and apparatus for receiving feedback information on an online network post written by an online network user and calculating a reputation of an online network user in consideration of sociality based on feedback between online network users.

인터넷의 발달로 온라인 커뮤니티(online community), 블로그 공간(blogosphere), 소셜 네트워크(social network) 등 다양한 형태의 온라인 네트워크가 활발히 운영되고 있다. 인터넷 사용자들은 사용자 생성 컨텐츠(User-Generated Contents, UGC) 내지 사용자 제작 컨텐츠(User-Created Contents, UCC) 등 다양한 형태의 포스트(post)를 작성하여 다른 사용자들에게 공개할 수 있다. Due to the development of the Internet, various types of online networks such as an online community, a blogosphere, and a social network are actively operated. Internet users can create various types of posts such as user-generated contents (UGC) to user-created contents (UCC) and publish them to other users.

이러한 다양한 포스트들을 일정한 기준에 따라 랭킹으로 정렬하여 그 중에서 원하는 포스트를 검색하는 기술 역시 이들 포스트 내지 컨텐츠 기술과 함께 발전해 왔다. 이러한 포스트를 랭킹에 기반하여 검색하는 데에는 문서 랭킹 기술이 사용되는데, 종래의 문서 랭킹 기술은 크게 키워드 기반 문서 랭킹 알고리즘과 웹 문서 간 링크 분석에 의한 문서 랭킹 알고리즘으로 나눌 수 있다.The technique of sorting these various posts by ranking according to a predetermined criterion and searching for a desired post among them has also developed along with these posts or content technologies. The document ranking technique is used to search for such posts based on the ranking. The conventional document ranking technique can be broadly divided into a keyword ranking algorithm based on a keyword-based document ranking algorithm and a link analysis between web documents.

키워드 기반 문서 랭킹 알고리즘에 속하는 것 중 전통적인 정보 검색 이론에 기반한 것으로 TF-IDF(Term Frequency - Inverse Document Frequency)가 있는데, 이는 문서 안에서의 단어의 빈도 수(term frequency) 및 단어를 포함하는 문서의 수(document frequency)의 역수 값(inverse document frequency)를 이용한다. One of the keyword-based document ranking algorithms is based on the traditional theory of information retrieval, TF-IDF (Term Frequency-Inverse Document Frequency), which is the number of words in a document and the number of documents that contain words. An inverse document frequency of document frequency is used.

한편, 웹 문서 간 링크 분석에 의한 문서 랭킹 알고리즘에 속하는 것 중 페이지랭크(Page Rank) 알고리즘은 랜덤 워크(random walk) 모델에 기초하여 웹 문서 간의 상호 링크 관계를 분석하여 각 웹 문서의 전역적인 랭크를 산출하는 알고리즘이다. HITS(Hyperlink-Induced Topic Search) 알고리즘은 웹 문서 검색 결과에 대하여 상호 링크 관계를 분석하여 각 문서의 권위성(authority) 및 허브성(hub)에 대한 두 가지 랭크를 산출한다. 페이지랭크 알고리즘 및 HITS 알고리즘과 관련된 종래의 연구를 간략히 소개하면 아래와 같다. On the other hand, among the document ranking algorithms by link analysis between web documents, the Page Rank algorithm analyzes the cross-link relationship between web documents based on a random walk model, thereby ranking the global rank of each web document. It is an algorithm that calculates. The Hyperlink-Induced Topic Search (HITS) algorithm analyzes the link relations of the web document search results and calculates two ranks for the authority and the hub of each document. Briefly, the conventional researches related to the page rank algorithm and the HITS algorithm are as follows.

P.Jurczyk와 E.Agichtein의 논문(P.Jurczyk and E.Agichtein. Discovering authorities in question answer communities by using link analysis. CIKM 2007)에서는 야후(Yahoo!)의 답변란(Answers)에서 권위성 및 허브성을 찾기 위하여 HITS 알고리즘을 사용하였으며 권위있는 사용자는 우수한 질문에 많은 수의 우수한 답변을 하고, 허브 사용자는 많은 수의 우수한 질문을 함을 밝혀내었다. Z.Gyogyi와 G.Koutrika의 논문(Z.Gyogyiand G.Koutrika, J.Pedersen and H.G.-Molina. Questioning Yahoo! Answers. WWW2008Workshop-QAWeb2008, 2008)에서도 야후 답변란에서의 사용자 평판도를 측정하기 위해 HITS 알고리즘을 사용하는 기술에 대해 다루고 있다.P.Jurczyk and E.Agichtein.Discovering authorities in question answer communities by using link analysis.CIKM 2007 finds authority and herb in Yahoo! 'S Answers The HITS algorithm was used to identify the authoritative users who answered a lot of good questions and the hub users who asked a lot of good questions. In Z.Gyogyi and G.Koutrika's papers (Z.Gyogyiand G.Koutrika, J.Pedersen and HG-Molina. Questioning Yahoo! Answers.WWW2008Workshop-QAWeb2008, 2008) The techniques used are discussed.

한편, Ko Fujimura 등의 논문(Ko Fujimura, Takafumi Inoue and Masayuki Sugisaki. The EigenRumor Algorithm for Ranking Blogs. WWW 2005)에서는 페이지랭크 및 HITS 알고리즘을 이용하여 블로거와 블로그 포스트를 동시에 랭크하는 기술에 대해 다루고 있다. A. Kristikopoulos 등의 논문(A. Kristikopoulos, M.Sideri, and I.Varlamis.Blgrank:ranking weblogs based on connectivity and similarity features. Proc. Of AAA-IDEA, 2006)에서는 암시적(implicit) 하이퍼링크를 이용한 블로그 랭킹 기술을 제안하였다. 예를 들어, 동일한 토픽을 가진 두 개의 블로그 포스트는 암시적으로 하이퍼링크로 연결되어 있다고 가정한다. 이러한 암시적 하이퍼링크는 블로그 포스트 간에 희박하게(sparse) 존재하는 링크 관계를 보완할 수 있게 된다. 암시적 하이퍼링크는 동일 토픽 이외에도 동일 소속의 블로거, 커멘터(commenter)에도 적용된다. 또한, J.Zhang 등의 논문(J.Zhang, M.S. Ackerman and L. Adamic. Expertise Networks in Online Communities:Structure and Algorithms.WWW 2007)에서는 온라인 네트워크에서의 사용자의 전문성을 추정하기 위하여 페이지랭크 및 HITS 알고리즘을 사용하였다.Meanwhile, Ko Fujimura et al. (Ko Fujimura, Takafumi Inoue and Masayuki Sugisaki.The EigenRumor Algorithm for Ranking Blogs. A. Kristikopoulos et al. (A. Kristikopoulos, M. Sideri, and I. Varlamis. Blgrank: ranking weblogs based on connectivity and similarity features.Proc. Of AAA-IDEA, 2006) use implicit hyperlinks. We proposed a blog ranking technique. For example, assume that two blog posts with the same topic are implicitly linked by hyperlinks. These implicit hyperlinks can compensate for sparse link relationships between blog posts. Implicit hyperlinks apply to bloggers and commenters on the same topic in addition to the same topic. In addition, J.Zhang et al. (J.Zhang, MS Ackerman and L. Adamic. Expertise Networks in Online Communities: Structure and Algorithms.WWW 2007) also suggest a pagerank and HITS algorithm to estimate user expertise in online networks. Was used.

이러한 기존의 페이지랭크 알고리즘 및 HITS 알고리즘을 이용한 랭킹 알고리즘은 웹 문서 간 링크 분석에 의한 문서 랭킹 알고리즘으로서, 포스트의 랭킹을 산출하는데 주로 이용되어 사용자의 랭킹을 산출하는데 적용될 수도 있다는 점을 간과하고 있는 한계가 있었다.The limitation that the existing page ranking algorithm and the ranking algorithm using the HITS algorithm is a document ranking algorithm by link analysis between web documents, which is mainly used to calculate the ranking of posts and may be applied to calculate the user's ranking. There was.

한편, 온라인 네트워크의 사용자들은 공개된 포스트에 대하여 피드백(feedback)을 남길 수 있다. 포스트에 대한 사용자의 피드백은 댓글(reply), 코멘트(comment), 공감, 추천, 트랙백, 스크랩 및 링크 등 다양한 형태로 이루어질 수 있다. 즉, 온라인 네트워크에서 사용자들은 포스트를 작성하고 다른 사용자의 포스트에 피드백을 주며 상호 작용을 한다. 온라인 네트워크에서 사용자는 비슷한 관심사를 가진 다른 사용자의 포스트에 피드백을 제공할 수도 있다. Meanwhile, users of the online network can leave feedback on published posts. The user's feedback on the post may take the form of comments, comments, empathy, recommendations, trackbacks, scraps and links. In other words, in an online network, users write posts, give feedback to and interact with posts from other users. In an online network, a user may provide feedback to posts of other users with similar interests.

또한, 온라인 네트워크 상의 사용자 사이에 상호 작용을 교환함에 따라서, 혹은 오프라인 상에서의 친분 관계에 의하여, 온라인 네트워크 상의 사용자는 다른 사용자 중 일부를 친구로 등록하는 등 사용자 사이에서 사회적 관계를 형성할 수 있다. 이 경우, 제 1 사용자의 친구로 등록된 제 2 사용자는 제 1 사용자가 포스트를 생성하거나 업데이트하는 경우 온라인 네트워크에 의해서 이를 통지 받고 제 1 사용자의 공간(예를 들어, 블로그 또는 홈페이지)을 방문할 수 있게 된다. In addition, by exchanging interactions between users on the online network, or by intimate relationships on-line, the users on the online network can form social relationships among users, such as registering some of the other users as friends. In this case, the second user who is registered as a friend of the first user is notified by the online network when the first user creates or updates a post, and visits the first user's space (e.g., blog or homepage). It becomes possible.

온라인 네트워크 상의 제 1 사용자의 공간을 방문하는 제 2 사용자는 제 1 사용자와의 사회 관계를 이미 형성한 사용자 및 제 1 사용자와 사회 관계가 전혀 형성되지 않은 사용자를 포함하게 된다. 제 1 사용자와 사회 관계를 이미 형성한 제 2 사용자는 제 1 사용자의 포스트의 콘텐츠의 유익성, 전문성 등을 고려하지 않고 단순히 제 1 사용자와 사회 관계를 형성하고 있음에 의하여 제 1 사용자의 공간을 방문할 수도 있다. 즉, 제 1 사용자의 사회성이 높은 경우(예를 들어, 제 1 사용자의 등록된 친구가 많을 경우) 제 1 사용자는 제 1 사용자의 포스트와 관계없이, 즉, 포스트의 내용이 부실함에도 불구하고, 높은 평판도를 받을 수 있는 문제점이 있을 수 있다.The second user who visits the space of the first user on the online network includes a user who has already formed a social relationship with the first user and a user who has never formed a social relationship with the first user. The second user who has already formed a social relationship with the first user does not consider the benefits, professionalism, etc. of the content of the first user's post, but simply forms a social relationship with the first user. You can also visit. That is, when the sociality of the first user is high (for example, when there are many registered friends of the first user), the first user is independent of the post of the first user, that is, despite the poor contents of the post, There may be a problem that can receive a high reputation.

이와 관련하여, 앞서 설명한 종래의 문서 랭킹 기술인 페이지랭크 알고리즘 및 HITS 알고리즘은 사용자의 사회성이 각 사용자의 평판도에 영향을 미친다는 사실을 간과하고 있는 한계가 있다. 그러므로, 온라인 네트워크 상에서 사용자들이 주고 받는 피드백의 속성을 고려하지 않고 페이지랭크 알고리즘 및 HITS 알고리즘을 사용하여 사용자들간의 랭킹을 구하는 경우 공정한 사용자 랭킹을 산출하지 못하게 되는 문제점이 있다.In this regard, the page rank algorithm and the HITS algorithm, which are the conventional document ranking techniques described above, have a limitation in overlooking the fact that the sociality of the user affects the reputation of each user. Therefore, there is a problem in that a fair user ranking cannot be calculated when a ranking is calculated between users using a page rank algorithm and an HITS algorithm without considering the nature of feedback sent and received on the online network.

상기 문제점을 해결하기 위하여 본 발명의 일 측면은, 각 사용자가 작성한 포스트의 개수에 상관없이 사용자 중심으로 제공된 피드백의 개수를 고려하여 온라인 네트워크 사용자의 평판도를 산출하는 방법 및 장치를 제공하는 것을 목적으로 한다.In order to solve the above problems, an aspect of the present invention is to provide a method and apparatus for calculating the reputation of an online network user in consideration of the number of feedback provided by the user, regardless of the number of posts written by each user. do.

상기 문제점을 해결하기 위하여 본 발명의 다른 측면은, 온라인 네트워크 포스트에 대한 피드백 정보를 수신하고, 온라인 네트워크 사용자간의 피드백에 기반한 사회성을 고려하여 온라인 네트워크 사용자의 평판도를 산출하는 방법 및 장치를 제공하는 것을 목적으로 한다. In order to solve the above problems, another aspect of the present invention is to provide a method and apparatus for receiving feedback information for an online network post and calculating reputation of an online network user in consideration of sociality based on feedback between online network users. The purpose.

또한, 본 발명에서는 온라인 네트워크 사용자의 평판도(reputation)와 사회성(sociability)을 분리하여 각각을 정의하고 효과적인 계산 방법을 제공하는 것을 목적으로 한다. In addition, an object of the present invention is to separate the reputation and sociality of online network users to define each and provide an effective calculation method.

또한, 온라인 네트워크 사용자의 평판도 및 사회성의 분리에 의해 보다 공정하고 효율적인 랭킹 산출 방법을 제공하는 것을 목적으로 한다.In addition, an object of the present invention is to provide a fair and efficient ranking calculation method by separating reputation and sociality of online network users.

상기 목적을 달성하기 위하여 본 발명의 제 1 측면은, 평판도(reputation) 산출 장치에서 구현되는 온라인 네트워크 사용자의 평판도 산출 방법으로서, 상기 평판도 산출 장치는, 수신부에서 복수의 온라인 네트워크 사용자가 작성한 온라인 네트워크 포스트에 대한 피드백 정보를 수신하는 단계, 제 1 정보 획득부에서 사용자 각각이 사용자 중 제 1 사용자에게 피드백을 제공할 확률인 제 1 정보를 획득하는 단계, 평판도 점수 산출부에서 제 1 정보를 이용하여 제 1 사용자의 평판도 점수를 산출하는 단계를 수행하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In order to achieve the above object, a first aspect of the present invention is a reputation calculation method of an online network user implemented in a reputation calculation apparatus, wherein the reputation calculation apparatus is an online network post created by a plurality of online network users at a reception unit. Receiving the feedback information for the first step, the first information obtaining unit obtains the first information which is the probability that each user to provide feedback to the first user of the user, the reputation score calculator using the first information A method of calculating a reputation of an online network user performing the step of calculating a reputation score of a user is provided.

또한, 상기 제 1 정보를 획득하는 단계는, 사용자 중 제 2 사용자가 제 1 사용자에게 제공한 피드백의 개수를 제 2 사용자가 사용자 전체에게 제공한 피드백의 개수로 나눠서 획득하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The acquiring of the first information may include acquiring the number of feedbacks provided by the second user to the first user by dividing the number of feedbacks provided by the second user to the entire user. Provides a method for calculating a user's reputation.

또한, 상기 제 1 사용자의 평판도 점수를 산출하는 단계는, 제 1 정보 및 사용자 중 제 2 사용자의 평판도 점수의 곱셈을 수행하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The calculating of the reputation score of the first user may include performing a multiplication of the reputation information score of the second user among the first information and the user.

또한, 상기 제 1 사용자의 평판도 점수를 산출하는 단계는 수학식 1을 이용하여 산출되는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In addition, the calculating of the reputation score of the first user provides a method for calculating the reputation of an online network user comprising the step of calculating using Equation 1.

[수학식 1][Equation 1]

(단, u 는 제 1 사용자, v 는 제 2 사용자, RR_rw(u) 는 제 1 사용자의 평판도 점수, RR_rw(v)는 제 2 사용자의 평판도 점수, b(u)는 제 1 사용자에게 피드백을 제공한 사용자의 집합, f(v)는 제 2 사용자가 작성한 피드백을 받은 사용자의 집합, F_v _->u는 제 1 사용자에게 제 2 사용자가 제공한 피드백의 개수,

는 제 2 사용자가 작성한 피드백의 전체 개수, d는 감쇄 인자(damping factor))Where u is the first user, v is the second user, RR _rw (u) is the reputation score of the first user, RR _rw (v) is the reputation score of the second user, and b (u) is the first user. The set of users who provided the feedback, f (v) is the set of users who received the feedback written by the second user, F _v- _{> u} is the number of feedbacks provided by the second user to the first user,

Is the total number of feedbacks written by the second user, d is the damping factor

상기 목적을 달성하기 위하여 본 발명의 제 1 측면은, 평판도(reputation) 산출 장치에서 구현되는 온라인 네트워크 사용자의 평판도 산출 방법으로서, 상기 평판도 산출 장치는, 수신부에서 복수의 온라인 네트워크 사용자가 작성한 온라인 네트워크 포스트에 대한 피드백 정보를 수신하는 단계, 제 1 정보 획득부에서 사용자 각각이 사용자 중 제 1 사용자에게 피드백을 제공할 확률인 제 1 정보를 획득하는 단계, 최초 평판도 점수 산출부에서 제 1 정보를 이용하여 제 1 사용자의 최초 평판도 점수를 산출하는 단계, 제 2 정보 획득부에서 제 1 사용자와 사용자 각각이 피드백을 교환할 확률인 제 2 정보를 획득하는 단계, 사회성 점수 산출부에서 제 2 정보를 이용하여 제 1 사용자의 사회성(sociability) 점수를 산출하는 단계, 및 순수 평판도 점수 산출부에서 제 1 사용자의 최초 평판도 점수 및 사회성 점수를 이용하여 제 1 사용자의 순수 평판도 점수를 산출하는 단계를 수행하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In order to achieve the above object, a first aspect of the present invention is a reputation calculation method of an online network user implemented in a reputation calculation apparatus, wherein the reputation calculation apparatus is an online network post created by a plurality of online network users at a reception unit. Receiving the feedback information for the step, obtaining the first information which is the probability that each user in the first information providing feedback to the first user of the user, the first reputation score calculator using the first information Calculating an initial reputation score of the first user, acquiring second information which is a probability that the first user and each user exchange feedback in the second information obtaining unit, and using the second information in the social score calculating unit Calculating a sociability score of the first user and a first reputation score calculation unit. A method of calculating a reputation of an online network user performing a step of calculating a pure reputation score of a first user using an initial reputation score and a sociality score of a user.

또한, 상기 피드백은 댓글, 코멘트, 공감, 추천, 트랙백, 스크랩 및 링크 중 하나 이상을 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The feedback also provides a method for calculating the reputation of an online network user including one or more of comments, comments, empathy, recommendations, trackbacks, scraps and links.

또한, 상기 제 1 사용자의 최초 평판도 점수를 산출하는 단계는, 제 1 정보 및 사용자 중 제 2 사용자의 최초 평판도 점수의 곱셈을 수행하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The calculating of the first reputation score of the first user may include performing a multiplication of the first information and the first reputation score of the second user among the users.

또한, 상기 제 1 사용자의 최초 평판도 점수를 산출하는 단계는 수학식 1을 이용하여 산출되는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In addition, the step of calculating the first reputation score of the first user provides a method for calculating the reputation of an online network user comprising the step of using the equation (1).

[수학식 1][Equation 1]

(단, u 는 제 1 사용자, v 는 제 2 사용자, RR_rw(u) 는 제 1 사용자의 최초 평판도 점수, RR_rw(v)는 제 2 사용자의 최초 평판도 점수, b(u)는 제 1 사용자에게 피드백을 제공한 사용자의 집합, f(v)는 제 2 사용자가 작성한 피드백을 받은 사용자의 집합, F_v _->u는 제 1 사용자에게 제 2 사용자가 제공한 피드백의 개수,

는 제 2 사용자가 작성한 피드백의 전체 개수, d는 감쇄 인자(damping factor))Where u is the first user, v is the second user, RR _rw (u) is the first reputation score of the first user, RR _rw (v) is the first reputation score of the second user, and b (u) is the first The set of users that provided feedback to the user, f (v) is the set of users that received feedback generated by the second user, F _v- _{> u} is the number of feedbacks provided by the second user to the first user,

또한, 상기 제 1 사용자의 최초 평판도 점수를 산출하는 단계는, 사용자 중 제 2 사용자를 허브 노드로, 제 1 사용자를 권위 노드로 정의하여 제 1 사용자의 권위 점수를 산출하는 단계, 및 제 1 사용자의 권위 점수에 따라 제 1 사용자의 최초 평판도 점수를 산출하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The calculating of an initial reputation score of the first user may include calculating an authority score of the first user by defining a second user as a hub node and a first user as an authority node, and a first user. Comprising a step of calculating the first reputation score of the first user according to the authority score of the online network user provides a method for calculating the reputation.

또한, 상기 제 1 사용자의 권위 점수를 산출하는 단계는, 제 1 사용자의 초기 권위 점수에 따라 제 2 사용자 허브 점수를 산출하는 단계, 및 제 1 정보 및 제 2 사용자 허브 점수의 곱을 산출하여 제 1 사용자의 권위 점수를 산출하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The calculating of the authority score of the first user may include calculating a second user hub score according to the initial authority score of the first user, and calculating a product of the first information and the second user hub score. It provides a method for calculating the reputation of an online network user comprising the step of calculating the authority score of the user.

또한, 상기 제 2 정보를 획득하는 단계는, 제 1 사용자가 사용자 중 제 2 사용자와 교환한 피드백의 개수를 제 2 사용자가 사용자 전체와 교환한 피드백의 개수로 나눠서 획득하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The acquiring of the second information may include obtaining the second information by dividing the number of feedbacks exchanged with the second user among the users by the number of feedbacks exchanged with the entire user by the second user. Provides a method for calculating a user's reputation.

또한, 상기 제 1 사용자가 상기 제 2 사용자와 교환한 피드백의 개수는, 제 1 사용자가 제 2 사용자에게 제공한 피드백의 개수를 이용하는 제 1 피드백 개수 및 제 2 사용자가 제 1 사용자에게 제공한 피드백의 개수를 이용하는 제 2 피드백 개수를 이용하여 계산되는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The number of feedbacks exchanged with the second user by the first user may include a number of first feedbacks using the number of feedbacks provided by the first user to the second user and feedbacks provided by the second user to the first user. It provides a method for calculating the reputation of an online network user calculated by using the second feedback number using the number of.

또한, 상기 제 1 사용자가 상기 제 2 사용자와 교환한 피드백의 개수는, 제 1 피드백 개수 및 제 2 피드백 개수 중 작은 값, 제 2 피드백 개수 및 제 2 피드백 개수의 산술 평균, 기하 평균 및 조화 평균 중 하나로 계산되는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The number of feedbacks exchanged with the second user by the first user may include arithmetic mean, geometric mean, and harmonic mean of the smaller one of the first feedback number and the second feedback number, the second feedback number, and the second feedback number. It provides a method for calculating the reputation of an online network user calculated by one of the methods.

또한, 상기 제 1 피드백 개수 및 상기 제 2 피드백 개수는 각각 상기 제 1 사용자가 작성한 포스트의 수 및 제 2 사용자가 작성한 포스트의 수를 고려하여 산출되는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The first feedback count and the second feedback count are provided in consideration of the number of posts written by the first user and the number of posts written by the second user, respectively.

또한, 상기 제 1 피드백 개수는 제 1 사용자가 제 2 사용자에게 제공한 피드백의 개수와 포스트당 제 1 사용자가 제 2 사용자에게 제공한 피드백의 평균 개수를 곱하여 산출되고, 상기 제 2 피드백 개수는 제 2 사용자가 제 1 사용자에게 제공한 피드백의 개수와 포스트당 제 2 사용자가 제 1 사용자에게 제공한 피드백의 평균 개수를 곱하여 산출되는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In addition, the first feedback number is calculated by multiplying the number of feedbacks provided by the first user to the second user and the average number of feedbacks provided by the first user to the second user per post, and the second feedback number 2 provides a method for calculating an online network user's reputation calculated by multiplying the number of feedbacks provided by a user to a first user and an average number of feedbacks provided to a first user by a second user per post.

또한, 상기 제 1 피드백 개수는 제 1 사용자가 제 2 사용자에게 제공한 피드백의 개수와 포스트당 제 1 사용자가 제 2 사용자에게 제공한 피드백의 평균 개수의 로그 값을 곱하여 산출되고, 상기 제 2 피드백 개수는 제 2 사용자가 제 1 사용자에게 제공한 피드백의 개수와 포스트당 제 2 사용자가 제 1 사용자에게 제공한 피드백의 평균 개수의 로그 값을 곱하여 산출되는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The first feedback number is calculated by multiplying a log value of the number of feedbacks provided by the first user to the second user and the average number of feedbacks provided by the first user to the second user per post, and the second feedback. The number provides a method for calculating an online network user's reputation calculated by multiplying a log value of the number of feedbacks provided to the first user by the second user and the average number of feedbacks provided to the first user by the second user per post.

또한, 상기 제 1 사용자의 사회성 점수를 산출하는 단계는, 제 2 정보 및 사용자 중 제 2 사용자의 사회성 점수의 곱셈을 포함하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The calculating of the social score of the first user may include providing a multiplicity of the second information and the social score of the second user among the users.

또한, 상기 제 1 사용자의 사회성 점수를 산출하는 단계는 수학식 2를 이용하여 산출되는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In addition, the calculating of the social score of the first user provides a method for calculating the reputation of an online network user comprising calculating using Equation 2.

[수학식 2]&Quot; (2) "

(단, u 는 제 1 사용자, v 는 제 2 사용자, SR(u) 는 제 1 사용자의 사회성 점수, SR(v)는 제 2 사용자의 사회성 점수, bd(u)는 제 1 사용자와 피드백을 교환한 사용자의 집합, bd(v)는 제 2 사용자와 피드백을 교환한 사용자의 집합, F^(s) _v->u는 제 1 사용자와 제 2 사용자가 교환한 피드백의 개수,

는 제 2 사용자가 교환한 피드백의 전체 개수, d는 감쇄 인자(damping factor))(Where u is the first user, v is the second user, SR (u) is the social score of the first user, SR (v) is the social score of the second user, and bd (u) provides feedback with the first user) The set of users exchanged, bd (v) is the set of users who exchanged feedback with the second user, F ^(s) _{v-> u} is the number of feedbacks exchanged between the first user and the second user,

Is the total number of feedbacks exchanged by the second user, d is the damping factor

또한, 상기 제 1 사용자의 순수 평판도 점수를 산출하는 단계는, 제 1 사용자의 최초 평판도 점수에서 제 1 사용자의 사회성 점수를 감산(subtract)하는 단계를 포함하는온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The calculating of the net reputation score of the first user may include subtracting the social score of the first user from the initial reputation score of the first user. .

또한, 상기 제 1 사용자의 순수 평판도 점수를 산출하는 단계는, 제 1 사용자가 사용자 중 제 2 사용자에게 제공한 피드백의 개수에서 제 1 사용자가 제 2 사용자와 교환한 피드백의 개수를 감산한 값 및 소정의 임계값 중 큰 값을 산출하고, 산출된 값을 제 1 사용자로부터 제 2 사용자로의 순수 평판도 강도(strength)로 정의하는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.The calculating of the pure reputation score of the first user may include: subtracting the number of feedbacks exchanged with the second user by the first user from the number of feedbacks provided by the first user to the second user among the users; A method of calculating a reputation of an online network user, comprising: calculating a large value among predetermined thresholds and defining the calculated value as a pure reputation strength from a first user to a second user.

또한, 상기 제 1 사용자의 순수 평판도 점수를 산출하는 단계는 수학식 3을 이용하여 산출되는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In addition, the calculating of the pure reputation score of the first user provides a method for calculating the reputation of an online network user comprising the step of using the equation (3).

[수학식 3]&Quot; (3) "

(단, u 는 제 1 사용자, v 는 사용자 중 제 2 사용자, PRrw(u) 는 제 1 사용자의 순수 평판도 점수, PRrw(v)는 제 2 사용자의 순수 평판도 점수, b(u)는 제 1 사용자에게 피드백을 제공한 사용자의 집합, f(v)는 제 2 사용자가 피드백을 제공한 사용자의 집합,

는 제 1 사용자로부터 제 2 사용자로의 순수 평판도 강도,

는 제 2 사용자의 순수 평판도 강도의 합, d는 감쇄 인자(damping factor))(Where u is the first user, v is the second user among users, PRrw (u) is the pure reputation score of the first user, PRrw (v) is the pure reputation score of the second user, and b (u) is the first user) The set of users that provided feedback to the user, f (v) is the set of users that provided feedback by the second user,

Is the pure reputation strength from the first user to the second user,

Is the sum of the pure reputation strengths of the second user, d is the damping factor

또한, 상기 평판도 산출 장치는, 최종 평판도 점수 산출부에서 제 1 사용자의 순수 평판도 점수 및 제 1 사용자의 사회성 점수를 이용하여 제 1 사용자의 최종 평판도 점수를 산출하는 단계를 더 수행하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In addition, the reputation calculator calculates the final reputation score of the first user by using the net reputation score of the first user and the social score of the first user in the final reputation score calculation unit of the online network user Provide a method for calculating reputation.

또한, 상기 최종 평판도 점수를 산출하는 단계는 수학식 4를 이용하여 산출되는 단계를 포함하는 온라인 네트워크 사용자의 평판도 산출 방법을 제공한다.In addition, calculating the final reputation score provides an online network user reputation calculation method comprising the step of calculating using the equation (4).

[수학식 4]&Quot; (4) "

(단, R_comb(u) 는 사용자 u 의 최종 평판도 점수, PR(u)는 상기 사용자 u의 순수 평판도 점수, SR(u)는 상기 사용자 u의 사회성 점수, a 는 01 사이의 상수)(Where R _comb (u) is the final reputation score of the user u, PR (u) is the pure reputation score of the user u, SR (u) is the sociality score of the user u, a is a constant between 01)

상기 목적을 달성하기 위하여 본 발명의 제 3 측면은, 복수의 온라인 네트워크 사용자가 작성한 온라인 네트워크 포스트에 대한 피드백 정보를 수신하는 수신부, 사용자 각각이 사용자 중 제 1 사용자에게 피드백을 제공할 확률인 제 1 정보를 획득하는 제 1 정보 획득부, 제 1 정보를 이용하여 제 1 사용자의 평판도 점수를 산출하는 평판도 점수 산출부를 포함하는 온라인 네트워크 사용자의 평판도 산출 장치를 제공한다.In order to achieve the above object, a third aspect of the present invention provides a receiver for receiving feedback information on an online network post written by a plurality of online network users, and a first probability in which each user provides feedback to a first user among the users. A reputation information calculating device of an online network user includes a first information obtaining unit obtaining information and a reputation score calculating unit calculating a reputation score of a first user using the first information.

상기 목적을 달성하기 위하여 본 발명의 제 4 측면은, 복수의 온라인 네트워크 사용자가 작성한 온라인 네트워크 포스트에 대한 피드백 정보를 수신하는 수신부, 사용자 각각이 사용자 중 제 1 사용자에게 피드백을 제공할 확률인 제 1 정보를 획득하는 제 1 정보 획득부, 제 1 정보를 이용하여 제 1 사용자의 최초 평판도 점수를 산출하는 최초 평판도 점수 산출부, 제 1 사용자와 사용자 각각이 피드백을 교환할 확률인 제 2 정보를 획득하는 제 2 정보 획득부, 제 2 정보를 이용하여 제 1 사용자의 사회성 점수를 산출하는 사회성 점수 산출부, 및 제 1 사용자의 최초 평판도 점수 및 사회성 점수를 이용하여 제 1 사용자의 순수 평판도 점수를 산출하는 순수 평판도 점수 산출부를 포함하는 온라인 네트워크 사용자의 평판도 산출 장치를 제공한다.In order to achieve the above object, a fourth aspect of the present invention provides a receiver for receiving feedback information on an online network post written by a plurality of online network users, and a first probability in which each user provides feedback to a first user among the users. A first information acquisition unit for acquiring information, an initial reputation score calculator for calculating an initial reputation score of the first user by using the first information, and second information which is a probability that each of the first user and the user exchanges feedback A second information obtaining unit configured to calculate a social score of the first user using the second information, a social score score unit, and a first reputation using the initial reputation score and the social score of the first user to calculate the pure reputation score of the first user An on-line network user's reputation calculation apparatus including a pure reputation score calculation unit is provided.

또한, 상기 피드백은 댓글, 코멘트, 공감, 추천, 트랙백, 스크랩 및 링크 중 하나 이상을 포함하는 온라인 네트워크 사용자의 평판도 산출 장치를 제공한다.In addition, the feedback provides an apparatus for calculating a reputation of an online network user including at least one of a comment, a comment, a sympathy, a recommendation, a trackback, a scrap, and a link.

또한, 상기 순수 평판도 점수 산출부는, 제 1 사용자의 최초 평판도 점수에서 제 1 사용자의 사회성 점수를 감산하는 감산부를 포함하는 온라인 네트워크 사용자의 평판도 산출 장치를 제공한다.The pure reputation score calculator may further include a reputation calculator for an online network user including a subtractor configured to subtract the social score of the first user from the first reputation score of the first user.

또한, 상기 제 1 사용자의 순수 평판도 점수 및 상기 제 1 사용자의 사회성 점수를 이용하여 제 1 사용자의 최종 평판도 점수를 산출하는 최종 평판도 점수 산출부를 더 포함하는 온라인 네트워크 사용자의 평판도 산출 장치를 제공한다.The present invention also provides an apparatus for calculating a reputation of an online network user further comprising a final reputation score calculator configured to calculate a final reputation score of the first user by using the pure reputation score of the first user and the sociality score of the first user.

본 발명의 일 측면에 따르면, 각 사용자가 작성한 포스트의 개수에 상관없이 사용자 중심으로 제공된 피드백의 개수를 고려하여 온라인 네트워크 사용자의 평판도를 산출할 수 있는 효과가 있다.According to an aspect of the present invention, regardless of the number of posts written by each user, the reputation of an online network user may be calculated in consideration of the number of feedbacks provided by the user.

본 발명의 일 측면에 따르면, 온라인 네트워크에서 어떤 사용자에 대한 다른 사용자들의 피드백을 고려함으로써, 사용자간의 사회성을 고려하여 사용자의 평판도를 제공할 수 있는 효과가 있다.According to an aspect of the present invention, by considering the feedback of other users to a user in the online network, there is an effect that can provide a reputation of the user in consideration of sociality between users.

또한, 온라인 네트워크 사용자의 평판도 및 사회성의 분리에 의해 보다 공정하고 효율적인 랭킹 산출 방법을 제공할 수 있는 효과가 있다.In addition, it is possible to provide a fair and efficient ranking calculation method by separation of reputation and sociality of online network users.

도 1 은 사용자의 포스트에 대한 피드백 활동을 도시한 그래프 모델이다.
도 2 는 본 발명에 따라 도 1 의 피드백 활동을 사용자 중심으로 재구성한 그래프 모델이다.
도 3 은 본 발명에 따라 사용자간의 상호작용 양을 도시한 그래프 모델이다.
도 4 는 본 발명에 따라 온라인 네트워크 사용자 사이의 관계를 사회성 관계 및 평판도 관계로 분류한 그래프 모델이다.
도 5 는 본 발명에 따라 사용자간의 사회성 강도를 도시한 그래프 모델이다.
도 6 은 본 발명에 따라 사용자간의 순수 평판도 강도를 도시한 그래프 모델이다.
도 7 은 본 발명에 따라 평판도 산출 장치에서 구현되는 온라인 네트워크 사용자의 평판도 산출 방법의 개략적인 흐름을 도시한 흐름도이다.
도 8 은 본 발명에 따라 네트워크 사용자의 평판도 산출 장치의 개략적인 구성을 도시한 블록도이다.
도 9 는 본 발명에 따라 SLRCLUB 사용자 점수 분포를 나타낸 그래프이다.
도 10 은 본 발명에 따라 SLRCLUB 사용자의 랭크간 관계를 나타낸 그래프이다.
도 11 은 본 발명에 따라 SLRCLUB 사용자 중 높은 랭크를 갖는 사용자를 나타낸 그래프이다.
도 12 는 본 발명에 따라 SLRCLUB 사용자의 랭크 상관 관계를 나타낸 그래프이다.
도 13 은 본 발명에 따라 본 발명에 따라 NAVER 사용자의 랭크간 관계를 나타낸 그래프이다.
도 14 는 본 발명에 따라 NAVER 사용자의 랭크 상관 관계를 나타낸 그래프이다.
도 15 는 본 발명에 따라 SLRCLUB과 NAVER 사용자의 클러스터(cluster) 분포를 나타낸 그래프이다.
도 16 은 본 발명에 따라 NAVER 레스토랑 카테고리 및 여행 카테고리 내의 사용자의 랭크 상관 관계를 나타낸 그래프이다.
도 17 은 본 발명에 따라 SLRCLUB, NAVER 여행 카테고리, NAVER 레스토랑 카테고리 내의 사용자의 랭크에 대한 NDCG(Normalized Discounted Cumulative Gain)을 나타낸 그래프이다.1 is a graph model depicting feedback activity for a user's post.
FIG. 2 is a graph model reconstructing the feedback activity of FIG. 1 in accordance with the present invention.
3 is a graphical model illustrating the amount of interaction between users in accordance with the present invention.
4 is a graph model classifying relationships between online network users into sociality and reputational relationships according to the present invention.
5 is a graph model showing the social strength between users in accordance with the present invention.
6 is a graph model showing the pure plate strength between users in accordance with the present invention.
7 is a flowchart illustrating a schematic flow of a reputation calculation method of an online network user implemented in a reputation calculation apparatus according to the present invention.
8 is a block diagram illustrating a schematic configuration of an apparatus for calculating a reputation of a network user according to the present invention.
9 is a graph showing a SLRCLUB user score distribution according to the present invention.
10 is a graph illustrating a relationship between ranks of SLRCLUB users according to the present invention.
11 is a graph illustrating a user having a high rank among SLRCLUB users according to the present invention.
12 is a graph showing rank correlation of SLRCLUB users according to the present invention.
13 is a graph showing the relationship between the rank of NAVER users according to the present invention in accordance with the present invention.
14 is a graph showing rank correlation of NAVER users according to the present invention.
15 is a graph illustrating cluster distribution of SLRCLUB and NAVER users according to the present invention.
16 is a graph showing rank correlation of users in a NAVER restaurant category and a travel category according to the present invention.
FIG. 17 is a graph illustrating a normalized discounted cumulative gain (NDCG) for a user's rank in SLRCLUB, NAVER travel category, and NAVER restaurant category according to the present invention.

이하 첨부된 도면을 참조하여 본 발명의 실시 예를 상세히 설명한다. 하기에서 본 발명을 설명함에 있어서 공지 기능 또는 구성에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명을 생략할 것이다. 그리고 후술되는 용어들은 본 발명에서의 기능을 고려하여 정의된 용어들로서 이는 사용자, 운용자의 의도 또는 관례 등에 따라 달라질 수 있다. 그러므로 그 정의는 본 명세서 전반에 걸친 내용을 토대로 내려져야 할 것이다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the following description of the present invention, if it is determined that a detailed description of a known function or configuration may unnecessarily obscure the subject matter of the present invention, the detailed description thereof will be omitted. The following terms are defined in consideration of the functions of the present invention, and may be changed according to the intentions or customs of the user, the operator, and the like. Therefore, the definition should be based on the contents throughout this specification.

온라인 네트워크는 인터넷 상에서 의사를 소통하는 공간으로서, 온라인 포럼(forum), 온라인 커뮤니티(online community), 블로그 공간(blogosphere), 소셜 네트워크(social network) 등 다양한 형태를 포함한다.An online network is a space for communicating on the Internet, and includes various forms such as an online forum, an online community, a blogosphere, and a social network.

온라인 커뮤니티, 블로그, 소셜 네트워크 등 온라인 네트워크 상에서 사용자의 평판(reputation) 또는 전문성(expertise)은 그 사용자가 온라인 상에 작성한 포스트(post) 및 피드백(feedback) 등의 온라인 활동에 의해서 결정된다. 예를 들어, 어떤 사용자가 작성한 포스트에 대하여 다른 사용자들이 피드백을 제공하였다면, 그 사용자는 그 포스트와 다른 사용자의 피드백들에 의하여 자신의 평판(reputation)이 더욱 높아질 것이고, 평판에 기반한 그 사용자의 스코어(score) 및 랭크(rank)도 오르게 될 것이다.The reputation or expertise of a user on an online network, such as an online community, a blog, or a social network, is determined by online activities such as posts and feedback that the user has written online. For example, if another user provided feedback on a post written by a user, that user would have a higher reputation for that post and other users' feedback, and that user's score based on the reputation The score and rank will also rise.

본 발명의 일 실시 예는, 어떤 사용자가 평판이 낮은 사용자로부터 피드백을 얻을 때보다 평판이 높은 사용자로부터 피드백을 획득할 때 그 사용자의 평판이 더 많이 올라 한다는 점을 고려하여, 사용자 랭킹에 페이지랭크(Page Rank) 알고리즘 또는 HITS(Hyperlink-Induced Topic Search) 알고리즘을 적용한다.According to an embodiment of the present invention, when a user obtains feedback from a user with a high reputation than a user receives feedback from a user with a low reputation, the user's reputation is increased. (Page Rank) algorithm or Hyperlink-Induced Topic Search (HITS) algorithm is applied.

도 1은 사용자의 포스트에 대한 피드백 활동을 도시한 그래프 모델이다. 본 그래프 모델은 사용자(110, 120, 130)와 포스트(210, 220, 230, 240, 250)를 각각 노드(node)로 표현하고, 사용자의 포스트 작성활동 및 피드백 제공활동을 사용자 노드와 포스트 노드를 연결하는 에지(edge)로 표현한다. 구체적으로 실선 에지는 사용자의 포스트 작성활동을, 점선 에지는 사용자의 피드백 제공활동을 의미한다. 예를 들어, 사용자 1(110)은 실선으로 연결된 포스트 1(210)과 포스트 2(220)를 작성하였고, 점선으로 연결된 포스트 3(230)과 포스트 4(240)에 피드백을 제공하였음을 알 수 있다.1 is a graph model depicting feedback activity for a user's post. This graph model represents users 110, 120, 130 and posts 210, 220, 230, 240, 250 as nodes, respectively, and the user's post creation activity and feedback providing activity are user nodes and post nodes. Expressed as an edge connecting the. Specifically, the solid edge means the user's post writing activity, and the dashed edge means the user's feedback providing activity. For example, User 1 110 may have written Post 1 210 and Post 2 220 connected in solid lines and provided feedback to Post 3 230 and Post 4 240 connected in dotted lines. have.

도 2는 본 발명에 따라 도 1의 피드백 활동을 사용자 중심으로 재구성한 그래프 모델이다. 본 그래프 모델은 사용자(110, 120, 130)만을 노드(node)로 표현하고, 사용자 간에 교환한 피드백의 횟수(112, 121, 131, 132)를 사용자 노드 간을 연결하는 에지(edge)로 표현한다. 예를 들어, 사용자 1(110)은 사용자 2(120)에게 2개의 피드백(112)을 제공하고 사용자 2(120)로부터 1개의 피드백(121)을 받았음을 알 수 있다.2 is a graph model reconstructing the feedback activity of FIG. 1 in accordance with the present invention. In this graph model, only the users 110, 120, and 130 are represented as nodes, and the number of feedbacks 112, 121, 131, and 132 exchanged between users is represented as an edge connecting the user nodes. do. For example, user 1 110 may provide two feedbacks 112 to user 2 120 and receive one feedback 121 from user 2 120.

도 2와 같이 사용자 중심으로 재구성된 그래프의 각 에지(112, 121, 131, 132)는, 해당 사용자 간의 피드백의 교환을 나타내고 있으므로, 평판에 기반한 사용자 랭킹의 계산 및 그에 따른 검색결과의 산출에 적합한 정보를 포함하고 있다. 따라서, 본 발명의 일 실시 예는 이러한 사용자 중심의 그래프에 페이지랭크 알고리즘 및 HITS 기반 알고리즘을 적용하여 사용자 랭킹 결과를 얻어낸다.Each edge 112, 121, 131, and 132 of the graph reconstructed by the user center as shown in FIG. 2 represents an exchange of feedback between the corresponding users, and thus is suitable for calculating a user ranking based on reputation and calculating a search result accordingly. Contains information. Therefore, an embodiment of the present invention obtains a user ranking result by applying a page rank algorithm and an HITS-based algorithm to such a user-centered graph.

앞서 설명한 것과 같이, 페이지랭크 알고리즘 및 HITS알고리즘은 웹 페이지의 랭킹을 얻기 위해서 사용되어 왔으며, 상업적인 검색 엔진에서 채택되고 있다. 이러한 알고리즘은 네트워크에서의 사용자들의 랭킹을 산출하기 위하여 적용될 수도 있다.As described above, pagerank algorithms and HITS algorithms have been used to obtain the ranking of web pages and are being employed in commercial search engines. This algorithm may be applied to calculate the ranking of users in the network.

그러나, 사용자 중심의 그래프에서의 링크의 특성은 사용자와 포스트간의 링크의 특성과는 상당히 다름을 알 수 있다. 우선 하나의 차이점은, 사용자 중심의 그래프에서는 한 쌍의 웹 페이지 사이에서의 에지가 양분화(dichotomous)되어 있는 것이다. 온라인 네트워크에서의 사용자는 다른 사용자에게 하나 이상의 피드백을 제공할 수 있으므로 에지는 가중치를 갖게 된다. 기존의 페이지랭크 모델에서의 랜덤 워크(random walk) 모델에서는 랜덤 서퍼(또는 피드백을 제공하는 사용자)가 하나의 노드에서 근접한 이웃 노드들 각각으로 이동할 확률이 동일하였다. 그러나, 피드백을 고려하는 경우 피드백을 제공하는 사용자는 다음에 피드백을 제공할 사용자를 그 사용자가 다른 사용자에게 이전에 제공한 피드백의 빈도에 기초하여 결정할 수 있다. However, it can be seen that the characteristics of the links in the user-centered graph are quite different from those of the link between the user and the post. One difference is that in a user-centric graph, the edges between a pair of web pages are dichotomous. Users in an online network can provide one or more feedbacks to other users, so edges have weights. In the random walk model of the existing page rank model, the probability that a random surfer (or a user providing feedback) moves from one node to each of neighboring neighbor nodes is the same. However, when considering feedback, the user providing the feedback may then determine the user to provide feedback based on the frequency of feedback that the user previously provided to other users.

또 다른 차이점은, 사용자간의 상호작용이다. 사용자간의 상호작용은 온라인 네트워크에서의 중요한 특성이다. 온라인 네트워크는 일반적으로 다수의 친분(intimacy)(또는 우정(friend) 또는 면식(acquaintance)) 그룹을 포함하고 있다. 사용자는 같은 그룹에 속해 있지 않은 사용자 보다는 같은 그룹에 속해 있는 사용자에게 많은 피드백을 제공할 것이다. 이는 특정 그룹 내에서 매우 많은 개수의 피드백을 교환하는 것을 관측함으로써 추론할 수 있다. 이러한 경향은 랜덤 워크 모델을 기초로 하는 랭킹 방법의 공정성을 약화시킨다. 예를 들어, 어떤 사용자는 그 사용자가 속해있는 그룹에서 매우 인기가 많기 때문에 실제 받아야 할 랭킹보다 높은 랭킹을 갖고 있을 수도 있다. 다른 경우에, 어떤 사용자는 어떠한 친분 그룹에도 속해있지 않기 때문에 실제 받아야 할 랭킹보다 낮은 랭킹을 갖고 있을 수도 있다. 그러므로, 기존의 랭킹 방법을 사용하는 경우, 높은 사회성 랭크를 갖고 있는 사용자가 불공평하게 높은 스코어를 가짐으로써, 낮은 사회성 랭크를 갖고 있는 사용자가 불공평하게 낮은 스코어를 가지는 문제점이 발생할 수도 있다.Another difference is the interaction between users. User interaction is an important characteristic in online networks. Online networks generally include a large number of intimacy (or friendship or acquaintance) groups. Users will give more feedback to users who belong to the same group than users who do not belong to the same group. This can be inferred by observing a very large number of feedback exchanges within a particular group. This tendency weakens the fairness of the ranking method based on the random walk model. For example, a user may have a higher ranking than the actual one that the user should receive because the user is very popular in the group to which the user belongs. In other cases, some users may have lower rankings than those that actually need to be because they are not in any familiar group. Therefore, in the case of using the existing ranking method, a user having a high social rank may have an unfairly high score, and thus a problem may occur in which a user having a low social rank has an unfairly low score.

따라서, 사용자 랭킹을 산출함에 있어서 사용자들간의 사회성을 고려하는 것은 매우 중요하다. 각각의 사용자의 랭크를 정확하게 측정하기 위해서는 사용자 각각의 사회성 및 평판도를 측정하는 것이 중요하다. 그러나, 사용자의 사회성이 언제나 사용자의 랭크에 부정적인 영향을 미치는 것은 아니다. 종래의 연구 결과에 따르면 서로 교환된 피드백의 수는 사용자의 랭크와 깊이 연관되어 있어서, 파워(power) 사용자간에 교환된 피드백의 수 > 파워 사용자와 비파워(non-power) 사용자간에 교환된 피드백의 수 > 비파워 사용자간에 교환된 피드백의 수라고 정의하고 있다. 이러한 종래의 연구 결과를 도 3 에 도시하였다.Therefore, it is very important to consider sociality among users in calculating user ranking. In order to accurately measure the rank of each user, it is important to measure the sociality and reputation of each user. However, the sociality of the user does not always negatively affect the user's rank. Previous research has shown that the number of feedbacks exchanged with each other is deeply related to the rank of the user, such that the number of feedbacks exchanged between power users is greater than that of power users and non-power users. It is defined as the number of feedback exchanged between number> non-power users. The results of this conventional research are shown in FIG. 3.

도 3 은 본 발명에 따라 사용자간의 상호작용 양을 도시한 그래프 모델이다. 도 3 에서는 사용자간의 피드백의 양에 따라 사용자를 레벨 A로부터 레벨E로 분류하였다. 파워 사용자에는 레벨A 의 사용자가 포함되며, 비파워 사용자에는 레벨E의 사용자가 포함된다. 대부분의 상호작용은 레벨A의 사용자간에서 일어나며, 낮은 레벨의 사용자간에는 상호작용이 거의 없음을 관측할 수 있다. 그러므로, 앞서 설명한 것과 같이 사용자간의 사회성이 사용자의 랭크를 산출함에 있어서 중요함을 인식할 수 있다. 3 is a graphical model illustrating the amount of interaction between users in accordance with the present invention. In FIG. 3, users are classified from level A to level E according to the amount of feedback between users. Power users include users at level A, and non-power users include users at level E. Most of the interactions occur between Level A users, and we can observe that there is little interaction between lower level users. Therefore, as described above, it can be recognized that sociality between users is important in calculating a user's rank.

다음에서는 온라인 네트워크에서의 사용자의 랭킹을 산출하기 위하여 최초 평판도 점수를 산출하는 방법, 사회성 점수를 산출하는 방법 및 순수 평판도 점수를 산출하는 방법에 대해서 설명하기로 한다.
Next, a method of calculating an initial reputation score, a method of calculating a social score, and a method of calculating a pure reputation score will be described in order to calculate a ranking of users in an online network.

* 최초 평판도 랭크( Reputation Rank ; RR ) * Initial Reputation Rank Rank ; RR )

페이지랭크 알고리즘 및 HITS 알고리즘을 적용하여 사용자의 평판도 랭크를 산출할 수 있다. 앞서 설명한 것과 같이, 온라인 네트워크에서의 사용자는 다른 사용자에게 하나 이상의 피드백을 제공할 수 있으므로 에지는 가중치를 갖게 된다. 그러므로, 사용자 중심의 그래프에서는 한 쌍의 웹 페이지 사이에서의 에지가 양분화(dichotomous)되게 된다. 따라서, 기존의 페이지랭크 모델에서 랜덤 서퍼(또는 피드백을 제공하는 사용자)가 하나의 노드에서 근접한 이웃 노드들 각각으로 이동할 확률이 동일하다는 가정을 사용할 수 없게 되었다. 그 대신, 피드백을 제공하는 사용자는 다음에 피드백을 제공할 사용자를 그 사용자가 다른 사용자에게 이전에 제공한 피드백의 빈도에 기초하여 결정할 수 있다. 피드백을 제공하는 사용자 v가 사용자u_i에게 다음의 피드백을 제공할 확률은 v가 u_i에게 제공한 피드백의 개수와 v 가 사용자 전체에게 제공한 피드백 개수의 비율로 계산된다.The user's reputation rank may be calculated by applying the page rank algorithm and the HITS algorithm. As discussed above, edges are weighted because a user in an online network may provide one or more feedback to other users. Thus, in a user-centric graph, the edges between a pair of web pages are dichotomous. Therefore, in the existing page rank model, it is impossible to use the assumption that random surfers (or users providing feedback) have the same probability of moving from one node to each of neighboring neighbor nodes. Instead, the user providing the feedback may determine which user to provide feedback next based on the frequency of feedback that the user has previously provided to other users. The probability that the user v providing the feedback provides the next feedback to the user u _i is calculated as the ratio of the number of feedbacks that v provides to u _i and the number of feedbacks that v provides to the entire user.

따라서, 랜덤 워크 모델을 기초로 하여 사용자의 최초 평판도 점수(RR)는 아래의 수식과 같이 계산될 수 있다.Therefore, based on the random walk model, the user's initial reputation score RR may be calculated as in the following equation.

[수식 1][Equation 1]

여기서, RR_rw(u) 는 u의 최초 평판도 점수, RR_rw(v)는 v의 최초 평판도 점수, b(u)는 u에게 피드백을 제공한 사용자의 집합, f(v)는 v가 작성한 피드백을 받은 사용자의 집합, F_v _->u는 u에게 v가 제공한 피드백의 개수,

는 v가 작성한 피드백의 전체 개수, d는 감쇄 인자(damping factor)를 나타낸다. 즉, 최초 평판도 점수는 u에게 다른 사용자들이 피드백을 제공할 확률과 다른 사용자들의 최초 평판도 점수의 곱셈에 의하여 계산된다.Where RR _rw (u) is u's initial reputation score, RR _rw (v) is v's initial reputation score, b (u) is the set of users who provided feedback to u, and f (v) is feedback written by v The set of users who received F _v- _{> u} is the number of feedback v has given to u,

Is the total number of feedbacks written by v, and d is the damping factor. That is, the initial reputation score is calculated by multiplying the probability that other users give feedback to u and the initial reputation score of other users.

한편, HITS 알고리즘을 기초로 하여 사용자의 최초 평판도 점수(RR)는 아래의 수식과 같이 계산될 수 있다.On the other hand, based on the HITS algorithm, the user's initial reputation score (RR) can be calculated as shown below.

[수식 2][Equation 2]

상기 수식 2 에서 RR_hits _- _authority는 권위성(authority)에 기반한 사용자의 점수를 나타내며, RR_hits _- _hub는 허브성(but)에 기반한 사용자의 점수를 나타낸다. HITS 모델을 적용하는 경우에는 사용자 v를 허브 노드로 사용자 u를 권위 노드로 정의하여 u의 권위 점수를 산출하고, u의 권위 점수에 따라 u의 최초 평판도 점수를 산출하게 된다. 이 경우 u 의 초기 권위 점수에 따라 v 의 허브 점수를 산출하고 v 가 u 에게 피드백을 제공할 확률과 v의 허브 점수의 곱을 산출하여 u의 권위 점수를 산출할 수 있다. HITS 알고리즘을 기초로 한 사용자의 최초 평판도 점수 RR_hits는 RR_{hits-authority}와 유사한 값을 가진다.In Equation 2, RR _hits _- _authority represents a user's score based on authority, and RR _hits _- _hub represents a user's score based on butt. In the case of applying the HITS model, the user v is defined as the hub node, the user u is defined as the authority node, and the authority score of u is calculated, and the initial reputation score of u is calculated according to the authority score of u. In this case, the authority score of u may be calculated by calculating the hub score of v according to the initial authority score of u, and calculating the product of the probability that v provides feedback to u and the hub score of v. The user's initial reputation score, RR _hits , based on the HITS algorithm, has a value similar to RR _{hits-authority} .

한편, 앞서 설명한 것과 같이 보다 정확한 사용자 랭킹을 산출하기 위해서는 최초 평판도 점수 외에도 온라인 네트워크의 사용자간의 사회성 점수를 더 고려할 수 있다. 이하에서는 온라인 네트워크 사용자간의 사회성 랭크 산출 방법에 대하여 설명하도록 한다.Meanwhile, in order to calculate a more accurate user ranking as described above, in addition to the initial reputation score, the social score between users of the online network may be further considered. Hereinafter, a method of calculating social rank between online network users will be described.

사회성 랭크(Sociability Rank; SR)Sociability Rank (SR)

온라인 네트워크에서 사용자간의 관계는 크게 2 가지로 구분될 수 있다. 하나는 친분 관계에 의해 생성되는 관계이고 다른 하나는 평판에 의해 생성되는 관계이다. 따라서, 효율적으로 사용자 랭킹 결과를 얻기 위해서는 온라인 네트워크상의 사용자 사이의 관계를 사회성 관계 및 평판도 관계로 분류하는 것이 필요하다. In the online network, the relationship between users can be classified into two types. One is a relationship created by friendship and the other is a relationship created by reputation. Therefore, in order to efficiently obtain user ranking results, it is necessary to classify the relationship between users on the online network into a social relationship and a reputation relationship.

도 4 는 본 발명에 따라 온라인 네트워크 사용자 사이의 관계를 사회성 관계 및 평판도 관계로 분류한 그래프 모델이다. 도 4(a)에서는 사용자 사이의 사회성 관계 및 평판도 관계가 모두 도시되어 있어, 정확한 사용자 랭킹을 산출하기 어려운 문제점이 있다. 한편, 도 4(b)는 사용자간의 친분 관계에 의해 형성된 사회성 관계를 도시한 것이다. 또한, 도4(c)는 사용자간의 평판도에 의해서만 형성된 평판도 관계를 도시한 것이다. 따라서, 온라인 네트워크 사용자 사이에는 사회성 관계 및 평판도 관계가 있음을 인지하고 아래에서는 사회성 관계에 의해 형성된 사회성 랭크를 산출하는 방법에 대하여 설명하도록 한다.4 is a graph model classifying relationships between online network users into sociality and reputational relationships according to the present invention. In FIG. 4 (a), both social and reputation relationships between users are illustrated, which makes it difficult to calculate accurate user rankings. On the other hand, Figure 4 (b) shows the social relationship formed by the friendship between users. 4 (c) shows the flatness relationship formed only by the flatness between users. Therefore, it is recognized that there is a social relationship and a reputation relationship among online network users, and a method of calculating a social rank formed by the social relationship will be described below.

사회성은 온라인 네트워크에서 사용자의 랭킹에 영향을 미치는 중요한 요소이다. 만약 두 사용자가 서로 친밀한 친분 관계를 형성하고 있는 경우, 서로 친분 관계를 형성하고 있지 않은 경우에 비하여 피드백을 교환할 가능성이 높다. 또한, 영향력이 높은 사용자 사이가 영향력이 낮은 사용자 사이보다 피드백을 교환할 확률이 크다. 사회성은 사용자 사이의 피드백의 양방향성을 고려하여 산출된다. Sociality is an important factor that affects the ranking of users in online networks. If the two users form an intimate relationship with each other, there is a high possibility of exchanging feedback compared to the case where the two users do not form an intimate relationship with each other. In addition, it is more likely that users with high impact exchange feedback than users with low impact. Sociality is calculated taking into account the bidirectionality of feedback between users.

도 5 는 본 발명에 따라 사용자간의 사회성 강도(strength)를 도시한 그래프 모델이다. 온라인 네트워크 상의 2 명의 사용자를 사용자 a 및 사용자 b로 지칭되는 경우 사용자 a 가 작성한 포스트는 P_a(510) 사용자 b 가 작성한 포스트는 P_b(520)로 지칭할 수 있다. 한편, b 에 의해 작성된 포스트에 a 가 제공한 피드백의 개수는 F_a _->b (530)로 지칭되며, a 에 의해 작성된 포스트에 b 가 제공한 피드백의 개수는 F_b _->a (540)로 지칭된다. FIG. 5 is a graphical model illustrating social strength between users in accordance with the present invention. FIG. When two users on the online network are referred to as user a and user b, a post written by user a may be referred to as P _a 510 and a post written by user b may be referred to as P _b 520. On the other hand, the number of feedbacks provided by a to a post written by _b is referred to as F _a- _{> b} 530, and the number of feedbacks provided by b to a post written by a is F _b- _{> a} 540. Is referred to.

한편, 사용자 a 와 b 사이의 사회성 강도는 F^(s) _a<->b (550)로 지칭된다. F^(s) _a<->b 는 F'_a _->b 와 F'_b _->a의 값에 기초하여 결정된다. 여기서, F'_a _->b 및 F'_b _->a는 각각 F_a _->b (530) 및 F_b _->a (540) 일 수도 있다. 또한, F'_a _->b 및 F'_b _->a는 다음 수식 3과 같이 계산될 수도 있다.On the other hand, the social strength between users a and b is referred to as F ^(s) _{a <-> b} 550. ^{_{F (s) a <->}} b is F is determined on the basis of the value _{_of> _a _- 'a ->} _b, and F' _b. Where, F _'a _{-> b,} and F' _b _{-> a} are each F _a _- may _{_{be> a (540) -> b}} (530) and F _b. In addition, F ' _a- _{> b} and F' _b- _{> a} may be calculated as in Equation 3 below.

[수식 3][Equation 3]

즉, F'_a _->b 및 F'_b _->a는 a 와 b 가 작성한 포스트의 개수를 고려하여 계산될 수 있다. 또한, F'_a _->b 및 F'_b _->a는 다음 수식 4와 같이 계산될 수도 있다.That is, F ' _a- _{> b} and F' _b- _{> a} may be calculated in consideration of the number of posts created by a and b. In addition, F ' _a- _{> b} and F' _b- _{> a} may be calculated as in Equation 4 below.

[수식 4][Equation 4]

F'_a _->b 와 F'_b _->a가 생성된 이후 F^(s) _a<->b가 계산된다. F^(s) _a<->b는 F'_a _->b 와 F'_b _-->a 중 작은 값, F'_a _->b 와 F'_b _->a의 산술 평균, 기하 평균 및 조화 평균 중 하나로 계산될 수 있다. F^(s) _a<->b 가 F'_a _->b 와 F'_b _-> _a의 조화 평균으로 계산되는 경우 수식은 다음 수식 5 와 같다.After F ' _a- _{> b} and F' _b- _{> a} are generated, F ^(s) _{a <-> b} is calculated. ^{_{F (s) a <->}} b is F _'a _{-> b,} and F' _b _{- ->} smaller of a, F _'a _{-> b,} and F' _b _{-> a} of the arithmetic mean, the geometric mean and the harmonic mean Can be calculated to either. ^{_{F (s) a <->}} b a F _'a _{-> b,} and F' _b _- when calculating the harmonic mean _of> _a formula shown in the following formula 5.

[수식 5][Equation 5]

만약 a 와 b 가 서로에게 피드백을 주지 않은 경우 사회성 강도 F^(s) _a<->b (550)는 0이 된다. 앞서 계산된 사회성 강도 F^(s) _a<->b (550)를 이용하여 사용자의 사회성 점수(SR)를 계산할 수 있다. 사회성 점수는 다음 수식 6과 같이 계산된다.If a and b do not give each other feedback, the social strength F ^(s) _{a <-> b} (550) is zero. The social score (SR) of the user may be calculated using the social strength F ^(s) _{a <-> b} 550 calculated above. The social score is calculated as shown in Equation 6 below.

[수식 6][Equation 6]

여기서, u 는 제 1 사용자, v 는 제 2 사용자, SR(u) 는 u의 사회성 점수, SR(v)는 v의 사회성 점수, bd(u)는 u 피드백을 교환한 사용자의 집합, bd(v)는 v와 피드백을 교환한 사용자의 집합, F^(s) _v->u는 u와 v의 사회성 점수,

는 v와 다른 사용자간의 사회성 점수의 합, d는 감쇄 인자를 나타낸다. 수식 6은 랜덤 워크 모델을 적용하여 계산한 것이나, 사회성 강도를 HITS 모델에 적용하여 사회성 점수를 계산할 수도 있다. 수식 6에서 계산된 사회성 점수는 후에 순수 평판도 점수를 계산함에 있어서 고려될 수 있다. Where u is the first user, v is the second user, SR (u) is the social score of u, SR (v) is the social score of v, bd (u) is the set of users who exchanged u feedback, bd ( v) is the set of users who exchanged feedback with v, F ^(s) _{v-> u} is the social score of u and v,

Is the sum of the social scores between v and other users, and d is the decay factor. Equation 6 is calculated by applying a random walk model, but may also calculate a social score by applying the social strength to the HITS model. The social score calculated in Equation 6 may be considered later in calculating the pure reputation score.

이하에서는 앞서 설명한 최초 평판도 점수 및 사회성 점수를 이용하여 온라인 네트워크 사용자간의 순수 평판도 점수를 계산하는 방법에 대하여 설명하도록 한다.Hereinafter, a method of calculating a pure reputation score between online network users by using the aforementioned initial reputation score and social score will be described.

순수 평판도 Pure reputation 랭크Rank ( ( PurePure ReputationReputation RankRank ; ; PRPR ))

순수 평판도 점수는 최초 평판도 점수에서 사회성 점수를 감산함으로서 얻어질 수 있다. 예를 들어, 어떤 사용자의 평판도가 매우 높다고 했을 때, 만약 이 유저의 점수 중 대부분이 사회성에 의한 점수라고 한다면 이러한 랭킹을 정확한 랭킹으로 보기는 어렵다. 따라서, 사용자의 랭킹 점수를 정확하게 계산하기 위해서는 사용자의 사회성과 사용자의 순수 평판도 점수를 분리하는 것이 매우 중요하다. The pure reputation score can be obtained by subtracting the sociality score from the original reputation score. For example, if a user's reputation is very high, if the majority of the user's scores are social scores, it is difficult to see this ranking as an accurate ranking. Therefore, in order to accurately calculate the user's ranking score, it is very important to separate the user's social score and the user's pure reputation score.

도 6 은 본 발명에 따라 사용자간의 순수 평판도 강도를 도시한 그래프 모델이다. 온라인 네트워크 상에서 사용자 a 와 사용자 b 가 피드백을 교환하고 있으며, 사용자 a 가 작성한 포스트는 P_a(610) 사용자 b 가 작성한 포스트는 P_b(620)로 지칭된다. 또한, b 에 의해 작성된 포스트에 a 가 제공한 피드백의 개수는 F_a _->b (630)로 지칭되며, a 에 의해 작성된 포스트에 b 가 제공한 피드백의 개수는 F_b _->a (640)로 지칭된다. 또한, 사용자 a 와 b 사이의 사회성 강도는 F^(s) _a<-> _b 로 지칭된다.6 is a graph model showing the pure plate strength between users in accordance with the present invention. User a and b are exchanging feedback on the online network, and the post written by user a is P _a 610 and the post written by user b is referred to as P _b 620. In addition, the number of feedbacks provided by a to the post written by _b is referred to as F _a- _{> b} 630, and the number of feedbacks provided by b to the post written by a is F _b- _{> a} (640). Is referred to. Also, the social strength between users a and _b is referred to as F ^(s) _{a <->} _b .

이 경우, 사용자 a로부터 b로의 순수 평판도 강도인 F^(p) _a->b (650) 는 다음 수식 7 과 같이 계산될 수 있다. In this case, F ^(p) _{a-> b} 650, which is the pure ^plate strength from user a to b, may be calculated as in Equation 7 below.

[수식 7][Formula 7]

기본적으로, 사용자 a 로부터 b로의 순수 평판도 강도는 사용자 a로부터 b로의 피드백의 개수(F_a _->b) 에서 사용자 a 와 b 간의 사회성 강도(F^(s) _a<->b)를 빼서 계산할 수 있다. 그러나, F_a _->b에서 F^(s) _a<->b를 빼는 경우 0 보다 작은 값이 나올 수도 있기 때문에 F^(p) _min 이란 값을 생성하고, F^(p) _min 과 F_a _->b- F^(s) _a<->b 중 큰 값을 a로부터 b 로의 순수 평판도 강도로 계산하게 된다. F^(p) _min 는 미리 정해진 상수인 f_min과 F_a _->b중 작은 값으로 설정될 수 있다. Basically, pure flatness strength to b from a user and a is the number of feedback to b from a user _{_{a (F a -> b)}} - can be subtracted to calculate the user social between a and b, the intensity ^{_{(> b F (s) a}} <) in have. However, F _a _-> in _b F ^(s) _{a <- because>} if subtracting the _b may come out a value less than 0 F ^(p) _min generate is value, F ^(p) _min and F _a _{-> b} -F ^{(s) The} larger of _{a <-> b} is calculated as the pure plate strength from a to b. F ^(p) _min may be set to a smaller value of _a predetermined constant f _min and F _a- _{> b} .

예를 들어, F_a _->b = 1, F_b _->a= 3, F^(s) _a<->b= 1.5, f_min=0.5 인 경우, F^(p) _a->b =0.5 의 값을 가지고, F^(p) _b->a =1.5의 값을 가지게 된다. a 와 b의 순수 평판도 강도를 계산함에 있어서, a 와 b 사이의 사회성을 고려하기 때문에 a 와 b 의 순수 평판도 강도는 최초 평판도보다 낮은 값을 가지게 된다. For _{_{example, F a -> b = 1}} , F b -> a = 3, F (s) a <-> b = 1.5, when the _{^{f min = 0.5, F (p}} ) a-> b = 0.5 of With a value of F ^(p) _{b-> a} = 1.5. In calculating the net reputation strengths of a and b, the net reputation strengths of a and b are lower than the original reputation because the sociality between a and b is taken into account.

앞서 설명한 순수 평판도 강도를 랜덤 워크 모델에 적용하여 사용자의 순수 평판도 점수를 계산할 수 있다. 사용자의 순수 평판도 점수(PR)를 아래 수식8과 같이 표현된다.The pure reputation score of the user may be calculated by applying the aforementioned pure reputation intensity to a random walk model. The pure reputation score (PR) of the user is expressed as shown in Equation 8 below.

[수식 8][Equation 8]

여기서, u 는 제 1 사용자, v 는 제 2 사용자, PRrw(u) 는 u의 순수 평판도 점수, PRrw(v)는 v의 순수 평판도 점수, b(u)는 u에게 피드백을 제공한 사용자의 집합, f(v)는 v가 피드백을 제공한 사용자의 집합,

는 u로부터 v로의 순수 평판도 강도,

는 v의 순수 평판도 강도의 합, d는 감쇄 인자(damping factor)를 나타낸다. 한편, HITS 모델을 적용하여 사용자의 순수 평판도 점수를 계산할 수도 있다. Where u is the first user, v is the second user, PRrw (u) is the pure reputation score of u, PRrw (v) is the pure reputation score of v, and b (u) is the set of users who provided feedback to u , f (v) is the set of users for whom v provided feedback,

Is the pure platen strength from u to v,

Is the sum of the pure platelet strengths of v, d is the damping factor. Meanwhile, the pure reputation score of the user may be calculated by applying the HITS model.

도 7 은 본 발명에 따라 평판도 산출 장치에서 구현되는 온라인 네트워크 사용자의 평판도 산출 방법의 개략적인 흐름을 도시한 흐름도이다. 평판도 산출 장치는, 수신부, 제 1 정보 획득부, 최초 평판도 점수 산출부, 제 2 정보 획득부, 사회성 점수 산출부, 순수 평판도 점수 산출부를 포함한다. 단계 S710 에서 수신부에서 복수의 온라인 네트워크 사용자가 작성한 복수의 온라인 네트워크 포스트에 대한 피드백 정보를 수신한다. 피드백은 댓글, 코멘트, 공감, 추천, 트랙백, 스크랩 및 링크를 포함하는 그룹 중 하나 이상을 포함할 수 있다.7 is a flowchart illustrating a schematic flow of a reputation calculation method of an online network user implemented in a reputation calculation apparatus according to the present invention. The reputation calculator includes a receiver, a first information acquisition unit, an initial reputation score calculation unit, a second information acquisition unit, a sociality score calculator, and a pure reputation score calculator. In step S710, the receiving unit receives feedback information on a plurality of online network posts written by a plurality of online network users. Feedback may include one or more of a group that includes comments, comments, empathy, recommendations, trackbacks, scraps, and links.

단계 S720에서 제 1 정보 획득부에서 사용자 중 제 1 사용자가 아닌 사용자 각각이 사용자 중 제 1 사용자에게 피드백을 제공할 확률인 제 1 정보를 획득한다. 상기 제 1 정보는 상기 사용자 중 제 2 사용자가 제 1 사용자에게 제공한 피드백의 개수를 제 2 사용자가 다른 사용자에게 제공한 피드백의 전체 개수로 나눠서 획득될 수 있다. In operation S720, the first information acquisition unit acquires first information that is a probability that each of the users who are not the first user among the users provides feedback to the first user. The first information may be obtained by dividing the number of feedbacks provided by the second user among the users to the first user by the total number of feedbacks provided by the second user to other users.

단계 S730 에서 최초 평판도 점수 산출부에서 제 1 정보를 이용하여 제 1 사용자의 최초 평판도 점수를 산출한다. 온라인 네트워크에서 어떤 사용자의 포스트가 높은 평판도를 가지는 사용자에게 피드백을 받을수록, (낮은 평판도를 가지는 사용자에게 피드백을 받는 경우에 비하여) 높은 평판도를 가지게 된다. 따라서, 제 1 사용자의 최초 평판도 점수를 산출하기 위해서는 제 1 정보 및 제 2 사용자의 최초 평판도의 곱셈을 합산하여 계산하게 된다. 또한, 제 1 사용자의 최초 평판도 점수는 랜덤 워크 모델을 적용한 수식 1 및 HITS 모델을 적용한 수식 2 에 의해서 산출될 수도 있다. In operation S730, the first reputation score calculator calculates an initial reputation score of the first user using the first information. In an online network, the more a user's posts receive feedback from a user with a higher reputation, the higher the reputation (compared to a feedback from a user with a lower reputation). Therefore, in order to calculate the first reputation score of the first user, the multiplication of the first information and the first reputation degree of the second user is added and calculated. In addition, the first reputation score of the first user may be calculated by Equation 1 applying the random walk model and Equation 2 applying the HITS model.

단계 S740 에서 제 2 정보 획득부에서 제 1 사용자와 전체 사용자 중 사용자 각각이 피드백을 교환할 확률인 제 2 정보를 획득한다. 제 2 정보는 사회성 강도에 기초하여 계산되는 것으로서, 제 1 사용자가 제 2 사용자와 교환한 피드백의 개수(사회성 강도)를 제 2 사용자가 다른 사용자와 교환한 피드백의 전체 개수로 나눠서 획득하게 된다. 이 경우, 제 1 사용자가 제 2 사용자와 교환한 피드백의 개수는 제 1 사용자가 제 2 사용자에게 제공한 피드백의 개수를 이용하는 제 1 피드백 개수 및 제 2 사용자가 제 1 사용자에게 제공한 피드백의 개수를 이용하는 제 2 피드백 개수를 이용하여 계산된다. 여기서, 제 1 사용자가 제 2 사용자와 교환한 피드백의 개수는 제 1 피드백 개수 및 제 2 피드백 개수 중 작은 값, 제 1 피드백 개수 및 제 2 피드백 개수의 산술 평균, 기하 평균 및 조화 평균 중 하나로 계산될 수 있다. 또한, 제 1 피드백 개수 및 제 2 피드백 개수는 제 1 사용자가 작성한 포스트의 수 및 제 2 사용자가 작성한 포스트의 수를 고려하여 산출될 수 있다. In operation S740, the second information acquisition unit acquires second information, which is a probability that each of the first user and the entire user exchanges feedback. The second information is calculated based on social strength, and is obtained by dividing the number of feedbacks (social strength) exchanged with the second user by the first user by the total number of feedbacks exchanged with other users by the second user. In this case, the number of feedbacks exchanged with the second user by the first user is the number of first feedbacks using the number of feedbacks provided by the first user to the second user and the number of feedbacks provided by the second user to the first user. It is calculated using the second feedback number using. Here, the number of feedbacks exchanged with the second user by the first user is calculated as one of a smaller value of the first and second feedback numbers, an arithmetic mean, a geometric mean, and a harmonic mean of the first and second feedback numbers. Can be. The first feedback number and the second feedback number may be calculated in consideration of the number of posts written by the first user and the number of posts written by the second user.

단계 S750 에서 사회성 점수 산출부에서 제 2 정보를 이용하여 제 1 사용자의 사회성 점수를 산출한다. 이 경우 제 1 사용자의 최초 평판도 점수를 계산할 때와 마찬가지로 랜덤 워크 모델을 적용하여 제 2 정보 및 제 2 사용자의 사회성 점수의 곱셈을 포함하여 제 1 사용자의 사회성 점수를 계산할 수 있다. 제 1 사용자의 사회성 점수는 수식 6에 의해서 계산될 수 있다.In operation S750, the social score calculator calculates the social score of the first user using the second information. In this case, as in the case of calculating the first reputation score of the first user, the social walk score of the first user may be calculated by applying a random walk model, including the multiplication of the second information and the social score of the second user. The social score of the first user may be calculated by Equation 6.

단계 S760 에서 순수 평판도 점수 산출부에서 제 1 사용자의 최초 평판도 점수 및 사회성 점수를 이용하여 제 1 사용자의 순수 평판도 점수를 산출한다. 이 경우, 기본적으로 제 1 사용자의 최초 평판도 점수에서 제 1 사용자의 사회성 점수를 감산함으로서, 제 1 사용자의 순수 평판도 점수를 계산할 수 있다. 또한, 제 1 사용자의 최초 평판도 점수에서 제 1 사용자의 사회성 점수를 감산한 값이 0 이하인 경우를 대비하여 소정의 임계값과 상기 사용자의 최초 평판도 점수에서 제 1 사용자의 사회성 점수를 감산한 값 중 큰 값을 제 1 사용자의 제 2 사용자에 대한 순수 평판도 강도로 계산할 수 있다. 순수 평판도 강도를 랜덤 워크 모델에 적용하여 순수 평판도 점수를 계산하는 과정은 수식 8에 나타나 있다. In operation S760, the pure reputation score calculator calculates a pure reputation score of the first user using the first reputation score and the sociality score of the first user. In this case, the net reputation score of the first user may be calculated by subtracting the first social user score from the first reputation score of the first user. In addition, in case a value obtained by subtracting the social user score of the first user from the initial reputation score of the first user is 0 or less, a predetermined threshold value and a value obtained by subtracting the social user score of the first user from the initial reputation score of the user. A large value can be calculated as the net reputation strength of the first user for the second user. The process of calculating the pure reputation score by applying the pure reputation intensity to the random walk model is shown in Equation 8.

한편, 순수 평판도 점수와 사회성 점수를 이용하여 최종 평판도 점수를 산출할 수도 있다. 최종 평판도 점수는 수식 9 에 의해 계산된다.Meanwhile, the final reputation score may be calculated using the pure reputation score and the sociality score. The final reputation score is calculated by Equation 9.

[수식 9][Equation 9]

이때, a 는 01 사이의 상수로 정해진다. At this time, a is set to a constant between 01.

도 8 은 본 발명에 따라 네트워크 사용자의 평판도 산출 장치의 개략적인 구성을 도시한 블록도이다. 수신부(800)는 복수의 온라인 네트워크 사용자가 작성한 복수의 온라인 네트워크 포스트에 대한 피드백 정보를 수신한다. 제 1 정보 획득부(810)은 사용자 중 제 1 사용자에게 사용자 중 제 1 사용자가 아닌 사용자 각각이 피드백을 제공할 확률을 계산한다. 최초 평판도 점수 산출부(820)는 제 1 정보 획득부(810)에서 획득된 제 1 정보를 이용하여 제 1 사용자의 최초 평판도 점수를 산출한다. 제 2 정보 획득부(830)는 제 1 사용자와 사용자 각각이 피드백을 교환할 확률인 제 2 정보를 획득한다. 사회성 점수 산출부(840)는 제 2 정보 획득부(830)에서 획득된 제 2 정보를 이용하여 제 1 사용자의 사회성 점수를 산출한다. 한편, 순수 평판도 점수 산출부(850)는 최초 평판도 점수 산출부(820)에서 산출된 최초 평판도 점수 및 사회성 점수 산출부(840)에서 산출된 사회성 점수를 이용하여 제 1 사용자의 순수 평판도 점수를 산출한다. 8 is a block diagram illustrating a schematic configuration of an apparatus for calculating a reputation of a network user according to the present invention. The receiver 800 receives feedback information about a plurality of online network posts written by a plurality of online network users. The first information acquisition unit 810 calculates a probability that each of the users other than the first user among the users provides feedback to the first user. The first reputation score calculator 820 calculates the first reputation score of the first user using the first information acquired by the first information acquirer 810. The second information acquisition unit 830 obtains second information that is a probability that the first user and each of the users exchange feedback. The social score calculator 840 calculates the social score of the first user using the second information acquired by the second information acquirer 830. Meanwhile, the pure reputation score calculator 850 calculates the pure reputation score of the first user using the initial reputation score calculated by the initial reputation score calculator 820 and the social score calculated by the social score calculator 840. do.

한편, 순수 평판도 점수 산출부(850)는 제 1 사용자의 최초 평판도 점수에서 제 1 사용자의 사회성 점수를 감산하는 감산부(860)를 더 포함할 수 있다. 또한, 평판도 산출 장치는 최종 평판도 점수 산출부(870)를 포함할 수 있으며, 최종 평판도 점수 산출부(870)는 사회성 점수 산출부(840)에서 산출된 사회성 점수와 순수 평판도 점수 산출부(850)에서 산출된 순수 평판도 점수를 이용하여 제 1 사용자의 최종 평판도 점수를 산출한다.Meanwhile, the pure reputation score calculator 850 may further include a subtraction unit 860 that subtracts the social score of the first user from the initial reputation score of the first user. Also, the reputation calculator may include a final reputation score calculator 870, and the final reputation score calculator 870 may be a social score and a pure reputation score calculator 850 calculated by the social score score 840. The final reputation score of the first user is calculated using the pure reputation score calculated in.

이하에서는 본 발명의 네트워크 사용자의 평판도 산출 방법 및 장치를 이용하여 사용자의 평판도를 산출하는 실험 결과에 대하여 설명하도록 한다.Hereinafter, an experimental result of calculating a user's reputation using the network user's reputation method and apparatus will be described.

실험 결과Experiment result

실험을 위하여 2 개의 데이터 세트를 수집하였다. 하나는 SLRCLUB(http://slrclub.com)으로서 Flickr와 같은 온라인 사진 공유 사이트이다. 144,514 명의 사용자에 의한 147,387 개의 포스트 및 1,883,239개의 피드백이 수집되었다. 또 다른 데이터 세트는 NAVER 블로그(http://blog.naver.com)으로서 1,107,208 명의 사용자에 의한 4,382,386 개의 블로그 포스트 및 7,642,440 개의 피드백이 수집되었다. 이 자료들은 대부분 2008년에 생성된 것이다. 표 1은 상기 수집된 데이터의 통계를 나타낸다.Two data sets were collected for the experiment. One is SLRCLUB ( http://slrclub.com ), an online photo sharing site like Flickr. 147,387 posts and 1,883,239 feedback by 144,514 users were collected. Another data set is the NAVER blog ( http://blog.naver.com ), which collected 4,382,386 blog posts and 7,642,440 feedback by 1,107,208 users. Most of these data were generated in 2008. Table 1 shows the statistics of the collected data.

[표 1][Table 1]

이하에서 최초 평판도 점수는 RR 점수로, 사회성 점수는 SR 점수로, 순수 평판도 점수는 PR 점수로 나타내기로 한다. 한편, 최초 평판도 랭크는 RR 랭크로, 사회성 랭크는 SR 랭크로, 순수 평판도 랭크는 PR 랭크로 나타내기로 한다. Hereinafter, an initial reputation score will be expressed as an RR score, a sociality score as an SR score, and a pure reputation score as a PR score. On the other hand, the first reputation rank is represented by RR rank, the social rank is represented by SR rank, and the pure reputation rank by PR rank.

도 9 는 본 발명에 따라 SLRCLUB 사용자 점수 분포를 나타낸 그래프이다. 도 9(a)는 RR 랭크에 따른 RR 점수를 나타낸 그래프이고, 도 9(b)는 PR랭크에 따른 PR점수를 나타낸 그래프이고, 도 9(c)는 SR랭크에 따른 SR점수를 나타낸 그래프이다. SLRCLUB의 144,514 명의 사용자 중 40.000 명의 사용자만이 0 이 아닌 사용자 점수를 갖고 있음을 알 수 있다. 한편, 5,000명의 사용자만이 0 이 아닌 SR점수를 갖고 있음을 알 수 있다. 이를 통해서 매우 작은 수의 사람들만이 사회적 상호 작용을 하며, PR점수는 SR점수를 감산하여 산출되기 때문에 일반적으로 PR 점수가 RR점수보다 작은 값을 가짐을 알 수 있다. SR 점수는 PR 점수 또는 RR 점수보다 매우 작은 값을 가진다. 9 is a graph showing a SLRCLUB user score distribution according to the present invention. Figure 9 (a) is a graph showing the RR score according to the RR rank, Figure 9 (b) is a graph showing the PR score according to the PR rank, Figure 9 (c) is a graph showing the SR score according to the SR rank. . Of the 144,514 users of SLRCLUB, only 40.000 users have a non-zero user score. On the other hand, it can be seen that only 5,000 users have an SR score other than zero. As a result, only a very small number of people interact with each other and the PR score is calculated by subtracting the SR score. Therefore, the PR score is generally smaller than the RR score. The SR score has a value much smaller than the PR score or the RR score.

도 10 은 본 발명에 따라 SLRCLUB 사용자의 랭크간 관계를 나타낸 그래프이다. 도 10(a)는 SR 랭크를 x 축으로 하고 PR 랭크를 y축으로 하여 사용자들을 나타낸 그래프이고, 도 10(b)는 SR 랭크를 x 축으로 하고 RR 랭크를 y 축으로 하여 사용자들을 나타낸 그래프이다. 도 10(a) 및 10(b)는 페이지랭크 알고리즘을 사용하여 산출된 것이다. 도 10(a)와 10(b)를 비교해 보면 10(b)가 x=y 라인 부근에 더 적은 데이터를 갖고 있음을 알 수 있다. 즉, PR 랭크가 RR 랭크에 비하여 SR 랭크에 의존하고 있지 않음을 나타낸다. 한편, 도 10(c) 및 10(d)는 HITS 알고리즘을 사용하여 각각 SR랭크-RR랭크의 관계 및 SR랭크-PR랭크의 관계를 나타낸 그래프로서, 10(a) 및 10(b)와 마찬가지로 PR 랭크가 RR 랭크에 비하여 SR 랭크에 의존하고 있지 않음을 알 수 있다. 10 is a graph illustrating a relationship between ranks of SLRCLUB users according to the present invention. Fig. 10 (a) is a graph showing users with SR rank as x axis and PR rank as y axis, and Fig. 10 (b) is a graph showing users with SR rank as x axis and RR rank as y axis. to be. 10 (a) and 10 (b) are calculated using the page rank algorithm. Comparing Figs. 10 (a) and 10 (b), it can be seen that 10 (b) has less data in the vicinity of the x = y line. That is, it shows that PR rank does not depend on SR rank compared with RR rank. 10 (c) and 10 (d) are graphs showing the relationship between SR rank-RR rank and SR rank-PR rank, respectively, using the HITS algorithm, similarly to 10 (a) and 10 (b). It can be seen that the PR rank does not depend on the SR rank compared to the RR rank.

도 11 은 본 발명에 따라 SLRCLUB 사용자 중 높은 랭크를 갖는 사용자를 나타낸 그래프이다. 사용자 1 은 RR 랭킹이 30 등이지만 낮은 SR 랭킹(274등)을 갖고 있다. 그러므로 사용자 1 의 사회성을 고려하여 PR 랭킹을 산출하면 1 등이 됨을 볼 수 있다. 한편, 사용자 2 는 RR 랭킹이 2 등이나 PR 랭킹은 12 등 밖에 되지 않는다. 이는 사용자 2 는 SR 랭킹이 2 등으로 활발한 사회성을 갖고 있기 때문에 사회성을 제외하여 평판도만 고려하는 PR 랭킹은 낮아지게 된다. 또한, 사용자 3 은 RR 랭킹이 1 등이고 SR 랭킹도 1 등이나 사회성을 고려한 PR 랭킹에서도 4등을 차지하여 PR 랭킹이 RR 랭킹에 비하여 많이 떨어지지 않았음을 알 수 있다. 11 is a graph illustrating a user having a high rank among SLRCLUB users according to the present invention. User 1 has a RR ranking of 30, but has a low SR ranking (274, etc.). Therefore, if the PR ranking is calculated in consideration of the sociality of user 1, it can be seen that it is 1, and so on. On the other hand, the user 2 has only RR ranking of 2 and PR ranking of 12 and the like. Since user 2 has an active sociality such as SR ranking of 2, PR ranking that considers reputation only except sociality becomes low. In addition, the user 3 can be found that the RR ranking is 1, the SR ranking is 1, and the PR ranking in consideration of sociality is 4th, and the PR ranking is not much lower than the RR ranking.

도 12 는 본 발명에 따라 SLRCLUB 사용자의 랭크 상관 관계를 나타낸 그래프이다. SR 랭크와 RR 랭크의 상관관계는 평균적으로 85% 정도의 상관성을 갖고 있음을 알 수 있다. 반면 SR 랭크와 PR 랭크의 상관 관계는 상대적으로 낮은 분포를 보이고 있다. 특히, 상위 30 명의 사용자에게서는 16% 의 상관성이 있으며, 전체적으로 보았을 때도 53% 의 상관성을 가지고 있어 SR 랭크와 RR 랭크간의 상관관계보다 낮은 값을 가지고 있음을 알 수 있다. 이는 사용자의 순수 평판도 점수는 사용자의 최초 평판도 점수에서 사회성 점수가 성공적으로 분리된 값임을 나타낸다.12 is a graph showing rank correlation of SLRCLUB users according to the present invention. It can be seen that the correlation between the SR rank and the RR rank has an average of about 85% correlation. On the other hand, the correlation between the SR rank and the PR rank shows a relatively low distribution. In particular, the top 30 users have a correlation of 16%, and as a whole, there is a 53% correlation, indicating that the value is lower than the correlation between the SR rank and the RR rank. This indicates that the pure reputation score of the user is a value in which the social score is successfully separated from the initial reputation score of the user.

도 13 은 본 발명에 따라 NAVER 사용자의 랭크간 관계를 나타낸 그래프이다. 도 13(a) 및 도 13(b)는 페이지랭크 알고리즘을 사용하여 각각 SR랭크-RR랭크의 관계 및 SR랭크-PR랭크의 관계를 나타낸 그래프이다. 한편, 도 13(c) 및 13(d)는 HITS 알고리즘을 사용하여 각각 SR랭크-RR랭크의 관계 및 SR랭크-PR랭크의 관계를 나타낸 그래프이다. 도 13 에서도 PR 랭크가 RR 랭크에 비하여 SR 랭크에 의존하고 있지 않음을 알 수 있다. 13 is a graph illustrating a relationship between ranks of NAVER users according to the present invention. 13 (a) and 13 (b) are graphs showing the relationship between SR rank and RR rank and the SR rank and PR rank, respectively, using a page rank algorithm. 13 (c) and 13 (d) are graphs showing the relationship between SR rank and RR rank and the SR rank and PR rank, respectively, using the HITS algorithm. It can be also seen from FIG. 13 that the PR rank does not depend on the SR rank as compared to the RR rank.

도 14 는 본 발명에 따라 NAVER 사용자의 랭크 상관 관계를 나타낸 그래프이다. 그러나, 도 12 에서와는 달리 SR 랭크와 RR 랭크간의 상관 관계가 낮음을 알 수 있다. 특히, 이러한 현상은 상위 랭커들 사이에서 더 두드러지는 것을 볼 수 있다. 이러한 차이는 네트워크의 특성에서 비롯되는 것으로 SLRCLUB과 같은 사진 공유 네트워크와는 달리 NAVER 블로그 공간에서는 주제가 다양하고, 사용자들은 적은 주제에만 관심을 보이며 활동하는 경향이 있다. 그러므로 사용자의 SR 전수가 일반적으로 낮으며, SR 랭크와 RR 랭크의 상관 관계 또한 낮게 된다. 이러한 사용자들간의 경향을 도 15 에서 클러스터 분포로 나타내었다.14 is a graph showing rank correlation of NAVER users according to the present invention. However, unlike in FIG. 12, it can be seen that the correlation between the SR rank and the RR rank is low. In particular, this phenomenon can be seen more prominent among the top rankers. This difference is due to the nature of the network. Unlike photo-sharing networks such as SLRCLUB, the NAVER blog space has a variety of topics, and users tend to be interested in only a few topics. Therefore, the total number of users' SRs is generally low, and the correlation between the SR rank and the RR rank is also low. The trend among these users is shown in the cluster distribution in FIG. 15.

도 15 는 본 발명에 따라 SLRCLUB과 NAVER 사용자의 클러스터(cluster) 분포를 나타낸 그래프이다. SLRCLUB의 사용자들은 중점을 중심으로 집중되는 경향이 있으나 NAVER 블로그의 사용자들은 NAVER 블로그 주제가 매우 다양하고 사용자 개개인의 관심 분야가 좁음으로 인하여 사용자들이 흩어지는 상태를 보이게 된다. 이러한 경향을 더 관측하기 위하여 NAVER 블로그 내에서도 2 개의 카테고리로 사용자들을 분류하고 사용자들의 특성을 관측하였다. 2 개의 카테고리는 레스토랑 카테고리와 여행 카테고리로 레스토랑 카테고리에서는 약 110 명의 사용자가 추출되었으며 여행 카테고리에서는 약 200 명의 사용자가 추출되었다.15 is a graph illustrating cluster distribution of SLRCLUB and NAVER users according to the present invention. The users of SLRCLUB tend to be concentrated around the emphasis, but the users of NAVER blogs are scattered due to the wide variety of NAVER blog topics and narrow personal interests. To further observe this trend, we classified users into two categories and observed their characteristics within the NAVER blog. The two categories are the restaurant category and the travel category, and about 110 users were extracted from the restaurant category and about 200 users were extracted from the travel category.

도 16 은 본 발명에 따라 NAVER 레스토랑 카테고리 및 여행 카테고리 내의 사용자의 랭크 상관 관계를 나타낸 그래프이다. 도 16(a) 및 도 16(b)는 페이지랭크 알고리즘을 사용하여 각각 SR랭크-RR랭크의 관계 및 SR랭크-PR랭크의 관계를 나타낸 그래프이다. 한편, 도 16(c) 및 16(d)는 HITS 알고리즘을 사용하여 각각 SR랭크-RR랭크의 관계 및 SR랭크-PR랭크의 관계를 나타낸 그래프이다. 도 16 에서도 PR 랭크가 RR 랭크에 비하여 SR 랭크에 의존하고 있지 않음을 알 수 있다. 16 is a graph showing rank correlation of users in a NAVER restaurant category and a travel category according to the present invention. 16 (a) and 16 (b) are graphs showing the relationship between SR rank and RR rank and the SR rank and PR rank, respectively, using a page rank algorithm. 16 (c) and 16 (d) are graphs showing the relationship between SR rank and RR rank and the SR rank and PR rank, respectively, using the HITS algorithm. It can also be seen from FIG. 16 that the PR rank does not depend on the SR rank as compared to the RR rank.

아래 표 2 는 NAVER 레스토랑 카테고리와 여행 카테고리의 SR랭크와 RR랭크간의 상관관계, SR랭크와 PR랭크간의 상관 관계, RR랭크와 PR랭크간의 상관관계를 나타낸 표이다.Table 2 below shows the correlation between the SR rank and the RR rank of the NAVER restaurant category and the travel category, the correlation between the SR rank and the PR rank, and the correlation between the RR rank and the PR rank.

[표 2][Table 2]

레스토랑 카테고리에서의 SR-RR 상관관계와 SR-PR 상관관계 사이의 차가 여행 카테고리에서의 차보다 크기 때문에 레스토랑 카테고리에서의 RR-PR 상관관계는 여행 카테고리에서의 RR-PR 상관관계보다 낮은 값을 갖게 된다.Since the difference between the SR-RR correlation and the SR-PR correlation in the restaurant category is larger than the difference in the travel category, the RR-PR correlation in the restaurant category has a lower value than the RR-PR correlation in the travel category. do.

한편, NDCG(Normalized Discounted Cumulative Gain)는 검색 결과의 랭크 정확도를 측정하는 평가 계량법으로서, SLRCLUB과 NAVER의 두 카테고리를 포함한 3 개의 데이터 세트에서 각각 상위 20 명의 사용자를 추출한 뒤, 10 명의 테스터를 대상으로 각각의 사용자에게 점수를 부여하였다.On the other hand, Normalized Discounted Cumulative Gain (NDCG) is an evaluation metric that measures the rank accuracy of search results.It extracts the top 20 users from three data sets, including two categories, SLRCLUB and NAVER, and then targets 10 testers. Each user was given a score.

도 17 은 본 발명에 따라 SLRCLUB, NAVER 여행 카테고리, NAVER 레스토랑 카테고리 내의 사용자의 랭크에 대한 NDCG(Normalized Discounted Cumulative Gain)을 나타낸 그래프이다. 각각의 그래프에서 PR이 다른 랭킹 알고리즘보다 높은 NDCG를 가짐을 확인할 수 있다. FIG. 17 is a graph illustrating a normalized discounted cumulative gain (NDCG) for a user's rank in SLRCLUB, NAVER travel category, and NAVER restaurant category according to the present invention. In each graph, it can be seen that PR has higher NDCG than other ranking algorithms.

앞서 설명한 실험 결과로 미루어 보아 온라인 네트워크 사용자들의 평판도 점수는 상당 부분이 사용자들간의 사회성에 기인함을 확인할 수 있었다. 또한, 정확한 사용자의 점수를 얻기 위해서는 사회성을 분리하는 것이 효과적인 방법임을 알 수 있었다.As a result of the above experiment, it can be seen that the reputation score of online network users is largely due to sociality among users. In addition, it was found that separating sociality is an effective way to obtain accurate user scores.

본 실시형태의 모듈, 기능 블록들 또는 수단들은 전자 회로, 집적 회로, ASIC (Application Specific Integrated Circuit) 등 공지된 다양한 소자들로 구현될 수 있으며, 각각 별개로 구현되거나 2 이상이 하나로 통합되어 구현될 수 있다.Modules, functional blocks or means of the present embodiment may be implemented in a variety of known elements, such as electronic circuits, integrated circuits, ASICs (Application Specific Integrated Circuit), each may be implemented separately, or two or more may be integrated into one Can be.

이상과 같이 본 발명의 이해를 위하여 그 실시예를 기술하였으나, 당업자라면 알 수 있듯이, 본 발명은 본 명세서에서 기술된 특정 실시예에 한정되는 것이 아니라, 본 발명의 범주를 벗어나지 않는 범위 내에서 다양하게 변형, 변경 및 대체될 수 있다. 따라서, 본 발명의 진정한 사상 및 범주에 속하는 모든 변형 및 변경을 특허청구범위에 의하여 모두 포괄하고자 한다.Although the embodiments have been described for the understanding of the present invention as described above, it will be understood by those skilled in the art, the present invention is not limited to the specific embodiments described herein, but variously without departing from the scope of the present invention. May be modified, changed and replaced. Therefore, it is intended that the present invention cover all modifications and variations that fall within the true spirit and scope of the present invention.

800 : 수신부 810 : 제 1 정보 획득부
820 : 최초 평판도 점수 산출부 830 : 제 2 정보 획득부
840 : 사회성 점수 산출부 850 : 순수 평판도 점수 산출부
860 : 감산부 870 : 최종 평판도 점수 산출부800: receiving unit 810: first information obtaining unit
820: First reputation score calculator 830: Second information acquisition unit
840: social grade score calculator 850: pure reputation score calculator
860: subtraction unit 870: final reputation score calculation unit

Claims

As a reputation calculation method of an online network user implemented in a reputation calculation device,
The flatness calculation device,
Receiving feedback information on an online network post written by a plurality of online network users at a receiving unit;
Acquiring, by a first information acquisition unit, first information that is a probability that each of the users provides feedback to a first user of the users;
Calculating an initial reputation score of the first user by using the first information in an initial reputation score calculator;
Acquiring, by a second information obtaining unit, second information which is a probability that each of the first user and the user exchanges feedback;
Calculating a sociability score of the first user by using the second information in a social score calculator; And
Calculating a pure reputation score of the first user by using the first reputation score and the sociality score of the first user in the pure reputation score calculator;
How to calculate the reputation of online network users.

A receiver configured to receive feedback information on an online network post written by a plurality of online network users;
A first information obtaining unit obtaining first information that is a probability that each of the users provides feedback to a first user of the users; And
And a reputation score calculator configured to calculate a reputation score of the first user using the first information.
Reputation calculator for online network users.

A receiver configured to receive feedback information on an online network post written by a plurality of online network users;
A first information obtaining unit obtaining first information that is a probability that each of the users provides feedback to a first user of the users;
An initial reputation score score calculator for calculating an initial reputation score of the first user using the first information;
A second information obtaining unit obtaining second information that is a probability that each of the first user and the user exchanges feedback;
A social score calculator that calculates a social score of the first user using the second information; And
It includes a pure reputation score calculation unit for calculating the pure reputation score of the first user using the first reputation score and sociality score of the first user
Reputation calculator for online network users.