KR20200103148A

KR20200103148A - Method and apparatus for recommending items based on multi-type pair-wise preference

Info

Publication number: KR20200103148A
Application number: KR1020190015746A
Authority: KR
Inventors: 김상욱; 이연창; 김태호
Original assignee: 한양대학교 산학협력단
Priority date: 2019-02-11
Filing date: 2019-02-11
Publication date: 2020-09-02
Also published as: KR102266853B1

Abstract

Disclosed is a method for recommending items based on multi-type pair-wise preference, capable or recommending items to users with high precision. A method of recommending an item using an evaluation result, which is a result of evaluating at least one item among a plurality of items by a plurality of users, according to an embodiment of the present invention includes the steps of: generating a comprehensive result by synthesizing a prediction result, which is a result of predicting an evaluation result of a plurality of unevaluated items included in the evaluation result, and the evaluation result using a collaborative filtering technique; classifying the plurality of unevaluated items into a plurality of non-information items and a plurality of indifferent items from the synthesis result; learning a model parameter based on the synthesis result and a plurality of item pairs selected from among item pairs consisting of two of an evaluation item, an unevaluated item, a no-information item, and an indifferent item included in the synthesis result; and recommending a predetermined number of items to at least one recommendation target among the plurality of users by using a final prediction result, which is a result of predicting evaluation results for the plurality of unevaluated items based on the learned model parameters.

Description

Method and device for recommending items based on multi-type pairwise {METHOD AND APPARATUS FOR RECOMMENDING ITEMS BASED ON MULTI-TYPE PAIR-WISE PREFERENCE}

본 발명은 평가 대상이 되는 아이템을 평가된 아이템, 미평가된 아이템, 무정보 아이템, 무관심 아이템으로 구분하고, 그에 기반하는 다중 유형의 아이템 페어를 이용하여 사용자에게 아이템을 추천하는 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for dividing an item to be evaluated into an evaluated item, an unevaluated item, an uninformed item, and an indifference item, and recommending an item to a user using a multi-type item pair based thereon. will be.

협업 필터링(collaborative filtering, CF) 기법은 레이팅 등의 명시적인 평가 결과와 클릭 등의 암시적인 평가 결과와 같은 과거의 사용자의 행동을 이용하여 사용자에게 상위 N개의 아이템을 추천하는 기술로서, 다수의 추천 시스템에 이미 적용된 기술이다.Collaborative filtering (CF) is a technology that recommends the top N items to users by using past user actions such as explicit evaluation results such as ratings and implicit evaluation results such as clicks. This technology has already been applied to the system.

최근, 단일 클래스 설정에 관한 데이터셋을 처리하려는 수요가 증가함에 따라, 단일 클래스 기반 협업 필터링(one-class collaborative filtering, OCCF) 기법이 제안되었다. OCCF 기법은 사용자의 선호도(즉, 평가 결과)를 학습하는 방법에 따라 2가지로 구분될 수 있다.In recent years, as the demand to process datasets related to single class setting increases, a one-class collaborative filtering (OCCF) technique has been proposed. OCCF techniques can be divided into two types according to the method of learning user preferences (ie, evaluation results).

첫번째로, 개별 아이템(point-wise) 단위로 개별 사용자의 실제 평가 결과와 예측 평가 결과의 오차를 최소화하는 방법이다. 두번째로, 아이템 페어 (pair-wise) 단위로 개별 사용자의 아이템 페어에서 평가된 아이템의 평가 결과와 예측된 아이템의 평가 결과의 차이가 극대화되도록 하는 방법이다.First, it is a method of minimizing the error between the actual evaluation result and the predicted evaluation result of an individual user on an individual item (point-wise) basis. Second, the difference between the evaluation result of the item evaluated in the item pair of the individual user and the evaluation result of the predicted item is maximized in an item pair (pair-wise) unit.

여기서, 두번째 방법은 단일 유형 페어와이즈 기반 접근법(single-type pair-wise approach)로 불리며, 첫번째 방법보다 아이템의 평가 결과 예측에 있어서 우수한 성능을 나타낸 것으로 평가된다.Here, the second method is called a single-type pair-wise approach, and is evaluated to exhibit superior performance in predicting an evaluation result of an item than the first method.

그러나, 단일 유형 페어와이즈 기반 접근법은 사용자가 모든 미평가된 아이템에 관하여 동일한 정도의 부정적인 선호도를 가지고 있다는 가정(P1)과 사용자가 평가된 아이템을 모든 미평가된 아이템보다 더 선호한다는 가정(P2)에 기반하고 있다. 하지만, 이와 같은 가정들이 언제나 현실과 부합하는 것은 아니기 때문에, 아이템 추천에 있어서 오류를 가져오게 될 가능성이 있는 것이 사실이다.However, the single type pairwise-based approach assumes that the user has the same degree of negative preference for all unrated items (P1) and that the user prefers the rated items over all unrated items (P2). Is based on. However, since these assumptions do not always correspond to reality, it is true that there is a possibility of introducing errors in item recommendation.

따라서, 이러한 문제점을 해결하기 위한 다중 유형 페어와이즈 기반의 아이템 추천 방법 및 장치에 대한 필요성이 대두되고 있다.Accordingly, there is a need for a method and apparatus for recommending items based on a multi-type pairwise to solve this problem.

R. Steffen, F. Christoph, G. Zeno, and S.-T. Lars. 2009. BPR: Bayesian Personalized Ranking from Implicit Feedback. In Proc. of the 25th Int'l Conf. on Uncertainty in Artificial Intelligence (UAI). 452-461.R. Steffen, F. Christoph, G. Zeno, and S.-T. Lars. 2009. BPR: Bayesian Personalized Ranking from Implicit Feedback. In Proc. of the 25th Int'l Conf. on Uncertainty in Artificial Intelligence (UAI). 452-461.

본 발명은 아이템을 평가된 아이템, 미평가된 아이템, 무정보 아이템, 무관심 아이템으로 구분하고, 다중 유형의 아이템 페어에 기반하여 학습된 모델 파라미터를 이용함으로써, 사용자에게 보다 높은 정확도로 아이템을 추천하는 방법 및 장치를 제공하고자 한다.The present invention divides items into evaluated items, unevaluated items, no-information items, and indifferent items, and recommends items with higher accuracy to users by using model parameters learned based on multiple types of item pairs. It is intended to provide a method and apparatus.

본 발명이 해결하고자 하는 과제는 이상에서 언급한 과제(들)로 제한되지 않으며, 언급되지 않은 또 다른 과제(들)은 아래의 기재로부터 당업자에게 명확하게 이해될 수 있을 것이다.The problem to be solved by the present invention is not limited to the problem(s) mentioned above, and another problem(s) not mentioned will be clearly understood by those skilled in the art from the following description.

상기 목적을 달성하기 위한 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 방법은 복수의 사용자가 복수의 아이템 중 적어도 하나의 아이템을 평가한 결과인 평가결과를 이용하여 아이템을 추천하는 방법에 있어서, 협업 필터링 기법을 이용하여 상기 평가결과에 포함된 복수의 미평가아이템의 평가 결과를 예측한 결과인 예측결과와 상기 평가결과를 종합하여 종합결과를 생성하는 단계; 상기 종합결과에서 상기 복수의 미평가아이템을 복수의 무정보아이템과 복수의 무관심아이템으로 분류하는 단계; 상기 종합결과 및 상기 종합결과에 포함된 평가아이템, 미평가아이템, 무정보아이템 및 무관심아이템 중 2개로 구성되는 아이템페어 중에서 선별된 복수의 선별아이템페어에 기초하여, 모델 파라미터를 학습시키는 단계; 및 상기 학습된 모델 파라미터에 기초하여 상기 복수의 미평가아이템에 대한 평가 결과를 예측한 결과인 최종예측결과를 이용하여, 상기 복수의 사용자 중 적어도 하나의 추천대상자에게 소정 개수의 아이템을 추천하는 단계를 포함한다.In order to achieve the above object, the multi-type pairwise-based item recommendation method according to an embodiment of the present invention is a method of recommending an item using an evaluation result that is a result of a plurality of users evaluating at least one item among a plurality of items. A method comprising: generating a comprehensive result by synthesizing a prediction result, which is a result of predicting an evaluation result of a plurality of unevaluated items included in the evaluation result, using a collaborative filtering technique and the evaluation result; Classifying the plurality of unevaluated items into a plurality of non-information items and a plurality of indifferent items from the synthesis result; Learning a model parameter based on the synthesis result and a plurality of selection item pairs selected from among item pairs consisting of two of an evaluation item, an unevaluated item, an uninformed item and an item of no interest included in the synthesis result; And recommending a predetermined number of items to at least one recommendation target among the plurality of users using a final prediction result that is a result of predicting evaluation results for the plurality of unevaluated items based on the learned model parameters. Includes.

바람직하게는, 상기 모델 파라미터를 학습시키는 단계는 상기 복수의 선별아이템페어 각각에 포함된 아이템 종류 간의 평가 결과 차이에 기반하는 목적함수를 생성하는 단계; 상기 복수의 사용자 중 선택된 하나의 선택사용자에 대하여, 상기 종합결과로부터 상기 복수의 선별아이템페어에 포함된 아이템 종류별 평가 결과를 포함하는 복수의 평가결과정보를 획득하는 단계; 상기 목적함수 및 상기 복수의 평가결과정보를 이용하여, 상기 목적함수의 값이 극대화되도록 상기 모델 파라미터값을 갱신하는 단계; 및 소정의 완료 조건을 만족할 때까지, 상기 복수의 평가결과정보를 획득하는 단계 및 상기 모델 파라미터값을 갱신하는 단계를 반복하는 단계를 포함할 수 있다.Preferably, the learning of the model parameters comprises: generating an objective function based on a difference in evaluation results between item types included in each of the plurality of selected item pairs; Acquiring a plurality of evaluation result information including evaluation results for each item type included in the plurality of selected item pairs from the synthesis result for one selected user selected from among the plurality of users; Updating the model parameter value to maximize the value of the objective function using the objective function and the plurality of evaluation result information; And repeating the steps of obtaining the plurality of evaluation result information and updating the model parameter values until a predetermined completion condition is satisfied.

바람직하게는, 상기 상기 모델 파라미터값을 갱신하는 단계는 상기 복수의 평가결과정보를 이용하여, 상기 목적함수의 그레디언트값을 산출하는 단계; 및 SGD(stochastic gradient descent) 기법에 기반하여 상기 그레디언트값을 적용함으로써, 상기 모델 파라미터값을 갱신하는 단계를 포함할 수 있다.Preferably, the updating of the model parameter values comprises: calculating a gradient value of the objective function using the plurality of evaluation result information; And updating the model parameter value by applying the gradient value based on a stochastic gradient descent (SGD) technique.

바람직하게는, 상기 복수의 평가결과정보는 상기 복수의 선별아이템페어에 포함된 아이템 종류별로 하나의 평가 결과를 포함할 수 있다.Preferably, the plurality of evaluation result information may include one evaluation result for each item type included in the plurality of selection item pairs.

바람직하게는, 상기 복수의 평가결과정보는 상기 복수의 선별아이템페어에 포함된 아이템 종류별로 복수의 평가 결과인 그룹평가결과를 포함하고, 상기 목적함수는 상기 그룹평가결과의 차이에 기반할 수 있다.Preferably, the plurality of evaluation result information includes a group evaluation result that is a plurality of evaluation results for each item type included in the plurality of selected item pairs, and the objective function may be based on a difference between the group evaluation results. .

바람직하게는, 상기 목적함수는 상기 평가아이템의 평가 결과가 상기 미평가아이템, 상기 무정보아이템 및 상기 무관심아이템의 평가 결과보다 높고, 상기 무정보아이템의 평가 결과가 상기 무관심아이템의 평가 결과보다 높다는 가정에 기반할 수 있다.Preferably, the objective function is that the evaluation result of the evaluation item is higher than the evaluation result of the unevaluated item, the no-information item and the indifferent item, and the evaluation result of the no-information item is higher than the evaluation result of the indifferent item. It can be based on assumptions.

바람직하게는, 상기 복수의 선별아이템페어는 상기 평가아이템과 상기 미평가아이템으로 구성되는 아이템페어와 상기 무정보아이템과 상기 무관심아이템으로 구성되는 아이템페어를 포함하거나, 상기 평가아이템과 상기 무정보아이템으로 구성되는 아이템페어, 상기 평가아이템과 상기 무관심아이템으로 구성되는 아이템페어 및 상기 무정보아이템과 상기 무관심아이템으로 구성되는 아이템페어를 포함할 수 있다.Preferably, the plurality of selection item pairs include an item pair consisting of the evaluation item and the unevaluated item and an item pair consisting of the no-information item and the indifferent item, or the evaluation item and the no-information item It may include an item pair composed of, an item pair composed of the evaluation item and the indifferent item, and an item pair composed of the no-information item and the indifferent item.

또한, 상기 목적을 달성하기 위한 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 장치는 복수의 사용자가 복수의 아이템 중 적어도 하나의 아이템을 평가한 결과인 평가결과를 이용하여 아이템을 추천하는 장치에 있어서, 협업 필터링 기법을 이용하여, 상기 평가결과에 포함된 복수의 미평가아이템의 평가 결과를 예측한 결과인 예측결과와 상기 평가결과를 종합하여 종합결과를 생성하는 생성부; 상기 종합결과에서 상기 복수의 미평가아이템을 복수의 무정보아이템과 복수의 무관심아이템으로 분류하는 분류부; 상기 종합결과 및 상기 종합결과에 포함된 평가아이템, 미평가아이템, 무정보아이템 및 무관심아이템 중 2개로 구성되는 아이템페어 중에서 선별된 복수의 선별아이템페어에 기초하여, 모델 파라미터를 학습시키는 학습부; 및 상기 학습된 모델 파라미터에 기초하여 상기 복수의 미평가아이템에 대한 평가 결과를 예측한 결과인 최종예측결과를 이용하여, 상기 복수의 사용자 중 적어도 하나의 추천대상자에게 소정 개수의 아이템을 추천하는 추천부를 포함한다.In addition, the multi-type pairwise-based item recommendation apparatus according to an embodiment of the present invention for achieving the above object recommends an item using an evaluation result that is a result of a plurality of users evaluating at least one item among a plurality of items. What is claimed is: 1. An apparatus comprising: a generator configured to generate a comprehensive result by synthesizing a prediction result, which is a result of predicting an evaluation result of a plurality of unevaluated items included in the evaluation result, and the evaluation result using a collaborative filtering technique; A classification unit for classifying the plurality of unevaluated items into a plurality of non-information items and a plurality of non-interested items in the synthesis result; A learning unit that learns model parameters based on the synthesis result and a plurality of selection item pairs selected from among item pairs consisting of two of an evaluation item, an unevaluated item, a no-information item, and an indifferent item included in the synthesis result; And a recommendation for recommending a predetermined number of items to at least one recommendation target among the plurality of users using a final prediction result that is a result of predicting evaluation results for the plurality of unevaluated items based on the learned model parameters. Includes wealth.

바람직하게는, 상기 학습부는 상기 복수의 선별아이템페어 각각에 포함된 아이템 종류 간의 평가 결과 차이에 기반하는 목적함수를 생성하고, 상기 복수의 사용자 중 선택된 하나의 선택사용자에 대하여, 상기 종합결과로부터 상기 복수의 선별아이템페어에 포함된 아이템 종류별 평가 결과를 포함하는 복수의 평가결과정보를 획득하고, 상기 목적함수 및 상기 복수의 평가결과정보를 이용하여, 상기 목적함수의 값이 극대화되도록 상기 모델 파라미터값을 갱신하고, 소정의 완료 조건을 만족할 때까지, 상기 복수의 평가결과정보를 획득하는 과정과 상기 모델 파라미터값을 갱신하는 과정을 반복할 수 있다.Preferably, the learning unit generates an objective function based on an evaluation result difference between item types included in each of the plurality of selected item pairs, and for one selected user selected among the plurality of users, from the synthesis result Obtaining a plurality of evaluation result information including evaluation results for each item type included in a plurality of selected item pairs, and using the objective function and the plurality of evaluation result information, the model parameter value so that the value of the objective function is maximized. The process of acquiring the plurality of evaluation result information and the process of updating the model parameter value may be repeated until a predetermined completion condition is satisfied.

바람직하게는, 상기 학습부가 상기 모델 파라미터값을 갱신할 때, 상기 복수의 평가결과정보를 이용하여, 상기 목적함수의 그레디언트값을 산출하고, SGD(stochastic gradient descent) 기법에 기반하여 상기 그레디언트값을 적용함으로써, 상기 모델 파라미터값을 갱신할 수 있다.Preferably, when the learning unit updates the model parameter value, the gradient value of the objective function is calculated using the plurality of evaluation result information, and the gradient value is calculated based on a stochastic gradient descent (SGD) technique. By applying, the model parameter value can be updated.

본 발명은 평가 대상이 되는 아이템을 평가된 아이템, 미평가된 아이템, 무정보 아이템, 무관심 아이템으로 구분하고, 다중 유형의 아이템 페어에 기반하여 학습된 모델 파라미터를 이용함으로써, 사용자에게 보다 높은 정확도로 아이템을 추천할 수 있는 효과가 있다.The present invention divides the item to be evaluated into an evaluated item, an unevaluated item, an uninformed item, and an indifference item, and uses model parameters learned based on multiple types of item pairs. It has the effect of recommending items.

또한, 본 발명은 다중 유형의 아이템 페어를 이용하여 모델 파라미터를 학습시킬 때, 동일한 종류의 아이템에 관한 평가 결과를 그룹화하여 이용함으로써, 사용자에게 보다 높은 정확도로 아이템을 추천할 수 있는 효과가 있다.In addition, according to the present invention, when a model parameter is trained using multiple types of item pairs, evaluation results for the same type of item are grouped and used, so that an item can be recommended to a user with higher accuracy.

도 1은 (a) 단일 유형 페어와이즈 선호도와 (b) 다중 유형 페어와이즈 선호도를 설명하기 위한 도면이다.
도 2는 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 방법을 설명하기 위한 흐름도이다.
도 3은 본 발명의 일 실시예에 따른 모델 파라미터의 학습 방법을 설명하기 위한 흐름도이다.
도 4는 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 장치를 설명하기 위한 블록도이다.
도 5는 본 발명의 일 실시예에 따른 모델 파라미터 학습 알고리즘의 의사코드를 나타내는 도면이다.
도 6 및 7은 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 방법의 효과를 설명하기 위한 블록도이다.FIG. 1 is a diagram for explaining (a) a single type pairwise preference and (b) a multiple type pairwise preference.
2 is a flowchart illustrating a method of recommending a multi-type pairwise-based item according to an embodiment of the present invention.
3 is a flowchart illustrating a method of learning model parameters according to an embodiment of the present invention.
4 is a block diagram illustrating a multi-type pairwise-based item recommendation apparatus according to an embodiment of the present invention.
5 is a diagram illustrating a pseudocode of a model parameter learning algorithm according to an embodiment of the present invention.
6 and 7 are block diagrams for explaining the effect of a multi-type pairwise-based item recommendation method according to an embodiment of the present invention.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세한 설명에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였다.In the present invention, various modifications may be made and various embodiments may be provided, and specific embodiments will be illustrated in the drawings and described in detail in the detailed description. However, this is not intended to limit the present invention to a specific embodiment, it is to be understood to include all changes, equivalents, and substitutes included in the spirit and scope of the present invention. In describing each drawing, similar reference numerals have been used for similar elements.

제1, 제2, A, B 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다. 및/또는 이라는 용어는 복수의 관련된 기재된 항목들의 조합 또는 복수의 관련된 기재된 항목들 중의 어느 항목을 포함한다.Terms such as first, second, A, and B may be used to describe various elements, but the elements should not be limited by the terms. These terms are used only for the purpose of distinguishing one component from another component. For example, without departing from the scope of the present invention, a first element may be referred to as a second element, and similarly, a second element may be referred to as a first element. The term and/or includes a combination of a plurality of related listed items or any of a plurality of related listed items.

어떤 구성요소가 다른 구성요소에 "연결되어" 있다거나 "접속되어" 있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 반면에, 어떤 구성요소가 다른 구성요소에 "직접 연결되어" 있다거나 "직접 접속되어" 있다고 언급된 때에는, 중간에 다른 구성요소가 존재하지 않는 것으로 이해되어야 할 것이다.When a component is referred to as being "connected" or "connected" to another component, it is understood that it may be directly connected or connected to the other component, but other components may exist in the middle. Should be. On the other hand, when a component is referred to as being "directly connected" or "directly connected" to another component, it should be understood that there is no other component in the middle.

본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terms used in the present application are only used to describe specific embodiments, and are not intended to limit the present invention. Singular expressions include plural expressions unless the context clearly indicates otherwise. In the present application, terms such as "comprise" or "have" are intended to designate the presence of features, numbers, steps, actions, components, parts, or combinations thereof described in the specification, but one or more other features. It is to be understood that the presence or addition of elements or numbers, steps, actions, components, parts, or combinations thereof, does not preclude in advance.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.Unless otherwise defined, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which the present invention belongs. Terms as defined in a commonly used dictionary should be interpreted as having a meaning consistent with the meaning in the context of the related technology, and should not be interpreted as an ideal or excessively formal meaning unless explicitly defined in this application. Does not.

본 발명에서 복수의 사용자가 평가 대상이 되는 복수의 아이템 중 적어도 하나를 평가한 결과인 평가결과(예, 행렬)가 존재할 때, 그 평가결과에서 어떤 사용자가 평가한 아이템은 평가아이템(rated item)으로 정의하고, 그 사용자가 평가하지 않은 아이템은 미평가아이템(unrated item)으로 정의한다. 여기서, 미평가아이템 중 그 사용자의 평가 결과가 낮을 것으로 예상되는 아이템을 무관심아이템(uninteresting item)으로 정의하고, 미평가아이템 중에서 무관심아이템을 제외한 나머지를 선호도를 예측하기 어려운 아이템인 무정보아이템(unknown item)으로 정의한다.In the present invention, when there is an evaluation result (e.g., a matrix) that is a result of evaluating at least one of a plurality of items subject to evaluation by a plurality of users, the item evaluated by a user in the evaluation result is a rated item It is defined as, and items not rated by the user are defined as unrated items. Here, among the unevaluated items, an item that is expected to have a low evaluation result by the user is defined as an uninteresting item, and among the unevaluated items, the remaining items except for the indifference item are items that are difficult to predict the preference. item).

한편, 본 발명은 암시적인 피드백(예, 클릭)에 기반하는 단일 클래스 환경에 주로 적용될 수 있다. 하지만, 본 발명은 이에 한정되지 않으며, 적절한 데이터 변형 및 가공을 통해 명시적 피드백(예, 레이팅)에 기반하는 다중 클래스 환경에도 적용될 수 있음은 물론이다.Meanwhile, the present invention can be mainly applied to a single class environment based on implicit feedback (eg, click). However, the present invention is not limited thereto, and can be applied to a multi-class environment based on explicit feedback (eg, rating) through appropriate data transformation and processing.

도 1은 (a) 단일 유형 페어와이즈 선호도와 (b) 다중 유형 페어와이즈 선호도를 설명하기 위한 도면이다.FIG. 1 is a diagram for explaining (a) a single type pairwise preference and (b) a multiple type pairwise preference.

우선, 도 1(a)를 참조하면, 종래의 기술(즉, Bayesian Personalized Ranking, BPR)에 따른 단일 유형 페어와이즈 선호도가 나타나 있다. 이때, 종래의 기술에서는 평가아이템과 미평가아이템으로 단일 유형의 페어를 구성하며, 평가아이템에 관한 평가 결과가 미평가아이템에 관한 평가결과보다 높다는 전제(PP)하에서, 그 차이를 극대화하도록 모델 파라미터를 학습시킬 수 있다.First, referring to FIG. 1(a), a single type pairwise preference according to a conventional technique (ie, Bayesian Personalized Ranking, BPR) is shown. At this time, in the conventional technology, a single type of pair is composed of an evaluation item and an unevaluated item, and under the premise that the evaluation result for the evaluation item is higher than the evaluation result for the unevaluated item (PP), the model parameter is to maximize the difference. Can be learned.

또한, 도 1(b)를 참조하면, 본 발명에 따른 다중 유형 페어와이즈 선호도가 나타나 있다. 이때, 본 발명에서는 평가아이템과 미평가아이템의 페어뿐만 아니라, 평가아이템과 무정보아이템의 페어, 평가아이템과 무관심아이템의 페어, 무정보아이템과 무관심아이템의 페어까지 총 4개의 다중 유형 페어가 나타나 있다. 또한, 본 발명에서는 평가아이템에 관한 평가 결과가 미평가아이템, 무정보아이템 및 무관심아이템에 관한 평가 결과보다 높다는 전제(PP1, PP2, PP3)와 무정보아이템에 관한 평가 결과가 무관심아이템에 관한 평가 결과보다 높다는 전제(PP4)하에서, 그 차이를 극대화하도록 모델 파라미터를 학습시킬 수 있다. 이때, i₁, i₂, i₃는 평가아이템이고, k₁, k₂는 무정보아이템이고, l₁, l₂, l₃, l₄는 무관심아이템이다.Further, referring to FIG. 1(b), a multi-type pairwise preference according to the present invention is shown. At this time, in the present invention, not only a pair of evaluation items and unevaluated items, but also a pair of evaluation items and non-information items, a pair of evaluation items and indifference items, and a pair of non-information items and indifference items. have. In addition, in the present invention, the premise that the evaluation result for the evaluation item is higher than the evaluation result for unevaluated items, no information items, and indifferent items (PP1, PP2, PP3) and the evaluation results for no information items are evaluated for indifferent items. Under the premise of higher than the result (PP4), model parameters can be trained to maximize the difference. At this time, i ₁ , i ₂ , i ₃ are evaluation items, k ₁ and k ₂ are non-information items, and l ₁ , l ₂ , l ₃ and l ₄ are indifferent items.

도 2는 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 방법을 설명하기 위한 흐름도이다.2 is a flowchart illustrating a method of recommending a multi-type pairwise-based item according to an embodiment of the present invention.

단계 S210에서는, 아이템 추천 장치가 협업 필터링 기법을 이용하여 평가결과에 포함된 복수의 미평가아이템의 평가 결과를 예측한 결과인 예측결과와 그 평가결과를 종합하여 종합결과를 생성한다.In step S210, the item recommendation device generates a comprehensive result by synthesizing a prediction result, which is a result of predicting the evaluation result of a plurality of unevaluated items included in the evaluation result, using a collaborative filtering technique and the evaluation result.

이때, 아이템 추천 장치는 협업 필터링 기법을 이용하여 미평가아이템에 대한 사용자의 평가 결과를 대략적으로 예측할 수 있다. 바람직하게는, 아이템 추천 장치는 WRMF(weighted regularized matrix factorization) 기법을 이용할 수 있다. 한편, WRMF는 잘 알려진 OCCF 기법으로서, 예측결과에 포함되는 미평가아이템을 1(positive) 또는 0(negative)로 나타낼 수 있다.In this case, the item recommendation device may roughly predict the user's evaluation result for the unevaluated item using a collaborative filtering technique. Preferably, the item recommendation device may use a weighted regularized matrix factorization (WRMF) technique. Meanwhile, WRMF is a well-known OCCF technique, and can represent unevaluated items included in the prediction result as 1 (positive) or 0 (negative).

또한, 아이템 추천 장치는 평가결과와 예측결과를 종합하여 종합결과를 생성할 수 있다. 예컨대, 종합결과는 평가결과에 포함된 평가아이템의 평가 결과와 예측결과에 포함된 미평가아이템의 평가 결과를 합친 것으로, 행렬의 형태일 수 있다.In addition, the item recommendation device may generate a comprehensive result by synthesizing the evaluation result and the prediction result. For example, the overall result is a sum of the evaluation result of the evaluation item included in the evaluation result and the evaluation result of the unevaluated item included in the prediction result, and may be in the form of a matrix.

단계 S220에서는, 아이템 추천 장치가 그 종합결과에서 복수의 미평가아이템을 무정보아이템과 무관심아이템으로 분류한다.In step S220, the item recommendation device classifies the plurality of unevaluated items into non-information items and indifferent items from the aggregate result.

예컨대, 아이템 추천 장치는 협업 필터링 기법을 통해 예측된 평가 결과가 낮은 미리 결정된 비율의 미평가아이템을 무관심아이템으로 분류할 수 있다. 또한, 아이템 추천 장치는 복수의 미평가아이템 중에서 복수의 무관심아이템을 제외한 나머지를 복수의 무정보아이템으로 분류할 수 있다.For example, the item recommendation device may classify unevaluated items of a predetermined ratio having a low predicted evaluation result through a collaborative filtering technique as indifferent items. In addition, the item recommendation apparatus may classify the rest of the plurality of unevaluated items excluding a plurality of indifferent items into a plurality of non-information items.

단계 S230에서는, 아이템 추천 장치가 그 종합결과 및 그 종합결과에 포함된 평가아이템, 미평가아이템, 무정보아이템 및 무관심아이템 중 2개로 구성되는 아이템페어 중에서 선별된 복수의 선별아이템페어에 기초하여, 모델 파라미터를 학습시킨다.In step S230, the item recommendation device is based on the overall result and a plurality of selected item pairs selected from item pairs consisting of two of an evaluation item, an unevaluated item, a no-information item, and an indifference item included in the total result, Train model parameters.

즉, 아이템 추천장치는 그 종합결과와 복수의 선별아이템페어를 이용하여, 아이템 추천 방법에서 사용되는 모델 파라미터를 학습시킬 수 있다.That is, the item recommendation device may learn model parameters used in the item recommendation method by using the aggregate result and a plurality of selected item pairs.

이때, 아이템페어는 평가아이템, 미평가아이템, 무정보아이템 및 무관심아이템 중에서 선별된 2개의 아이템으로 구성된다. 예컨대, 아이템페어는 {평가아이템, 미평가아이템}, {평가아이템, 무정보아이템}, {평가아이템, 무관심아이템}, {무정보아이템, 무관심아이템} 등이 될 수 있다.At this time, the item pair is composed of two items selected from evaluation items, unevaluated items, no information items, and indifferent items. For example, the item pair may be {evaluated item, unevaluated item}, {evaluated item, no information item}, {evaluated item, indifferent item}, {no information item, indifferent item}, etc.

한편, 아이템 추천 장치가 모델 파라미터를 학습시키는 자세한 내용은 도 3에 대한 설명에서 구체적으로 후술한다.Meanwhile, details of the item recommendation device learning model parameters will be described in detail later in the description of FIG. 3.

다른 실시예에서는, 복수의 선별아이템페어는 평가아이템과 미평가아이템으로 구성되는 아이템페어와 무정보아이템과 무관심아이템으로 구성되는 아이템페어를 포함할 수 있다. In another embodiment, the plurality of selection item pairs may include an item pair consisting of an evaluation item and an unevaluated item, and an item pair consisting of an uninformation item and an indifference item.

또한, 복수의 선별아이템페어는 평가아이템과 무정보아이템으로 구성되는 아이템페어, 평가아이템과 무관심아이템으로 구성되는 아이템페어 및 무정보아이템과 무관심아이템으로 구성되는 아이템페어를 포함할 수 있다.In addition, the plurality of selection item pairs may include an item pair consisting of an evaluation item and a non-information item, an item pair consisting of an evaluation item and an indifference item, and an item pair consisting of non-information items and indifferent items.

즉, 아이템 추천 장치는 보다 효율적으로 정확하게 아이템을 사용자에게 추천하기 위하여 복수의 선별아이템페어를 선별할 수 있다.That is, the item recommendation device may select a plurality of selection item pairs in order to more efficiently and accurately recommend an item to a user.

이를 위해, 복수의 선별아이템페어는 {평가아이템, 미평가아이템}, {무정보아이템, 무관심아이템}을 포함할 수 있다. 또한, 복수의 선별아이템페어는 {평가아이템, 무정보아이템}, {평가아이템, 무관심아이템}, {무정보아이템, 무관심아이템}을 포함할 수 있다.To this end, the plurality of selection item pairs may include {evaluated items, unevaluated items}, and {no information items, indifferent items}. In addition, the plurality of selection item pairs may include {evaluation item, no information item}, {evaluation item, indifferent item}, and {no information item, indifferent item}.

마지막으로 단계 S240에서는, 아이템 추천 장치가 그 학습된 모델 파라미터에 기초하여 복수의 미평가아이템에 대한 평가 결과를 예측한 결과인 최종예측결과를 이용하여, 복수의 사용자 중 적어도 하나의 추천대상자에게 소정 개수의 아이템을 추천한다.Finally, in step S240, the item recommendation device uses the final prediction result, which is a result of predicting evaluation results for a plurality of unevaluated items based on the learned model parameters, to at least one recommendation target among the plurality of users. Recommend a number of items.

즉, 아이템 추천 장치는 그 학습된 모델 파라미터를 이용하여 복수의 미평가아이템에 대한 평가 결과를 예측하여 최종예측결과를 생성할 수 있다. 최종예측결과는 앞서 언급한 예측결과보다 정확하게 예측된 결과로서, 아이템 추천 장치는 이 최종예측결과를 이용하여 추천대상자에게 상위 소정 개수의 아이템을 추천할 수 있다.That is, the item recommendation device may generate a final prediction result by predicting evaluation results for a plurality of unevaluated items by using the learned model parameters. The final prediction result is a result predicted more accurately than the above-described prediction result, and the item recommendation device may recommend a high-order predetermined number of items to a recommendation target by using the final prediction result.

이와 같이, 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 방법은 복수의 아이템을 평가된 아이템, 미평가된 아이템, 무정보 아이템, 무관심 아이템으로 구분하고, 다중 유형의 아이템 페어에 기반하여 학습된 모델 파라미터를 이용함으로써, 사용자에게 보다 높은 정확도로 아이템을 추천할 수 있는 효과가 있다.As described above, the multi-type pairwise-based item recommendation method according to an embodiment of the present invention divides a plurality of items into evaluated items, unevaluated items, no-information items, and indifferent items, and is based on multi-type item pairs. Thus, by using the learned model parameters, it is possible to recommend items to users with higher accuracy.

도 3은 본 발명의 일 실시예에 따른 모델 파라미터의 학습 방법을 설명하기 위한 흐름도이다.3 is a flowchart illustrating a method of learning model parameters according to an embodiment of the present invention.

단계 S310에서는, 아이템 추천 장치가 복수의 선별아이템페어 각각에 포함된 아이템 종류 간의 평가 결과 차이에 기반하는 목적함수를 생성한다.In step S310, the item recommendation device generates an objective function based on a difference in evaluation results between item types included in each of the plurality of selected item pairs.

예컨대, 복수의 선별아이템페어가 {평가아이템, 미평가아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우를 가정한다. 이때, 아이템 추천 장치는 복수의 사용자에 대하여 평가아이템과 미평가아이템의 평가 결과 차이와 무정보아이템과 무관심아이템의 평가 결과 차이의 총합을 나타내는 목적함수를 생성할 수 있다.For example, it is assumed that a plurality of selection item pairs include {evaluated items, unevaluated items}, and {no information items, indifferent items}. In this case, the item recommendation device may generate an objective function representing the sum of the difference between the evaluation results of the evaluation item and the non-evaluation item and the difference between the evaluation result of the non-information item and the indifferent item for a plurality of users.

다른 실시예에서는, 목적함수는 평가아이템의 평가 결과가 미평가아이템, 무정보아이템 및 무관심아이템의 평가 결과보다 높고, 무정보아이템의 평가 결과가 무관심아이템의 평가 결과보다 높다는 가정에 기반할 수 있다.In another embodiment, the objective function may be based on the assumption that the evaluation result of the evaluation item is higher than the evaluation result of the unevaluated item, the no-information item and the indifference item, and that the evaluation result of the no-information item is higher than the evaluation result of the indifference item .

이때, 도 1(b)를 참조하면, 평가아이템의 평가 결과가 미평가아이템의 평가 결과보다 높다는 가정은 PP1에 대응되고, 평가아이템의 평가 결과가 무정보아이템의 평가 결과보다 높다는 가정은 PP2에 대응되고, 평가아이템의 평가 결과가 무관심아이템의 평가 결과보다 높다는 가정은 PP3에 대응되고, 무정보아이템의 평가 결과가 무관심아이템의 평가 결과보다 높다는 가정은 PP4에 대응된다.At this time, referring to FIG. 1(b), the assumption that the evaluation result of the evaluation item is higher than the evaluation result of the unevaluated item corresponds to the PP1, and the assumption that the evaluation result of the evaluation item is higher than the evaluation result of the non-information item corresponds to PP2. Corresponds, and the assumption that the evaluation result of the evaluation item is higher than the evaluation result of the indifferent item corresponds to PP3, and the assumption that the evaluation result of the no-information item is higher than the evaluation result of the indifferent item corresponds to PP4.

즉, 아이템 추천 장치는 상술한 가정에 기반하여, 목적함수에서 그 차이가 극대화되도록 모델 파라미터를 학습시킬 수 있다.That is, the item recommendation device may train the model parameter to maximize the difference in the objective function based on the above-described assumption.

단계 S320에서는, 아이템 추천 장치가 그 복수의 사용자 중 선택된 하나의 선택사용자에 대하여, 종합결과로부터 복수의 선별아이템페어에 포함된 아이템 종류별 평가 결과를 포함하는 복수의 평가결과정보를 획득한다.In step S320, the item recommendation apparatus acquires a plurality of evaluation result information including evaluation results for each item type included in the plurality of selected item pairs from the aggregate result for one selected user selected from the plurality of users.

이때, 아이템 추천 장치는 그 복수의 사용자 중에서 무작위(random)로 하나의 선택사용자를 선정하는 것이 바람직할 수 있다. 그리고, 아이템 추천 장치는 종합결과에 포함된 선택사용자의 평가 결과로부터, 복수의 선별아이템페어에 포함된 아이템 종류별 평가 결과를 무작위로 하나 또는 복수로 추출하여, 복수의 평가결과정보를 획득할 수 있다.In this case, it may be preferable that the item recommendation device selects one selected user from among the plurality of users at random. In addition, the item recommendation device may randomly extract one or more evaluation results for each item type included in the plurality of selected item pairs from the evaluation results of the selected user included in the comprehensive result, and obtain a plurality of evaluation result information. .

예컨대, 복수의 선별아이템페어가 {평가아이템, 미평가아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우를 가정한다. 이때, 아이템 추천 장치는 선택사용자의 평가 결과 중에서, 평가아이템, 미평가아이템, 무정보아이템, 무관심아이템에 대한 평가 결과를 각각 추출하여 복수의 평가결과정보를 획득할 수 있다. 또한, 미평가아이템에 대한 평가 결과는 무정보아이템과 무관심아이템 중 무작위로 선정된 아이템에 대한 평가 결과일 수 있다.For example, it is assumed that a plurality of selection item pairs include {evaluated items, unevaluated items}, and {no information items, indifferent items}. In this case, the item recommendation device may obtain a plurality of evaluation result information by extracting evaluation results for evaluation items, non-evaluated items, non-information items, and indifferent items from among the evaluation results of the selected user. In addition, the evaluation result for the unevaluated item may be an evaluation result for an item randomly selected among the no-information item and the indifference item.

다른 실시예에서는, 복수의 평가결과정보는 복수의 선별아이템페어에 포함된 아이템 종류별로 하나의 평가 결과를 포함할 수 있다.In another embodiment, the plurality of evaluation result information may include one evaluation result for each item type included in the plurality of selection item pairs.

즉, 아이템 추천 장치가 선택사용자에 대하여 아이템의 종류별로 하나씩의 아이템의 평가 결과를 이용하여, 목적함수값을 산출하고, 모델 파라미터를 학습시키기 위하여 하나씩의 평가 결과를 포함시킬 수 있다.That is, the item recommendation device may calculate an objective function value by using the evaluation result of one item for each item type for the selected user, and include each evaluation result in order to learn model parameters.

이는, 기존의 BPR(Bayesian Personalized Ranking)을 개별 아이템에 대한 다중 유형의 페어와이즈 평가 결과에 기반하도록 변경한 것으로, 본 발명에서는 M-BPR(Multi-type pair-wise preference BPR)로 명명한다.This is a change to the existing Bayesian Personalized Ranking (BPR) based on the result of multi-type pairwise evaluation for individual items, and is referred to as M-BPR (Multi-type pair-wise preference BPR) in the present invention.

또 다른 실시예에서는, 복수의 평가결과정보는 복수의 선별아이템페어에 포함된 아이템 종류별로 복수의 평가 결과인 그룹평가결과를 포함하고, 목적함수는 그룹평가결과의 차이에 기반할 수 있다.In another embodiment, the plurality of evaluation result information includes a group evaluation result that is a plurality of evaluation results for each item type included in the plurality of selection item pairs, and the objective function may be based on a difference between the group evaluation results.

즉, 아이템 추천 장치가 선택사용자에 대하여 아이템의 종류별로 복수의 아이템의 평가 결과(즉, 그룹평가결과)를 이용하여, 목적함수값을 산출하고, 모델 파라미터를 학습시키기 위하여 하나씩의 그룹평가 결과를 포함시킬 수 있다. 이때, 목적함수는 개별 아이템의 평가 결과 차이가 아니라, 그룹평가결과 차이에 기반하도록 생성될 수 있다. 또한, 그룹평가결과는 그 복수의 아이템의 평가 결과의 평균값을 이용하여 산출될 수 있다.That is, the item recommendation device calculates the objective function value by using the evaluation results of a plurality of items (i.e., group evaluation results) for each item type for the selected user, and calculates each group evaluation result to learn model parameters. Can be included. In this case, the objective function may be generated not based on differences in evaluation results of individual items, but differences in group evaluation results. Further, the group evaluation result may be calculated using an average value of the evaluation results of the plurality of items.

이는, 앞서 설명한 본 발명의 M-BPR을 그룹 단위의 아이템에 기반하도록 변경한 것으로서, 본 발명에서는 M-BPR^G로 명명한다.This is a modification of the M-BPR of the present invention described above to be based on an item in a group unit, and is referred to as M-BPR ^G in the present invention.

단계 S330에서는, 아이템 추천 장치가 목적함수 및 복수의 평가결과정보를 이용하여, 목적함수의 값이 극대화되도록 모델 파라미터값을 갱신한다.In step S330, the item recommendation device updates the model parameter value so that the value of the objective function is maximized by using the objective function and a plurality of evaluation result information.

예컨대, 아이템 추천 장치는 복수의 평가결과정보를 이용하여, 아이템 종류 간의 평가 결과의 차이(즉, 목적함수)가 극대화되도록 하는 모델 파라미터값을 결정해 나갈 수 있다. 이때, 아이템 추천 장치는 경사하강법(gradient descent) 등의 방법을 추가적으로 이용할 수 있다.For example, the item recommendation device may use a plurality of evaluation result information to determine a model parameter value that maximizes a difference in evaluation results between item types (ie, an objective function). In this case, the item recommendation device may additionally use a method such as gradient descent.

다른 실시예에서는, 아이템 추천 장치가 복수의 평가결과정보를 이용하여 목적함수의 그레디언트(gradient)값을 산출한 이후, SGD기법에 기반하여 그레디언트값을 적용함으로써 모델 파라미터값을 갱신할 수 있다.In another embodiment, after the item recommendation device calculates a gradient value of an objective function using a plurality of evaluation result information, the model parameter value may be updated by applying the gradient value based on the SGD technique.

우선, 아이템 추천 장치가 복수의 평가결과정보를 이용하여, 목적함수의 그레디언트값을 산출한다.First, the item recommendation device calculates a gradient value of the objective function by using a plurality of evaluation result information.

예컨대, 아이템 추천 장치는 목적함수의 그레디언트에 복수의 평가결과정보를 입력하여 그레디언트값을 산출할 수 있다.For example, the item recommendation device may calculate a gradient value by inputting a plurality of evaluation result information to the gradient of the objective function.

그리고, 아이템 추천 장치가 SGD(stochastic gradient descent) 기법에 기반하여 그레디언트값을 적용함으로써, 모델 파라미터값을 갱신한다.In addition, the item recommendation device updates the model parameter value by applying a gradient value based on a stochastic gradient descent (SGD) technique.

즉, 아이템 추천 장치는 SGD 기법에 기반하여, 산출된 그레디언트값을 적용하므로써, 기존의 모델 파라미터값을 새로 계산된 모델 파라미터값으로 갱신할 수 있다.That is, the item recommendation device may update the existing model parameter value with the newly calculated model parameter value by applying the calculated gradient value based on the SGD technique.

마지막으로 단계 S340에서는, 아이템 추천 장치가 소정의 완료 조건을 만족할 때까지, 단계 S320 및 단계 S330을 반복한다.Finally, in step S340, steps S320 and S330 are repeated until the item recommendation device satisfies a predetermined completion condition.

이때, 완료 조건은 미리 지정된 횟수를 반복하는 조건일 수 있다. 또는, 완료 조건은 갱신된 모델 파라미터값이 기존의 모델 파라미터값과 비교하여, 소정의 임계범위 이내로 수렴하는 조건일 수 있다.In this case, the completion condition may be a condition for repeating a predetermined number of times. Alternatively, the completion condition may be a condition in which the updated model parameter value is compared with the existing model parameter value and converged within a predetermined threshold range.

즉, 아이템 추천 장치는 이와 같은 완료 조건이 만족될 때까지, 단계 S320 및 단계 S330을 반복하여 지속적으로 모델 파라미터값을 갱신할 수 있다.That is, the item recommendation device may continuously update the model parameter value by repeating steps S320 and S330 until such a completion condition is satisfied.

한편, 본 발명의 M-BPR에서 복수의 선별아이템페어가 {평가아이템, 미평가아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우의 목적함수는 수학식 1과 같다. 이때, 도 1(b)를 참조하면, 수학식 1은 PP1과 PP4를 이용하는 경우의 목적함수이다.On the other hand, in the M-BPR of the present invention, in the case where a plurality of selection item pairs include {evaluated items, unevaluated items} and {no information items, indifferent items}, the objective function is as shown in Equation 1. At this time, referring to FIG. 1(b), Equation 1 is an objective function when PP1 and PP4 are used.

[수학식 1][Equation 1]

여기서, f(θ)는 목적함수이고, θ 는 모델 파라미터이고, R(θ)는 모델 파라미터의 정규화 항이고, U는 사용자 집합이고, I_u ^rt는 사용자 u의 평가아이템 집합이고, I\I_u ^rt는 사용자 u의 미평가아이템 집합이고, I_u ^unk는 사용자 u의 무정보아이템 집합이고, I_u ^uin는 사용자 u의 무관심아이템 집합이고, w는 가중치고, σ는 sigmoid 함수이고, r_uij는 r_ui-r_uj이고, r_ukl은 r_uk-r_ul이고, r_ui, r_uj, r_uk, r_ul은는 사용자 u의 아이템 i, j, k, l에 대한 평가 결과이다.Here, f(θ) is the objective function, θ is the model parameter, R(θ) is the normalization term of the model parameter, U is the user set, I _u ^rt is the evaluation item set of user u, and I\I _u ^rt is the set of ^unevaluated items of user u, I _u ^unk is the set of ^uninformed items of user u, I _u ^uin is the set of indifference items of user u, w is the weight, σ is the sigmoid function, and r _uij is r -r _ui _uj, _uk r _ukl is r -r _ul, the evaluation results for _ui r, r _uj, _uk r, r _ul item eunneun user u i, j, k, l .

또한, 본 발명의 M-BPR에서 복수의 선별아이템페어가 {평가아이템, 무정보아이템}, {평가아이템, 무관심아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우의 목적함수는 수학식 2와 같다. 이때, 도 1(b)를 참조하면, 수학식 2는 PP2, PP3 및 PP4를 이용하는 경우의 목적함수이다.In addition, in the M-BPR of the present invention, in the case where a plurality of selection item pairs include {evaluation item, no information item}, {evaluation item, indifferent item}, {no information item, indifferent item}, the objective function is Same as 2. At this time, referring to FIG. 1(b), Equation 2 is an objective function in the case of using PP2, PP3, and PP4.

[수학식 2][Equation 2]

여기서, α, β 및 γ는 PP2, PP4 및 PP4 각각에 대한 상대적인 학습 가중치를 제어하기 위한 신뢰계수(confident coefficient)이다.Here, α, β, and γ are confidence coefficients for controlling the relative learning weights for each of PP2, PP4, and PP4.

또한, 아래 수학식 3 및 4 각각은 수학식 1 및 2에 SGD알고리즘을 적용하여 최적화한 결과이다. In addition, each of Equations 3 and 4 below is a result of optimization by applying the SGD algorithm to Equations 1 and 2.

[수학식 3][Equation 3]

[수학식 4][Equation 4]

여기서, γ는 학습율이고, λθ는 정규화된 항에 관한 파라미터이고, 모델 파라미터 θ는 사용자 u의 잠재요인인 Uu·, 아이템 i의 잠재요인인 Vi· 및 아이템 i의 바이어스 bi를 포함한다.Here, γ is the learning rate, λθ is a parameter related to the normalized term, and the model parameter θ includes Uu· which is a latent factor of user u, Vi· which is a latent factor of item i, and bias bi of item i.

또한, 추가적으로 다음의 수학식 5와 같은 관계가 성립한다.In addition, the relationship as shown in Equation 5 below is additionally established.

[수학식 5][Equation 5]

또한, 본 발명의 M-BPR^G에서 복수의 선별아이템페어가 {평가아이템, 미평가아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우의 목적함수는 수학식 6과 같다. 이때, 도 1(b)를 참조하면, 수학식 1은 PP1과 PP4를 이용하는 경우의 목적함수이다.In addition, in the case where a plurality of selection items in the M-BPR ^G of the present invention includes {evaluated items, unevaluated items}, and {no information items, indifferent items}, the objective function is shown in Equation 6. At this time, referring to FIG. 1(b), Equation 1 is an objective function when PP1 and PP4 are used.

[수학식 6][Equation 6]

또한, 본 발명의 M-BPR^G에서 복수의 선별아이템페어가 {평가아이템, 무정보아이템}, {평가아이템, 무관심아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우의 목적함수는 수학식 7과 같다. 이때, 도 1(b)를 참조하면, 수학식 2는 PP2, PP3 및 PP4를 이용하는 경우의 목적함수이다.In addition, in the case of the M-BPR ^G of the present invention, when a plurality of selection item pairs include {evaluation item, no-information item}, {evaluation item, indifferent item}, and {no information item, indifferent item}, the objective function is mathematical Same as Equation 7. At this time, referring to FIG. 1(b), Equation 2 is an objective function in the case of using PP2, PP3, and PP4.

[수학식 7][Equation 7]

또한, 아래 수학식 8 및 9 각각은 수학식 6 및 7에 SGD알고리즘을 적용하여 최적화한 결과이다. In addition, each of Equations 8 and 9 below is a result of optimization by applying the SGD algorithm to Equations 6 and 7.

[수학식 8][Equation 8]

[수학식 9][Equation 9]

여기서, 다음의 수학식 10과 같은 관계가 성립한다.Here, the relationship as in Equation 10 below is established.

[수학식 10][Equation 10]

한편, 도 5를 참조하면, 본 발명의 일 실시예에 따른 모델 파라미터 학습 알고리즘의 의사코드가 나타나 있다. 즉, 본 발명의 M-BPR 및 M-BPR^G의 전 과정이 나타나 있다.Meanwhile, referring to FIG. 5, a pseudocode of a model parameter learning algorithm according to an embodiment of the present invention is shown. That is, the entire process of M-BPR and M-BPR ^G of the present invention is shown.

먼저, 아이템 추천 장치는 트레이닝 데이터셋 R = {(u, i)}에 포함된 복수의 미평가아이템에 대하여, 대략적인 평가 결과를 예측한다. 개별 사용자 u에 대하여 아이템 집합 I를 I_u ^rt, I_u ^unk, I_u ^uni로 분류한다. 다음으로, 모델 파라미터 θ를 T번 반복하면서 갱신한다. 이때, 그룹 기반이면(G=TRUE), 각 반복마다 사용자 u와 u에 대한 3종류의 그룹평가결과를 무작위로 획득하고, 그룹 기반이 아니면(G=FALSE), 각 반복마다 사용자 u와 u에 대한 3종류의 개별 아이템의 평가 결과를 무작위로 획득한다.First, the item recommendation device predicts an approximate evaluation result for a plurality of unevaluated items included in the training data set R = {(u, i)}. For each user u, the item set I is classified into I _u ^rt , I _u ^unk , and I _u ^uni . Next, the model parameter θ is updated while repeating T times. At this time, if group-based (G=TRUE), three types of group evaluation results for users u and u are randomly obtained for each iteration, and if not group-based (G=FALSE), users u and u for each iteration The evaluation results of 3 types of individual items are obtained at random.

또한, MODE=A이면, 복수의 선별아이템페어가 {평가아이템, 미평가아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우로써, 미평가아이템에 관한 평가 결과를 더 획득할 수 있다. 또한, MODE=B이면, 복수의 선별아이템페어가 {평가아이템, 무정보아이템}, {평가아이템, 무관심아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우이다.In addition, when MODE = A, as a case where the plurality of selection item pairs include {evaluated items, unevaluated items}, {no information items, indifferent items}, evaluation results for unevaluated items can be further obtained. In addition, if MODE=B, it is a case in which the plurality of selection item pairs include {evaluation item, no information item}, {evaluation item, indifferent item}, and {no information item, indifferent item}.

마지막으로, 아이템 추천 장치는 사용자 u에 대한 모델 파라미터 θ를 갱신한다. 이를 위해 아이템 추천 장치는 수학식 3, 4, 8 및 9를 이용할 수 있다.Finally, the item recommendation device updates the model parameter θ for the user u. For this, the item recommendation device may use Equations 3, 4, 8, and 9.

또한, 도 6을 참조하면, 종래 기술인 BPR, 본 발명의 2종류의 M-BPR, 본 발명의 2종류의 M-BPR^G에 대한 정확도가 나타나 있다. Further, referring to Fig. 6, the accuracy of the conventional BPR, the two types of M-BPR of the present invention, and the two types of M-BPR ^G of the present invention are shown.

이때, 2종류의 M-BPR은 선별아이템페어의 종류에 따른 것으로, (A)는 복수의 선별아이템페어가 {평가아이템, 미평가아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우이고, (B)는 복수의 선별아이템페어가 {평가아이템, 무정보아이템}, {평가아이템, 무관심아이템}, {무정보아이템, 무관심아이템}을 포함하는 경우이다.At this time, the two types of M-BPR are according to the type of the selection item pair, and (A) is the case where the plurality of selection item pairs include {evaluated items, unevaluated items}, and {no information items, indifferent items}. , (B) is a case where a plurality of selected item pairs include {evaluation item, no information item}, {evaluation item, indifferent item}, and {no information item, indifferent item}.

구체적으로, M-BPR 및 M-BPR^G가 추천 아이템 개수(N)와 무관하게 종래 기술인 BPR보다 우수한 성능을 나타내고 있다. 또한, M-BPR와 M-BPR^G간에는 M-BPR^G가 M-BPR보다 우수한 성능을 나타내는 것을 알 수 있다.Specifically, M-BPR and M-BPR ^G exhibit superior performance than conventional BPR regardless of the number of recommended items (N). In addition, it can be seen that between M-BPR and M-BPR ^G, M-BPR ^G exhibits better performance than M-BPR.

또한, 도 7을 참조하면, 총 8개 방법의 추천 정확도가 나타나 있다.Also, referring to FIG. 7, recommendation accuracy of a total of eight methods is shown.

이때, 본 발명의 M-BPR이 나머지 모든 방법의 경우보다, 모든 데이터셋에 대하여 전반적으로 상당한 차이로 우수한 성능을 나타냈음을 알 수 있다. 보다 구체적으로, M-BPR의 GAIN은 12.84%(MovieLens100K), 27.41%(Watcha), 61.88%(Ciao), 17.82%(Last.fm)이다. At this time, it can be seen that the M-BPR of the present invention showed superior performance with a significant difference overall for all datasets than in the case of all other methods. More specifically, the GAIN of M-BPR is 12.84% (MovieLens100K), 27.41% (Watcha), 61.88% (Ciao), and 17.82% (Last.fm).

도 4는 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 장치를 설명하기 위한 블록도이다.4 is a block diagram illustrating a multi-type pairwise-based item recommendation apparatus according to an embodiment of the present invention.

도 4를 참조하면, 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 장치(400)는 생성부(410), 분류부(420), 학습부(430) 및 추천부(440)를 포함한다.4, a multi-type pairwise-based item recommendation device 400 according to an embodiment of the present invention includes a generation unit 410, a classification unit 420, a learning unit 430, and a recommendation unit 440. Include.

한편, 본 발명의 일 실시예에 따른 다중 유형 페어와이즈 기반 아이템 추천 장치(400)는 서버, 데스크탑PC, 노트북, 스마트폰 및 태블릿PC 등과 같은 컴퓨팅 장치에 탑재될 수 있다.Meanwhile, the multi-type pairwise-based item recommendation device 400 according to an embodiment of the present invention may be mounted on a computing device such as a server, a desktop PC, a notebook, a smart phone, and a tablet PC.

생성부(410)는 협업 필터링 기법을 이용하여, 평가결과에 포함된 복수의 미평가아이템의 평가 결과를 예측한 결과인 예측결과와 평가결과를 종합하여 종합결과를 생성한다.The generation unit 410 generates a comprehensive result by synthesizing the prediction result and the evaluation result, which is a result of predicting the evaluation result of a plurality of unevaluated items included in the evaluation result, using a collaborative filtering technique.

분류부(420)는 그 종합결과에서 복수의 미평가아이템을 복수의 무정보아이템과 복수의 비관심아이템으로 분류한다.The classification unit 420 classifies a plurality of unevaluated items into a plurality of non-information items and a plurality of non-interested items in the aggregate result.

학습부(430)는 그 종합결과 및 그 종합결과에 포함된 평가아이템, 미평가아이템, 무정보아이템 및 비관심아이템 중 2개로 구성되는 아이템페어 중에서 선별된 복수의 선별아이템페어에 기초하여, 모델 파라미터를 학습시킨다.The learning unit 430 is based on the overall result and a plurality of selected item pairs selected from among item pairs consisting of two of evaluation items, unevaluated items, no-information items, and non-interested items included in the overall result, and the model Learn parameters.

마지막으로 추천부(440)는 그 학습된 모델 파라미터에 기초하여 복수의 미평가아이템에 대한 평가 결과를 예측한 결과인 최종예측결과를 이용하여, 복수의 사용자 중 적어도 하나의 추천대상자에게 소정 개수의 아이템을 추천한다.Finally, the recommendation unit 440 uses the final prediction result, which is a result of predicting evaluation results for a plurality of unevaluated items, based on the learned model parameters, to at least one recommendation target among the plurality of users. Recommend the item.

다른 실시예에서는, 학습부(430)는 복수의 선별아이템페어 각각에 포함된 아이템 종류 간의 평가 결과 차이에 기반하는 목적함수를 생성하고, 복수의 사용자 중 선택된 하나의 선택사용자에 대하여, 종합결과로부터 상기 복수의 선별아이템페어에 포함된 아이템 종류별 평가 결과를 포함하는 복수의 평가결과정보를 획득하고, 목적함수 및 복수의 평가결과정보를 이용하여, 목적함수의 값이 극대화되도록 상기 모델 파라미터값을 갱신하고, 소정의 완료 조건을 만족할 때까지, 상기 복수의 평가결과정보를 획득하는 과정과 상기 모델 파라미터값을 갱신하는 과정을 반복할 수 있다.In another embodiment, the learning unit 430 generates an objective function based on an evaluation result difference between item types included in each of the plurality of selected item pairs, and for one selected user selected from among the plurality of users, from the synthesis result. Acquires a plurality of evaluation result information including evaluation results for each item type included in the plurality of selected item pairs, and updates the model parameter value so that the value of the objective function is maximized by using the objective function and the plurality of evaluation result information. Then, the process of acquiring the plurality of evaluation result information and the process of updating the model parameter values may be repeated until a predetermined completion condition is satisfied.

또 다른 실시예에서는, 학습부(430)가 모델 파라미터값을 갱신할 때, 복수의 평가결과정보를 이용하여, 목적함수의 그레디언트값을 산출하고, SGD(stochastic gradient descent) 기법에 기반하여 그레디언트값을 적용함으로써, 상기 모델 파라미터값을 갱신할 수 있다.In another embodiment, when the learning unit 430 updates a model parameter value, a gradient value of an objective function is calculated using a plurality of evaluation result information, and the gradient value is based on a stochastic gradient descent (SGD) technique. By applying, it is possible to update the model parameter value.

또 다른 실시예에서는, 복수의 평가결과정보는 복수의 선별아이템페어에 포함된 아이템 종류별로 하나의 평가 결과를 포함할 수 있다.In another embodiment, the plurality of evaluation result information may include one evaluation result for each item type included in the plurality of selection item pairs.

또 다른 실시예에서는, 목적함수는 평가아이템의 평가 결과가 미평가아이템, 무정보아이템 및 무관심아이템의 평가 결과보다 높고, 무정보아이템의 평가 결과가 무관심아이템의 평가 결과보다 높다는 가정에 기반할 수 있다.In another embodiment, the objective function may be based on the assumption that the evaluation result of the evaluation item is higher than the evaluation result of unevaluated items, no information items, and indifferent items, and that the evaluation result of no information items is higher than the evaluation result of indifferent items. have.

또 다른 실시예에서는, 복수의 선별아이템페어는 평가아이템과 미평가아이템으로 구성되는 아이템페어와 무정보아이템과 무관심아이템으로 구성되는 아이템페어를 포함하거나, 평가아이템과 무정보아이템으로 구성되는 아이템페어, 평가아이템과 무관심아이템으로 구성되는 아이템페어 및 무정보아이템과 무관심아이템으로 구성되는 아이템페어를 포함할 수 있다.In another embodiment, the plurality of selection item pairs include an item pair consisting of an evaluation item and an unevaluated item and an item pair consisting of an uninformed item and an indifferent item, or an item pair consisting of an evaluation item and an uninformed item. , An item pair consisting of an evaluation item and an indifference item, and an item pair consisting of an uninformed item and an indifference item.

상술한 본 발명의 실시예들은 컴퓨터에서 실행될 수 있는 프로그램으로 작성가능하고, 컴퓨터로 읽을 수 있는 기록매체를 이용하여 상기 프로그램을 동작시키는 범용 디지털 컴퓨터에서 구현될 수 있다.The above-described embodiments of the present invention can be written in a program that can be executed on a computer, and can be implemented in a general-purpose digital computer that operates the program using a computer-readable recording medium.

상기 컴퓨터로 읽을 수 있는 기록매체는 마그네틱 저장매체(예를 들면, 롬, 플로피 디스크, 하드디스크 등), 광학적 판독 매체(예를 들면, 시디롬, 디브이디 등) 를 포함한다.The computer-readable recording medium includes a magnetic storage medium (for example, a ROM, a floppy disk, a hard disk, etc.), and an optical reading medium (for example, a CD-ROM, a DVD, etc.).

이제까지 본 발명에 대하여 그 바람직한 실시예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.
So far, the present invention has been looked at around its preferred embodiments. Those of ordinary skill in the art to which the present invention pertains will be able to understand that the present invention can be implemented in a modified form without departing from the essential characteristics of the present invention. Therefore, the disclosed embodiments should be considered from an illustrative point of view rather than a limiting point of view. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope equivalent thereto should be construed as being included in the present invention.

Claims

In a method of recommending an item using an evaluation result that is a result of a plurality of users evaluating at least one item among a plurality of items,
Generating a comprehensive result by synthesizing a prediction result, which is a result of predicting an evaluation result of a plurality of unevaluated items included in the evaluation result, using a collaborative filtering technique and the evaluation result;
Classifying the plurality of unevaluated items into a plurality of non-information items and a plurality of indifferent items from the synthesis result;
Learning a model parameter based on the synthesis result and a plurality of selection item pairs selected from among item pairs consisting of two of an evaluation item, an unevaluated item, an uninformed item and an item of no interest included in the synthesis result; And
Recommending a predetermined number of items to at least one recommendation target among the plurality of users using a final prediction result that is a result of predicting evaluation results for the plurality of unevaluated items based on the learned model parameters
Multi-type pairwise-based item recommendation method comprising a.

The method of claim 1,
Learning the model parameters
Generating an objective function based on a difference in evaluation results between item types included in each of the plurality of selected item pairs;
Acquiring a plurality of evaluation result information including evaluation results for each item type included in the plurality of selected item pairs from the synthesis result for one selected user selected from among the plurality of users;
Updating the model parameter value to maximize the value of the objective function using the objective function and the plurality of evaluation result information; And
Repeating the steps of acquiring the plurality of evaluation result information and updating the model parameter values until a predetermined completion condition is satisfied.
Multi-type pairwise-based item recommendation method comprising a.

The method of claim 2,
Updating the model parameter value comprises:
Calculating a gradient value of the objective function using the plurality of evaluation result information; And
Updating the model parameter value by applying the gradient value based on a stochastic gradient descent (SGD) technique
Multi-type pairwise-based item recommendation method comprising a.

The method of claim 2,
The plurality of evaluation result information
A multi-type pairwise-based item recommendation method comprising one evaluation result for each item type included in the plurality of selected item pairs.

The method of claim 2,
The plurality of evaluation result information
And a group evaluation result that is a plurality of evaluation results for each item type included in the plurality of selected item pairs,
The objective function is
Multi-type pairwise-based item recommendation method, characterized in that based on the difference between the group evaluation results.

The method of claim 2,
The objective function is
Characterized in that based on the assumption that the evaluation result of the evaluation item is higher than the evaluation result of the unevaluated item, the no-information item and the indifferent item, and that the evaluation result of the no-information item is higher than the evaluation result of the indifferent item Multi-type pairwise-based item recommendation method.

The method of claim 1,
The plurality of selection item pairs are
An item pair consisting of the evaluation item and the unevaluated item and an item pair consisting of the non-information item and the indifferent item, or
An item pair consisting of the evaluation item and the non-information item, an item pair consisting of the evaluation item and the indifferent item, and an item pair consisting of the non-information item and the indifference item. Wise-based item recommendation method.

In the apparatus for recommending an item using an evaluation result that is a result of a plurality of users evaluating at least one item among a plurality of items,
A generator for generating a comprehensive result by synthesizing a prediction result, which is a result of predicting an evaluation result of a plurality of unevaluated items included in the evaluation result, and the evaluation result using a collaborative filtering technique;
A classification unit for classifying the plurality of unevaluated items into a plurality of non-information items and a plurality of non-interested items in the synthesis result;
A learning unit that learns model parameters based on the synthesis result and a plurality of selection item pairs selected from among item pairs consisting of two of an evaluation item, an unevaluated item, a no-information item, and an indifferent item included in the synthesis result; And
A recommendation unit that recommends a predetermined number of items to at least one recommendation target among the plurality of users using a final prediction result that is a result of predicting evaluation results for the plurality of unevaluated items based on the learned model parameters
Multi-type pairwise-based item recommendation device comprising a.

The method of claim 8,
The learning unit
Generate an objective function based on a difference in evaluation results between item types included in each of the plurality of selected item pairs,
For one selected user selected from among the plurality of users, a plurality of evaluation result information including evaluation results for each item type included in the plurality of selected item pairs are obtained from the comprehensive result,
Using the objective function and the plurality of evaluation result information, updating the model parameter value so that the value of the objective function is maximized,
A multi-type pairwise-based item recommendation device, characterized in that the process of acquiring the plurality of evaluation result information and the process of updating the model parameter value are repeated until a predetermined completion condition is satisfied.

The method of claim 9,
When the learning unit updates the model parameter value,
Using the plurality of evaluation result information, calculate the gradient value of the objective function,
A multi-type pairwise-based item recommendation device, characterized in that the model parameter value is updated by applying the gradient value based on a stochastic gradient descent (SGD) technique.

The method of claim 9,
The plurality of evaluation result information
A multi-type pairwise-based item recommendation device, comprising: one evaluation result for each item type included in the plurality of selected item pairs.

The method of claim 9,
The plurality of evaluation result information
And a group evaluation result that is a plurality of evaluation results for each item type included in the plurality of selected item pairs,
The objective function is
Multi-type pairwise-based item recommendation device, characterized in that based on the difference between the group evaluation results.

The method of claim 9,
The objective function is
Characterized in that based on the assumption that the evaluation result of the evaluation item is higher than the evaluation result of the unevaluated item, the no-information item and the indifferent item, and that the evaluation result of the no-information item is higher than the evaluation result of the indifferent item Multi-type pairwise-based item recommendation device.

The method of claim 8,
The plurality of selection item pairs are
An item pair consisting of the evaluation item and the unevaluated item and an item pair consisting of the non-information item and the indifferent item, or
An item pair consisting of the evaluation item and the non-information item, an item pair consisting of the evaluation item and the indifferent item, and an item pair consisting of the non-information item and the indifference item. Wise-based item recommendation device.