KR102537900B1

KR102537900B1 - Apparatus, method and recording medium storing commands for generating tags that indicate user characteristics

Info

Publication number: KR102537900B1
Application number: KR1020220023340A
Authority: KR
Inventors: 정태영
Original assignee: 현대카드 주식회사
Priority date: 2022-02-23
Filing date: 2022-02-23
Publication date: 2023-05-31

Abstract

본 개시의 일 측면에 따르면, 사용자 특징을 지시하는 태그를 생성하기 위한 장치가 제공될 수 있다. 본 개시에 따른 장치는, 하나 이상의 프로세서 및 상기 하나 이상의 프로세서에 의한 실행 시, 상기 하나 이상의 프로세서가 연산을 수행하도록 하는 명령들이 저장된 하나 이상의 메모리를 포함하고, 상기 하나 이상의 프로세서는, 상기 명령에 따라, 복수의 사용자 각각에 대한 사용자 정보 및 상기 복수의 사용자 각각에 대한 결제 정보를 획득하고, 상기 사용자 정보 및 상기 결제 정보에 기초하여, 상기 복수의 사용자 각각에 대해 하나 이상의 1차 태그를 생성하고 - 상기 하나 이상의 1차 태그는 하나 이상의 1차 사용자 특징을 각각 지시하고, 상기 하나 이상의 1차 사용자 특징 각각은 상기 사용자 정보 및 상기 결제 정보 중 적어도 하나에 의해 특정됨 - , 상기 복수의 사용자 중에서 미리 결정된 표본 사용자 그룹 내 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여, 상기 복수의 사용자 각각과 상기 표본 사용자 그룹 사이의 유사도를 결정하고, 상기 유사도에 기초하여, 상기 복수의 사용자 각각에 대해 2차 태그를 생성 - 상기 2차 태그는 상기 표본 사용자 그룹과 관련되 2차 사용자 특징을 지시함 - 할 수 있다. According to one aspect of the present disclosure, an apparatus for generating a tag indicating a user characteristic may be provided. An apparatus according to the present disclosure includes one or more processors and one or more memories storing instructions that, when executed by the one or more processors, cause the one or more processors to perform operations, and the one or more processors, according to the instructions , obtaining user information for each of a plurality of users and payment information for each of the plurality of users, and generating one or more primary tags for each of the plurality of users based on the user information and the payment information; The one or more primary tags indicate one or more primary user characteristics, respectively, and each of the one or more primary user characteristics is specified by at least one of the user information and the payment information - predetermined among the plurality of users. A similarity between each of the plurality of users and the sample user group is determined based on one or more primary tags for each of the sample users in the sample user group, and a secondary tag is determined for each of the plurality of users based on the degree of similarity. A tag may be created, wherein the secondary tag indicates secondary user characteristics associated with the sample user group.

Description

Recording medium on which devices, methods and commands for generating tags indicating user characteristics are recorded

본 개시는 사용자 특징을 지시하는 태그(tag)를 생성하기 위한 기술에 관한 것이다.The present disclosure relates to techniques for generating tags indicating user characteristics.

기업들은 마케팅 업무의 일환으로, 자신의 상품 또는 서비스를 구매할 잠재 고객(즉, 사용자)을 다양한 방식으로 탐색할 수 있다. 잠재 고객을 탐색하기 위하여 각 고객의 특징을 결정해야 할 수 있다. 각 고객이 어떠한 특징을 가지는지는 소위 휴리스틱(heuristic)한 방식으로 사람에 의해 결정될 수 있다. 그러나 휴리스틱한 방식은 사람이 직접 고객 특징을 결정한다는 점에서 결정 과정의 일관성이 결여될 수 있고, 그에 따라 고객 특징의 결정 결과에 대한 신뢰성이 떨어질 수 있다.As part of marketing tasks, companies may search potential customers (ie, users) for their products or services in various ways. To search for potential customers, you may need to determine the characteristics of each customer. What characteristics each customer has can be determined by humans in a so-called heuristic way. However, the heuristic method may lack consistency in the decision process in that a person directly determines the customer characteristics, and accordingly, the reliability of the customer characteristic determination result may decrease.

본 개시는 사용자 특징을 지시하는 태그를 생성하기 위한 기술을 제공한다.This disclosure provides techniques for creating tags that indicate user characteristics.

본 개시의 일 측면에 따르면, 사용자 특징을 지시하는 태그를 생성하기 위한 장치가 제공될 수 있다. 본 개시에 따른 장치는, 하나 이상의 프로세서; 및 상기 하나 이상의 프로세서에 의한 실행 시, 상기 하나 이상의 프로세서가 연산을 수행하도록 하는 명령들이 저장된 하나 이상의 메모리를 포함하고, 상기 하나 이상의 프로세서는, 상기 명령에 따라, 복수의 사용자 각각에 대한 사용자 정보 및 상기 복수의 사용자 각각에 대한 결제 정보를 획득하고, 상기 사용자 정보 및 상기 결제 정보에 기초하여, 상기 복수의 사용자 각각에 대해 하나 이상의 1차 태그를 생성하고 - 상기 하나 이상의 1차 태그는 하나 이상의 1차 사용자 특징을 각각 지시하고, 상기 하나 이상의 1차 사용자 특징 각각은 상기 사용자 정보 및 상기 결제 정보 중 적어도 하나에 의해 특정됨 - , 상기 복수의 사용자 중에서 미리 결정된 표본 사용자 그룹 내 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여, 상기 복수의 사용자 각각과 상기 표본 사용자 그룹 사이의 유사도를 결정하고, 상기 유사도에 기초하여, 상기 복수의 사용자 각각에 대해 2차 태그를 생성 - 상기 2차 태그는 상기 표본 사용자 그룹과 관련되 2차 사용자 특징을 지시함 - 할 수 있다. According to one aspect of the present disclosure, an apparatus for generating a tag indicating a user characteristic may be provided. An apparatus according to the present disclosure may include one or more processors; and one or more memories in which commands for causing the one or more processors to perform operations when executed by the one or more processors are stored, wherein the one or more processors, according to the commands, user information for each of a plurality of users and Acquiring payment information for each of the plurality of users, generating one or more primary tags for each of the plurality of users based on the user information and the payment information, wherein the one or more primary tags include one or more 1 Primary user characteristics are each indicated, and each of the one or more primary user characteristics is specified by at least one of the user information and the payment information, one for each sample user in a predetermined sample user group among the plurality of users. Based on the above primary tag, a similarity between each of the plurality of users and the sample user group is determined, and based on the similarity, a secondary tag is generated for each of the plurality of users - the secondary tag is Indicate secondary user characteristics in relation to the sample user group - may.

일 실시예에 있어서, 상기 하나 이상의 1차 태그 각각은, 상기 복수의 사용자 각각의 상기 사용자 정보, 결제처, 결제 품목, 결제 방식, 결제 시점, 결제 횟수, 결제 빈도 및 결제 액수 중에서 선택된 적어도 하나에 기초하여 특정되는 하나 이상의 1차 사용자 특징 각각을 지시할 수 있다.In one embodiment, each of the one or more primary tags is configured to at least one selected from among the user information of each of the plurality of users, a payment destination, a payment item, a payment method, a payment time point, a number of payments, a payment frequency, and a payment amount. Each of the one or more primary user characteristics specified on the basis may be indicated.

일 실시예에 있어서, 상기 복수의 사용자 각각에 대한 상기 사용자 정보는, 상기 복수의 사용자 각각의 행동 정보 또는 신용 정보를 포함할 수 있다.In one embodiment, the user information for each of the plurality of users may include behavior information or credit information of each of the plurality of users.

일 실시예에 있어서, 상기 하나 이상의 프로세서는, 상기 복수의 사용자 각각에 대한 1차 태그의 값과 상기 표본 사용자 각각에 대한 1차 태그의 값 사이의 차이에 기초하여, 상기 유사도를 결정할 수 있다.In an embodiment, the one or more processors may determine the similarity based on a difference between a primary tag value for each of the plurality of users and a primary tag value for each sample user.

일 실시예에 있어서, 상기 하나 이상의 프로세서는, 상기 복수의 사용자 각각에 대한 1차 태그의 값과 상기 표본 사용자 그룹의 1차 태그 기준값 사이의 차이에 기초하여, 상기 유사도를 결정 - 상기 1차 태그 기준값은 상기 표본 사용자 각각에 대한 1차 태그의 값에 기초하여 결정되는 값임 - 할 수 있다.In an embodiment, the one or more processors determine the degree of similarity - the primary tag based on a difference between a primary tag value for each of the plurality of users and a primary tag reference value of the sample user group. The reference value may be a value determined based on the value of the primary tag for each of the sample users.

일 실시예에 있어서, 상기 하나 이상의 메모리는, 상기 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여 학습된 신경망(Neural Network) 기반 학습 모델을 더 저장하고, 상기 하나 이상의 프로세서는, 상기 학습 모델에 기초하여, 상기 복수의 사용자 각각과 상기 표본 사용자 그룹 사이의 유사도를 결정할 수 있다.In one embodiment, the one or more memories further store a neural network-based learning model learned based on one or more primary tags for each of the sample users, and the one or more processors, the learning model Based on, a similarity between each of the plurality of users and the sample user group may be determined.

일 실시예에 있어서, 상기 하나 이상의 프로세서는, 상기 학습 모델에 상기 복수의 사용자 각각에 대한 하나 이상의 1차 태그의 값을 입력하고, 상기 학습 모델의 출력에 기초하여, 상기 유사도를 결정할 수 있다.In an embodiment, the one or more processors may input values of one or more primary tags for each of the plurality of users to the learning model, and determine the degree of similarity based on an output of the learning model.

일 실시예에 있어서, 상기 하나 이상의 프로세서는, 상기 복수의 사용자 중 상기 유사도가 높은 순으로 임의의 수의 사용자들을 결정하고, 상기 임의의 수의 사용자들 중 상기 표본 사용자의 비율을 결정하고, 상기 비율에 기초하여, 상기 복수의 사용자 각각에 대한 상기 2차 태그의 값을 결정할 수 있다.In one embodiment, the one or more processors determine a random number of users from among the plurality of users in an order of high similarity, determine a ratio of the sample user among the random number of users, and Based on the ratio, a value of the secondary tag for each of the plurality of users may be determined.

일 실시예에 있어서, 상기 2차 태그는, 상기 복수의 사용자 각각이 상기 2차 사용자 특징을 가지는지 여부 또는 상기 2차 사용자 특징에 부합하는 정도를 지시할 수 있다.In one embodiment, the secondary tag may indicate whether each of the plurality of users has the secondary user characteristics or a degree of matching with the secondary user characteristics.

본 개시의 다른 측면에 따르면, 사용자 특징을 지시하는 태그를 생성하기 위한 방법이 제공될 수 있다. 본 개시에 따른 방법은 하나 이상의 프로세서 및 상기 하나 이상의 프로세서에 의해 실행되기 위한 명령들이 저장된 하나 이상의 메모리를 포함하는 컴퓨팅 장치에서 수행되는 방법에 있어서, 상기 하나 이상의 프로세서가, 복수의 사용자 각각에 대한 사용자 정보 및 상기 복수의 사용자 각각에 대한 결제 정보를 획득하는 단계; 상기 하나 이상의 프로세서가, 상기 사용자 정보 및 상기 결제 정보에 기초하여, 상기 복수의 사용자 각각에 대해 하나 이상의 1차 태그를 생성하는 단계 - 상기 하나 이상의 1차 태그는 하나 이상의 1차 사용자 특징을 각각 지시하고, 상기 하나 이상의 1차 사용자 특징 각각은 상기 사용자 정보 및 상기 결제 정보 중 적어도 하나에 의해 특정됨 - ; 상기 하나 이상의 프로세서가, 상기 복수의 사용자 중에서 미리 결정된 표본 사용자 그룹 내 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여, 상기 복수의 사용자 각각과 상기 표본 사용자 그룹 사이의 유사도를 결정하는 단계; 및 상기 하나 이상의 프로세서가, 상기 유사도에 기초하여, 상기 복수의 사용자 각각에 대해 2차 태그를 생성하는 단계 - 상기 2차 태그는 상기 표본 사용자 그룹과 관련된 2차 사용자 특징을 지시함 - 를 포함할 수 있다. According to another aspect of the present disclosure, a method for generating a tag indicating a user characteristic may be provided. A method according to the present disclosure is performed in a computing device including one or more processors and one or more memories in which instructions to be executed by the one or more processors are stored, wherein the one or more processors, the user for each of a plurality of users obtaining information and payment information for each of the plurality of users; Generating, by the one or more processors, one or more primary tags for each of the plurality of users based on the user information and the payment information, the one or more primary tags indicating one or more primary user characteristics, respectively. and each of the one or more primary user characteristics is specified by at least one of the user information and the payment information; determining, by the one or more processors, a degree of similarity between each of the plurality of users and the sample user group, based on one or more primary tags for each sample user in a predetermined sample user group among the plurality of users; and generating, by the one or more processors, a secondary tag for each of the plurality of users based on the degree of similarity, wherein the secondary tag indicates a secondary user characteristic related to the sample user group. can

일 실시예에 있어서, 상기 복수의 사용자 각각에 대한 사용자 정보는, 상기 복수의 사용자 각각의 행동 정보 또는 신용 정보를 포함할 수 있다.In one embodiment, the user information for each of the plurality of users may include behavior information or credit information of each of the plurality of users.

일 실시예에 있어서, 상기 유사도를 결정하는 단계는, 상기 복수의 사용자 각각에 대한 1차 태그의 값과 상기 표본 사용자 각각에 대한 1차 태그의 값 사이의 차이에 기초하여 상기 유사도를 결정하는 단계를 포함할 수 있다.In an embodiment, the determining of the degree of similarity may include determining the degree of similarity based on a difference between a value of a primary tag for each of the plurality of users and a value of a primary tag for each of the sample users. can include

일 실시예에 있어서, 상기 유사도를 결정하는 단계는, 상기 복수의 사용자 각각에 대한 1차 태그의 값과 상기 표본 사용자 그룹의 1차 태그 기준값 사이의 차이에 기초하여, 상기 유사도를 결정하는 단계 - 상기 1차 태그 기준값은 상기 표본 사용자 각각에 대한 1차 태그의 값에 기초하여 결정되는 값임 - 를 포함할 수 있다.In an embodiment, the determining of the similarity may include determining the similarity based on a difference between a primary tag value for each of the plurality of users and a primary tag reference value of the sample user group - The primary tag reference value is a value determined based on the value of the primary tag for each of the sample users.

일 실시예에 있어서, 상기 하나 이상의 메모리는 상기 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여 학습된 신경망(Neural Network) 기반 학습 모델을 더 저장하고, 상기 유사도를 결정하는 단계는, 상기 학습 모델에 기초하여 상기 복수의 사용자 각각과 상기 표본 사용자 그룹 사이의 유사도를 결정하는 단계를 포함할 수 있다.In one embodiment, the one or more memories further store a neural network-based learning model learned based on one or more primary tags for each of the sample users, and the determining of the similarity may include the learning and determining a degree of similarity between each of the plurality of users and the sample user group based on a model.

일 실시예에 있어서, 상기 유사도를 결정하는 단계는, 상기 학습 모델에 상기 복수의 사용자 각각에 대한 하나 이상의 1차 태그의 값을 입력하는 단계; 및 상기 학습 모델의 출력에 기초하여 상기 유사도를 결정하는 단계를 포함할 수 있다.In an embodiment, the determining of the degree of similarity may include inputting one or more primary tag values for each of the plurality of users to the learning model; and determining the degree of similarity based on an output of the learning model.

일 실시예에 있어서, 상기 2차 태그를 생성하는 단계는, 상기 복수의 사용자 중 상기 유사도가 높은 순으로 임의의 수의 사용자들을 결정하는 단계; 상기 임의의 수의 사용자들 중 상기 표본 사용자의 비율을 결정하는 단계; 및 상기 비율에 기초하여, 상기 복수의 사용자 각각에 대한 상기 2차 태그의 값을 결정하는 단계를 포함할 수 있다.In an embodiment, the generating of the secondary tag may include determining a random number of users from among the plurality of users in an order of high similarity; determining a proportion of the sample users among the random number of users; and determining a value of the secondary tag for each of the plurality of users based on the ratio.

본 개시의 다른 측면에 따르면, 사용자 특징을 지시하는 태그를 생성하기 위해 컴퓨터 상에서 수행되기 위한 명령들을 기록한 비일시적 컴퓨터 판독 가능 기록 매체가 제공될 수 있다. 본 개시에 따른 기록 매체에 기록된 상기 명령들은, 하나 이상의 프로세서에 의한 실행시, 상기 하나 이상의 프로세서가, 복수의 사용자 각각에 대한 사용자 정보 및 상기 복수의 사용자 각각에 대한 결제 정보를 획득하고, 상기 사용자 정보 및 상기 결제 정보에 기초하여, 상기 복수의 사용자 각각에 대해 하나 이상의 1차 태그를 생성하고 - 상기 하나 이상의 1차 태그는 하나 이상의 1차 사용자 특징을 각각 지시하고, 상기 하나 이상의 1차 사용자 특징 각각은 상기 사용자 정보 및 상기 결제 정보 중 적어도 하나에 의해 특정됨 - , 상기 복수의 사용자 중에서 미리 결정된 표본 사용자 그룹 내 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여, 상기 복수의 사용자 각각과 상기 표본 사용자 그룹 사이의 유사도를 결정하고, 상기 유사도에 기초하여, 상기 복수의 사용자 각각에 대해 2차 태그를 생성하도록 - 상기 2차 태그는 상기 표본 사용자 그룹과 관련된 2차 사용자 특징을 지시함 - 할 수 있다.According to another aspect of the present disclosure, a non-transitory computer-readable recording medium recording instructions to be executed on a computer to generate a tag indicating a user characteristic may be provided. When the instructions recorded on the recording medium according to the present disclosure are executed by one or more processors, the one or more processors acquire user information for each of a plurality of users and payment information for each of the plurality of users, and generate one or more primary tags for each of the plurality of users based on the user information and the payment information, wherein the one or more primary tags each indicate one or more primary user characteristics; Each feature is specified by at least one of the user information and the payment information - Based on one or more primary tags for each sample user in a predetermined sample user group among the plurality of users, each of the plurality of users and determine a similarity between the sample user groups, and generate a secondary tag for each of the plurality of users based on the similarity, the secondary tag indicating a secondary user characteristic related to the sample user group; can do.

본 개시의 다양한 실시예들에 의하면, 사용자에 관한 다양한 데이터로부터, 해당 사용자의 특성을 객관적으로 결정할 수 있다.According to various embodiments of the present disclosure, characteristics of a corresponding user may be objectively determined from various data on the user.

본 개시의 다양한 실시예들에 의하면, 목표한 사용자 특징에 관한 표본 사용자 그룹과 각 사용자와의 유사도라는 일관된 기준에 기초하여, 복수의 사용자 각각이 해당 사용자 특징을 가지는지 여부를 결정할 수 있다.According to various embodiments of the present disclosure, it is possible to determine whether each of a plurality of users has a corresponding user characteristic based on a consistent criterion of similarity between each user and a sample user group related to a target user characteristic.

본 개시의 다양한 실시예들에 의하면, 유사도라는 일관된 기준에 기초하여 특정 상품 또는 서비스를 구매할 사용자를 탐색하므로, 탐색 결과에 대한 신뢰성이 향상될 수 있다.According to various embodiments of the present disclosure, since a user who will purchase a specific product or service is searched for based on a consistent criterion of similarity, the reliability of the search result can be improved.

도 1은 본 개시의 일 실시예에 따른 컴퓨팅 장치가 동작하는 과정을 도시한 도면이다.
도 2는 본 개시의 일 실시예에 따른 사용자 정보, 결제 정보, 행동 정보, 신용 정보, 1차 태그 집합 및 2차 태그를 설명하기 위한 도면이다.
도 3은 본 개시의 일 실시예에 따른 컴퓨팅 장치(100)의 블록도이다.
도 4는 본 개시의 일 실시예에 따른 복수의 사용자와 표본 사용자 그룹의 관계를 설명하기 위한 도면이다.
도 5는 본 개시의 일 실시예에 따른 신경망 기반 학습 모델에 기초한 유사도 결정 과정을 설명하기 위한 도면이다.
도 6은 본 개시의 일 실시예에 따른 유사도에 기초한 2차 태그 결정 과정을 설명하는 도면이다.
도 7은 본 개시의 일 실시예에 따른 컴퓨팅 장치의 동작 흐름도이다.
도 8은 본 개시의 일 실시예에 따른 컴퓨팅 장치의 동작 흐름도이다.1 is a diagram illustrating a process of operating a computing device according to an embodiment of the present disclosure.
2 is a diagram for explaining user information, payment information, behavior information, credit information, a primary tag set, and a secondary tag according to an embodiment of the present disclosure.
3 is a block diagram of a computing device 100 according to one embodiment of the present disclosure.
4 is a diagram for explaining a relationship between a plurality of users and a sample user group according to an embodiment of the present disclosure.
5 is a diagram for explaining a similarity determination process based on a neural network-based learning model according to an embodiment of the present disclosure.
6 is a diagram illustrating a process of determining a secondary tag based on similarity according to an embodiment of the present disclosure.
7 is a flowchart illustrating an operation of a computing device according to an embodiment of the present disclosure.
8 is a flowchart illustrating an operation of a computing device according to an embodiment of the present disclosure.

본 문서에 기재된 다양한 실시예들은, 본 개시의 기술적 사상을 명확히 설명하기 위한 목적으로 예시된 것이며, 이를 특정한 실시 형태로 한정하려는 것이 아니다. 본 개시의 기술적 사상은, 본 문서에 기재된 각 실시예의 다양한 변경(modifications), 균등물(equivalents), 대체물(alternatives) 및 각 실시예의 전부 또는 일부로부터 선택적으로 조합된 실시예를 포함한다. 또한 본 개시의 기술적 사상의 권리 범위는 이하에 제시되는 다양한 실시예들이나 이에 대한 구체적 설명으로 한정되지 않는다.Various embodiments described in this document are illustrated for the purpose of clearly explaining the technical spirit of the present disclosure, and are not intended to be limited to specific embodiments. The technical spirit of the present disclosure includes various modifications, equivalents, and alternatives of each embodiment described in this document, and embodiments selectively combined from all or parts of each embodiment. In addition, the scope of the technical idea of the present disclosure is not limited to various embodiments or specific descriptions thereof presented below.

기술적이거나 과학적인 용어를 포함해서, 본 문서에서 사용되는 용어들은, 달리 정의되지 않는 한, 본 개시가 속하는 기술 분야에서 통상의 지식을 가진 자에게 일반적으로 이해되는 의미를 가질 수 있다.Terms used in this document, including technical or scientific terms, may have meanings commonly understood by those of ordinary skill in the art to which this disclosure belongs, unless otherwise defined.

본 문서에서 사용되는 "포함한다", "포함할 수 있다", "구비한다", "구비할 수 있다", "가진다", "가질 수 있다" 등과 같은 표현들은, 대상이 되는 특징(예: 기능, 동작 또는 구성요소 등)이 존재함을 의미하며, 다른 추가적인 특징의 존재를 배제하지 않는다. 즉, 이와 같은 표현들은 다른 실시예를 포함할 가능성을 내포하는 개방형 용어(open-ended terms)로 이해되어야 한다.Expressions such as "comprises", "may include", "has", "may have", "has", "may have", etc. used in this document refer to the target characteristics (e.g. function, operation or component), and does not preclude the presence of other additional features. That is, such expressions should be understood as open-ended terms that include the possibility of including other embodiments.

본 문서에서 사용되는 단수형의 표현은, 문맥상 다르게 뜻하지 않는 한 복수형의 의미를 포함할 수 있으며, 이는 청구항에 기재된 단수형의 표현에도 마찬가지로 적용된다.Expressions in the singular form used in this document may include plural meanings unless the context indicates otherwise, and this applies equally to expressions in the singular form in the claims.

본 문서에서 사용되는 "제1", "제2", 또는 "첫째", "둘째" 등의 표현은, 문맥상 다르게 뜻하지 않는 한, 복수의 동종 대상들을 지칭함에 있어 한 대상을 다른 대상과 구분하기 위해 사용되며, 해당 대상들 간의 순서 또는 중요도를 한정하는 것은 아니다. 일 실시예에 있어서, 본 개시에 따른 복수의 사용자에 포함된 각각의 사용자들은 "제1 사용자", "제2 사용자"와 같이 표현됨으로써 서로 구분될 수 있다. 일 실시예에 있어서, 본 개시에 따른 표본 사용자 그룹에 포함된 복수의 표본 사용자들은 "제1 표본 사용자", "제2 표본 사용자"와 같이 표현됨으로써 서로 구분될 수 있다. 일 실시예에 있어서, 본 개시에 따른 하나 이상의 1차 태그들은 "제1의 1차 태그", "제2의 1차 태그"와 같이 표현됨으로써 서로 구분될 수 있다.As used herein, expressions such as "first", "second", or "first" or "second" refer to plural objects of the same kind, unless the context dictates otherwise, distinguishing one object from another. It is used to do so, and does not limit the order or importance between the objects. In one embodiment, each user included in the plurality of users according to the present disclosure may be distinguished from each other by being expressed as a “first user” or a “second user”. In one embodiment, a plurality of sample users included in the sample user group according to the present disclosure may be distinguished from each other by being expressed as "first sample user" and "second sample user". In one embodiment, one or more primary tags according to the present disclosure may be distinguished from each other by being expressed as “first primary tag” or “second primary tag”.

본 문서에서 사용되는 "A, B, 및 C", "A, B, 또는 C", "A, B, 및/또는 C" 또는 "A, B, 및 C 중 적어도 하나", "A, B, 또는 C 중 적어도 하나", "A, B, 및/또는 C 중 적어도 하나" 등의 표현은, 각각의 나열된 항목 또는 나열된 항목들의 가능한 모든 조합들을 의미할 수 있다. 예를 들어, "A 또는 B 중 적어도 하나"는, (1) 적어도 하나의 A, (2) 적어도 하나의 B, (3) 적어도 하나의 A 및 적어도 하나의 B를 모두 지칭할 수 있다.As used herein, "A, B, and C", "A, B, or C", "A, B, and/or C" or "at least one of A, B, and C", "A, B Expressions such as “at least one of , or C”, “at least one of A, B, and/or C” may mean each listed item or all possible combinations of the listed items. For example, “at least one of A or B” can refer to (1) at least one A, (2) at least one B, and (3) both at least one A and at least one B.

본 문서에서 사용되는 "부"라는 표현은, 소프트웨어, 또는 FPGA(field-programmable gate array), ASIC(application specific integrated circuit)과 같은 하드웨어 구성요소를 의미할 수 있다. 그러나, "부"는 하드웨어 및 소프트웨어에 한정되는 것은 아니다. "부"는 어드레싱할 수 있는 저장 매체에 저장되어 있도록 구성될 수도 있고, 하나 또는 그 이상의 프로세서들을 실행시키도록 구성될 수도 있다. 일 실시예에서, "부"는 소프트웨어 구성요소들, 객체지향 소프트웨어 구성요소들, 클래스 구성요소들 및 태스크 구성요소들과 같은 구성요소들과, 프로세서, 함수, 속성, 프로시저, 서브루틴, 프로그램 코드의 세그먼트, 드라이버, 펌웨어, 마이크로코드, 회로, 데이터, 데이터베이스, 데이터 구조, 테이블, 어레이 및 변수를 포함할 수 있다.The expression "unit" used in this document may mean software or a hardware component such as a field-programmable gate array (FPGA) or an application specific integrated circuit (ASIC). However, "unit" is not limited to hardware and software. A "unit" may be configured to be stored in an addressable storage medium or configured to execute one or more processors. In one embodiment, "unit" refers to components such as software components, object-oriented software components, class components and task components, processors, functions, properties, procedures, subroutines, programs. It can include segments of code, drivers, firmware, microcode, circuits, data, databases, data structures, tables, arrays and variables.

본 문서에서 사용되는 "~에 기초하여"라는 표현은, 해당 표현이 포함되는 어구 또는 문장에서 기술되는, 결정, 판단의 행위 또는 동작에 영향을 주는 하나 이상의 인자를 기술하는데 사용되고, 이 표현은 해당 결정, 판단의 행위 또는 동작에 영향을 주는 추가적인 인자를 배제하지 않는다.As used herein, the expression "based on" is used to describe one or more factors that affect the act or operation of a decision, judgment, or action described in a phrase or sentence in which the expression is included, and the expression refers to that It does not preclude additional factors that may affect the decision, the act of judgment, or the action.

본 문서에서 사용되는, 어떤 구성요소(예: 제1 구성요소)가 다른 구성요소(예: 제2 구성요소)에 "연결되어" 있다거나 "접속되어" 있다는 표현은, 상기 어떤 구성요소가 상기 다른 구성요소에 직접적으로 연결 또는 접속되는 것뿐 아니라, 새로운 다른 구성요소(예: 제3 구성요소)를 매개로 하여 연결 또는 접속되는 것을 의미할 수 있다.As used in this document, the expression that a certain component (eg, a first component) is "connected" or "connected" to another component (eg, a second component) refers to a component as described above. It may mean not only being directly connected or connected to another component, but also being connected or connected through another new component (eg, a third component).

본 문서에서 사용된 표현 "~하도록 구성된(configured to)"은 문맥에 따라, "~하도록 설정된", "~하는 능력을 가지는", "~하도록 변경된", "~하도록 만들어진", "~를 할 수 있는" 등의 의미를 가질 수 있다. 해당 표현은, "하드웨어적으로 특별히 설계된"의 의미로 제한되지 않으며, 예를 들어 특정 동작을 수행하도록 구성된 프로세서란, 해당 특정 동작을 수행하도록, 프로그래밍을 통해 구조화된 특수 목적 컴퓨터(special purpose computer)를 의미할 수 있다.The expression "configured to" as used in this document means "configured to", "having the ability to", "modified to", "made to", "to do", depending on the context. can have meanings such as "can". The expression is not limited to the meaning of "specifically designed in hardware", for example, a processor configured to perform a specific operation is a special purpose computer structured through programming to perform that specific operation. can mean

본 개시에서, 인공지능(Artificial Intelligence, AI)은 인간의 학습능력, 추론능력, 지각능력 등을 모방하고, 이를 컴퓨터로 구현하는 기술을 의미할 수 있다. 인공지능은 기계학습 내지 기계학습을 활용한 요소 기술 등을 포함할 수 있다. 기계학습은 입력되는 데이터를 분류하기 위해 적어도 하나의 학습용 데이터의 특징을 추출하는 알고리즘을 의미할 수 있다. 또한, 기계학습의 알고리즘을 활용하여 인간 두뇌의 인지, 판단 등의 기능을 모사하는 기술들 역시 인공지능의 범주로 이해될 수 있다. 예를 들어, 언어적 이해, 시각적 이해, 추론/예측, 지식 표현, 동작 제어 등의 기술 분야가 포함될 수 있다.In the present disclosure, artificial intelligence (AI) may refer to a technology that imitates human learning ability, reasoning ability, perception ability, and the like, and implements it with a computer. Artificial intelligence may include machine learning or element technology using machine learning. Machine learning may refer to an algorithm that extracts at least one feature of training data in order to classify input data. In addition, technologies that use machine learning algorithms to mimic the functions of the human brain, such as recognition and judgment, can also be understood as a category of artificial intelligence. For example, technical fields such as linguistic understanding, visual understanding, inference/prediction, knowledge expression, and motion control may be included.

이하, 첨부된 도면들을 참조하여, 본 개시의 다양한 실시예들을 설명한다. 첨부된 도면 및 도면에 대한 설명에서, 동일하거나 실질적으로 동등한(substantially equivalent) 구성요소에는 동일한 참조부호가 부여될 수 있다. 또한, 이하 다양한 실시예들의 설명에 있어서, 동일하거나 대응하는 구성요소를 중복하여 기술하는 것이 생략될 수 있으나, 이는 해당 구성요소가 그 실시예에 포함되지 않는 것을 의미하지는 않는다.Hereinafter, various embodiments of the present disclosure will be described with reference to the accompanying drawings. In the accompanying drawings and description of the drawings, identical or substantially equivalent components may be given the same reference numerals. In addition, in the following description of various embodiments, overlapping descriptions of the same or corresponding components may be omitted, but this does not mean that the corresponding components are not included in the embodiments.

도 1은 본 개시의 일 실시예에 따른 컴퓨팅 장치가 동작하는 과정을 도시한 도면이다. 본 개시에 따른 컴퓨팅 장치(100)는 적어도 하나의 다른 장치(110)로부터 복수의 사용자 각각에 대한 사용자 정보(210) 및/또는 결제 정보(220)를 획득할 수 있다. 본 개시에서 사용자 정보(210)는 사용자의 개인 인적 사항에 관한 적어도 하나의 정보를 포함할 수 있다. 예를 들어, 사용자 정보(210)는 사용자의 성별, 사용자의 연령, 사용자의 집 주소, 사용자의 혼인 여부 등에 관한 정보를 포함할 수 있다. 일 실시예에서, 사용자 정보(210)는 해당 사용자의 행동 정보(215) 및/또는 신용 정보(217)를 포함할 수 있다. 행동 정보(215)는 해당 사용자의 단말 기기 또는 해당 사용자를 대상으로 마케팅을 수행하는 주체로부터 전달된 데이터에 기초하여 생성되는 적어도 하나의 정보를 포함할 수 있다. 예를 들어, 행동 정보(215)는 웹사이트 로그 기록, 어플리케이션 로그 기록, TM(Telemarketing) 반응 정보, LMS(Long Message Service) 반응 정보 등을 포함할 수 있다. 여기서 TM 반응 정보는, 예를 들어, 텔레마케터의 마케팅 전화에 대해 사용자가 수신 거부를 설정했는지 여부 또는 마케팅 전화를 수신한 후 통화가 지속된 시간 등에 기초하는 정보일 수 있다. 또한 LMS 반응 정보는, 예를 들어, LMS 형식으로 전달되는 마케팅 문자에 대해 사용자가 수신 거부를 설정했는지 여부 또는 사용자가 LMS 형식의 마케팅 문자를 이용하여 마케팅 대상 웹사이트로 접속하였는지 여부 등에 기초하는 정보일 수 있다. 본 개시에서 신용 정보(217)는 해당 사용자의 신용도를 나타내기 위한 적어도 하나의 정보를 포함할 수 있다. 예를 들어, 신용 정보(217)는 소정의 신용평가기관(Credit Bureau)이 해당 사용자에 대해 부여한 신용 점수를 포함할 수 있다. 본 개시에 따른 결제 정보(220)는 사용자의 결제 행위에 기초하여 생성되는 적어도 하나의 정보를 포함할 수 있다. 예를 들어, 결제 정보(220)는 결제처, 결제 품목, 결제 방식, 결제 시점, 결제 횟수, 결제 빈도, 결제 액수 등에 관한 정보를 포함할 수 있다. 본 개시에서, 사용자 정보(210) 및/또는 결제 정보(220)는 컴퓨팅 장치(100)에 의해 가공되지 않은 상태의 로(raw) 데이터를 의미할 수 있다.1 is a diagram illustrating a process of operating a computing device according to an embodiment of the present disclosure. The computing device 100 according to the present disclosure may obtain user information 210 and/or payment information 220 for each of a plurality of users from at least one other device 110 . In the present disclosure, user information 210 may include at least one piece of information about a user's personal information. For example, the user information 210 may include information about the user's gender, the user's age, the user's home address, and the user's marital status. In one embodiment, the user information 210 may include behavioral information 215 and/or credit information 217 of the corresponding user. The behavioral information 215 may include at least one piece of information generated based on data transmitted from a terminal device of a corresponding user or an entity performing marketing for the corresponding user. For example, the behavior information 215 may include website log records, application log records, TM (Telemarketing) response information, LMS (Long Message Service) response information, and the like. Here, the TM response information may be, for example, information based on whether the user has set reception rejection for the telemarketer's marketing call or the duration of the call after receiving the marketing call. In addition, the LMS response information is, for example, information based on whether the user has set reception rejection for the marketing text delivered in the LMS format or whether the user has access to the marketing target website using the marketing text in the LMS format. can be In the present disclosure, the credit information 217 may include at least one piece of information to indicate the credit level of the corresponding user. For example, the credit information 217 may include a credit score assigned to a corresponding user by a predetermined credit bureau. The payment information 220 according to the present disclosure may include at least one piece of information generated based on a user's payment behavior. For example, the payment information 220 may include information about a payment destination, payment item, payment method, payment timing, payment number, payment frequency, payment amount, and the like. In the present disclosure, user information 210 and/or payment information 220 may mean raw data in a state not processed by the computing device 100 .

본 개시에서, "태그"는 사용자와 관련된 소정의 특징을 의미하거나, 설명하거나, 지시하거나, 유형화할 수 있는 정보일 수 있다. 도 1의 컴퓨팅 장치(100)는 사용자 정보(210) 및/또는 결제 정보(220)에 기초하여, 복수의 사용자 각각에 대해 하나 이상의 1차 태그를 생성할 수 있다. 본 1차 태그 각각은 1차 사용자 특징 각각을 지시할 수 있다. 본 개시에서 "1차 사용자 특징"은 사용자 정보(210) 및 결제 정보(220) 중 적어도 하나에 포함된 로(raw) 데이터들에 기초하여 1차적으로 도출되는 특징일 수 있다. 즉 1차 사용자 특징은, 해당 사용자의 사용자 정보 및/또는 결제 정보로부터 소정의 연산 과정(예: 사칙연산, 통계적 연산 등)을 통해 도출되는 특징일 수 있다. 예를 들어, 1차 사용자 특징(예: 홈쇼핑을 통해 지난 30일간 결제한 금액)은 해당 사용자의 사용자 정보(210) 또는 결제 정보(220)에 포함된 로 데이터의 수, 로 데이터의 평균값, 로 데이터의 사분위값, 로 데이터의 총합값 등에 기초하여 도출될 수 있다. 또한 한 사용자에 대한 1차 사용자 특징은, 다른 사용자의 사용자 정보, 결제 정보 또는 그들 정보와의 유사도와는 무관하게, 해당 사용자에 대해 독립적으로 도출될 수 있다. In the present disclosure, a "tag" may be information that may mean, describe, indicate, or categorize a certain characteristic related to a user. The computing device 100 of FIG. 1 may generate one or more primary tags for each of a plurality of users based on the user information 210 and/or the payment information 220 . Each of these primary tags may indicate each primary user characteristic. In the present disclosure, a “primary user feature” may be a feature that is primarily derived based on raw data included in at least one of the user information 210 and the payment information 220 . That is, the primary user characteristics may be characteristics derived from user information and/or payment information of the corresponding user through a predetermined calculation process (eg, four arithmetic operations, statistical calculations, etc.). For example, the primary user characteristics (eg, the amount paid through home shopping for the past 30 days) are the number of raw data included in the user information 210 or payment information 220 of the user, the average value of the raw data, and It may be derived based on the quartile value of data, the total value of raw data, and the like. In addition, the primary user characteristics for one user may be independently derived for that user, regardless of user information, payment information, or similarity with other users' information.

본 개시에 있어서 1차 태그는 해당 1차 태그에 대한 값을 가질 수 있다. 1차 태그의 값은, 사용자가 해당 1차 사용자 특징(예: OO 마트 이용 경과일 60일 이상)을 가지는지 여부(예: 1(참, True) / 0(거짓, False)), 또는 해당 1차 사용자 특징(예: 홈쇼핑을 통해 지난 30일간 결제한 금액)에 부합하는 정도(예: 30만원)를 지시할 수 있다. 컴퓨팅 장치(100)는 1차 사용자 특징을 지시하는 1차 태그 및 그 1차 태그의 값을 각 사용자에 대해 생성할 수 있다.In the present disclosure, the primary tag may have a value for the corresponding primary tag. The value of the primary tag is whether the user has the corresponding primary user characteristic (eg, OO mart usage age of 60 days or more) (eg, 1 (True) / 0 (False)), or The degree of conformity (eg, 300,000 won) to the primary user characteristics (eg, the amount paid for the past 30 days through home shopping) can be indicated. The computing device 100 may generate a primary tag indicating a primary user characteristic and a value of the primary tag for each user.

컴퓨팅 장치(100)는 하나 이상의 1차 태그에 기초하여, 복수의 사용자 각각에 대해 2차 태그(250)를 생성할 수 있다. 본 개시에서 2차 태그(250)는 2차 사용자 특징을 지시할 수 있다. 2차 사용자 특징은 마케팅 목적에 의해 임의로 설정된 사용자 특징(예: 중국 음식 이용 지수 등)일 수 있다. 일 실시예에서, 2차 사용자 특징에 대한 표본 사용자 그룹이 미리 결정될 수 있다. 표본 사용자 그룹은 복수의 사용자 중 2차 사용자 특징을 가질 것으로 추정된 표본 사용자들의 집합일 수 있다. 일 실시예에서, 표본 사용자 그룹은 타겟 고객 그룹, 정답 고객 그룹 등으로 불릴 수 있다. 일 실시예에서, 컴퓨팅 장치(100)는 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여, 복수의 사용자 각각과 표본 사용자 그룹 사이의 유사도를 결정할 수 있다. 컴퓨팅 장치(100)는 해당 유사도에 기초하여, 복수의 사용자 각각에 대해 해당 2차 사용자 특징을 지시하는 2차 태그(250)를 생성(부여)할 수 있다.The computing device 100 may generate a secondary tag 250 for each of a plurality of users based on one or more primary tags. In the present disclosure, the secondary tag 250 may indicate secondary user characteristics. Secondary user characteristics may be user characteristics arbitrarily set for marketing purposes (eg, Chinese food consumption index, etc.). In one embodiment, a sample user group for a secondary user characteristic may be predetermined. The sample user group may be a set of sample users estimated to have secondary user characteristics among a plurality of users. In one embodiment, the sample user group may be referred to as a target customer group, a correct customer group, or the like. In an embodiment, the computing device 100 may determine a degree of similarity between each of a plurality of users and a sample user group based on one or more primary tags for each sample user. Based on the similarity, the computing device 100 may generate (apply) a secondary tag 250 indicating a corresponding secondary user characteristic to each of a plurality of users.

본 개시에 있어서 2차 태그는 해당 2차 태그에 대한 값을 가질 수 있다. 2차 태그의 값은, 사용자가 해당 2차 사용자 특징(예: 중국 음식을 이용하는 사용자인지 여부)에 해당하는지(예: 1(참, True) / 0(거짓, False)), 또는 해당 2차 사용자 특징(예: 중국 음식 이용 지수)에 부합하는 정도(예: 5)를 지시할 수 있다. 컴퓨팅 장치(100)는 2차 사용자 특징을 지시하는 2차 태그 및 그 2차 태그의 값을 각 사용자에 대해 생성할 수 있다.In the present disclosure, a secondary tag may have a value for the corresponding secondary tag. The value of the secondary tag is whether the user corresponds to that secondary user characteristic (e.g., whether or not he or she is a Chinese food eater) (e.g., 1 (True) / 0 (False)), or whether the corresponding secondary tag The degree of conformity (eg, 5) to user characteristics (eg, Chinese food usage index) may be indicated. The computing device 100 may generate a secondary tag indicating a secondary user characteristic and a value of the secondary tag for each user.

본 개시에 있어서 "2차 사용자 특징"은 하나 이상의 1차 태그에 기초하여 2차적으로 도출되는 특징이라는 점에서, 로 데이터로부터 직접 도출되는 1차 사용자 특징과 구별될 수 있다. 본 개시에 있어서 각 사용자에 대한 "2차 사용자 특징"은 해당 사용자가 표본 사용자 그룹에 대하여 갖는 상대적인 특징일 수 있다. 예컨대 2차 사용자 특징은 각 사용자에 관한 정보들(예: 사용자 정보, 결제 정보)만으로 독립적으로 도출될 수는 없고, 기준이 되는 표본 사용자 그룹에 관한 정보들과 비교하여 상대적으로 도출되는 특징일 수 있다.In the present disclosure, “secondary user characteristics” may be distinguished from primary user characteristics directly derived from raw data in that “secondary user characteristics” are characteristics that are secondarily derived based on one or more primary tags. In the present disclosure, “secondary user characteristics” for each user may be relative characteristics of the corresponding user with respect to a sample user group. For example, secondary user characteristics cannot be derived independently with only information about each user (e.g., user information, payment information), but can be relatively derived by comparing information about a standard sample user group. there is.

본 개시에 있어서, "1차 태그와 1차 태그의 값" 또는 "2차 태그와 2차 태그의 값" 사이의 관계는, 데이터 처리를 위한 키(key) 값과 각 키 값에 대응되는 벨류(value) 값의 관계로 이해될 수 있다.In the present disclosure, the relationship between "the primary tag and the value of the primary tag" or "the secondary tag and the value of the secondary tag" is a key value for data processing and a value corresponding to each key value. (value) It can be understood as a relationship of values.

이와 같이 컴퓨팅 장치(100)는 복수의 사용자 각각에 대해 하나 이상의 1차 태그 및 하나 이상의 2차 태그를 생성할 수 있다. 일 실시예에서, 컴퓨팅 장치(100)는 하나 이상의 1차 태그 및 하나 이상의 2차 태그를 다른 장치(110)에 전달할 수 있다. 이를 통해 컴퓨팅 장치(100)는 다른 장치(110)의 운영 주체(예: 마케팅을 실시하려는 기업)에게 복수의 사용자 각각의 사용자 특징에 대한 분석을 제공할 수 있다.As such, the computing device 100 may generate one or more primary tags and one or more secondary tags for each of a plurality of users. In one embodiment, computing device 100 may communicate one or more primary tags and one or more secondary tags to another device 110 . Through this, the computing device 100 may provide an analysis of user characteristics of each of a plurality of users to an operator (eg, a company intending to conduct marketing) of the other device 110 .

도 2는 본 개시의 일 실시예에 따른 사용자 정보, 결제 정보, 1차 태그 집합 및 2차 태그를 설명하기 위한 도면이다. 본 개시에 있어서 하나 이상의 1차 태그들로 구성된 집합은 1차 태그 집합(230)으로 지칭될 수 있다. 전술한 바와 같이, 1차 태그는 1차 사용자 특징을 지시할 수 있다. 일 실시예에서, 1차 사용자 특징은 사용자의 사용자 정보(210) 및 결제 정보(220)의 로 데이터 중 적어도 하나에 의해 특정될 수 있다. 일 실시예에서, 1차 사용자 특징은 사용자의 사용자 정보(예: 성별 등), 결제처, 결제 품목, 결제 방식, 결제 시점, 결제 횟수, 결제 빈도 및 결제 액수 중에서 선택된 적어도 하나에 기초하여 특정될 수 있다. 또한 사용자 정보(210)는 해당 사용자의 행동 정보(215) 및/또는 신용 정보(217)를 포함할 수 있으므로, 1차 사용자 특징은 사용자의 웹 로그, 앱 로그, TM 반응 정보, LMS 반응 정보, 및 신용 점수 중에서 선택된 적어도 하나에 추가적으로 기초하여 특정될 수 있다.2 is a diagram for explaining user information, payment information, a primary tag set, and a secondary tag according to an embodiment of the present disclosure. In the present disclosure, a set composed of one or more primary tags may be referred to as a primary tag set 230 . As mentioned above, a primary tag may indicate a primary user characteristic. In one embodiment, the primary user characteristic may be specified by at least one of raw data of user information 210 and payment information 220 of the user. In one embodiment, the primary user characteristic may be specified based on at least one selected from user information (eg, gender, etc.) of the user, payment destination, payment item, payment method, payment time, payment number, payment frequency, and payment amount. can In addition, since the user information 210 may include the user's behavioral information 215 and/or credit information 217, the primary user characteristics include the user's web log, app log, TM response information, LMS response information, and a credit score.

본 개시의 일 실시예에 따르면, 하나 이상의 1차 태그에 포함된 제1의 1차 태그(231)는 "지난 1주일간 중국 음식 결제 횟수"를 의미하는 1차 사용자 특징을 지시할 수 있다. 이 때 "지난 1주일간 중국 음식 결제 횟수"를 의미하는 1차 사용자 특징은 결제 정보(220)에 포함된 결제처 및 결제 시점 등에 관한 정보에 기초하여 특정될 수 있다. 컴퓨팅 장치(100)는 결제처 및 결제 시점에 기초하여 "지난 1주일간 중국 음식 결제 횟수"를 지시하는 제1의 1차 태그(231)의 값을 산출할 수 있다. "지난 1주일간 중국 음식 결제 횟수"를 지시하는 제1의 1차 태그(231)의 값은, "횟수"를 표현하는 0 이상의 정수일 수 있다. 구체적으로, 컴퓨팅 장치(100) 획득한 결제 정보(220) 중 임의의 일 사용자에 대한 결제 정보가 포함하는 결제 행위들 중에서, "결제처"가 "중국 음식점"이고 "결제 시점"이 지난 1주일 이내에 해당하는 결제 행위들에 기초하여 "지난 1주일간 중국 음식 결제 횟수"를 지시하는 제1의 1차 태그(231)의 값을 산출할 수 있다. 예를 들어, 제1의 1차 태그(231)의 값은 3 또는 10 등일 수 있다.According to an embodiment of the present disclosure, the first primary tag 231 included in one or more primary tags may indicate a primary user characteristic meaning "number of payments for Chinese food in the past week". In this case, the primary user characteristic meaning "the number of payments for Chinese food in the past week" may be specified based on information about the payment destination and payment time included in the payment information 220 . The computing device 100 may calculate the value of the first primary tag 231 indicating "the number of payments for Chinese food in the past week" based on the payment destination and payment time. A value of the first primary tag 231 indicating "number of payments for Chinese food in the past week" may be an integer of 0 or more representing "number of times". Specifically, among the payment actions included in the payment information for any one user among the payment information 220 acquired by the computing device 100, the "payment destination" is a "Chinese restaurant" and the "payment point" is the last one week. A value of the first primary tag 231 indicating "the number of payments for Chinese food in the past week" may be calculated based on the payment actions corresponding to within the range. For example, the value of the first primary tag 231 may be 3 or 10.

본 개시의 다른 일 실시예에 따르면, 하나 이상의 1차 태그에 포함된 제2의 1차 태그(232)는 "홈쇼핑에서 지난 30일간 결제한 금액"을 의미하는 1차 사용자 특징을 지시할 수 있다. 이 때 "홈쇼핑에서 지난 30일간 결제한 금액"을 의미하는 1차 사용자 특징은 결제 정보(220)에 포함된 결제처, 결제 시점 및 결제 액수 등에 관한 정보 기초하여 특정될 수 있다. 컴퓨팅 장치(100)는 결제처, 결제 시점 및 결제 액수에 기초하여 "홈쇼핑에서 지난 30일간 결제한 금액"을 지시하는 제2의 1차 태그(232)의 값을 산출할 수 있다. "홈쇼핑에서 지난 30일간 결제한 금액"을 지시하는 제2의 1차 태그(232)의 값은, "금액"을 표현하는 0 이상의 실수일 수 있다. 구체적으로, 컴퓨팅 장치(100)는 획득한 결제 정보(220) 중 임의의 일 사용자에 대한 결제 정보가 포함하는 결제 행위들 중에서, "결제처"가 홈쇼핑 관련 업체이면서 "결제 시점"이 30일 이내인 결제 행위들의 "결제 액수"를 합산함으로써, "홈쇼핑에서 지난 30일간 결제한 금액"을 지시하는 제2의 1차 태그의 값을 산출할 수 있다. 예를 들어 제2의 1차 태그(232)의 값은 100,000 또는 300,000 등일 수 있다.According to another embodiment of the present disclosure, the second primary tag 232 included in one or more primary tags may indicate a primary user characteristic meaning "amount paid in home shopping for the past 30 days". . In this case, the primary user characteristic meaning "amount paid in home shopping for the past 30 days" may be specified based on information about the payment destination, payment time, and payment amount included in the payment information 220 . The computing device 100 may calculate the value of the second primary tag 232 indicating "amount paid for the past 30 days in home shopping" based on the payment destination, payment time, and payment amount. A value of the second primary tag 232 indicating "amount paid in home shopping for the past 30 days" may be a real number greater than or equal to 0 representing "amount". Specifically, the computing device 100 determines that among the payment actions included in the payment information for any one user among the acquired payment information 220, the "payment destination" is a home shopping related company and the "time of payment" is within 30 days. A value of the second primary tag indicating "amount paid for the last 30 days in home shopping" may be calculated by summing the "payment amount" of the payment activities. For example, the value of the second primary tag 232 may be 100,000 or 300,000.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는 위 예시들과 동일 또는 유사한 방법으로, "주유소 이용 건수"를 의미하는 1차 사용자 특징에 관한 제3의 1차 태그(233), "B 마트 이용 경과일 60일 이상"을 의미하는 1차 사용자 특징에 관한 제4의 1차 태그(234), 또는 "외식업 관련 지난 30일간 결제 금액"을 의미하는 1차 사용자 특징에 관한 제5의 1차 태그(235) 등을 생성하고, 각각의 1차 태그에 대응되는 값을 산출할 수 있다. 그 밖에도 본 개시의 다양한 실시예에 따른 컴퓨팅 장치(100)는 행동 정보(215)에 포함된 웹 로그 또는 앱 로그에 기초하여 "지난 30일간 A 사이트 접속 횟수"를 의미하는 1차 태그, 행동 정보(215)에 포함된 TM 반응 정보 또는 LMS 반응 정보에 기초하여 "마케팅 연락 거부 횟수"를 의미하는 1차 태그, 또는 신용 정보(217)에 포함된 신용 점수에 기초하여 "신용등급 변화 여부"를 의미하는 1차 태그 등을 생성할 수도 있다. 전술한 1차 태그 또는 1차 태그의 값에 관한 기재 내용은 본 개시의 설명을 위한 예시적인 서술에 불과하며 본 개시를 제한하지 않는다.The computing device 100 according to an embodiment of the present disclosure, in the same or similar manner to the above examples, includes a third primary tag 233 related to a primary user characteristic meaning "the number of gas station uses", "B" The 4th primary tag 234 related to the primary user characteristic meaning "more than 60 days of mart usage" or the 5th primary tag 234 related to the primary user characteristic meaning "restaurant-related payment amount for the past 30 days" Secondary tags 235 and the like may be created, and values corresponding to respective primary tags may be calculated. In addition, the computing device 100 according to various embodiments of the present disclosure includes a primary tag meaning "the number of accesses to site A in the past 30 days" based on the web log or app log included in the behavior information 215, and behavior information. Based on the TM response information or LMS response information included in (215), the primary tag meaning "the number of marketing contact rejections" or "whether or not the credit rating has changed" based on the credit score included in the credit information (217) It is also possible to create primary tags and the like. The foregoing description of the primary tag or the value of the primary tag is only an exemplary description for description of the present disclosure and does not limit the present disclosure.

본 개시의 일 실시예에 따르면, 컴퓨팅 장치(100)는 1차 태그 집합(230)에 포함된 적어도 하나의 1차 태그에 기초하여 2차 태그(250)를 생성하고 해당 2차 태그(250)의 값을 결정할 수 있다. 일례로, 컴퓨팅 장치(100)는 "지난 1주일간 중국 음식 결제 횟수"를 지시하는 제1의 1차 태그(231)에 기초하여 "중국 음식 이용지수"를 지시하는 2차 태그(250)의 값을 결정할 수 있다. 다른 일례로, 컴퓨팅 장치(100)는 "지난 1주일간 중국 음식 결제 횟수"를 지시하는 제1의 1차 태그(231) 및 "외식업 관련 지난 30일간 결제 금액"을 지시하는 제5의 1차 태그(235)에 기초하여서도 "중국 음식 이용지수"를 지시하는 2차 태그(250)의 값을 결정할 수 있다. 여기서, 제5의 1차 태그(235)가 지칭하는 외식업은 중국 음식 관련업으로 한정될 수도 있다. 다른 일례로, 컴퓨팅 장치(100)는 1차 태그 집합의 1차 태그들 모두에 기초하여서 "중국 음식 이용지수"를 지시하는 2차 태그(250)의 값을 결정할 수 있다.According to an embodiment of the present disclosure, the computing device 100 generates a secondary tag 250 based on at least one primary tag included in the primary tag set 230, and the corresponding secondary tag 250 value can be determined. For example, the computing device 100 sets the value of the second tag 250 indicating the "Chinese food usage index" based on the first primary tag 231 indicating "the number of payments for Chinese food in the past week". can determine As another example, the computing device 100 includes a first primary tag 231 indicating "the number of payments for Chinese food in the past week" and a fifth primary tag indicating "amount of payment related to the restaurant business in the past 30 days". Also based on (235), the value of the secondary tag 250 indicating the "Chinese food availability index" can be determined. Here, the restaurant business indicated by the fifth primary tag 235 may be limited to a Chinese food-related business. As another example, the computing device 100 may determine the value of the secondary tag 250 indicating “Chinese food availability index” based on all of the primary tags of the primary tag set.

도 3은 본 개시의 일 실시예에 따른 컴퓨팅 장치(100)의 블록도이다. 본 개시의 일 실시예에 따르면, 컴퓨팅 장치(100)는 하나 이상의 프로세서(310) 및/또는 하나 이상의 메모리(320)를 구성요소로서 포함할 수 있다. 어떤 실시예에서는, 컴퓨팅 장치(100)의 이 구성요소들 중 적어도 하나가 생략되거나, 다른 구성요소가 컴퓨팅 장치(100)에 추가될 수 있다. 어떤 실시예에서는, 추가적으로 또는 대체적으로(in alternative to), 일부의 구성요소들이 통합되어 구현되거나, 단수 또는 복수의 개체로 구현될 수 있다. 본 개시에서, 하나 이상의 프로세서(310)는 프로세서(310)라고 표현될 수 있다. 프로세서(310)라는 표현은, 문맥상 명백히 다르게 표현하지 않는 이상, 하나 또는 그 이상의 프로세서의 집합을 의미할 수 있다. 본 개시에서, 하나 이상의 메모리(320)는 메모리(320)라고 표현될 수 있다. 메모리(320)라는 표현은, 문맥상 명백히 다르게 표현하지 않는 이상, 하나 또는 그 이상의 메모리의 집합을 의미할 수 있다.3 is a block diagram of a computing device 100 according to one embodiment of the present disclosure. According to one embodiment of the present disclosure, the computing device 100 may include one or more processors 310 and/or one or more memories 320 as components. In some embodiments, at least one of these components of computing device 100 may be omitted or other components may be added to computing device 100 . In some embodiments, additionally or alternatively, some components may be integrated and implemented, or implemented as a single or multiple entities. In this disclosure, one or more processors 310 may be referred to as processor 310 . The expression processor 310 may mean a set of one or more processors, unless the context clearly dictates otherwise. In this disclosure, one or more memories 320 may be referred to as memory 320 . The expression memory 320 may mean a set of one or more memories, unless the context clearly dictates otherwise.

컴퓨팅 장치(100) 내, 외부의 구성요소들 중 적어도 일부의 구성요소들은 버스, GPIO(general purpose input/output), SPI(serial peripheral interface) 또는 MIPI(mobile industry processor interface) 등을 통해 서로 연결되어, 데이터 및/또는 시그널을 주고 받을 수 있다.At least some of the internal and external components of the computing device 100 are connected to each other through a bus, general purpose input/output (GPIO), serial peripheral interface (SPI), or mobile industry processor interface (MIPI). , data and/or signals can be sent and received.

다양한 실시예에 따른 프로세서(310)는 소프트웨어(예: 명령, 프로그램 등)를 구동하여 프로세서(310)에 연결된 컴퓨팅 장치(100)의 적어도 하나의 구성요소를 제어할 수 있다. 또한 프로세서(310)는 본 개시와 관련된 다양한 연산, 처리, 데이터 생성, 가공 등의 동작을 수행할 수 있다. 또한 프로세서(310)는 데이터 등을 메모리(320)로부터 로드하거나, 메모리(320)에 저장할 수 있다.The processor 310 according to various embodiments may control at least one component of the computing device 100 connected to the processor 310 by driving software (eg, instructions, programs, etc.). In addition, the processor 310 may perform operations such as various calculations, processing, data generation, and processing related to the present disclosure. Also, the processor 310 may load data or the like from the memory 320 or store data in the memory 320 .

다양한 실시예에 따른 메모리(320)는 다양한 데이터를 저장할 수 있다. 메모리(320)에 저장되는 데이터는, 컴퓨팅 장치(100)의 적어도 하나의 구성요소에 의해 획득되거나, 처리되거나, 사용되는 데이터로서, 소프트웨어(예: 명령, 프로그램 등)를 포함할 수 있다. 메모리(320)는 휘발성 및/또는 비휘발성 메모리를 포함할 수 있다. 본 개시에서, 명령 내지 프로그램은 메모리(320)에 저장되는 소프트웨어로서, 컴퓨팅 장치(100)의 리소스를 제어하기 위한 운영체제, 어플리케이션 및/또는 어플리케이션이 컴퓨팅 장치(100)의 리소스들을 활용할 수 있도록 다양한 기능을 어플리케이션에 제공하는 미들 웨어 등을 포함할 수 있다. 일 실시예에서, 메모리(320)는 프로세서(310)에 의한 실행 시 프로세서(310)가 연산을 수행하도록 하는 명령들을 저장할 수 있다. 일 실시예에서, 메모리(320)는 복수의 사용자 각각에 대한 사용자 정보, 복수의 사용자 각각에 대한 결제 정보, 복수의 사용자에 대한 1차 태그, 복수의 사용자에 대한 2차 태그 등을 저장할 수 있다.The memory 320 according to various embodiments may store various data. Data stored in the memory 320 is data acquired, processed, or used by at least one component of the computing device 100 and may include software (eg, commands, programs, etc.). Memory 320 may include volatile and/or non-volatile memory. In the present disclosure, a command or a program is software stored in the memory 320, and various functions such as an operating system for controlling resources of the computing device 100, an application, and/or an application to utilize the resources of the computing device 100. may include middleware that provides the application. In one embodiment, memory 320 may store instructions that, when executed by processor 310, cause processor 310 to perform an operation. In one embodiment, the memory 320 may store user information for each of a plurality of users, payment information for each of a plurality of users, a primary tag for a plurality of users, a secondary tag for a plurality of users, and the like. .

일 실시예에서, 컴퓨팅 장치(100)는 통신 회로(330)를 더 포함할 수 있다. 통신 회로(330)는, 컴퓨팅 장치(100)와 서버 또는 컴퓨팅 장치(100)와 다른 장치들 간의 무선 또는 유선 통신을 수행할 수 있다. 예를 들어, 통신 회로(330)는 eMBB(enhanced Mobile Broadband),　URLLC(Ultra Reliable Low-Latency Communications), MMTC(Massive Machine Type Communications), LTE(Long-Term Evolution), LTE-A(LTE Advance), NR(New Radio), UMTS(Universal Mobile Telecommunications System), GSM(Global System for Mobile communications), CDMA(Code Division Multiple Access), WCDMA(Wideband CDMA), WiBro(Wireless Broadband), WiFi(Wireless Fidelity), 블루투스(Bluetooth), NFC(Near Field Communication), GPS(Global Positioning System) 또는 GNSS(Global Navigation Satellite System) 등의 방식에 따른 무선 통신을 수행할 수 있다. 예를 들어, 통신 회로(330)는 USB(Universal Serial Bus), HDMI(High Definition Multimedia Interface), RS-232(Recommended Standard-232) 또는 POTS(Plain Old Telephone Service) 등의 방식에 따른 유선 통신을 수행할 수 있다. 일 실시예에서, 통신 회로(330)는 다른 장치(110)와 통신을 수행할 수 있다.In one embodiment, computing device 100 may further include communication circuitry 330 . The communication circuitry 330 may perform wireless or wired communication between the computing device 100 and a server or between the computing device 100 and other devices. For example, the communication circuit 330 may include enhanced mobile broadband (eMBB), ultra reliable low-latency communications (URLLC), massive machine type communications (MMTC), long-term evolution (LTE), and LTE Advance (LTE-A). , New Radio (NR), Universal Mobile Telecommunications System (UMTS), Global System for Mobile communications (GSM), Code Division Multiple Access (CDMA), Wideband CDMA (WCDMA), Wireless Broadband (WiBro), Wireless Fidelity (WiFi), Wireless communication may be performed according to a method such as Bluetooth, Near Field Communication (NFC), Global Positioning System (GPS), or Global Navigation Satellite System (GNSS). For example, the communication circuit 330 performs wired communication according to a method such as Universal Serial Bus (USB), High Definition Multimedia Interface (HDMI), Recommended Standard-232 (RS-232), or Plain Old Telephone Service (POTS). can be done In one embodiment, communication circuitry 330 may communicate with another device 110 .

일 실시예에서, 컴퓨팅 장치(100)는 입력부(340)를 더 포함할 수 있다. 입력부(340)는 외부로부터 수신된 데이터를 컴퓨팅 장치(100)에 포함된 적어도 하나의 구성요소에 전달하기 위한 구성요소일 수 있다. 예를 들어, 입력부(340)는 마우스, 키보드, 터치 패드 등을 포함할 수 있다.In one embodiment, the computing device 100 may further include an input unit 340 . The input unit 340 may be a component for transferring data received from the outside to at least one component included in the computing device 100 . For example, the input unit 340 may include a mouse, keyboard, or touch pad.

본 개시의 다양한 실시예들에 따른 컴퓨팅 장치(100)는 다양한 형태의 장치가 될 수 있다. 예를 들어, 컴퓨팅 장치는 휴대용 통신 장치, 휴대용 멀티미디어 장치, 웨어러블 장치, 가전 장치 또는 상술한 장치들 중 하나 또는 그 이상의 조합에 따른 장치일 수 있다. 본 개시의 컴퓨팅 장치는 전술한 장치들에 한정되지 않는다.The computing device 100 according to various embodiments of the present disclosure may be various types of devices. For example, a computing device may be a portable communication device, a portable multimedia device, a wearable device, a consumer electronics device, or a device according to a combination of one or more of the foregoing. The computing device of the present disclosure is not limited to the devices described above.

일 실시예에서, 컴퓨팅 장치(100)는, 서버를 통해서 또는 직접, 외부의 다른 장치(110)와 통신할 수 있다. 다른 장치(110)는 컴퓨팅 장치(100)와 동일한 또는 다른 종류의 장치일 수 있다. 컴퓨팅 장치(100)는 다른 장치(110)로부터 전달받은 정보 등을 이용하여 후속되는 동작들을 수행할 수 있다. 컴퓨팅 장치(100)는 다른 장치(110)로부터 복수의 사용자 각각에 대한 사용자 정보(210) 및/또는 결제 정보(220)를 전달 받아, 각 사용자에 연관된 1차 태그 집합(230) 및/또는 2차 태그(250)를 생성하기 위한 후속 동작들을 수행할 수 있다. 이와 같은 과정을 수행하기 위하여, 클라우드 컴퓨팅 또는 분산 컴퓨팅 등의 방식이 사용될 수 있다.In one embodiment, the computing device 100 may communicate with another external device 110 through a server or directly. The other device 110 may be the same or a different type of device than the computing device 100 . The computing device 100 may perform subsequent operations using information received from the other device 110 . The computing device 100 receives user information 210 and/or payment information 220 for each of a plurality of users from the other device 110, and sets a primary tag set 230 and/or 2 associated with each user. Subsequent operations to create tea tag 250 may be performed. In order to perform such a process, a method such as cloud computing or distributed computing may be used.

도 4는 본 개시의 일 실시예에 따른 복수의 사용자와 표본 사용자 그룹의 관계를 설명하기 위한 도면이다. 전술한 바와 같이, 2차 태그(250)가 지시하는 2차 사용자 특징은 마케팅 목적에 의해 임의로 설정된 사용자 특징으로서, 2차 사용자 특징에 대한 표본 사용자 그룹이 미리 결정될 수 있다. 이하에서는 몇몇 실시예들을 통해 복수의 사용자 중에서 표본 사용자 그룹을 미리 결정하는 방법을 구체적으로 서술한다.4 is a diagram for explaining a relationship between a plurality of users and a sample user group according to an embodiment of the present disclosure. As described above, the secondary user characteristics indicated by the secondary tag 250 are user characteristics arbitrarily set for marketing purposes, and a sample user group for the secondary user characteristics may be predetermined. Hereinafter, a method of pre-determining a sample user group from among a plurality of users will be described in detail through some embodiments.

본 개시의 일 실시예에 따르면, 컴퓨팅 장치(100)는 외부로부터 복수의 사용자(410) 중 하나 이상의 사용자를 선택하기 위한 신호를 수신함으로써 표본 사용자 그룹(430)을 미리 결정할 수 있다. 컴퓨팅 장치(100)는 입력부(340)를 통해 복수의 사용자(410) 중 하나 이상의 사용자를 표본 사용자로서 선택하는 입력 신호를 수신하고, 이에 기초하여 표본 사용자 그룹(430)을 결정할 수 있다. 컴퓨팅 장치(100)는 통신 회로(330)를 통해 외부 서버 또는 다른 장치(110)로부터 복수의 사용자(410) 중 하나 이상의 사용자를 표본 사용자로서 선택하는 신호를 수신하고, 이에 기초하여 표본 사용자 그룹(430)을 결정할 수도 있다.According to an embodiment of the present disclosure, the computing device 100 may pre-determine the sample user group 430 by receiving a signal for selecting one or more users from among the plurality of users 410 from the outside. The computing device 100 may receive an input signal for selecting at least one user among the plurality of users 410 as a sample user through the input unit 340 and determine the sample user group 430 based on the input signal. The computing device 100 receives a signal for selecting one or more users among a plurality of users 410 as a sample user from an external server or other device 110 through the communication circuit 330, and based on the signal, a sample user group ( 430) may be determined.

본 개시의 일 실시예에 따르면, 컴퓨팅 장치(100)는 하나 이상의 임의의 조건에 기초하여 표본 사용자 그룹(430)을 미리 결정할 수 있다. 여기서 한 조건은 1차 태그 및 그 1차 태그 값에 대한 임계값을 포함할 수 있다.According to one embodiment of the present disclosure, the computing device 100 may pre-determine the sample user group 430 based on one or more arbitrary conditions. Here, one condition may include a primary tag and a threshold value for the primary tag value.

일례로, 컴퓨팅 장치(100)는 "최근 한 달 간 중국 음식점 이용 횟수 10회 이상"이라는 조건 하나에 기초하여 표본 사용자 그룹(430)을 결정할 수 있다. 즉, 해당 조건은 "최근 한달 간 중국 음식점 이용 횟수"를 지시하는 1차 태그 및 그 임계값인 10을 포함할 수 있다. 컴퓨팅 장치(100)는 복수의 사용자(410) 중 "최근 한달 간 중국 음식점 이용 횟수"에 해당하는 1차 태그의 값이 10 이상인 사용자들을 표본 사용자로 결정할 수 있다.For example, the computing device 100 may determine the sample user group 430 based on one condition, "the number of times of using a Chinese restaurant in the last month is 10 or more." That is, the corresponding condition may include a first tag indicating "the number of times of Chinese restaurant use in the last month" and a threshold value of 10. The computing device 100 may determine, among the plurality of users 410 , users having a value of 10 or more in a primary tag corresponding to “the number of times of using a Chinese restaurant in the last month” as sample users.

다른 일례로, 컴퓨팅 장치(100)는 "최근 한달 간 중국 음식점 이용 횟수가 10회 이상"이라는 조건과 "최근 한달 간 외식업 관련 결제 총액이 300만원 이상"이라는 조건에 기초하여 표본 사용자 그룹(430)을 결정할 수 있다. 즉, "최근 한달 간 중국 음식점 이용 횟수"를 지시하는 1차 태그 및 그 임계값이 10인 조건과, "최근 한달 간 외식업 관련 결제 총액"을 지시하는 1차 태그 및 그 임계값이 3,000,000인 조건이 모두 사용될 수 있다. 컴퓨팅 장치(100)는 복수의 사용자(410) 중 "최근 한달 간 중국 음식점 이용 횟수가 10회 이상"에 해당하는 1차 태그의 값이 10 이상이면서, "최근 한달 간 외식업 관련 결제 총액"에 해당하는 1차 태그의 값이 3,000,000 이상인 사용자들을 표본 사용자로 결정할 수 있다.As another example, the computing device 100 selects a sample user group 430 based on the condition that "the number of times of using a Chinese restaurant in the past month is 10 or more times" and the condition that "the total amount of payment related to the restaurant business in the past month is 3 million won or more" can determine That is, the first tag indicating “the number of times of Chinese restaurant use in the last month” and its threshold value is 10, and the first tag indicating “the total number of restaurant-related payments in the last month” and its threshold value is 3,000,000. All of these can be used. In the computing device 100, among the plurality of users 410, the value of the first tag corresponding to "the number of times of using a Chinese restaurant in the last month is 10 or more" is 10 or more, and corresponds to "the total amount of payments related to the restaurant business in the last month" Users whose primary tag value of 3,000,000 or more may be determined as sample users.

전술한 표본 사용자 그룹(430)을 결정하기 위한 임의의 조건에 관한 예시적 기재들은 본 개시를 한정하지 않으며, 다양한 실시예들을 따르는 컴퓨팅 장치(100)는 하나 이상의 1차 태그 각각에 관한 임의의 조건에 기초하여 복수의 사용자(410) 중에서 표본 사용자 그룹(430)을 결정할 수 있다.The illustrative descriptions of any conditions for determining the sample user group 430 described above are not limiting of this disclosure, and computing device 100 according to various embodiments may use any conditions for each of the one or more primary tags. Based on this, a sample user group 430 may be determined from among the plurality of users 410 .

전술한 바와 같이, 컴퓨팅 장치(100)는 복수의 사용자(410) 중에서 미리 결정된 표본 사용자 그룹(430) 내 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여, 복수의 사용자(410) 각각과 표본 사용자 그룹(430) 사이의 유사도를 결정할 수 있다. 이하에서는 몇몇 실시예들을 통해 복수의 사용자(410) 각각과 표본 사용자 그룹(430) 사이의 유사도를 결정하는 방법을 구체적으로 서술한다.As described above, the computing device 100 determines each of the plurality of users 410 and the sample based on one or more primary tags for each of the sample users in a predetermined sample user group 430 among the plurality of users 410 . A degree of similarity between user groups 430 may be determined. Hereinafter, a method for determining the degree of similarity between each of the plurality of users 410 and the sample user group 430 will be described in detail through some embodiments.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는 복수의 사용자(410) 각각에 대한 1차 태그의 값과 표본 사용자 그룹(430) 내 표본 사용자 각각에 대한 1차 태그의 값 사이의 차이에 기초하여, 복수의 사용자(410) 각각과 표본 사용자 그룹(430) 사이의 유사도를 결정할 수 있다. 즉 컴퓨팅 장치(100)는 복수의 사용자(410) 중 한 사용자(이하, 제1 사용자(411))와 표본 사용자 각각에 대한 관계에서 1차 태그의 값의 차이 값을 연산하고, 그 연산 결과에 기초하여 제1 사용자(411)와 표본 사용자 그룹(430) 사이의 유사도를 결정할 수 있다. 이하 제1 사용자(411)를 기준으로 유사도 결정 방식을 설명하지만, 제1 사용자(411)에 대한 유사도 결정 방식과 유사한 방식으로 복수의 사용자 각각에 대한 유사도가 결정될 수 있다.The computing device 100 according to an embodiment of the present disclosure determines the difference between the value of the primary tag for each of the plurality of users 410 and the value of the primary tag for each sample user in the sample user group 430. Based on this, a similarity between each of the plurality of users 410 and the sample user group 430 may be determined. That is, the computing device 100 calculates the difference between the values of the primary tags in the relationship between one user (hereinafter, first user 411 hereinafter) and each sample user among the plurality of users 410 and calculates the difference value according to the calculation result. Based on this, a degree of similarity between the first user 411 and the sample user group 430 may be determined. The similarity determination method is described below based on the first user 411 , but the similarity determination method for each of the plurality of users may be determined in a manner similar to the similarity determination method for the first user 411 .

예를 들어, 컴퓨팅 장치(100)는 제1 사용자(411)의 소정의 1차 태그의 값과 각 표본 사용자가 가지는 해당 1차 태그의 값 각각 사이의 차이의 평균 값(이하, 제1 평균 오차)을 산출할 수 있다. 컴퓨팅 장치(100)는 제1 평균 오차에 기초하여, 제1 사용자(411)의 표본 사용자 그룹(430)에 대한 유사도를 결정할 수 있다. 구체적으로, 컴퓨팅 장치(100)는 제1 평균 오차가 작을수록 제1 사용자(411)가 표본 사용자 그룹(430)에 대해 높은 유사도를 가진다고 판단하고, 높은 유사도 값을 부여할 수 있다.For example, the computing device 100 calculates an average value of a difference between a predetermined primary tag value of the first user 411 and a corresponding primary tag value of each sample user (hereinafter referred to as a first average error). ) can be calculated. The computing device 100 may determine a similarity of the first user 411 to the sample user group 430 based on the first average error. Specifically, the computing device 100 may determine that the first user 411 has a high degree of similarity to the sample user group 430 as the first average error decreases, and may assign a high degree of similarity value.

이와 유사한 방식으로, 둘 이상의 1차 태그의 값의 차이 값에 기초하여 유사도가 결정될 수도 있다. 컴퓨팅 장치(100)는 제1 사용자(411)의 소정의 1차 태그의 값과 각 표본 사용자가 가지는 해당 1차 태그의 값 각각 사이의 차이의 평균값(즉, 제1 평균 오차)를 산출할 수 있다. 또한 컴퓨팅 장치(100)는 제1 사용자(411)의 다른 1차 태그의 값과 각 표본 사용자가 가지는 해당 다른 1차 태그의 값 각각 사이의 차이의 평균값(이하, 제2 평균 오차)를 산출할 수 있다. 컴퓨팅 장치(100)는 제1 평균 오차 및 제2 평균 오차에 기초하여, 제1 사용자(411)의 표본 사용자 그룹(430)에 대한 유사도를 결정할 수 있다. 일례로, 컴퓨팅 장치(100)는 복수의 평균 오차를 합산함으로써, 제1 사용자(411)의 표본 사용자 그룹(430)에 대한 유사도를 결정할 수 있다. 컴퓨팅 장치(100)는 복수의 평균 오차들에 대한 합산 값이 작을수록 제1 사용자(411)에 대해 높은 유사도 값을 부여할 수 있다. 다른 일례로, 컴퓨팅 장치(100)는 복수의 평균 오차 각각에 대해 가중치를 적용하여 가중 합산(weighted sum) 연산을 수행함으로써, 제1 사용자(411)의 표본 사용자 그룹(430)에 대한 유사도를 결정할 수 있다. 구체적으로, 컴퓨팅 장치(100)는 제1 평균 오차에 대해 가중치

, 제2 평균 오차에 대해 가중치

를 각각 곱한 후 합산하여 평균 오차를 가중 합산할 수 있다. 컴퓨팅 장치(100)는 복수의 평균 오차들에 대한 가중 합산 값이 작을수록 제1 사용자(411)에 대해 높은 유사도 값을 부여할 수 있다.In a similar way, the degree of similarity may be determined based on a difference between values of two or more primary tags. The computing device 100 may calculate an average value of a difference between a predetermined primary tag value of the first user 411 and a corresponding primary tag value of each sample user (ie, a first average error). there is. In addition, the computing device 100 may calculate an average value of differences between values of other primary tags of the first user 411 and values of corresponding other primary tags of each sample user (hereinafter referred to as a second average error). can The computing device 100 may determine a degree of similarity between the first user 411 and the sample user group 430 based on the first average error and the second average error. For example, the computing device 100 may determine the degree of similarity between the first user 411 and the sample user group 430 by summing a plurality of mean errors. The computing device 100 may assign a higher similarity value to the first user 411 as the sum of the plurality of average errors is smaller. As another example, the computing device 100 determines the degree of similarity of the first user 411 to the sample user group 430 by applying a weight to each of a plurality of mean errors and performing a weighted sum operation. can Specifically, the computing device 100 weights the first mean error.

, weighted for the second mean error

The weighted sum of average errors can be obtained by multiplying and summing each. The computing device 100 may assign a higher similarity value to the first user 411 as the weighted sum of the plurality of average errors is smaller.

본 개시에서 제1 사용자(411)의 표본 사용자 그룹(430)에 대한 유사도를 결정함에 있어, 컴퓨팅 장치(100)는 전술한 "평균 오차"에 기반한 방법 외에도 다양한 통계 분석 방법을 활용할 수 있다. 예를 들어, 컴퓨팅 장치(100)는 평균 값 연산, 표준 편차 연산, 분산 값 연산, 총합 연산, 제N 백분위수 연산(N은 0이상 100이하의 정수) 등의 동작들을 수행함으로써 복수의 사용자 각각에 대한 유사도를 결정할 수 있다. 통계 분석을 위한 다양한 연산 방법에 관한 위 예시들은 설명을 위한 기재일 뿐 본 개시를 한정하지 않는다.In determining the degree of similarity of the first user 411 to the sample user group 430 in the present disclosure, the computing device 100 may utilize various statistical analysis methods other than the method based on the above-described “average error”. For example, the computing device 100 performs operations such as average value calculation, standard deviation calculation, variance value calculation, total sum calculation, and Nth percentile calculation (N is an integer of 0 or more and 100 or less) by performing operations such as each of a plurality of users. The degree of similarity can be determined. The above examples of various computational methods for statistical analysis are for explanation only and do not limit the present disclosure.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는 제1 사용자(411)에 대한 1차 태그의 값과 표본 사용자 그룹(430)의 1차 태그 기준값 사이의 차이에 기초하여, 제1 사용자(411)와 표본 사용자 그룹(430) 사이의 유사도를 결정할 수 있다. 본 개시에 있어서 소정의 1차 태그에 대한 "1차 태그 기준값"이란, 해당 1차 태그에 대해 표본 사용자 각각이 가지는 1차 태그의 값에 기초하여 결정되는 값을 의미할 수 있다. 컴퓨팅 장치(100)는 표본 사용자 그룹(430)의 특징을 대표하는 값을 설정하기 위해 소정의 방법에 따라 표본 사용자 각각에 대한 1차 태그의 값에 기초하여 1차 태그 기준값을 결정할 수 있다. 일례로, 컴퓨팅 장치(100)는 표본 사용자 각각에 대한 1차 태그 값의 평균 값을 표본 사용자 그룹(430)의 1차 태그 기준값으로 결정할 수 있다. 다른 일례로, 컴퓨팅 장치(100)는 표본 사용자 각각에 대한 1차 태그 값의 중간(median) 값을 표본 사용자 그룹(430)의 1차 태그 기준값으로 결정할 수 있다. 다른 일례로, 컴퓨팅 장치(100)는 표본 사용자 각각에 대한 1차 태그 값의 최빈(mode) 값을 표본 사용자 그룹(430)의 1차 태그 기준값으로 결정할 수 있다. 전술한 1차 태그 기준값에 관한 예시들은 본 개시를 제한하지 않는다.Based on the difference between the value of the primary tag for the first user 411 and the reference value of the primary tag of the sample user group 430, the computing device 100 according to an embodiment of the present disclosure determines the first user ( 411) and the sample user group 430 may be determined. In the present disclosure, a “primary tag reference value” for a predetermined primary tag may mean a value determined based on a primary tag value of each sample user for the corresponding primary tag. The computing device 100 may determine the primary tag reference value based on the value of the primary tag for each sample user according to a predetermined method in order to set a value representing the characteristics of the sample user group 430 . For example, the computing device 100 may determine an average value of primary tag values for each sample user as the primary tag reference value of the sample user group 430 . As another example, the computing device 100 may determine a median value of primary tag values for each sample user as the primary tag reference value of the sample user group 430 . As another example, the computing device 100 may determine a mode value of the primary tag value for each sample user as the primary tag reference value of the sample user group 430 . Examples of the primary tag reference value described above do not limit the present disclosure.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는 소정의 1차 태그에 대한 제1 사용자(411)의 1차 태그의 값과 표본 사용자 그룹(430)의 1차 태그 기준값을 비교하여, 그 차이 값이 작을수록 제1 사용자(411)에 대해 높은 유사도 값을 부여할 수 있다. 예를 들어, 제1 사용자(411) 및 복수의 사용자 중 다른 한 사용자(이하, 제2 사용자(412))가 유사도 결정의 대상이 되는 사용자이고, 특정 1차 태그에 대한 각 사용자의 1차 태그 값과 1차 태그 기준값이 "제1 사용자(411): 6, 제2 사용자(412): 1, 1차 태그 기준값: 9"와 같다고 가정한다. 이 때 컴퓨팅 장치(100)는 각 사용자의 1차 태그의 값과 1차 태그 기준값의 차이를 연산하고, 연산 결과에 따른 1차 태그 기준값과의 차이가 작을수록, 제1 사용자(411) 및 제2 사용자(412) 각각에 대해 높은 유사도를 부여할 수 있다. 위의 예시에서 제1 사용자(411)의 경우 "3", 제2 사용자(412)의 경우 "8"과 같이 1차 태그 기준값과의 차이가 연산 될 수 있다. 이 경우 컴퓨팅 장치(100)는 더 작은 차이를 갖는 제1 사용자(411)에 대해 제2 사용자(412)보다 높은 유사도를 부여할 수 있다.Computing device 100 according to an embodiment of the present disclosure compares the value of the primary tag of the first user 411 for a predetermined primary tag with the reference value of the primary tag of the sample user group 430, As the difference value is smaller, a higher similarity value may be assigned to the first user 411 . For example, a first user 411 and another user among a plurality of users (hereinafter referred to as a second user 412) are users who are subject to similarity determination, and each user's primary tag for a specific primary tag. It is assumed that the value and the primary tag reference value are "first user 411: 6, second user 412: 1, primary tag reference value: 9". At this time, the computing device 100 calculates the difference between the value of each user's primary tag and the primary tag reference value, and as the difference between the primary tag reference value and the primary tag reference value according to the calculation result is smaller, the first user 411 and A high degree of similarity may be assigned to each of the two users 412 . In the above example, the difference from the primary tag reference value may be calculated as “3” for the first user 411 and “8” for the second user 412 . In this case, the computing device 100 may assign a higher degree of similarity to the first user 411 having a smaller difference than the second user 412 .

도 5는 본 개시의 일 실시예에 따른 신경망 기반 학습 모델에 기초한 유사도 결정 과정을 설명하기 위한 도면이다.5 is a diagram for explaining a similarity determination process based on a neural network-based learning model according to an embodiment of the present disclosure.

본 개시의 일 실시예에서, 컴퓨팅 장치(100)는 신경망 기반 학습 모델(Neural-Network-based trained model, 550)에 기초하여, 복수의 사용자(예: 제1 사용자(411), 제2 사용자(412) 등) 각각과 표본 사용자 그룹(430) 사이의 유사도를 결정할 수 있다. 여기서 학습 모델(550)은 표본 사용자 그룹(430) 내 표본 사용자 각각에 대한 1차 태그 집합(531 등)에 기초하여 학습될 수 있다. 1차 태그 집합 각각은 하나 이상의 1차 태그 및 그에 대한 값을 포함할 수 있다. 일 실시예에서, 하나 이상의 메모리(320)는 신경망 기반 학습 모델을 저장할 수 있다.In one embodiment of the present disclosure, the computing device 100 is based on a neural-network-based trained model (550), a plurality of users (eg, the first user 411, the second user ( 412), etc.) and the similarity between each sample user group 430 may be determined. Here, the learning model 550 may be learned based on the primary tag set 531 and the like for each sample user in the sample user group 430 . Each primary tag set may include one or more primary tags and their values. In one embodiment, one or more memories 320 may store neural network-based learning models.

본 개시에서, 신경망 기반 학습 모델은 인간의 뇌 구조를 컴퓨터 상에서 구현하도록 설계될 수 있으며, 인간의 신경망의 뉴런(neuron)을 모의하며 가중치를 가지는 복수의 네트워크 노드들을 포함할 수 있다. 복수의 네트워크 노드들은 뉴런이 시냅스(synapse)를 통하여 신호를 주고 받는 뉴런의 시냅틱(synaptic) 활동을 모의하여, 서로 간의 연결 관계를 가질 수 있다. 신경망 기반 학습 모델에서 복수의 네트워크 노드들은 서로 다른 깊이의 레이어에 위치하면서 컨볼루션(convolution) 연결 관계에 따라 데이터를 주고 받을 수 있다. 신경망 기반 학습 모델은, 예를 들어, 인공 신경망 모델(Artificial Neural Network), 컨볼루션 신경망 모델(Convolution Neural Network) 등일 수 있다. 본 개시에서 신경망 기반 학습 모델은, 소정의 기계 학습 방식에 따라 학습된 모델로서, 학습되지 않은 모델에 포함된 적어도 하나의 네트워크 노드에 대한 가중치가 기계 학습에 의해 결정된 모델을 의미할 수 있다. 기계 학습은, 데이터 및 데이터를 처리한 경험을 이용한 학습을 통해, 컴퓨터 소프트웨어가 데이터 처리 능력을 향상시키는 것을 의미할 수 있다. 신경망 기반 학습 모델은 데이터 사이의 상관 관계를 모델링하여 구축된 것으로서, 그 상관 관계는 복수의 파라미터에 의해 표현될 수 있다. 신경망 기반 학습 모델은 주어진 데이터로부터 특징들을 추출하고 분석하여 데이터 간의 상관 관계를 도출하는데, 이러한 과정을 반복하여 신경망 기반 학습 모델의 파라미터를 최적화 해나가는 것이 기계 학습이라고 할 수 있다. 예를 들어, 신경망 기반 학습 모델은 입출력 쌍으로 주어지는 데이터에 대하여, 입력과 출력 사이의 매핑(상관 관계)을 학습할 수 있다. 또는 신경망 기반 학습 모델은 입력 데이터만 주어지는 경우에도 주어진 데이터 사이의 규칙성을 도출하여 그 관계를 학습할 수도 있다.In the present disclosure, a neural network-based learning model may be designed to implement a human brain structure on a computer, and may include a plurality of network nodes that simulate neurons of a human neural network and have weights. A plurality of network nodes may have a connection relationship between them by simulating synaptic activities of neurons that transmit and receive signals through synapses. In a neural network-based learning model, a plurality of network nodes may exchange data according to a convolutional connection relationship while being located in layers of different depths. The neural network-based learning model may be, for example, an artificial neural network model, a convolutional neural network model, and the like. In the present disclosure, a neural network-based learning model is a model learned according to a predetermined machine learning method, and may mean a model in which a weight for at least one network node included in an untrained model is determined by machine learning. Machine learning may mean that computer software improves data processing capabilities through learning using data and experiences of processing the data. A neural network-based learning model is constructed by modeling correlations between data, and the correlations may be expressed by a plurality of parameters. A neural network-based learning model derives a correlation between data by extracting and analyzing features from given data, and optimizing the parameters of a neural network-based learning model by repeating this process can be referred to as machine learning. For example, a neural network-based learning model may learn a mapping (correlation) between an input and an output with respect to data given as an input/output pair. Alternatively, the neural network-based learning model may learn the relationship by deriving a regularity between given data even when only input data is given.

이하 전술한 신경망 기반 학습 모델(550)을 이용하여 복수의 사용자 각각과 표본 사용자 그룹(430) 사이의 유사도를 결정하는 과정에 대해 설명한다. 이 과정은 다양한 실시예에 의해 수행될 수 있으나, 이하 제1 실시예 및 제2 실시예를 들어 설명한다. 그러나 두 실시예는 본 개시의 범위를 한정하지 않는다.Hereinafter, a process of determining the degree of similarity between each of a plurality of users and the sample user group 430 using the aforementioned neural network-based learning model 550 will be described. Although this process may be performed by various embodiments, the first and second embodiments will be described below. However, the two embodiments do not limit the scope of the present disclosure.

이하 제1 실시예에 대해 설명한다. 제1 실시예에 있어서, 학습 모델(550)은 복수의 표본 사용자 각각에 대한 1차 태그 집합을 입력 받고, 해당 복수의 1차 태그 집합의 공통적 특징을 학습할 수 있다. 일례에서, 학습 모델(550)은 복수의 표본 사용자 각각에 대한 로 데이터(즉, 사용자 정보(210) 및/또는 결제 정보(220))를 입력 받고, 해당 복수의 로 데이터들 사이에 존재하는 공통적 특징을 학습할 수도 있다. 다른 일례에서, 학습 모델(550)은 복수의 표본 사용자 각각에 대한 1차 태그 집합 및 로 데이터를 입력 받고, 해당 복수의 "1차 태그 집합 및 로 데이터"들 사이에 존재하는 공통적 특징을 학습할 수도 있다. 이하 제1 실시예에 대한 설명에 있어서는, 설명의 편의를 위해, 학습 모델(550)이 복수의 표본 사용자 각각에 대한 1차 태그 집합을 입력 받았다고 가정한다. 학습 모델(550)은 복수의 1차 태그 집합의 공통적 특징을 학습하기 위해, 입력 데이터(즉, 복수의 1차 태그 집합)의 차원을 감소시켜 중간 데이터를 생성하고, 생성된 중간 데이터의 차원을 다시 입력 데이터와 같은 차원을 가지는 출력 데이터로 복원하는 동작을 수행할 수 있다. 구체적으로, 학습을 위하여 입력되는 복수의 1차 태그 집합은 제1 표본 사용자의 1차 태그 집합(531), 제2 표본 사용자의 1차 태그 집합, ..., 제n 표본 사용자의 1차 태그 집합을 포함할 수 있다. 각 표본 사용자의 1차 태그 집합은 같은 종류의 1차 태그들을 포함할 수 있다. 1차 태그 집합이 K개의 1차 태그들을 포함하는 경우(K는 자연수), 1차 태그 집합은 K개의 원소를 갖는 벡터로 변환되어 학습 모델(550)에 입력될 수 있다. 여기서 해당 벡터의 K개의 원소들은 K개의 1차 태그들의 값 각각에 해당할 수 있다. 학습 모델(550)은, 적어도 하나의 네트워크 노드에 기초한 연산을 통해, 벡터의 크기가 K인 입력 데이터를 벡터의 크기가 L인 중간 데이터로 변환할 수 있다(L은 K보다 작은 자연수). 학습 모델(550)은 입력 데이터의 차원을 축소함에 따라 입력된 1차 태그 집합의 주요 정보를 압축하여 보존할 수 있다. 학습 모델(550)은 적어도 하나의 네트워크 노드에 기초한 연산을 통해, 벡터의 크기가 L인 중간 데이터로부터 벡터의 크기가 K인 출력 데이터를 복원할 수 있다. 학습 모델(550)은 입력 데이터와 출력 데이터 사이의 오차를 계산할 수 있다. 입력 데이터와 출력 데이터 사이의 오차가 작을수록, 학습 모델(550)은 복수의 학습 데이터(즉, 복수의 1차 태그 집합)의 공통적 특징을 잘 학습한 것으로 평가될 수 있다. 학습 모델(550)은 이러한 데이터 압축 및 복원 과정을 통해 학습되므로, 정답에 해당하는 학습 데이터(소정의 2차 사용자 특징을 가지는 것으로 미리 결정된 표본 사용자들의 1차 태그 집합들)만으로도, 2차 사용자 특징을 가지는 사용자들의 1차 사용자 특징들의 경향을 학습할 수 있다. 학습 모델(550)은, 상술한 데이터의 압축 및 복원 과정을 통해, 미리 결정된 표본 사용자들의 1차 태그 집합들에 관한 입력 데이터의 특징이 출력 데이터에 잘 보존되도록 학습될 수 있다. 즉, 학습 모델(550)은 입력 데이터와 출력 데이터 사이의 오차를 역전파(back-propagation)하는 동작에 기초하여, 입력 데이터를 압축하여 중간 데이터를 생성하는 과정 또는 중간 데이터를 복원하여 출력 데이터를 생성하는 과정에 관여하는 적어도 하나의 네트워크 노드에 대한 가중치를 갱신할 수 있다. 그 결과, 학습 모델(550)은 새롭게 입력된 입력 데이터가 미리 결정된 표본 사용자들의 1차 태그 집합에 대한 데이터와 유사할수록 "오차가 작음(즉, 유사도가 높음)"으로 판단할 수 있다.The first embodiment will be described below. In the first embodiment, the learning model 550 may receive primary tag sets for each of a plurality of sample users and learn common features of the plurality of primary tag sets. In one example, the learning model 550 receives raw data (ie, user information 210 and/or payment information 220) for each of a plurality of sample users, and obtains common data among the plurality of raw data. Features can also be learned. In another example, the learning model 550 receives a primary tag set and raw data for each of a plurality of sample users, and learns a common feature existing among the plurality of “primary tag sets and raw data”. may be In the following description of the first embodiment, for convenience of description, it is assumed that the learning model 550 receives a primary tag set for each of a plurality of sample users. The learning model 550 generates intermediate data by reducing the dimension of input data (ie, the plurality of primary tag sets) in order to learn common features of a plurality of primary tag sets, and converts the dimension of the generated intermediate data to Again, an operation of restoring output data having the same dimensions as the input data may be performed. Specifically, the plurality of primary tag sets input for learning include the primary tag set 531 of the first sample user, the primary tag set of the second sample user, ..., the primary tag of the nth sample user. may contain sets. The primary tag set of each sample user may include primary tags of the same type. When the primary tag set includes K primary tags (K is a natural number), the primary tag set may be converted into a vector having K elements and input to the learning model 550 . Here, K elements of the corresponding vector may correspond to values of K primary tags. The learning model 550 may convert input data having a vector size K into intermediate data having a vector size L (L is a natural number smaller than K) through an operation based on at least one network node. As the dimension of the input data is reduced, the learning model 550 may compress and preserve main information of the input primary tag set. The learning model 550 may restore output data having a vector size of K from intermediate data having a vector size of L through an operation based on at least one network node. The learning model 550 may calculate an error between input data and output data. As the error between the input data and the output data is smaller, it can be evaluated that the learning model 550 has learned well the common features of the plurality of learning data (ie, the plurality of primary tag sets). Since the learning model 550 is learned through such a data compression and restoration process, only the training data corresponding to the correct answer (primary tag sets of sample users pre-determined to have predetermined secondary user characteristics), secondary user characteristics It is possible to learn the tendency of primary user characteristics of users having . The learning model 550 may learn to well preserve the characteristics of input data of primary tag sets of predetermined sample users through the above-described data compression and restoration processes in output data. That is, the learning model 550 is a process of generating intermediate data by compressing input data or restoring intermediate data to obtain output data based on an operation of back-propagating an error between input data and output data. A weight for at least one network node involved in the creation process may be updated. As a result, the learning model 550 may determine that "the error is small (ie, the degree of similarity is high)" as the newly input input data is similar to the data for the primary tag set of predetermined sample users.

컴퓨팅 장치(100)는 제1 실시예에 기초하여 학습된 학습 모델(550)에 복수의 사용자 각각에 대한 하나 이상의 1차 태그(즉, 각 사용자의 1차 태그 집합)를 입력할 수 있다. 입력에 따라, 컴퓨팅 장치(100)는 복수의 사용자 각각과 표본 사용자 그룹(430) 사이의 유사도를 획득할 수 있다. 구체적으로, 컴퓨팅 장치(100)는 입력 데이터(즉, 제1 사용자(411)의 1차 태그 집합)를 학습 모델(550)에 입력할 수 있다. 학습 모델(550)은 적어도 하나의 네트워크 노드에 기초한 연산을 통해, 입력 데이터와 벡터 차원의 크기가 동일한 출력 데이터를 생성할 수 있다. 학습 모델(550)은 표본 사용자 그룹(430)의 표본 사용자 각각의 1차 태그 집합에 기초하여 학습된 바, 입력된 제1 사용자(411)의 1차 태그 집합이 표본 사용자 그룹(430)의 1차 태그 집합들과 유사할수록, 입력 데이터와 차이가 적은 출력 데이터를 생성할 수 있다. 학습 모델(550)은 이러한 입력 데이터와 출력 데이터 사이의 오차에 기초하여, 제1 사용자(411)와 표본 사용자 그룹(430) 사이의 유사도를 결정할 수 있다. 즉 학습 모델(550)은 입력 데이터와 출력 데이터의 오차가 작을수록 제1 사용자(411)에 대해 높은 유사도를 부여할 수 있다. 컴퓨팅 장치(100)는 학습 모델(550)로부터 제1 사용자(411)에 대한 유사도를 획득할 수 있다. 동일한 방식으로, 컴퓨팅 장치(100)는 학습 모델(550)로부터 복수의 사용자 각각에 대한 유사도를 획득할 수 있다.The computing device 100 may input one or more primary tags for each of a plurality of users (ie, a set of primary tags for each user) to the learning model 550 learned based on the first embodiment. According to the input, the computing device 100 may obtain a degree of similarity between each of the plurality of users and the sample user group 430 . Specifically, the computing device 100 may input input data (ie, a primary tag set of the first user 411 ) into the learning model 550 . The learning model 550 may generate output data having the same vector dimension size as input data through an operation based on at least one network node. Since the learning model 550 is learned based on the primary tag set of each sample user of the sample user group 430, the input primary tag set of the first user 411 is 1 of the sample user group 430. As the car tag sets are similar, output data having a smaller difference from input data can be generated. The learning model 550 may determine a degree of similarity between the first user 411 and the sample user group 430 based on the error between the input data and the output data. That is, the learning model 550 may assign a higher degree of similarity to the first user 411 as the difference between the input data and the output data is smaller. The computing device 100 may obtain a degree of similarity with respect to the first user 411 from the learning model 550 . In the same way, the computing device 100 may obtain a degree of similarity for each of a plurality of users from the learning model 550 .

이하 제2 실시예에 대해 설명한다. 제2 실시예에 있어서, 학습 모델(550)은 복수의 표본 사용자 각각에 대한 1차 태그 집합을 입력 받고, 군집화(clustering) 기법에 기초하여, 해당 복수의 1차 태그 집합의 공통적 특징을 학습할 수 있다. 일례에서, 학습 모델(550)은 복수의 표본 사용자 각각에 대한 로 데이터(즉, 사용자 정보(210) 및/또는 결제 정보(220))를 입력 받고, 군집화 기법에 기초하여, 해당 복수의 로 데이터들 사이에 존재하는 공통적 특징을 학습할 수 있다. 다른 일례에서, 학습 모델(550)은 복수의 표본 사용자 각각에 대한 1차 태그 집합 및 로 데이터를 입력 받고, 군집화 기법에 기초하여, 해당 복수의 "1차 태그 집합 및 로 데이터"들 사이에 존재하는 공통적 특징을 학습할 수 있다. 이하 제2 실시예에 대한 설명에 있어서는, 설명의 편의를 위해, 학습 모델(550)이 복수의 표본 사용자 각각에 대한 1차 태그 집합을 입력 받았다고 가정한다. 예를 들어, 학습 모델(550)은 K-means 군집화 기법에 기초하여 학습될 수 있다. 학습 모델(550)은 복수의 1차 태그 집합을 군집화하기 위해, 입력 데이터(즉, 복수의 표본 사용자의 1차 태그 집합들)를 N차원의 잠재 공간(latent space)에 각각 매핑할 수 있다(N은 자연수). 본 개시에 있어서 잠재 공간은, 학습 모델(550)에 입력된 입력 데이터가 표현될 수 있는 임의의 차원을 갖는 좌표계일 수 있다. 원래의 입력 데이터가 가지는 좌표 공간을 "관측 공간"이라고 할 때, 학습 모델(550)은 좌표계 변환을 위한 소정의 연산을 통해 입력 데이터를 관측 공간 상의 한 점에서 잠재 공간 상의 한 점으로 일대일 대응(즉, 매핑)시킬 수 있다. 전술한 바와 같이, 각 표본 사용자의 1차 태그 집합들은 서로 같은 종류의 1차 태그들을 포함할 수 있다. 컴퓨팅 장치(100)는 학습 모델(550)에 각 표본 사용자의 1차 태그 집합을 입력하기 위해, 해당 1차 태그 집합을 표현하는 입력 데이터를 생성할 수 있다. 예를 들어, 1차 태그 집합이 K개의 1차 태그를 포함하는 경우, 해당 1차 태그 집합은 각 K개의 1차 태그의 값들을 좌표 성분으로 가지는 입력 데이터(즉, K차원 좌표)로 변환될 수 있다. 학습 모델(550)은 적어도 하나의 네트워크 노드에 대한 가중치에 기초한 연산을 통해, 입력 데이터인 K차원 좌표를 N차원의 잠재 공간 내 한 좌표에 대응되는 N차원 좌표로 변환할 수 있다(N은 K보다 작거나 같은 자연수). 즉, 학습 모델(550)은 이러한 변환을 통해, 하나의 1차 태그 집합을 N차원 잠재 공간 상의 한 좌표에 매핑할 수 있다. 학습 모델(550)은 N차원 잠재 공간에 매핑된 각각의 데이터들이 소정의 기준점으로부터 사전 결정된 임계 거리 이상 멀어지지 않도록 학습될 수 있다. 여기서 기준점은 N차원 잠재 공간 내 존재하는 한 점으로서, 표본 사용자 그룹(430)에 포함된 임의의 한 표본 사용자가 갖는 1차 태그 집합에 대한 매핑 데이터이거나, 가상의 1차 태그 집합에 대한 매핑 데이터일 수 있다. 일례로, 기준점은 표본 사용자 그룹(430)을 결정하기 위해 사용된 소정의 기준을 나타내는 1차 태그 집합에 대한 매핑 데이터일 수 있다. N차원 잠재 공간에 매핑된 데이터와 기준점 사이의 거리는, 예를 들어, 유클리드 거리(Euclidean distance)로서 계산될 수 있다. 구체적으로, 잠재 공간의 차원은 3, 잠재 공간 내 기준점의 좌표가 (0,0,0), 사전 결정된 임계 거리가 1이고, 학습 모델(550)의 연산에 의한 한 사용자의 1차 태그 집합에 관한 매핑 데이터의 좌표는 (2,2,2)일 수 있다. 이 경우 기준점과 매핑 데이터의 거리는

이므로 사전 결정된 임계 거리 1을 초과하는 바, 학습 모델(550)은 학습을 위한 오차 역전파 과정에서, 거리의 차이 값(즉,

을 오차로 하여, 해당 매핑 데이터에 대한 적어도 하나의 네트워크 노드에 대한 가중치를 학습할 수 있다. 전술한 학습 모델(550)의 학습 방법에 관한 일부 구체적인 기재들은 발명의 설명을 위한 예시적 기재일 뿐, 본 개시를 한정하지 않는다.A second embodiment will be described below. In the second embodiment, the learning model 550 receives primary tag sets for each of a plurality of sample users, and learns common characteristics of the plurality of primary tag sets based on a clustering technique. can In one example, the learning model 550 receives raw data (ie, user information 210 and/or payment information 220) for each of a plurality of sample users, and based on a clustering technique, the plurality of raw data Common characteristics that exist among them can be learned. In another example, the learning model 550 receives a primary tag set and raw data for each of a plurality of sample users and, based on a clustering technique, exists among the plurality of “primary tag sets and raw data”. Common characteristics can be learned. In the following description of the second embodiment, for convenience of explanation, it is assumed that the learning model 550 receives a primary tag set for each of a plurality of sample users. For example, the learning model 550 may be trained based on a K-means clustering technique. The learning model 550 may map input data (ie, primary tag sets of a plurality of sample users) to an N-dimensional latent space, respectively, in order to cluster a plurality of primary tag sets ( N is a natural number). In the present disclosure, the latent space may be a coordinate system having an arbitrary dimension in which input data input to the learning model 550 can be expressed. When the coordinate space of the original input data is referred to as an “observation space,” the learning model 550 performs a one-to-one correspondence between a point on the observation space and a point on the latent space through a predetermined operation for converting the coordinate system ( that is, mapping). As described above, the primary tag sets of each sample user may include primary tags of the same type. The computing device 100 may generate input data representing the corresponding primary tag set in order to input the primary tag set of each sample user to the learning model 550 . For example, if the primary tag set includes K primary tags, the primary tag set is converted into input data (ie, K-dimensional coordinates) having values of each K primary tag as coordinate components. can The learning model 550 may convert K-dimensional coordinates, which are input data, into N-dimensional coordinates corresponding to one coordinate in an N-dimensional latent space through an operation based on a weight for at least one network node (where N is K natural number less than or equal to). That is, the learning model 550 may map one primary tag set to one coordinate in the N-dimensional latent space through such transformation. The learning model 550 may be trained so that each data mapped to the N-dimensional latent space is not farther than a predetermined threshold distance from a predetermined reference point. Here, the reference point is a point existing in the N-dimensional latent space, and is mapping data for a primary tag set of a sample user included in the sample user group 430 or mapping data for a virtual primary tag set. can be For example, the reference point may be mapping data for a primary tag set representing a predetermined criterion used to determine the sample user group 430 . The distance between data mapped in the N-dimensional latent space and a reference point may be calculated as, for example, a Euclidean distance. Specifically, the dimension of the latent space is 3, the coordinates of the reference point in the latent space are (0,0,0), the predetermined critical distance is 1, and the primary tag set of one user by the operation of the learning model 550 The coordinates of the mapping data may be (2,2,2). In this case, the distance between the reference point and the mapping data is

Since it exceeds the predetermined threshold distance 1, the learning model 550, in the error back-propagation process for learning, the difference value of the distance (i.e.,

A weight value for at least one network node for the corresponding mapping data may be learned by taking as an error. Some specific descriptions of the learning method of the learning model 550 described above are only exemplary descriptions for explanation of the invention, and do not limit the present disclosure.

컴퓨팅 장치(100)는 전술한 제2 실시예에 기초하여 학습된 학습 모델(550)에 복수의 사용자 각각에 대한 하나 이상의 1차 태그(즉, 각 사용자의 1차 태그 집합)을 입력할 수 있다. 입력에 따라, 컴퓨팅 장치(100)는 복수의 사용자 각각과 표본 사용자 그룹(430) 사이의 유사도를 획득할 수 있다. 구체적으로, 컴퓨팅 장치(100)는 입력 데이터(즉, 제1 사용자(411)에 대한 1차 태그 집합)를 학습 모델(550)에 입력할 수 있다. 학습 모델(550)은 적어도 하나의 네트워크 노드에 기초한 연산을 통해, 해당 입력 데이터 역시 N차원의 잠재 공간에 매핑할 수 있다. 복수의 표본 사용자의 1차 태그 집합들을 N차원 잠재 공간에 매핑함으로써 학습된 학습 모델(550)은, 매핑된 복수의 표본 사용자의 1차 태그 집합들의 N차원 잠재 공간 상 중심점을 산출할 수 있다(이 중심점 역시 N차원 좌표로 표현됨). 입력된 제1 사용자(411)의 1차 태그 집합이 표본 사용자 그룹(430)의 1차 태그 집합들과 유사할수록, 제1 사용자(411)의 1차 태그 집합은 N차원 공간 상에서 복수의 표본 사용자의 1차 태그 집합들의 중심점과 가까운 거리에 매핑될 수 있다. 학습 모델(550)은 제1 사용자(411)의 1차 태그 집합이 매핑된 좌표와 전술한 중심점과의 거리에 기초하여, 제1 사용자(411)와 표본 사용자 그룹(430) 사이의 유사도를 결정할 수 있다. 즉 학습 모델(550)은 N차원 공간 상 복수의 표본 사용자의 1차 태그 집합들의 중심점과 제1 사용자(411)의 1차 태그 집합 사이의 거리가 가까울수록 제1 사용자(411)에 대해 높은 유사도를 부여할 수 있다. 컴퓨팅 장치(100)는 학습 모델(550)로부터 제1 사용자(411)에 대한 유사도를 획득할 수 있다. 동일한 방식으로, 컴퓨팅 장치(100)는 학습 모델(550)로부터 복수의 사용자 각각에 대한 유사도를 획득할 수 있다.The computing device 100 may input one or more primary tags for each of a plurality of users (ie, a set of primary tags for each user) to the learning model 550 learned based on the second embodiment described above. . According to the input, the computing device 100 may obtain a degree of similarity between each of the plurality of users and the sample user group 430 . Specifically, the computing device 100 may input input data (ie, a primary tag set for the first user 411 ) into the learning model 550 . The learning model 550 may also map corresponding input data to an N-dimensional latent space through an operation based on at least one network node. The learning model 550 learned by mapping the primary tag sets of a plurality of sample users to the N-dimensional latent space may calculate a center point on the N-dimensional latent space of the mapped primary tag sets of the plurality of sample users ( This center point is also expressed in N-dimensional coordinates). As the input primary tag set of the first user 411 is similar to the primary tag sets of the sample user group 430, the primary tag set of the first user 411 becomes a plurality of sample users in the N-dimensional space. It can be mapped to a distance close to the center point of the primary tag sets of . The learning model 550 determines the degree of similarity between the first user 411 and the sample user group 430 based on the distance between the coordinates to which the primary tag set of the first user 411 is mapped and the aforementioned center point. can That is, the learning model 550 has a higher similarity to the first user 411 as the distance between the center point of the primary tag sets of the plurality of sample users in the N-dimensional space and the primary tag set of the first user 411 is closer. can be granted. The computing device 100 may obtain a degree of similarity with respect to the first user 411 from the learning model 550 . In the same way, the computing device 100 may obtain a degree of similarity for each of a plurality of users from the learning model 550 .

본 개시의 신경망 기반 학습 모델(550)은 컴퓨팅 장치(100)에 의해 학습되어 컴퓨팅 장치(100)의 메모리(320)에 저장될 수 있다. 또한, 본 개시의 신경망 기반 학습 모델(550)은 외부 서버 또는 다른 장치(110)에 의해 학습된 후 통신 회로(330)를 통해 컴퓨팅 장치(100)로 전달되어 메모리(320)에 저장될 수도 있다.The neural network-based learning model 550 of the present disclosure may be learned by the computing device 100 and stored in the memory 320 of the computing device 100 . In addition, the neural network-based learning model 550 of the present disclosure may be learned by an external server or other device 110 and then transmitted to the computing device 100 through the communication circuit 330 and stored in the memory 320. .

도 6은 본 개시의 일 실시예에 따른 유사도에 기초한 2차 태그 결정 과정을 설명하는 도면이다. 전술한 바와 같이, 2차 태그는 표본 사용자 그룹과 관련된 2차 사용자 특징을 지시할 수 있다. 본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는 복수의 사용자 각각과 표본 사용자 그룹 사이에서 결정된 유사도에 기초하여, 복수의 사용자 각각에 대해 2차 태그를 생성할 수 있다. 본 개시에 있어서 컴퓨팅 장치(100)가 생성하는 2차 태그는 해당 태그에 대한 값을 가질 수 있다. 즉, 컴퓨팅 장치(100)는 복수의 사용자 각각에 대해 2차 태그를 생성하고, 해당 2차 태그에 대응되는 2차 태그의 값을 결정할 수 있다.6 is a diagram illustrating a process of determining a secondary tag based on similarity according to an embodiment of the present disclosure. As mentioned above, secondary tags may indicate secondary user characteristics associated with a sample group of users. The computing device 100 according to an embodiment of the present disclosure may generate a secondary tag for each of the plurality of users based on the degree of similarity determined between each of the plurality of users and the sample user group. In the present disclosure, a secondary tag generated by the computing device 100 may have a corresponding tag value. That is, the computing device 100 may generate a secondary tag for each of a plurality of users and determine a value of a secondary tag corresponding to the corresponding secondary tag.

본 개시의 일 실시예에 따라 컴퓨팅 장치(100)는 복수의 사용자 중 유사도가 높은 순으로 임의의 수의 사용자들을 결정할 수 있다. 이 때, 임의의 수는 2차 태그의 값을 결정하기 위해 임시적으로 설정되는 수로서, 2차 태그 값 결정 과정에서 변경될 수 있다. 일례에서, 임의의 수는 마케팅 대상으로 삼고자 하는 잠재 고객의 수를 고려하여 설정될 수 있다. 예를 들어 임의의 수는 잠재 고객 수(예: 20,000명)를 약분한 수(예: 1,000명)로 설정된 후, 이후 2차 태그 값 결정 과정에서 더 큰 수(예: 2,000명, 3,000명 ...)로 증가되어 갈 수 있다. 또한 예를 들어, 임의의 수는 잠재 고객 수의 배수로 설정될 수도 있다. 컴퓨팅 장치(100)는 유사도가 높은 순으로 결정된 임의의 수의 사용자들 중 표본 사용자의 비율을 결정하고, 결정된 비율에 기초하여, 복수의 사용자 각각에 대한 2차 태그의 값을 결정할 수 있다.According to an embodiment of the present disclosure, the computing device 100 may determine an arbitrary number of users from among a plurality of users in an order of high similarity. At this time, the arbitrary number is a number temporarily set to determine the value of the secondary tag, and may be changed in the process of determining the value of the secondary tag. In one example, an arbitrary number may be set considering the number of potential customers to be marketed to. For example, a random number is set to a smaller number (eg, 1,000) of the number of potential customers (eg, 20,000), then a larger number (eg, 2,000, 3,000 . ..) can be increased. Also, for example, an arbitrary number may be set as a multiple of the number of potential customers. The computing device 100 may determine a ratio of a sample user among a random number of users determined in order of similarity, and determine a value of a secondary tag for each of a plurality of users based on the determined ratio.

도 6에 도시된 레이블 집합(600)은 복수의 레이블들을 포함할 수 있다. 각각의 레이블은 복수의 사용자 각각과 일대일 대응될 수 있다. 레이블 집합(600) 내의 각 레이블들은, 해당 레이블에 대응되는 사용자와 표본 사용자 그룹 사이의 유사도에 기초하여, 유사도가 높은 순으로 나열될 수 있다. 즉 레이블 집합(600)에 포함된 복수의 레이블들은, 레이블 각각에 대응되는 사용자 번호가 증가할수록 레이블에 대응되는 사용자와 표본 사용자 그룹 사이의 유사도가 감소하도록 나열될 수 있다. 표본 사용자 그룹은 복수의 사용자 내에서 결정되므로, 레이블 집합(600)에 포함된 일부 레이블들은 표본 사용자와 대응될 수 있다. 도 6에서, 표본 사용자와 대응되는 레이블들(예: 사용자 #1, 사용자 #2, 사용자 #1000, 사용자 #1600 등)은 음영으로 표시되어 있고, 표본 사용자가 아닌 사용자와 대응되는 레이블들(예: 사용자 #4, 사용자 #5, 사용자 #1595, 사용자 #1602 등)은 음영 없이 표시되어 있다.The label set 600 shown in FIG. 6 may include a plurality of labels. Each label may have a one-to-one correspondence with each of a plurality of users. Each label in the label set 600 may be listed in order of high similarity based on a similarity between a user corresponding to the corresponding label and a sample user group. That is, the plurality of labels included in the label set 600 may be arranged so that the similarity between the user corresponding to the label and the sample user group decreases as the user number corresponding to each label increases. Since the sample user group is determined within a plurality of users, some labels included in the label set 600 may correspond to the sample user. In FIG. 6, labels corresponding to sample users (eg, user #1, user #2, user #1000, user #1600, etc.) are shaded, and labels corresponding to users other than sample users (eg, user #1000, user #1600, etc.) : User #4, User #5, User #1595, User #1602, etc.) are displayed without shading.

본 개시의 일 실시예에 따라 컴퓨팅 장치(100)는 복수의 사용자 각각에 대해 "2차 사용자 특징을 가지는지 여부"를 나타내는 2차 태그 값을 결정할 수 있다. 2차 사용자 특징을 가지는지 여부를 나타내는 2차 태그 값은, 예를 들어 이진(binary) 분류를 위한 태그 값일 수 있으며 예를 들어, 2차 사용자 특징을 가질 경우 "1", 2차 사용자 특징을 가지지 않을 경우 "0" 로 결정될 수 있다. 컴퓨팅 장치(100)는 복수의 사용자 중 유사도가 높은 순서대로 임의의 수의 사용자들을 결정할 수 있다. 예를 들어 임의의 사용자 수가 "1000"인 경우, 컴퓨팅 장치(100)는 유사도가 높은 순으로 1000명의 사용자들을 결정할 수 있다. 도 6의 참조 번호 611은 사용자 #1000에 대응되는 레이블을 지시한다. 컴퓨팅 장치(100)는 임의의 수의 사용자들 중 표본 사용자의 비율을 결정할 수 있다. 1000명의 사용자들 중 표본 사용자가 900명이라고 가정할 때, 컴퓨팅 장치(100)는 표본 사용자 비율을 90%로 결정할 수 있다.According to an embodiment of the present disclosure, the computing device 100 may determine a secondary tag value representing “whether or not a secondary user has a characteristic” for each of a plurality of users. The secondary tag value indicating whether or not the secondary user characteristic is present may be, for example, a tag value for binary classification. If not, it may be determined as "0". The computing device 100 may determine an arbitrary number of users in the order of high similarity among the plurality of users. For example, if the number of arbitrary users is “1000”, the computing device 100 may determine 1000 users in order of similarity. Reference number 611 in FIG. 6 indicates a label corresponding to user #1000. Computing device 100 may determine a percentage of a sample user out of an arbitrary number of users. Assuming that 900 sample users are among 1000 users, the computing device 100 may determine the sample user ratio as 90%.

컴퓨팅 장치(100)는 이와 같이 결정되는 표본 사용자 비율에 기초하여 복수의 사용자 각각에 대해 2차 태그의 값을 결정할 수 있다. 컴퓨팅 장치(100)는 결정된 표본 사용자 비율과 사전 결정된 임계 비율을 비교하여 복수의 사용자 각각에 대해 2차 태그의 값을 결정할 수 있다. 본 개시에 따른 이하 다양한 실시예들의 설명을 위해 임의의 사용자 수를 N, N명의 사용자들 중 표본 사용자의 비율을 x%, 사전 결정된 임계 비율을 y%라고 가정한다. 이 때, x가 y보다 크거나 같으면, 컴퓨팅 장치(100)는 복수의 사용자들 중 유사도 기준 상위 N명의 사용자들에 대해 2차 사용자 특징이 존재함을 나타내는 2차 태그 값을 부여하고, 복수의 사용자들 중 유사도 기준 상위 N명이 아닌 나머지 사용자들에 대해 2차 사용자 특징이 부존재함을 나타내는 2차 태그 값을 부여할 수 있다. 컴퓨팅 장치(100)는 임의의 사용자 수(N)를 순차적으로 증가시키는 동작에 추가적으로 기초하여 복수의 사용자 각각에 대한 2차 태그 값을 결정할 수 있다. 컴퓨팅 장치(100)는 증가된 임의의 사용자 수(N)에 기초하여, 복수의 사용자 중 유사도가 높은 순으로 N명의 사용자들을 결정하고, 결정된 N명의 사용자들 중 표본 사용자의 비율(x%)을 결정할 수 있다. 컴퓨팅 장치(100)는 표본 사용자의 비율(x%)과 사전 결정된 임계 비율(y%)을 비교하여 표본 사용자의 비율(x%)이 사전 결정된 임계 비율(y%)보다 최초로 작아지는 임의의 사용자 수(N)를 결정할 수 있다. 컴퓨팅 장치(100)는 결정된 사용자 수(N)에 따라, 복수의 사용자 중, 유사도 기준 상위 N명에 대해 2차 사용자 특징이 존재함을 나타내는 2차 태그 값을 부여하고, 나머지 사용자에 대해 2차 사용자 특징이 부존재함을 나타내는 2차 태그 값을 부여할 수 있다. 예를 들어 임계 비율이 50%이고, 유사도 기준 상위 600명의 사용자 중 300명이 표본 사용자, 나머지 300명이 표본 사용자가 아닌 사용자이어서 해당 600명의 사용자 중 표본 사용자의 비율이 50%인 경우, 해당 600명의 사용자에는 "1"의 2차 태그 값을 부여하고, 나머지 유사도 기준 600등 이하의 사용자에는 "0"의 2차 태그 값을 부여할 수 있다.The computing device 100 may determine the value of the secondary tag for each of the plurality of users based on the sample user ratio determined as described above. The computing device 100 may determine the value of the secondary tag for each of the plurality of users by comparing the determined sample user ratio with a predetermined threshold ratio. For description of various embodiments according to the present disclosure, it is assumed that an arbitrary number of users is N, a ratio of a sample user among N users is x%, and a predetermined threshold ratio is y%. At this time, if x is greater than or equal to y, the computing device 100 assigns a secondary tag value indicating that secondary user characteristics exist to the top N users based on similarity among the plurality of users, and A secondary tag value representing non-existence of secondary user characteristics may be assigned to the remaining users other than the top N users among users based on similarity. The computing device 100 may determine a secondary tag value for each of a plurality of users additionally based on an operation of sequentially increasing the number N of arbitrary users. The computing device 100 determines N users in order of similarity among a plurality of users based on the increased number of random users (N), and calculates a ratio (x%) of sample users among the determined N users. can decide The computing device 100 compares the sample user ratio (x%) with a predetermined threshold ratio (y%), and compares the sample user ratio (x%) to a random user whose ratio (x%) is less than the predetermined threshold ratio (y%) for the first time. number (N) can be determined. According to the determined number of users (N), the computing device 100 assigns a secondary tag value indicating that a secondary user characteristic exists to the top N users based on similarity among the plurality of users, and assigns a secondary tag value to the remaining users. A secondary tag value representing non-existence of user characteristics may be assigned. For example, if the threshold rate is 50%, 300 of the top 600 users based on similarity are sample users, and the remaining 300 are non-sample users, so the percentage of sample users out of the 600 users is 50%, then the 600 users A secondary tag value of “1” may be assigned to the user, and a secondary tag value of “0” may be assigned to the remaining users of 600 or less based on similarity.

본 개시의 일 실시예에 따라 컴퓨팅 장치(100)는 복수의 사용자 각각에 대해 "2차 사용자 특징에 부합하는 정도"를 나타내는 2차 태그 값을 결정할 수도 있다. 2차 태그의 값은 2차 사용자 특징에 부합하는 정도에 따라 사용자들을 하나 이상의 분류로 구분하기 위한 값일 수 있다. 예를 들어, 2차 사용자 특징에 부합하는 정도에 따라 소정의 최대값(예: 10) 및 최소값(예: 1) 사이에서 2차 태그 값이 차등적으로 결정될 수 있다. 본 개시에 따른 컴퓨팅 장치(100)가 복수의 사용자 각각에 대해 "2차 사용자 특징에 부합하는 정도"를 나타내는 2차 태그 값을 결정하는 과정은, 전술한 "2차 사용자 특징을 가지는지 여부"를 나타내는 2차 태그 값 결정 과정을 2회 이상 수행하는 것과 동일 또는 유사하게 이해될 수 있다.According to an embodiment of the present disclosure, the computing device 100 may determine a secondary tag value indicating "a degree of conformity to secondary user characteristics" for each of a plurality of users. The value of the secondary tag may be a value for classifying users into one or more categories according to the degree of matching with secondary user characteristics. For example, secondary tag values may be differentially determined between a predetermined maximum value (eg, 10) and a predetermined minimum value (eg, 1) according to a degree of conformity to secondary user characteristics. The process of determining, by the computing device 100 according to the present disclosure, a secondary tag value representing “the degree to which a secondary user characteristic is matched” for each of a plurality of users is “whether or not the user has the secondary user characteristic” described above. It can be understood as the same as or similar to performing the process of determining the secondary tag value indicating 2 or more times.

구체적으로, 컴퓨팅 장치(100)는 복수의 사용자 중 유사도가 높은 순으로 임의의 수의 사용자를 결정할 수 있다. 컴퓨팅 장치(100)는 "2차 사용자 특징에 부합하는 정도"를 나타내는 2차 태그 값의 결정을 위해 임의의 사용자 수를 순차적으로 증가시킬 수 있다. 컴퓨팅 장치(100)는 임의의 사용자들 중 표본 사용자의 비율을 결정하고, 결정된 표본 사용자 비율을 복수의 사전 결정된 임계 비율과 비교하여 복수의 사용자 각각에 대한 2차 태그 값을 결정할 수 있다.Specifically, the computing device 100 may determine an arbitrary number of users in order of high similarity among the plurality of users. The computing device 100 may sequentially increase the number of users in order to determine a secondary tag value indicating "a degree of conformity to secondary user characteristics". The computing device 100 may determine a ratio of a sample user among random users and compare the determined sample user ratio with a plurality of predetermined threshold ratios to determine a secondary tag value for each of a plurality of users.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)의 동작을 설명하기 위해 복수의 사전 결정된 임계 비율은 "제1 임계 비율 - 90%", "제2 임계 비율 - 80%"을 포함한다고 가정한다. 컴퓨팅 장치(100)는 임의의 사용자 M1 명의 사용자에 포함된 표본 사용자 비율이 90% 이상일 경우, 표본 사용자 비율이 제1 임계 비율 이상이므로, 유사도 기준 상위 M1 명에 대한 2차 태그 값을 "10"으로 결정할 수 있다. 그 후, 컴퓨팅 장치(100)는 2차 태그 값 결정을 위해 임의의 사용자 수의 크기를 M1보다 큰 값으로 결정할 수 있다. 컴퓨팅 장치(100)는 M2명(M2는 M1보다 큰 자연수)의 사용자에 포함된 표본 사용자 비율이 90% 미만 80% 이상일 경우, 표본 사용자 비율이 제1 임계 비율 미만 제2 임계 비율 이상이므로, 유사도 기준 상위 M2 명 중 상위 M1 명을 제외한 나머지 사용자에 대한 2차 태그 값을 "9"로 결정할 수 있다.In order to describe the operation of the computing device 100 according to an embodiment of the present disclosure, it is assumed that the plurality of predetermined critical ratios include "first critical ratio - 90%" and "second critical ratio - 80%". . When the sample user ratio included in the random user M1 users is 90% or more, the computing device 100 sets the secondary tag value for the top M1 people based on similarity to “10” because the sample user ratio is equal to or greater than the first threshold ratio. can be determined by After that, the computing device 100 may determine the size of the number of arbitrary users as a value greater than M1 in order to determine the secondary tag value. The computing device 100 calculates similarity when the sample user ratio included in M2 users (M2 is a natural number greater than M1) is less than 90% and greater than or equal to 80%, since the sample user ratio is less than the first threshold ratio and greater than or equal to the second threshold ratio. A secondary tag value for the remaining users excluding the top M1 among the top M2 people may be determined to be "9".

예를 들어 도 6에서, 사용자 #1부터 사용자 #1000까지의 사용자들에 포함된 표본 사용자의 수는 900, 사용자 #1부터 사용자 #1600까지의 사용자들에 포함된 표본 사용자의 수는 1280이라고 가정한다. 추가적으로, 복수의 사전 결정된 임계 비율은 제1 임계 비율과 제2 임계 비율을 포함하고, 제1 임계 비율 및 제2 임계 비율은 각각 90% 및 80%로 결정된 것으로 가정한다. 컴퓨팅 장치(100)는 임의의 사용자 수를 "1000"으로 결정할 수 있다. 컴퓨팅 장치(100)는 1000명의 사용자들 중 표본 사용자의 비율을 결정할 수 있다. 컴퓨팅 장치(100)는 사용자 #1000에 대응되는 레이블(611)보다 순위가 같거나 높은 레이블들 중 표본 사용자와 대응되는 레이블들(예: 사용자 #1, #2, #6, #997, #1000 등)의 개수를 카운트(count)하여, 1000명의 사용자들 중 표본 사용자의 비율을 결정할 수 있다. 가정에 따라, 1000명의 사용자들 중 표본 사용자의 수는 900이므로, 1000명의 사용자들 중 표본 사용자의 비율은 90%일 수 있다. 컴퓨팅 장치(100)는 1000명의 사용자들 중 표본 사용자의 비율(90%)이 제1 임계 비율(90%) 이상이므로, 복수의 사용자들 중 유사도 기준 상위 1000명의 사용자들(사용자 #1부터 사용자 #1000까지)에 대한 2차 태그 값을 "10"으로 결정할 수 있다.For example, in FIG. 6, it is assumed that the number of sample users included in users from user #1 to user #1000 is 900 and the number of sample users included in users from user #1 to user #1600 is 1280. do. Additionally, it is assumed that the plurality of predetermined critical ratios include a first critical ratio and a second critical ratio, and the first critical ratio and the second critical ratio are determined to be 90% and 80%, respectively. The computing device 100 may determine an arbitrary number of users as “1000”. Computing device 100 may determine a percentage of sample users out of 1000 users. The computing device 100 selects labels corresponding to sample users (eg, users #1, #2, #6, #997, and #1000) among labels having the same or higher rank than the label 611 corresponding to user #1000. etc.) can be counted to determine the ratio of sample users among 1000 users. According to the assumption, since the number of sample users among 1000 users is 900, the proportion of sample users among 1000 users may be 90%. Since the ratio (90%) of sample users among 1000 users is equal to or greater than the first threshold ratio (90%), the computing device 100 determines the top 1000 users (user #1 to user #) based on similarity among a plurality of users. up to 1000) can be determined as "10".

계속해서, 컴퓨팅 장치(100)는 임의의 사용자 수를 "1000"보다 큰 값으로 결정할 수 있다. 컴퓨팅 장치(100)는 임의의 사용자 수를 "1600"으로 결정할 수 있다. 컴퓨팅 장치(100)는 1600명의 사용자들 중 표본 사용자의 비율을 결정할 수 있다. 컴퓨팅 장치(100)는 사용자 #1600에 대응되는 레이블(613)보다 순위가 같거나 높은 레이블들 중 표본 사용자와 대응되는 레이블들(예: 사용자 #1, #2, #6, #997, #1000, #1594, #1599, #1600 등)의 개수를 카운트하여, 1600명의 사용자들 중 표본 사용자의 비율을 결정할 수 있다. 상술한 가정에 의해 1600명의 사용자들 중 표본 사용자의 수는 1280이므로, 1600명의 사용자들 중 표본 사용자의 비율은 80%일 수 있다. 컴퓨팅 장치(100)는 1600명의 사용자들 중 표본 사용자의 비율(80%)이 제1 임계 비율(90%) 미만 제2 임계 비율(80%) 이상이므로, 복수의 사용자들 중 기 2차 태그 값이 결정된 유사도 기준 상위 1000명의 사용자들(사용자 #1부터 사용자 #1000까지)을 제외한 나머지 사용자들(사용자 #1001부터 사용자 #1600까지)에 대한 2차 태그 값을 "9"로 결정할 수 있다. 컴퓨팅 장치(100)는 위와 같은 방식을 반복하여 복수의 사용자들을 둘 이상의 그룹으로 나누고 각 그룹마다 상응하는 2차 태그 값을 부여할 수 있다.Continuing, the computing device 100 may determine the number of arbitrary users as a value greater than “1000”. The computing device 100 may determine an arbitrary number of users as “1600”. Computing device 100 may determine a percentage of the sample users out of the 1600 users. The computing device 100 selects labels corresponding to sample users (eg, users #1, #2, #6, #997, and #1000) among labels having the same or higher rank than the label 613 corresponding to user #1600. , #1594, #1599, #1600, etc.) may be counted to determine a sample user ratio among 1600 users. According to the above assumption, the number of sample users among 1600 users is 1280, so the ratio of sample users among 1600 users may be 80%. The computing device 100 determines that the secondary tag value among the plurality of users is determined because the ratio (80%) of the sample users among the 1600 users is less than the first threshold ratio (90%) and is greater than or equal to the second threshold ratio (80%). Secondary tag values for the remaining users (users #1001 to user #1600) excluding the top 1000 users (users #1 to user #1000) based on the determined similarity may be determined to be “9”. The computing device 100 may divide the plurality of users into two or more groups by repeating the above method and assign a corresponding secondary tag value to each group.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)의 동작을 설명하기 위해 복수의 사전 결정된 임계 비율은 "제1 임계 비율 - 90%", "제2 임계 비율 - 80%", "제3 임계 비율 - 70%", "제4 임계 비율 - 60%", 및 "제5 임계 비율 - 50%"을 포함한다고 가정한다. 복수의 임의 사용자 수를 각각 "N_1", "N_2", "N_3", "N_4", "N_5"와 같이 표현할 때, 복수의 임의 사용자 수는 "N_1 < N_2 < N_3 < N_4 < N_5"와 대소 관계를 가질 수 있다. 컴퓨팅 장치(100)는 임의의 사용자 수를 1부터 단조 증가시켜 각각의 사용자 수에 있어서 표본 사용자 비율을 결정할 수 있다. 컴퓨팅 장치(100)는 유사도 기준 상위 N_1 명의 사용자에 포함된 표본 사용자 비율이 90% 이상인 경우, 복수의 사용자 중 유사도 기준 상위 N_1명에 대해 2차 태그 값을 "10"으로 결정할 수 있다. 컴퓨팅 장치(100)는 유사도 기준 상위 N_2 명의 사용자에 포함된 표본 사용자 비율이 80% 이상인 경우, 복수의 사용자 중 유사도 기준 상위 N_2 명을 선택하고, 그 중에서 유사도 기준 상위 N_1 명을 제외한 나머지 사용자에 대해 2차 태그 값을 "9"로 결정할 수 있다. 컴퓨팅 장치(100)는 유사도 기준 상위 N_3 명의 사용자에 포함된 표본 사용자 비율이 70% 이상인 경우, 복수의 사용자 중 유사도 기준 상위 N_3 명을 선택하고, 그 중에서 유사도 기준 상위 N_2 명을 제외한 나머지 사용자에 대해 2차 태그 값을 "8"로 결정할 수 있다. 컴퓨팅 장치(100)는 유사도 기준 상위 N_4 명의 사용자에 포함된 표본 사용자 비율이 60% 이상인 경우, 복수의 사용자 중 유사도 기준 상위 N_4 명을 선택하고, 그 중에서 유사도 기준 상위 N_3 명을 제외한 나머지 사용자에 대해 2차 태그 값을 "7"로 결정할 수 있다. 컴퓨팅 장치(100)는 유사도 기준 상위 N_5 명의 사용자에 포함된 표본 사용자 비율이 50% 이상인 경우, 복수의 사용자 중 유사도 기준 상위 N_5 명을 선택하고, 그 중에서 유사도 기준 상위 N_4 명을 제외한 나머지 사용자에 대해 2차 태그 값을 "6"으로 결정할 수 있다.In order to describe the operation of the computing device 100 according to an embodiment of the present disclosure, a plurality of predetermined threshold ratios are "first threshold ratio - 90%", "second threshold ratio - 80%", "third threshold ratio" ratio - 70%", "fourth threshold ratio - 60%", and "fifth threshold ratio - 50%". When the number of random users is expressed as "N_1", "N_2", "N_3", "N_4", and "N_5", respectively, the number of random users is equivalent to "N_1 < N_2 < N_3 < N_4 < N_5" can have a relationship. The computing device 100 may monotonically increase a certain number of users from 1 to determine a sample user ratio for each number of users. When the ratio of sample users included in the top N_1 users based on similarity is 90% or more, the computing device 100 may determine a secondary tag value of “10” for the top N_1 users based on similarity among the plurality of users. When the ratio of sample users included in the top N_2 users based on similarity is 80% or more, the computing device 100 selects the top N_2 users based on similarity among the plurality of users, and among the users other than the top N_1 users based on similarity, the computing device 100 The secondary tag value can be determined as "9". When the proportion of sample users included in the top N_3 users based on similarity is 70% or more, the computing device 100 selects the top N_3 users based on similarity among a plurality of users, and among the users except for the top N_2 users based on similarity, the computing device 100 The secondary tag value can be determined as "8". When the proportion of sample users included in the top N_4 users based on similarity is 60% or more, the computing device 100 selects the top N_4 users based on similarity among the plurality of users, and among the users except for the top N_3 users based on similarity, the computing device 100 The secondary tag value can be determined as "7". When the ratio of sample users included in the top N_5 users based on similarity is 50% or more, the computing device 100 selects the top N_5 users based on similarity among the plurality of users, and among the users other than the top N_4 users based on similarity, the computing device 100 The secondary tag value can be determined as "6".

컴퓨팅 장치(100)는 복수의 사용자 중 유사도 기준 상위 N_5 명을 제외한 나머지 사용자에 대해서도 2차 태그 값을 결정할 수 있다. 이 때 유사도 기준 상위 N_5 명을 제외한 나머지 사용자들에 대해서는, 전술한 방식과는 달리, 나머지 사용자의 수를 둘 이상의 균등 구간으로 분할하여 각 구간에 따라 서로 다른 2차 태그 값을 결정하는 방식을 취할 수 있다. 예를 들어 유사도 기준 상위 N_5 명을 제외한 나머지 사용자가 100명일 경우, 컴퓨팅 장치(100)는 해당 나머지 사용자 100명 중에서도 다시 상위 100명에 대해서는 2차 태그 값으로 "5"를 부여하고, 차상위 100명에 대해서는 2차 태그 값으로 "4"를 부여하는 등의 방식으로 2차 태그 값을 결정할 수 있다.The computing device 100 may also determine secondary tag values for the remaining users excluding the top N_5 users based on similarity among the plurality of users. At this time, for the remaining users excluding the top N_5 users based on similarity, unlike the above method, a method of dividing the number of remaining users into two or more equal sections and determining different secondary tag values according to each section is taken. can For example, if there are 100 users excluding the top N_5 users based on similarity, the computing device 100 assigns "5" as a secondary tag value to the top 100 users among the remaining 100 users, and assigns "5" to the next top 100 users. For , the secondary tag value can be determined in a manner such as giving "4" as the secondary tag value.

본 개시의 다양한 실시예들에 따르면, 컴퓨팅 장치(100)는 복수의 사용자에 대한 로 데이터로부터, 정량화된 단계를 통해 복수의 사용자 각각에 대해 2차 태그를 생성할 수 있다. 복수의 사용자에 대한 로 데이터는 개별 데이터 관점에서 정보의 밀도가 낮아 그 자체로 의미있는 정보를 전달하기 힘들 수 있다. 본 개시는 이러한 로 데이터들에 대한 수치 해석을 통해 일정 수준의 의미를 갖는 "1차 사용자 특징"에 관한 1차 태그를 생성하는 방법을 개시한다. 또한 본 개시는 복수의 "1차 사용자 특징"을 지시하는 1차 태그에 기초하여 "2차 사용자 특징"을 나타내는 2차 태그를 생성하는 방법을 구체적으로 개시함으로써, 방대한 양의 데이터들로부터 적어도 둘 이상의 연산 단계를 통해 주요 정보를 효과적으로 추출하는 방법을 개시한다.According to various embodiments of the present disclosure, the computing device 100 may generate a secondary tag for each of the plurality of users from raw data of the plurality of users through quantified steps. Raw data for a plurality of users may have low density of information in terms of individual data, so it may be difficult to convey meaningful information by itself. The present disclosure discloses a method of generating a primary tag related to “primary user characteristics” having a certain level of meaning through numerical analysis of such raw data. In addition, the present disclosure specifically discloses a method of generating a secondary tag representing a “second user characteristic” based on a plurality of primary tags indicating a plurality of “primary user characteristics,” thereby providing at least two tags from a vast amount of data. A method for effectively extracting key information through the above calculation steps is disclosed.

본 개시의 다양한 실시예들에 따르면, 컴퓨팅 장치(100)는 각 사용자가 갖는 1차 태그 집합과 표본 사용자 그룹에 포함된 표본 사용자가 갖는 1차 태그 집합의 유사한 정도에 기초하여 각 사용자에 대해 2차 태그를 부여하는 방법을 개시함으로써, 목표 고객 집단과 가장 유사한 패턴을 가지는 잠재 고객 집단을 효과적으로 탐색하는 방법을 개시한다. 또한, 컴퓨팅 장치(100)가 복수의 사용자에 대해 표본 사용자 그룹의 2차 사용자 특징에 부합하는 정도를 나타내는 2차 태그 값을 결정할 경우, 컴퓨팅 장치(100)는 복수의 사용자를 둘 이상의 그룹으로 나누어 각 그룹별로 서로 다른 크기의 2차 태그 값(예: 10, 7, 3 등)을 결정함으로써 2차 태그 값에 따라 복수의 사용자들 사이에 마케팅 우선순위를 결정할 수 있다. 이와 같이 본 개시는 따라 복수의 사용자들 사이에서 마케팅 대상 우선순위를 결정하는 방법을 개시함으로써 마케팅의 효율을 증대시키는 한편, 마케팅 과정에 소요되는 자원의 낭비를 줄일 수 있다.According to various embodiments of the present disclosure, the computing device 100 generates secondary tags for each user based on a degree of similarity between the primary tag set of each user and the primary tag set of sample users included in the sample user group. By disclosing a method of assigning a tag, a method of effectively searching for a potential customer group having the most similar pattern to a target customer group is disclosed. In addition, when the computing device 100 determines a secondary tag value indicating the degree to which the plurality of users match the secondary user characteristics of the sample user group, the computing device 100 divides the plurality of users into two or more groups. By determining secondary tag values (eg, 10, 7, 3, etc.) of different sizes for each group, marketing priority among a plurality of users can be determined according to the secondary tag value. As described above, according to the present disclosure, by disclosing a method for determining a marketing target priority among a plurality of users, the efficiency of marketing can be increased, while waste of resources required for marketing can be reduced.

본 개시의 다양한 실시예들에 따르면, 로 데이터로부터 하나 이상의 1차 태그를 생성하고, 하나 이상의 1차 태그를 이용하여 사용자 간 유사도 연산을 수행해 2차 태그를 생성함으로써, 로 데이터를 이용해 유사도 연산을 수행하는 것에 비하여 컴퓨팅 장치(100)에서의 연산 부담을 줄일 수 있다.According to various embodiments of the present disclosure, one or more primary tags are generated from raw data, and a similarity calculation is performed between users using the one or more primary tags to generate a secondary tag, thereby calculating similarity using the raw data. Comparatively, the calculation load on the computing device 100 may be reduced.

도 7은 본 개시의 일 실시예에 따른 컴퓨팅 장치의 동작 흐름도이다. 본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는, 동작 S710에서, 복수의 사용자 각각에 대한 사용자 정보 및 복수의 사용자 각각에 대한 결제 정보를 획득할 수 있다. 사용자 정보에는 성별, 나이, 자택 주소, 기혼 여부 등에 관한 정보가 포함될 수 있다. 결제 정보에는 결제처, 결제 시점, 결제 횟수, 결제 빈도, 결제 액수 등에 관한 적어도 하나의 정보가 포함될 수 있다.7 is a flowchart illustrating an operation of a computing device according to an embodiment of the present disclosure. The computing device 100 according to an embodiment of the present disclosure may obtain user information for each of a plurality of users and payment information for each of a plurality of users in operation S710. User information may include information about gender, age, home address, marital status, and the like. The payment information may include at least one piece of information about a payment destination, payment time, number of payments, frequency of payment, amount of payment, and the like.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는, 동작 S720에서, 사용자 정보 및/또는 결제 정보에 기초하여 복수의 사용자 각각에 대해 하나 이상의 1차 태그를 생성할 수 있다. 하나 이상의 1차 태그 각각은 사용자에 관한 1차 사용자 특징을 지시할 수 있다. 전술한 바와 같이, 1차 사용자 특징은 사용자 정보(210) 및/또는 결제 정보(220)에 포함된 로 데이터들에 기초하여 1차적으로 도출되는 특징일 수 있다. 즉 1차 사용자 특징은, 해당 사용자의 사용자 정보 및/또는 결제 정보로부터 소정의 연산 과정(예: 사칙연산, 통계적 연산 등)을 통해 도출되는 특징일 수 있다.The computing device 100 according to an embodiment of the present disclosure may generate one or more primary tags for each of a plurality of users based on user information and/or payment information in operation S720. Each of the one or more primary tags may indicate a primary user characteristic about the user. As described above, the primary user characteristics may be characteristics that are primarily derived based on raw data included in the user information 210 and/or the payment information 220 . That is, the primary user characteristics may be characteristics derived from user information and/or payment information of the corresponding user through a predetermined calculation process (eg, four arithmetic operations, statistical calculations, etc.).

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는, 동작 S730에서, 표본 사용자 그룹 내 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여 복수의 사용자 각각과 표본 사용자 그룹 사이의 유사도를 결정할 수 있다. 전술한 바와 같이, 표본 사용자 그룹은 복수의 사용자 중 2차 사용자 특징을 가질 것으로 추정된 표본 사용자들의 집합일 수 있다. 각 사용자와 표본 사용자 그룹 사이에서 결정되는 유사도는, 각 사용자가 표본 사용자 그룹과 얼마나 유사한지를 나타내는 지표일 수 있다.The computing device 100 according to an embodiment of the present disclosure may determine a degree of similarity between each of a plurality of users and the sample user group based on one or more primary tags for each sample user in the sample user group, in operation S730. there is. As described above, the sample user group may be a set of sample users estimated to have secondary user characteristics among a plurality of users. The degree of similarity determined between each user and the sample user group may be an index indicating how similar each user is to the sample user group.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는, 동작 S740에서, 복수의 사용자 각각에 대해 결정된 표본 사용자 그룹과의 유사도에 기초하여, 2차 태그 생성할 수 있다. 사용자에 대한 2차 태그는 사용자에 관한 2차 사용자 특징을 지시할 수 있다. 전술한 바와 같이, 2차 사용자 특징은 마케팅 목적에 의해 임의로 설정된 사용자 특징(예: 중국 음식 이용 지수 등)일 수 있다. 2차 사용자 특징은 하나 이상의 1차 태그에 기초하여 2차적으로 도출되는 특징이라는 점에서, 로 데이터로부터 직접 도출되는 1차 사용자 특징과 구별될 수 있다.In operation S740, the computing device 100 according to an embodiment of the present disclosure may generate a secondary tag based on a degree of similarity between each of the plurality of users and the determined sample user group. A secondary tag for a user may indicate secondary user characteristics about the user. As described above, the secondary user characteristics may be user characteristics arbitrarily set for marketing purposes (eg, Chinese food usage index, etc.). Secondary user characteristics may be distinguished from primary user characteristics directly derived from raw data in that they are characteristics that are secondarily derived based on one or more primary tags.

전술한 바와 같이, 복수의 사용자 각각에 대한 사용자 정보(210)는 복수의 사용자 각각에 대한 행동 정보(215) 및/또는 복수의 사용자 각각에 대한 신용 정보(217)를 포함할 수 있다. 따라서 컴퓨팅 장치(100)는 사용자 정보(210) 내의 행동 정보(215) 및/또는 신용 정보(217)에 추가적으로 기초하여, 복수의 사용자 각각에 대해 하나 이상의 1차 태그를 생성할 수 있다.As described above, the user information 210 for each of the plurality of users may include behavior information 215 for each of the plurality of users and/or credit information 217 for each of the plurality of users. Accordingly, computing device 100 may generate one or more primary tags for each of a plurality of users based further on behavioral information 215 and/or credit information 217 in user information 210 .

도 8은 본 개시의 일 실시예에 따른 컴퓨팅 장치의 동작 흐름도이다. 도 8의 동작 S810 및 동작 S820은 각각 도 7의 동작 S710 및 동작 S720과 동일 또는 유사하게 수행될 수 있으며, 이하 중복된 내용에 대한 설명은 생략한다.8 is a flowchart illustrating an operation of a computing device according to an embodiment of the present disclosure. Operations S810 and S820 of FIG. 8 may be performed identically or similarly to operations S710 and S720 of FIG. 7 , respectively, and descriptions of overlapping contents are omitted below.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는, 동작 S830에서, 표본 사용자 그룹 내 표본 사용자 각각에 대한 하나 이상의 1차 태그(즉, 1차 태그 그룹)에 기초하여 학습된 신경망 기반 학습 모델에, 복수의 사용자 각각에 대한 하나 이상의 1차 태그를 입력할 수 있다. 신경망 기반 학습 모델은 컴퓨팅 장치(100) 또는 외부 장치에 의해 기 학습된 모델일 수 있다. 신경망 기반 학습 모델은 표본 사용자 그룹에 포함된 표본 사용자 각각에 대한 하나 이상의 1차 태그에 기초하여 학습된 모델인 바, 새롭게 입력되는 데이터(즉, 복수의 사용자 각각의 하나 이상의 1차 태그(1차 태그 그룹))가 표본 사용자 그룹의 데이터들과 유사한 지 여부를 판단할 수 있다.The computing device 100 according to an embodiment of the present disclosure, in operation S830, performs a neural network-based learning model learned based on one or more primary tags (ie, primary tag groups) for each sample user in the sample user group. In, one or more primary tags for each of a plurality of users may be input. The neural network-based learning model may be a model previously learned by the computing device 100 or an external device. Since the neural network-based learning model is a model learned based on one or more primary tags for each sample user included in a sample user group, newly input data (ie, one or more primary tags of each of a plurality of users (primary tags) It may be determined whether the tag group)) is similar to the data of the sample user group.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는, 동작 S840에서, 신경망 기반 학습 모델에 기초하여, 복수의 사용자 각각과 표본 사용자 그룹 사이의 유사도를 결정할 수 있다. 컴퓨팅 장치(100)는 신경망 기반 학습 모델이 출력하는 유사도 값을 그대로 이용하여 사용자 각각의 유사도를 결정할 수 있다. 컴퓨팅 장치(100)는 신경망 기반 학습 모델이 출력하는 유사도 값에 적어도 하나의 추가적인 처리를 거쳐 사용자 각각의 유사도를 결정할 수도 있다.The computing device 100 according to an embodiment of the present disclosure may determine a degree of similarity between each of a plurality of users and a sample user group based on a neural network-based learning model in operation S840. The computing device 100 may determine the similarity of each user by directly using the similarity value output from the neural network-based learning model. The computing device 100 may determine the similarity of each user by performing at least one additional process on the similarity value output from the neural network-based learning model.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는, 동작 S850에서, 유사도에 기초하여 복수의 사용자 각각에 대해 표본 사용자 그룹과 관련된 2차 사용자 특징을 지시하는 2차 태그를 생성할 수 있다.The computing device 100 according to an embodiment of the present disclosure may generate a secondary tag indicating a secondary user characteristic related to a sample user group for each of a plurality of users based on the degree of similarity in operation S850.

본 개시에 도시된 각 흐름도에서 본 개시에 따른 방법 또는 알고리즘의 각 단계들이 순차적인 순서로 설명되었지만, 각 단계들은 순차적으로 수행되는 것 외에, 본 개시에 의해 임의로 조합될 수 있는 순서에 따라 수행될 수도 있다. 본 흐름도에 따른 설명은, 방법 또는 알고리즘에 변화 또는 수정을 가하는 것을 제외하지 않으며, 임의의 단계가 필수적이거나 바람직하다는 것을 의미하지 않는다. 일 실시예에서, 적어도 일부의 단계가 병렬적, 반복적 또는 휴리스틱하게 수행될 수 있다. 일 실시예에서, 적어도 일부의 단계가 생략되거나, 다른 단계가 추가될 수 있다.Although each step of the method or algorithm according to the present disclosure has been described in a sequential order in each flowchart shown in this disclosure, each step may be performed in an order that may be arbitrarily combined according to the present disclosure, in addition to being performed sequentially. may be The description according to this flowchart does not exclude changes or modifications to the method or algorithm, and does not imply that any step is necessary or desirable. In one embodiment, at least some of the steps may be performed in parallel, iteratively or heuristically. In one embodiment, at least some steps may be omitted or other steps may be added.

본 개시의 다양한 실시예들은 기기(machine)가 읽을 수 있는 저장매체(machine-readable storage medium)에 소프트웨어로 구현될 수 있다. 소프트웨어는 본 개시의 다양한 실시예들을 구현하기 위한 소프트웨어일 수 있다. 소프트웨어는 본 개시가 속하는 기술분야의 프로그래머들에 의해 본 개시의 다양한 실시예들로부터 추론될 수 있다. 예를 들어 소프트웨어는 기기가 읽을 수 있는 명령어(예: 코드 또는 코드 세그먼트)를 포함하는 프로그램일 수 있다. 기기는 저장 매체로부터 호출된 명령어에 따라 동작이 가능한 장치로서, 예를 들어 컴퓨터일 수 있다. 일 실시예에서, 기기는 본 개시의 실시예들에 따른 컴퓨팅 장치(100)일 수 있다. 일 실시예에서, 기기의 프로세서는 호출된 명령어를 실행하여, 기기의 구성요소들이 해당 명령어에 해당하는 기능을 수행하게 할 수 있다. 일 실시예에서, 프로세서는 본 개시의 실시예들에 따른 프로세서(310)일 수 있다. 저장 매체는 기기에 의해 읽혀질 수 있는, 데이터가 저장되는 모든 종류의 기록 매체(recording medium)를 의미할 수 있다. 저장 매체는, 예를 들어 ROM, RAM, CD-ROM, 자기 테이프, 플로피 디스크, 광 데이터 저장장치 등을 포함할 수 있다. 일 실시예에서, 저장 매체는 메모리(320)일 수 있다. 일 실시예에서, 저장매체는 네트워크로 연결된 컴퓨터 시스템 등에 분산된 형태로서 구현될 수도 있다. 소프트웨어는 컴퓨터 시스템 등에 분산되어 저장되고, 실행될 수 있다. 저장 매체는 비일시적(non-transitory) 저장매체일 수 있다. 비일시적 저장매체는, 데이터가 반영구적 또는 임시적으로 저장되는 것과 무관하게 실재하는 매체(tangible medium)를 의미하며, 일시적(transitory)으로 전파되는 신호(signal)를 포함하지 않는다.Various embodiments of the present disclosure may be implemented as software in a machine-readable storage medium. The software may be software for implementing various embodiments of the present disclosure. Software can be inferred from various embodiments of this disclosure by programmers skilled in the art. For example, software may be a program containing machine-readable instructions (eg, code or code segments). A device is a device capable of operating according to a command called from a storage medium, and may be, for example, a computer. In one embodiment, the device may be a computing device 100 according to embodiments of the present disclosure. In one embodiment, the processor of the device may execute the called command, so that components of the device perform functions corresponding to the command. In one embodiment, the processor may be processor 310 according to embodiments of the present disclosure. The storage medium may mean any type of recording medium in which data that can be read by a device is stored. The storage medium may include, for example, ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage, and the like. In one embodiment, the storage medium may be memory 320 . In one embodiment, the storage medium may be implemented in a distributed form such as a computer system connected to a network. The software may be distributed, stored, and executed on a computer system or the like. The storage medium may be a non-transitory storage medium. A non-transitory storage medium means a tangible medium regardless of whether data is stored semi-permanently or temporarily, and does not include a signal propagated temporarily.

이상 다양한 실시예들에 의해 본 개시의 기술적 사상이 설명되었지만, 본 개시의 기술적 사상은 본 개시가 속하는 기술 분야에서 통상의 지식을 가진 자가 이해할 수 있는 범위에서 이루어질 수 있는 다양한 치환, 변형 및 변경을 포함한다. 또한, 그러한 치환, 변형 및 변경은 첨부된 청구범위 내에 포함될 수 있는 것으로 이해되어야 한다.Although the technical idea of the present disclosure has been described by various embodiments, the technical idea of the present disclosure includes various substitutions, modifications, and changes that can be made within the scope understandable by those skilled in the art to which the present disclosure belongs. include It is also to be understood that such substitutions, modifications and alterations may be included within the scope of the appended claims.

Claims

one or more processors; and
one or more memories storing instructions that, when executed by the one or more processors, cause the one or more processors to perform operations;
The one or more processors, according to the instructions,
Acquiring user information for each of a plurality of users and payment information for each of the plurality of users;
generate one or more primary tags for each of the plurality of users based on the user information and the payment information, wherein the one or more primary tags each indicate one or more primary user characteristics; Each user characteristic is specified by at least one of the user information and the payment information.
determining a degree of similarity between each of the plurality of users and the sample user group based on one or more primary tags for each sample user in a predetermined sample user group among the plurality of users;
Based on the degree of similarity, generate a secondary tag for each of the plurality of users, wherein the secondary tag indicates a secondary user characteristic associated with the sample user group, the secondary user characteristic being associated with the one or more primary users. specified by the tag - ,
determining a predetermined number of users from among the plurality of users in order of the similarity;
determine a proportion of the sample users among the predetermined number of users;
based on the ratio, determine a value of the secondary tag for each of the plurality of users;
The value of the secondary tag indicates whether each of the plurality of users has the secondary user characteristics or a degree of matching with the secondary user characteristics.

According to claim 1,
Each of the one or more primary tags may include one or more tags specified based on at least one selected from among the user information, payment destination, payment item, payment method, payment time, payment number, payment frequency, and payment amount of each of the plurality of users. A device that indicates each of the primary user characteristics.

According to claim 1,
The user information for each of the plurality of users includes behavior information or credit information of each of the plurality of users.

According to claim 1,
The one or more processors,
and determining the degree of similarity based on a difference between a value of a primary tag for each of the plurality of users and a value of a primary tag for each of the sample users.

According to claim 1,
The one or more processors,
determining the degree of similarity based on a difference between the value of the primary tag for each of the plurality of users and the primary tag reference value of the sample user group - the primary tag reference value is the primary tag for each of the sample users - It is a value determined based on the value of the device.

According to claim 1,
The one or more memories,
Further storing a learning model based on a neural network learned based on one or more primary tags for each of the sample users;
The one or more processors,
and determining a degree of similarity between each of the plurality of users and the sample user group based on the learning model.

According to claim 6,
The one or more processors,
Entering a value of one or more primary tags for each of the plurality of users into the learning model;
and determining the degree of similarity based on an output of the learning model.

delete

A method performed in a computing device comprising one or more processors and one or more memories storing instructions for execution by the one or more processors,
obtaining, by the one or more processors, user information for each of a plurality of users and payment information for each of the plurality of users;
Generating, by the one or more processors, one or more primary tags for each of the plurality of users based on the user information and the payment information, the one or more primary tags indicating one or more primary user characteristics, respectively. and each of the one or more primary user characteristics is specified by at least one of the user information and the payment information;
determining, by the one or more processors, a degree of similarity between each of the plurality of users and the sample user group, based on one or more primary tags for each sample user in a predetermined sample user group among the plurality of users;
generating, by the one or more processors, a secondary tag for each of the plurality of users based on the degree of similarity, wherein the secondary tag indicates a secondary user characteristic related to the sample user group, and the secondary user Characteristics specified by the one or more primary tags;
determining a predetermined number of users from among the plurality of users in an order of high similarity;
determining a proportion of the sample users among the predetermined number of users; and
based on the ratio, determining a value of the secondary tag for each of the plurality of users;
The value of the secondary tag indicates whether each of the plurality of users has the secondary user characteristics or a degree of matching with the secondary user characteristics.

According to claim 10,
Each of the one or more primary tags may include one or more tags specified based on at least one selected from among the user information, payment destination, payment item, payment method, payment time, payment number, payment frequency, and payment amount of each of the plurality of users. Indicating each of the primary user characteristics.

According to claim 10,
The user information for each of the plurality of users includes behavior information or credit information of each of the plurality of users.

According to claim 10,
wherein the determining of the degree of similarity includes determining the degree of similarity based on a difference between a value of a primary tag for each of the plurality of users and a value of a primary tag for each of the sample users.

According to claim 10,
The determining of the similarity may include determining the similarity based on a difference between a primary tag value for each of the plurality of users and a primary tag reference value of the sample user group - the primary tag reference value is and is a value determined based on a value of a primary tag for each of the sample users.

According to claim 10,
The one or more memories further store a neural network-based learning model learned based on one or more primary tags for each of the sample users;
Wherein the determining of the degree of similarity comprises determining a degree of similarity between each of the plurality of users and the sample user group based on the learning model.

According to claim 15,
The step of determining the degree of similarity is,
inputting a value of one or more primary tags for each of the plurality of users into the learning model; and
determining the degree of similarity based on an output of the learned model.

delete

In a non-transitory computer-readable recording medium recording instructions to be executed on a computer,
The instructions, when executed by one or more processors, cause the one or more processors to:
Acquiring user information for each of a plurality of users and payment information for each of the plurality of users;
generate one or more primary tags for each of the plurality of users based on the user information and the payment information, wherein the one or more primary tags each indicate one or more primary user characteristics; Each user characteristic is specified by at least one of the user information and the payment information.
determining a degree of similarity between each of the plurality of users and the sample user group based on one or more primary tags for each sample user in a predetermined sample user group among the plurality of users;
Based on the degree of similarity, generate a secondary tag for each of the plurality of users, wherein the secondary tag indicates a secondary user characteristic associated with the sample user group, the secondary user characteristic being associated with the one or more primary users. specified by the tag - ,
determining a predetermined number of users from among the plurality of users in order of the similarity;
determine a proportion of the sample users among the predetermined number of users;
determine a value of the secondary tag for each of the plurality of users based on the ratio;
The value of the secondary tag indicates whether each of the plurality of users has the secondary user characteristics or a degree of matching with the secondary user characteristics.