KR20210028554A

KR20210028554A - Method and apparatus for training AI model using user's time series behavior data

Info

Publication number: KR20210028554A
Application number: KR1020200031360A
Authority: KR
Inventors: 홍승표
Original assignee: (주)빅인사이트
Priority date: 2020-03-13
Filing date: 2020-03-13
Publication date: 2021-03-12

Abstract

According to one disclosure of the present invention, a method for training an AI model using time series behavior data of a user may be provided. The method comprises: a step of acquiring behavioral data for all time-series behaviors of users, who have accessed an integrated platform; a labeling step of setting a label by determining whether target data determined by a manager of the integrated platform has been reached for each of all behavior data of the user; and learning the AI model by repeating the labeling step for all time-series behavioral data of other users accessing the integrated platform.

Description

TECHNICAL FIELD [Method and apparatus for training AI model using user's time series behavior data]

일 개시에 의하여 통합 플랫폼에 접속한 사용자의 모든 행동 데이터를 이용하여 통합 플랫폼의 목적 데이터에 도달할지 여부에 대한 확률을 판단하기 위한 인공지능 모델을 학습시키는 방법 및 장치가 개시된다.Disclosed is a method and apparatus for training an artificial intelligence model for determining a probability of reaching target data of an integrated platform using all behavioral data of a user who has accessed the integrated platform by one disclosure.

인공 지능(artificial intelligence, AI)은 인간의 지능으로 할 수 있는 사고, 학습, 자기계발 등을 컴퓨터가 할 수 있도록 하는 방법을 연구하는 컴퓨터 공학 및 정보기술의 한 분야로, 컴퓨터가 인간의 지능적인 행동을 모방할 수 있도록 하는 것을 의미한다.Artificial intelligence (AI) is a branch of computer science and information technology that studies how computers can do the thinking, learning, and self-development that human intelligence can do. It means being able to imitate behavior.

새로운 유형의 멀티미디어 콘텐츠, SNS(social network service)의 광범위한 확장, 그리고 스마트 기기들의 보급과 이용으로 인해 웹상에서 발생 및 유통되는 데이터의 규모가 기하 급수적으로 늘어나고 있다. 웹상에서 존재하고 지금도 늘어나고 있는 엄청난 양의 데이터는 세상을 해석하기 위해 사용될 수 있다. 이것이 바로 빅 데이터로서, 쉽게 말해 디지털화된 방대한 양의 정보를 뜻한다. 빅 데이터에서 불필요한 데이터들을 걸러내고 유용한 정보만을 추출 및 분석하여 사람들의 생각과 의견, 트랜드를 읽어내고 더 나아가 그들의 행동을 미리 예측할 수 있다. 빅 데이터는 이러한 유용성으로 인해 현재 우리나라에서뿐만 아니라 전세계적으로 각광받고 있는 차세대 IT(information technology) 기술 중 하나이다.Due to the widespread expansion of new types of multimedia contents, social network service (SNS), and the spread and use of smart devices, the scale of data generated and distributed on the web is increasing exponentially. The vast amount of data that exists and continues to grow on the web can be used to interpret the world. This is big data, which simply means a vast amount of digitized information. By filtering out unnecessary data from big data and extracting and analyzing only useful information, it is possible to read people's thoughts, opinions, and trends, and further predict their behavior in advance. Big data is one of the next generation IT (information technology) technologies that are currently in the limelight not only in Korea but also around the world due to such usefulness.

국내 빅 데이터 시장은 2015년 3,000억 원대를 형성하며, 2020년 1조원 규모로 성장할 것으로 예상된다. 빅 데이터와 관련된 국내 시장 규모도 매년 28% 이상 성장하고 있다. 그러나, 빅데이터 분석은 폭발적으로 증가하고 있으나 빅데이터 분석 기술 발전만큼 데이터 전처리에 관한 기술의 발전 속도는 느리며 이에 따라 자동화된 데이터 전처리 기술 개발의 필요성이 대두되고 있다.The domestic big data market is expected to reach KRW 300 billion in 2015 and is expected to grow to KRW 1 trillion in 2020. The size of the domestic market related to big data is also growing by more than 28% every year. However, although big data analysis is explosively increasing, the development speed of data preprocessing technology is as slow as the development of big data analysis technology, and accordingly, the need for the development of automated data preprocessing technology is emerging.

또한, 다양화된 온라인 서비스에 따라 제공 목적 및 분석 요소가 다변화되고 있어 빅데이터를 분석하는 방법만으로는 사용자가 원하는 데이터를 분석하고 처리하기에 한계점이 존재하며, 종래의 데이터 수집형태로 취합된 데이터를 분석하는 경우 다양한 차원의 분석을 수행할 수 없는 한계가 나타났다.In addition, as the purpose of providing and analysis elements are diversifying according to diversified online services, there is a limitation in analyzing and processing the data desired by users only by the method of analyzing big data. In the case of analysis, there was a limitation in that it was impossible to perform analysis of various dimensions.

따라서, 비정형적 데이터 수집을 통한 다차원적 데이터 분석을 수행할 수 있는 인공지능 모델에 대한 필요성이 증가하였다.Accordingly, the need for an artificial intelligence model capable of performing multidimensional data analysis through informal data collection has increased.

0001)한국공개특허 제10-2016-0075971호(공개일자: 2016. 06. 30)0001) Korean Patent Publication No. 10-2016-0075971 (Publication date: 2016. 06. 30) 0002)한국공개특허 제10-2019-0062848호(공개일자: 2019.06.07)0002) Korean Patent Publication No. 10-2019-0062848 (Publication date: 2019.06.07)

일 개시에 따른 기술적 과제는 통합 플랫폼에 접속한 시점부터 종료한 시점까지 수행된 사용자의 모든 행동 데이터를 수집하고, 수집된 행동 데이터가 목적 데이터에 해당하는지 여부를 라벨링함으로써 사용자의 행동 패턴을 분석하는 인공지능 모델을 학습시키는 방법 및 장치를 제공함에 있다.The technical task according to the start of work is to collect all the user's behavior data performed from the point of access to the integrated platform to the point of termination, and analyze the user's behavior pattern by labeling whether the collected behavior data corresponds to the purpose data. It is to provide a method and apparatus for training an artificial intelligence model.

제 1 실시예에 의하여 통합 플랫폼에 접속한 사용자의 모든 시계열적 행동에 대한 행동 데이터를 획득하는 단계, 사용자의 모든 행동 데이터 각각에 대하여, 통합 플랫폼의 관리자에 의하여 정해진 목적 데이터에 도달하였는지 여부를 판단하여 라벨(label)을 설정하는 라벨링(labeling) 단계 및 통합 플랫폼에 접속한 다른 사용자의 모든 시계열적 행동 데이터에 대하여 라벨링 단계를 반복함으로써 인공지능 모델을 학습시키는 단계를 포함하는 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 방법을 제공할 수 있다.Acquiring behavioral data on all time-series behaviors of a user who has accessed the integrated platform according to the first embodiment, and for each of all behavioral data of the user, it is determined whether or not the target data determined by the administrator of the integrated platform has been reached. User time-series behavior data including the step of learning an artificial intelligence model by repeating the labeling step of setting a label and the labeling step for all time-series behavior data of other users who have accessed the integrated platform. It can provide a method for training an artificial intelligence model using.

제 2 실시예에 의하여 프로세서 및 프로세서에 의해 실행 가능한 명령어들을 저장하는 메모리를 포함하고 프로세서는 명령어들을 실행함으로써, 통합 플랫폼에 접속한 사용자의 모든 시계열적 행동 데이터를 획득하고, 사용자의 모든 행동 데이터 각각에 대하여, 통합 플랫폼의 관리자에 의하여 정해진 목적 데이터에 도달하였는지 여부를 판단하여 라벨(label)을 설정하고, 통합 플랫폼에 접속한 다른 사용자의 모든 시계열적 행동 데이터에 대하여 라벨을 설정하는 것을 반복함으로써 인공지능 모델을 학습시키는 것인, 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 장치를 제공할 수 있다.According to the second embodiment, the processor includes a processor and a memory storing instructions executable by the processor, and the processor executes the instructions, thereby acquiring all time-series behavior data of a user accessing the integrated platform, and each of all behavior data of the user. For the artificial platform, it is determined whether or not the target data determined by the administrator of the integrated platform has been reached, a label is set, and the label is repeatedly set for all time-series behavior data of other users who have accessed the integrated platform. It is possible to provide an apparatus for learning an artificial intelligence model using time-series behavior data of a user, which is to train an intelligence model.

제 3 실시예에 따른, 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 방법을 실행하기 위하여 프로세서에 의해 실행 가능한 명령어들이 저장된 비일시적 컴퓨터 판독 가능 저장매체와, 저장매체에 기록된 컴퓨터프로그램을 제공할 수 있다.A non-transitory computer-readable storage medium in which instructions executable by a processor are stored in order to execute a method of learning an artificial intelligence model using time-series behavior data of a user according to the third embodiment, and a computer recorded in the storage medium You can provide a program.

일 개시에 의하여 사용자의 행동을 트래킹하여 사용자 중심의 데이터를 수집하고 분석함으로써, 사용자의 경험을 최적화 한 다양한 차원의 데이터 분석 결과를 도출해낼 수 있다.By tracking the user's behavior at the start of work, collecting and analyzing user-centered data, it is possible to derive data analysis results of various dimensions that optimize the user's experience.

일 개시에 의하여 기존 실무자의 역량에 의하여 좌우되는 데이터 분석 결과를 인공지능 학습 모델을 통해 자동화함으로써, 데이터 분석, 실행, 전략으로 구성된 자동 데이터 분석 과정에 따라 빠른 주기의 의사결정 및 일관된 기준의 분석 결과를 제공할 수 있다. 특히, 본원발명으로 인하여 실무자의 업무시간은 약 95% 단축되는 효과가 있으며, 데이터 분석에 필요한 인원도 2.8명 감소시킬 수 있어, 뛰어난 경제적 효과를 제공한다.By automating the data analysis results, which are influenced by the capabilities of the existing practitioners by the start of work, through an artificial intelligence learning model, decision-making in a fast cycle and analysis results of consistent standards according to the automatic data analysis process consisting of data analysis, execution, and strategy. Can provide. In particular, due to the present invention, the working hours of practitioners can be reduced by about 95%, and the number of personnel required for data analysis can also be reduced by 2.8, providing excellent economic effects.

일 개시에 의하여 비식별 데이터를 이용한 기존의 분석과 달리, 실제 사용자의 프로파일을 수집하고 모든 사용자 행동을 분석함으로써, 보다 직접적인 개인화 커뮤니케이션이 가능해지며 친밀한 브랜드 경험을 제공할 수 있다.Unlike the existing analysis using non-identifying data by one disclosure, by collecting the profile of an actual user and analyzing all user behaviors, more direct personalized communication is possible and an intimate brand experience can be provided.

일 개시에 의하여, 기존의 테이블 형태의 데이터를 저장하는 관계형 데이터베이스에 비하여, 각각의 데이터들을 하나의 객체로서 저장하는 그래프 데이터베이스를 사용함으로써 정형 데이터 및 비정형 데이터의 통합적인 저장관리에 유리하다.According to one disclosure, compared to a relational database that stores data in the form of a table, a graph database that stores each data as an object is advantageous for integrated storage and management of structured and unstructured data.

일 개시에 의하여 본 발명의 데이터 분석 시스템 및 방법에 의하면, 정형 및 비정형 데이터의 수집 및 전처리 시 데이터 간의 연관관계를 자동으로 찾아냄으로써 완성된 그래프 데이터 모델을 자동적으로 생성할 수 있고, 자동 연관관계 생성을 통하여 기존 테이블(table) 형태의 2차원적인 데이터에서 볼 수 없었던 시스템적인 통찰력을 달성하여 상품 추천, 사용자 행동 예측, 침해사고 예방 분석 고도화에 활용할 수 있다.According to the data analysis system and method of the present invention according to the disclosure, a completed graph data model can be automatically generated by automatically finding a correlation between data when collecting and pre-processing structured and unstructured data, and automatically generating a correlation relationship. Through this, it can be used for product recommendation, user behavior prediction, and intrusion accident prevention analysis advancement by achieving systemic insight that could not be seen in the existing table-type two-dimensional data.

도 1은 일 실시예에 의한 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 방법을 개략적으로 설명하기 위한 도면이다.
도 2는 일 개시에 의한 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 방법 설명하기 위한 흐름도이다.
도 3은 일 개시에 의한 그래프 데이터를 이용하여 인공지능 모델을 학습하는 특징을 설명하기 위한 도면이다.
도 4는 일 개시에 의한 성공 라벨이 부여된 행동 데이터를 이용하여 인공지능 모델을 학습하는 특징을 설명하기 위한 도면이다.
도 5는 일 개시에 의한 실패 라벨이 부여된 행동 데이터를 이용하여 인공지능 모델을 학습하는 특징을 설명하기 위한 도면이다.
도 6은 일 개시에 의한 사용자 행동을 획득하고, 사용자 정보와 매핑하여 행동 데이터를 분석하는 특징을 설명하기 위한 도면이다.
도 7은 일 개시에 의한 사용자의 통합 플랫폼 이용 내역에 따라 사용자 그룹을 관리하는 특징을 설명하기 위한 도면이다.
도 8은 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 장치의 구조를 설명하기 위한 도면이다.1 is a diagram schematically illustrating a method of training an artificial intelligence model using time-series behavior data of a user according to an exemplary embodiment.
2 is a flowchart illustrating a method of training an artificial intelligence model using time-series behavior data of a user according to an initiation.
3 is a diagram for explaining a feature of learning an artificial intelligence model using graph data according to one disclosure.
FIG. 4 is a diagram for describing a feature of learning an artificial intelligence model using behavioral data to which a success label is assigned according to an initiation.
5 is a diagram for explaining a feature of learning an artificial intelligence model using behavior data to which a failure label is assigned due to an initiation.
6 is a diagram for explaining a feature of acquiring a user behavior according to an initiation and analyzing behavior data by mapping it with user information.
FIG. 7 is a diagram for explaining a feature of managing a user group according to a user's use of an integrated platform according to an initiation.
FIG. 8 is a diagram for explaining the structure of an apparatus for training an artificial intelligence model using time-series behavior data of a user.

본 명세서에서 사용되는 용어에 대해 간략히 설명하고, 본 발명의 일 실시예에 대해 구체적으로 설명하기로 한다. The terms used in the present specification will be briefly described, and an embodiment of the present invention will be described in detail.

본 명세서에서 사용되는 용어는 본 발명에서의 기능을 고려하면서 가능한 현재 널리 사용되는 일반적인 용어들을 선택하였으나, 이는 당 분야에 종사하는 기술자의 의도 또는 판례, 새로운 기술의 출현 등에 따라 달라질 수 있다. 또한, 특정한 경우는 출원인이 임의로 선정한 용어도 있으며, 이 경우 해당되는 발명의 설명 부분에서 상세히 그 의미를 기재할 것이다. 따라서 본 명세서에서 사용되는 용어는 단순한 용어의 명칭이 아닌, 그 용어가 가지는 의미와 본 발명의 전반에 걸친 내용을 토대로 정의되어야 한다. Terms used in the present specification have selected general terms that are currently widely used as possible while taking functions of the present invention into consideration, but this may vary according to the intention or precedent of a technician working in the field, the emergence of new technologies, and the like. In addition, in certain cases, there are terms arbitrarily selected by the applicant, and in this case, the meaning of the terms will be described in detail in the description of the corresponding invention. Therefore, the terms used in the present specification should be defined based on the meaning of the term and the overall contents of the present invention, not a simple name of the term.

명세서 전체에서 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있음을 의미한다. 또한, 명세서에 기재된 "...부", "모듈" 등의 용어는 적어도 하나의 기능이나 동작을 처리하는 단위를 의미하며, 이는 하드웨어 또는 소프트웨어로 구현되거나 하드웨어와 소프트웨어의 결합으로 구현될 수 있다.When a part of the specification is said to "include" a certain component, it means that other components may be further included rather than excluding other components unless specifically stated to the contrary. In addition, terms such as "... unit" and "module" described in the specification mean a unit that processes at least one function or operation, which may be implemented as hardware or software or a combination of hardware and software. .

머신 러닝(machine learning)은 인공지능의 한 분야로, 컴퓨터에 명시적인 프로그램 없이 배울 수 있는 능력을 부여하는 연구 분야이다.Machine learning is a branch of artificial intelligence, a field of research that gives computers the ability to learn without explicit programming.

구체적으로 머신 러닝은, 경험적 데이터를 기반으로 학습을 하고 예측을 수행하고 스스로의 성능을 향상시키는 시스템과 이를 위한 알고리즘을 연구하고 구축하는 기술이라 할 수 있다. 머신 러닝의 알고리즘들은 엄격하게 정해진 정적인 프로그램 명령들을 수행하는 것이라기보다, 입력 데이터를 기반으로 예측이나 결정을 이끌어내기 위해 특정한 모델을 구축하는 방식을 취한다.Specifically, machine learning can be said to be a technology that studies and builds a system that learns based on empirical data, performs prediction, and improves its own performance, and algorithms for it. Rather than executing strictly defined static program instructions, machine learning algorithms build specific models to derive predictions or decisions based on input data.

인공 신경망은 훈련 데이터(training data)를 이용하여 학습(training)될 수 있다. 여기서 학습이란, 입력 데이터를 분류(classification)하거나 회귀분석(regression)하거나 군집화(clustering)하는 등의 목적을 달성하기 위하여, 학습 데이터를 이용하여 인공 신경망의 파라미터(parameter)를 결정하는 과정을 의미할 수 있다. 인공 신경망의 파라미터의 대표적인 예시로써, 시냅스에 부여되는 가중치(weight)나 뉴런에 적용되는 편향(bias)을 들 수 있다.The artificial neural network can be trained using training data. Here, learning means a process of determining parameters of an artificial neural network using training data in order to achieve the purpose of classifying, regressing, or clustering input data. I can. Representative examples of parameters of an artificial neural network include weights applied to synapses or biases applied to neurons.

훈련 데이터에 의하여 학습된 인공 신경망은, 입력 데이터를 입력 데이터가 가지는 패턴에 따라 분류하거나 군집화 할 수 있다.The artificial neural network learned by the training data may classify or cluster input data according to a pattern of the input data.

인공 신경망의 학습 방식은 크게, 지도 학습, 비 지도 학습, 준 지도 학습(Semi-Supervised Learning), 강화 학습(Reinforcement Learning)으로 분류될 수 있다.Learning methods of artificial neural networks can be broadly classified into supervised learning, unsupervised learning, semi-supervised learning, and reinforcement learning.

지도 학습은 훈련 데이터로부터 하나의 함수를 유추해내기 위한 기계 학습의 한 방법이다. 그리고 이렇게 유추되는 함수 중, 연속 적인 값을 출력하는 것을 회귀분석(Regression)이라 하고, 입력 벡터의 클래스(class)를 예측하여 출력하는 것을 분류(Classification)라고 할 수 있다.Supervised learning is a method of machine learning to infer a function from training data. And among the functions to be inferred, outputting a continuous value is called regression, and predicting and outputting a class of an input vector can be called classification.

지도 학습에서는, 훈련 데이터에 대한 라벨(label)이 주어진 상태에서 인공 신경망을 학습시킨다.In supervised learning, an artificial neural network is trained with a label for training data.

여기서 레이블이란, 훈련 데이터가 인공 신경망에 입력되는 경우 인공 신경망이 추론해 내야 하는 정답(또는 결과 값)을 의미할 수 있다.Here, the label may mean a correct answer (or result value) that the artificial neural network must infer when training data is input to the artificial neural network.

또한 본 명세서에서는, 인공 신경망의 학습을 위하여 훈련 데이터에 레이블을 설정하는 것을, 훈련 데이터에 레이블링 데이터를 라벨링(labeling) 한다고 명칭 한다.In addition, in this specification, setting a label on training data for learning an artificial neural network is referred to as labeling the training data with labeling data.

이 경우 훈련 데이터와 훈련 데이터에 대응하는 레이블)은 하나의 트레이닝 셋(training set)을 구성하고, 인공 신경망에는 트레이닝 셋의 형태로 입력될 수 있다.In this case, the training data and the label corresponding to the training data) constitute one training set, and may be input to the artificial neural network in the form of a training set.

한편 훈련 데이터는 복수의 특징(feature)을 나타내고, 훈련 데이터에 레이블이 레이블링 된다는 것은 훈련 데이터가 나타내는 특징에 레이블이 달린다는 것을 의미할 수 있다. 이 경우 훈련 데이터는 입력 개체의 특징을 벡터 형태로 나타낼 수 있다.Meanwhile, the training data represents a plurality of features, and labeling of the training data may mean that a label is attached to the feature represented by the training data. In this case, the training data may represent the characteristics of the input entity in the form of a vector.

인공 신경망은 훈련 데이터와 레이블링 데이터를 이용하여, 훈련 데이터와 레이블링 데이터의 연관 관계에 대한 함수를 유추할 수 있다. 그리고, 인공 신경망에서 유추된 함수에 대한 평가를 통해 인공 신경망의 파라미터가 결정(최적화)될 수 있다.The artificial neural network can infer a function for the correlation between the training data and the labeling data by using the training data and the labeling data. In addition, parameters of the artificial neural network may be determined (optimized) through evaluation of a function inferred from the artificial neural network.

비 지도 학습은 기계 학습의 일종으로, 훈련 데이터에 대한 레이블이 주어지지 않는다.Unsupervised learning is a type of machine learning, where no labels are given for training data.

구체적으로, 비 지도 학습은, 훈련 데이터 및 훈련 데이터에 대응하는 레이블의 연관 관계 보다는, 훈련 데이터 자체에서 패턴을 찾아 분류하도록 인공 신경망을 학습시키는 학습 방법일 수 있다. 통합 플랫폼은 사용자 단말에서 설치 또는 구동되는 애플리케이션, 프로그램, 웹 페이지 등을 포함하는 웹 플랫폼 또는 모바일 플랫폼일 수 있다.Specifically, the unsupervised learning may be a learning method of training an artificial neural network to find and classify patterns in the training data itself, rather than an association relationship between training data and a label corresponding to the training data. The integrated platform may be a web platform or a mobile platform including applications, programs, web pages, etc. installed or driven in the user terminal.

여기서, 웹 브라우저는 웹(WWW: World Wide Web) 서비스를 이용할 수 있게 하는 프로그램으로 HTML(hyper text mark-up language)로 서술된 하이퍼 텍스트를 받아서 보여주는 프로그램을 의미하며, 예를 들어 넷스케이프(Netscape), 익스플로러(Explorer), 크롬(chrome) 등을 포함한다. 또한 애플케이션은 단말 상의 응용프로그램(application)을 의미하며, 예를 들어, 모바일 단말(스마트폰)에서 실행되는 앱(app)을 포함한다. 앱(app)은 모바일 콘텐츠를 자유롭게 사고 파는 가상의 장터인 애플리케이션 마켓에서 다운로드 받아서 설치할 수 있다. Here, the web browser is a program that enables you to use the Web (WWW: World Wide Web) service, which means a program that receives and displays hypertext described in HTML (hyper text mark-up language). For example, Netscape , Explorer, chrome, etc. In addition, an application means an application program on a terminal, and includes, for example, an app running on a mobile terminal (smart phone). Apps can be downloaded and installed from the application market, a virtual marketplace for freely buying and selling mobile content.

일 개시에 의하여 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 장치(100)는 인공지능 모델 학습 장치(100)로 축약하여 설명하도록 한다.The apparatus 100 for training an artificial intelligence model using time-series behavior data of a user according to one disclosure will be abbreviated as the artificial intelligence model learning apparatus 100 and described.

도 1은 일 실시예에 의한 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 방법을 개략적으로 설명하기 위한 도면이다.1 is a diagram schematically illustrating a method of training an artificial intelligence model using time-series behavior data of a user according to an exemplary embodiment.

일 개시에 의하여 인공지능 모델 학습 장치(100)는 통합 플랫폼, 예를 들어 쇼핑 사이트에 접속한 사용자의 모든 행동 데이터를 수집할 수 있다. 여기서 행동 데이터는 통합 플랫폼에 접속하는 방문 트래픽 데이터, 통합 플랫폼에 포함된 컨텐트에 대한 반응 데이터, 통합 플랫폼에서 제공하는 이벤트에 대한 반응 데이터, 통합 플랫폼에 등록된 사용자의 식별 데이터 및 통합 플랫폼에 접속한 동안 발생하는 비정형 데이터를 포함할 수 있다.By the start, the artificial intelligence model learning apparatus 100 may collect all behavioral data of a user who has accessed an integrated platform, for example, a shopping site. Here, the behavioral data is visited traffic data connected to the integrated platform, reaction data for content included in the integrated platform, reaction data for events provided by the integrated platform, identification data of users registered on the integrated platform, and access to the integrated platform. It may contain unstructured data that occurs during.

보다 상세하게 인공지능 모델 학습 장치(100)는 웹사이트에 접속한 사용자의 경로, 접속 시작 시점, 접속 종료 시점, 페이지의 스크롤 범위, 로그인 여부, 사용자의 컨텐츠 구매 유형, 구매 단계의 진행여부, 배송 정보, 결제 정보 등의 모든 정형적, 비정형적 데이터를 획득할 수 있다. In more detail, the artificial intelligence model learning device 100 includes the path of the user who accessed the website, the connection start time point, the connection end time point, the scroll range of the page, whether or not to log in, the user's content purchase type, whether the purchase step is in progress, and delivery. All formal and unstructured data such as information and payment information can be obtained.

인공지능 모델 학습 장치(100)는 도 1에서와 같이, SNS광고를 통해 접속한 A사용자의 구매 완료까지의 모든 행동 데이터를 획득할 수 있다. 인공지능 모델 학습 장치(100)는 실시간으로 사용자의 행동 데이터를 획득할 수 있다.As shown in FIG. 1, the artificial intelligence model learning apparatus 100 may acquire all behavioral data until purchase completion of user A accessed through an SNS advertisement. The artificial intelligence model learning apparatus 100 may acquire user behavior data in real time.

일 개시에 의하여, 인공지능 모델 학습 장치(100)는 사용자의 행동 데이터가 통합 플랫폼의 관리자에 의하여 미리 지정된 목표 데이터를 포함하고 있는지 여부를 자동으로 판단할 수 있다. 예를 들어, 쇼핑몰 사이트의 목표 데이터가 “구매”인 경우, 인공지능 모델 학습 장치(100)는 A사용자의 모든 행동 데이터에 구매 데이터가 존재하는지 여부를 자동으로 판단할 수 있다. 인공지능 모델 학습 장치(100)는 A사용자의 행동 데이터에 구매 데이터가 존재하는 경우, A 사용자의 모든 행동 데이터에 성공 라벨을 부여할 수 있다. 즉, 인공지능 모델 학습 장치(100)는 A가 웹사이트에 접속한 시점에서부터 종료한 시점까지 획득한 모든 데이터를 성공 라벨로 결정할 수 있다. According to an initiation, the artificial intelligence model learning apparatus 100 may automatically determine whether the user's behavior data includes target data previously designated by the administrator of the integrated platform. For example, when the target data of the shopping mall site is “purchase”, the artificial intelligence model learning apparatus 100 may automatically determine whether purchase data exists in all behavioral data of user A. The artificial intelligence model learning apparatus 100 may assign a success label to all behavior data of user A when purchase data exists in the behavior data of user A. That is, the artificial intelligence model learning apparatus 100 may determine all the data acquired from the time point A accesses the website to the time point where it ends as the success label.

일 개시에 의하여, 인공지능 모델 학습 장치(100)는 라벨링 단계를 거친 행동 데이터들을 반복학습함으로써, 인공지능 모델을 생성할 수 있다. 여기서 인공지능 모델은 기존의 특정한 도메인에 대해서만 학습을 수행한 것과 달리, 목적 데이터가 정해지기만 하면 사용자의 행동 데이터로부터 자동으로 성공 여부를 도출해낸다는 점에서 다차원의 결과에 속하는 목록을 생성할 수 있으며, 이로 인하여 사용자의 행동 데이터를 분석하여 다양한 마케팅 리소스로 활용할 수 있는 장점이 있다.By an initiation, the artificial intelligence model learning apparatus 100 may generate an artificial intelligence model by iteratively learning behavioral data that has been subjected to the labeling step. Here, the artificial intelligence model can generate a list belonging to multidimensional results in that it automatically derives the success or failure from the user's behavior data once the target data is determined, unlike the existing learning only for a specific domain. This has the advantage of analyzing user behavior data and utilizing it as a variety of marketing resources.

개시된 실시 예에서, 인공지능 모델 학습 장치(100)는 라벨링 단계를 거친 행동 데이터들을 수집하여 학습 데이터를 획득할 수 있다. 인공지능 모델 학습 장치(100)는 획득된 학습 데이터를 이용하여 인공지능 모델을 학습시킬 수 있다.In the disclosed embodiment, the apparatus 100 for learning an artificial intelligence model may acquire training data by collecting behavioral data that has been subjected to a labeling step. The artificial intelligence model learning apparatus 100 may train an artificial intelligence model by using the acquired training data.

예를 들어, 인공지능 모델 학습 장치(100)는 기 설정된 주기로, 혹은 기 설정된 양의 학습 데이터가 수집될 때마다 학습을 수행할 수 있으며, 상술한 바와 같이 라벨링 단계를 거친 행동 데이터들을 반복학습함으로써 업데이트될 수도 있다.For example, the artificial intelligence model learning apparatus 100 may perform training at a preset period or whenever a preset amount of training data is collected, and iteratively learns the behavioral data that passed the labeling step as described above. It may be updated.

일 실시 예에서, 학습된 인공지능 모델 및 인공지능 모델의 업데이트를 위한 모듈이 메인 서버에서 생성된 후, 인공지능 모델 학습 장치(100)에 다양한 형태(예를 들어, API 및 SDK 등)로 제공될 수 있다. 최초에 제공되는 모듈은 기본적인 학습이 완료된 상태일 수 있으며, 이에 더하여 통합 플랫폼의 관리자가 입력한 목표 및 기 수집된 정보와 이후 수집되는 정보들에 기반하여 학습 데이터를 획득하여 학습하는 형태로 구성될 수 있다.In one embodiment, the learned artificial intelligence model and the module for updating the artificial intelligence model are generated in the main server, and then provided to the artificial intelligence model learning device 100 in various forms (for example, API and SDK, etc.) Can be. The module initially provided may be in a state where basic learning has been completed, and in addition to this, it will be configured in the form of acquiring and learning learning data based on the goals entered by the administrator of the integrated platform, previously collected information, and information collected later. I can.

일 개시에 의하여 생성된 인공지능 모델을 이용하여 사용자의 행동 데이터에 기반한 맞춤형 이벤트를 생성할 수 있으며, 보다 상세하게는 사용자의 행동 데이터의 획득 시점 및 행동 데이터의 경로에 기반하여 사용자에게 맞춤형 이벤트를 제공하여 사용자의 행동을 목적데이터로 유도하도록 하는 효과를 제공한다.A customized event based on the user's behavior data can be created using the artificial intelligence model generated by the initiation of the day, and in more detail, a customized event is provided to the user based on the acquisition time of the user's behavior data and the path of the behavior data. It provides the effect of inducing the user's actions to target data.

또한, 일 개시에 의하여 생성된 인공지능 모델을 이용하여 맞춤형 이벤트를 제공하기 위한 잠재고객 판단을 수행할 수 있다. 예를 들어, 일 개시에 의하여 생성된 인공지능 모델을 이용하여 사용자의 행동 데이터를 분석한 결과, 사용자가 기 설정된 목적에 해당하는 동작을 수행할 확률이 기 설정된 확률 이상인 것으로 판단되는 경우, 해당 사용자를 잠재 고객으로 판단할 수 있다.In addition, it is possible to determine a potential customer to provide a customized event by using the artificial intelligence model generated by the initiation of the day. For example, if, as a result of analyzing the user's behavior data using an artificial intelligence model created at the beginning of the day, that the probability that the user will perform an action corresponding to a preset purpose is greater than or equal to the preset probability, the user Can be judged as a potential customer.

도 2는 일 개시에 의한 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 방법 설명하기 위한 흐름도이다.2 is a flowchart illustrating a method of training an artificial intelligence model using time-series behavior data of a user according to an initiation.

블록 201에서 일 개시에 의한 인공지능 모델 학습 장치(100)는 통합 플랫폼에 접속한 사용자의 모든 시계열적 행동에 대한 행동 데이터를 획득할 수 있다.In block 201, the apparatus 100 for learning an artificial intelligence model according to an initiation may acquire behavior data for all time-series behaviors of a user accessing the integrated platform.

일 개시에 의한 인공지능 모델 학습 장치(100)는 사용자의 식별정보와 별개로 사용자의 행동 데이터를 수집할 수 있으며, 사용자의 식별정보에 매핑하여 저장하거나, 사용자의 행동 데이터만을 별개로 저장할 수 있다.The artificial intelligence model learning apparatus 100 according to one initiation may collect user's behavior data separately from the user's identification information, map and store the user's identification information, or separately store only the user's behavior data. .

일 개시에 의한 인공지능 모델 학습 장치(100)는 사용자의 행동 데이터 각각이 발생한 시간, 위치, 접속 환경 정보, 네트워크 이용정보 등을 동시에 획득할 수 있다.The apparatus 100 for learning an artificial intelligence model according to an initiation may simultaneously acquire time, location, access environment information, network usage information, etc. of each occurrence of the user's behavior data.

블록 202에서 일 개시에 의한 인공지능 모델 학습 장치(100)는 사용자의 모든 행동 데이터 각각에 대하여, 통합 플랫폼의 관리자에 의하여 정해진 목적 데이터에 도달하였는지 여부를 판단하여 라벨(label)을 설정하는 라벨링 단계를 수행할 수 있다.In block 202, the apparatus 100 for learning an artificial intelligence model according to one start is a labeling step of setting a label by determining whether the target data determined by the administrator of the integrated platform has been reached for each of the user's behavioral data. You can do it.

라벨링은 사용자의 모든 행동 데이터가 목적 데이터에 도달하였는지 여부를 기준으로 성공 또는 실패로 결정될 수 있다. 즉, 사용자의 행동 데이터 중 성공 라벨이 부여될 목적 데이터가 존재하는 경우를 제외하고는 모두 실패로 간주한다. Labeling may be determined as success or failure based on whether all behavioral data of the user has reached the target data. In other words, all of the user's behavior data is regarded as failure, except for the case in which purpose data to be assigned a success label exists.

라벨링은 경우에 따라 성공/실패 외의 다양한 결과값을 가질 수 있다. 예를 들어, 쇼핑몰 사이트의 다양한 마케팅 방법을 도출하기 위하여, 제품 구매, 회원 가입, 팔로우, 좋아요, 댓글, 이메일 구독 등의 기 설정된 결과를 라벨링할 수도 있다.In some cases, labeling may have various result values other than success/failure. For example, in order to derive various marketing methods of a shopping mall site, preset results such as product purchase, membership registration, follow, like, comment, and email subscription may be labeled.

여기서 라벨이란, 훈련 데이터가 인공 신경망에 입력되는 경우 인공 신경망이 추론해 내야 하는 정답(또는 결과 값)을 의미할 수 있다. 또한, 인공 신경망의 학습을 위하여 훈련 데이터에 레이블을 설정하는 것을, 훈련 데이터에 레이블링 데이터를 라벨링(labeling) 한다고 명칭 한다.Here, the label may mean a correct answer (or result value) that the artificial neural network must infer when training data is input to the artificial neural network. In addition, for training of an artificial neural network, labeling of training data is referred to as labeling of training data with labeling data.

이 경우 훈련 데이터와 훈련 데이터에 대응하는 라벨은 하나의 트레이닝 셋(training set)을 구성하고, 인공 신경망에는 트레이닝 셋의 형태로 입력될 수 있다. 한편 훈련 데이터는 복수의 특징(feature)을 나타내고, 훈련 데이터에 레이블이 레이블링 된다는 것은 훈련 데이터가 나타내는 특징에 레이블이 달린다는 것을 의미할 수 있다. 이 경우 훈련 데이터는 입력 개체의 특징을 벡터 형태로 나타낼 수 있다.In this case, the training data and the label corresponding to the training data constitute one training set, and may be input to the artificial neural network in the form of a training set. Meanwhile, the training data represents a plurality of features, and labeling of the training data may mean that a label is attached to the feature represented by the training data. In this case, the training data may represent the characteristics of the input entity in the form of a vector.

블록 203에서 일 개시에 의한 인공지능 모델 학습 장치(100)는 통합 플랫폼에 접속한 다른 사용자의 모든 시계열적 행동 데이터에 대하여 라벨링 단계를 반복함으로써 인공지능 모델을 학습시킬 수 있다.In block 203, the apparatus 100 for learning an artificial intelligence model according to an initiation may train the artificial intelligence model by repeating the labeling step for all time-series behavior data of other users accessing the integrated platform.

일 개시에 의하여 인공지능 모델 학습 장치(100)는 통합 플랫폼의 관리자로부터 목적 데이터를 획득할 수 있다. 목적 데이터는 통합 플랫폼의 관리자에 의하여 지정된 적어도 하나의 행동 데이터로서, 인공지능 모델은 목적 데이터의 속성에 기초하여 개별적으로 학습되는 것을 특징으로 한다. 예를 들어, 쇼핑몰 사이트의 목적 데이터는 “컨텐츠의 구매”일 수 있다. 다른 예로, 쇼핑몰 사이트의 목적 데이터는 설정에 따라 회원 가입, 팔로우, 좋아요, 댓글, 이메일 구독 등 다양하게 설정될 수 있으며, 이에 제한되지 않는다.According to an initiation, the artificial intelligence model learning apparatus 100 may acquire target data from the administrator of the integrated platform. The target data is at least one behavioral data designated by the administrator of the integrated platform, and the artificial intelligence model is individually learned based on the attributes of the target data. For example, the purpose data of the shopping mall site may be "purchase of contents". As another example, the purpose data of the shopping mall site may be variously set, such as membership registration, follow, likes, comments, and email subscriptions according to settings, but is not limited thereto.

다른 예로, 쇼핑몰 사이트의 목적 데이터는 사용자의 재방문 및 재구매를 포함할 수 있다. 이 경우, 사용자의 행동 데이터를 수집한 후, 해당 사용자의 사이트 재방문 여부 및 이에 따른 재구매 여부에 대한 데이터를 획득하고, 이에 기반하여 라벨링을 수행할 수도 있다.As another example, the purpose data of the shopping mall site may include a user's revisiting and repurchasing. In this case, after collecting the user's behavioral data, data on whether the user revisits the site and whether to repurchase accordingly may be obtained, and labeling may be performed based on this.

또한, 인공지능 모델 학습 장치(100)는 사용자의 모든 행동 데이터 중 통합 플랫폼의 목적 데이터가 존재하는 경우, 사용자의 모든 행동 데이터 각각에 대하여 성공 라벨을 설정할 수 있다.In addition, the artificial intelligence model learning apparatus 100 may set a success label for each of all the user's behavior data when the target data of the integrated platform exists among all the user's behavior data.

인공지능 모델 학습 장치(100)는 사용자의 모든 행동 데이터 중 통합 플랫폼의 목적 데이터가 존재하지 않는 경우, 사용자의 모든 행동 데이터 각각에 대하여 실패 라벨을 설정할 수 있다. 인공지능 모델 학습 장치(100)는 라벨이 부여된 행동 데이터들을 이용하여 인공지능 모델을 학습시킬 수 있다. 일 실시 예에서, 인공지능 모델 학습 장치(100)는 동일한 라벨이 부여된 행동 데이터들을 이용하여 인공지능 모델을 학습시킬 수 있으나, 이에 제한되는 것은 아니다.The apparatus 100 for learning an artificial intelligence model may set a failure label for each of all the user's behavior data when the target data of the integrated platform does not exist among all the user's behavior data. The artificial intelligence model learning apparatus 100 may train an artificial intelligence model by using labeled behavioral data. In an embodiment, the apparatus 100 for learning an artificial intelligence model may train an artificial intelligence model using behavioral data to which the same label is assigned, but is not limited thereto.

도 3은 일 개시에 의한 그래프 데이터를 이용하여 인공지능 모델을 학습하는 특징을 설명하기 위한 도면이다.3 is a diagram for explaining a feature of learning an artificial intelligence model using graph data according to one disclosure.

일 개시에 의하여 인공지능 모델 학습 장치(100)는 동일한 라벨이 설정된 행동 데이터 각각을 나타내는 정점(vertex)들을 방향성을 갖는 벡터인 엣지(edge)로 연결하여 그래프 데이터를 생성할 수 있다.According to one disclosure, the apparatus 100 for learning an artificial intelligence model may generate graph data by connecting vertices representing each of the behavioral data to which the same label is set with an edge, which is a vector having a direction.

그래프 데이터는 데이터의 저장 및 관리 기술로서 각 데이터 객체와 이 객체가 가진 연관 관계를 저장하는 수단으로 사용된다. 그래프 데이터를 이용한 인공지능 학습은, 테이블을 사전에 정의한 스키마(schema) 구조 형태에 맞추어 각 객체들을 저장하는 관계형 데이터베이스에 비하여, 데이터들을 하나의 객체로서 저장하기 때문에 정형 및 비정형 데이터의 통합적인 저장 관리에 유리하다.Graph data is a data storage and management technology, and is used as a means of storing each data object and its relationship. Artificial intelligence learning using graph data stores data as one object compared to a relational database that stores each object in accordance with a predefined schema structure, so it is an integrated storage and management of structured and unstructured data. Is advantageous to

인공지능 모델 학습 장치(100)는 정점들에 연결된 엣지의 수에 기초하여 그래프 데이터에서 생성될 수 있는 엣지의 발생 확률을 산출할 수 있다. 여기서 엣지는 각 정점을 연결하는 가상의 연관관계로서, 행동 데이터로부터 다른 행동 데이터를 예측할 수 있는지 확인하는 지표이다. 일 개시에 의하여, 엣지의 발생 확률을 산출하는 것은 나이브 베이즈(Naive Bayes) 통계확률기법을 기초로 수행될 수 있다.The artificial intelligence model learning apparatus 100 may calculate a probability of occurrence of an edge that may be generated from graph data based on the number of edges connected to the vertices. Here, the edge is a virtual association that connects each vertex, and is an index that checks whether other behavioral data can be predicted from the behavioral data. According to one initiation, calculating the probability of occurrence of an edge may be performed based on a Naive Bayes statistical probability technique.

인공지능 모델 학습 장치(100)는 엣지의 발생 확률이 임계값 이상인 경우, 그래프 데이터에 새로운 엣지를 생성하여 정점들의 연관관계를 도출할 수 있다. 여기서 임계값은 연관관계(엣지)의 발생 확률에 따라 해당 연관관계를 그래프 데이터에 반영할 것인지를 판단하는 기준이 되는 것으로서, 관리자에 의하여 지정될 수 있다. 이로써, 명시적인 연관관계가 없는 행동 데이터 간의 연관관계를 자동으로 생성할 수 있다.The artificial intelligence model learning apparatus 100 may derive a correlation between the vertices by generating a new edge in graph data when the probability of occurrence of an edge is greater than or equal to a threshold value. Here, the threshold value is a criterion for determining whether to reflect the correlation in graph data according to the probability of occurrence of the correlation (edge), and may be designated by the administrator. As a result, it is possible to automatically create an association relationship between behavioral data that does not have an explicit association relationship.

또한, 인공지능 모델 학습 장치(100)는 연관관계를 이용하여 사용자의 행동 데이터로부터 목적 데이터에 도달할 수 있는 확률을 학습할 수 있다. 일 개시에 의하여 연관관계를 자동으로 생성하고 학습하기 위하여는, 그래프 데이터 상에서 연관관계의 발생 여부를 검토할 수 있는 데이터 객체들에 대응하는 정점과, 정점들을 연결하는 엣지 정보 및 기 획득한 데이터 객체들과 유사한 속성의 정점 및 엣지정보가 필요하다. 일 개시에 따라, 인공지능 모델 학습 장치(100)는 실시간으로 획득한 행동 데이터를 기존 데이터에 병합하여 지속적인 학습을 이어갈 수 있다.In addition, the artificial intelligence model learning apparatus 100 may learn a probability of reaching the target data from the user's behavior data by using the association relationship. In order to automatically generate and learn the relationship by initiation, the vertices corresponding to the data objects that can be checked for the occurrence of the relationship on the graph data, edge information connecting the vertices, and the previously acquired data objects Vertex and edge information with properties similar to those of the fields are required. According to the start, the apparatus 100 for learning an artificial intelligence model may continue learning by merging the behavioral data acquired in real time with the existing data.

종래에는 문서 또는 카테고리 등의 특정 데이터를 대상으로 이와 다른 데이터 사이의 공통 토픽의 빈도 수 존재 확률을 통하여, 데이터 객체 사이의 연관관계를 도출하였다. 그러나, 이러한 종래의 방법은 정형 및 비정형 데이터를 통합하고 그래프 데이터로 변경하는 과정에서 수집·정제되는 데이터 간의 연관관계가 존재하지 않는 경우가 빈번히 발생하는 한계를 갖는다.Conventionally, a relationship between data objects was derived through the probability of the existence of the frequency number of common topics between different data and specific data such as documents or categories. However, such a conventional method has a limitation in that there is frequently no association between data collected and purified in the process of integrating structured and unstructured data and converting it into graph data.

도 3은 그래프 데이터를 가시적으로 나타낸 것으로서, 각각의 원은 정점을 나타내며, 원 사이를 연결하는 화살표는 정점 사이의 연관관계에 상응하는 벡터인 엣지를 나타낸다. 도 3에 도시된 본 발명의 일 실시예에 따르면, 현재 물리적으로 존재하지 않는 엣지(320)의 존재 조건을 결정하기 위하여 방향성을 가진 전체 그래프 모델 G에 대한 확률을 검토하며, 이 확률과 통합 플랫폼의 관리자가 설정한 임계값을 비교함으로써, 그래프 데이터 내의 관계정보와 확률통계기법을 통합적으로 활용하여 행동 데이터(311, 312) 사이의 엣지(320)라는 신규 연관관계 데이터를 자동으로 도출할 수 있다.3 is a visual representation of graph data, where each circle represents a vertex, and an arrow connecting the circles represents an edge, which is a vector corresponding to an association relationship between the vertices. According to an embodiment of the present invention shown in FIG. 3, in order to determine the existence condition of the edge 320 that does not currently exist physically, the probability of the entire graph model G with directionality is reviewed, and this probability and the integrated platform By comparing the threshold value set by the administrator of the graph data, the new relationship data, called the edge 320 between the behavior data 311 and 312, can be automatically derived by integrating the relationship information in the graph data and the probability statistics technique. .

인공지능 모델 학습 장치(100)는, 그래프 데이터 기반의 인공지능 학습을 통해 정형 및 비정형 데이터의 수집 및 전처리 시 데이터 간의 연관관계를 자동으로 찾아냄으로써 완성된 그래프 데이터 모델을 자동적으로 생성할 수 있다. 또한, 자동 연관관계 생성을 통하여 기존 테이블 형태의 2차원적인 데이터에서 볼 수 없었던 시스템적인 통찰력을 달성함으로써, 상품 추천, 사용자 행동 예측, 맞춤형 이벤트 생성, 마케팅 방법 도출 등에 유용하게 활용될 수 있다. 나아가, 데이터 간의 연관관계를 자동으로 만들어 내고 행동 데이터 내부의 각종 연관관계 패턴을 도출할 수 있으므로, 한정된 데이터만을 이용한 유의미한 해석이 가능하여 인공지능 예비 연구 기술 등에 유용하게 활용될 수 있다.The artificial intelligence model learning apparatus 100 may automatically generate a completed graph data model by automatically finding a relationship between data during collection and preprocessing of structured and unstructured data through artificial intelligence learning based on graph data. In addition, it can be usefully used for product recommendation, predicting user behavior, creating customized events, and deriving marketing methods by achieving systemic insights that could not be seen in existing table-type two-dimensional data through automatic association generation. Furthermore, since it is possible to automatically create associations between data and derive various association patterns within behavioral data, meaningful interpretations using only limited data are possible, which can be usefully used for artificial intelligence preliminary research techniques.

도 4는 일 개시에 의한 성공 라벨이 부여된 행동 데이터를 이용하여 인공지능 모델을 학습하는 특징을 설명하기 위한 도면이다.4 is a diagram for explaining a feature of learning an artificial intelligence model by using behavior data to which a success label is assigned according to an initiation.

일 개시에 의한 인공지능 모델 학습 장치(100)는 성공 라벨로 설정된 모든 사용자의 모든 그래프 데이터의 유사성을 이용하여, 성공 라벨의 중복 경로를 검출할 수 있다. 예를 들어, 쇼핑사이트에서 컨텐츠를 구매한 사용자 A, B 및 C의 성공 데이터를 살펴보면, A, B 및 C 모두 상품을 조회한 후, 해당 상품의 상품평을 조회한 것을 확인할 수 있다. 즉, 상품 조회 행동 후, 상품평 조회 행동을 연속적으로 수행한 행동은 구매로 이어질 확률이 높음을 확인할 수 있다. 이러한 상품 조회 및 상품평 조회의 경로는 3회 중복되었으며, 접속 경로의 동일성은 SNS 광고 통한 접속이 2회, 구매전 로그인은 2회, 할인쿠폰 발급은 2회로 검출되었음을 확인할 수 있다.The artificial intelligence model learning apparatus 100 according to one disclosure may detect a redundant path of the success label by using the similarity of all graph data of all users set as the success label. For example, looking at the success data of users A, B, and C who purchase content on a shopping site, it can be confirmed that after all of A, B, and C have searched for a product, they have searched for a product review of the corresponding product. In other words, it can be seen that after the product inquiry action, the action that continuously performs the product review inquiry action has a high probability of leading to a purchase. It can be seen that the paths of such product inquiry and product review inquiry were duplicated three times, and the identity of the connection path was detected as two times for access through SNS advertisement, two times for login before purchase, and two times for issuance of discount coupons.

인공지능 모델 학습 장치(100)는 성공 라벨의 중복 경로에 누적된 중복 횟수에 기초하여 가중치를 부여할 수 있다. 인공지능 모델 학습 장치(100)는 공통의 행동 데이터를 수행하는 사용자의 수가 증가할 수록 높은 가중치를 부여할 수 있다. 즉, 수행되는 행동 데이터가 많아질 수록, 해당 행동 데이터를 수행하는 사람은 목적 데이터로 도달할 확률이 높다는 것을 나타낸다. The artificial intelligence model learning apparatus 100 may assign a weight to the overlapping path of the success label based on the accumulated number of overlaps. The artificial intelligence model learning apparatus 100 may assign a higher weight as the number of users performing common behavioral data increases. That is, as the number of behavioral data to be performed increases, the person performing the behavioral data has a higher probability of reaching the target data.

또한, 성공 데이터 중 반복되는 데이터에 가중치를 부여하는 것을 통해, 해당 통합 플랫폼의 다각적인 마케팅 방법을 도출해낼 수 있다. 예를 들어, 가중치가 높은 접속 경로로 접속한 사용자에게 연관 상품을 추천한다거나, 가중치가 낮은 접속 경로로 접속한 사용자의 목록을 활용하여 상품의 쿠폰을 제공하거나, 접속 경로의 광고 방법을 변경할 수도 있다.In addition, by assigning weights to repeated data among success data, it is possible to derive a multifaceted marketing method of the corresponding integrated platform. For example, it is possible to recommend related products to users who have accessed through a high-weight access path, provide coupons for products using a list of users who have accessed through a low-weight access path, or change the advertising method of the access path. .

인공지능 모델 학습 장치(100)는 성공 라벨의 중복 경로에 부여된 가중치에 기초하여, 성공 라벨의 중복 경로에 포함된 소정의 행동 데이터로부터 목적 데이터까지 최단 경로를 결정하고, 최단 경로와 연관된 성공 확률 증가형 이벤트를 생성할 수 있다. The artificial intelligence model learning apparatus 100 determines the shortest path from predetermined behavior data included in the overlapping path of the success label to the target data based on the weight given to the overlapping path of the success label, and a success probability associated with the shortest path. You can create incremental events.

일 개시에 의하여 최단 경로는 현재 사용자의 행동에 기초하여 가장 성공 확률을 높일 수 있는 행동 데이터의 경로를 의미한다. 최단 경로는 목적 데이터의 종류에 따라 다르게 설정될 수 있으며, 행동 데이터의 속성에 따라 다르게 설정될 수 있다. 예를 들어, 사용자가 제 1 제품의 상세설명 및 상품평에 많은 시간을 할애하고 있는 행동 데이터를 획득한 경우, 사용자에게 제 1 제품의 쿠폰을 제공함으로써 제 1 제품의 구매를 유도할 수 있다. 또는, 제 1 제품의 페이지와 제 2 제품의 페이지를 번갈아가며 클릭하고 시간을 할애하는 행동 데이터를 획득한 경우, 2개의 제품을 동시에 구매하는 경우 할인해주는 이벤트를 최단 경로로서 설정할 수 있다.The shortest path by initiation refers to a path of behavior data that can increase the probability of success most based on the current user's behavior. The shortest path may be set differently according to the type of target data, and may be set differently according to the property of the behavior data. For example, when a user acquires behavioral data that spends a lot of time on a detailed description of a first product and a product review, it is possible to induce a purchase of the first product by providing a coupon for the first product to the user. Alternatively, when the page of the first product and the page of the second product are alternately clicked and behavioral data for dedicating time is obtained, an event for discounting when two products are purchased at the same time may be set as the shortest path.

인공지능 모델 학습 장치(100)는 인공지능 학습 모델을 이용하여 성공 확률 증가형 이벤트를 결정할 수 있으며, 또는, 통합 플랫폼의 관리자로부터 제공받은 적어도 하나의 이벤트 중 어느 하나를 성공 확률 증가형 이벤트로 결정할 수 있다.The artificial intelligence model learning device 100 may determine a success probability increase type event using an artificial intelligence learning model, or determine any one of at least one event provided from the administrator of the integrated platform as a success probability increase type event. I can.

도 5는 일 개시에 의한 실패 라벨이 부여된 행동 데이터를 이용하여 인공지능 모델을 학습하는 특징을 설명하기 위한 도면이다.FIG. 5 is a diagram for explaining a feature of learning an artificial intelligence model using behavior data to which a failure label is assigned due to an initiation.

인공지능 모델 학습 장치(100)는 실패 라벨이 부여된 모든 사용자의 모든 그래프 데이터의 유사성을 이용하여, 중복 경로를 검출할 수 있다. 예를 들어, D 및 E의 비구매로 인한 실패 데이터를 이용하여, D와 E가 여러 상품을 짧은 시간만 조회하고 다른 상품으로 넘어간 경로의 유사성을 이용하여 짧은 시간 동안 상품을 조회하고 페이지를 변경하는 행동 데이터를 중복 경로로 검출할 수 있다. The artificial intelligence model learning apparatus 100 may detect the redundant path by using the similarity of all graph data of all users to which the failure label is assigned. For example, using data on failures due to non-purchase of D and E, D and E search for several products for a short time, and use the similarity of the path to other products to search for products and change pages for a short time. Behavioral data can be detected as redundant paths.

인공지능 모델 학습 장치(100)는 실패 라벨의 중복 경로에 누적된 중복 횟수에 기초하여 가중치를 부여할 수 있다. 인공지능 모델 학습 장치(100)는 반복된 실패 원인을 분석하고, 가중치를 부여함으로써, 어떤 원인으로 사용자가 이탈하는지 여부를 확인할 수 있다.The artificial intelligence model learning apparatus 100 may assign a weight to the overlapping path of the failure label based on the accumulated number of overlaps. The artificial intelligence model learning apparatus 100 analyzes the cause of the repeated failure and assigns a weight to it, so that it can be checked for what cause the user leaves.

인공지능 모델 학습 장치(100)는 실패 라벨의 중복 경로에 부여된 가중치에 기초하여 실패 확률 개선형 이벤트를 생성할 수 있다. 예를 들어, 인공지능 모델 학습 장치(100)는 상품의 상세 정보 페이지에서 페이지 이탈률이 상대적으로 높게 나오는 경우, 종료한 사용자의 목록을 활용하여 사용자에게 쿠폰을 제공함으로써 구매 전환을 유도할 수 있다.The artificial intelligence model learning apparatus 100 may generate a failure probability improvement type event based on a weight assigned to a duplicate path of the failure label. For example, when the page bounce rate is relatively high on the product detail information page, the artificial intelligence model learning apparatus 100 may induce a purchase conversion by providing a coupon to the user using a list of users who have ended.

일 개시에 의하여 인공지능 모델 학습 장치(100)는 사용자의 행동 데이터로부터 실패 라벨의 중복 경로가 검출되는 경우, 실패 확률 개선형 이벤트를 제공할 수 있다. 예를 들어, 특정 상품의 페이지에서 종료한 이력이 있는 사용자에게 특정 상품의 쿠폰을 제공함으로써, 사용자의 구매 전환을 유도할 수 있다.According to an initiation, the apparatus 100 for learning an artificial intelligence model may provide a failure probability improvement event when a duplicate path of a failure label is detected from the user's behavior data. For example, by providing a coupon for a specific product to a user who has a history of ending on a page of a specific product, it is possible to induce a user's purchase conversion.

인공지능 모델 학습 장치(100)는 인공지능 학습 모델을 이용하여 실패 확률 개선형 이벤트를 결정할 수 있으며, 또는, 통합 플랫폼의 관리자로부터 제공받은 적어도 하나의 이벤트 중 어느 하나를 실패 확률 개선형 이벤트로 결정할 수 있다.The artificial intelligence model learning device 100 may determine a failure probability improvement type event using the artificial intelligence learning model, or determine any one of at least one event provided from the administrator of the integrated platform as a failure probability improvement type event. I can.

일 실시 예에서, 인공지능 모델 학습 장치(100)는 각 사용자의 행동 데이터 및 이에 따라 제공된 목적 달성을 위한 유도책에 대한 정보 및 이에 따른 목적 달성여부에 대한 정보를 수집 및 저장할 수 있다. 인공지능 모델 학습 장치(100)는 수집된 정보에 기반하여 사용자의 행동 데이터에 기반하여, 사용자의 목적 달성을 유도할 수 있는 유도책에 대한 정보를 도출할 수 있는 인공지능 모델을 학습시킬 수 있다.In an embodiment, the artificial intelligence model learning apparatus 100 may collect and store information about each user's behavioral data, information about an induction plan for achieving a purpose provided accordingly, and information about whether or not the purpose is achieved accordingly. The artificial intelligence model learning apparatus 100 may train an artificial intelligence model that can derive information on guidance measures that can induce the user's goal attainment based on the user's behavior data based on the collected information. .

실시 예에 따라서, 인공지능 모델 학습 장치(100)는 목적 달성여부 외에도 기존 달성확률 대비 유도책이 제공된 후 달성확률의 변화에 대한 정보를 획득할 수도 있다. 인공지능 모델 학습 장치(100)는 달성확률이 기 설정된 기준값 이상 변경(상승)된 유도책에 대하여 유효 라벨링을, 그렇지 못한 유도책에 대하여 무효 라벨링을 수행할 수도 있다.According to an embodiment, the apparatus 100 for learning an artificial intelligence model may acquire information on a change in an achievement probability after providing an induction plan compared to an existing achievement probability in addition to whether or not an object has been achieved. The artificial intelligence model learning apparatus 100 may perform valid labeling for induction books whose achievement probability is changed (increased) by more than a preset reference value, and invalid labeling for induction books that are not.

인공지능 모델 학습 장치(100)는 라벨링된 학습 데이터에 기반하여, 사용자에게 제공할 유도책을 결정하는 인공지능 모델을 학습시킬 수 있다.The artificial intelligence model learning apparatus 100 may train an artificial intelligence model that determines an induction book to be provided to a user based on the labeled learning data.

일 실시 예에서, 인공지능 모델 학습 장치(100)는 달성하고자 하는 목적의 난이도를 설정할 수 있다. 예를 들어, 제품 구매의 난이도가 10이라 할 때, 장바구니에 담기의 난이도는 7, 이메일 구독의 난이도는 5, 페이지 팔로우의 난이도는 3, 좋아요 누르기의 난이도는 2로 각각 설정될 수 있다.In an embodiment, the apparatus 100 for learning an artificial intelligence model may set a difficulty level of an objective to be achieved. For example, when the difficulty of purchasing a product is 10, the difficulty of adding to the cart is 7, the difficulty of subscribing to an email is 5, the difficulty of following a page is 3, and the difficulty of clicking a like is 2, respectively.

이에 따라, 인공지능 모델 학습 장치(100)는 최종 목적으로 제품 구매가 설정된 경우에, 사용자의 행동 데이터에 기반하여 예측되는 목적 달성 확률이 70%인 경우, 현재 보고 있는 제품을 장바구니에 담을 것인지를 묻는 질의 혹은 팝업 페이지를 사용자에게 제공할 수 있다. Accordingly, when the product purchase is set for the final purpose, the artificial intelligence model learning apparatus 100 determines whether to put the currently viewed product in the shopping cart when the predicted goal achievement probability based on the user's behavior data is 70%. You can provide the user with a query or pop-up page that asks.

마찬가지로, 사용자의 행동 데이터에 기반하여 예측되는 목적 달성 확률이 50%인 경우, 사용자에게 이메일 구독을 요청하는 메시지를 제공할 수 있다. 즉, 인공지능 모델 학습 장치(100)에 따라 학습된 모델 및 이를 이용하는 시스템의 경우, 최종 목적을 달성할 수 있도록 사용자에게 유도책을 제공하는 동시에, 목적 달성에 실패하는 경우에도 최대한의 성과를 얻을 수 있도록 이에 대응하는 작업들을 사용자에게 요청함으로써, 사용자의 긍정적인 행동 및 피드백을 유도할 수 있다.Similarly, when the predicted goal achievement probability based on the user's behavior data is 50%, a message requesting an email subscription to the user may be provided. In other words, in the case of a model trained according to the artificial intelligence model learning apparatus 100 and a system using the same, at the same time, in the case of failure to achieve the goal, the maximum result is obtained while providing guidance to the user to achieve the final purpose. By requesting actions corresponding to this to the user, it is possible to induce a positive action and feedback of the user.

온라인 마케팅의 경우, 오프라인 마케팅을 통해 사용자와 실시간으로 인터렉션하며 상황에 맞게 상담하는 것과 달리 기 설정된 목적을 달성하느냐, 달성하지 못하느냐의 이분법적 결과를 가져오는 것이 일반적이다. 개시된 실시 예에 따르면, 사용자의 행동 데이터에 기반하여 최종 목적을 달성하지 못하더라도, 사용자의 현재 상태에 따라 최대한의 부차적인 성과를 얻을 수 있도록 하는 모델을 제공하는 장점이 있다.In the case of online marketing, it is common for users to interact in real time through offline marketing and to provide a dichotomous result of whether a predetermined purpose is achieved or not achieved, unlike consulting according to the situation. According to the disclosed embodiment, there is an advantage of providing a model capable of obtaining a maximum secondary performance according to the current state of the user even if the final purpose is not achieved based on the user's behavior data.

또한, 인공지능 모델 학습 장치(100)는 개시된 실시 예에 따른 모델에 기반하여 사용자의 행동 데이터에 따른 목적 달성 가능성을 판단하고, 이에 따라 사용자에게 비용 할인 쿠폰을 지급할 것을 결정할 수 있다.In addition, the artificial intelligence model learning apparatus 100 may determine the possibility of achieving a goal according to the user's behavior data based on the model according to the disclosed embodiment, and accordingly determine to provide a discount coupon to the user.

예를 들어, 인공지능 모델 학습 장치(100)는 사용자가 제품을 구매할 확률이 제1 임계값 이상인 경우, 별도의 할인 쿠폰을 지급하지 않을 수 있다. 반면, 사용자가 제품을 구매할 확률이 제2 임계값 이상 제1 임계값 이하인 경우, 할인 쿠폰을 지급하여 사용자의 구매를 유도할 수도 있다.For example, the artificial intelligence model learning apparatus 100 may not provide a separate discount coupon when the probability that the user purchases a product is greater than or equal to a first threshold value. On the other hand, when the probability that the user purchases the product is equal to or greater than the second threshold and equal to or less than the first threshold, a discount coupon may be provided to induce the user's purchase.

일 실시 예에서, 할인 쿠폰의 할인율은 동일한 제품에 대한 다른 플랫폼에서의 판매가격에 기초하여 결정될 수 있다. 예를 들어, 인공지능 모델 학습 장치(100)는 현재 사용자가 보고 있는 제품의 인터넷 최저가 및 타 플랫폼에서의 평균 가격 등을 포함하는 가격정보를 획득하고, 현재 사용자가 보고 있는 페이지에서의 판매가격과 비교할 수 있다. 이때, 인공지능 모델 학습 장치(100)는 해당 제품의 인터넷 최저가와 현재 사용자가 보고 있는 페이지에서의 판매가격 중 인터넷 최저가가 더 낮은 경우, 인터넷 최저가 혹은 인터넷 최저가와 현재 페이지의 가격 중 중간에 해당하는 특정 비용의 할인가격을 제공할 수 있도록 하는 할인율을 결정하고, 이에 따른 할인쿠폰을 제공할 수 있다.In an embodiment, the discount rate of the discount coupon may be determined based on the selling price of the same product on different platforms. For example, the artificial intelligence model learning apparatus 100 acquires price information including the lowest Internet price of the product currently viewed by the user and the average price on other platforms, and the selling price on the page currently viewed by the user Can be compared. At this time, the artificial intelligence model learning apparatus 100 is the lowest Internet price among the Internet lowest price of the product and the selling price on the page currently viewed by the user, which corresponds to the middle of the Internet lowest price or the Internet lowest price A discount rate for providing a discount price of a specific cost may be determined, and a discount coupon may be provided accordingly.

도 6은 일 개시에 의한 사용자 행동을 획득하고, 사용자 정보와 매핑하여 행동 데이터를 분석하는 특징을 설명하기 위한 도면이다.6 is a diagram for explaining a characteristic of acquiring a user behavior according to an initiation and analyzing behavior data by mapping it with user information.

일 개시에 의하여 도 6(a)에서는 인공지능 모델 학습 장치(100)는 가 실제로 사용자 행동을 수집한 실시예를 보여준다. 일 개시에 의하여 인공지능 모델 학습 장치(100)는 사용자의 통합 플랫폼의 접속 시점에서부터 종료 시점까지의 행동 데이터를 수집할 수 있다. 예를 들어, 8시 3분에 사용자가 상품 페이지를 조회하였으며, 8시 4분에 상품을 구매하였다는 행동 데이터를 수집할 수 있다.According to one disclosure, in FIG. 6(a), the apparatus 100 for learning an artificial intelligence model shows an embodiment in which the user behavior is actually collected. By initiation, the artificial intelligence model learning apparatus 100 may collect behavioral data from a user's access point to an end point of the integrated platform. For example, behavioral data indicating that a user viewed a product page at 8:3 and purchased a product at 8:04 may be collected.

일 개시에 의하여 도 6(b)에서는 통합 플랫폼에 접속한 사용자의 정보를 보여준다. 일 개시에 의한 사용자 정보는 사용자의 개인 식별 정보 및 최근 방문 정보를 포함할 수 있다. According to one disclosure, FIG. 6(b) shows information of a user who has accessed the integrated platform. User information according to an initiation may include personal identification information and recent visit information of the user.

일 개시에 의하여, 인공지능 모델 학습 장치(100)는 사용자 정보와 행동 데이터를 개별적으로 저장하여 각각에 대한 데이터를 분석함으로써, 사용자의 행동 데이터에 기반한 성공 확률을 판단할 수 있다. 나아가, 인공지능 모델 학습 장치(100)는 사용자 정보와 사용자의 행동 데이터를 매핑하여 저장하고 분석함으로써, 사용자 맞춤형 데이터 분석이 가능하다. 따라서, 사용자의 행동 패턴에 따른 맞춤형 분석이 가능해진다.At the beginning, the artificial intelligence model learning apparatus 100 may determine a success probability based on the user's behavior data by separately storing user information and behavior data and analyzing the data for each. Further, the artificial intelligence model learning apparatus 100 maps, stores, and analyzes user information and user behavior data, thereby enabling user-customized data analysis. Therefore, customized analysis according to the user's behavior pattern becomes possible.

도 7은 일 개시에 의한 사용자의 통합 플랫폼 이용 내역에 따라 사용자 그룹을 관리하는 특징을 설명하기 위한 도면이다.FIG. 7 is a diagram for describing a feature of managing a user group according to a user's use of an integrated platform according to an initiation.

일 개시에 의하여, 인공지능 모델 학습 장치(100)는 사용자의 통합 플랫폼 이용 내역을 확인하여 각 특성에 맞는 사용자 그룹을 생성할 수 있다. According to one disclosure, the artificial intelligence model learning apparatus 100 may check the user's use of the integrated platform and create a user group suitable for each characteristic.

일 개시에 의하여 인공지능 모델 학습 장치(100)는 사용자의 통합 플랫폼 이용 내역 정보에 기초하여, 기존에 목적 데이터에 도달한 이력이 있는 사용자를 제 1 그룹으로 분류할 수 있다. 예를 들어, 인공지능 모델 학습 장치(100)는 사용자의 구매 횟수에 따라 1회 구매자 그룹, 2회 구매자 그룹 등으로 분류할 수 있다. 또한, 인공지능 모델 학습 장치(100)는 기존에 구매이력이 없는 사용자 그룹을 생성할 수 있으며, 보다 구체적으로 어떤 행동 데이터를 기반으로 구매를 하지 않았는지 여부를 기준으로 세부 그룹을 생성할 수 있다.According to one disclosure, the apparatus 100 for learning an artificial intelligence model may classify users who have a history of reaching target data into a first group based on the user's information on using the integrated platform. For example, the artificial intelligence model learning apparatus 100 may be classified into a one-time purchaser group, a two-time purchaser group, or the like according to the number of purchases by the user. In addition, the artificial intelligence model learning apparatus 100 may create a user group that does not have a previous purchase history, and more specifically, may create a detailed group based on whether or not a purchase has been made based on which behavior data. .

또한, 인공지능 모델 학습 장치(100)는 제 1 그룹이 기 도달한 목적 데이터의 성질에 기초하여, 제 1 그룹에 제공할 적어도 하나의 추천 컨텐츠를 결정하고, 적어도 하나의 추천 컨텐츠를 제 1 그룹에 제공할 수 있다. 예를 들어, 인공지능 모델 학습 장치(100)는 사용자가 적어도 2회 이상 구매한 그룹에 속한 경우, 구매한 컨텐츠 이력을 기준으로 사용자에게 추천할 컨텐츠를 결정할 수 있다. 또는 사용자가 적어도 2회 이상 클릭한 특정 상품을 추천 컨텐츠로 결정하고, 특정 상품에 대한 쿠폰을 발급할 수도 있다.In addition, the artificial intelligence model learning apparatus 100 determines at least one recommended content to be provided to the first group based on the property of the target data previously reached by the first group, and provides at least one recommended content to the first group. Can be provided to. For example, when the user belongs to a group that has purchased at least two or more times, the artificial intelligence model learning apparatus 100 may determine content to be recommended to the user based on the purchased content history. Alternatively, a specific product that the user clicks at least twice or more may be determined as recommended content, and a coupon for a specific product may be issued.

즉, 인공지능 모델 학습 장치(100)는 사용자의 모든 행동을 분석하고 그룹화함으로써, 마케팅 자료로서 활용할 수 있다.That is, the artificial intelligence model learning apparatus 100 can be used as marketing data by analyzing and grouping all user actions.

도 8은 사용자의 시계열적 행동 데이터를 이용하여 인공지능 모델을 학습시키는 장치의 구조를 설명하기 위한 도면이다.FIG. 8 is a diagram for explaining the structure of an apparatus for training an artificial intelligence model using time-series behavior data of a user.

그러나, 도 8에 도시된 구성 요소 모두가 인공지능 모델 학습 장치(100)의 필수 구성 요소인 것은 아니다. 도 8에 도시된 구성 요소보다 많은 구성 요소에 의해 인공지능 모델 학습 장치(100)가 구현될 수도 있고, 더 적은 구성 요소에 의해 구현될 수도 있다.However, not all of the components shown in FIG. 8 are essential components of the apparatus 100 for learning an artificial intelligence model. The apparatus 100 for learning the artificial intelligence model may be implemented by more components than those shown in FIG. 8, or by fewer components.

프로세서(1100)는, 통상적으로 인공지능 모델 학습 장치(100의 전반적인 동작을 제어한다. 일 개시에 의하여 제어부는 프로세서라고도 칭한다. 예를 들어, 프로세서(1100)는, 메모리(1200)에 저장된 프로그램들을 실행함으로써, 사용자 입력부(1100), 출력부(1200), 센싱부(1400), 통신부(1500), A/V 입력부(1600) 등을 전반적으로 제어할 수 있다. 또한, 프로세서(1100)는 메모리(1200)에 저장된 프로그램들을 실행함으로써, 인공지능 모델 학습 장치(100의 기능을 수행할 수 있다. 프로세서(1100)는 적어도 하나의 프로세서를 구비할 수 있다. 프로세서(1100)는 그 기능 및 역할에 따라, 복수의 프로세서들을 포함하거나, 통합된 형태의 하나의 프로세서를 포함할 수 있다. 일 실시예에서, 프로세서(1100)는 메모리(1200)에 저장된 적어도 하나의 프로그램을 실행함으로써 알림 메시지를 제공하도록 하는 적어도 하나의 프로세서를 포함할 수 있다.The processor 1100 generally controls the overall operation of the artificial intelligence model learning apparatus 100. According to one disclosure, the controller is also referred to as a processor. For example, the processor 1100 includes programs stored in the memory 1200. By executing, the user input unit 1100, the output unit 1200, the sensing unit 1400, the communication unit 1500, the A/V input unit 1600 can be controlled overall. By executing programs stored in 1200, the function of the artificial intelligence model learning apparatus 100 may be performed. The processor 1100 may include at least one processor. The processor 1100 may be provided with its functions and roles. Accordingly, a plurality of processors may be included, or an integrated type of processor may be included. In one embodiment, the processor 1100 executes at least one program stored in the memory 1200 to provide a notification message. It may include at least one processor.

메모리(1200)는, 프로세서(1100)의 처리 및 제어를 위한 프로그램을 저장할 수 있고, 행동 분석 장치(100)로 입력되거나 행동 분석 장치(100)로부터 출력되는 데이터를 저장할 수도 있다. The memory 1200 may store a program for processing and controlling the processor 1100, and may store data input to the behavior analysis apparatus 100 or output from the behavior analysis apparatus 100.

메모리(1200)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리(1200) 등), 램(RAM, Random Access Memory) SRAM(Static Random Access Memory), 롬(ROM, Read-Only Memory), EEPROM(Electrically Erasable Programmable Read-Only Memory), PROM(Programmable Read-Only Memory), 자기 메모리, 자기 디스크, 광디스크 중 적어도 하나의 타입의 저장매체를 포함할 수 있다. The memory 1200 is a flash memory type, a hard disk type, a multimedia card micro type, a card type memory (for example, SD or XD memory 1200). ), RAM (Random Access Memory) SRAM (Static Random Access Memory), ROM (ROM, Read-Only Memory), EEPROM (Electrically Erasable Programmable Read-Only Memory), PROM (Programmable Read-Only Memory), Magnetic Memory , A magnetic disk, and an optical disk.

메모리(1200)에 저장된 프로그램들은 그 기능에 따라 복수 개의 모듈들로 분류할 수 있는데, 여기서, 복수 개의 모듈들은 하드웨어가 아닌 소프트웨어로서, 기능적으로 동작하는 모듈을 의미한다.Programs stored in the memory 1200 may be classified into a plurality of modules according to their functions. Here, the plurality of modules are software, not hardware, and refer to modules that operate functionally.

일 실시예에 따른 프로세서(1100)는 데이터 학습부(1101) 및 데이터 인식부(1102)를 포함할 수 있다.The processor 1100 according to an embodiment may include a data learning unit 1101 and a data recognition unit 1102.

데이터 학습부(1101) 및 데이터 인식부(1102) 중 적어도 하나는, 적어도 하나의 하드웨어 칩 형태로 제작되어 전자 장치에 탑재될 수 있다. 예를 들어, 데이터 학습부(1101) 및 데이터 인식부(1102) 중 적어도 하나는 인공 지능(AI; artificial intelligence)을 위한 전용 하드웨어 칩 형태로 제작될 수도 있고, 또는 기존의 범용 프로세서(예: CPU 또는 application processor) 또는 그래픽 전용 프로세서(예: GPU)의 일부로 제작되어 전술한 각종 전자 장치에 탑재될 수도 있다.At least one of the data learning unit 1101 and the data recognition unit 1102 may be manufactured in the form of at least one hardware chip and mounted on an electronic device. For example, at least one of the data learning unit 1101 and the data recognition unit 1102 may be manufactured in the form of a dedicated hardware chip for artificial intelligence (AI), or an existing general-purpose processor (eg, a CPU Alternatively, it may be manufactured as a part of an application processor) or a graphics dedicated processor (eg, a GPU) and mounted on the aforementioned various electronic devices.

이 경우, 데이터 학습부(1101) 및 데이터 인식부(1102)는 하나의 전자 장치에 탑재될 수도 있으며, 또는 별개의 전자 장치들에 각각 탑재될 수도 있다. 예를 들어, 데이터 학습부(1101) 및 데이터 인식부(1102) 중 하나는 전자 장치에 포함되고, 나머지 하나는 서버에 포함될 수 있다. 또한, 데이터 학습부(1101) 및 데이터 인식부(1102)는 유선 또는 무선으로 통하여, 데이터 학습부(1101)가 구축한 모델 정보를 데이터 인식부(1102)로 제공할 수도 있고, 데이터 인식부(1102)로 입력된 데이터가 추가 학습 데이터로서 데이터 학습부(1101)로 제공될 수도 있다.In this case, the data learning unit 1101 and the data recognition unit 1102 may be mounted on one electronic device, or may be mounted on separate electronic devices, respectively. For example, one of the data learning unit 1101 and the data recognition unit 1102 may be included in the electronic device, and the other may be included in the server. In addition, the data learning unit 1101 and the data recognition unit 1102 may provide model information built by the data learning unit 1101 to the data recognition unit 1102 through wired or wireless communication, or the data recognition unit ( Data input to 1102 may be provided to the data learning unit 1101 as additional learning data.

한편, 데이터 학습부(1101) 및 데이터 인식부(1102) 중 적어도 하나는 소프트웨어 모듈로 구현될 수 있다. 데이터 학습부(1101) 및 데이터 인식부(1102) 중 적어도 하나가 소프트웨어 모듈(또는, 인스트럭션(instruction) 포함하는 프로그램 모듈)로 구현되는 경우, 소프트웨어 모듈은 컴퓨터로 읽을 수 있는 판독 가능한 비일시적 판독 가능 기록매체(non-transitory computer readable media)에 저장될 수 있다. 또한, 이 경우, 적어도 하나의 소프트웨어 모듈은 OS(Operating System)에 의해 제공되거나, 소정의 애플리케이션에 의해 제공될 수 있다. 또는, 적어도 하나의 소프트웨어 모듈 중 일부는 OS(Operating System)에 의해 제공되고, 나머지 일부는 소정의 애플리케이션에 의해 제공될 수 있다.Meanwhile, at least one of the data learning unit 1101 and the data recognition unit 1102 may be implemented as a software module. When at least one of the data learning unit 1101 and the data recognition unit 1102 is implemented as a software module (or a program module including an instruction), the software module is computer-readable and non-transitory. It may be stored in a non-transitory computer readable media. Also, in this case, at least one software module may be provided by an operating system (OS) or may be provided by a predetermined application. Alternatively, some of the at least one software module may be provided by an operating system (OS), and the remaining part may be provided by a predetermined application.

본 발명의 일 실시예에 따른 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. The method according to an embodiment of the present invention may be implemented in the form of program instructions that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like alone or in combination. The program instructions recorded on the medium may be specially designed and configured for the present invention, or may be known and usable to those skilled in computer software. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floptical disks. -A hardware device specially configured to store and execute program instructions such as magneto-optical media, and ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine language codes such as those produced by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like.

이상에서 본 발명의 실시예에 대하여 상세하게 설명하였지만 본 발명의 권리범위는 이에 한정되는 것은 아니고 다음의 청구범위에서 정의하고 있는 본 발명의 기본 개념을 이용한 당업자의 여러 변형 및 개량 형태 또한 본 발명의 권리범위에 속한다.Although the embodiments of the present invention have been described in detail above, the scope of the present invention is not limited thereto, and various modifications and improvements by those skilled in the art using the basic concept of the present invention defined in the following claims are also provided. It belongs to the scope of rights.

Claims

Acquiring behavioral data on all time-series behaviors of a user accessing the integrated platform;
A labeling step of setting a label by determining whether target data determined by an administrator of the integrated platform has been reached for each of the user's behavior data; And
Including the step of learning an artificial intelligence model by repeating the labeling step for all time-series behavior data of other users who have accessed the integrated platform,
A method of training an artificial intelligence model using the user's time series behavioral data.

The method of claim 1,
The labeling step,
Obtaining target data from an administrator of the integrated platform;
Setting a success label for each of all of the user's behavioral data when object data of the integrated platform exists among all the behavioral data of the user;
Setting a failure label for each of the user's all behavioral data when the object data of the integrated platform does not exist among all the behavioral data of the user; And
Including the step of training an artificial intelligence model by using the label-set behavioral data,
A method of training an artificial intelligence model using the user's time series behavioral data.

The method of claim 1,
The behavioral data includes visit traffic data accessing the integrated platform, response data for content included in the integrated platform, response data for events provided by the integrated platform, identification data of a user registered in the integrated platform, and the Characterized in that it contains unstructured data generated while accessing the integrated platform,
A method of training an artificial intelligence model using the user's time series behavioral data.

The method of claim 1,
The target data is at least one behavioral data designated by an administrator of the integrated platform, wherein the artificial intelligence model is individually learned based on an attribute of the target data,
A method of training an artificial intelligence model using the user's time series behavioral data.

The method of claim 1,
The step of training the artificial intelligence model,
Generating graph data by connecting vertices representing each of the behavioral data for which the same label is set with an edge, which is a vector having a directionality;
Calculating a probability of occurrence of an edge that can be generated from the graph data based on the number of edges connected to the vertices;
When the probability of occurrence of the edge is greater than or equal to a threshold value, generating a new edge in the graph data to derive a relationship between the vertices; And
Including the step of learning the probability of reaching the target data from the user's behavior data using the association,
A method of training an artificial intelligence model using the user's time series behavioral data.

The method of claim 5,
Detecting a redundant path of the success label by using the similarity of all graph data of all users set as the success label;
Assigning a weight to the duplicate path of the success label based on the accumulated number of duplicates;
Determining a shortest path from predetermined behavioral data included in the overlapping path of the success label to target data based on a weight assigned to the overlapping path of the success label;
Generating a success probability increasing event associated with the shortest path; And
In case a duplicate path of the success label is detected from the user's behavior data, providing the success probability increase type event,
A method of training an artificial intelligence model using the user's time series behavioral data.

The method of claim 5,
Detecting a redundant path by using the similarity of all graph data of all users to which a failure label is assigned;
Assigning a weight to the duplicate path of the failure label based on the accumulated number of duplicates;
Generating a failure probability improvement type event based on a weight assigned to a duplicate path of the failure label; And
In case a duplicate path of the failure label is detected from the user's behavior data, providing the failure probability improvement type event,
A method of training an artificial intelligence model using the user's time series behavioral data.

A memory for storing one or more instructions; And
A processor that executes the one or more instructions stored in the memory,
The processor executes the one or more instructions,
An apparatus for performing the method of claim 1.

A computer program combined with a computer as hardware and stored in a recording medium readable by a computer to perform the method of claim 1.