KR20240034078A

KR20240034078A - Method and system for dynamic linkage of classification system

Info

Publication number: KR20240034078A
Application number: KR1020220138921A
Authority: KR
Inventors: 유선희; 원동규; 임종연
Original assignee: 한국과학기술정보연구원
Priority date: 2022-09-06
Filing date: 2022-10-26
Publication date: 2024-03-13

Abstract

분류체계의 동적 연계 방법 및 그 방법이 적용된 시스템이 제공된다. 본 개시의 일 실시예에 따른 분류체계의 동적 연계 방법은, 제1 분류 체계 상의 제1 분류 코드의 제1 설명문(description)을 획득하는 단계, 제2 분류 체계 상의 제2 분류 코드의 제2 설명문을 획득하는 단계, 상기 제1 설명문을 표현하는 입력 피처를 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제1 벡터를 획득하는 단계, 상기 제2 설명문을 표현하는 입력 피처를 상기 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제2 벡터를 획득하는 단계, 및 상기 제1 벡터와 상기 제2 벡터 사이의 특징 공간 상의 거리를 기반으로, 상기 제1 분류 코드와 상기 제2 분류 코드의 대응 여부를 추론하는 단계를 포함할 수 있다.A dynamic linkage method of a classification system and a system to which the method is applied are provided. The dynamic linkage method of classification systems according to an embodiment of the present disclosure includes obtaining a first description of a first classification code on a first classification system, and a second description of a second classification code on a second classification system. Obtaining, inputting an input feature representing the first description into a vector extraction model, obtaining a first vector output from the vector extraction model, extracting the input feature representing the second description into the vector Inputting a model and obtaining a second vector output from the vector extraction model, and based on the distance in feature space between the first vector and the second vector, the first classification code and the second classification It may include a step of inferring whether the code corresponds.

Description

Dynamic linkage method of classification system and its system {METHOD AND SYSTEM FOR DYNAMIC LINKAGE OF CLASSIFICATION SYSTEM}

본 개시는 분류체계를 동적으로 연계하는 방법 및 그 시스템에 관한 것이다. 보다 자세하게는, 기술, 제품, 시장 및 산업 간의 연계를 자동화하는 방법 및 그 시스템에 관한 것이다.This disclosure relates to a method and system for dynamically linking classification systems. More specifically, it relates to methods and systems for automating linkages between technologies, products, markets, and industries.

기술 혁신을 고려할 경우 가장 기본적이고 우선시되는 기반은 해당 기술의 응용 분야 또는 관련 제품 및 서비스를 정의하는 것인데, 기존의 정의 방법은 전문가 의존성이 높아, 자동화된 맞춤형 분석 정보 지원 체제를 구축하기 어렵다.When considering technological innovation, the most basic and priority basis is to define the application field of the technology or related products and services, but the existing definition method is highly dependent on experts, making it difficult to build an automated, customized analysis information support system.

기존의 기술에 대한 대표적인 특허는 기술적 관점에서 작성된 특허 분류체계를 따르는데, 기술적 관점에서 작성된 분류체계는 연구 개발 비용이나 생산성과 같이 산업 분류체계에 따라 수집되는 산업 데이터 및 경제 데이터와의 연계가 어렵다는 문제점이 있다.Representative patents for existing technologies follow a patent classification system created from a technical perspective, but the classification system created from a technical perspective is difficult to link with industrial data and economic data collected according to the industrial classification system, such as research and development costs or productivity. There is a problem.

또한, 새로운 기술이 창조되었을 때 새로운 분류의 추가 생성도 중요하지만, 새로운 분류 생성 전까지 임시적으로 과거의 분류체계와의 연계를 필요로 하는 경우 실시간적으로 대응하기 어렵다는 문제점이 있다.In addition, it is important to create additional new classifications when a new technology is created, but there is a problem that it is difficult to respond in real time when a temporary connection with the past classification system is required until a new classification is created.

수많은 기술 - 제품 - 산업 간의 분류체계 연계에 관련된 연구가 있어왔으나, 계속해서 각자의 분류체계가 독자적으로 업데이트 및 갱신됨에 따라, 실시간적으로 적용하고, 실질적으로 이용할 수 있는 분류체계가 부재하다.There have been numerous studies on linking classification systems between technologies, products, and industries, but as each classification system continues to be independently updated and updated, there is no classification system that can be applied in real time and practically used.

한국등록특허 제10-2371359호 (2022.03.02 등록)Korean Patent No. 10-2371359 (registered on 2022.03.02) 한국등록특허 제10-2277787호 (2021.07.09 등록)Korean Patent No. 10-2277787 (registered on 2021.07.09) 한국공개특허 제10-2020-0177506호 (2020.12.17 출원)Korean Patent Publication No. 10-2020-0177506 (filed on December 17, 2020) 한국공개특허 제10-2020-0182207호 (2020.12.23 출원)Korean Patent Publication No. 10-2020-0182207 (filed on December 23, 2020)

본 개시가 몇몇 실시예들에서 해결하고자 하는 기술적 과제는, 인공지능 기반의 자동화된 분석정보 지원체제를 구축하는 것이다.The technical problem that this disclosure aims to solve in some embodiments is to build an automated analysis information support system based on artificial intelligence.

본 개시가 몇몇 실시예들에서 해결하고자 하는 기술적 과제는, 다른 관점의 분류체계를 따라 작성된 정보를 연계하는 방법을 제공하는 것이다.The technical problem that the present disclosure aims to solve in some embodiments is to provide a method of linking information prepared according to classification systems from different perspectives.

본 개시가 몇몇 실시예들에서 해결하고자 하는 기술적 과제는, 상이한 분류체계를 동적으로 연계하는 방법을 제공하는 것이다.The technical problem that the present disclosure seeks to solve in some embodiments is to provide a method for dynamically linking different classification systems.

본 개시가 몇몇 실시예들에서 해결하고자 하는 기술적 과제는, 신기술의 분류 코드를 자동으로 기존의 분류체계에 포함되는 분류 코드에 연계하는 방법을 제공하는 것이다.The technical problem that the present disclosure seeks to solve in some embodiments is to provide a method of automatically linking classification codes of new technologies to classification codes included in an existing classification system.

본 개시의 기술적 과제들은 이상에서 언급한 기술적 과제들로 제한되지 않으며, 언급되지 않은 또 다른 기술적 과제들은 아래의 기재로부터 본 개시의 기술분야에서의 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The technical problems of the present disclosure are not limited to the technical problems mentioned above, and other technical problems not mentioned can be clearly understood by those skilled in the art from the description below.

상기 기술적 과제를 해결하기 위한 일 실시예에 따른 분류체계의 동적 연계 시스템을 제공하는 방법은, 제1 분류 체계 상의 제1 분류 코드의 제1 설명문(description)을 획득하는 단계와, 제2 분류 체계 상의 제2 분류 코드의 제2 설명문을 획득하는 단계와, 상기 제1 설명문을 표현하는 입력 피처를 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제1 벡터를 획득하는 단계와, 상기 제2 설명문을 표현하는 입력 피처를 상기 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제2 벡터를 획득하는 단계와, 상기 제1 벡터와 상기 제2 벡터 사이의 특징 공간 상의 거리를 기반으로, 상기 제1 분류 코드와 상기 제2 분류 코드의 대응 여부를 추론하는 단계를 포함하되, 상기 벡터 추출 모델은, 상기 제1 분류 체계와 상기 제2 분류 체계의 각 분류 코드의 대응표를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것일 수 있다.A method of providing a dynamic linking system of classification systems according to an embodiment for solving the above technical problem includes obtaining a first description of a first classification code in a first classification system, and a second classification system. obtaining a second description of the second classification code of the above, inputting an input feature representing the first description into a vector extraction model, and obtaining a first vector output from the vector extraction model; 2 Inputting an input feature representing a description into the vector extraction model and obtaining a second vector output from the vector extraction model, based on the distance in the feature space between the first vector and the second vector , Including the step of inferring whether the first classification code and the second classification code correspond, wherein the vector extraction model is generated using a correspondence table of each classification code of the first classification system and the second classification system. It may be machine learned using trained learning data.

일 실시예에서, 상기 제1 벡터와 상기 제2 벡터 사이의 특징 공간 상의 거리를 기반으로, 상기 제1 분류 코드와 상기 제2 분류 코드의 대응 여부를 추론하는 단계는, 특징 공간 상에서 상기 제1 벡터와의 거리가 기 정의된 거리 이하인 벡터에 대응되는 분류 코드를 제2 분류 코드로 지정하는 단계를 포함할 수 있다.In one embodiment, the step of inferring whether the first classification code corresponds to the second classification code based on the distance in the feature space between the first vector and the second vector includes: It may include the step of designating a classification code corresponding to a vector whose distance from the vector is less than or equal to a predefined distance as the second classification code.

일 실시예에서, 상기 특징 공간 상에서 상기 제1 벡터와의 거리가 기 정의된 거리 이하인 벡터에 대응되는 분류 코드를 제2 분류 코드로 지정하는 단계는, 사용자로부터 분류체계의 폭에 반영할 벡터 간 거리의 기준을 입력받는 단계를 포함할 수 있다.In one embodiment, the step of designating as a second classification code a classification code corresponding to a vector whose distance from the first vector in the feature space is less than or equal to a predefined distance is performed between the vectors to be reflected in the width of the classification system from the user. It may include a step of inputting a distance standard.

상기 기술적 과제를 해결하기 위한 일 실시예에 따른 이종 분류 체계 상의 분류 코드 매칭을 위한 기계 학습 방법은, 제1 분류 체계와 상기 제2 분류 체계의 각 분류 코드의 대응표를 분석하여, 서로 대응하는 제1 분류 체계 상의 제1 분류 코드 및 제2 분류 체계 상의 제2 분류 코드의 순서쌍의 세트를 획득하는 단계와, 상기 세트에 포함된 각각의 순서쌍을 이용하여 학습 데이터를 획득하는 단계와, 상기 학습 데이터의 제1 분류 코드의 설명문을 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제1 벡터를 획득하는 단계와, 상기 학습 데이터의 제2 분류 코드의 설명문을 상기 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제2 벡터를 획득하는 단계와, 상기 제1 벡터와 상기 제2 벡터의 특징 공간 상의 거리를 기반으로 하는 손실 함수(loss function)를 이용한 오류 역전파(back-propagation)를 이용하여 상기 벡터 추출 모델의 가중치를 업데이트하는 단계와, 상기 학습 데이터가 소진될 때까지, 각각의 학습 데이터에 대하여 상기 제1 벡터를 획득하는 단계, 상기 제2 벡터를 획득하는 단계 및 상기 가중치를 업데이트하는 단계를 반복하는 단계를 포함할 수 있다.A machine learning method for matching classification codes in heterogeneous classification systems according to an embodiment to solve the above technical problem analyzes the correspondence table of each classification code of the first classification system and the second classification system, and provides 1 Obtaining a set of ordered pairs of a first classification code on a classification system and a second classification code on a second classification system, obtaining learning data using each ordered pair included in the set, and the learning data Inputting the description of the first classification code of into the vector extraction model, obtaining the first vector output from the vector extraction model, inputting the description of the second classification code of the learning data into the vector extraction model, Obtaining a second vector output from the vector extraction model, and error back-propagation using a loss function based on the distance in the feature space of the first vector and the second vector. Updating the weights of the vector extraction model using, obtaining the first vector for each training data until the training data is exhausted, obtaining the second vector, and the weight It may include repeating the step of updating.

상기 기술적 과제를 해결하기 위한 일 실시예에 따른 분류체계의 동적 연계 시스템을 제공하는 방법은, 제1 기술의 관련 데이터를 획득하되, 상기 제1 기술의 관련 데이터는 제1 분류 체계 상의 제1 분류 코드를 포함하는 단계와, 상기 제1 분류 코드의 제1 설명문(DESCRIPTION)을 표현하는 입력 피처를 제1 인공지능 모델에 입력하고, 상기 제1 인공지능 모델에서 출력된 출력 데이터를 이용하여 상기 제1 분류 코드의 분석 데이터를 획득하되, 상기 제1 분류 코드의 분석 데이터는 설명문의 문맥을 반영하는 단계와, 기 제1 분류 코드 분석 데이터를 표현하는 입력 피처를 제2 인공지능 모델에 입력하고, 상기 제2 인공지능 모델에서 출력된 제1 벡터를 획득하는 단계와, 제2 분류 체계 상의 제2 분류 코드를 제1 분류 코드에 연계하는 단계를 포함하되, 상기 제2 인공지능 모델은 특징 공간 상의 거리를 기반으로, 상기 제1 분류 코드의 분석 데이터와 상기 제2 분류 코드의 분석 데이터의 대응 여부를 추론하고, 상기 제2 인공지능 모델은, 상기 제1 분류 체계와 상기 제2 분류 체계의 각 분류 코드의 대응표를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것일 수 있다.A method of providing a dynamic linkage system of a classification system according to an embodiment for solving the above technical problem includes obtaining related data of a first technology, wherein the related data of the first technology is classified into a first classification on the first classification system. Including a code, inputting an input feature representing a first description (DESCRIPTION) of the first classification code into a first artificial intelligence model, and using output data output from the first artificial intelligence model 1. Obtaining analysis data of a classification code, wherein the analysis data of the first classification code reflects the context of the explanatory text, inputting an input feature representing the first classification code analysis data into a second artificial intelligence model, Obtaining a first vector output from the second artificial intelligence model, and linking a second classification code on the second classification system to the first classification code, wherein the second artificial intelligence model is Based on the distance, it is inferred whether the analysis data of the first classification code corresponds to the analysis data of the second classification code, and the second artificial intelligence model is used for each of the first classification system and the second classification system. It may be machine learned using learning data generated using a correspondence table of classification codes.

일 실시예에서, 상기 제1 분류 체계 상의 제1 분류 코드의 제1 설명문을 획득하는 단계는, 상기 제1 기술이 신기술일 경우, 상기 제1 기술에 대한 설명문을 표현하는 입력 피처를 언어 모델에 입력하고, 상기 언어 모델에서 출력된 제1 벡터를 획득하는 단계와, 상기 제1 벡터와 특징 공간 상 가장 가까운 벡터에 대응되는 기술의 분류 코드를 제1 분류 코드로 지정하는 단계를 포함하되, 상기 언어모델은, 상기 제1 기술의 전문 분야 데이터와 상기 제1 기술의 분류 코드를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것 일 수 있다.In one embodiment, the step of obtaining the first description of the first classification code in the first classification system includes, when the first technology is a new technology, inputting an input feature representing the description for the first technology to a language model. Inputting and obtaining a first vector output from the language model, and specifying a classification code of the technology corresponding to the closest vector in the feature space to the first vector as the first classification code, The language model may be machine-learned using training data generated using specialty data of the first technology and classification codes of the first technology.

상기 기술적 과제를 해결하기 위한 본 개시의 일 실시예에 따른 컴퓨팅 시스템은, 하나 이상의 프로세서와, 상기 프로세서에 의하여 수행되는 컴퓨터 프로그램을 로드(load)하는 메모리와, 상기 컴퓨터 프로그램을 저장하는 스토리지를 포함할 수 있다. 상기 컴퓨터 프로그램은, 제1 분류 체계 상의 제1 분류 코드의 제1 설명문(description)을 획득하는 인스트럭션과, 제2 분류 체계 상의 제2 분류 코드의 제2 설명문을 획득하는 인스트럭션과, 상기 제1 설명문을 표현하는 입력 피처를 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제1 벡터를 획득하는 인스트럭션과, 상기 제2 설명문을 표현하는 입력 피처를 상기 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제2 벡터를 획득하는 인스트럭션과, 상기 제1 벡터와 상기 제2 벡터 사이의 특징 공간 상의 거리를 기반으로, 상기 제1 분류 코드와 상기 제2 분류 코드의 대응 여부를 추론하는 인스트럭션을 포함하되, 상기 벡터 추출 모델은, 상기 제1 분류 체계와 상기 제2 분류 체계의 각 분류 코드의 대응표를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것일 수 있다.A computing system according to an embodiment of the present disclosure for solving the technical problem includes one or more processors, a memory for loading a computer program performed by the processor, and storage for storing the computer program. can do. The computer program includes instructions for obtaining a first description of a first classification code on a first classification system, instructions for obtaining a second description of a second classification code on a second classification system, and the first description. Inputting an input feature representing an instruction to a vector extraction model, obtaining a first vector output from the vector extraction model, inputting an input feature representing the second description into the vector extraction model, and extracting the vector. An instruction for obtaining a second vector output from a model, and an instruction for inferring whether the first classification code corresponds to the second classification code based on the distance in the feature space between the first vector and the second vector. Including, the vector extraction model may be machine-learned using learning data generated using the correspondence table of each classification code of the first classification system and the second classification system.

상기 기술적 과제를 해결하기 위한 본 개시의 일 실시예에 따른 컴퓨팅 시스템은, 하나 이상의 프로세서와, 상기 프로세서에 의하여 수행되는 컴퓨터 프로그램을 로드(load)하는 메모리와, 상기 컴퓨터 프로그램을 저장하는 스토리지를 포함할 수 있다. 상기 컴퓨터 프로그램은, 제1 분류 체계와 상기 제2 분류 체계의 각 분류 코드의 대응표를 분석하여, 서로 대응하는 제1 분류 체계 상의 제1 분류 코드 및 제2 분류 체계 상의 제2 분류 코드의 순서쌍의 세트를 획득하는 인스트럭션과, 상기 세트에 포함된 각각의 순서쌍을 이용하여 학습 데이터를 획득하는 인스트럭션과, 상기 학습 데이터의 제1 분류 코드의 설명문을 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제1 벡터를 획득하는 인스트럭션과, 상기 학습 데이터의 제2 분류 코드의 설명문을 상기 벡터 추출 모델에 입력하고, 상기 벡터 추출 모델에서 출력된 제2 벡터를 획득하는 인스트럭션과, 상기 제1 벡터와 상기 제2 벡터의 특징 공간 상의 거리를 기반으로 하는 손실 함수(loss function)를 이용한 오류 역전파(back-propagation)를 이용하여 상기 벡터 추출 모델의 가중치를 업데이트하는 인스트럭션과, 상기 학습 데이터가 소진될 때까지, 각각의 학습 데이터에 대하여 상기 제1 벡터를 획득하는 단계, 상기 제2 벡터를 획득하는 단계 및 상기 가중치를 업데이트하는 단계를 반복하는 인스트럭션을 포함할 수 있다.A computing system according to an embodiment of the present disclosure for solving the technical problem includes one or more processors, a memory for loading a computer program performed by the processor, and storage for storing the computer program. can do. The computer program analyzes the correspondence table of each classification code of the first classification system and the second classification system, and creates an ordered pair of the first classification code on the first classification system and the second classification code on the second classification system that corresponds to each other. Input an instruction for acquiring a set, an instruction for acquiring learning data using each ordered pair included in the set, and a description of the first classification code of the learning data into a vector extraction model, and output from the vector extraction model. an instruction for obtaining a first vector, inputting a description of a second classification code of the learning data into the vector extraction model, and obtaining a second vector output from the vector extraction model, and the first vector and An instruction for updating the weights of the vector extraction model using error back-propagation using a loss function based on the distance in the feature space of the second vector, and an instruction for updating the weights of the vector extraction model when the learning data is exhausted. It may include instructions for repeating the steps of acquiring the first vector, acquiring the second vector, and updating the weight for each training data until.

상기 기술적 과제를 해결하기 위한 본 개시의 일 실시예에 따른 컴퓨팅 시스템은, 하나 이상의 프로세서와, 상기 프로세서에 의하여 수행되는 컴퓨터 프로그램을 로드(load)하는 메모리와, 상기 컴퓨터 프로그램을 저장하는 스토리지를 포함할 수 있다. 상기 컴퓨터 프로그램은, 제1 기술의 관련 데이터를 획득하되, 상기 제1 기술의 관련 데이터는 제1 분류 체계 상의 제1 분류 코드를 포함하는 인스트럭션과, 상기 제1 분류 코드의 제1 설명문(DESCRIPTION)을 표현하는 입력 피처를 제1 인공지능 모델에 입력하고, 상기 제1 인공지능 모델에서 출력된 출력 데이터를 이용하여 상기 제1 분류 코드의 분석 데이터를 획득하되, 상기 제1 분류 코드의 분석 데이터는 설명문의 문맥을 반영하는 인스트럭션과, 상기 제1 분류 코드 분석 데이터를 표현하는 입력 피처를 제2 인공지능 모델에 입력하여 제2 분류 체계 상의 제2 분류 코드를 제1 분류 코드에 연계하는 인스트럭션을 포함하되, 상기 제2 인공지능 모델은 분류 코드 간의 유사도가 기 정의된 기준 이상인 분류체계를 연결해주는 것일 수 있다.A computing system according to an embodiment of the present disclosure for solving the technical problem includes one or more processors, a memory for loading a computer program performed by the processor, and storage for storing the computer program. can do. The computer program acquires relevant data of a first technology, wherein the relevant data of the first technology includes instructions including a first classification code on a first classification system, and a first description of the first classification code. An input feature representing an input feature is input to a first artificial intelligence model, and analysis data of the first classification code is obtained using output data output from the first artificial intelligence model, wherein the analysis data of the first classification code is Includes instructions that reflect the context of the explanation and input features representing the first classification code analysis data into a second artificial intelligence model to link the second classification code in the second classification system to the first classification code. However, the second artificial intelligence model may connect classification systems in which the similarity between classification codes is higher than a predefined standard.

도 1은 본 개시의 일 실시예에 따른 분류체계의 동적 연계 시스템이 적용될 수 있는 예시적인 환경을 도시한다.
도 2 내지 3은 본 개시의 일 실시예에 따른, 변화 신호의 성장성 예측 방법의 순서도이다.
도 4는 본 개시의 다른 실시예에 따른, 변화 신호의 성장성 예측 방법의 순서도이다.
도 5는 도 4에 도시된 제1 분류 코드 분석 데이터 획득 단계를 보다 자세하게 설명하기 위한 순서도이다.
도 6 내지 도 8은 본 개시의 몇몇 실시예를 설명하기 위한 분류체계표와 연계표이다.
도 11은 본 개시의 몇몇 실시예에서 구성 요소로서 이용되는 분류체계의 동적 연계 시스템의 하드웨어 구성도이다.1 shows an example environment in which a dynamic linking system of classification systems according to an embodiment of the present disclosure can be applied.
2 and 3 are flowcharts of a method for predicting growth of a change signal, according to an embodiment of the present disclosure.
Figure 4 is a flowchart of a method for predicting growth of a change signal according to another embodiment of the present disclosure.
FIG. 5 is a flow chart to explain in more detail the first classification code analysis data acquisition step shown in FIG. 4.
Figures 6 to 8 are a classification system table and a linkage table for explaining some embodiments of the present disclosure.
11 is a hardware configuration diagram of a dynamic linkage system of a classification system used as a component in some embodiments of the present disclosure.

이하, 첨부된 도면을 참조하여 본 개시의 다양한 실시예들을 상세히 설명한다. 본 개시의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나, 본 개시의 기술적 사상은 이하의 실시예들에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 수 있으며, 단지 이하의 실시예들은 본 개시의 기술적 사상을 완전하도록 하고, 본 개시가 속한 기술분야에서 통상의 지식을 가진 자에게 본 개시의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 개시의 기술적 사상은 청구항의 범주에 의해 정의될 뿐이다.Hereinafter, various embodiments of the present disclosure will be described in detail with reference to the attached drawings. The advantages and features of the present disclosure and methods for achieving them will become clear by referring to the embodiments described in detail below along with the accompanying drawings. However, the technical idea of the present disclosure is not limited to the following embodiments and may be implemented in various different forms. The following examples are merely intended to complete the technical idea of the present disclosure and to cover the technical field to which the present disclosure belongs. is provided to fully inform those skilled in the art of the scope of the present disclosure, and the technical idea of the present disclosure is only defined by the scope of the claims.

본 개시의 다양한 실시예들을 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 개시의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다.In describing various embodiments of the present disclosure, if it is determined that a detailed description of a related known configuration or function may obscure the gist of the present disclosure, the detailed description will be omitted.

다른 정의가 없다면, 이하의 실시예들에서 사용되는 용어(기술 및 과학적 용어를 포함)는 본 개시가 속한 기술분야에서 통상의 지식을 가진 자에게 공통적으로 이해될 수 있는 의미로 사용될 수 있으나, 이는 관련 분야에 종사하는 기술자의 의도 또는 판례, 새로운 기술의 출현 등에 따라 달라질 수도 있다. 본 개시에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 개시의 범주를 제한하고자 하는 것은 아니다.Unless otherwise defined, terms (including technical and scientific terms) used in the following embodiments may be used in a meaning that can be commonly understood by those skilled in the art in the technical field to which this disclosure pertains. It may vary depending on the intentions or precedents of engineers working in related fields, the emergence of new technologies, etc. The terminology used in this disclosure is for describing embodiments and is not intended to limit the scope of this disclosure.

이하의 실시예들에서 사용되는 단수의 표현은 문맥상 명백하게 단수인 것으로 특정되지 않는 한, 복수의 개념을 포함한다. 또한, 복수의 표현은 문맥상 명백하게 복수인 것으로 특정되지 않는 한, 단수의 개념을 포함한다.The singular expressions used in the following embodiments include plural concepts, unless the context clearly specifies singularity. Additionally, plural expressions include singular concepts, unless the context clearly specifies plurality.

또한, 이하의 실시예들에서 사용되는 제1, 제2, A, B, (a), (b) 등의 용어는 어떤 구성요소를 다른 구성요소와 구별하기 위해 사용되는 것일 뿐, 그 용어에 의해 해당 구성요소의 본질이나 차례 또는 순서 등이 한정되지는 않는다.In addition, terms such as first, second, A, B, (a), (b) used in the following embodiments are only used to distinguish one component from another component, and the terms The nature, sequence, or order of the relevant components are not limited.

이하, 첨부된 도면들을 참조하여 본 개시의 다양한 실시예들에 대하여 상세하게 설명한다.Hereinafter, various embodiments of the present disclosure will be described in detail with reference to the attached drawings.

도 1은 본 개시의 일 실시예에 따른 분류체계의 동적 연계 시스템이 적용될 수 있는 예시적인 환경을 도시한다. 도 1에 도시된 바와 같이 본 실시예에 따른 분류체계 연계 시스템(100)은 분류체계 DB(200) 및 사용자 장치(300)을 포함하도록 구성될 수 있다.1 illustrates an example environment in which a dynamic linkage system of classification systems according to an embodiment of the present disclosure can be applied. As shown in FIG. 1, the classification system linkage system 100 according to this embodiment may be configured to include a classification system DB 200 and a user device 300.

특히, 도 1에는 분류체계 DB에서 분류체계 데이터를 획득하는 경우가 예로써 도시되었다. 단, 도 1에 도시된 구성은 본 발명의 목적을 달성하기 위한 바람직한 실시예일 뿐이며, 필요에 따라 일부 구성 요소가 추가되거나 삭제될 수 있음은 물론이다. 또한, 도 1에 도시된 분류체계의 동적 연계 시스템의 각각의 구성 요소들은 기능적으로 구분되는 기능 요소들을 나타낸 것으로서, 적어도 하나의 구성 요소가 실제 물리적 환경에서는 서로 통합되는 형태로 구현될 수도 있음에 유의한다. 이하, 상기 분류체계의 동적 연계 시스템의 각 구성 요소에 대하여 설명한다.In particular, Figure 1 shows an example of obtaining classification system data from a classification system DB. However, the configuration shown in FIG. 1 is only a preferred embodiment for achieving the purpose of the present invention, and of course, some components may be added or deleted as needed. In addition, note that each component of the dynamic linkage system of the classification system shown in FIG. 1 represents functionally distinct functional elements, and at least one component may be implemented in an integrated form in the actual physical environment. do. Hereinafter, each component of the dynamic linkage system of the classification system will be described.

상기 분류체계의 동적 연계 시스템에서, 분류체계 연계 시스템(100)은 분류체계 DB(200)에서 획득된 분류체계에 대한 분석을 수행하고, 분석 정보를 학습하는 컴퓨팅 시스템이다.In the dynamic linking system of the classification system, the classification system linking system 100 is a computing system that performs analysis on the classification system obtained from the classification system DB 200 and learns the analysis information.

상기 분류체계의 동적 연계 시스템에서, 분류체계 DB(200)는 다른 관점으로 분류된 하나 이상의 분류체계를 포함할 수 있다. HS분류, KSIC분류, IPC분류, CPC분류가 분류체계 DB(200)에 포함되는 분류체계일 수 있다. HS분류는 대외 무역거래에서 거래 상품의 종류를 숫자 코드로 분류하는 관점으로 분류된 분류체계이다. KSIC분류는 한국에서 산업을 분류하는 관점으로 분류된 분류체계이다. IPC분류는 국제 특허 분류 관점으로 분류된 분류체계이다. CPC 분류는 협력적 특허분류 관점으로 분류된 분류체계이다.In the dynamic linkage system of the classification system, the classification system DB 200 may include one or more classification systems classified from different perspectives. HS classification, KSIC classification, IPC classification, and CPC classification may be classification systems included in the classification system DB 200. HS classification is a classification system that classifies the types of traded products in foreign trade transactions using numerical codes. KSIC classification is a classification system classified from the perspective of classifying industries in Korea. IPC classification is a classification system classified from the perspective of international patent classification. CPC classification is a classification system classified from a cooperative patent classification perspective.

분류체계 DB(200)는 하나 이상의 분류체계 연계표를 포함할 수 있다. 산업품목-특허 연계표가 분류체계 DB(200)에 포함되는 분류체계 연계표일 수 있다. 예시적으로 분류체계 DB(200)에 포함될 수 있는 분류체계와 분류체계 연계표는 도 6 내지 8에 도시되어 있다. 단, 도 6 내지 8에 도시된 실시예는 본 발명의 목적을 달성하기 위한 바람직한 실시예일 뿐이며, 필요에 따라 다른 관점으로 분류된 하나 이상의 분류체계와 분류체계 연계표가 추가되거나 삭제될 수 있음은 물론이다.The classification system DB 200 may include one or more classification system linkage tables. The industrial item-patent linkage table may be a classification system linkage table included in the classification system DB (200). Exemplarily, classification systems that can be included in the classification system DB 200 and classification system linkage tables are shown in FIGS. 6 to 8. However, the embodiments shown in Figures 6 to 8 are only preferred embodiments to achieve the purpose of the present invention, and one or more classification systems and classification system linkage tables classified from different perspectives may be added or deleted as needed. am.

상기 분류체계의 동적 연계 시스템에서, 사용자 장치(300)는 분류체계 연계 정보를 제공받기 위해 사용자가 이용하는 단말이다. In the dynamic linkage system of the classification system, the user device 300 is a terminal used by the user to receive classification system linkage information.

도 1에 도시된 분류체계의 동적 연계 시스템의 각 구성 요소는 네트워크를 통해 통신할 수 있다. 여기서, 상기 네트워크는 근거리 통신망(Local Area Network; LAN), 광역통신망(Wide Area Network; WAN), 이동 통신망(mobile radio communication network), Wibro(Wireless Broadband Internet)등과 같은 모든 종류의 유/무선 네트워크로 구현될 수 있다.Each component of the dynamic linkage system of the classification system shown in Figure 1 can communicate through a network. Here, the network is all types of wired/wireless networks such as Local Area Network (LAN), Wide Area Network (WAN), mobile radio communication network, Wibro (Wireless Broadband Internet), etc. It can be implemented.

지금까지 도 1을 참조하여 본 발명의 일 실시예에 따른 분류체계의 동적 연계 시스템에 대하여 설명하였다. 다음으로, 도 2 내지 도 5를 참조하여 본 개시의 몇몇 실시예에 따른, 분류체계의 동적 연계 방법을 설명한다. So far, the dynamic linking system of the classification system according to an embodiment of the present invention has been described with reference to FIG. 1. Next, a dynamic linkage method of classification systems according to some embodiments of the present disclosure will be described with reference to FIGS. 2 to 5.

도 2 내지 도 3은 본 실시예에 따른, 분류체계의 동적 연계 방법의 순서도이다. 단, 이는 본 개시의 목적을 달성하기 위한 바람직한 실시예일 뿐이며, 필요에 따라 일부 단계가 추가되거나 삭제될 수 있음은 물론이다.2 to 3 are flowcharts of a method for dynamic linking of classification systems according to this embodiment. However, this is only a preferred embodiment for achieving the purpose of the present disclosure, and of course, some steps may be added or deleted as needed.

도 2를 참조하면, 분류체계 연계 시스템은 분류 코드를 획득(S100)할 수 있다. 상기 분류 코드는 분류체계 DB(200)에서 수신된 데이터일 수 있다. 상기 분류 코드는 대상 기술을 하나의 관점에서 분류한 하나의 분류체계에 포함되는 하나의 갈래이다. 예를 들어, 도 6을 참고하여 설명하면, 한국표준산업의 관점에서 기술을 분류한 한국표준산업분류(KSIC) 분류체계에 포함되는 A01. 농업이 하나의 분류 코드이다. 또, 도 7을 참고하여 설명하면, 국제 특허 분류의 관점에서 기술을 분류한 국제특허분류(IPC) 분류체계에 포함되는 A01. 농업이 하나의 분류 코드이다.Referring to FIG. 2, the classification system linkage system can acquire a classification code (S100). The classification code may be data received from the classification system DB 200. The classification code is one branch included in one classification system that classifies the target technology from one perspective. For example, if explained with reference to FIG. 6, A01 included in the Korean Standard Industrial Classification (KSIC) classification system, which classifies technologies from the perspective of the Korean standard industry. Agriculture is one classification code. In addition, if explained with reference to FIG. 7, A01 is included in the International Patent Classification (IPC) classification system that classifies technologies from the perspective of international patent classification. Agriculture is one classification code.

다음으로 단계 S200에서, 분류체계 연계 시스템은 획득한 분류 코드의 설명문을 획득할 수 있다. 상기 분류 코드의 설명문은 분류체계 DB(200)에서 수신된 데이터일 수 있다. 상기 분류 코드의 설명문은 기존의 분류체계에 포함되어 있는 코드에 관한 설명이다. 상기 분류 코드의 설명문은 단어와, 단어 간의 관계를 의미하는 문맥과, 문장 간의 관계를 의미하는 문맥을 특징으로 구성될 수 있다.Next, in step S200, the classification system linking system may obtain a description of the obtained classification code. The description of the classification code may be data received from the classification system DB 200. The description of the classification code is an explanation of the code included in the existing classification system. The description of the classification code may be comprised of words, context meaning the relationship between words, and context meaning the relationship between sentences.

다음으로 분류체계 연계 시스템은, 단계 S300에서, 상기 분류 코드의 설명문의 입력 피처를 벡터 추출 모델에 입력하고, 단계 S400에서, 상기 벡터 추출 모델에서 분류 코드의 벡터를 획득할 수 있다. 상기 분류 코드의 설명문의 입력 피처는 상기 분류 코드의 설명문의 특징인 단어와, 단어 간의 관계를 의미하는 문맥과, 문장 간의 관계를 의미하는 문맥을 반영하는 형식으로 구성될 수 있다.Next, the classification system linkage system may input the input features of the description of the classification code into the vector extraction model in step S300 and obtain the vector of the classification code from the vector extraction model in step S400. The input features of the description of the classification code may be configured in a format that reflects words that are characteristic of the description of the classification code, context meaning the relationship between words, and context meaning the relationship between sentences.

상기 벡터 추출 모델은 분류체계 DB(200)에 포함된 분류 코드의 설명문과 분류체계 연계표를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것일 수 있다. 상기 벡터 추출 모델은 분류체계 연계표에 포함된 각 분류 코드의 대응표를 분석하고, 서로 대응되는 제1 분류 체계 상의 제1 분류 코드 및 제2 분류 체계 상의 제2 분류 코드의 순서쌍 세트를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것일 수 있다. The vector extraction model may be machine-learned using learning data generated using a description of the classification code included in the classification system DB 200 and a classification system linkage table. The vector extraction model analyzes the correspondence table of each classification code included in the classification system linkage table, and is generated using a set of ordered pairs of the first classification code in the first classification system and the second classification code in the second classification system that correspond to each other. It may be machine learned using training data.

일 실시예에서, 상기 분류 코드의 설명문과 분류체계 연계표를 이용하여 생성된 학습 데이터의 제1 분류 코드의 설명문을 벡터 추출 모델에 입력하면, 상기 벡터 추출 모델에서 제1 벡터가 출력될 수 있다. 상기 분류 코드의 설명문과 분류체계 연계표를 이용하여 생성된 학습 데이터의 제2 분류 코드의 설명문을 벡터 추출 모델에 입력하면, 상기 벡터 추출 모델에서 제2 벡터가 출력될 수 있다.In one embodiment, when the description of the classification code and the description of the first classification code of the learning data generated using the classification system linkage table are input to the vector extraction model, the first vector may be output from the vector extraction model. When the description of the classification code and the description of the second classification code of the learning data generated using the classification system linkage table are input to the vector extraction model, a second vector may be output from the vector extraction model.

분류체계 연계 시스템은 상기 출력된 제1 벡터와 상기 출력된 제2 벡터의 특징 공간 상의 거리를 기반으로 하는 손실 함수(loss function)를 이용한 오류 역전파(back-propagation)를 이용하여 상기 벡터 추출 모델의 가중치를 업데이트할 수 있다. 상기 손실 함수(loss function)를 이용한 오류 역전파(back-propagation)를 이용하여 학습된 벡터 추출 모델은 타깃과 예측값의 오차를 최소화하는 방향으로 모델의 파라미터를 조정해 가장 알맞은 모델을 구축할 수 있다.The classification system linkage system extracts the vector using error back-propagation using a loss function based on the distance in the feature space of the output first vector and the output second vector. The weights of can be updated. The vector extraction model learned using error back-propagation using the loss function can build the most appropriate model by adjusting the model parameters in the direction of minimizing the error between the target and predicted values. .

산업분류(KSIC)체계와 특허분류(IPC)체계를 연계하는 분류체계 연계표인 도 8을 참고하여 설명하면, 제1 분류 코드인 KSIC 분류체계의 B05를 제1 분류 코드로 지정할 시 석탄, 원유 및 천연가스 광업을 아우르는 분류 코드에 대응되는 설명문이 제1 분류 코드의 설명문으로 지정되고, 제1 분류 코드에 연계되는 IPC 분류체계의 E21D가 제2 분류 코드로 지정되고, 채광, 채석을 아우르는 분류 코드에 대응되는 설명문이 제2 분류 코드의 설명문으로 지정될 수 있다. 제1 분류 코드인 B05와 B05의 설명문을 표현하는 입력 피처를 벡터 추출 모델에 입력하고, 제2 분류 코드인 E21D와 E21D의 설명문을 표현하는 입력 피처를 벡터 추출 모델에 입력하면, 상기 제1 분류 코드와 제2 분류 코드가 분류체계 연계표 상에서 연계되는 분류 코드이므로 제1 분류 코드를 벡터 추출 모델에 입력할 시 추출되는 제1 벡터와 제2 분류 코드를 벡터 추출 모델에 입력할 시 추출되는 제2 백터의 특징 공간 상의 거리를 줄이는 가중치가 업데이트될 수 있다.If explained with reference to Figure 8, which is a classification system linkage table linking the Industrial Classification (KSIC) system and the Patent Classification (IPC) system, when B05 of the KSIC classification system, which is the first classification code, is designated as the first classification code, coal, crude oil and The explanatory text corresponding to the classification code encompassing natural gas mining is designated as the explanatory text of the first classification code, E21D of the IPC classification system linked to the first classification code is designated as the second classification code, and the classification code encompassing mining and quarrying The description corresponding to may be designated as the description of the second classification code. When input features representing the descriptions of the first classification codes B05 and B05 are input into the vector extraction model, and input features representing the descriptions of the second classification codes E21D and E21D are input into the vector extraction model, the first classification Since the code and the second classification code are classification codes linked on the classification system linkage table, the first vector extracted when the first classification code is input into the vector extraction model and the second vector extracted when the second classification code is input into the vector extraction model Weights that reduce the distance in the vector's feature space may be updated.

분류체계 연계 시스템은 상기 학습 데이터가 소진될 때까지 가중치를 업데이트하는 단계를 반복하여 벡터 추출 모델을 학습시킬 수 있다. 분류체계 연계 시스템은 상기 벡터 추출 모델로 딥러닝 기술 중 하나인 트랜스포머 언어모델 기술들에서 공개하여 제공하고 있는 모델을 활용할 수 있다.The classification system linkage system can learn a vector extraction model by repeating the step of updating weights until the learning data is exhausted. The classification system linkage system can use the model provided publicly by Transformer language model technologies, one of the deep learning technologies, as the vector extraction model.

분류체계 연계 시스템은 도 2에서 설명한 과정에 따라 생성된 벡터 추출 모델에 분류 코드와 분류 코드의 설명문을 표현하는 입력 피처를 입력하여 분류 코드의 벡터를 획득할 수 있다(S510). 일 실시예에서, 상기 벡터 추출 모델에는 벡터 추출 모델의 학습 데이터를 생성하는 데 포함된 분류 코드와 분류 코드의 설명문을 표현하는 입력 피처가 입력될 수 있다. 일 실시예에서, 상기 벡터 추출 모델에는 벡터 추출 모델의 학습 데이터를 생성하는 데 포함되지 않은 분류 코드와 분류 코드의 설명문을 표현하는 입력 피처가 입력될 수 있다.The classification system linkage system can obtain a vector of the classification code by inputting input features representing the classification code and the description of the classification code into the vector extraction model generated according to the process described in FIG. 2 (S510). In one embodiment, input features representing a classification code and a description of the classification code included in generating training data of the vector extraction model may be input to the vector extraction model. In one embodiment, input features representing a classification code and a description of the classification code that are not included in generating training data of the vector extraction model may be input to the vector extraction model.

다음으로 단계 S520에서 분류체계 연계 시스템은 특징 공간 상의 벡터 간 거리를 기반으로 분류 코드 간의 유사도를 도출할 수 있다. 상기 벡터 추출 모델은 입력된 설명문을 표현하는 입력 피처에 대응되는 설명문이 유의한 분류 코드를 근접한 벡터로 도출하도록 학습된 벡터 모델이므로, 분류 코드간의 유사도는 분류 코드에 대응되는 벡터 모델간의 거리에 따라 도출될 수 있다. 분류 코드간의 유사도를 분류 코드에 대응되는 벡터 모델간의 거리에 따라 도출하는 방법은 도 9 내지 도 10을 참조하여 후술한다.Next, in step S520, the classification system linkage system can derive the similarity between classification codes based on the distance between vectors in the feature space. The vector extraction model is a vector model learned so that the description corresponding to the input feature representing the input description derives a meaningful classification code as a close vector, so the similarity between classification codes depends on the distance between the vector models corresponding to the classification code. can be derived. A method of deriving the similarity between classification codes according to the distance between vector models corresponding to the classification codes will be described later with reference to FIGS. 9 and 10.

다음으로, 분류체계 연계 시스템은 분류 코드 연계 폭에 반영할 기준을 사용자로부터 선택받을 수 있다(S530). 상기 분류 코드 연계 폭은 제1 분류 코드에 연계될 제2 분류 코드의 기준일 수 있다. 산업분류(KSIC)체계와 특허분류(IPC)체계를 연계하는 분류체계 연계표인 도 8을 참고하여 설명하면, 산업분류(KSIC)체계의 A01 분류 코드에 연계된 제2 분류 코드가 특허분류(IPC)체계의 A01B27/02, A01C, A01D, A01G, A01H, A01K인 경우, 특허분류(IPC)체계의 A01B27/02, A01C, A01D, A01G, A01H, A01K는 제1 분류 코드 연계 폭에 반영할 기준에 부합하는 분류 코드일 수 있다. 사용자는 사용자 장치(300)를 이용하여 분류체계 연계 시스템에 분류 코드 연계 폭에 반영할 기준 선택을 입력할 수 있다. 상기 사용자가 선택을 입력한 분류 코드 연계 폭에 반영할 기준은 분류 코드 간의 유사도를 단위로 할 수 있다.Next, the classification system linkage system can select the criteria to be reflected in the classification code linkage width from the user (S530). The classification code linkage width may be a standard for the second classification code to be linked to the first classification code. 8, which is a classification system linkage table linking the Industrial Classification (KSIC) system and the Patent Classification (IPC) system, the second classification code linked to the A01 classification code of the Industrial Classification (KSIC) system is the Patent Classification (IPC) system. ) In the case of A01B27/02, A01C, A01D, A01G, A01H, A01K of the system, A01B27/02, A01C, A01D, A01G, A01H, A01K of the patent classification (IPC) system is the standard to be reflected in the first classification code linkage width. It may be a classification code that matches. The user can use the user device 300 to input a selection of criteria to be reflected in the classification code linkage width into the classification system linkage system. The standard to be reflected in the linkage width of the classification code for which the user entered the selection may be the degree of similarity between classification codes.

일 실시예에서, 사용자는 제1 분류체계에 포함된 제1 분류 코드에 대해서 연계되는 제2 분류체계에 포함된 분류 코드를 유사도에 따라 여러 개 리스트업 하고, 리스트업 되어 있는 분류 코드 중 상위 n개 선택하여 연계되는 분류 코드로 지정할 수 있다.In one embodiment, the user lists several classification codes included in the second classification system linked to the first classification code included in the first classification system according to similarity, and selects the top n classification codes among the listed classification codes. You can select one and designate it with a related classification code.

일 실시예에서, 사용자는 제1 분류체계에 포함된 제1 분류 코드에 대해서 연계되는 제2 분류체계에 포함된 분류 코드를 유사도에 따라 여러 개 리스트업 하고, 리스트업 되어 있는 분류 코드 중 제1 분류 코드와의 유사도가 n% 이상인 분류 코드를 선택하여 연계되는 분류 코드로 지정할 수 있다. In one embodiment, the user lists several classification codes included in the second classification system linked to the first classification code included in the first classification system according to similarity, and selects the first classification code among the listed classification codes. You can select a classification code with a similarity of n% or more to the classification code and designate it as a linked classification code.

상기 제1 분류 코드와 제2 분류체계에 포함된 분류 코드의 유사도는 도 9 내지 도 10을 참조하여 후술한다.The similarity between the first classification code and the classification code included in the second classification system will be described later with reference to FIGS. 9 and 10.

일 실시예에서, 분류체계 연계 시스템은 사용자의 선택이 입력되지 않아도 기 정의된 분류 코드 연계 폭에 반영할 기준에 반영할 기준을 생성하고 반영할 수 있다.In one embodiment, the classification system linkage system can create and reflect criteria to be reflected in the criteria to be reflected in the predefined classification code linkage width even if the user's selection is not input.

다음으로 단계 S540에서, 분류체계 연계 시스템은 사용자에 의해 선택된 기준에 부합하는 분류 코드를 제2 분류 코드로 지정할 수 있다. 사용자에 의해 선택된 기준에 부합하는 분류 코드를 제2 분류 코드로 지정함으로써 용도에 맞게 분류체계의 폭을 선택하는 유연성을 제공할 수 있다.Next, in step S540, the classification system linking system may designate a classification code that meets the criteria selected by the user as the second classification code. By designating a classification code that meets the criteria selected by the user as the second classification code, flexibility can be provided in selecting the width of the classification system to suit the purpose.

도 4는 본 개시의 다른 실시예에 따른, 변화 신호의 성장성 예측 방법의 순서도이다. 도 4를 참조하면, 분류체계 연계 시스템은 제1 기술 관련 데이터를 획득(S600)할 수 있다. 상기 제1 기술 관련 데이터는 제1 기술이 해당되는 제1 분류 코드를 포함할 수 있다. 상기 제1 기술 관련 데이터는 제1 기술이 해당되는 제1 분류 코드를 포함하지 않을 수 있다. 상기 제1 기술 관련 데이터가 제1 기술이 해당되는 제1 분류 코드를 포함하지 않을 경우 제1 분류 코드와 제1 분류 코드의 분석 데이터를 획득하는 보다 자세한 과정은 도 5를 참조하여 후술한다.Figure 4 is a flowchart of a method for predicting growth of a change signal according to another embodiment of the present disclosure. Referring to FIG. 4, the classification system linkage system may acquire first technology-related data (S600). The first technology-related data may include a first classification code to which the first technology corresponds. The first technology-related data may not include the first classification code to which the first technology corresponds. If the first technology-related data does not include the first classification code to which the first technology corresponds, a more detailed process of obtaining the first classification code and analysis data of the first classification code will be described later with reference to FIG. 5.

제1 분류 코드를 획득한 분류체계 연계 시스템은 제1 분류 코드의 분석 데이터를 획득(S700)할 수 있다. 상기 제1 분류 코드의 분석 데이터는 제1 분류 코드를 설명하는 제1 설명문을 표현하는 입력 피처를 제1 인공지능 모델에 입력할 시 출력되는 데이터를 이용하여 생성할 수 있다. The classification system linking system that has acquired the first classification code may acquire analysis data of the first classification code (S700). The analysis data of the first classification code can be generated using data output when inputting an input feature representing the first description describing the first classification code into the first artificial intelligence model.

상기 제1 설명문을 표현하는 입력 피처는 설명문의 특징인 단어와, 단어 간의 관계를 의미하는 문맥과, 문장 간의 관계를 의미하는 문맥을 반영하는 형식으로 구성될 수 있다.The input feature representing the first description may be configured in a format that reflects words that are characteristic of the description, context meaning the relationship between words, and context meaning the relationship between sentences.

다음으로, 상기 제1 분류 코드의 분석 데이터를 획득한 분류체계 연계 시스템은 제1 분류 코드의 분석 데이터를 표현하는 입력 피처를 제2 인공지능 모델에 입력하여, 제1 분류 코드 벡터를 획득(S800)하고, 특징 공간 상의 거리를 기반으로, 제2 분류 체계 상의 제2 분류 코드를 획득할 수 있다(S900).Next, the classification system linkage system that obtained the analysis data of the first classification code inputs the input feature representing the analysis data of the first classification code into the second artificial intelligence model to obtain the first classification code vector (S800 ), and based on the distance in the feature space, a second classification code on the second classification system can be obtained (S900).

다음으로 도 5를 참조하여 제1 기술 관련 데이터가 제1 기술이 해당되는 제1 분류 코드를 포함하지 않을 경우 제1 분류 코드와 제1 분류 코드의 분석 데이터를 획득하는 과정을 설명한다.Next, referring to FIG. 5, when the first technology-related data does not include the first classification code to which the first technology corresponds, a process for obtaining the first classification code and analysis data of the first classification code will be described.

분류체계 연계 시스템은 제1 분류 코드가 제1 기술 관련 데이터에 존재하는지 판단(S610)하여, 제1 분류 코드가 제1 기술 관련 데이터에 존재하는 경우 바로 제1 분류 코드와 제1 분류 코드의 설명문을 획득하지만, 제1 분류 코드가 제1 기술 관련 데이터에 존재하지 않는 경우, 제1 기술 설명문을 획득(S620)하고, 획득한 제1 기술 설명문을 표현하는 피처를 언어 모델에 입력하여(S630) 제1 분류 코드를 획득(S640)하고, 제1 분류 코드와 제1 분류 코드의 설명문을 획득한다. 상기 언어 모델은 제1 기술 분야의 논문, 특허, 보고서를 포함하는 제1 기술의 전문 분야 데이터와 제1 기술 분야의 분류 코드를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 모델이다.The classification system linkage system determines whether the first classification code exists in the first technology-related data (S610), and if the first classification code exists in the first technology-related data, the first classification code and an explanation of the first classification code are immediately sent. is obtained, but if the first classification code does not exist in the first technology-related data, the first technology description is acquired (S620), and the feature representing the obtained first technology description is input into the language model (S630) The first classification code is acquired (S640), and the first classification code and a description of the first classification code are obtained. The language model is a machine-learned model using training data generated using specialized field data of the first technology, including papers, patents, and reports in the first technology field, and classification codes of the first technology field.

일 실시예에서, 다양한 관점의 분류 코드와 연계하려는 대상 기술이 신기술로서 분류 코드를 포함하지 않는 기술일 경우, 대상 기술 분야의 논문, 특허, 보고서를 포함하는 기술의 전문 분야 데이터와 대상 기술 분야의 분류 코드를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 언어 모델을 이용하여 가장 벡터상 근접한 하나의 분류 코드를 대상 기술의 임시 분류 코드로 지정할 수 있다.In one embodiment, when the target technology to be linked with classification codes from various perspectives is a new technology that does not include a classification code, the technical field data including papers, patents, and reports in the target technology field and the target technology field Using learning data generated using classification codes, a machine-learned language model can be used to designate the closest classification code in terms of vector as the temporary classification code of the target technology.

도 9 내지 10은 본 개시의 몇몇 실시예에 따른, 특징 공간에 사상되는 벡터를 설명하기 위한 도면이다. 분류체계 연계 시스템은 특징 공간에 사상되는 벡터를 이용하여 분류 코드간의 유사도를 도출할 수 있다.9 to 10 are diagrams for explaining vectors mapped to a feature space according to some embodiments of the present disclosure. The classification system linkage system can derive the similarity between classification codes using vectors mapped to the feature space.

분류체계 연계 시스템은, 하나 이상의 분류 코드의 설명문을 표현하는 입력 피처(400)를 벡터 추출 모델(500)에 입력하고, 상기 벡터 추출 모델에서 입력한 분류 코드에 대응되는 벡터(600)를 획득할 수 있다. 상기 분류 코드의 설명문의 입력 피처는 상기 분류 코드의 설명문의 특징인 단어와, 단어 간의 관계를 의미하는 문맥과, 문장 간의 관계를 의미하는 문맥을 반영하는 형식으로 구성될 수 있다.The classification system linkage system inputs an input feature 400 representing a description of one or more classification codes into a vector extraction model 500, and obtains a vector 600 corresponding to the classification code input from the vector extraction model. You can. The input features of the description of the classification code may be configured in a format that reflects words that are characteristic of the description of the classification code, context meaning the relationship between words, and context meaning the relationship between sentences.

벡터 추출 모델은 분류체계 DB(200)에 포함된 분류 코드의 설명문과 분류체계 연계표를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것일 수 있다. 상기 벡터 추출 모델은 분류체계 연계표에 포함된 각 분류 코드의 대응표를 분석하고, 서로 대응되는 제1 분류 체계 상의 제1 분류 코드 및 제2 분류 체계 상의 제2 분류 코드의 순서쌍 세트를 이용하여 생성된 학습 데이터를 이용하여 기계 학습된 것일 수 있다. 벡터 추출 모델은 출력된 제1 벡터와 출력된 제2 벡터의 특징 공간 상의 거리를 기반으로 하는 손실 함수(loss function)를 이용한 오류 역전파(back-propagation)를 이용하여 상기 벡터 추출 모델의 가중치를 업데이트하는 방식으로 기계 학습될 수 있다.The vector extraction model may be machine-learned using learning data generated using the description of the classification code included in the classification system DB 200 and the classification system linkage table. The vector extraction model analyzes the correspondence table of each classification code included in the classification system linkage table, and is generated using a set of ordered pairs of the first classification code in the first classification system and the second classification code in the second classification system that correspond to each other. It may be machine learned using training data. The vector extraction model uses error back-propagation using a loss function based on the distance in the feature space of the output first vector and the output second vector to adjust the weights of the vector extraction model. It can be machine learned by updating.

일 실시예에서, 상기 제1 벡터와 상기 제2 벡터 사이의 유사도는 유클리드 유사도(euclidean similarity)로 산출될 수 있다. 즉, 상기 제1 벡터와 제2 벡터 사이의 유사도는 벡터 공간 모델(vector space model)에서 두 벡터가 가리키는 포인트 사이의 유클리드 거리에 기초하여 산출될 수도 있다.In one embodiment, the similarity between the first vector and the second vector may be calculated as Euclidean similarity. That is, the similarity between the first vector and the second vector may be calculated based on the Euclidean distance between the points indicated by the two vectors in a vector space model.

분류 코드에 대응되는 벡터(600)를 도시한 도 10을 예로서 설명하면, A 노드에 대응되는 분류 코드와 연계되는 분류 코드 후보에는 A 노드와 인접하게 위치하는 B, C, D 노드에 대응되는 분류 코드가 포함될 수 있다. 분류 체계 연계 시스템은 상기 사용자로부터 입력된 분류체계의 폭에 반영할 벡터 기준을 이용하여 제1 분류 코드에 대해서 연계되는 분류 코드를 지정할 수 있다. 제1 벡터와 제2 벡터 사이의 유사도는 벡터 공간 모델에서 두 벡터가 가리키는 포인트 사이의 유클리드 거리에 기초하여 산출되므로, A 노드와의 거리가 짧은 순으로 노드를 리스트업 하고, 리스트업 되어 있는 노드 중 A 노드와의 거리가 상기 사용자로부터 입력된 분류체계의 폭에 해당되는 노드에 대응되는 분류 코드를 제1 분류 코드에 대응되는 분류 코드로 지정할 수 있다. 10 showing the vector 600 corresponding to the classification code as an example, the classification code candidates associated with the classification code corresponding to the A node include those corresponding to the B, C, and D nodes located adjacent to the A node. A classification code may be included. The classification system linkage system can specify a classification code linked to the first classification code using vector criteria to be reflected in the width of the classification system input from the user. Since the similarity between the first vector and the second vector is calculated based on the Euclidean distance between the points indicated by the two vectors in the vector space model, the nodes are listed in order of the shortest distance to the A node, and the listed nodes The classification code corresponding to the node whose distance from node A corresponds to the width of the classification system input by the user may be designated as the classification code corresponding to the first classification code.

기술 관련데이터에 대응되는 벡터(600)를 도시한 도 10을 예로서 설명하면, 분류 코드 도출 대상 기술 관련데이터에 대응되는 A 노드와 가장 인접한 B 노드가 대응되는 기술 관련데이터가 포함되는 분류 코드를 분류 코드 도출 대상 기술의 분류 코드로 지정할 수 있다. 10 showing the vector 600 corresponding to technology-related data as an example, the A node and the closest B node corresponding to the technology-related data to be derived from the classification code generate a classification code containing the corresponding technology-related data. The classification code can be designated as the classification code of the target technology.

이외에도, 상기 제1 벡터와 제2 벡터 사이의 유사도는 다양한 방식으로 산출될 수 있다.In addition, the similarity between the first vector and the second vector can be calculated in various ways.

도 11은 본 개시의 몇몇 실시예들에서 구성 요소로 이용될 수 있는 컴퓨팅 시스템의 하드웨어 구성도이다. 도 11에 도시된 분류체계 연계 시스템(100)은, 하나 이상의 프로세서(1100), 시스템 버스(1600), 통신 인터페이스(1200), 프로세서(1100)에 의하여 수행되는 컴퓨터 프로그램(1500)을 로드(load)하는 메모리(1400)와, 컴퓨터 프로그램(1500)을 저장하는 스토리지(1300)를 포함할 수 있다.11 is a hardware configuration diagram of a computing system that can be used as a component in some embodiments of the present disclosure. The classification system linking system 100 shown in FIG. 11 loads one or more processors 1100, a system bus 1600, a communication interface 1200, and a computer program 1500 performed by the processor 1100. ) and a storage 1300 that stores a computer program 1500.

프로세서(1100)는 분류체계 연계 시스템(100)의 각 구성의 전반적인 동작을 제어한다. 프로세서(1100)는 본 개시의 다양한 실시예들에 따른 방법/동작을 실행하기 위한 적어도 하나의 애플리케이션 또는 프로그램에 대한 연산을 수행할 수 있다. 메모리(1400)는 각종 데이터, 명령 및/또는 정보를 저장한다. The processor 1100 controls the overall operation of each component of the classification system linking system 100. The processor 1100 may perform operations on at least one application or program to execute methods/operations according to various embodiments of the present disclosure. The memory 1400 stores various data, commands and/or information.

메모리(1400)는 본 개시의 다양한 실시예들에 따른 방법/동작들을 실행하기 위하여 스토리지(1300)로부터 하나 이상의 컴퓨터 프로그램(1500)을 로드(load) 할 수 있다. The memory 1400 may load one or more computer programs 1500 from the storage 1300 to execute methods/operations according to various embodiments of the present disclosure.

스토리지(1300)는 하나 이상의 컴퓨터 프로그램(1500)을 비임시적으로 저장할 수 있다. 컴퓨터 프로그램(1500)은 본 개시의 다양한 실시예들에 따른 방법/동작들이 구현된 하나 이상의 인스트럭션들(instructions)을 포함할 수 있다. Storage 1300 may non-temporarily store one or more computer programs 1500. The computer program 1500 may include one or more instructions implementing methods/operations according to various embodiments of the present disclosure.

컴퓨터 프로그램(1500)이 메모리(1400)에 로드 되면, 프로세서(1100)는 상기 하나 이상의 인스트럭션들을 실행시킴으로써 본 개시의 다양한 실시예들에 따른 방법/동작들을 수행할 수 있다.When the computer program 1500 is loaded into the memory 1400, the processor 1100 can perform methods/operations according to various embodiments of the present disclosure by executing the one or more instructions.

몇몇 실시예들에서, 도 11을 참조하여 설명된 키워드 그룹 생성 시스템은 가상 머신 등 클라우드 기술에 기반하여 서버 팜(server farm)에 포함된 하나 이상의 물리 서버(physical server)를 이용하여 구성될 수 있다. 이 경우, 도 11에 도시된 구성 요소 중 프로세서(1100), 메모리(1400) 및 스토리지(1300) 중 적어도 일부는 가상 하드웨어(virtual hardware)일 수 있을 것이며, 통신 인터페이스(1200) 또한 가상 스위치(virtual switch) 등 가상화된 네트워킹 요소로 구성될 수 있을 것이다.In some embodiments, the keyword group creation system described with reference to FIG. 11 may be configured using one or more physical servers included in a server farm based on cloud technology such as a virtual machine. . In this case, at least some of the processor 1100, memory 1400, and storage 1300 among the components shown in FIG. 11 may be virtual hardware, and the communication interface 1200 may also be a virtual switch. It may be composed of virtualized networking elements such as switches.

컴퓨터 프로그램(1500)은 키워드 그룹 자동 생성 동작을 수행하기 위한 인스트럭션들(instructions)을 포함할 수 있다. 컴퓨터 프로그램(1500)이 메모리(1400)에 로드되면, 프로세서(1100)는 상기 인스트럭션들을 실행시킴으로써 본 개시의 다양한 실시예들에 따른 방법/동작들을 수행할 수 있다.The computer program 1500 may include instructions for performing an automatic keyword group creation operation. When the computer program 1500 is loaded into the memory 1400, the processor 1100 can perform methods/operations according to various embodiments of the present disclosure by executing the instructions.

지금까지 도 1 내지 도 11을 참조하여 본 개시의 다양한 실시예들 및 그 실시예들에 따른 효과들을 언급하였다. 본 개시의 기술적 사상에 따른 효과들은 이상에서 언급한 효과들로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.So far, various embodiments of the present disclosure and effects according to the embodiments have been mentioned with reference to FIGS. 1 to 11 . The effects according to the technical idea of the present disclosure are not limited to the effects mentioned above, and other effects not mentioned can be clearly understood by those skilled in the art from the description below.

지금까지 설명된 본 개시의 기술적 사상은 컴퓨터가 읽을 수 있는 매체 상에 컴퓨터가 읽을 수 있는 코드로 구현될 수 있다. 상기 컴퓨터로 읽을 수 있는 기록 매체에 기록된 상기 컴퓨터 프로그램은 인터넷 등의 네트워크를 통하여 다른 컴퓨팅 시스템에 전송되어 상기 다른 컴퓨팅 시스템에 설치될 수 있고, 이로써 상기 다른 컴퓨팅 시스템에서 사용될 수 있다.The technical ideas of the present disclosure described so far can be implemented as computer-readable code on a computer-readable medium. The computer program recorded on the computer-readable recording medium can be transmitted to another computing system through a network such as the Internet, installed in the other computing system, and thus used in the other computing system.

도면에서 동작들이 특정한 순서로 도시되어 있지만, 반드시 동작들이 도시된 특정한 순서로 또는 순차적 순서로 실행되어야만 하거나 또는 모든 도시 된 동작들이 실행되어야만 원하는 결과를 얻을 수 있는 것으로 이해되어서는 안 된다. 특정 상황에서는, 멀티태스킹 및 병렬 처리가 유리할 수도 있다. 이상 첨부된 도면을 참조하여 본 개시의 실시예들을 설명하였지만, 본 개시가 속하는 기술분야에서 통상의 지식을 가진 자는 그 기술적 사상이나 필수적인 특징을 변경하지 않고서 본 개시가 다른 구체적인 형태로도 실시될 수 있다는 것을 이해할 수 있다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적인 것이 아닌 것으로 이해해야만 한다. 본 개시의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 개시에 의해 정의되는 기술적 사상의 권리범위에 포함되는 것으로 해석되어야 할 것이다.Although operations are shown in the drawings in a specific order, it should not be understood that the operations must be performed in the specific order shown or sequential order or that all illustrated operations must be performed to obtain the desired results. In certain situations, multitasking and parallel processing may be advantageous. Although embodiments of the present disclosure have been described above with reference to the attached drawings, those skilled in the art will understand that the present disclosure can be implemented in other specific forms without changing the technical idea or essential features. I can understand that there is. Therefore, the embodiments described above should be understood in all respects as illustrative and not restrictive. The scope of protection of this disclosure should be interpreted in accordance with the claims below, and all technical ideas within the equivalent scope should be interpreted as being included in the scope of rights of the technical ideas defined by this disclosure.

Claims

In a method performed by a computing system,
Obtaining a first description of a first classification code on a first classification system;
Obtaining a second description of a second classification code on a second classification system;
Inputting an input feature representing the first description into a vector extraction model and obtaining a first vector output from the vector extraction model;
Inputting an input feature representing the second description into the vector extraction model and obtaining a second vector output from the vector extraction model; and
Including the step of inferring whether the first classification code and the second classification code correspond based on the distance in feature space between the first vector and the second vector,
The vector extraction model is machine learned using learning data generated using the correspondence table of each classification code of the first classification system and the second classification system,
Dynamic linking method of classification system.

According to claim 1,
The step of inferring whether the first classification code and the second classification code correspond based on the distance in feature space between the first vector and the second vector includes:
Comprising the step of designating a classification code corresponding to a vector whose distance from the first vector in the feature space is less than or equal to a predefined distance as a second classification code,
Dynamic linking method of classification system.

According to clause 2,
The step of designating a classification code corresponding to a vector whose distance from the first vector in the feature space is less than or equal to a predefined distance as a second classification code,
Including the step of receiving a standard for the distance between vectors to be reflected in the width of the classification system from the user,
Dynamic linking method of classification system.

In a method performed by a computing system,
Analyzing the correspondence table of each classification code of the first classification system and the second classification system, obtaining a set of ordered pairs of the first classification code on the first classification system and the second classification code on the second classification system corresponding to each other. ;
Obtaining learning data using each ordered pair included in the set;
Inputting a description of the first classification code of the learning data into a vector extraction model and obtaining a first vector output from the vector extraction model;
Inputting a description of a second classification code of the learning data into the vector extraction model and obtaining a second vector output from the vector extraction model;
Updating weights of the vector extraction model using error back-propagation using a loss function based on the distance in the feature space of the first vector and the second vector; and
Comprising repeating the steps of obtaining the first vector, obtaining the second vector, and updating the weight for each training data until the training data is exhausted,
A machine learning method for matching classification codes in heterogeneous classification systems.

In a method performed by a computing system,
Obtaining relevant data of a first technology, wherein the relevant data of the first technology includes a first classification code in a first classification system;
An input feature representing the first description (DESCRIPTION) of the first classification code is input into a first artificial intelligence model, and analysis data of the first classification code is analyzed using the output data output from the first artificial intelligence model. Obtaining analysis data of the first classification code reflecting the context of the description;
Inputting an input feature representing the first classification code analysis data into a second artificial intelligence model and obtaining a first vector output from the second artificial intelligence model; and
Linking a second classification code on a second classification system to a first classification code,
The second artificial intelligence model infers whether the analysis data of the first classification code corresponds to the analysis data of the second classification code based on the distance in the feature space,
The second artificial intelligence model is machine learned using learning data generated using the correspondence table of each classification code of the first classification system and the second classification system,
Dynamic linking method of classification system.

According to clause 5,
The step of obtaining the first description of the first classification code in the first classification system is,
When the first technology is a new technology, inputting an input feature representing a description of the first technology into a language model and obtaining a first vector output from the language model;
Including the step of designating a classification code of the technology corresponding to the closest vector in the feature space to the first vector as the first classification code,
The language model is machine learned using training data generated using specialty data of the first technology and classification codes of the first technology,
Dynamic linking method of classification system.

According to clause 6,
The specialty data of the first technology is,
Including papers, patents, and reports in the first technical field,
Dynamic linking method of classification system.

One or more processors;
a memory that loads a computer program executed by the processor; and
Including storage for storing the computer program,
The computer program is,
instructions for obtaining a first description of a first classification code on a first classification system;
instructions for obtaining a second description of a second classification code on a second classification system;
Instructions for inputting an input feature representing the first description into a vector extraction model and obtaining a first vector output from the vector extraction model;
Instructions for inputting an input feature representing the second description into the vector extraction model and obtaining a second vector output from the vector extraction model; and
An instruction for inferring whether the first classification code corresponds to the second classification code based on the distance in feature space between the first vector and the second vector,
The vector extraction model is machine learned using learning data generated using the correspondence table of each classification code of the first classification system and the second classification system,
Dynamic linkage system of classification system.

One or more processors;
a memory that loads a computer program executed by the processor; and
Including storage for storing the computer program,
The computer program is,
An instruction for analyzing the correspondence table of each classification code of the first classification system and the second classification system to obtain a set of ordered pairs of the first classification code on the first classification system and the second classification code on the second classification system that correspond to each other. ;
Instructions for obtaining learning data using each ordered pair included in the set;
Instructions for inputting a description of a first classification code of the learning data into a vector extraction model and obtaining a first vector output from the vector extraction model;
Instructions for inputting a description of a second classification code of the learning data into the vector extraction model and obtaining a second vector output from the vector extraction model;
Instructions for updating weights of the vector extraction model using error back-propagation using a loss function based on the distance in the feature space of the first vector and the second vector; and
Comprising instructions for repeating the steps of obtaining the first vector, acquiring the second vector, and updating the weight for each training data until the training data is exhausted,
A machine learning system for matching classification codes in heterogeneous classification systems.

One or more processors;
a memory that loads a computer program executed by the processor; and
Including storage for storing the computer program,
The computer program is,
Instructions for obtaining relevant data of a first technology, wherein the relevant data of the first technology includes a first classification code on a first classification system;
An input feature representing the first description (DESCRIPTION) of the first classification code is input into a first artificial intelligence model, and analysis data of the first classification code is analyzed using the output data output from the first artificial intelligence model. Obtaining instructions, wherein the analysis data of the first classification code reflects the context of the description; and
Including instructions for inputting an input feature representing the first classification code analysis data into a second artificial intelligence model to link the second classification code in the second classification system to the first classification code, wherein the second artificial intelligence model Connecting classification systems where the similarity between classification codes is greater than a predefined standard,
Dynamic linkage system of classification system.