KR102635609B1

KR102635609B1 - Method and apparatus for predicting and classifying irregular clinical time-series data

Info

Publication number: KR102635609B1
Application number: KR1020210094286A
Authority: KR
Inventors: 석흥일; 이유림
Original assignee: 고려대학교 산학협력단
Priority date: 2021-07-19
Filing date: 2021-07-19
Publication date: 2024-02-08
Also published as: KR20230013485A

Abstract

불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치가 개시된다. 불규칙 임상 시계열 데이터 예측 및 분류 장치는, 불규칙한 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터를 통합하여 상기 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성하는 다시점 통합 자가주의 모듈; 상기 심층 표현을 이용하여 보조적인 결측 대체값을 생성하는 보조 해석부; 및 상기 심층 표현을 이용하여 환자 상태 예측 또는 진단을 분류하는 예측 및 분류기를 포함한다. A method and apparatus for predicting and classifying irregular clinical time series data are disclosed. The irregular clinical time series data prediction and classification device includes: a multi-view integrated self-attention module that integrates irregular clinical time series data, observation data and temporal information data to generate a deep representation of the irregular clinical time series data; an auxiliary analysis unit that generates auxiliary missing replacement values using the deep expression; and a prediction and classifier that classifies patient condition prediction or diagnosis using the deep representation.

Description

Method and apparatus for predicting and classifying irregular clinical time-series data}

본 발명은 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치에 관한 것이다. The present invention relates to a method and device for predicting and classifying irregular clinical time series data.

전자 건강 데이터(EHR: electronic health data, 이하 EHR이라 칭하기로 함)는 진단, 활력 징후, 실험실 테스트 기록, 절차 및 텍스트 설명과 같은 환자의 건강 데이터 모음이다. 최근 10년 동안 풍부한 건강 데이터 세트로 인해 다변수 EHR 시계열을 사용하여 딥 러닝 모델을 개발하는데 상당한 진전이 있었다. Electronic health data (EHR) is a collection of patient health data such as diagnoses, vital signs, laboratory test records, procedures, and text descriptions. Over the past decade, significant progress has been made in developing deep learning models using multivariate EHR time series due to the abundance of health datasets.

특히 EHR을 포함하는 임상 데이터는 시간적, 측정 변수적으로 큰 불규칙성을 보일 뿐만 아니라 높은 결측률로 인해 환자 상태를 예측하고 분류하는데 낮은 성능을 보이고 있다. In particular, clinical data including EHR not only show large irregularities in time and measurement variables, but also show low performance in predicting and classifying patient conditions due to a high missing rate.

기계학습 모델을 이용하여 측정값의 특징을 추출하여 예측하는 연구가 많이 진행되었지만, 결측값의 대체나 불규칙 데이터를 규칙형으로 바꾸는 등의 추가적인 데이터의 전처리가 이루어져야 한다는 한계가 존재한다.Although much research has been conducted to extract and predict the characteristics of measured values using machine learning models, there is a limitation that additional data preprocessing, such as replacing missing values or converting irregular data into regular data, is required.

최근의 딥러닝 기법은 특징의 추출과 예측 및 분류를 동시에 할 수 있다는 장점이 있기 때문에 다양한 자연어처리 등의 다른 분야에서 각광을 받았고 동시에 많은 발전을 보였다. 그러나 딥러닝 기법을 임상 시계열 데이터 분류 분야에서의 예측 및 분류에 사용하기에는 불규칙성이나 높은 결측률 등의 요인 때문에 한계가 존재한다. Recently, deep learning techniques have the advantage of being able to extract features, predict, and classify features at the same time, so they have been in the spotlight in other fields such as natural language processing and have shown great progress at the same time. However, there are limitations in using deep learning techniques for prediction and classification in the field of clinical time series data classification due to factors such as irregularity and high missingness rate.

삭제delete

(01) 특허문헌 한국등록특허 제10-2248732호(2021.04.29.)(01) Patent Document Korean Patent No. 10-2248732 (2021.04.29.)

본 발명은 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치를 제공하기 위한 것이다. The present invention is to provide a method and device for predicting and classifying irregular clinical time series data.

또한, 본 발명은 자가주의 매커니즘(self-attention mechanism)을 기반으로 결측값이 높고 불규칙적인 임상 데이터와 이에 포함된 결측 패턴 정보를 함께 모델링하여 예측 및 분류 결과 제공이 가능한 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치를 제공하기 위한 것이다. In addition, the present invention predicts and classifies irregular clinical time series data that can provide prediction and classification results by modeling irregular clinical data with high missing values and missing pattern information included therein based on a self-attention mechanism. To provide a method and device.

또한, 본 발명은 불규칙적 임상 시계열 데이터와 그에서 파생된 결측 패턴에 대한 정보를 자가주의 메커니즘 기반의 방법으로 통합하여 데이터에 대한 추가적인 전처리 없이 예측 및 분류의 성능을 향상시킬 수 있는 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치를 제공하기 위한 것이다. In addition, the present invention integrates information about irregular clinical time series data and missing patterns derived therefrom into a method based on a self-attention mechanism to predict irregular clinical time series data, which can improve prediction and classification performance without additional preprocessing of the data. and to provide a classification method and device.

본 발명의 일 측면에 따르면, 불규칙 임상 시계열 데이터 예측 및 분류 장치가 제공된다. According to one aspect of the present invention, an apparatus for predicting and classifying irregular clinical time series data is provided.

본 발명의 일 실시예에 따르면, 규칙한 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터를 통합하여 상기 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성하는 다시점 통합 자가주의 모듈; 상기 심층 표현을 이용하여 보조적인 결측 대체값을 생성하는 보조 해석부; 및 상기 심층 표현을 이용하여 환자 상태 예측 또는 진단을 분류하는 예측 및 분류기를 포함하는 불규칙 임상 시계열 데이터 예측 및 분류 장치가 제공될 수 있다. According to one embodiment of the present invention, there is provided a multi-view integrated self-attention module that integrates regular clinical time series data, observation data, and temporal information data to generate a deep representation of the irregular clinical time series data; an auxiliary analysis unit that generates auxiliary missing replacement values using the deep expression; A prediction and classification device for irregular clinical time series data may be provided, including a prediction and classifier that classifies patient condition prediction or diagnosis using the deep expression.

상기 예측 및 분류기 학습시, 상기 보조적인 결측 대체값이 더 이용될 수 있다. When learning the prediction and classifier, the auxiliary missing replacement value may be further used.

상기 다시점 통합 자가주의 모듈은, 상기 불규칙 임상 시계열 데이터, 상기 관측 여부 데이터 및 상기 시간 정보 데이터 각각을 임베딩하여 각각의 특징값을 구하고, 자가주의 매커니즘을 이용하여 상기 관측 여부 데이터와 상기 시간 정보 데이터의 특징값을 통합하여 결측 패턴 정보를 생성하고, 상기 결측 패턴 정보와 상기 불규칙 임상 시계열 데이터를 자가주의 블록을 이용하여 통합한 후 순방향 신경망(FFN: feed-forward network) 모듈에 적용하여 상기 심층 표현을 생성할 수 있다. The multi-view integrated self-attention module obtains each feature value by embedding each of the irregular clinical time series data, the observation status data, and the time information data, and uses a self-attention mechanism to obtain the observation status data and the time information data. Missing pattern information is generated by integrating the feature values, and the missing pattern information and the irregular clinical time series data are integrated using a self-attention block and then applied to a forward neural network (FFN: feed-forward network) module to express the deep expression. can be created.

상기 보조 해석부는, 상기 불규칙한 임상 시계열 데이터 중 일부를 마스킹한 후 상기 심층 표현을 이용하여 마스킹된 값을 복원하여 보조적인 결측 대체값을 생성할 수 있다. The auxiliary analysis unit may mask some of the irregular clinical time series data and then restore the masked values using the deep expression to generate auxiliary missing replacement values.

본 발명의 다른 측면에 따르면, 불규칙 임상 시계열 데이터 예측 및 분류 방법이 제공된다. According to another aspect of the present invention, a method for predicting and classifying irregular clinical time series data is provided.

본 발명의 일 실시예에 따르면, (a) 불규칙한 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터를 통합하여 상기 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성하는 단계; (b) 상기 불규칙한 임상 시계열 데이터 중 일부를 마스킹한 후 상기 심층 표현을 이용하여 마스킹된 값을 복원하여 보조적인 결측 대체값을 생성하는 단계; 및 (c) 상기 심층 표현 및 상기 보조적인 결측 대체값을 이용하여 환자 상태 예측 또는 진단을 분류하는 예측 및 분류기 모델을 학습시키는 단계를 포함하는 불규칙 임상 시계열 데이터 예측 및 분류 방법이 제공될 수 있다. According to an embodiment of the present invention, (a) integrating irregular clinical time series data, observation status data, and time information data to generate a deep representation of the irregular clinical time series data; (b) masking some of the irregular clinical time series data and then restoring the masked values using the deep representation to generate auxiliary missing replacement values; and (c) learning a prediction and classifier model that classifies a patient condition prediction or diagnosis using the deep expression and the auxiliary missing replacement value. A method for predicting and classifying irregular clinical time series data may be provided.

상기 (a) 단계는, 상기 불규칙 임상 시계열 데이터, 상기 관측 여부 데이터 및 상기 시간 정보 데이터 각각을 임베딩하여 각각의 특징값을 구하는 단계; 제1 자가주의 모듈을 기반으로 상기 관측 여부 데이터와 상기 시간 정보 데이터의 특징값을 통합하여 결측 패턴 정보를 생성하는 단계; 및 제2 자가주의 모듈을 기반으로 상기 불규칙한 임상 시계열 데이터의 특징값과 상기 결측 패턴 정보를 통합한 후 순방향 신경망(FFN: feed-forward network)에 적용하여 결측 패턴에 대한 심층 표현을 생성하는 단계를 포함할 수 있다. The step (a) includes calculating feature values by embedding each of the irregular clinical time series data, the observation status data, and the time information data; generating missing pattern information by integrating feature values of the observation status data and the time information data based on a first self-attention module; And a step of integrating the feature values of the irregular clinical time series data and the missing pattern information based on a second self-attention module and then applying it to a feed-forward network (FFN) to generate a deep representation of the missing pattern. It can be included.

본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법 및 장치를 제공함으로써, 자가주의 매커니즘(self-attention mechanism)을 기반으로 결측값이 높고 불규칙적인 임상 데이터와 이에 포함된 결측 패턴 정보를 함께 모델링하여 정확한 예측 및 분류 결과 제공이 가능한 이점이 있다. By providing a method and apparatus for classifying and predicting irregular clinical time series data according to an embodiment of the present invention, clinical data with high missing values and irregular clinical data and missing pattern information included therein are classified based on a self-attention mechanism. There is an advantage in being able to provide accurate prediction and classification results by modeling together.

또한, 본 발명은 불규칙적 임상 시계열 데이터와 그에서 파생된 결측 패턴에 대한 정보를 자가주의 메커니즘 기반의 방법으로 통합하여 데이터에 대한 추가적인 전처리 없이 예측 및 분류의 성능을 향상시킬 수 있는 이점이 있다. In addition, the present invention has the advantage of improving prediction and classification performance without additional preprocessing of the data by integrating information about irregular clinical time series data and missing patterns derived therefrom into a method based on a self-attention mechanism.

도 1은 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류를 위한 장치의 구성을 개략적으로 도시한 도면.
도 2는 본 발명의 일 실시예에 따른 다시점 통합 자가주의 모듈의 내부 구성을 개략적으로 도시한 도면.
도 3은 본 발명의 일 실시예에 따른 보조 해석부의 내부 구성을 개략적으로 도시한 블록도.
도 4는 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류 방법을 위한 의사 코드를 도시한 도면.
도 5는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법을 나타낸 순서도. 1 is a diagram schematically showing the configuration of an apparatus for predicting and classifying irregular clinical time series data according to an embodiment of the present invention.
Figure 2 is a diagram schematically showing the internal configuration of a multi-viewpoint integrated self-attention module according to an embodiment of the present invention.
Figure 3 is a block diagram schematically showing the internal configuration of an auxiliary analysis unit according to an embodiment of the present invention.
Figure 4 is a diagram illustrating pseudo code for a method for predicting and classifying irregular clinical time series data according to an embodiment of the present invention.
Figure 5 is a flowchart showing a method for classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

본 명세서에서 사용되는 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 명세서에서, "구성된다" 또는 "포함한다" 등의 용어는 명세서상에 기재된 여러 구성 요소들, 또는 여러 단계들을 반드시 모두 포함하는 것으로 해석되지 않아야 하며, 그 중 일부 구성 요소들 또는 일부 단계들은 포함되지 않을 수도 있고, 또는 추가적인 구성 요소 또는 단계들을 더 포함할 수 있는 것으로 해석되어야 한다. 또한, 명세서에 기재된 "...부", "모듈" 등의 용어는 적어도 하나의 기능이나 동작을 처리하는 단위를 의미하며, 이는 하드웨어 또는 소프트웨어로 구현되거나 하드웨어와 소프트웨어의 결합으로 구현될 수 있다.As used herein, singular expressions include plural expressions unless the context clearly dictates otherwise. In this specification, terms such as “consists of” or “comprises” should not be construed as necessarily including all of the various components or steps described in the specification, and some of the components or steps may be included in the specification. It may not be included, or it should be interpreted as including additional components or steps. In addition, terms such as "... unit" and "module" used in the specification refer to a unit that processes at least one function or operation, which may be implemented as hardware or software, or as a combination of hardware and software. .

이하, 첨부된 도면들을 참조하여 본 발명의 실시예를 상세히 설명한다. Hereinafter, embodiments of the present invention will be described in detail with reference to the attached drawings.

도 1은 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류를 위한 장치의 구성을 개략적으로 도시한 도면이고, 도 2는 본 발명의 일 실시예에 따른 다시점 통합 자가주의 모듈의 내부 구성을 개략적으로 도시한 도면이고, 도 3은 본 발명의 일 실시예에 따른 보조 해석부의 내부 구성을 개략적으로 도시한 블록도이며, 도 4는 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류 방법을 위한 의사 코드를 도시한 도면이다. Figure 1 is a diagram schematically showing the configuration of an apparatus for predicting and classifying irregular clinical time series data according to an embodiment of the present invention, and Figure 2 is a diagram showing the interior of a multi-view integrated self-attention module according to an embodiment of the present invention. It is a diagram schematically showing the configuration, Figure 3 is a block diagram schematically showing the internal structure of the auxiliary analysis unit according to an embodiment of the present invention, and Figure 4 is a prediction of irregular clinical time series data according to an embodiment of the present invention. and a diagram showing pseudo code for the classification method.

도 1을 참조하면, 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 다시점 통합 자가주의 모듈(110), 보조 해석부(120), 예측 및 분류기(130), 메모리(140) 및 프로세서(150)를 포함하여 구성된다. Referring to FIG. 1, the apparatus 100 for predicting and classifying irregular clinical time series data according to an embodiment of the present invention includes a multi-view integrated self-attention module 110, an auxiliary analysis unit 120, a prediction and classifier 130, It is configured to include a memory 140 and a processor 150.

다시점 통합 자가주의 모듈(110)은 불규칙한 임상 시계열 데이터, 관측 여부 데이터, 시간 정보 데이터를 각각 입력 데이터로 입력받은 후 이를 순차적으로 통합하여 특징을 찾은 후 순방향 신경망 모델에 적용하여 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성하기 위한 수단이다. The multi-view integration self-attention module 110 receives irregular clinical time series data, observation data, and time information data as input data, sequentially integrates them, finds features, and applies them to the forward neural network model to the irregular clinical time series data. It is a means to create an in-depth expression of

다시점 통합 자가주의 모듈(110)은 데이터 임베딩부(210), 통합 모듈(220) 및 순방향 신경망 모듈(230)을 포함하여 구성된다. The multi-view integrated self-attention module 110 includes a data embedding unit 210, an integration module 220, and a forward neural network module 230.

각 서브젝트 n에 대해, 길이의 시점 의 D-차원 다변량 시계열 집합이 주어지면, 관측 시계열을 와 같이 나타내기로 한다. 여기서, 는 모든 변수의 t번째 관측을 나타내고, 는 내의 d번째 엘리먼트를 나타낸다. For each subject n, point of length Given a set of D-dimensional multivariate time series of It will be expressed as follows. here, represents the tth observation of all variables, Is Indicates the dth element within.

시계열 데이터 은 결측값(missing values)를 포함하기 때문에, 시계열 데이터에 대해 관측 또는 결측 변수를 마크하기 위해 과 동일한 사이즈를 가지는 관측 여부 정보(즉, 결측 지표(missing indicator)) 를 도입하기로 한다. time series data contains missing values, so to mark observed or missing variables for time series data, Observation information (i.e., missing indicator) with the same size as decided to introduce.

관측 여부 데이터는 만일 이 관측되면, 이며, 그렇지 않은 경우 과 같이 생성될 수 있다. 또한, 불규칙 임상 시계열 데이터의 관측이 결측되는 경우, 입력은 내에 상응하는 요소에 대해 제로(0)으로 설정될 수 있다. 각 변수 d에 대해 시간 간격을 로 나타내기로 한다. 여기서, 은 수학식 1과 같이 정의될 수 있다. Observation data is When this is observed, , and if not It can be created as follows. Additionally, if observations of irregular clinical time series data are missing, the input The corresponding element within can be set to zero (0). For each variable d, the time interval is It will be expressed as . here, Can be defined as Equation 1.

불규칙 임상 시계열 데이터셋이 와 같이 주어지면, 예측 라벨 정의를 위한 매핑 함수 구축이 가능하다. Irregular clinical time series dataset Given as , predicted label It is possible to build a mapping function for definition.

데이터 임베딩부(210)는 각 입력 데이터(즉, 불규칙한 임상 시계열 데이터, 관측 여부 데이터, 시간 정보 데이터)를 각각 임베딩하여 특징값을 도출한다. 다시점 통합 자가주의 모듈(110)은 연속적인 시간 값을 입력으로 취하는 위치 인코딩의 변형으로 시간 임베딩(TE: time embedding)을 사용하고, 시간 임베딩을 인코딩 벡터 표현으로 변환할 수 있다. The data embedding unit 210 derives feature values by embedding each input data (i.e., irregular clinical time series data, observation status data, and time information data). The multi-view integrated self-attention module 110 uses time embedding (TE) as a variation of position encoding that takes continuous time values as input, and can convert the time embedding into an encoding vector representation.

이를 통해, 정확한 시점과 시간 간격을 고려하여 불규칙하게 샘플링된 임상 시계열 데이터를 처리할 수 있다. Through this, it is possible to process irregularly sampled clinical time series data by considering the exact time point and time interval.

시간 임베딩은 수학식 2와 같이 나타낼 수 있다. Time embedding can be expressed as Equation 2.

여기서, t, d, 및 는 각각 정확한 시간 포인트, 변수의 인덱스, 모델 차원, 데이터의 최대 시간 길이를 나타낸다. 시간 임베딩은 학습된 입력 임베딩에 추가될 수 있다. 이를 수학식으로 나타내면 수학식 3 내지 수학식 5와 같다. where t, d, and represents the exact time point, variable index, model dimension, and maximum time length of data, respectively. Temporal embeddings can be added to the learned input embeddings. If this is expressed in a mathematical formula, it is the same as Equation 3 to Equation 5.

여기서, 는 각각 다시점 입력에 대한 상응하는 임베딩 가중치 행렬을 나타낸다. here, represents the corresponding embedding weight matrix for the multi-view input, respectively.

종래에는 결측 지표와 시간 간격과 같은 결측치의 소스를 활용할뿐, 결측 데이터의 표현(representation)을 학습하지 않음으로써, 결측에 대한 부적절한 모델링으로 인해 신뢰할 수 없는 평가와 측정 변경에 강건하지 않은 모델 생성으로 이어지는 문제점이 있었다. Conventionally, only sources of missing values such as missing indicators and time intervals are utilized, but the representation of missing data is not learned, resulting in unreliable evaluation due to inappropriate modeling of missingness and creation of models that are not robust to measurement changes. There was a problem that followed.

따라서, 본 발명의 일 실시예에서는 결측 지표와 시간 간격을 모두 효과적으로 활용하여 불규칙한 임상 시계열 데이터의 심층 표현을 학습할 수 있다. Therefore, in one embodiment of the present invention, both missing indicators and time intervals can be effectively utilized to learn a deep representation of irregular clinical time series data.

모든 시간에서, D 측정값이 주어지는 경우, 다시점 통합 자가주의 모듈(110)은 불규칙 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터에 대한 입력 데이터의 임베딩하고, 이를 통해 각 입력 데이터(즉, 불규칙 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터)에 대한 특징값을 구할 수 있다. At any time, given the D measurement value, the multi-view integrated self-attention module 110 embeds the input data for irregular clinical time series data, observational data, and temporal information data, and thereby embeds the input data for each input data (i.e., irregular Feature values for clinical time series data, observation status data, and time information data) can be obtained.

다시점 통합 자가주의 모듈(110)은 쿼리(Q), 키(K) 및 값(V) 집합에 대해 스케일된 도트 프로덕트 어텐션(scaled dot-product attention)이 계산되는 다중헤드 자가주의 블록(MHA: multi-head attention)을 기반으로 한다. The multi-view integrated self-attention module 110 is a multi-head self-attention block (MHA: It is based on multi-head attention.

자가주의 블록에 기초하여 관측 X를 포함하는 다시점 불규칙 임상 시계열 데이터의 어텐션 표현(attention representation)이 학습될 수 있다. Based on the self-attention block, an attention representation of multi-view irregular clinical time series data including observation X can be learned.

예를 들어, 각 입력 집합은 자가주의 블록을 통해 자신의 표현 을 학습할 수 있다. 여기서, 각 데이터 포인트는 자체 가중치 행렬과 선형적으로 결합되어 해당하는 Q, K 및 V로 공급될 수 있다. For example, each set of inputs Express yourself through self-attention blocks You can learn. Here, each data point can be linearly combined with its own weight matrix to feed into the corresponding Q, K, and V.

각 입력 데이터에 대한 어텐션 표현은 수학식 6 내지 수학식 8과 같이 수학식으로 정리될 수 있다. The attention expression for each input data can be organized into a mathematical equation, such as Equation 6 to Equation 8.

여기서, W는 학습 가능한 가중치 행렬의 집합을 나타내고, 는 SoftMax 활성화 함수를 나타내고, 는 키 벡터의 차원을 나타낸다. Here, W represents the set of learnable weight matrices, represents the SoftMax activation function, represents the dimension of the key vector.

통합 모듈(220)은 자가주의 블록에 의존하며, 입력 데이터의 특징값들을 통합하기 위한 수단이다. The integration module 220 relies on the self-attention block and is a means for integrating feature values of input data.

통합 모듈(220)은 입력 데이터 중 관측 여부 데이터와 시간 정보 데이터의 임베딩된 특징값을 통합하여 복잡한 결측 패턴 정보를 생성할 수 있다. 또한, 통합 모듈(220)은 불규칙한 임상 시계열 데이터의 특징값과 결측 패턴 정보를 표현 공간에서 결합할 수 있다. The integration module 220 can generate complex missing pattern information by integrating the embedded feature values of the observation status data and time information data among the input data. Additionally, the integration module 220 can combine feature values and missing pattern information of irregular clinical time series data in the expression space.

이와 같이, 결측 자체로부터 기본 표현을 학습하면 결측값을 전가할 필요가 없으며, 어떠한 휴리스틱한 함수를 특정할 필요가 없다. In this way, if the basic expression is learned from the missingness itself, there is no need to impute missing values and there is no need to specify any heuristic function.

따라서, 통합 모듈(220)은 두 단계의 통합 과정(제1 통합 단계, 제2 통합 단계)가 수반될 수 있다. 이에 대해 보다 상세히 설명하기로 한다. Accordingly, the integration module 220 may involve a two-step integration process (a first integration step and a second integration step). Let us explain this in more detail.

즉, 제1 통합 단계에서, 통합 모듈(220)은 제1 자가주의 블록을 이용하여 시간 정보 데이터에 기초한 시간 간격의 표현과 관측 여부 데이터에 기반한 결측 지표의 표현을 통합하여 결측 패턴의 표현()을 획득할 수 있다. 이를 수학식으로 나타내면 수학식 9와 같다. That is, in the first integration step, the integration module 220 integrates the expression of the time interval based on the time information data and the expression of the missing indicator based on the observation data using the first self-attention block to express the missing pattern ( ) can be obtained. If this is expressed mathematically, it is as shown in Equation 9.

제2 통합 단계에서, 통합 모듈(220)은 제2 자가주의 블록을 이용하여 불규칙한 임상 시계열 데이터와 결측 패턴 정보의 표현을 통합할 수 있다. 이를 수학식으로 나타내면 수학식 10과 같다. In the second integration step, the integration module 220 may integrate the representation of irregular clinical time series data and missing pattern information using the second self-attention block. If this is expressed mathematically, it is as shown in Equation 10.

상술한 바와 같이, 통합 모듈(220)은 복수의 자가주의 모듈을 기반으로, 입력 데이터들을 순차적으로 통합할 수 있다. As described above, the integration module 220 may sequentially integrate input data based on a plurality of self-attention modules.

통합 모듈(220)에 의해 입력 데이터들이 순차적으로 통합된 결과는 순방향 신경망 모듈(230)로 입력되며, 순방향 신경망 모듈(230)을 통해 불규칙한 임상 시계열 데이터와 통합된 결측 패턴 간의 관계를 모델링하여 공동 학습된 심층 표현을 출력할 수 있다.The result of sequentially integrating the input data by the integration module 220 is input to the forward neural network module 230, and joint learning is performed by modeling the relationship between the irregular clinical time series data and the integrated missing pattern through the forward neural network module 230. A deep expression can be output.

순방향 신경망 모듈(230)은 변수간의 종속성을 모델링하도록 각 시점에 동일하게 적용될수 있다. The forward neural network module 230 can be applied equally at each time point to model dependencies between variables.

최종 심층 표현은 보조적인 결측 대체값 생성 및 환자 상태 예측 또는 진단을 분류에 이용될 수 있다. Final deep representation Can be used to generate auxiliary missing replacement values and classify patient condition prediction or diagnosis.

보조 해석부(120)는 다시점 통합 자가주의 모듈(110)에서 출력된 심층 표현()을 이용하여 보조적인 결측 대체값을 생성하기 위한 수단이다. The auxiliary analysis unit 120 is a deep expression output from the multi-view integrated self-attention module 110 ( ) is a means for generating auxiliary missing replacement values.

보조 해석부(120)는 불규칙한 임상 시계열 데이터의 일부를 고의로 마스킹한 후 다시점 통합 자가주의 모듈(110)에서 생성된 최종 심층 표현을 이용하여 마스킹한 값을 복원하는 과정을 통해 보조적인 결측 대체값을 생성할 수 있다. The auxiliary analysis unit 120 intentionally masks part of the irregular clinical time series data and then restores the masked value using the final deep expression generated by the multi-view integrated self-attention module 110 to provide auxiliary missing replacement values. can be created.

보조 해석부(120)의 상세 구조는 도 3에 도시된 바와 같다.The detailed structure of the auxiliary analysis unit 120 is as shown in FIG. 3.

도 3에 도시된 바와 같이, 보조 해석부(120)는 다중헤드 자가주의 모듈(310)과 순방향 신경망 모듈(320)을 포함하여 구성된다. As shown in FIG. 3, the auxiliary analysis unit 120 includes a multi-head self-attention module 310 and a forward neural network module 320.

보조 해석부(120)는 쿼리로서 다시점 통합 자가주의 모듈(110)에서 출력된 심층 표현()과 키와 값으로 불규칙 임상 시계열 데이터() 사이의 다중헤드 자가주의 모듈(310)을 적용하고, 순방향 신경망 모듈(320)을 뒤에 배치할 수 있다. The auxiliary analysis unit 120 uses a deep expression output from the multi-view integrated self-attention module 110 as a query ( ) and irregular clinical time series data with keys and values ( ) The multi-head self-attention module 310 can be applied, and the forward neural network module 320 can be placed behind it.

출력 계층은 학습된 임베딩 행렬()을 사용하여 타겟 시간 시퀀스에 매핑되고, 대치된 데이터()를 생성할 수 있다. 이를 수학식으로 나타내면 수학식 11과 같다. The output layer uses the learned embedding matrix ( ), and the imputed data ( ) can be created. If this is expressed mathematically, it is as shown in Equation 11.

마스킹된 값을 표시하기 위해 도입된 다른 마스킹 벡터가 주어지면, 전가 손실 은 불규칙한 임상 시계열 데이터(X)와 에 의해 마스킹된 값을 위한 전가된 샘플() 사이의 마스킹된 평균 제곱 에러(MSE: mean squared error)에 의해 계산될 수 있다. Different masking vectors introduced to represent masked values. Given , impute loss is irregular clinical time series data (X) and Imputed samples for values masked by ( ) can be calculated by the masked mean squared error (MSE) between.

이를 수학식으로 나타내면, 수학식 12와 같다. If this is expressed mathematically, it is as shown in Equation 12.

목적 함수에 전가 손실을 도입하여 최적의 예측 결과를 달성하기 위한 보조적인 정보 제공이 가능케 할 수 있다. By introducing an imputation loss into the objective function, it is possible to provide auxiliary information to achieve optimal prediction results.

합성 손실은 예측 및 전가 손실을 누적하여 수학식 13과 같이 정의될 수 있다. The synthetic loss can be defined as Equation 13 by accumulating the prediction and imputation losses.

여기서, 와 는 두 손실 사이의 비율 조절을 위한 하이퍼매개변수를 나타낸다. here, and represents the hyperparameter for adjusting the ratio between the two losses.

합성 손실()을 통해 단대단(end-to-end) 방식으로 모델의 모든 매개변수가 최적화실 수 있다. Composite loss ( ), all parameters of the model can be optimized in an end-to-end manner.

예측 및 분류기(130)는 다시점 통합 자가주의 모듈(110)로부터 출력된 최종 심층 표현을 기반으로 환자 상태 예측 또는 진단을 분류하기 위한 수단이다.The prediction and classifier 130 is a means for predicting or classifying a patient's condition based on the final deep expression output from the multi-view integrated self-attention module 110.

예측 및 분류기(130)는 학습시 심층 표현과 보조 해석부(120)에 의해 출력된 보조적인 결측 대체값을 이용하여 학습될 수 있다. 예측 및 분류기(130)는 심층 표현과 보조적인 결측 대체값을 이용하여 보조적인 결측 대체값에 대한 손실값이 최소가 되도록 학습될 수 있다. The prediction and classifier 130 may be trained using deep expressions and auxiliary missing replacement values output by the auxiliary analysis unit 120 during learning. The prediction and classifier 130 can be trained using deep representations and auxiliary missing replacement values to minimize loss for auxiliary missing replacement values.

보조적인 결측 대체값은 예측 및 분류기(130)의 학습에만 이용되며, 학습이 완료된 이후에는 이용되지 않을 수 있다. The auxiliary missing replacement value is used only for prediction and training of the classifier 130, and may not be used after training is completed.

예측 및 분류기(130)는 심층 표현을 기반으로 환자 상태를 예측하거나 진단을 분류할 수 있다. The prediction and classifier 130 can predict the patient's condition or classify the diagnosis based on the deep expression.

예를 들어, 예측 및 분류기(130)는 병원내 사망률이나 LOS 예측을 수행할 수 있다. For example, prediction and classifier 130 may perform in-hospital mortality or LOS prediction.

병원내 사망률 예측은 환자가 입원 기간 동안 사망했는지 또는 퇴원할 예정인지를 나타내는 이진 분류로, 자에 대한 타겟 라벨은 과 같다. In-hospital mortality prediction is a binary classification indicating whether the patient died during hospitalization or is scheduled to be discharged, with the target label for Same as

LOS 예측은 환자의 입원과 입원 종료 사이의 일수를 예측하는 회귀 작업일 수 있다. LOS prediction can be a regression task predicting the number of days between a patient's admission and end of hospitalization.

최종 심층 표현()이 주어지면, 타임스탬프에 대한 평균 풀링을 수행하고, 풀링된 최종 표현() 다음에 다층 퍼셉트론(MLP)가 뒤 따른다. 최종 계층은 특정 작업에 따라 달라지며 분류 및 다중 레이블 분류에 시그모이드 계층이 사용될 수 있다. Final deep representation ( ), perform average pooling on the timestamps, and obtain the final pooled representation ( ) followed by a multilayer perceptron (MLP). The final layer depends on the specific task, and sigmoid layers can be used for classification and multi-label classification.

예측 및 분류기(130)는 의료 데이터의 특성상 불균형한 데이터에서의 열악한 분류 성능 문제를 해결하기 위해 각 환자의 타겟 레이블()과 예측된 레이블() 사이의 분류 손실을 계산하기 위해 수학식 14 내지 수학식 16과 같은 목적 함수를 이용할 수 있다. The prediction and classifier 130 uses the target label ( ) and the predicted label ( ) To calculate the classification loss, objective functions such as Equation 14 to Equation 16 can be used.

이진 분류 손실 수학식 14를 이용하여 계산될 수 있다. Binary classification loss can be calculated using Equation 14.

회귀 손실은 수학식 15를 이용하여 계산될 수 있다. Regression loss can be calculated using Equation 15.

다중 레벨 분류 손실은 수학식 16을 이용하여 계산될 수 있다. Multi-level classification loss can be calculated using Equation 16.

여기서, N은 전체 환자의 수를 나타내고, K는 데이터베이스내의 라벨의 전체 수를 나타내며, 는 마이너리티 클래스를 위한 포커싱 매개변수를 나타내며, 는 클래스간의 중요도 균형을 위한 가중치를 나타낸다. Here, N represents the total number of patients, K represents the total number of labels in the database, represents the focusing parameter for the minority class, represents the weight for balancing importance between classes.

메모리(140)는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법을 수행하기 위해 필요한 다양한 명령어를 저장한다. The memory 140 stores various commands necessary to perform a method for classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

프로세서(150)는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 예측 및 분류 장치의 내부 구성 요소들(예를 들어, 다시점 통합 자가주의 모듈(110), 보조 해석부(120), 예측 및 분류기(130), 메모리(140) 등)을 제어하기 위한 수단이다. 도 4에는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법에 대한 의사 코드가 예시되어 있다. The processor 150 includes internal components (e.g., multi-view integrated self-attention module 110, auxiliary analysis unit 120, prediction and It is a means for controlling the classifier 130, memory 140, etc.). Figure 4 illustrates pseudo code for a method for classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

도 5는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법을 나타낸 순서도이다. Figure 5 is a flowchart showing a method for classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

단계 510에서 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙한 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터를 통합하여 상기 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성한다. In step 510, the irregular clinical time series data prediction and classification device 100 integrates the irregular clinical time series data, observation data, and time information data to generate an in-depth representation of the irregular clinical time series data.

이미 전술한 바와 같이, 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙 임상 시계열 데이터, 상기 관측 여부 데이터 및 상기 시간 정보 데이터 각각을 임베딩하여 각각의 특징값을 구하고, 자가주의 매커니즘을 이용하여 상기 관측 여부 데이터와 상기 시간 정보 데이터의 특징값을 통합하여 결측 패턴 정보를 생성하고, 상기 결측 패턴 정보와 상기 불규칙 임상 시계열 데이터를 자가주의 블록을 이용하여 통합한 후 순방향 신경망(FFN: feed-forward network) 모듈에 적용하여 상기 심층 표현을 생성할 수 있다. As already described above, the apparatus 100 for predicting and classifying irregular clinical time series data obtains each feature value by embedding each of the irregular clinical time series data, the observation status data, and the time information data, and uses a self-attention mechanism to obtain the respective feature values. Missing pattern information is generated by integrating the feature values of the observation data and the time information data, and the missing pattern information and the irregular clinical time series data are integrated using a self-attention block, followed by a feed-forward network (FFN). ) module can be applied to generate the deep representation.

단계 515에서 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙한 임상 시계열 데이터와 심층 표현을 이용하여 보조적인 결측 대체값을 생성한다. In step 515, the irregular clinical time series data prediction and classification device 100 generates auxiliary missing replacement values using the irregular clinical time series data and deep expression.

이미 전술한 바와 같이, 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙한 임상 시계열 데이터 중 일부를 마스킹 처리한 후 심층 표현을 이용하여 마스킹 처리된 값을 복원하는 과정을 통해 보조적인 결측 대체값을 생성할 수 있다. As already described above, the irregular clinical time series data prediction and classification device 100 masks some of the irregular clinical time series data and then restores the masked values using deep expression to provide auxiliary missing replacement values. can be created.

단계 520에서 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 심층 표현 및 보조적인 결측 대체값을 이용하여 모델(즉, 예측 및 분류기(130))를 학습한다. 이와 같은 학습 과정이 완료되면, 심층 표현을 이용하여 환자 상태 예측 또는 진단을 분류할 수 있다. In step 520, the apparatus 100 for predicting and classifying irregular clinical time series data learns a model (i.e., prediction and classifier 130) using deep expressions and auxiliary missing replacement values. Once this learning process is completed, the patient condition prediction or diagnosis can be classified using the deep expression.

본 발명의 실시 예에 따른 장치 및 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 컴퓨터 판독 가능 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 분야 통상의 기술자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media) 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다.Devices and methods according to embodiments of the present invention may be implemented in the form of program instructions that can be executed through various computer means and recorded on a computer-readable medium. Computer-readable media may include program instructions, data files, data structures, etc., singly or in combination. Program instructions recorded on a computer-readable medium may be specially designed and constructed for the present invention or may be known and usable by those skilled in the computer software field. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floptical disks. -Includes magneto-optical media and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, etc. Examples of program instructions include machine language code, such as that produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter, etc.

상술한 하드웨어 장치는 본 발명의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

이제까지 본 발명에 대하여 그 실시 예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시 예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far, the present invention has been examined focusing on its embodiments. A person skilled in the art to which the present invention pertains will understand that the present invention may be implemented in a modified form without departing from the essential characteristics of the present invention. Therefore, the disclosed embodiments should be considered from an illustrative rather than a restrictive perspective. The scope of the present invention is indicated in the claims rather than the foregoing description, and all differences within the equivalent scope should be construed as being included in the present invention.

100: 불규칙 임상 시계열 데이터 예측 및 분류 장치.
110: 다시점 통합 자가주의 모듈
120: 보조 해석부
130: 예측 및 분류기
140: 메모리
150: 프로세서100: Irregular clinical time series data prediction and classification device.
110: Multi-view integrated self-attention module
120: Auxiliary analysis unit
130: Prediction and classifier
140: memory
150: processor

Claims

a multi-view integrated self-attention module that integrates irregular clinical time series data, observational presence data, and temporal information data to generate an in-depth representation of the irregular clinical time series data;
an auxiliary analysis unit that generates auxiliary missing replacement values using the deep expression; and
An irregular clinical time series data prediction and classification device comprising a prediction and classifier that classifies patient condition prediction or diagnosis using the deep representation.

According to claim 1,
Irregular clinical time series data prediction and classification device, characterized in that the auxiliary missing replacement value is further used when learning the prediction and classifier.

According to claim 1,
The multi-view integrated self-attention module is,
Each feature value is obtained by embedding the irregular clinical time series data, the observation data, and the time information data, and the feature values of the observation data and the time information data are integrated using a self-attention mechanism to provide missing pattern information. Generating, integrating the missing pattern information and the irregular clinical time series data using a self-attention block and then applying it to a feed-forward network (FFN) module to generate the deep representation. Time series data prediction and classification device.

According to claim 1,
The auxiliary analysis unit,
An irregular clinical time series data prediction and classification device, characterized in that it masks some of the irregular clinical time series data and then restores the masked values using the deep expression to generate auxiliary missing replacement values.

In the irregular clinical time series data prediction and classification method performed in the irregular clinical time series data prediction and classification device,
(a) integrating irregular clinical time series data, observational availability data, and temporal information data to generate a deep representation of the irregular clinical time series data;
(b) masking some of the irregular clinical time series data and then restoring the masked values using the deep representation to generate auxiliary missing replacement values; and
(c) a method for predicting and classifying irregular clinical time series data, comprising the step of training a prediction and classifier model for classifying patient condition prediction or diagnosis using the deep representation and the auxiliary missing replacement values.

According to clause 5,
In step (a),
Obtaining feature values for each of the irregular clinical time series data, the observation status data, and the time information data;
generating missing pattern information by integrating feature values of the observation status data and the time information data based on a first self-attention module; and
Integrating the feature values of the irregular clinical time series data and the missing pattern information based on a second self-attention module and then applying it to a feed-forward network (FFN) to generate a deep representation of the missing pattern. A method for predicting and classifying irregular clinical time series data.