KR20230013485A

KR20230013485A - Method and apparatus for predicting and classifying irregular clinical time-series data

Info

Publication number: KR20230013485A
Application number: KR1020210094286A
Authority: KR
Inventors: 석흥일; 이유림
Original assignee: 고려대학교 산학협력단
Priority date: 2021-07-19
Filing date: 2021-07-19
Publication date: 2023-01-26
Also published as: KR102635609B1

Abstract

Disclosed are a method and a device for predicting and classifying irregular clinical time-series data. The device for predicting and classifying irregular clinical time-series data includes: a multi-view integration self-attention module for generating a deep expression of irregular clinical time-series data by integrating the irregular clinical time-series data, observation/absence data, and temporal information data; an auxiliary analysis unit for generating an auxiliary missing replacement value using the deep expression; and a predictor and classifier for classifying a patient condition prediction or diagnosis using the deep expression.

Description

Method and apparatus for predicting and classifying irregular clinical time-series data

본 발명은 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치에 관한 것이다. The present invention relates to a method and apparatus for predicting and classifying irregular clinical time series data.

전자 건강 데이터(EHR: electronic health data, 이하 EHR이라 칭하기로 함)는 진단, 활력 징후, 실험실 테스트 기록, 절차 및 텍스트 설명과 같은 환자의 건강 데이터 모음이다. 최근 10년 동안 풍부한 건강 데이터 세트로 인해 다변수 EHR 시계열을 사용하여 딥 러닝 모델을 개발하는데 상당한 진전이 있었다. Electronic health data (EHR) is a collection of patient health data such as diagnoses, vital signs, laboratory test records, procedures and text descriptions. In the recent decade, significant progress has been made in developing deep learning models using multivariate EHR time series due to rich health data sets.

특히 EHR을 포함하는 임상 데이터는 시간적, 측정 변수적으로 큰 불규칙성을 보일 뿐만 아니라 높은 결측률로 인해 환자 상태를 예측하고 분류하는데 낮은 성능을 보이고 있다. In particular, clinical data including EHR not only show large irregularities in terms of temporal and measurement variables, but also show low performance in predicting and classifying patient conditions due to high missing rates.

기계학습 모델을 이용하여 측정값의 특징을 추출하여 예측하는 연구가 많이 진행되었지만, 결측값의 대체나 불규칙 데이터를 규칙형으로 바꾸는 등의 추가적인 데이터의 전처리가 이루어져야 한다는 한계가 존재한다.Although many studies have been conducted to extract and predict the characteristics of measured values using machine learning models, there is a limitation that additional data preprocessing, such as replacing missing values or changing irregular data into regular data, exists.

최근의 딥러닝 기법은 특징의 추출과 예측 및 분류를 동시에 할 수 있다는 장점이 있기 때문에 다양한 자연어처리 등의 다른 분야에서 각광을 받았고 동시에 많은 발전을 보였다. 그러나 딥러닝 기법을 임상 시계열 데이터 분류 분야에서의 예측 및 분류에 사용하기에는 불규칙성이나 높은 결측률 등의 요인 때문에 한계가 존재한다. Recent deep learning techniques have the advantage of being able to simultaneously extract, predict, and classify features, so they have been in the limelight in other fields such as various natural language processing and have shown many developments at the same time. However, there are limitations in using deep learning techniques for prediction and classification in the field of clinical time series data classification due to factors such as irregularity or high missing rate.

본 발명은 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치를 제공하기 위한 것이다. The present invention is to provide a method and apparatus for predicting and classifying irregular clinical time series data.

또한, 본 발명은 자가주의 매커니즘(self-attention mechanism)을 기반으로 결측값이 높고 불규칙적인 임상 데이터와 이에 포함된 결측 패턴 정보를 함께 모델링하여 예측 및 분류 결과 제공이 가능한 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치를 제공하기 위한 것이다. In addition, the present invention predicts and classifies irregular clinical time series data that can provide prediction and classification results by modeling irregular clinical data with high missing values and missing pattern information included therein based on a self-attention mechanism. It is to provide a method and apparatus.

또한, 본 발명은 불규칙적 임상 시계열 데이터와 그에서 파생된 결측 패턴에 대한 정보를 자가주의 메커니즘 기반의 방법으로 통합하여 데이터에 대한 추가적인 전처리 없이 예측 및 분류의 성능을 향상시킬 수 있는 불규칙 임상 시계열 데이터 예측 및 분류 방법, 및 장치를 제공하기 위한 것이다. In addition, the present invention predicts irregular clinical time series data that can improve the performance of prediction and classification without additional preprocessing of data by integrating irregular clinical time series data and information on missing patterns derived therefrom into an autologous mechanism-based method. and a classification method, and apparatus.

본 발명의 일 측면에 따르면, 불규칙 임상 시계열 데이터 예측 및 분류 장치가 제공된다. According to one aspect of the present invention, an apparatus for predicting and classifying irregular clinical time series data is provided.

본 발명의 일 실시예에 따르면, 규칙한 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터를 통합하여 상기 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성하는 다시점 통합 자가주의 모듈; 상기 심층 표현을 이용하여 보조적인 결측 대체값을 생성하는 보조 해석부; 및 상기 심층 표현을 이용하여 환자 상태 예측 또는 진단을 분류하는 예측 및 분류기를 포함하는 불규칙 임상 시계열 데이터 예측 및 분류 장치가 제공될 수 있다. According to an embodiment of the present invention, a multi-view integration self-attention module generating an in-depth representation of the irregular clinical time series data by integrating regular clinical time series data, observation data, and time information data; an auxiliary analyzer generating an auxiliary missing replacement value using the deep expression; and a predictor and classifier for classifying patient condition prediction or diagnosis using the deep expression.

상기 예측 및 분류기 학습시, 상기 보조적인 결측 대체값이 더 이용될 수 있다. In the prediction and training of the classifier, the auxiliary missing replacement value may be further used.

상기 다시점 통합 자가주의 모듈은, 상기 불규칙 임상 시계열 데이터, 상기 관측 여부 데이터 및 상기 시간 정보 데이터 각각을 임베딩하여 각각의 특징값을 구하고, 자가주의 매커니즘을 이용하여 상기 관측 여부 데이터와 상기 시간 정보 데이터의 특징값을 통합하여 결측 패턴 정보를 생성하고, 상기 결측 패턴 정보와 상기 불규칙 임상 시계열 데이터를 자가주의 블록을 이용하여 통합한 후 순방향 신경망(FFN: feed-forward network) 모듈에 적용하여 상기 심층 표현을 생성할 수 있다. The multi-view integrated self-attention module obtains a feature value by embedding each of the irregular clinical time-series data, the observation/nonexistence data, and the time information data, and uses the self-attention mechanism to obtain the observation/nonexistence data and the time information data. Missing pattern information is generated by integrating feature values of , and after integrating the missing pattern information and the irregular clinical time series data using an autologous attention block, it is applied to a feed-forward network (FFN) module to express the depth. can create

상기 보조 해석부는, 상기 불규칙한 임상 시계열 데이터 중 일부를 마스킹한 후 상기 심층 표현을 이용하여 마스킹된 값을 복원하여 보조적인 결측 대체값을 생성할 수 있다. After masking some of the irregular clinical time series data, the auxiliary analysis unit may generate an auxiliary missing replacement value by restoring the masked value using the deep expression.

본 발명의 다른 측면에 따르면, 불규칙 임상 시계열 데이터 예측 및 분류 방법이 제공된다. According to another aspect of the present invention, a method for predicting and classifying irregular clinical time series data is provided.

본 발명의 일 실시예에 따르면, (a) 불규칙한 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터를 통합하여 상기 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성하는 단계; (b) 상기 불규칙한 임상 시계열 데이터 중 일부를 마스킹한 후 상기 심층 표현을 이용하여 마스킹된 값을 복원하여 보조적인 결측 대체값을 생성하는 단계; 및 (c) 상기 심층 표현 및 상기 보조적인 결측 대체값을 이용하여 환자 상태 예측 또는 진단을 분류하는 예측 및 분류기 모델을 학습시키는 단계를 포함하는 불규칙 임상 시계열 데이터 예측 및 분류 방법이 제공될 수 있다. According to an embodiment of the present invention, (a) generating an in-depth representation of the irregular clinical time series data by integrating irregular clinical time series data, observation data, and time information data; (b) generating an auxiliary missing replacement value by masking some of the irregular clinical time series data and restoring the masked value using the deep representation; and (c) training a predictor and classifier model for classifying patient condition prediction or diagnosis using the deep representation and the auxiliary missing replacement value.

상기 (a) 단계는, 상기 불규칙 임상 시계열 데이터, 상기 관측 여부 데이터 및 상기 시간 정보 데이터 각각을 임베딩하여 각각의 특징값을 구하는 단계; 제1 자가주의 모듈을 기반으로 상기 관측 여부 데이터와 상기 시간 정보 데이터의 특징값을 통합하여 결측 패턴 정보를 생성하는 단계; 및 제2 자가주의 모듈을 기반으로 상기 불규칙한 임상 시계열 데이터의 특징값과 상기 결측 패턴 정보를 통합한 후 순방향 신경망(FFN: feed-forward network)에 적용하여 결측 패턴에 대한 심층 표현을 생성하는 단계를 포함할 수 있다. Step (a) may include: embedding the irregular clinical time-series data, the observation data, and the time information data to obtain respective feature values; generating missing pattern information by integrating feature values of the observation data and the time information data based on a first self-attention module; and integrating the feature values of the irregular clinical time series data and the missing pattern information based on a second self-attention module, and applying the result to a feed-forward network (FFN) to generate a deep representation of the missing pattern. can include

본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법 및 장치를 제공함으로써, 자가주의 매커니즘(self-attention mechanism)을 기반으로 결측값이 높고 불규칙적인 임상 데이터와 이에 포함된 결측 패턴 정보를 함께 모델링하여 정확한 예측 및 분류 결과 제공이 가능한 이점이 있다. By providing a method and apparatus for classifying and predicting irregular clinical time series data according to an embodiment of the present invention, irregular clinical data with high missing values and missing pattern information included therein based on a self-attention mechanism are provided. It has the advantage of being able to provide accurate prediction and classification results by modeling together.

또한, 본 발명은 불규칙적 임상 시계열 데이터와 그에서 파생된 결측 패턴에 대한 정보를 자가주의 메커니즘 기반의 방법으로 통합하여 데이터에 대한 추가적인 전처리 없이 예측 및 분류의 성능을 향상시킬 수 있는 이점이 있다. In addition, the present invention has the advantage of improving the performance of prediction and classification without additional preprocessing of data by integrating irregular clinical time series data and information on missing patterns derived therefrom into an autologous mechanism-based method.

도 1은 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류를 위한 장치의 구성을 개략적으로 도시한 도면.
도 2는 본 발명의 일 실시예에 따른 다시점 통합 자가주의 모듈의 내부 구성을 개략적으로 도시한 도면.
도 3은 본 발명의 일 실시예에 따른 보조 해석부의 내부 구성을 개략적으로 도시한 블록도.
도 4는 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류 방법을 위한 의사 코드를 도시한 도면.
도 5는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법을 나타낸 순서도. 1 is a diagram schematically showing the configuration of an apparatus for predicting and classifying irregular clinical time-series data according to an embodiment of the present invention.
2 is a diagram schematically showing the internal configuration of a multi-view integration self-attention module according to an embodiment of the present invention.
Figure 3 is a block diagram schematically showing the internal configuration of the secondary analysis unit according to an embodiment of the present invention.
4 is a diagram showing pseudo code for a method for predicting and classifying irregular clinical time series data according to an embodiment of the present invention.
5 is a flowchart illustrating a method of classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

본 명세서에서 사용되는 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 명세서에서, "구성된다" 또는 "포함한다" 등의 용어는 명세서상에 기재된 여러 구성 요소들, 또는 여러 단계들을 반드시 모두 포함하는 것으로 해석되지 않아야 하며, 그 중 일부 구성 요소들 또는 일부 단계들은 포함되지 않을 수도 있고, 또는 추가적인 구성 요소 또는 단계들을 더 포함할 수 있는 것으로 해석되어야 한다. 또한, 명세서에 기재된 "...부", "모듈" 등의 용어는 적어도 하나의 기능이나 동작을 처리하는 단위를 의미하며, 이는 하드웨어 또는 소프트웨어로 구현되거나 하드웨어와 소프트웨어의 결합으로 구현될 수 있다.Singular expressions used herein include plural expressions unless the context clearly dictates otherwise. In this specification, terms such as "consisting of" or "comprising" should not be construed as necessarily including all of the various components or steps described in the specification, and some of the components or some of the steps It should be construed that it may not be included, or may further include additional components or steps. In addition, terms such as "...unit" and "module" described in the specification mean a unit that processes at least one function or operation, which may be implemented as hardware or software or a combination of hardware and software. .

이하, 첨부된 도면들을 참조하여 본 발명의 실시예를 상세히 설명한다. Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류를 위한 장치의 구성을 개략적으로 도시한 도면이고, 도 2는 본 발명의 일 실시예에 따른 다시점 통합 자가주의 모듈의 내부 구성을 개략적으로 도시한 도면이고, 도 3은 본 발명의 일 실시예에 따른 보조 해석부의 내부 구성을 개략적으로 도시한 블록도이며, 도 4는 본 발명의 일 실시예에 따른 불규칙한 임상 시계열 데이터 예측 및 분류 방법을 위한 의사 코드를 도시한 도면이다. 1 is a diagram schematically showing the configuration of an apparatus for predicting and classifying irregular clinical time-series data according to an embodiment of the present invention, and FIG. 2 is an interior of a multi-view integrated self-attention module according to an embodiment of the present invention. Figure 3 is a block diagram schematically showing the internal structure of the auxiliary analysis unit according to an embodiment of the present invention, Figure 4 is a diagram showing irregular clinical time series data prediction according to an embodiment of the present invention and a pseudo code for a classification method.

도 1을 참조하면, 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 다시점 통합 자가주의 모듈(110), 보조 해석부(120), 예측 및 분류기(130), 메모리(140) 및 프로세서(150)를 포함하여 구성된다. Referring to FIG. 1 , an apparatus 100 for predicting and classifying irregular clinical time series data according to an embodiment of the present invention includes a multi-view integration self-attention module 110, an auxiliary analysis unit 120, a prediction and classifier 130, It is configured to include a memory 140 and a processor 150 .

다시점 통합 자가주의 모듈(110)은 불규칙한 임상 시계열 데이터, 관측 여부 데이터, 시간 정보 데이터를 각각 입력 데이터로 입력받은 후 이를 순차적으로 통합하여 특징을 찾은 후 순방향 신경망 모델에 적용하여 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성하기 위한 수단이다. The multi-view integration self-attention module 110 receives irregular clinical time series data, observation data, and time information data as input data, sequentially integrates them to find features, and applies them to a forward neural network model to determine irregular clinical time series data. It is a means for creating in-depth representations of

다시점 통합 자가주의 모듈(110)은 데이터 임베딩부(210), 통합 모듈(220) 및 순방향 신경망 모듈(230)을 포함하여 구성된다. The multi-view integration self-attention module 110 includes a data embedding unit 210, an integration module 220, and a forward neural network module 230.

각 서브젝트 n에 대해,

길이의 시점

의 D-차원 다변량 시계열 집합이 주어지면, 관측 시계열을

와 같이 나타내기로 한다. 여기서,

는 모든 변수의 t번째 관측을 나타내고,

는

내의 d번째 엘리먼트를 나타낸다. For each subject n,

point of length

Given a set of D-dimensional multivariate time series of

to be expressed as here,

denotes the tth observation of all variables,

Is

Indicates the d-th element in

시계열 데이터

은 결측값(missing values)를 포함하기 때문에, 시계열 데이터에 대해 관측 또는 결측 변수를 마크하기 위해

과 동일한 사이즈를 가지는 관측 여부 정보(즉, 결측 지표(missing indicator))

를 도입하기로 한다. time series data

to mark observed or missing variables for time series data, since they contain missing values.

Observation information having the same size as (i.e., missing indicator)

to introduce

관측 여부 데이터는 만일

이 관측되면,

이며, 그렇지 않은 경우

과 같이 생성될 수 있다. 또한, 불규칙 임상 시계열 데이터의 관측이 결측되는 경우, 입력은

내에 상응하는 요소에 대해 제로(0)으로 설정될 수 있다. 각 변수 d에 대해 시간 간격을

로 나타내기로 한다. 여기서,

은 수학식 1과 같이 정의될 수 있다. Observation data is

If this is observed,

and, if not

can be created as In addition, if observations of irregular clinical time series data are missing, the input is

It can be set to zero (0) for the corresponding element in For each variable d, the time interval

to be represented by here,

Can be defined as in Equation 1.

불규칙 임상 시계열 데이터셋이

와 같이 주어지면, 예측 라벨

정의를 위한 매핑 함수 구축이 가능하다. Irregular clinical time series dataset

Given as, the predicted label

It is possible to build a mapping function for definition.

데이터 임베딩부(210)는 각 입력 데이터(즉, 불규칙한 임상 시계열 데이터, 관측 여부 데이터, 시간 정보 데이터)를 각각 임베딩하여 특징값을 도출한다. 다시점 통합 자가주의 모듈(110)은 연속적인 시간 값을 입력으로 취하는 위치 인코딩의 변형으로 시간 임베딩(TE: time embedding)을 사용하고, 시간 임베딩을 인코딩 벡터 표현으로 변환할 수 있다. The data embedding unit 210 derives feature values by embedding each input data (ie, irregular clinical time series data, observation data, and time information data). The multi-view integration self-attention module 110 may use time embedding (TE) as a variant of positional encoding that takes continuous time values as input, and converts the time embedding into an encoding vector representation.

이를 통해, 정확한 시점과 시간 간격을 고려하여 불규칙하게 샘플링된 임상 시계열 데이터를 처리할 수 있다. Through this, it is possible to process irregularly sampled clinical time series data in consideration of the exact time point and time interval.

시간 임베딩은 수학식 2와 같이 나타낼 수 있다. Time embedding can be expressed as in Equation 2.

여기서, t, d,

및

는 각각 정확한 시간 포인트, 변수의 인덱스, 모델 차원, 데이터의 최대 시간 길이를 나타낸다. 시간 임베딩은 학습된 입력 임베딩에 추가될 수 있다. 이를 수학식으로 나타내면 수학식 3 내지 수학식 5와 같다. Here, t, d,

and

represents the exact time point, the index of the variable, the model dimension, and the maximum time length of the data, respectively. Temporal embeddings can be added to learned input embeddings. If this is expressed as an equation, it is equivalent to Equations 3 to 5.

여기서,

는 각각 다시점 입력에 대한 상응하는 임베딩 가중치 행렬을 나타낸다. here,

denotes the corresponding embedding weight matrix for each multi-view input.

종래에는 결측 지표와 시간 간격과 같은 결측치의 소스를 활용할뿐, 결측 데이터의 표현(representation)을 학습하지 않음으로써, 결측에 대한 부적절한 모델링으로 인해 신뢰할 수 없는 평가와 측정 변경에 강건하지 않은 모델 생성으로 이어지는 문제점이 있었다. In the past, only missing data sources such as missing indexes and time intervals were utilized, but representations of missing data were not learned, resulting in unreliable evaluation due to inadequate modeling of missing data and creation of models that were not robust to measurement changes. There was a follow up problem.

따라서, 본 발명의 일 실시예에서는 결측 지표와 시간 간격을 모두 효과적으로 활용하여 불규칙한 임상 시계열 데이터의 심층 표현을 학습할 수 있다. Therefore, in an embodiment of the present invention, it is possible to learn a deep representation of irregular clinical time series data by effectively utilizing both the missing indicator and the time interval.

모든 시간에서, D 측정값이 주어지는 경우, 다시점 통합 자가주의 모듈(110)은 불규칙 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터에 대한 입력 데이터의 임베딩하고, 이를 통해 각 입력 데이터(즉, 불규칙 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터)에 대한 특징값을 구할 수 있다. At all times, when the measured value D is given, the multi-view integrated self-attention module 110 embeds input data for irregular clinical time series data, observation data, and time information data, and through this, each input data (i.e., irregular feature values for clinical time series data, observation data, and time information data) can be obtained.

다시점 통합 자가주의 모듈(110)은 쿼리(Q), 키(K) 및 값(V) 집합에 대해 스케일된 도트 프로덕트 어텐션(scaled dot-product attention)이 계산되는 다중헤드 자가주의 블록(MHA: multi-head attention)을 기반으로 한다. The multi-view integrated self-attention module 110 is a multi-headed self-attention block (MHA: It is based on multi-head attention.

자가주의 블록에 기초하여 관측 X를 포함하는 다시점 불규칙 임상 시계열 데이터의 어텐션 표현(attention representation)이 학습될 수 있다. An attention representation of multi-view irregular clinical time series data including observation X may be learned based on the self-attention block.

예를 들어, 각 입력 집합

은 자가주의 블록을 통해 자신의 표현

을 학습할 수 있다. 여기서, 각 데이터 포인트는 자체 가중치 행렬과 선형적으로 결합되어 해당하는 Q, K 및 V로 공급될 수 있다. For example, each input set

self-expression through self-attention blocks

can learn Here, each data point can be linearly combined with its own weight matrix and fed into the corresponding Q, K and V.

각 입력 데이터에 대한 어텐션 표현은 수학식 6 내지 수학식 8과 같이 수학식으로 정리될 수 있다. The expression of attention for each input data can be organized into equations such as Equations 6 to 8.

여기서, W는 학습 가능한 가중치 행렬의 집합을 나타내고,

는 SoftMax 활성화 함수를 나타내고,

는 키 벡터의 차원을 나타낸다. Here, W represents a set of learnable weight matrices,

denotes the SoftMax activation function,

represents the dimension of the key vector.

통합 모듈(220)은 자가주의 블록에 의존하며, 입력 데이터의 특징값들을 통합하기 위한 수단이다. The integration module 220 relies on the autoattention block and is a means for integrating the feature values of the input data.

통합 모듈(220)은 입력 데이터 중 관측 여부 데이터와 시간 정보 데이터의 임베딩된 특징값을 통합하여 복잡한 결측 패턴 정보를 생성할 수 있다. 또한, 통합 모듈(220)은 불규칙한 임상 시계열 데이터의 특징값과 결측 패턴 정보를 표현 공간에서 결합할 수 있다. The integration module 220 may generate complex missing pattern information by integrating feature values embedded in observation data and time information data among input data. In addition, the integration module 220 may combine feature values of irregular clinical time series data and missing pattern information in the expression space.

이와 같이, 결측 자체로부터 기본 표현을 학습하면 결측값을 전가할 필요가 없으며, 어떠한 휴리스틱한 함수를 특정할 필요가 없다. In this way, if the basic expression is learned from the missing value itself, there is no need to impute the missing value, and there is no need to specify any heuristic function.

따라서, 통합 모듈(220)은 두 단계의 통합 과정(제1 통합 단계, 제2 통합 단계)가 수반될 수 있다. 이에 대해 보다 상세히 설명하기로 한다. Accordingly, the integration module 220 may involve a two-step integration process (a first integration step and a second integration step). This will be described in more detail.

즉, 제1 통합 단계에서, 통합 모듈(220)은 제1 자가주의 블록을 이용하여 시간 정보 데이터에 기초한 시간 간격의 표현과 관측 여부 데이터에 기반한 결측 지표의 표현을 통합하여 결측 패턴의 표현(

)을 획득할 수 있다. 이를 수학식으로 나타내면 수학식 9와 같다. That is, in the first integration step, the integration module 220 integrates the expression of the time interval based on the time information data and the expression of the missing indicator based on the observation data using the first self-attention block to express the missing pattern (

) can be obtained. If this is expressed as an equation, it is equal to Equation 9.

제2 통합 단계에서, 통합 모듈(220)은 제2 자가주의 블록을 이용하여 불규칙한 임상 시계열 데이터와 결측 패턴 정보의 표현을 통합할 수 있다. 이를 수학식으로 나타내면 수학식 10과 같다. In the second integration step, the integration module 220 may integrate irregular clinical time series data and representation of missing pattern information using the second autologous attention block. If this is expressed as an equation, it is equivalent to Equation 10.

상술한 바와 같이, 통합 모듈(220)은 복수의 자가주의 모듈을 기반으로, 입력 데이터들을 순차적으로 통합할 수 있다. As described above, the integration module 220 may sequentially integrate input data based on a plurality of self-attention modules.

통합 모듈(220)에 의해 입력 데이터들이 순차적으로 통합된 결과는 순방향 신경망 모듈(230)로 입력되며, 순방향 신경망 모듈(230)을 통해 불규칙한 임상 시계열 데이터와 통합된 결측 패턴 간의 관계를 모델링하여 공동 학습된 심층 표현을 출력할 수 있다.The result of sequential integration of the input data by the integration module 220 is input to the forward neural network module 230, and joint learning is performed by modeling the relationship between the irregular clinical time series data and the integrated missing pattern through the forward neural network module 230. In-depth expression can be output.

순방향 신경망 모듈(230)은 변수간의 종속성을 모델링하도록 각 시점에 동일하게 적용될수 있다. The forward neural network module 230 can be equally applied at each point in time to model dependencies between variables.

최종 심층 표현

은 보조적인 결측 대체값 생성 및 환자 상태 예측 또는 진단을 분류에 이용될 수 있다. final deep expression

can be used to generate auxiliary missing replacement values and classify patient condition prediction or diagnosis.

보조 해석부(120)는 다시점 통합 자가주의 모듈(110)에서 출력된 심층 표현(

)을 이용하여 보조적인 결측 대체값을 생성하기 위한 수단이다. The auxiliary analysis unit 120 is a deep expression output from the multi-view integration self-attention module 110 (

) is used to generate an auxiliary missing replacement value.

보조 해석부(120)는 불규칙한 임상 시계열 데이터의 일부를 고의로 마스킹한 후 다시점 통합 자가주의 모듈(110)에서 생성된 최종 심층 표현을 이용하여 마스킹한 값을 복원하는 과정을 통해 보조적인 결측 대체값을 생성할 수 있다. The auxiliary analysis unit 120 deliberately masks part of the irregular clinical time series data, and then restores the masked value using the final deep expression generated by the multi-view integration self-attention module 110 to obtain an auxiliary missing replacement value. can create

보조 해석부(120)의 상세 구조는 도 3에 도시된 바와 같다.A detailed structure of the auxiliary analysis unit 120 is as shown in FIG. 3 .

도 3에 도시된 바와 같이, 보조 해석부(120)는 다중헤드 자가주의 모듈(310)과 순방향 신경망 모듈(320)을 포함하여 구성된다. As shown in FIG. 3 , the auxiliary analysis unit 120 includes a multi-head self-attention module 310 and a forward neural network module 320 .

보조 해석부(120)는 쿼리로서 다시점 통합 자가주의 모듈(110)에서 출력된 심층 표현(

)과 키와 값으로 불규칙 임상 시계열 데이터(

) 사이의 다중헤드 자가주의 모듈(310)을 적용하고, 순방향 신경망 모듈(320)을 뒤에 배치할 수 있다. The auxiliary analysis unit 120 outputs the deep expression from the multi-view integration self-attention module 110 as a query (

) and irregular clinical time series data as keys and values (

), and the forward neural network module 320 can be placed behind.

출력 계층은 학습된 임베딩 행렬(

)을 사용하여 타겟 시간 시퀀스에 매핑되고, 대치된 데이터(

)를 생성할 수 있다. 이를 수학식으로 나타내면 수학식 11과 같다. The output layer is the learned embedding matrix (

) to the target time sequence, and the imputed data (

) can be created. This is expressed as Equation 11.

마스킹된 값을 표시하기 위해 도입된 다른 마스킹 벡터

가 주어지면, 전가 손실

은 불규칙한 임상 시계열 데이터(X)와

에 의해 마스킹된 값을 위한 전가된 샘플(

) 사이의 마스킹된 평균 제곱 에러(MSE: mean squared error)에 의해 계산될 수 있다. Another masking vector introduced to represent the masked value

Given , the imputed loss

is irregular clinical time series data (X) and

Imputed samples for values masked by (

) can be calculated by the masked mean squared error (MSE) between

이를 수학식으로 나타내면, 수학식 12와 같다. If this is expressed as an equation, it is equivalent to Equation 12.

목적 함수에 전가 손실을 도입하여 최적의 예측 결과를 달성하기 위한 보조적인 정보 제공이 가능케 할 수 있다. By introducing imputation loss into the objective function, it is possible to provide auxiliary information for achieving optimal prediction results.

합성 손실은 예측 및 전가 손실을 누적하여 수학식 13과 같이 정의될 수 있다. The synthetic loss can be defined as Equation 13 by accumulating prediction and imputation losses.

여기서,

와

는 두 손실 사이의 비율 조절을 위한 하이퍼매개변수를 나타낸다. here,

Wow

represents a hyperparameter for adjusting the ratio between the two losses.

합성 손실(

)을 통해 단대단(end-to-end) 방식으로 모델의 모든 매개변수가 최적화실 수 있다. composite loss (

) allows all parameters of the model to be optimized in an end-to-end manner.

예측 및 분류기(130)는 다시점 통합 자가주의 모듈(110)로부터 출력된 최종 심층 표현을 기반으로 환자 상태 예측 또는 진단을 분류하기 위한 수단이다.The predictor and classifier 130 is a means for classifying a patient condition prediction or diagnosis based on the final deep expression output from the multi-view integration autoattention module 110 .

예측 및 분류기(130)는 학습시 심층 표현과 보조 해석부(120)에 의해 출력된 보조적인 결측 대체값을 이용하여 학습될 수 있다. 예측 및 분류기(130)는 심층 표현과 보조적인 결측 대체값을 이용하여 보조적인 결측 대체값에 대한 손실값이 최소가 되도록 학습될 수 있다. The predictor and classifier 130 may be trained using a deep expression and an auxiliary missing replacement value output by the auxiliary analyzer 120 during learning. The predictor and classifier 130 may be trained to minimize the loss value for the auxiliary missing replacement value using the deep representation and the auxiliary missing replacement value.

보조적인 결측 대체값은 예측 및 분류기(130)의 학습에만 이용되며, 학습이 완료된 이후에는 이용되지 않을 수 있다. The auxiliary missing replacement value is used only for learning of the predictor and classifier 130, and may not be used after learning is completed.

예측 및 분류기(130)는 심층 표현을 기반으로 환자 상태를 예측하거나 진단을 분류할 수 있다. The predictor and classifier 130 may predict a patient condition or classify a diagnosis based on the deep representation.

예를 들어, 예측 및 분류기(130)는 병원내 사망률이나 LOS 예측을 수행할 수 있다. For example, predictor and classifier 130 may perform in-hospital mortality or LOS prediction.

병원내 사망률 예측은 환자가 입원 기간 동안 사망했는지 또는 퇴원할 예정인지를 나타내는 이진 분류로, 자에 대한 타겟 라벨은

과 같다. In-hospital mortality prediction is a binary classification that indicates whether a patient has died during hospitalization or is expected to be discharged, with a target label for

Same as

LOS 예측은 환자의 입원과 입원 종료 사이의 일수를 예측하는 회귀 작업일 수 있다. LOS prediction can be a regression task that predicts the number of days between a patient's hospitalization and the end of hospitalization.

최종 심층 표현(

)이 주어지면, 타임스탬프에 대한 평균 풀링을 수행하고, 풀링된 최종 표현(

) 다음에 다층 퍼셉트론(MLP)가 뒤 따른다. 최종 계층은 특정 작업에 따라 달라지며 분류 및 다중 레이블 분류에 시그모이드 계층이 사용될 수 있다. final deep expression (

), perform mean pooling over timestamps, and pool the final expression (

) followed by the multilayer perceptron (MLP). The final layer depends on the specific task, and sigmoid layers can be used for classification and multi-label classification.

예측 및 분류기(130)는 의료 데이터의 특성상 불균형한 데이터에서의 열악한 분류 성능 문제를 해결하기 위해 각 환자의 타겟 레이블(

)과 예측된 레이블(

) 사이의 분류 손실을 계산하기 위해 수학식 14 내지 수학식 16과 같은 목적 함수를 이용할 수 있다. The predictor and classifier 130 is a target label (

) and predicted labels (

), objective functions such as Equations 14 to 16 may be used to calculate the classification loss between

이진 분류 손실 수학식 14를 이용하여 계산될 수 있다. The binary classification loss can be calculated using Equation 14.

회귀 손실은 수학식 15를 이용하여 계산될 수 있다. The regression loss can be calculated using Equation 15.

다중 레벨 분류 손실은 수학식 16을 이용하여 계산될 수 있다. The multi-level classification loss can be calculated using Equation 16.

여기서, N은 전체 환자의 수를 나타내고, K는 데이터베이스내의 라벨의 전체 수를 나타내며,

는 마이너리티 클래스를 위한 포커싱 매개변수를 나타내며,

는 클래스간의 중요도 균형을 위한 가중치를 나타낸다. where N represents the total number of patients, K represents the total number of labels in the database,

represents the focusing parameter for the minority class,

represents the weight for the balance of importance between classes.

메모리(140)는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법을 수행하기 위해 필요한 다양한 명령어를 저장한다. The memory 140 stores various commands required to perform the method of classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

프로세서(150)는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 예측 및 분류 장치의 내부 구성 요소들(예를 들어, 다시점 통합 자가주의 모듈(110), 보조 해석부(120), 예측 및 분류기(130), 메모리(140) 등)을 제어하기 위한 수단이다. 도 4에는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법에 대한 의사 코드가 예시되어 있다. The processor 150 includes internal components (eg, the multi-view integration self-attention module 110, the auxiliary analysis unit 120, prediction and It is a means for controlling the classifier 130, memory 140, etc.). 4 illustrates pseudo code for a method for classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

도 5는 본 발명의 일 실시예에 따른 불규칙 임상 시계열 데이터 분류, 예측 방법을 나타낸 순서도이다. 5 is a flowchart illustrating a method of classifying and predicting irregular clinical time series data according to an embodiment of the present invention.

단계 510에서 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙한 임상 시계열 데이터, 관측 여부 데이터 및 시간 정보 데이터를 통합하여 상기 불규칙한 임상 시계열 데이터에 대한 심층 표현을 생성한다. In step 510, the apparatus 100 for predicting and classifying irregular clinical time series data generates an in-depth representation of the irregular clinical time series data by integrating the irregular clinical time series data, observation data, and time information data.

이미 전술한 바와 같이, 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙 임상 시계열 데이터, 상기 관측 여부 데이터 및 상기 시간 정보 데이터 각각을 임베딩하여 각각의 특징값을 구하고, 자가주의 매커니즘을 이용하여 상기 관측 여부 데이터와 상기 시간 정보 데이터의 특징값을 통합하여 결측 패턴 정보를 생성하고, 상기 결측 패턴 정보와 상기 불규칙 임상 시계열 데이터를 자가주의 블록을 이용하여 통합한 후 순방향 신경망(FFN: feed-forward network) 모듈에 적용하여 상기 심층 표현을 생성할 수 있다. As already described above, the apparatus 100 for predicting and classifying irregular clinical time series data obtains each characteristic value by embedding the irregular clinical time series data, the observation data, and the time information data, and uses a self-attention mechanism to Missing pattern information is generated by integrating feature values of the observation data and the time information data, and after integrating the missing pattern information and the irregular clinical time series data using a self-attention block, a feed-forward network (FFN) ) module to generate the deep expression.

단계 515에서 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙한 임상 시계열 데이터와 심층 표현을 이용하여 보조적인 결측 대체값을 생성한다. In step 515, the apparatus 100 for predicting and classifying irregular clinical time series data generates an auxiliary missing replacement value using the irregular clinical time series data and deep expression.

이미 전술한 바와 같이, 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 불규칙한 임상 시계열 데이터 중 일부를 마스킹 처리한 후 심층 표현을 이용하여 마스킹 처리된 값을 복원하는 과정을 통해 보조적인 결측 대체값을 생성할 수 있다. As already described above, the apparatus 100 for predicting and classifying irregular clinical time series data masks some of the irregular clinical time series data, and then restores the masked values using a deep representation to generate an auxiliary missing replacement value. can create

단계 520에서 불규칙 임상 시계열 데이터 예측 및 분류 장치(100)는 심층 표현 및 보조적인 결측 대체값을 이용하여 모델(즉, 예측 및 분류기(130))를 학습한다. 이와 같은 학습 과정이 완료되면, 심층 표현을 이용하여 환자 상태 예측 또는 진단을 분류할 수 있다. In step 520, the apparatus 100 for predicting and classifying irregular clinical time series data learns a model (ie, the predictor and classifier 130) using the deep representation and the auxiliary missing replacement value. When such a learning process is completed, patient state prediction or diagnosis may be classified using deep expression.

본 발명의 실시 예에 따른 장치 및 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 컴퓨터 판독 가능 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 분야 통상의 기술자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media) 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다.Devices and methods according to embodiments of the present invention may be implemented in the form of program instructions that can be executed through various computer means and recorded in computer readable media. Computer readable media may include program instructions, data files, data structures, etc. alone or in combination. Program instructions recorded on a computer readable medium may be specially designed and configured for the present invention, or may be known and usable to those skilled in the art in the field of computer software. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floptical disks. - Includes hardware devices specially configured to store and execute program instructions, such as magneto-optical media and ROM, RAM, flash memory, etc. Examples of program instructions include high-level language codes that can be executed by a computer using an interpreter, as well as machine language codes such as those produced by a compiler.

상술한 하드웨어 장치는 본 발명의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.The hardware device described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

이제까지 본 발명에 대하여 그 실시 예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시 예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far, the present invention has been looked at mainly by its embodiments. Those skilled in the art to which the present invention pertains will be able to understand that the present invention can be implemented in a modified form without departing from the essential characteristics of the present invention. Therefore, the disclosed embodiments should be considered from a descriptive point of view rather than a limiting point of view. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the equivalent scope will be construed as being included in the present invention.

100: 불규칙 임상 시계열 데이터 예측 및 분류 장치.
110: 다시점 통합 자가주의 모듈
120: 보조 해석부
130: 예측 및 분류기
140: 메모리
150: 프로세서100: Irregular clinical time series data prediction and classification device.
110: multi-view integration self-attention module
120: auxiliary analysis unit
130: predictor and classifier
140: memory
150: processor

Claims

a multi-view integration self-attention module for generating an in-depth representation of the irregular clinical time-series data by integrating irregular clinical time-series data, observation data, and time information data;
an auxiliary analyzer generating an auxiliary missing replacement value using the deep expression; and
An apparatus for predicting and classifying irregular clinical time series data including a predictor and classifier for classifying patient condition prediction or diagnosis using the deep expression.

According to claim 1,
The apparatus for predicting and classifying irregular clinical time series data, characterized in that the auxiliary missing replacement value is further used when learning the prediction and classifier.

According to claim 1,
The multi-view integration self-attention module,
Each feature value is obtained by embedding the irregular clinical time series data, the observation data, and the time information data, and missing pattern information is obtained by integrating the observation data and the feature value of the time information data using an autoattention mechanism. and generating the deep expression by integrating the missing pattern information and the irregular clinical time series data using an autologous attention block and then applying it to a feed-forward network (FFN) module. Time series data forecasting and classification device.

According to claim 1,
The auxiliary analysis unit,
An apparatus for predicting and classifying irregular clinical time series data, characterized in that, after masking some of the irregular clinical time series data, restoring the masked value using the deep expression to generate an auxiliary missing replacement value.

(a) generating an in-depth representation of the irregular clinical time series data by integrating irregular clinical time series data, observation data, and time information data;
(b) generating an auxiliary missing replacement value by masking some of the irregular clinical time series data and restoring the masked value using the deep representation; and
(c) training a predictor and classifier model for classifying patient condition prediction or diagnosis using the deep representation and the auxiliary missing replacement value.

According to claim 5,
In step (a),
obtaining feature values by embedding each of the irregular clinical time-series data, the observation data, and the time information data;
generating missing pattern information by integrating feature values of the observation data and the time information data based on a first self-attention module; and
Including integrating the feature values of the irregular clinical time series data and the missing pattern information based on a second self-attention module and then applying them to a feed-forward network (FFN) to generate a deep representation of the missing pattern. A method for predicting and classifying irregular clinical time series data, characterized in that.