KR102329938B1

KR102329938B1 - Method for processing conebeam computed tomography image using artificial neural network and apparatus therefor

Info

Publication number: KR102329938B1
Application number: KR1020190123051A
Authority: KR
Inventors: 예종철; 한요섭; 김준영
Original assignee: 한국과학기술원
Priority date: 2019-10-04
Filing date: 2019-10-04
Publication date: 2021-11-23
Also published as: KR20210040607A

Abstract

뉴럴 네트워크를 이용한 콘빔 단층촬영 영상 처리 방법 및 그 장치가 개시된다. 본 발명의 일 실시예에 따른 영상 처리 방법은 콘빔 전산단층 촬영 데이터를 수신하는 단계; 및 콘빔 아티팩트(artifact)를 제거하는 뉴럴 네트워크를 이용하여 상기 콘빔 전산단층 촬영 데이터에 대한 영상을 복원하는 단계를 포함하며, 상기 영상을 복원하는 단계는 차분 역투영(differentiated backprojection, DBP) 알고리즘을 포함하는 해석적 알고리즘을 이용하여 상기 콘빔 전산단층 촬영 데이터로부터 적어도 두 개 이상의 방향에 대한 최초 영상들을 획득하는 단계; 및 상기 뉴럴 네트워크를 이용하여 상기 최초 영상들로부터 상기 콘빔 전산단층 촬영 데이터에 대한 영상을 복원하는 단계를 포함하고, 상기 영상을 복원하는 단계는 스펙트럴 블렌딩 기법을 이용하여 상기 복원된 영상에 대한 최종 복원 영상을 생성할 수 있다.Disclosed are a method and apparatus for processing cone beam tomography images using a neural network. An image processing method according to an embodiment of the present invention includes receiving cone-beam computed tomography data; and reconstructing an image of the cone-beam computed tomography data using a neural network that removes cone-beam artifacts, wherein reconstructing the image includes a differential backprojection (DBP) algorithm. obtaining initial images in at least two directions from the cone beam computed tomography data using an analytic algorithm; and reconstructing an image of the cone-beam computed tomography data from the initial images using the neural network, wherein the reconstructing of the image includes a final image of the reconstructed image using a spectral blending technique. A restored image can be created.

Description

Cone beam tomography image processing method and apparatus using neural network

본 발명은 콘빔 단층촬영 영상 처리 방법 및 그 장치에 관한 것으로, 보다 구체적으로는 콘빔 아티팩트(conebeam artifact)를 제거하는 뉴럴 네트워크를 이용하여 콘빔 단층촬영 영상을 고품질의 영상으로 복원할 수 있는 영상 처리 방법 및 그 장치에 관한 것이다.The present invention relates to a method and apparatus for processing a cone-beam tomography image, and more particularly, to an image processing method capable of reconstructing a cone-beam tomography image into a high-quality image using a neural network that removes cone-beam artifacts and to the device.

원형 궤도를 갖는 콘빔 전산화 단층촬영(Cone-Beam Computed Tomography, CBCT)는 간단한 지오메트리(geometry)로 인하여 자주 사용되는 CT 시스템 중 하나이다. 이러한 CBCT는 상대적으로 간단한 스캐너 형상으로 고해상도 투영 영상을 획득할 수 있으므로 중재적 이미징과 치과 CT 등에 자주 사용된다. 일반적으로 CBCT 시스템에서 얻어진 투영 데이터(projection data)를 이용하여 영상을 복원하기 위하여 Feldkamp, Davis and Kress(FDK)라 불리는 복원 알고리즘이 일반적으로 사용된다. 하지만, FDK 알고리즘의 경우 CBCT 시스템이 갖는 콘 각(cone-angle)이 커짐에 따라 콘빔 아티팩트(conebeam artifact)로 불리는 영상 노이즈가 심해지게 되어 복원 영상의 형태 왜곡을 발생 시킨다.Cone-Beam Computed Tomography (CBCT) with a circular orbit is one of the frequently used CT systems due to its simple geometry. Since such CBCT can acquire a high-resolution projection image with a relatively simple scanner shape, it is often used for interventional imaging and dental CT. In general, a reconstruction algorithm called Feldkamp, Davis and Kress (FDK) is generally used to reconstruct an image using projection data obtained from a CBCT system. However, in the case of the FDK algorithm, as the cone-angle of the CBCT system increases, image noise called a cone-beam artifact becomes severe, which causes distortion of the shape of the reconstructed image.

수학적으로 콘빔 아티팩트는 원형 궤적에 내재된 결함으로 인해 발생하는데, 이는 Tuy의 조건을 만족시키지 못한다. 이것은 푸리에 도메인에서 스캐너 지오메트리에 의해 결정되는 특정 주파수에서 누락된 스펙트럴 구성요소로 나타난다. 따라서 추가 사전 정보를 사용하지 않으면 콘빔 아티팩트의 정확한 제거가 불가능할 수 있다.Mathematically, cone beam artifacts are caused by defects inherent in circular trajectories, which do not satisfy Tuy's condition. This appears as a missing spectral component at a specific frequency determined by the scanner geometry in the Fourier domain. Therefore, accurate removal of cone beam artifacts may not be possible without additional prior information.

이러한 영상의 왜곡 또는 영상의 노이즈를 제거하기 위하여 다양한 모델 기반 반복적 복원 알고리즘(Model-Based Iterative Reconstruction, MBIR)이 일반적으로 사용되었지만, 많은 연산량이 소요되는 CT의 투영 연산과 역투영 연산을 반복적으로 수행해야 한다는 한계가 존재한다.In order to remove such image distortion or image noise, various model-based iterative reconstruction algorithms (Model-Based Iterative Reconstruction, MBIR) have been generally used, but the CT projection operation and back-projection operation, which require a lot of computation, are repeatedly performed. There are limits to what must be done.

근래에 들어, 뉴럴 네트워크 (neural network)을 이용한 전산화 단층촬영 복원기법에 관한 연구가 활발히 진행되기 시작하였다. 이미 저선량 CT, 희소뷰 CT, 그리고 내부 단층 촬영 CT 에서 뉴럴 네트워크를 이용하여 높은 수준의 복원 성능을 보여준다는 것이 검증되었다. 뉴럴 네트워크를 이용한 복원기법의 경우, 복원수행시간이 굉장히 짧음에도 불구하고 우수한 성능을 보여주는 것으로 알려져 있다. 또한, 최근의 이론적 연구에서 인코더-디코더 구조를 가진 딥 컨볼루션 뉴럴 네트워크가 조합형 컨볼루션 프레임을 이용한 새로운 프레임 확장과 관련이 있음을 보여주었다. Recently, studies on computed tomography reconstruction techniques using neural networks have begun to be actively conducted. It has already been verified that low-dose CT, sparse-view CT, and internal tomography CT show a high level of restoration performance using neural networks. In the case of a restoration technique using a neural network, it is known that it shows excellent performance despite the very short restoration execution time. In addition, recent theoretical studies have shown that deep convolutional neural networks with encoder-decoder structures are related to new frame extensions using combinatorial convolutional frames.

본 발명의 실시예들은, 콘빔 아티팩트를 제거하는 뉴럴 네트워크를 이용하여 콘빔 단층촬영 영상을 고품질의 영상으로 복원할 수 있는 영상 처리 방법 및 그 장치를 제공한다.Embodiments of the present invention provide an image processing method and apparatus capable of reconstructing a cone-beam tomography image into a high-quality image using a neural network that removes cone-beam artifacts.

본 발명의 실시예들은, 뉴럴 네트워크에 의해 콘빔 아티팩트가 제거된 콘빔 영상을 입력으로 하는 스펙트럴 블렌딩(spectral blending) 기법을 이용하여 잔존하는 노이즈를 제거함으로써, 콘빔 단층촬영 영상을 고품질의 영상으로 복원할 수 있는 영상 처리 방법 및 그 장치를 제공한다.Embodiments of the present invention restore a cone-beam tomography image to a high-quality image by removing residual noise using a spectral blending technique in which a cone-beam image from which cone-beam artifacts are removed by a neural network is removed as an input. Provided are an image processing method and an apparatus for the same.

본 발명의 일 실시예에 따른 영상 처리 방법은 콘빔 전산단층 촬영 데이터를 수신하는 단계; 및 콘빔 아티팩트(artifact)를 제거하는 뉴럴 네트워크를 이용하여 상기 콘빔 전산단층 촬영 데이터에 대한 영상을 복원하는 단계를 포함한다.An image processing method according to an embodiment of the present invention includes receiving cone-beam computed tomography data; and reconstructing an image of the cone-beam computed tomography data using a neural network that removes cone-beam artifacts.

상기 영상을 복원하는 단계는 차분 역투영(differentiated backprojection, DBP) 알고리즘을 포함하는 해석적 알고리즘을 이용하여 상기 콘빔 전산단층 촬영 데이터로부터 적어도 두 개 이상의 방향에 대한 최초 영상들을 획득하는 단계; 및 상기 뉴럴 네트워크를 이용하여 상기 최초 영상들로부터 상기 콘빔 전산단층 촬영 데이터에 대한 영상을 복원하는 단계를 포함할 수 있다.The reconstructing of the image may include: obtaining initial images in at least two directions from the cone-beam computed tomography data using an analytic algorithm including a differential backprojection (DBP) algorithm; and reconstructing an image of the cone-beam computed tomography data from the initial images by using the neural network.

상기 영상을 복원하는 단계는 차분 역투영 알고리즘을 이용하여 제1 방향의 최초 영상과 상기 제1 방향과 직교하는 제2 방향의 최초 영상을 획득하고, 상기 뉴럴 네트워크를 이용하여 상기 제1 방향의 콘빔 아티팩트를 제거한 제1 영상과 상기 제2 방향의 콘빔 아티팩트를 제거한 제2 영상을 복원할 수 있다.The restoring of the image includes obtaining an initial image in a first direction and an initial image in a second direction orthogonal to the first direction using a differential inverse projection algorithm, and using the neural network to obtain a cone beam in the first direction. The first image from which the artifacts are removed and the second image from which the cone beam artifacts in the second direction are removed may be reconstructed.

상기 영상을 복원하는 단계는 상기 제1 영상과 상기 제2 영상을 입력으로 하는 스펙트럴 블렌딩(spectral blending) 기법을 이용하여 상기 콘빔 전산단층 촬영 데이터에 대한 최종 복원 영상을 생성할 수 있다.The reconstructing of the image may include generating a final reconstructed image of the cone-beam computed tomography data by using a spectral blending technique in which the first image and the second image are input.

상기 영상을 복원하는 단계는 스펙트럴 블렌딩 기법을 이용하여 상기 복원된 영상에 대한 최종 복원 영상을 생성할 수 있다.The reconstructing of the image may include generating a final reconstructed image of the reconstructed image by using a spectral blending technique.

상기 뉴럴 네트워크는 컨볼루션 프레임렛(convolution framelet) 기반의 뉴럴 네트워크를 포함할 수 있다.The neural network may include a neural network based on a convolution framelet.

상기 뉴럴 네트워크는 풀링(pooling) 레이어와 언풀링(unpooling) 레이어를 포함하는 다중 해상도 뉴럴 네트워크를 포함할 수 있다.The neural network may include a multi-resolution neural network including a pooling layer and an unpooling layer.

상기 뉴럴 네트워크는 상기 풀링 레이어에서 상기 언풀링 레이어로의 바이패스 연결을 포함할 수 있다.The neural network may include a bypass connection from the pooling layer to the unpooling layer.

본 발명의 일 실시예에 따른 영상 처리 장치는 콘빔 전산단층 촬영 데이터를 수신하는 수신부; 및 콘빔 아티팩트(artifact)를 제거하는 뉴럴 네트워크를 이용하여 상기 콘빔 전산단층 촬영 데이터에 대한 영상을 복원하는 복원부를 포함한다.An image processing apparatus according to an embodiment of the present invention includes: a receiver configured to receive cone-beam computed tomography data; and a restoration unit for reconstructing an image of the cone-beam computed tomography data using a neural network that removes cone-beam artifacts.

상기 복원부는 차분 역투영(differentiated backprojection, DBP) 알고리즘을 포함하는 해석적 알고리즘을 이용하여 상기 콘빔 전산단층 촬영 데이터로부터 적어도 두 개 이상의 방향에 대한 최초 영상들을 획득하고, 상기 뉴럴 네트워크를 이용하여 상기 최초 영상들로부터 상기 콘빔 전산단층 촬영 데이터에 대한 영상을 복원할 수 있다.The restoration unit acquires initial images in at least two directions from the cone-beam computed tomography data using an analytic algorithm including a differential backprojection (DBP) algorithm, and uses the neural network to obtain the initial images An image of the cone-beam computed tomography data may be reconstructed from the images.

상기 복원부는 차분 역투영 알고리즘을 이용하여 제1 방향의 최초 영상과 상기 제1 방향과 직교하는 제2 방향의 최초 영상을 획득하고, 상기 뉴럴 네트워크를 이용하여 상기 제1 방향의 콘빔 아티팩트를 제거한 제1 영상과 상기 제2 방향의 콘빔 아티팩트를 제거한 제2 영상을 복원할 수 있다.The restoration unit obtains an initial image in a first direction and an initial image in a second direction orthogonal to the first direction by using a differential back-projection algorithm, and removes the cone beam artifact in the first direction using the neural network. The first image and the second image from which the cone beam artifact in the second direction is removed may be reconstructed.

상기 복원부는 상기 제1 영상과 상기 제2 영상을 입력으로 하는 스펙트럴 블렌딩(spectral blending) 기법을 이용하여 상기 콘빔 전산단층 촬영 데이터에 대한 최종 복원 영상을 생성할 수 있다.The reconstructor may generate a final reconstructed image for the cone-beam computed tomography data by using a spectral blending technique in which the first image and the second image are input.

상기 복원부는 스펙트럴 블렌딩 기법을 이용하여 상기 복원된 영상에 대한 최종 복원 영상을 생성할 수 있다.The reconstructor may generate a final reconstructed image of the reconstructed image by using a spectral blending technique.

본 발명의 실시예들에 따르면, 콘빔 아티팩트를 제거하는 뉴럴 네트워크를 이용하여 콘빔 단층촬영 영상으로 콘빔 아티팩트를 제거하고, 콘빔 아티팩트가 제거된 콘빔 영상을 입력으로 하는 스펙트럴 블렌딩 기법을 이용하여 잔존하는 노이즈를 제거함으로써, 콘빔 단층촬영 영상을 고품질의 영상으로 복원할 수 있다.According to embodiments of the present invention, a cone beam artifact is removed with a cone beam tomography image using a neural network that removes the cone beam artifact, and a spectral blending technique in which the cone beam image from which the cone beam artifact has been removed is used as an input. By removing the noise, the cone-beam tomography image can be restored to a high-quality image.

전산화 단층촬영(CT)은 엑스선(X-ray)을 실험자에게 투영하여 얻어진 투영 영상을 이용하여 실험자의 단층영상을 복원하는 기법이다. 특히, 콘빔 전산화 단층촬영(CBCT)은 사용하는 장비의 궤도가 간단하여, 치과용 3D CBCT 와 성형외과용 3D CBCT, 그리고 환자의 치료 경과를 살피기 위한 3D CBCT 영상으로 다양한 방면에서 사용되고 있다. 본 발명의 실시예들에 따르면, 차분 역투영(Differential Back-Projection, DBP) 영상을 통한 열화 보정을 수행함으로써, 그 성능이 치과용 또는 성형외과용과 같은 용도와 무관하게 안정적인 성능을 보일 수 있다. 따라서, 본 발명은 다양한 3D CBCT 시스템에 적용할 수 있다.Computed tomography (CT) is a technique for reconstructing a tomography image of an experimenter using a projection image obtained by projecting X-rays onto an experimenter. In particular, cone-beam computed tomography (CBCT) is used in various fields as 3D CBCT for dentistry, 3D CBCT for plastic surgery, and 3D CBCT image for monitoring the patient's treatment progress because the track of the equipment used is simple. According to embodiments of the present invention, by performing deterioration correction through a differential back-projection (DBP) image, the performance may show stable performance regardless of uses such as dentistry or plastic surgery. Therefore, the present invention can be applied to various 3D CBCT systems.

치과용 3D CBCT와 성형외과용 3D CBCT는 사용빈도가 광장히 높으며 이미 장비들이 대거 도입이 되어 있는 실정이다. 이와 같이, 3D CBCT에 관한 인프라가 충분히 갖춰 있는 상황에서 본 발명과 같은 코어 어플리케이션이 개발되지 않은 상황이다. 따라서, 본 발명은 치과용과 성형외과용 3D CBCT에 바로 적용가능한 인프라에 충분히 구축이 되어 있기 때문에 즉시 도입할 수 있으며, 이를 통해 고품질의 영상을 제공할 수 있다.3D CBCT for dentistry and 3D CBCT for plastic surgery have a high frequency of use, and a lot of equipment has already been introduced. As such, in a situation in which the infrastructure for 3D CBCT is sufficiently equipped, a core application such as the present invention has not been developed. Therefore, the present invention can be introduced immediately because it is sufficiently established in the infrastructure that can be directly applied to 3D CBCT for dental and plastic surgery, and through this, it is possible to provide high-quality images.

도 1은 본 발명의 일 실시예에 따른 영상 처리 방법에 대한 동작 흐름도를 나타낸 것이다.
도 2는 본 발명의 방법을 설명하기 위한 차분 역투영 도메인 딥 러닝에 대한 일 예시도를 나타낸 것이다.
도 3은 인자화된 콘빔 지오메트리의 좌표계에 대한 일 예시도를 나타낸 것이다.
도 4는 관심 평면 상에서 소스 지오메트리와 필터링 방향에 대한 일 예시도를 나타낸 것이다.
도 5는 누락된 주파수 영역을 설명하기 위한 일 예시도를 나타낸 것이다.
도 6은 스펙트럴 블렌딩 기법을 설명하기 위한 일 예시도를 나타낸 것이다.
도 7은 디스크 팬텀에 대한 일 예시도를 나타낸 것이다.
도 8은 콘빔 아티팩트를 제거하는 뉴럴 네트워크의 입력과 출력에 대한 일 예시도를 나타낸 것이다.
도 9는 본 발명의 방법에서의 딥 뉴럴 네트워크 아키텍처에 대한 일 예시도를 나타낸 것이다.
도 10은 에포크 수에 따른 목적함수의 수렴에 대한 일 예시도를 나타낸 것이다.
도 11과 도 12는 반복적 복원기법(Total variation), FDK-CNN과 본 발명의 방법에 의한 복원 성능을 비교한 예시도들을 나타낸 것이다.
도 13은 본 발명의 일 실시예에 따른 영상 처리 장치에 대한 구성을 나타낸 것이다.1 is a flowchart illustrating an image processing method according to an embodiment of the present invention.
2 shows an exemplary diagram for differential inverse-projection domain deep learning for explaining the method of the present invention.
3 shows an exemplary view of a coordinate system of a factorized cone beam geometry.
4 is a diagram illustrating an example of a source geometry and a filtering direction on a plane of interest.
5 is a diagram illustrating an example for explaining a missing frequency region.
6 is a diagram illustrating an example for explaining a spectral blending technique.
7 is a diagram illustrating an example of a disk phantom.
8 is a diagram illustrating an example of input and output of a neural network that removes cone beam artifacts.
9 shows an exemplary diagram of a deep neural network architecture in the method of the present invention.
10 shows an exemplary diagram for convergence of the objective function according to the number of epochs.
11 and 12 show exemplary views comparing restoration performance by an iterative restoration technique (total variation), FDK-CNN, and the method of the present invention.
13 illustrates a configuration of an image processing apparatus according to an embodiment of the present invention.

본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나, 본 발명은 이하에서 개시되는 실시예들에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 것이며, 단지 본 실시예들은 본 발명의 개시가 완전하도록 하며, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구항의 범주에 의해 정의될 뿐이다.Advantages and features of the present invention and methods of achieving them will become apparent with reference to the embodiments described below in detail in conjunction with the accompanying drawings. However, the present invention is not limited to the embodiments disclosed below, but will be implemented in various different forms, and only these embodiments allow the disclosure of the present invention to be complete, and common knowledge in the art to which the present invention pertains It is provided to fully inform those who have the scope of the invention, and the present invention is only defined by the scope of the claims.

본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며, 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다. 명세서에서 사용되는 "포함한다(comprises)" 및/또는 "포함하는(comprising)"은 언급된 구성요소, 단계, 동작 및/또는 소자는 하나 이상의 다른 구성요소, 단계, 동작 및/또는 소자의 존재 또는 추가를 배제하지 않는다.The terminology used herein is for the purpose of describing the embodiments, and is not intended to limit the present invention. As used herein, the singular also includes the plural unless specifically stated otherwise in the phrase. As used herein, "comprises" and/or "comprising" refers to the presence of one or more other components, steps, operations and/or elements mentioned. or addition is not excluded.

다른 정의가 없다면, 본 명세서에서 사용되는 모든 용어(기술 및 과학적 용어를 포함)는 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 공통적으로 이해될 수 있는 의미로 사용될 수 있을 것이다. 또한, 일반적으로 사용되는 사 전에 정의되어 있는 용어들은 명백하게 특별히 정의되어 있지 않는 한 이상적으로 또는 과도하게 해석되지 않는다.Unless otherwise defined, all terms (including technical and scientific terms) used herein may be used with the meaning commonly understood by those of ordinary skill in the art to which the present invention belongs. In addition, terms defined in a commonly used dictionary are not to be interpreted ideally or excessively unless specifically defined explicitly.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예들을 보다 상세하게 설명하고자 한다. 도면 상의 동일한 구성요소에 대해서는 동일한 참조 부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다.Hereinafter, preferred embodiments of the present invention will be described in more detail with reference to the accompanying drawings. The same reference numerals are used for the same components in the drawings, and repeated descriptions of the same components are omitted.

본 발명의 실시예들은, 콘빔 아티팩트를 제거하는 뉴럴 네트워크를 이용하여 콘빔 단층촬영 영상으로 콘빔 아티팩트를 제거함으로써, 콘빔 단층촬영 영상을 고품질의 영상으로 복원하는 것을 그 요지로 한다.Embodiments of the present invention have a gist of restoring a cone-beam tomography image to a high-quality image by removing the cone-beam artifact with the cone-beam tomography image using a neural network that removes the cone-beam artifact.

여기서, 본 발명은 콘빔 아티팩트가 제거된 콘빔 영상을 입력으로 하는 스펙트럴 블렌딩 기법을 이용하여 잔존하는 노이즈를 제거함으로써, 콘빔 단층촬영 영상을 고품질의 영상으로 복원할 수 있다.Here, the present invention can restore a cone beam tomography image to a high quality image by removing residual noise using a spectral blending technique that takes a cone beam image from which the cone beam artifact has been removed as an input.

이 때, 본 발명에서 사용하는 뉴럴 네트워크는 컨볼루션 프레임렛(convolution framelet) 기반의 뉴럴 네트워크를 포함할 수 있으며, 풀링(pooling) 레이어와 언풀링(unpooling) 레이어를 포함하는 다중 해상도 뉴럴 네트워크를 포함할 수 있다. 나아가, 다중 해상도 뉴럴 네트워크는 풀링 레이어에서 언풀링 레이어로의 바이패스 연결을 포함할 수 있다.In this case, the neural network used in the present invention may include a convolution framelet-based neural network, and includes a multi-resolution neural network including a pooling layer and an unpooling layer. can do. Furthermore, the multi-resolution neural network may include a bypass connection from the pooling layer to the unpooling layer.

본 발명의 뉴럴 네트워크는 콘빔 전산단층 촬영 영상을 차분 역투영(differentiated backprojection, DBP) 알고리즘을 포함하는 해석적 알고리즘을 이용하여 콘빔 전산단층 촬영 영상으로부터 획득된 적어도 두 개 이상의 방향에 대한 최초 영상들을 입력으로 하고, 최초 영상들에 대한 콘빔 아티팩트를 제거함으로써, 콘빔 전산단층 촬영 영상에 대한 고품질의 영상을 복원할 수 있다.The neural network of the present invention inputs initial images for at least two directions obtained from cone-beam computed tomography images using an analytic algorithm including a differential backprojection (DBP) algorithm for cone-beam computed tomography images. , and by removing cone-beam artifacts for the initial images, it is possible to reconstruct a high-quality image of the cone-beam computed tomography image.

도 1은 본 발명의 일 실시예에 따른 영상 처리 방법에 대한 동작 흐름도를 나타낸 것이고, 도 2는 본 발명의 방법을 설명하기 위한 차분 역투영 도메인 딥 러닝에 대한 일 예시도를 나타낸 것이다.1 is a flowchart illustrating an image processing method according to an embodiment of the present invention, and FIG. 2 is a diagram illustrating an example of differential inverse-projection domain deep learning for explaining the method of the present invention.

도 1과 도 2를 참조하면, 본 발명의 일 실시예에 따른 영상 처리 방법은 콘빔 전산단층 촬영 데이터를 수신하는 단계(S110), 콘빔 아티팩트를 제거하는 뉴럴 네트워크를 이용하여 콘빔 전산단층 촬영 데이터에 대한 영상을 복원하는 단계(S120) 및 스펙트럴 블렌딩 기법을 이용한 복원된 영상에 대한 최종 복원 영상을 생성하는 단계(S130)를 포함한다.1 and 2 , the image processing method according to an embodiment of the present invention includes the steps of receiving cone-beam computed tomography data (S110), and adding the cone-beam computed tomography data to the cone-beam computed tomography data using a neural network that removes cone-beam artifacts. It includes a step of reconstructing an image of the . (S120) and a step of generating a final reconstructed image of the reconstructed image using a spectral blending technique (S130).

여기서, 단계 S120은 내부 전산단층 촬영 데이터를 미리 설정된 해석적 알고리즘 예를 들어, 차분 역투영(differentiated backprojection, DBP) 알고리즘을 이용하여 콘빔 전산단층 촬영 데이터로부터 적어도 두 개 이상의 방향에 대한 최초 영상들 예를 들어, 관상(coronal) 방향과 시상(sagittal) 방향 각각에 대한 최초 영상들을 획득하고, 최초 영상들(CNN input)을 입력으로 하는 뉴럴 네트워크를 이용하여 적어도 두 개 이상의 방향에 대한 최초 영상들로부터 콘빔 아티팩트를 제거한 영상(CNN output)을 출력함으로써, 콘빔 전산단층 촬영 데이터에 대한 영상을 복원할 수 있다. 이 때, 단계 S120은 직교하는 두 방향의 최초 영상을 획득할 수도 있고, 직교하지 않는 두 방향의 최초 영상을 획득할 수도 있으며, 뉴럴 네트워크를 이용하여 최초 영상들 각각의 콘빔 아티팩트를 제거함으로써, 콘빔 아티팩트가 제거된 두 방향 각각의 복원 영상을 획득할 수 있다. 즉, 도 2에 도시된 바와 같이 차분 역투영 연산을 수행하여 투영 데이터를 서로 다른 두 방향 예를 들어, 관상 방향과 시상 방향에 대한 차분 역투영(DBP) 영상으로 생성하고, 양 방향의 DBP 영상에 대하여 뉴럴 네트워크를 각각 적용하여 복원 영상을 생성한다.Here, in step S120, the internal computed tomography data are first images in at least two directions from the cone beam computed tomography data using a preset analytic algorithm, for example, a differential backprojection (DBP) algorithm. For example, first images for each of the coronal and sagittal directions are acquired, and from the first images for at least two or more directions using a neural network that takes the first images (CNN input) as input. By outputting an image (CNN output) from which the cone-beam artifact is removed, an image for the cone-beam computed tomography data may be reconstructed. In this case, in step S120, initial images in two directions orthogonal to each other may be obtained, or initial images in two non-orthogonal directions may be obtained, and cone beam artifacts of each of the initial images are removed by using a neural network, so that the cone beam Reconstructed images in each of two directions from which artifacts are removed may be obtained. That is, as shown in FIG. 2 , a differential inverse projection operation is performed to generate projection data as a differential inverse projection (DBP) image in two different directions, for example, a coronal direction and a sagittal direction, and DBP images in both directions. A reconstructed image is generated by applying a neural network to each.

본 발명에서 사용되는 뉴럴 네트워크는 컨볼루션 프레임렛(convolution framelet) 기반의 뉴럴 네트워크를 포함할 수 있으며, 풀링(pooling) 레이어와 언풀링(unpooling) 레이어를 포함하는 다중 해상도 뉴럴 네트워크를 포함할 수 있다.The neural network used in the present invention may include a convolution framelet-based neural network, and may include a multi-resolution neural network including a pooling layer and an unpooling layer. .

여기서, 컨볼루션 프레임렛은 입력신호에 대하여 국소 기저와 비국소 기저를 이용하여 표현하는 방식을 의미할 수 있다.Here, the convolutional framelet may refer to a method of expressing an input signal using a local basis and a non-local basis.

나아가, 뉴럴 네트워크는 풀링 레이어에서 언풀링 레이어로의 바이패스 연결을 포함할 수 있다.Furthermore, the neural network may include a bypass connection from the pooling layer to the unpooling layer.

단계 S120에서 뉴럴 네트워크를 통해 적어도 두 개 이상의 방향에 대한 최초 영상들의 콘빔 아티팩트가 제거되면, 콘빔 아티팩트가 제거된 적어도 두 개 이상의 방향에 대한 최초 영상들(spectral blending input)을 입력으로 하는 스펙트럴 블렌딩 기법을 이용하여 최종 복원 영상(reconstructed images)을 생성한다(S130).When the cone beam artifacts of the original images in at least two directions are removed through the neural network in step S120, spectral blending is performed by inputting spectral blending inputs in the at least two directions in which the cone beam artifacts are removed. A final reconstructed image is generated using the technique (S130).

여기서, 단계 S130은 콘빔 아티팩트가 제거된 적어도 두 개 이상의 방향에 대한 최초 영상들 각각을 이용하여 각 방향에 대한 최초 영상들을 통해 3D 영상을 복원하고, 복원된 3D 영상들 각각에 대한 다른 방향의 슬라이스 영상 페어를 입력으로 하는 스펙트럴 블렌딩 기법을 이용하여 잔존하는 노이즈를 제거함으로써, 최종 복원 영상을 생성할 수 있다. 즉, 단계 S130은 다른 방향의 슬라이스 영상 페어에 잔존하는 노이즈를 제거하여 잔존하는 노이즈가 제거된 다른 방향 예를 들어, 제3 방향의 슬라이스 영상을 획득하고, 노이즈가 제거된 제3 방향의 슬라이스 영상들을 이용하여 콘빔 전산단층 촬영 데이터에 대한 고품질의 3D 영상을 생성할 수 있다.Here, in step S130, a 3D image is reconstructed through the initial images for each direction by using each of the initial images for at least two directions from which the cone beam artifact is removed, and a slice in a different direction for each of the reconstructed 3D images A final reconstructed image may be generated by removing residual noise using a spectral blending technique using an image pair as an input. That is, in step S130, the noise remaining in the slice image pair in the other direction is removed to obtain a slice image in the other direction from which the remaining noise is removed, for example, a slice image in the third direction, and the slice image in the third direction from which the noise is removed. High-quality 3D images of cone-beam computed tomography data can be generated using

이러한 본 발명에 따른 방법에 대해 도 3 내지 도 12를 참조하여 설명한다.The method according to the present invention will be described with reference to FIGS. 3 to 12 .

표기법(notation)notation

x = (x, y, z)를

의 점이라 하면 x, y 및 z는 환자에 대한 직교 좌표를 의미한다. 원형 궤적의 경우 엑스선 소스(source)가 물체 f(x)를 중심으로 회전하면 소스 궤적은 아래 <수학식 1>과 같이 나타낼 수 있다.x = (x, y, z)

For a point in x, y and z mean the Cartesian coordinates for the patient. In the case of a circular trajectory, when the X-ray source rotates around the object f(x), the source trajectory can be expressed as in Equation 1 below.

[수학식 1][Equation 1]

여기서, R은 원형 스캔의 반지름을 의미하고, 심볼 ┳는 행렬 또는 벡터의 전치(transpose)를 의미할 수 있다.Here, R may mean a radius of a circular scan, and symbol ┳ may mean transpose of a matrix or vector.

f(x)를 선적분의 세트로 매핑하는 엑스선 변환 D_f는 아래 <수학식 2>와 같이 정의될 수 있다. _{An X-ray transformation D f} that maps f(x) to a set of line integrals may be defined as in Equation 2 below.

[수학식 2][Equation 2]

여기서, θ는 단위 구

의 벡터를 의미할 수 있다.where θ is the unit sphere

can mean a vector of .

3D 역 푸리에(Fourier) 변환은 아래 <수학식 3>과 같이 나타낼 수 있다.The 3D inverse Fourier transform can be expressed as in Equation 3 below.

[수학식 3][Equation 3]

여기서,

는 f(x)의 푸리에 스펙트럼을 의미하고,

은 각주파수를 의미하며,

을 의미할 수 있다.here,

is the Fourier spectrum of f(x),

is the angular frequency,

can mean

차분difference 역투영back projection (DBP)(DBP)

주어진 엑스선 소스의 궤적 a(λ)의 경우, 엑스선 소스 궤적 a(λ),

의 투영 데이터를 이용하는 점(point)

에서 차분 역투영(DBP)은 아래 <수학식 4>와 같이 나타낼 수 있다.For a given X-ray source trajectory a(λ), X-ray source trajectory a(λ),

point using projection data of

In , the differential back-projection (DBP) can be expressed as in Equation 4 below.

[수학식 4][Equation 4]

여기서, 1/||x-a(λ)||는 배율 팩터 종속 가중치(magnification factor dependent weighting)를 의미할 수 있다.Here, 1/||x-a(λ)|| may mean magnification factor dependent weighting.

DBP의 가장 근본적인 특성 중 하나는 푸리에 변환과의 관계이다. 구체적으로는 상기 수학식 4의 DBP 데이터는 아래 <수학식 5> 내지 <수학식 7>과 같이 나타낼 수 있다.One of the most fundamental properties of DBP is its relationship to the Fourier transform. Specifically, the DBP data of Equation 4 may be expressed as in <Equation 5> to <Equation 7> below.

[수학식 5][Equation 5]

[수학식 6][Equation 6]

[수학식 7][Equation 7]

상기 수학식 3의 푸리에 공식과 비교하면, 상기 수학식 5의 주요 차이점은

라는 추가적인 항이다. 이것은 DBP가 f(x)의 필터링된 버전을 제공한다는 것을 의미한다. 필터

의 스펙트럼 또한 x에 의존하기 때문에 해당 필터는 공간적으로 변화하는 필터이다.Compared with the Fourier formula of Equation 3, the main difference of Equation 5 is

is an additional term. This means that DBP provides a filtered version of f(x). filter

Since the spectrum of is also dependent on x, the filter is a spatially varying filter.

콘빔Corn Beam 지오메트리의geometric 인자화printing 표현(Factorized Representation of Factorized Representation of ConebeamConebeam Geometry) Geometry)

원형 궤적이 있는 콘빔 CT의 경우, 기존 연구에서 인자화 방법을 제안하였으며, 유사한 아이디어가 다른 연구에서도 제안되었다. 본 발명은 이러한 인자화 방법의 핵심 아이디어를 이용할 수 있다.In the case of cone beam CT with a circular trajectory, a factorization method has been proposed in previous studies, and similar ideas have been proposed in other studies as well. The present invention can utilize the core idea of this factorization method.

도 3은 인자화된 콘빔 지오메트리의 좌표계에 대한 일 예시도를 나타낸 것으로, 원형 궤적을 가진 콘빔 CT의 지오메트리적 구조를 보여준다. 여기서, z축과 평행하고 a(λ^-)와 a(λ⁺) 두 위치에서 소스 궤적을 교차하는 평면을 고려한다.

가 관심 평면을 의미한다 가정한다. 인자화 방법의 주요 목표는 3D 복원 문제를 관심 평면에서 연속적인 2D 문제로 전환하는 것이다. 특히 관심 평면

에서 가상의 코드선(chord line)과 가상의 소스 위치를 정의한다. 가상 코드선 좌표계는 가상 코드선 방향 e, z축 e_z 및 수직축 e┴에 의해 정의된다. 가상의 소스 a_v(λ⁺)와 a_v(λ^-)는 아래 <수학식 8>과 같이 간단히 계산될 수 있다.3 shows an exemplary view of a coordinate system of a factorized cone beam geometry, and shows a geometric structure of a cone beam CT having a circular trajectory. Here, consider a plane parallel to the z-axis and intersecting the source trajectory at two positions ^{a(λ -} ) and a(λ ^{+ ).}

Assume that is the plane of interest. The main goal of the factorization method is to transform the 3D reconstruction problem into a continuous 2D problem in the plane of interest. plane of particular interest

defines a virtual chord line and a virtual source location. The virtual code line coordinate system is defined by the virtual code line direction e, the z-axis e _z and the vertical axis e┴. The virtual sources a _v (λ ⁺ ) and a _v (λ ^- ) can be simply calculated as shown in Equation 8 below.

[수학식 8][Equation 8]

그런 다음, 직교 좌표 x는 아래 <수학식 9>와 같이 새 좌표(t, s, z)로 변환된다.Then, the Cartesian coordinate x is converted into a new coordinate (t, s, z) as shown in Equation 9 below.

[수학식 9][Equation 9]

표기법을 다소 남용하면, 고정된 s의 관심 평면 P에서 오브젝트 밀도와 DBP 데이터는 아래 <수학식 10>과 같은 2D 함수로 나타낼 수 있다.If the notation is slightly abused, the object density and DBP data in the plane of interest P of fixed s can be expressed as a 2D function as shown in Equation 10 below.

[수학식 10][Equation 10]

상기 수학식 6의

의 첫 번째 항과 두 번째 항은 각각 α(λ^-,x)와 α(λ⁺,x) 방향에 따른 힐버트(Hilbert) 변환에 해당하기 때문에 기존 연구(F. Dennerlein, F. Noo, H. Schondube, G. Lauritsch, and J. Hornegger, "A factorization approach for cone-beam reconstruction on a circular short-scan," IEEE Transactions on Medical Imaging, vol. 27, no. 7, pp. 887-896, 2008.)를 통해 상기 수학식 5를 아래 <수학식 11>과 같이 나타낼 수 있다.of Equation 6 above

The first and second terms of α(λ ^- ,x) and α(λ ⁺ ,Because x) corresponds to the Hilbert transform along the direction, previous studies (F. Dennerlein, F. Noo, H. Schondube, G. Lauritsch, and J. Hornegger, “A factorization approach for cone-beam reconstruction on a circular Short-scan," IEEE Transactions on Medical Imaging, vol. 27, no. 7, pp. 887-896, 2008.) through Equation 5 can be expressed as <Equation 11> below.

[수학식 11][Equation 11]

여기서, h_H(t)는 힐버트 변환을 의미하며, z₁(τ)과 z₂(τ)는 도 4와 같이

과

선을 따른 좌표를 의미할 수 있다. 이는 DBP 도메인에서 디컨볼루션 문제를 해결함으로써 콘빔 재구성 문제를 해결할 수 있음을 의미한다.Here, h _H (t) means the Hilbert transform, and z ₁ (τ) and z ₂ (τ) are as shown in FIG. 4 .

class

It may mean coordinates along a line. This means that the cone beam reconstruction problem can be solved by solving the deconvolution problem in the DBP domain.

에서 디컨볼루션을 위한 인코더-디코더 CNN

at Encoder-decoder CNNs for deconvolution

상기 수학식 11은 디컨볼루션 알고리즘이 각 관심 평면

의 g(t, z)에서 원본 영상 f(t, z)를 복구할 수 있다고 제안하지만, 두 가지 기술적 어려움이 있다. 첫째, t에 의존하는 z₁(t) 및 z₂(t) 좌표 때문에 알려지지 않은 영상의 그리드는 표준 직교 그리드가 아니다. 실제로 f(t, z₁(t))와 f(t, z₂(t))는 새로운 좌표계 상 f(t, z)의 변형된 영상(deformed image)으로 볼 수 있다. 따라서, 컨볼루션 필터는 공간적으로 다르기 때문에 표준 디컨볼루션 방법은 작동하지 않는다. 둘째, 각 DBP 데이터 g(t, z)는 f(t, z₁(t))와 f(t, z₂(t))라는 두 개의 변형된 영상으로부터 기여하기 때문에 디컨볼루션 문제는 대단히 부적절하게 정의 되어진다. 따라서, 기존 연구(F. Dennerlein, F. Noo, H. Schondube, G. Lauritsch, and J. Hornegger, "A factorization approach for cone-beam reconstruction on a circular short-scan," IEEE Transactions on Medical Imaging, vol. 27, no. 7, pp. 887-896, 2008.)는 알려지지 않은 영상에 대하여 원래의 고정된 직교 그리드에 대하여 통합 이산 후 정규화된 행렬 반전 접근법을 제안하였다.Equation (11) is the deconvolution algorithm for each plane of interest.

We propose that we can recover the original image f(t, z) from g(t, z), but there are two technical difficulties. First, the grid of the unknown image is not a standard orthogonal grid because of the t- _{dependent z 1} (t) and z _{2 (t) coordinates.} In fact, f(t, z ₁ (t)) and f(t, z ₂ (t)) can be viewed as deformed images of f(t, z) on the new coordinate system. Therefore, the standard deconvolution method does not work because the convolution filters are spatially different. Second, since each DBP data g(t, z) _{contributes from two transformed images f(t, z 1} (t)) and f(t, z ₂ (t)), the deconvolution problem is very inappropriate. is well defined Therefore, previous studies (F. Dennerlein, F. Noo, H. Schondube, G. Lauritsch, and J. Hornegger, “A factorization approach for cone-beam reconstruction on a circular short-scan,” IEEE Transactions on Medical Imaging, vol. (27, no. 7, pp. 887-896, 2008.) proposed a normalized matrix inversion approach after integration discrete on the original fixed orthogonal grid for unknown images.

일반적으로 상기 수학식 11에 대한 디컨볼루션 알고리즘은 아래 <수학식 12>오 같이 매핑으로 나타낼 수 있다.In general, the deconvolution algorithm for Equation 11 can be expressed as a mapping as shown in Equation 12 below.

[수학식 12][Equation 12]

여기서,

는 DBP 데이터 g(t, z)가 있는 입력 공간을 의미하며,

는 영상 f(t, z)가 속한 공간을 의미할 수 있다. Tikhonov 정규화의 경우

는 폐쇄 형태를 가지고 있지만, l₁ 또는 총 변동(TV; total variation)과 같은 일반 정규화 함수에 대해서는 역 매핑

가 일반적으로 비선형이며, 계산적으로 비싼 반복법을 이용하여 찾아야 한다. 런타임 계산 복잡성을 줄이기 위한 빠른 해결책은 비선형 매핑

를 사전 계산하는 것이다. 불행하게도 매핑

는 입력에 의존하기 때문에 모든 입력에 대해

를 저장하는 것은 엄청난 양의 메모리를 필요로 할 것이고 실현 가능하지도 않을 것이다.here,

denotes the input space with DBP data g(t, z),

may mean a space to which the image f(t, z) belongs. For Tikhonov regularization

has a closed form, but is inversely mapped for general regularization functions such as _{l 1 or total variation (TV).}

is generally non-linear and must be found using computationally expensive iterative methods. A quick solution to reduce runtime computational complexity is non-linear mapping

is to be pre-calculated. unfortunately mapping

is input dependent, so for every input

Storing them would require a huge amount of memory and would not be feasible.

이런 점에서 정류된 선형 유닛(ReLU) 비선형성을 이용한 인코더-디코더 CNN(E-D CNN)은 이 문제를 해결할 수 있다. 구체적으로, 자기 연계를 위한 부록에도 간략하게 요약되어 있는 최근의 이론적 연구(J. C. Ye and W. K. Sung, "Understanding geometry of encoder-decoder CNNs," in Proceedings of the 36th International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, K. Chaudhuri and R. Salakhutdinov, Eds., vol. 97. Long Beach, California, USA: PMLR, 09-15 Jun 2019, pp. 7064-7073.)에서, 본 발명의 출원인은 ReLU 비선형성을 가진 인코더-디코더 CNN이 많은 수의 국지적 선형 매핑을 생성한다는 것을 보여주었다. 더 구체적으로, 입력 공간

를 각 영역에 대한 입력이 공통의 선형 표현을 공유하는 겹치지 않는 영역으로 분할한다. 그런 다음, 각 입력에 대한 해당 선형 표현으로의 전환은 ReLU 활성화 패턴을 기반으로 즉시 수행할 수 있다.In this respect, an encoder-decoder CNN (ED CNN) using rectified linear unit (ReLU) nonlinearity can solve this problem. Specifically, recent theoretical studies (JC Ye and WK Sung, "Understanding geometry of encoder-decoder CNNs," in Proceedings of the 36th International Conference on Machine Learning, ser. Proceedings of In Machine Learning Research, K. Chaudhuri and R. Salakhutdinov, Eds., vol. 97. Long Beach, California, USA: PMLR, 09-15 Jun 2019, pp. 7064-7073.) Applicants of the present invention describe ReLU nonlinear We have shown that encoder-decoder CNNs with gender generate a large number of local linear mappings. More specifically, the input space

split into non-overlapping regions where the inputs to each region share a common linear representation. Then, the conversion to the corresponding linear representation for each input can be performed immediately based on the ReLU activation pattern.

딥 뉴럴 네트워크의 또 다른 유니크한 측면은 이렇게 기하급수적으로 많은 선형 표현들이 ReLU 비선형성의 조합적 특성 덕분에 작은 필터 세트에서 도출될 수 있다는 것이다. 특히, 네트워크 트레이닝은 필터 세트

를 추정하는 것으로, 아래 <수학식 13>과 같이 나타낼 수 있다.Another unique aspect of deep neural networks is that these exponentially many linear representations can be derived from a small set of filters thanks to the combinatorial nature of ReLU nonlinearity. In particular, network training is a filter set

, which can be expressed as <Equation 13> below.

[수학식 13][Equation 13]

여기서,

은 실측 자료(ground truth) 영상과 DBP 영상으로 구성된 트레이닝 데이터 세트를 의미하고,

는

에 의해 매개변수화된 역 매핑을 의미할 수 있다.here,

is a training data set consisting of a ground truth image and a DBP image,

Is

may mean inverse mapping parameterized by .

일반적으로 필터 매개변수

는 훨씬 더 작은 메모리와 공간을 요구한다. 반면에, 연관된 선형 표현들의 수는 네트워크 깊이, 너비, 스킵된 연결(skipped connection)과 함께 기하급수적으로 증가한다.Filter parameters in general

requires much less memory and space. On the other hand, the number of associated linear representations increases exponentially with network depth, width, and skipped connections.

이러한 표현과 입력 적응성은 영상 복원 문제에 대한 딥 뉴럴 네트워크 네트워크의 주요 성공 이유 중 하나로 여겨진다.This representation and input adaptability are considered to be one of the main reasons for the success of deep neural network networks in the image reconstruction problem.

스펙트럴spectral 블렌딩blending (spectral blending)(spectral blending)

인자화 방법은 각 관심 평면에 대해 딥 학습 접근으로 이어지지만, 이전 연구(M. Lee, Y. Han, J. P. Ward, M. Unser, and J. C. Ye, "Interior tomography using 1d generalized total variation. part ii: Multiscale implementation," SIAM Journal on Imaging Sciences, vol. 8, no. 4, pp. 2452-2486, 2015.)에서 누락된 주파수 영역은 관심 평면의 방향에 따라 다르다는 것을 보여주었다. 단, 이전 연구에서도 다른 필터링 방향에 대하여 누락된 주파수 영역을 적절히 조합하면 누락된 주파수 영역의 아티팩트를 최소화할 수 있음을 보여 주었다. 이를 자세히 설명하면 다음과 같다.The factorization method leads to a deep learning approach for each plane of interest, but a previous study (M. Lee, Y. Han, JP Ward, M. Unser, and JC Ye, "Interior tomography using 1d generalized total variation. part ii: Multiscale implementation," SIAM Journal on Imaging Sciences, vol. 8, no. 4, pp. 2452-2486, 2015.) showed that the missing frequency domain depends on the orientation of the plane of interest. However, previous studies also showed that artifacts in the missing frequency domain can be minimized by appropriately combining the missing frequency domains for different filtering directions. This is explained in detail as follows.

DBP 데이터에서 복셀을 x = (x, y, z)로 복구하는 데 관심이 있다고 가정한다. 도 5(a)는 점 x와 소스 궤도의 평면도를 나타낸 것으로, 4(a)에서 a_v(λ)는 해당하는 가상의 소스 위치를 의미한다.Suppose we are interested in recovering voxels from DBP data to x = (x, y, z). 5(a) is a plan view of a point x and a source trajectory. In 4(a), a _v (λ) means a corresponding virtual source position.

관심 평면

가 수평 방향으로 정렬되는 관상 방향(예를 들면, a_v(λ₁ ^-)과 a_v(λ₁ ⁺) 사이)의 영상 재구성을 고려해 본다. 이전 연구(M. Lee, Y. Han, J. P. Ward, M. Unser, and J. C. Ye, "Interior tomography using 1d generalized total variation. part ii: Multiscale implementation," SIAM Journal on Imaging Sciences, vol. 8, no. 4, pp. 2452-2486, 2015.)의 스펙트럴 분석에 따르면, DBP 결과 데이터는 도 5(b)와 같이 시상 방향을 따라 누락된 주파수 영역을 가지고 있다. 따라서, 관상 방향 DBP 데이터를 사용하는 디컨볼루션 알고리즘은 누락된 주파수 영역의 부적절한 정의로 인해 시상 방향을 따라 노이즈 부스팅을 가질 수 있다. 마찬가지로, 시상 방향(예를 들면 a_v(λ₂ ^-)과 a_v(λ₂ ⁺) 사이)의 DBP 데이터에 대해서는 도 5(c)에 해당 주파수 영역이 제시되어 디코볼루션 후 노이즈 부스팅이 발생한다. plane of interest

Consider the image reconstruction in the coronal direction (eg, between a _v (λ ₁ ^- ) and a _v (λ ₁ ^{+ )) in which is horizontally aligned.} A previous study (M. Lee, Y. Han, JP Ward, M. Unser, and JC Ye, "Interior tomography using 1d generalized total variation. part ii: Multiscale implementation," SIAM Journal on Imaging Sciences, vol. 8, no. 4, pp. 2452-2486, 2015.), the DBP result data has a missing frequency domain along the sagittal direction as shown in FIG. Therefore, the deconvolution algorithm using the coronal direction DBP data may have noise boosting along the sagittal direction due to improper definition of the missing frequency domain. Similarly, for DBP data in the sagittal direction (eg, between a _v (λ ₂ ^- ) and a _v (λ ₂ ⁺ )), the corresponding frequency domain is presented in FIG. 5(c), and noise boosting occurs after decovolution. do.

전체 스캔 콘빔 CT의 경우 3D 데이터의 2D 인자화에는 중복성이 있으며, 이를 이용하여 관상 방향과 시상 방향 모두 재구성 또는 복원 할 수 있고, 이를 결합할 수 있다. 도 5(d)는 두 방향의 재구성 결과를 조합하여 결과 누락된 주파수 영역을 나타낸 것이다. 특히 공통의 누락 주파수 영역은 원점을 중심으로 한 어두운 사각형으로, 관상 및 시상 처리에서 두 개의 누락된 주파수 영역의 교차점이다. 이는 공통 주파수 영역을 제외하고 관상 방향 처리에서 누락된 스펙트럴 구성요소는 시상 처리 결과에 의해 보상될 수 있으며, 그 반대의 경우도 마찬가지임을 의미한다.In the case of full-scan cone-beam CT, there is redundancy in the 2D digitization of 3D data. FIG. 5( d ) shows the missing frequency domain by combining the reconstruction results in two directions. In particular, the common missing frequency domain is a dark rectangle centered at the origin, which is the intersection of the two missing frequency domains in coronal and sagittal processing. This means that, except for the common frequency domain, the spectral component missing from coronal direction processing can be compensated for by the sagittal processing result, and vice versa.

도 6은 스펙트럴 블렌딩 기법을 설명하기 위한 일 예시도를 나타낸 것으로, 이러한 시너지 조합을 달성하기 위해 스펙트럴 블렌딩의 흐름도를 나타낸 것이다. 도 6(a)에 도시된 바와 같이, 관상 및 시상의 재구성(또는 복원)된 3D 볼륨의 축 영상(또는 슬라이스 영상) f^cor 및 f^sag는 2D 푸리에 변환을 사용하여 먼저 스펙트럴 영역으로 변환된다. 누락된 주파수 영역을 따라 부적절하게 정의되어 있으므로 도 6(b)의 노란색 화살표로 표시된 스트레이트 패턴의 아티팩트는 대개 누락된 주파수 영역을 따른 푸리에 영역에서 볼 수 있다. 그런 다음, 누락된 주파수 영역에서 신호를 억제하기 위해 보우(bow) 타입 스펙트럴 가중치를 적용하여 함께 결합한다. 이 과정은 수학적으로 아래 <수학식 14>와 같이 표현될 수 있다.6 shows an exemplary diagram for explaining a spectral blending technique, and shows a flowchart of spectral blending to achieve such a synergistic combination. As shown in Fig. 6(a), the axial images (or slice images) f ^cor and f ^sag of the coronal and sagittal reconstructed (or reconstructed) 3D volumes are first transformed into the spectral domain using a 2D Fourier transform. . Since it is improperly defined along the missing frequency domain, the artifact of the straight pattern indicated by the yellow arrow in Fig. 6(b) is usually visible in the Fourier domain along the missing frequency domain. Then, to suppress the signal in the missing frequency domain, a bow-type spectral weight is applied and combined together. This process can be mathematically expressed as <Equation 14> below.

[수학식 14][Equation 14]

여기서,

및

은 각각 푸리에 변환과 역 푸리에 변환을 의미하고, ω는 보우-타이(bow-tie) 스펙트럴 마스크를 의미하며,

는 요소별 곱(element-wise multiplication)을 의미할 수 있다.here,

and

denotes Fourier transform and inverse Fourier transform, respectively, ω denotes a bow-tie spectral mask,

may mean element-wise multiplication.

데이터 세트data set

본 발명에서는 American Association of Phsicists in Medicine(AAPM)의 저선량 CT Grand Challenge의 10개 주제 데이터 세트를 사용할 수 있다. 볼륨 영상 데이터에서 전방 콘빔 투영을 사용하여 콘빔 CT 사인파 데이터를 수치적으로 구한다. 검출기 수는 피치 1mm²의 1440×1440 요소이고, 뷰 수는 1200이며, 소스에서 발신지까지의 거리(DSO)는 500mm, 소스에서 검출기까지의 거리(DSD)는 1000mm일 수 있다. 최대 원추각는 35.8°로 계산된다. 영상의 x - y 크기는 512Х512이고 z 크기는 400 ~ 600이지만 환자마다 다를 수 있다. 환자 10명 중 8명의 환자 데이터를 트레이닝 세트로 사용하고, 환자 데이터 1명은 유효성 검사 세트로 사용할 수 있으며, 다른 환자 데이터는 테스트 세트에 사용될 수 있다. 본 발명에서의 뉴럴 네트워크는 관상 슬라이스와 시상 슬라이스에 따라 관심 평면에서 트레이닝되므로, 트레이닝 데이터 세트는 또한 관상 슬라이스와 시상 슬라이스로 구성된다. 이는 각각 트레이닝 및 유효성 검사를 위한 8192(=512Х8Х2), 1024(=512Х1Х2) 관상/시상 슬라이스에 해당한다. 물론, 본 발명에서의 뉴럴 네트워크는 관상 슬라이스와 시상 슬라이스에 따라 트레이닝되는 것으로 한정되지 않으며, 서로 다른 적어도 두 방향의 슬라이스들에 따라 트레이닝될 수 있고, 반드시 두 방향이 서로 직교하는 것으로 한정되지 않는다. 또한, 본 발명에서의 뉴럴 네트워크는 두 방향 이상의 슬라이스들에 대해 트레이닝되기 때문에 세 방향 이상의 슬라이스들을 이용하여 트레이닝될 수도 있다. 물론, 이러한 경우 세 방향 이상의 슬라이스들을 뉴럴 네트워크 입력으로 이용할 수도 있고, 두 방향 이상의 슬라이스들을 뉴럴 네트워크 입력으로 이용할 수도 있다.In the present invention, a 10-subject data set from the American Association of Phsicists in Medicine (AAPM) low-dose CT Grand Challenge can be used. Numerically obtain cone-beam CT sine wave data using anterior cone-beam projection from volumetric image data. The number of detectors may be ^{1440×1440 elements with a pitch of 1 mm 2} , the number of views may be 1200, the source to source distance (DSO) may be 500 mm, and the source to detector distance (DSD) may be 1000 mm. The maximum cone angle is calculated to be 35.8°. The x - y size of the image is 512Х512 and the z size is 400 to 600, but it may vary from patient to patient. Patient data from 8 out of 10 patients can be used as the training set, 1 patient data can be used as the validation set, and other patient data can be used as the test set. Since the neural network in the present invention is trained in the plane of interest according to coronal slices and sagittal slices, the training data set also consists of coronal slices and sagittal slices. This corresponds to 8192 (=512Х8Х2) and 1024 (=512Х1Х2) coronal/sagittal slices for training and validation, respectively. Of course, the neural network in the present invention is not limited to being trained according to the coronal slice and the sagittal slice, and may be trained according to slices in at least two different directions, and the two directions are not necessarily limited to being orthogonal to each other. In addition, since the neural network in the present invention is trained on slices in two or more directions, it may be trained using slices in three or more directions. Of course, in this case, slices in three or more directions may be used as a neural network input, and slices in two or more directions may be used as a neural network input.

또한, 본 발명은 네트워크 성능을 보다 정량적으로 조사하기 위해 수치 팬텀을 이용한 실험을 실시할 수 있다. 도 7은 디스크 팬텀에 대한 일 예시도를 나타낸 것으로, 인접한 디스크 사이의 간격과 두께가 동일한 디스크로 구성된 수치 팬텀의 전형적인 예를 보여준다. 디스크의 반경은 80mm일 수 있으며, 아래 <표 1>과 같이 크기, 반지름, 숫자, 두께, 간격을 변경하여 다양한 팬텀을 만들 수 있다. 여기서, 표 1의 (a) 내지 (c)는 미세 뉴닝을 위해 사용될 수 있으며, (d)는 테스트를 위해 사용될 수 있다.In addition, the present invention may conduct an experiment using a numerical phantom to more quantitatively investigate network performance. 7 shows an exemplary view of a disk phantom, and shows a typical example of a numerical phantom composed of disks having the same thickness and spacing between adjacent disks. The radius of the disk can be 80mm, and various phantoms can be made by changing the size, radius, number, thickness, and spacing as shown in Table 1 below. Here, (a) to (c) of Table 1 may be used for fine nuning, and (d) may be used for testing.

알고리즘 구현Algorithm implementation

도 8은 콘빔 아티팩트를 제거하는 뉴럴 네트워크의 입력과 출력에 대한 일 예시도를 나타낸 것으로, 본 발명의 딥 학습 방법의 입력과 출력에 대한 일 예시도를 나타낸 것이다. 도 8에 도시된 바와 같이, 두 방향의 영상 예를 들어, 관상 및 시상 뷰 DBP 영상이 모두 입력으로 사용되며, 아티팩트가 없는 관상 및 시상 뷰 영상이 라벨로 사용된다. 전체 스캔 콘빔 CT의 경우

평면의 DBP 영상은 두 개의 보완 소스 궤적, 즉 각 λ[λ^-, λ⁺]와 λ[0, 2π]\[λ^-, λ⁺]에서 a(λ)를 사용하여 얻을 수 있으며, 여기서 \은 차세트, 즉 A\B는 A∩B^c를 나타낸다. 이 경우 도 8과 같이 두 개의 DBP 영상을 모두 입력으로 사용한다.8 shows an exemplary diagram of an input and output of a neural network for removing cone beam artifacts, and shows an exemplary diagram of an input and output of the deep learning method of the present invention. As shown in FIG. 8 , images from two directions, for example, coronal and sagittal view DBP images are used as inputs, and artifact-free coronal and sagittal view images are used as labels. For full scan cone beam CT

A planar DBP image can be obtained using a(λ) from two complementary source trajectories: λ[λ ^- , λ ⁺ ] and λ[0, 2π]\[λ ^- , λ ^{+ ], where \} is the difference set, that is, A\B denotes A∩B ^c . In this case, both DBP images are used as inputs as shown in FIG. 8 .

도 9는 본 발명의 방법에서의 딥 뉴럴 네트워크 아키텍처에 대한 일 예시도를 나타낸 것으로, 동일한 뉴럴 네트워크 예를 들어, 하나의 뉴럴 네트워크가 DBP 영상 즉, 최초 영상 각각에 대해 사용될 수 있다.9 shows an exemplary diagram of a deep neural network architecture in the method of the present invention, and the same neural network, for example, one neural network may be used for each DBP image, that is, the initial image.

네트워크 백본은 U-Net의 수정된 아키텍처에 해당한다. 도 9에서의 뉴럴 네트워크는 선형 변환(linear transform) 연산을 수행하는 컨볼루션 레이어, 정규화(normalization) 연산을 수행하는 배치 노말라이제이션(batch normalization) 레이어, 비선형 함수(nonlinear function) 연산을 수행하는 ReLU(rectified linear unit) 레이어 및 연쇄를 가진 경로 연결(contracting path connection with concatenation)을 포함한다. 특히, 각 스테이지는 3 Х 3 커널들을 갖는 컨볼루션, 배치 노말라이제이션 및 ReLU 레이어들로 구성된 4개의 순차적인 레이어들을 포함한다. 마지막 스테이지는 두 개의 순차적인 레이어들과 마지막 레이어를 포함하며, 마지막 레이어는 최초 영상에 대한 복원 영상을 생성하는 1 Х 1 커널을 갖는 컨볼루션만을 포함한다. 각 컨볼루션 레이어에 대한 채널 수는 도 9에 도시되어 있으며, 각 풀링 레이어 후에 채널 수는 두 배가 되며, 계층들의 크기를 4배로 감소시킨다. 여기서, 풀링 레이어는 2 Х 2 평균 풀링 레이어일 수 있으며, 풀링 레이어는 각 스테이지들 사이에 위치할 수 있다. 또한 언풀링 레이어도 2 Х 2 평균 언풀링 레이어일 수 있다. 트레이닝 가능한 파라미터의 총 수는 약 22,000,000개 이다.The network backbone corresponds to the modified architecture of U-Net. The neural network in FIG. 9 is a convolution layer that performs a linear transform operation, a batch normalization layer that performs a normalization operation, and a ReLU that performs a nonlinear function operation. Includes a (rectified linear unit) layer and a contracting path connection with concatenation. In particular, each stage contains 4 sequential layers consisting of Convolution, Batch Normalization and ReLU layers with 3 Х 3 kernels. The last stage includes two sequential layers and a last layer, and the last layer includes only a convolution with a 1 Х 1 kernel that generates a reconstructed image for the first image. The number of channels for each convolutional layer is shown in Fig. 9, the number of channels is doubled after each pooling layer, and the size of the layers is reduced by a factor of four. Here, the pooling layer may be a 2 Х 2 average pooling layer, and the pooling layer may be located between each stage. Also, the unpooling layer may be a 2 Х 2 average unpooling layer. The total number of trainable parameters is about 22,000,000.

본 발명은 두가지 추가적인 알고리즘 예를 들어, FDK 재구성(FDK-CNN)을 이용한 딥 컨볼루션 뉴럴 네트워크와 TV 페널티가 부과된 MBIR 알고리즘을 구현하여 비교할 수 있다. FDK-CNN은 원추각으로 인한 불완전한 투영 데이터를 사용하여 FDK 알고리즘의 재구성된 영상에서 아티팩트가 없는 영상을 학습하도록 트레이닝 받았다. 입력 영상은 콘빔 아티팩트로 손상된 FDK 영상인 반면, 아티팩트가 없는 데이터는 실측 자료로 사용된다. 공정한 비교를 위해 관상 및 시상 뷰 영상은 본 발명의 DBP 도메인 네트워크와 유사한 네트워크 입력으로 사용된다. FDK 재구성은 모든 전체 스캔 각도에서 투영을 결합하므로 각 처리 방향에 대해 서로 다른 두 개의 FDK 영상을 얻을 수 없다. 이것이 관상 뷰 및 시상 뷰를 따라 두 개의 영상만 입력으로 제공하는 이유이다. FDK 도메인 뉴럴 네트워크에도 동일한 U-Net 아키텍처가 사용될 수 있다. 따라서, 본 발명의 DBP 도메인 심층 네트워크와의 차이점은 입력 영상 뿐이다.The present invention can be compared by implementing two additional algorithms, for example, a deep convolutional neural network using FDK reconstruction (FDK-CNN) and an MBIR algorithm to which a TV penalty is imposed. FDK-CNN was trained to learn artifact-free images from the reconstructed images of the FDK algorithm using incomplete projection data due to cone angle. The input image is an FDK image damaged by cone-beam artifacts, while the artifact-free data is used as the actual measurement data. For fair comparison, coronal and sagittal view images are used as network inputs similar to the DBP domain network of the present invention. Since FDK reconstruction combines projections from all full scan angles, it is not possible to obtain two different FDK images for each processing direction. This is why only two images along the coronal and sagittal views are provided as inputs. The same U-Net architecture can be used for FDK domain neural networks. Therefore, the only difference from the DBP domain deep network of the present invention is the input image.

총 변동(TV) 페널티가 부과된 재구성은 아래 <수학식 15>와 같은 최적화 문제를 해결함으로써 구현될 수 있다.Reconstruction to which a total variation (TV) penalty is imposed can be implemented by solving an optimization problem as shown in Equation 15 below.

[수학식 15][Equation 15]

여기서, y와 f는 측정된 사이노그램 데이터와 그것의 3D 영상을 의미하고, A는 시스템 행렬을 의미하며, ∇_x,y,z는 (x, y, z) 축을 따르는 차분 연산자를 의미하고, λ는 정규화 팩터를 의미할 수 있다.Here, y and f mean the measured sinogram data and its 3D image, A means the system matrix, ∇ _{x, y, z} means the difference operator along the (x, y, z) axis, , λ may mean a normalization factor.

정량적 평가를 위해 정규화된 평균 제곱 오차(NMSE) 값이 사용될 수 있으며, 이 값은 아래 <수학식 16>과 같이 나타낼 수 있다.For quantitative evaluation, a normalized mean square error (NMSE) value may be used, and this value may be expressed as in Equation 16 below.

[수학식 16][Equation 16]

여기서, f와 f^*는 각각 재구성된 영상과 실측 영상을 의미하고, M과 N은 픽셀의 행과 열의 수를 의미할 수 있다.Here, f and f ^* may mean a reconstructed image and a measured image, respectively, and M and N may mean the number of rows and columns of pixels.

또한, 아래 <수학식 17>과 같이 정의된 피크 신호 대 노이즈 비율(PSNR)을 사용할 수 있다.Also, a peak signal-to-noise ratio (PSNR) defined as in Equation 17 below may be used.

[수학식 17][Equation 17]

또한, 아래 <수학식 18>과 같이 정의된 구조 유사성 지수를 사용할 수 있다.In addition, a structural similarity index defined as in Equation 18 below may be used.

[수학식 18][Equation 18]

여기서, μ_f는 f의 평균을 의미하고, σ² _f는 f의 분산을 의미하며, σ_ff*는f와 f*의 교차 공분산을 의미하고, 분배를 안정시키기 위한 변수는 c₁ = (k₁L)²와 c₂ = (k₂L)²이며, L은 픽셀 강도의 동적 범위를 의미하고, k₁과 k₂는 상수로서 기본값은 k₁ = 0.01 및 k₂ = 0.03일 수 있다.where μ _f means the mean of f, σ ² _f means the variance of f, and σ _ff* ismeans the cross covariance of f and f*, the variables for stabilizing the distribution are c ₁ = (k ₁ L) ² and c ₂ = (k ₂ L) ² , L means the dynamic range of pixel intensity, k ₁ and k ₂ are constants, and the default values may be k ₁ = 0.01 and k ₂ = 0.03.

네트워크 트레이닝network training

본 발명은 MATLAB R2015a 환경에서 FDK 및 DBP 도메인 네트워크를 구현하기 위해 MatConvNet 툴박스(ver.24)를 사용할 수 있으며, 처리 유닛은 인텔 코어 i7-7700(3.60GHz) 중앙처리장치(CPU)와 GTX 1080Ti 그래픽처리장치(GPU)일 수 있다. l₂ 손실은 트레이닝의 객관적 기능으로 사용될 수 있으며, 트레이닝 네트워크 에포크(epoch)의 개수는 300개일 수 있다. 확률적 경사 하강법(stochastic gradient descent, SGD)이 네트워크 트레이닝을 위한 최적화기(optimier)로서 사용될 수 있다. 초기 학습율은 10^-4로, 각 에포크(epoch)에서 점차 10^-5로 떨어질 수 있으며, 정규화 파라미터(regularization parameter)는 10^-4일 수 있다. 데이터 증대를 위해 전체 데이터 세트를 수평 및 수직 플립으로 수행할 수 있다. 미니배치는 4로 사용되고, 입력패치의 크기는 256Х256이다. 컨볼루션 연산은 공간적으로 불변하므로, 트레이닝된 필터는 추정 단계에서 전체 입력 데이터에 사용될 수 있다. 트레이닝 시간은 일정 기간 예를 들어, 약 4일일 수 있다. The present invention can use MatConvNet toolbox (ver.24) to implement FDK and DBP domain networks in MATLAB R2015a environment, and the processing unit is an Intel Core i7-7700 (3.60GHz) central processing unit (CPU) and GTX 1080Ti graphics It may be a processing unit (GPU). l ₂ The loss may be used as an objective function of training, and the number of training network epochs may be 300. Stochastic gradient descent (SGD) can be used as an optimizer for network training. The initial learning rate may be 10 ^-4 ^{, and may gradually drop to 10 -5} at each epoch, and the regularization parameter may be ^{10 -4.} For data augmentation, the entire data set can be performed with horizontal and vertical flips. The mini-batch is used as 4, and the size of the input patch is 256Х256. Since the convolution operation is spatially invariant, the trained filter can be used on the entire input data in the estimation step. The training time may be a period of time, for example, about 4 days.

도 10은 에포크 수에 따른 목적함수의 수렴에 대한 일 예시도를 나타낸 것으로, 점선과 실선은 각각 트레이닝 및 검증 단계의 목적함수를 나타낸다. 도 10에 도시된 바와 같이, 트레이닝과 유효성 검사 곡선이 밀접하게 수렴되기 때문에 본 발명의 네트워크가 잘 트레이닝되어 있고 오버 피팅(over-fitted)되지 않은 것을 알 수 있다.10 shows an exemplary diagram of the convergence of the objective function according to the number of epochs, wherein the dotted line and the solid line indicate the objective function of the training and verification steps, respectively. As shown in Figure 10, it can be seen that the network of the present invention is well trained and not over-fitted because the training and validation curves converge closely.

수치 팬텀은 AAPM 데이터 세트와 다르기 때문에 FDK-CNN과 본 발명의 네트워크는 다른 디스크 팬텀의 재구성 성능을 평가하기 위해 디스크 팬텀의 부분 세트를 사용하여 미세 조정될 수 있다. 구체적으로는 상기 표 1의 (a)(b)(c)의 매개변수를 활용한 디스크 팬텀을 정밀 조정에 사용할 수 있고, (d)의 디스크 팬텀을 테스트 단계에서 사용할 수 있다.Since numerical phantoms are different from AAPM data sets, FDK-CNN and our network can be fine-tuned using subsets of disk phantoms to evaluate the reconstruction performance of other disk phantoms. Specifically, the disk phantom using the parameters of (a) (b) (c) of Table 1 can be used for precise adjustment, and the disk phantom of (d) can be used in the test step.

도 11과 도 12는 반복적 복원기법(Total variation), FDK-CNN과 본 발명의 방법에 의한 복원 성능을 비교한 예시도들을 나타낸 것으로, 복원 성능과 PSNR, NMSE 그리고 SSIM과 같은 다양한 정량적 지표들을 나타낸 것이다.11 and 12 show exemplary views comparing restoration performance by iterative restoration technique (total variation), FDK-CNN and the method of the present invention, and various quantitative indicators such as restoration performance and PSNR, NMSE and SSIM. will be.

도 11과 도 12를 통해 알 수 있듯이, FDK 복원 영상의 경우, 큰 콘 각을 갖는 부분에서 영상이 어두워지고 그 형상이 왜곡되는 것을 알 수 있으며, TV 복원방법의 경우 강도는 복원이 되지만, 그 텍스쳐와 형태적 왜곡이 충분히 복원되지 못하는 것을 알 수 있더. FDK-CNN의 경우 강도와 텍스쳐는 복원이 되었지만, 그 형태적 왜곡이 복원되지 못하는 반면 본 발명의 방법은 강도와 텍스쳐는 물론 영상의 왜곡까지도 온전히 복원하는 것을 알 수 있다.11 and 12, in the case of the FDK reconstruction image, it can be seen that the image is darkened and the shape is distorted in the portion having a large cone angle. In the case of the TV restoration method, the intensity is restored, but the It can be seen that the texture and morphological distortions are not sufficiently restored. In the case of FDK-CNN, the intensity and texture are restored, but the morphological distortion is not restored, while the method of the present invention completely restores the intensity and texture as well as the distortion of the image.

또한, 아래 <표 2>의 정량적 지표에서도 알 수 있듯이, 본 발명의 방법이 다른 알고리즘들과 비교하여도 모든 정량적 지표에서 우수한 성능을 보이는 것을 알 수 있다.In addition, as can be seen from the quantitative indicators in <Table 2> below, it can be seen that the method of the present invention shows excellent performance in all quantitative indicators compared to other algorithms.

또한, 본 발명은 컨볼루션 프레임렛 기반의 뉴럴 네트워크를 이용할 수 있으며, 컨볼루션 프레임렛은 입력신호에 대하여 국소 기저와 비국소 기저를 이용하여 표현하는 방식으로, 이에 대해 설명하면 다음과 같다.In addition, the present invention can use a neural network based on a convolutional framelet, and the convolutional framelet is a method of expressing an input signal using a local basis and a non-local basis, which will be described as follows.

컨볼루션 프레임렛은 입력신호 f에 대하여 국소 기저(

)와 비국소 기저 (

)를 이용하여 표현한 것으로, 아래 <수학식 19>와 같이 나타낼 수 있다.The convolutional framelet is a local basis (

) and non-localized basal (

), which can be expressed as in <Equation 19> below.

[수학식 19][Equation 19]

여기서,

는 비국소 기저 벡터를 가지는 선형 변환 연산을 의미하고,

는 국소 기저 벡터를 가지는 선형 변환 연산을 의미할 수 있다.here,

is a linear transformation operation with a non-local basis vector,

may mean a linear transformation operation having a local basis vector.

이 때, 국소 기저 벡터와 비국소 기저 벡터는 각각 서로 직교하는 듀얼 기저 벡터

와

를 가질 수 있으며, 기저 벡터들의 직교 관계는 아래 <수학식 20>과 같이 정의될 수 있다.In this case, the local basis vector and the non-local basis vector are dual basis vectors orthogonal to each other, respectively.

Wow

, and the orthogonal relationship between the basis vectors may be defined as in Equation 20 below.

[수학식 20][Equation 20]

상기 수학식 20을 이용하면 컨볼루션 프레임렛은 아래 <수학식 21>과 같이 표현할 수 있다.Using Equation 20, the convolutional framelet can be expressed as in Equation 21 below.

[수학식 21][Equation 21]

여기서,

는 행켈 행렬 연산(Hankel matrix operator)을 의미하는 것으로, 컨볼루션 연산을 행렬곱(matrix multiplication)으로 표현할 수 있게 해주며, C는 국소 기저와 비국소 기저에 의하여 변환된 신호인 컨볼루션 프레임렛 계수(convolution framelet coefficient)를 의미할 수 있다.here,

stands for the Hankel matrix operator, which allows the convolution operation to be expressed as matrix multiplication, and C is the convolutional framelet coefficient, which is a signal transformed by a local basis and a non-local basis. (convolution framelet coefficient).

컨볼루션 프레임렛 계수 C는 듀얼 기저 벡터

를 적용하여 본래의 신호로 복원될 수 있다. 신호 복원 과정은 아래 <수학식 22>와 같이 나타낼 수 있다.Convolutional framelet coefficients C are dual basis vectors

can be restored to the original signal by applying . The signal restoration process can be expressed as in Equation 22 below.

[수학식 22][Equation 22]

이와 같이, 국소 기저 및 비국소 기저를 통해 입력 신호를 표현하는 방식을 컨볼루션 프레임렛이라 한다.As described above, a method of representing an input signal through a local basis and a non-local basis is called a convolutional framelet.

이와 같이, 본 발명의 실시예에 따른 방법은 콘빔 아티팩트를 제거하는 뉴럴 네트워크를 이용하여 콘빔 단층촬영 영상으로 콘빔 아티팩트를 제거하고, 콘빔 아티팩트가 제거된 콘빔 영상을 입력으로 하는 스펙트럴 블렌딩 기법을 이용하여 잔존하는 노이즈를 제거함으로써, 콘빔 단층촬영 영상을 고품질의 영상으로 복원할 수 있다.As described above, the method according to an embodiment of the present invention removes the cone beam artifact with a cone beam tomography image using a neural network that removes the cone beam artifact, and uses a spectral blending technique in which the cone beam image from which the cone beam artifact is removed is input. By removing the residual noise, the cone-beam tomography image can be restored to a high-quality image.

도 13은 본 발명의 일 실시예에 따른 영상 처리 장치에 대한 구성을 나타낸 것으로, 도 1 내지 도 12의 방법을 수행하는 장치에 대한 구성을 나타낸 것이다.13 is a diagram illustrating a configuration of an image processing apparatus according to an embodiment of the present invention, and shows a configuration of an apparatus performing the method of FIGS. 1 to 12 .

도 13을 참조하면, 본 발명의 일 실시예에 따른 영상 처리 장치(1300)는 수신부(1310) 및 복원부(1320)를 포함한다.Referring to FIG. 13 , an image processing apparatus 1300 according to an embodiment of the present invention includes a receiving unit 1310 and a restoration unit 1320 .

수신부(1310)는 콘빔 전산단층 촬영 데이터 예를 들어, 투영 데이터를 수신한다.The receiver 1310 receives cone beam computed tomography data, for example, projection data.

복원부(1320)는 콘빔 아티팩트를 제거하는 뉴럴 네트워크를 이용하여 콘빔 전산단층 촬영 데이터에 대한 영상을 복원한다.The restoration unit 1320 reconstructs an image of the cone-beam computed tomography data using a neural network that removes cone-beam artifacts.

이 때, 복원부(1320)는 내부 전산단층 촬영 데이터를 미리 설정된 해석적 알고리즘을 이용하여 콘빔 전산단층 촬영 데이터로부터 적어도 두 개 이상의 방향에 대한 최초 영상들 예를 들어, 관상(coronal) 방향과 시상(sagittal) 방향 각각에 대한 최초 영상들을 획득하고, 최초 영상들(CNN input)을 입력으로 하는 뉴럴 네트워크를 이용하여 적어도 두 개 이상의 방향에 대한 최초 영상들로부터 콘빔 아티팩트를 제거한 영상(CNN output)을 출력함으로써, 콘빔 전산단층 촬영 데이터에 대한 영상을 복원할 수 있다. 여기서, 복원부(1320)는 직교하는 두 방향의 최초 영상을 획득할 수도 있고, 직교하지 않는 두 방향의 최초 영상을 획득할 수도 있으며, 뉴럴 네트워크를 이용하여 최초 영상들 각각의 콘빔 아티팩트를 제거함으로써, 콘빔 아티팩트가 제거된 두 방향 각각의 복원 영상을 획득할 수 있다. 해석적 알고리즘은 차분 역투영(differentiated backprojection, DBP) 알고리즘을 포함할 수 있다.At this time, the restoration unit 1320 sets the internal computed tomography data to the initial images in at least two directions from the cone beam computed tomography data using a preset analytic algorithm, for example, the coronal direction and the sagittal direction. First images for each (sagittal) direction are acquired, and an image (CNN output) from which cone beam artifacts are removed from the first images for at least two directions using a neural network inputting the first images (CNN input) By outputting the image, it is possible to reconstruct the image for the cone-beam computed tomography data. Here, the restoration unit 1320 may acquire initial images in two orthogonal directions or may acquire initial images in two non-orthogonal directions, and by using a neural network to remove cone beam artifacts in each of the initial images. , reconstructed images in each of two directions from which cone beam artifacts are removed may be obtained. The analytic algorithm may include a differential backprojection (DBP) algorithm.

복원부(1320)에서 이용하는 뉴럴 네트워크는 컨볼루션 프레임렛(convolution framelet) 기반의 뉴럴 네트워크를 포함할 수 있으며, 풀링(pooling) 레이어와 언풀링(unpooling) 레이어를 포함하는 다중 해상도 뉴럴 네트워크를 포함할 수 있다. 또한, 뉴럴 네트워크는 풀링 레이어에서 언풀링 레이어로의 바이패스 연결을 포함할 수 있다.The neural network used by the reconstruction unit 1320 may include a convolution framelet-based neural network, and may include a multi-resolution neural network including a pooling layer and an unpooling layer. can In addition, the neural network may include a bypass connection from the pooling layer to the unpooling layer.

나아가, 복원부(1320)는 스펙트럴 블렌딩 기법을 이용한 복원된 영상에 대한 최종 복원 영상을 생성한다.Furthermore, the reconstruction unit 1320 generates a final reconstructed image for the reconstructed image using the spectral blending technique.

여기서, 복원부(1320)는 콘빔 아티팩트가 제거된 적어도 두 개 이상의 방향에 대한 최초 영상들 각각을 이용하여 각 방향에 대한 최초 영상들을 통해 3D 영상을 복원하고, 복원된 3D 영상들 각각에 대한 다른 방향의 슬라이스 영상 페어를 입력으로 하는 스펙트럴 블렌딩 기법을 이용하여 잔존하는 노이즈를 제거함으로써, 최종 복원 영상을 생성할 수 있다.Here, the restoration unit 1320 reconstructs a 3D image through initial images for each direction by using each of the initial images for at least two directions from which the cone beam artifact has been removed, and another 3D image for each of the reconstructed 3D images. A final reconstructed image may be generated by removing residual noise using a spectral blending technique that takes a slice image pair in the direction as an input.

비록, 도 13의 장치에서 그 설명이 생략되었더라도, 도 10을 구성하는 각 구성 수단은 도 1 내지 도 12에서 설명한 모든 내용을 포함할 수 있으며, 이는 이 기술 분야에 종사하는 당업자에게 있어서 자명하다.Although the description of the device of FIG. 13 is omitted, each component constituting FIG. 10 may include all the contents described with reference to FIGS. 1 to 12 , which is apparent to those skilled in the art.

이상에서 설명된 장치는 하드웨어 구성요소, 소프트웨어 구성요소, 및/또는 하드웨어 구성요소 및 소프트웨어 구성요소의 조합으로 구현될 수 있다. 예를 들어, 실시예들에서 설명된 장치 및 구성요소는, 예를 들어, 프로세서, 콘트롤러, ALU(arithmetic logic unit), 디지털 신호 프로세서(digital signal processor), 마이크로컴퓨터, FPA(field programmable array), PLU(programmable logic unit), 마이크로프로세서, 또는 명령(instruction)을 실행하고 응답할 수 있는 다른 어떠한 장치와 같이, 하나 이상의 범용 컴퓨터 또는 특수 목적 컴퓨터를 이용하여 구현될 수 있다. 처리 장치는 운영 체제(OS) 및 상기 운영 체제 상에서 수행되는 하나 이상의 소프트웨어 애플리케이션을 수행할 수 있다. 또한, 처리 장치는 소프트웨어의 실행에 응답하여, 데이터를 접근, 저장, 조작, 처리 및 생성할 수도 있다. 이해의 편의를 위하여, 처리 장치는 하나가 사용되는 것으로 설명된 경우도 있지만, 해당 기술분야에서 통상의 지식을 가진 자는, 처리 장치가 복수 개의 처리 요소(processing element) 및/또는 복수 유형의 처리 요소를 포함할 수 있음을 알 수 있다. 예를 들어, 처리 장치는 복수 개의 프로세서 또는 하나의 프로세서 및 하나의 콘트롤러를 포함할 수 있다. 또한, 병렬 프로세서(parallel processor)와 같은, 다른 처리 구성(processing configuration)도 가능하다.The device described above may be implemented as a hardware component, a software component, and/or a combination of the hardware component and the software component. For example, devices and components described in the embodiments may include, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA), It may be implemented using one or more general purpose or special purpose computers, such as a programmable logic unit (PLU), microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. The processing device may also access, store, manipulate, process, and generate data in response to execution of the software. For convenience of understanding, although one processing device is sometimes described as being used, one of ordinary skill in the art will recognize that the processing device includes a plurality of processing elements and/or a plurality of types of processing elements. It can be seen that can include For example, the processing device may include a plurality of processors or one processor and one controller. Other processing configurations are also possible, such as parallel processors.

소프트웨어는 컴퓨터 프로그램(computer program), 코드(code), 명령(instruction), 또는 이들 중 하나 이상의 조합을 포함할 수 있으며, 원하는 대로 동작하도록 처리 장치를 구성하거나 독립적으로 또는 결합적으로(collectively) 처리 장치를 명령할 수 있다. 소프트웨어 및/또는 데이터는, 처리 장치에 의하여 해석되거나 처리 장치에 명령 또는 데이터를 제공하기 위하여, 어떤 유형의 기계, 구성요소(component), 물리적 장치, 가상 장치(virtual equipment), 컴퓨터 저장 매체 또는 장치에 구체화(embody)될 수 있다. 소프트웨어는 네트워크로 연결된 컴퓨터 시스템 상에 분산되어서, 분산된 방법으로 저장되거나 실행될 수도 있다. 소프트웨어 및 데이터는 하나 이상의 컴퓨터 판독 가능 기록 매체에 저장될 수 있다.The software may comprise a computer program, code, instructions, or a combination of one or more thereof, which configures a processing device to operate as desired or is independently or collectively processed You can command the device. The software and/or data may be any kind of machine, component, physical device, virtual equipment, computer storage medium or device, to be interpreted by or to provide instructions or data to the processing device. may be embodied in The software may be distributed over networked computer systems, and stored or executed in a distributed manner. Software and data may be stored in one or more computer-readable recording media.

실시예에 따른 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 실시예를 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. The method according to the embodiment may be implemented in the form of program instructions that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, etc. alone or in combination. The program instructions recorded on the medium may be specially designed and configured for the embodiment, or may be known and available to those skilled in the art of computer software. Examples of the computer-readable recording medium include magnetic media such as hard disks, floppy disks and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic such as floppy disks. - includes magneto-optical media, and hardware devices specially configured to store and carry out program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine language codes such as those generated by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like.

이상과 같이 실시예들이 비록 한정된 실시예와 도면에 의해 설명되었으나, 해당 기술분야에서 통상의 지식을 가진 자라면 상기의 기재로부터 다양한 수정 및 변형이 가능하다. 예를 들어, 설명된 기술들이 설명된 방법과 다른 순서로 수행되거나, 및/또는 설명된 시스템, 구조, 장치, 회로 등의 구성요소들이 설명된 방법과 다른 형태로 결합 또는 조합되거나, 다른 구성요소 또는 균등물에 의하여 대치되거나 치환되더라도 적절한 결과가 달성될 수 있다.As described above, although the embodiments have been described with reference to the limited embodiments and drawings, various modifications and variations are possible by those skilled in the art from the above description. For example, the described techniques are performed in a different order than the described method, and/or the described components of the system, structure, apparatus, circuit, etc., are combined or combined in a different form than the described method, or other components Or substituted or substituted by equivalents may achieve an appropriate result.

그러므로, 다른 구현들, 다른 실시예들 및 특허청구범위와 균등한 것들도 후술하는 특허청구범위의 범위에 속한다.Therefore, other implementations, other embodiments, and equivalents to the claims are also within the scope of the following claims.

Claims

Receiving the cone beam computed tomography data in the receiver; and
reconstructing an image of the cone beam computed tomography data using a neural network that removes cone beam artifacts in a restoration unit
includes,
Restoring the image
An initial image in a first direction and an initial image in a second direction orthogonal to the first direction are obtained using a differential inverse projection algorithm, and the neural network is performed with respect to the initial image in the first direction and the initial image in the second direction. reconstructing a first image from which the cone beam artifact in the first direction is removed and a second image from which the cone beam artifact in the second direction is removed using a network;
Restoring the image
A first image in which the cone beam artifact in the first direction is removed using a spectral blending technique in which a first image from which the cone beam artifact in the first direction is removed and a second image from which the cone beam artifact in the second direction is removed are input Removes noise remaining in the first image and the second image from which the cone beam artifacts in the second direction are removed, and slices in the third direction using the image in the first direction and the image in the second direction from which the residual noise is removed An image processing method of acquiring images and generating a final reconstructed image of the cone beam computed tomography data using the obtained slice images in the third direction.

delete

According to claim 1,
The neural network is
An image processing method comprising a neural network based on a convolution framelet.

According to claim 1,
The neural network is
An image processing method comprising a multi-resolution neural network including a pooling layer and an unpooling layer.

8. The method of claim 7,
The neural network is
and bypass connection from the pooling layer to the unpooling layer.

a receiver for receiving cone-beam computed tomography data; and
A restoration unit that restores an image of the cone-beam computed tomography data using a neural network that removes cone-beam artifacts
includes,
the restoration unit
An initial image in a first direction and an initial image in a second direction orthogonal to the first direction are obtained using a differential inverse projection algorithm, and the neural network is performed with respect to the initial image in the first direction and the initial image in the second direction. reconstructing a first image from which the cone beam artifact in the first direction is removed and a second image from which the cone beam artifact in the second direction is removed using a network;
the restoration unit
A first image in which the cone beam artifact in the first direction is removed using a spectral blending technique in which a first image from which the cone beam artifact in the first direction is removed and a second image from which the cone beam artifact in the second direction is removed are input Removes noise remaining in the first image and the second image from which the cone beam artifacts in the second direction are removed, and slices in the third direction using the image in the first direction and the image in the second direction from which the residual noise is removed An image processing apparatus for acquiring images and generating a final reconstructed image of the cone-beam computed tomography data using the obtained slice images in the third direction.

delete

10. The method of claim 9,
The neural network is
An image processing apparatus comprising a neural network based on a convolution framelet.

10. The method of claim 9,
The neural network is
An image processing apparatus comprising a multi-resolution neural network including a pooling layer and an unpooling layer.

16. The method of claim 15,
The neural network is
and bypass connection from the pooling layer to the unpooling layer.