JP2023143628A

JP2023143628A - Image generation device, method and program, learning device, method and program, segmentation model, and image processing device, method and program

Info

Publication number: JP2023143628A
Application number: JP2022150250A
Authority: JP
Inventors: 朗子流石; Akiko Sasuga; 嘉郎北村; Yoshiro Kitamura; 彰工藤; Akira Kudo
Original assignee: Fujifilm Corp
Current assignee: Fujifilm Corp
Priority date: 2022-03-25
Filing date: 2022-09-21
Publication date: 2023-10-06

Abstract

To make it possible to provide a machine learning model allowing segmentation to be accurately performed, in an image generation device, method and program, a learning device, method and program, a segmentation model, and an image processing device, method and program.SOLUTION: A processor obtains an original image and a mask image in which a mask is applied to one or more regions representing each of one or more objects including a target object in the original image, derives a pseudo mask image by processing the mask in the mask image, and derives a pseudo image having a region based on the mask included in the pseudo mask image and having the same representation form as the original image, on the basis of the original image and the pseudo mask image.SELECTED DRAWING: Figure 3

Description

本開示は、画像生成装置、方法およびプログラム、学習装置、方法およびプログラム、セグメンテーションモデル、並びに画像処理装置、方法およびプログラムに関する。 The present disclosure relates to an image generation device, a method, and a program, a learning device, a method, and a program, a segmentation model, and an image processing device, a method, and a program.

画像を扱う機械学習モデルとして、画像に含まれる対象物を画素単位で識別するセマンティックセグメンテーションを行う畳み込みニューラルネットワーク（以下、ＣＮＮ（convolutional Neural Network）と略す）が知られている。例えば、非特許文献１にはＵ字型の畳み込みニューラルネットワーク（Ｕ－Ｎｅｔ；U-Shaped Neural Network）を用いたセグメンテーションが提案されている。 As a machine learning model for handling images, a convolutional neural network (hereinafter abbreviated as CNN) that performs semantic segmentation to identify objects included in an image on a pixel-by-pixel basis is known. For example, Non-Patent Document 1 proposes segmentation using a U-shaped convolutional neural network (U-Net).

また、医用分野においては、機械学習モデルを用いて医用画像をセグメンテーションし、セグメンテーションした領域について病気の進行度を判断することも行われている。 Furthermore, in the medical field, medical images are segmented using machine learning models, and the degree of disease progression is determined for the segmented regions.

一方、機械学習モデルの学習には多数の教師データが必要である。しかしながら、医用分野においては稀少疾患のデータは収集が難しいため、セグメンテーションを精度よく行うことが可能な機械学習モデルを提供することが困難な状況にある。 On the other hand, training a machine learning model requires a large amount of training data. However, in the medical field, it is difficult to collect data on rare diseases, making it difficult to provide machine learning models that can accurately perform segmentation.

このため、皮膚がんを検出するための機械学習モデルの学習のために、既存の医用画像および医用画像における皮膚がんの領域をマスクした正解データを使用して、様々な大きさの皮膚がんを含む疑似画像を生成する手法が提案されている（非特許文献２参照）。また、例えば椅子のような既存の形状を有する３次元物体の空間的な分布を学習し、未知の形状を有する椅子の画像を生成する手法も提案されている（非特許文献３参照）。 Therefore, in order to train a machine learning model for detecting skin cancer, we use existing medical images and ground truth data that masks skin cancer areas in medical images to detect skin of various sizes. A method has been proposed to generate a pseudo image that includes . Furthermore, a method has also been proposed in which the spatial distribution of a three-dimensional object with an existing shape, such as a chair, is learned and an image of a chair with an unknown shape is generated (see Non-Patent Document 3).

U-Net: Convolutional Networks for Biomedical Image Segmentation、Olaf Ronnebergerら、2015U-Net: Convolutional Networks for Biomedical Image Segmentation, Olaf Ronneberger et al., 2015 Mask2Lesion: Mask-Constrained Adversarial Skin Lesion Image Synthesis、Kumar Abhishekら、2019Mask2Lesion: Mask-Constrained Adversarial Skin Lesion Image Synthesis, Kumar Abhishek et al., 2019 The shape variational autoencoder: A deep generative model of part-segmented 3D objects、C.Nashら、2017The shape variational autoencoder: A deep generative model of part-segmented 3D objects, C. Nash et al., 2017

しかしながら、非特許文献２に記載された手法のように既存の画像についての正解データを用いるのみでは、既存の学習データに含まれることが少ない特徴のデータを生成することができない。このため、生成した画像を学習データに追加しても既存の画像に含まれることが少ない対象物を精度よくセグメンテーションできる機械学習モデルを構築することは困難である。また、非特許文献３に記載された手法のように、既存の対象物の形状を変更するのみでは、既存の対象物とは異なる対象物を精度よくセグメンテーションできる機械学習モデルを構築することは困難である。とくに、進行がんのような稀少疾患については、がん組織がその周囲の部位に浸潤していることが多いため、そのような進行がんを精度よくセグメンテーションすることが望まれている。 However, just by using correct data for existing images as in the method described in Non-Patent Document 2, it is not possible to generate data of features that are rarely included in existing learning data. For this reason, even if generated images are added to learning data, it is difficult to construct a machine learning model that can accurately segment objects that are rarely included in existing images. Furthermore, as in the method described in Non-Patent Document 3, it is difficult to construct a machine learning model that can accurately segment objects that are different from existing objects by simply changing the shape of existing objects. It is. In particular, in the case of rare diseases such as advanced cancer, cancer tissue often invades surrounding areas, so it is desired to segment such advanced cancers with high accuracy.

本開示は上記事情に鑑みなされたものであり、セグメンテーションを精度よく行うことが可能な機械学習モデルを提供できるようにすることを目的とする。 The present disclosure has been made in view of the above circumstances, and aims to provide a machine learning model that can perform segmentation with high accuracy.

本開示による第１の態様に係る画像生成装置は、少なくとも１つのプロセッサを備え、
プロセッサは、原画像および原画像において対象物体を含む１以上の物体のそれぞれを表す１以上の領域にマスクが付与されたマスク画像を取得し、
マスク画像におけるマスクを加工することにより疑似マスク画像を導出し、
原画像および疑似マスク画像に基づいて、疑似マスク画像に含まれるマスクに基づく領域を有し、かつ原画像と同一の表現形式を有する疑似画像を導出する。 An image generation device according to a first aspect of the present disclosure includes at least one processor,
the processor obtains an original image and a mask image in which a mask is applied to one or more regions representing each of one or more objects including the target object in the original image;
Derive a pseudo mask image by processing the mask in the mask image,
Based on the original image and the pseudo mask image, a pseudo image is derived that has a region based on the mask included in the pseudo mask image and has the same expression format as the original image.

本開示の第２の態様に係る画像生成装置は、本開示の第１の態様に係る画像生成装置において、疑似マスク画像および疑似画像は、画像に含まれる物体をセグメンテーションするセグメンテーションモデルを学習するための教師データとして使用されるものであってもよい。 An image generation device according to a second aspect of the present disclosure is the image generation device according to the first aspect of the present disclosure, in which the pseudo mask image and the pseudo image are used to learn a segmentation model for segmenting objects included in the image. It may also be used as training data.

本開示の第３の態様に係る画像生成装置は、本開示の第２の態様に係る画像生成装置において、プロセッサは、疑似マスク画像および疑似画像を教師データとして蓄積するものであってもよい。 An image generation device according to a third aspect of the present disclosure may be the image generation device according to the second aspect of the present disclosure, in which the processor accumulates the pseudo mask image and the pseudo image as teacher data.

本開示の第４の態様に係る画像生成装置は、本開示の第１から第３のいずれか１つの態様に係る画像生成装置において、プロセッサは、対象物体が示すクラスとは異なるクラスの対象物体を含む疑似画像を生成可能な疑似マスク画像を導出するものであってもよい。 An image generation device according to a fourth aspect of the present disclosure is an image generation device according to any one of the first to third aspects of the present disclosure, in which the processor is configured to generate a target object in a class different from a class indicated by the target object. It may be possible to derive a pseudo mask image that can generate a pseudo image including the following.

「クラスが異なる」とは、対象物体の形状の種類が異なること、あるいは対象物体が医用画像に含まれる病変であれば病変の進行度が異なること等を意味する。また、「異なるクラス」とは、保有している教師データのうち、他のクラスと比較して出現する頻度が少ないまたは全くないクラスを意味する。このため、「対象物体が示すクラスとは異なるクラスの対象物体を含む疑似画像を生成可能な疑似マスク画像を導出する」ことにより、既存の教師データには少ないまたは全くないクラスの教師データを用意することができる。したがって、このような教師データを既存の教師データとともに用いてセグメンテーションモデルの学習を行うことにより、出願頻度が少ない対象物体を含む画像についての対象物体をセグメンテーションできるように、セグメンテーションモデルを構築することができる。 "Different classes" means that the shape of the target object is different, or if the target object is a lesion included in a medical image, the degree of progression of the lesion is different. Furthermore, a "different class" refers to a class that appears less frequently than other classes, or does not appear at all, among the teacher data held. For this reason, by ``deriving a pseudo mask image that can generate a pseudo image that includes the target object in a class different from the class indicated by the target object,'' we prepare training data for classes that are small or absent in existing training data. can do. Therefore, by training a segmentation model using such training data together with existing training data, it is possible to construct a segmentation model that can segment target objects for images that include target objects that are rarely applied. can.

本開示の第５の態様に係る画像生成装置は、本開示の第１から第４のいずれか１つの態様に係る画像生成装置において、プロセッサは、医用画像に関して、臨床において評価指標となっている病変形状評価指標に基づいて、病変の形状および進行度の少なくとも一方が原画像に含まれる病変とは異なるものとなるようにマスクを加工することにより疑似マスク画像を導出するものであってもよい。 An image generation device according to a fifth aspect of the present disclosure is an image generation device according to any one of the first to fourth aspects of the present disclosure, in which the processor is an evaluation index in clinical practice regarding medical images. A pseudo mask image may be derived by processing the mask so that at least one of the shape and progression of the lesion is different from the lesion included in the original image based on the lesion shape evaluation index. .

本開示の第６の態様に係る画像生成装置は、本開示の第１から第５のいずれか１つの態様に係る画像生成装置において、プロセッサは、医用画像に関して、正常な臓器を臨床の計測指標に基づいて病変と評価される形状となるまでマスクを加工することにより疑似マスク画像を導出するものであってもよい。 An image generation device according to a sixth aspect of the present disclosure is the image generation device according to any one of the first to fifth aspects of the present disclosure. A pseudo mask image may be derived by processing the mask until it has a shape that is evaluated as a lesion based on the following.

本開示の第７の態様に係る画像生成装置は、本開示の第１から第６のいずれか１つの態様に係る画像生成装置において、プロセッサは、予め定められた濃度、色またはテクスチャを有する少なくとも１つのスタイル画像を参照して、スタイル画像に応じた濃度、色またはテクスチャを有する疑似画像を生成するものであってもよい。 An image generation device according to a seventh aspect of the present disclosure is an image generation device according to any one of the first to sixth aspects of the present disclosure, in which the processor includes at least one image having a predetermined density, color, or texture. A pseudo image having density, color, or texture according to the style image may be generated by referring to one style image.

「スタイル画像」とは、対象物体が有することがあり得る濃度、色およびテクスチャを有する対象物体と同一種類の物体を表す画像である。 A "style image" is an image that represents an object of the same type as the target object, having the density, color, and texture that the target object may have.

本開示の第８の態様に係る画像生成装置は、本開示の第１から第７のいずれか１つの態様に係る画像生成装置において、プロセッサは、マスクの加工の程度の指示を受け付け、指示に基づいてマスクを加工することにより疑似マスク画像を導出するものであってもよい。 An image generation device according to an eighth aspect of the present disclosure is the image generation device according to any one of the first to seventh aspects of the present disclosure, in which the processor receives an instruction regarding the degree of processing of the mask and responds to the instruction. A pseudo mask image may be derived by processing a mask based on the mask.

本開示の第９の態様に係る画像生成装置は、本開示の第８の態様に係る画像生成装置において、プロセッサは、加工後のマスクの端点の位置の指定および加工量の指定を加工の程度の指示として受け付けるものであってもよい。 In the image generation device according to a ninth aspect of the present disclosure, in the image generation device according to the eighth aspect of the present disclosure, the processor specifies the position of the end point of the mask after processing and the amount of processing. It may also be accepted as an instruction.

本開示の第１０の態様に係る画像生成装置は、本開示の第８または第９の態様に係る画像生成装置において、プロセッサは、予め設定された拘束条件に従ってマスクの加工の程度の指示を受け付けるものであってもよい。 An image generation device according to a tenth aspect of the present disclosure is the image generation device according to the eighth or ninth aspect of the present disclosure, wherein the processor receives an instruction for the degree of processing of the mask according to preset constraint conditions. It may be something.

本開示の第１１の態様に係る画像生成装置は、本開示の第１から第１０のいずれか１つの態様に係る画像生成装置において、原画像が複数の物体を含み、対象物体と対象物体以外の他の物体の一部の領域が互いに包含関係にある場合、マスク画像には、包含関係にある領域は包含関係にない領域とは異なるマスクが付与されるものであってもよい。 An image generation device according to an eleventh aspect of the present disclosure is an image generation device according to any one of the first to tenth aspects of the present disclosure, in which the original image includes a plurality of objects, a target object and a target object other than the target object. When some regions of other objects have an inclusive relationship with each other, in the mask image, the areas that have an inclusive relationship may be given a different mask from those that do not have an inclusive relationship.

本開示の第１２の態様に係る画像生成装置は、本開示の第１１の態様に係る画像生成装置において、包含関係にある他の物体が原画像内において固定された物体である場合、プロセッサは、対象物体に付与されたマスクを固定された物体に付与されたマスクの形状に合わせて加工することにより疑似マスク画像を導出するものであってもよい。 An image generation device according to a twelfth aspect of the present disclosure is the image generation device according to the eleventh aspect of the present disclosure, in which when the other object in the inclusion relationship is a fixed object in the original image, the processor Alternatively, a pseudo mask image may be derived by processing a mask applied to a target object to match the shape of a mask applied to a fixed object.

本開示の第１３の態様に係る画像生成装置は、本開示の第１から第１２のいずれか１つの態様に係る画像生成装置において、原画像が３次元画像である場合、プロセッサは、対象物体の領域に付与されたマスクの３次元的な連続性を保持しつつマスクを加工することにより疑似マスク画像を導出するものであってもよい。 An image generation device according to a thirteenth aspect of the present disclosure is the image generation device according to any one of the first to twelfth aspects of the present disclosure, in which when the original image is a three-dimensional image, the processor The pseudo mask image may be derived by processing the mask while maintaining the three-dimensional continuity of the mask applied to the area.

本開示の第１４の態様に係る画像生成装置は、本開示の第１から第１３のいずれか１つの態様に係る画像生成装置において、原画像は３次元の医用画像であり、
対象物体は医用画像に含まれる病変であってもよい。 An image generation device according to a fourteenth aspect of the present disclosure is an image generation device according to any one of the first to thirteenth aspects of the present disclosure, wherein the original image is a three-dimensional medical image;
The target object may be a lesion included in a medical image.

本開示の第１５の態様に係る画像生成装置は、本開示の第１４の態様に係る画像生成装置において、医用画像は人体の直腸を含み、
対象物体は直腸がんであり、対象物体以外の他の物体は、直腸の粘膜層、直腸の粘膜下層、直腸の固有筋層、直腸の漿膜下層およびこれら以外の背景のうちの少なくとも１つであってもよい。 An image generation device according to a fifteenth aspect of the present disclosure is an image generation device according to a fourteenth aspect of the present disclosure, wherein the medical image includes a rectum of a human body;
The target object is rectal cancer, and the other object other than the target object is at least one of the mucosal layer of the rectum, the submucosa of the rectum, the muscularis propria of the rectum, the subserosa of the rectum, and a background other than these. It's okay.

本開示の第１６の態様に係る画像生成装置は、本開示の第１４の態様に係る画像生成装置において、医用画像は人体の関節を含み、
対象物体は関節を構成する骨であり、対象物体以外の他の物体は関節を構成する骨以外の背景であってもよい。 An image generation device according to a sixteenth aspect of the present disclosure is an image generation device according to a fourteenth aspect of the present disclosure, wherein the medical image includes a joint of a human body;
The target object is a bone forming a joint, and the object other than the target object may be a background other than the bones forming a joint.

本開示の第１７の態様に係る学習装置は、少なくとも１つのプロセッサを備え、
プロセッサは、本開示の第１から第１６の何れか１つの態様に係る画像生成装置により生成された複数の疑似画像および疑似マスク画像の組を教師データとして用いて機械学習を行うことにより、入力画像に含まれる対象物体を含む１以上の物体の領域をセグメンテーションするセグメンテーションモデルを構築する。 A learning device according to a seventeenth aspect of the present disclosure includes at least one processor,
The processor performs machine learning using, as training data, a set of a plurality of pseudo images and pseudo mask images generated by the image generation device according to any one of the first to sixteenth aspects of the present disclosure. A segmentation model is constructed to segment one or more object regions including a target object included in an image.

本開示の第１８の態様に係る学習装置は、本開示の第１７の態様に係る学習装置において、プロセッサは、さらに複数の原画像およびマスク画像の組を教師データとして用いて機械学習を行うことによりセグメンテーションモデルを構築するものであってもよい。 A learning device according to an eighteenth aspect of the present disclosure is a learning device according to a seventeenth aspect of the present disclosure, wherein the processor further performs machine learning using a plurality of sets of original images and mask images as training data. Alternatively, a segmentation model may be constructed by

本開示の第１９の態様に係るセグメンテーションモデルは、本開示の第１７または第１８の態様に係る学習装置により構築される。 The segmentation model according to the nineteenth aspect of the present disclosure is constructed by the learning device according to the seventeenth or eighteenth aspect of the present disclosure.

本開示の第２０の態様に係る画像処理装置は、少なくとも１つのプロセッサを備え、
プロセッサは、本開示の第１９の態様に係るセグメンテーションモデルを用いて、処理対象となる対象画像に含まれる対象物体を含む１以上の物体の領域をセグメンテーションすることにより、対象画像に含まれる１以上の物体がマスクされたマスク画像を導出する。 An image processing device according to a twentieth aspect of the present disclosure includes at least one processor,
The processor uses the segmentation model according to the nineteenth aspect of the present disclosure to segment the region of one or more objects including the target object included in the target image to be processed. Derive a mask image in which the object is masked.

本開示の第２１の態様に係る画像処理装置は、本開示の第２０の態様に係る画像処理装置において、プロセッサは、マスク画像に含まれる対象物体のクラスを判別する判別モデルを用いて、マスク画像においてマスクされた対象物体のクラスを判別するものであってもよい。 An image processing apparatus according to a twenty-first aspect of the present disclosure is the image processing apparatus according to the twentieth aspect of the present disclosure, in which the processor uses a discriminant model that discriminates the class of the target object included in the mask image to The class of the target object masked in the image may be determined.

本開示の第２２の態様に係る画像生成方法は、原画像および原画像において対象物体を含む１以上の物体のそれぞれを表す１以上の領域にマスクが付与されたマスク画像を取得し、
マスク画像におけるマスクを加工することにより疑似マスク画像を導出し、
原画像および疑似マスク画像に基づいて、疑似マスク画像に含まれるマスクに基づく領域を有し、かつ原画像と同一の表現形式を有する疑似画像を導出する。 An image generation method according to a twenty-second aspect of the present disclosure obtains an original image and a mask image in which a mask is applied to one or more regions representing each of one or more objects including a target object in the original image,
Derive a pseudo mask image by processing the mask in the mask image,
Based on the original image and the pseudo mask image, a pseudo image is derived that has a region based on the mask included in the pseudo mask image and has the same expression format as the original image.

本開示の第２３の態様に係る学習方法は、本開示の第２２の態様に係る画像生成方法により生成された複数の疑似画像および疑似マスク画像の組を教師データとして用いて機械学習を行うことにより、入力画像に含まれる対象物体を含む１以上の物体の領域をセグメンテーションするセグメンテーションモデルを構築する。 A learning method according to a twenty-third aspect of the present disclosure includes performing machine learning using a set of a plurality of pseudo images and a pseudo mask image generated by the image generation method according to the twenty-second aspect of the present disclosure as training data. A segmentation model is constructed to segment one or more object regions including the target object included in the input image.

本開示の第２４の態様に係る画像処理方法は、本開示の第１９の態様に係るセグメンテーションモデルを用いて、処理対象となる対象画像に含まれる対象物体を含む１以上の物体の領域をセグメンテーションすることにより、対象画像に含まれる１以上の物体がマスクされたマスク画像を導出する。 An image processing method according to a twenty-fourth aspect of the present disclosure uses the segmentation model according to the nineteenth aspect of the present disclosure to segment one or more object regions including a target object included in a target image to be processed. By doing so, a mask image in which one or more objects included in the target image is masked is derived.

本開示の第２５の態様に係る画像生成プログラムは、原画像および原画像において対象物体を含む１以上の物体のそれぞれを表す１以上の領域にマスクが付与されたマスク画像を取得する手順と、
マスク画像におけるマスクを加工することにより疑似マスク画像を導出する手順と、
原画像および疑似マスク画像に基づいて、疑似マスク画像に含まれるマスクに基づく領域を有し、かつ原画像と同一の表現形式を有する疑似画像を導出する手順とをコンピュータに実行させる。 An image generation program according to a twenty-fifth aspect of the present disclosure includes a step of acquiring an original image and a mask image in which a mask is applied to one or more regions representing each of one or more objects including a target object in the original image;
A procedure for deriving a pseudo mask image by processing a mask in the mask image;
Based on the original image and the pseudo mask image, the computer executes a procedure for deriving a pseudo image that has a region based on a mask included in the pseudo mask image and has the same expression format as the original image.

本開示の第２６の態様に係る学習プログラムは、本開示の第２２の態様に係る画像生成方法により生成された複数の疑似画像および疑似マスク画像の組を教師データとして用いて機械学習を行うことにより、入力画像に含まれる対象物体を含む１以上の物体の領域をセグメンテーションするセグメンテーションモデルを構築する手順をコンピュータに実行させる。 A learning program according to a twenty-sixth aspect of the present disclosure performs machine learning using a set of a plurality of pseudo images and a pseudo mask image generated by the image generation method according to the twenty-second aspect of the present disclosure as training data. This causes the computer to execute a procedure for constructing a segmentation model that segments one or more object regions including the target object included in the input image.

本開示の第２７の態様に係る画像処理プログラムは、本開示の第１９の態様に係るセグメンテーションモデルを用いて、処理対象となる対象画像に含まれる対象物体を含む１以上の物体の領域をセグメンテーションすることにより、対象画像に含まれる１以上の物体がマスクされたマスク画像を導出する手順をコンピュータに実行させる。 An image processing program according to a twenty-seventh aspect of the present disclosure uses the segmentation model according to the nineteenth aspect of the present disclosure to segment a region of one or more objects including a target object included in a target image to be processed. This causes the computer to execute a procedure for deriving a mask image in which one or more objects included in the target image are masked.

本開示によれば、セグメンテーションを精度よく行うことが可能な機械学習モデルを提供できる。 According to the present disclosure, a machine learning model that can perform segmentation with high accuracy can be provided.

本開示の実施形態による画像生成装置、学習装置および画像処理装置を適用した診断支援システムの概略構成を示す図A diagram showing a schematic configuration of a diagnosis support system to which an image generation device, a learning device, and an image processing device according to an embodiment of the present disclosure are applied. 本実施形態による画像生成装置および学習装置のハードウェア構成を示す図A diagram showing the hardware configuration of an image generation device and a learning device according to the present embodiment 本実施形態による画像生成装置および学習装置の機能構成図Functional configuration diagram of an image generation device and a learning device according to this embodiment 直腸がんの進行を説明するための直腸の断面を模式的に示す図Diagram schematically showing a cross section of the rectum to explain the progression of rectal cancer 原画像およびマスク画像の例を示す図Diagram showing an example of original image and mask image 直腸がんについてのマスク加工画面を示す図Diagram showing the mask processing screen for rectal cancer 直腸がんについてのマスク加工画面を示す図Diagram showing the mask processing screen for rectal cancer 直腸がんについてのマスク加工画面を示す図Diagram showing the mask processing screen for rectal cancer マスク加工の他の例を示す図Diagram showing another example of mask processing マスク加工の他の例を示す図Diagram showing another example of mask processing ジェネレータおよびその学習を模式的に示す図Diagram schematically showing the generator and its learning 疑似マスクおよび疑似画像を示す図Diagram showing pseudo-mask and pseudo-image 本実施形態による画像処理装置のハードウェア構成を示す図A diagram showing the hardware configuration of an image processing device according to this embodiment 本実施形態による画像処理装置の機能構成図Functional configuration diagram of an image processing device according to this embodiment 表示画面を示す図Diagram showing the display screen 本実施形態における画像生成処理のフローチャートFlowchart of image generation processing in this embodiment 本実施形態における学習処理のフローチャートFlowchart of learning processing in this embodiment 本実施形態における画像処理のフローチャートFlowchart of image processing in this embodiment 骨棘についてのマスク加工画面を示す図Diagram showing the mask processing screen for bone spurs 骨棘についてのマスク加工画面を示す図Diagram showing the mask processing screen for bone spurs マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen マスク加工画面の他の例を示す図Diagram showing another example of the mask processing screen

以下、図面を参照して本開示の実施形態について説明する。まず、本実施形態による画像生成装置、学習装置および画像処理装置を適用した医療情報システムの構成について説明する。図１は、医療情報システムの概略構成を示す図である。図１に示す医療情報システムは、本実施形態による画像生成装置および学習装置を内包するコンピュータ１、本実施形態による画像処理装置を内包するコンピュータ２、撮影装置３、および画像保管サーバ４が、ネットワーク５を経由して通信可能な状態で接続されている。 Embodiments of the present disclosure will be described below with reference to the drawings. First, the configuration of a medical information system to which an image generation device, a learning device, and an image processing device according to this embodiment are applied will be described. FIG. 1 is a diagram showing a schematic configuration of a medical information system. In the medical information system shown in FIG. 1, a computer 1 including an image generation device and a learning device according to the present embodiment, a computer 2 containing an image processing device according to the present embodiment, an imaging device 3, and an image storage server 4 are connected to a network. 5 and is connected in a communicable state.

コンピュータ１は、本実施形態による画像生成装置および学習装置を内包するものであり、本実施形態の画像生成プログラムおよび学習プログラムがインストールされている。コンピュータ１は、ワークステーションあるいはパーソナルコンピュータでもよいし、それらとネットワークを介して接続されたサーバコンピュータでもよい。画像生成プログラムおよび学習プログラムは、ネットワークに接続されたサーバコンピュータの記憶装置、あるいはネットワークストレージに、外部からアクセス可能な状態で記憶され、要求に応じてコンピュータ１にダウンロードされ、インストールされる。または、ＤＶＤ（Digital Versatile Disc）あるいはＣＤ－ＲＯＭ（Compact Disc Read Only Memory）等の記録媒体に記録されて配布され、その記録媒体からコンピュータ１にインストールされる。 The computer 1 includes an image generation device and a learning device according to the present embodiment, and has the image generation program and learning program according to the present embodiment installed therein. The computer 1 may be a workstation or a personal computer, or a server computer connected thereto via a network. The image generation program and the learning program are stored in a storage device of a server computer connected to a network or in a network storage in a state that can be accessed from the outside, and are downloaded and installed in the computer 1 according to a request. Alternatively, it is recorded and distributed on a recording medium such as a DVD (Digital Versatile Disc) or a CD-ROM (Compact Disc Read Only Memory), and installed on the computer 1 from the recording medium.

コンピュータ２は、本実施形態による画像処理装置を内包するものであり、本実施形態の画像処理プログラムがインストールされている。コンピュータ２は、ワークステーションあるいはパーソナルコンピュータでもよいし、それらとネットワークを介して接続されたサーバコンピュータでもよい。画像処理プログラムは、ネットワークに接続されたサーバコンピュータの記憶装置、あるいはネットワークストレージに、外部からアクセス可能な状態で記憶され、要求に応じてコンピュータ２にダウンロードされ、インストールされる。または、ＤＶＤあるいはＣＤ－ＲＯＭ等の記録媒体に記録されて配布され、その記録媒体からコンピュータ２にインストールされる。 The computer 2 includes an image processing apparatus according to the present embodiment, and has an image processing program according to the present embodiment installed therein. The computer 2 may be a workstation or a personal computer, or a server computer connected thereto via a network. The image processing program is stored in a storage device of a server computer connected to a network or in a network storage in a state that can be accessed from the outside, and is downloaded and installed in the computer 2 according to a request. Alternatively, it is recorded and distributed on a recording medium such as a DVD or CD-ROM, and installed on the computer 2 from the recording medium.

撮影装置３は、被検体の診断対象となる部位を撮影することにより、その部位を表す３次元画像を生成する装置であり、具体的には、ＣＴ装置、ＭＲＩ装置、およびＰＥＴ（Positron Emission Tomography）装置等である。撮影装置３により生成された、複数の断層画像からなる３次元画像は画像保管サーバ４に送信され、保存される。なお、本実施形態においては、撮影装置３はＭＲＩ装置であり、被検体である人体のＭＲＩ画像を３次元画像として生成する。また、本実施形態においては、３次元画像は、人体の直腸付近を含む３次元画像とする。このため、直腸がんの患者を撮影した場合、３次元画像には直腸がんが含まれる。 The imaging device 3 is a device that generates a three-dimensional image representing the region of the subject by photographing the region to be diagnosed. Specifically, the imaging device 3 is a device that generates a three-dimensional image representing the region of the subject. ) equipment, etc. A three-dimensional image composed of a plurality of tomographic images generated by the imaging device 3 is transmitted to the image storage server 4 and stored. Note that in this embodiment, the imaging device 3 is an MRI device, and generates an MRI image of a human body, which is a subject, as a three-dimensional image. Furthermore, in this embodiment, the three-dimensional image is a three-dimensional image including the vicinity of the rectum of the human body. Therefore, when a patient with rectal cancer is photographed, the three-dimensional image includes the rectal cancer.

画像保管サーバ４は、各種データを保存して管理するコンピュータであり、大容量外部記憶装置およびデータベース管理用ソフトウェアを備えている。画像保管サーバ４は、有線あるいは無線のネットワーク５を介して他の装置と通信を行い、画像データ等を送受信する。具体的には撮影装置３で生成された３次元画像の画像データを含む各種データをネットワーク経由で取得し、大容量外部記憶装置等の記録媒体に保存して管理する。また、画像保管サーバ４には、後述するように疑似画像を導出したり、異常部位を検出したり、異常部位のクラスを判別したりするための機械学習モデルを構築するための教師データも保管されている。なお、画像データの格納形式およびネットワーク５経由での各装置間の通信は、ＤＩＣＯＭ（Digital Imaging and Communication in Medicine）等のプロトコルに基づいている。 The image storage server 4 is a computer that stores and manages various data, and includes a large-capacity external storage device and database management software. The image storage server 4 communicates with other devices via a wired or wireless network 5 and sends and receives image data and the like. Specifically, various data including image data of a three-dimensional image generated by the photographing device 3 is acquired via a network, and is stored and managed in a recording medium such as a large-capacity external storage device. The image storage server 4 also stores training data for constructing machine learning models for deriving pseudo images, detecting abnormal areas, and determining classes of abnormal areas, as described later. has been done. Note that the storage format of image data and the communication between each device via the network 5 are based on a protocol such as DICOM (Digital Imaging and Communication in Medicine).

次いで、本実施形態による画像生成装置および学習装置について説明する。図２は、本実施形態による画像生成装置および学習装置のハードウェア構成を示す図である。図２に示すように、画像生成装置および学習装置（以下、画像生成装置で代表させる）２０は、ＣＰＵ（Central Processing Unit）１１、不揮発性のストレージ１３、および一時記憶
領域としてのメモリ１６を含む。また、画像生成装置２０は、液晶ディスプレイ等のディスプレイ１４、キーボードとマウス等の入力デバイス１５、およびネットワーク５に接続されるネットワークＩ／Ｆ（InterFace）１７を含む。ＣＰＵ１１、ストレージ１３、ディスプレイ１４、入力デバイス１５、メモリ１６およびネットワークＩ／Ｆ１７は、バス１８に接続される。なお、ＣＰＵ１１は、本開示におけるプロセッサの一例である。 Next, an image generation device and a learning device according to this embodiment will be explained. FIG. 2 is a diagram showing the hardware configuration of the image generation device and the learning device according to this embodiment. As shown in FIG. 2, an image generation device and a learning device (hereinafter referred to as image generation device) 20 includes a CPU (Central Processing Unit) 11, a nonvolatile storage 13, and a memory 16 as a temporary storage area. . The image generation device 20 also includes a display 14 such as a liquid crystal display, an input device 15 such as a keyboard and a mouse, and a network I/F (InterFace) 17 connected to the network 5. The CPU 11, storage 13, display 14, input device 15, memory 16, and network I/F 17 are connected to the bus 18. Note that the CPU 11 is an example of a processor in the present disclosure.

ストレージ１３は、ＨＤＤ（Hard Disk Drive）、ＳＳＤ（Solid State Drive）、およびフラッシュメモリ等によって実現される。記憶媒体としてのストレージ１３には、画像生成プログラム１２Ａおよび学習プログラム１２Ｂが記憶される。ＣＰＵ１１は、ストレージ１３から画像生成プログラム１２Ａおよび学習プログラム１２Ｂを読み出してメモリ１６に展開し、展開した画像生成プログラム１２Ａおよび学習プログラム１２Ｂを実行する。 The storage 13 is realized by an HDD (Hard Disk Drive), an SSD (Solid State Drive), a flash memory, or the like. The storage 13 as a storage medium stores an image generation program 12A and a learning program 12B. The CPU 11 reads the image generation program 12A and the learning program 12B from the storage 13, expands them into the memory 16, and executes the expanded image generation program 12A and learning program 12B.

次いで、本実施形態による画像生成装置および学習装置の機能的な構成を説明する。図３は、本実施形態による画像生成装置および学習装置の機能的な構成を示す図である。図３に示すように画像生成装置２０は、情報取得部２１、疑似マスク導出部２２、疑似画像導出部２３および学習部２４を備える。そして、ＣＰＵ１１が画像生成プログラム１２Ａを実行することにより、ＣＰＵ１１は、情報取得部２１、疑似マスク導出部２２、および疑似画像導出部２３として機能する。また、ＣＰＵが学習プログラム１２Ｂを実行することにより、ＣＰＵ１１は学習部２４として機能する。 Next, the functional configurations of the image generation device and learning device according to this embodiment will be explained. FIG. 3 is a diagram showing the functional configuration of the image generation device and the learning device according to this embodiment. As shown in FIG. 3, the image generation device 20 includes an information acquisition section 21, a pseudo mask derivation section 22, a pseudo image derivation section 23, and a learning section 24. When the CPU 11 executes the image generation program 12A, the CPU 11 functions as an information acquisition section 21, a pseudo mask derivation section 22, and a pseudo image derivation section 23. Furthermore, the CPU 11 functions as the learning section 24 by executing the learning program 12B.

情報取得部２１は、後述する疑似画像を導出するために使用される原画像Ｇ０を画像保管サーバ４から取得する。また、情報取得部２１は、後述する学習済みモデルを構築するための教師データを画像保管サーバ４から取得する。 The information acquisition unit 21 acquires an original image G0 used for deriving a pseudo image, which will be described later, from the image storage server 4. The information acquisition unit 21 also acquires teacher data for constructing a learned model, which will be described later, from the image storage server 4.

ここで、本実施形態においては、原画像Ｇ０は、原画像Ｇ０に含まれる物体の領域にマスクが付与されたマスク画像Ｍ０と併せて保管されている。また、原画像Ｇ０に直腸がんが含まれる場合、直腸がんのステージを表す情報が原画像Ｇ０に付与されている。 Here, in this embodiment, the original image G0 is stored together with a mask image M0 in which a mask is applied to the object area included in the original image G0. Furthermore, when the original image G0 includes rectal cancer, information representing the stage of the rectal cancer is added to the original image G0.

マスクの付与は、原画像Ｇ０に対して入力デバイス１５を用いたマニュアル操作により行ってもよく、原画像Ｇ０をセグメンテーションすることにより行ってもよい。セグメンテーションとしては、画像の全ピクセルをピクセル単位でラベリングすることによりクラス分類を行うセマンティックセグメンテーションが用いられる。セマンティックセグメンテーションは、画像に含まれる物体の領域を抽出するように機械学習がなされることにより構築された機械学習モデルであるセマンティックセグメンテーションモデルにより行われる。セマンティックセグメンテーションモデルについては後述する。なお、画像保管サーバ４に保管される原画像Ｇ０およびマスク画像Ｍ０は、後述する本実施形態の画像処理装置により導出されたものであってもよい。 The mask may be applied to the original image G0 by manual operation using the input device 15, or by segmenting the original image G0. Semantic segmentation is used for segmentation, which performs class classification by labeling all pixels of an image pixel by pixel. Semantic segmentation is performed using a semantic segmentation model, which is a machine learning model constructed by machine learning to extract the region of an object included in an image. The semantic segmentation model will be described later. Note that the original image G0 and mask image M0 stored in the image storage server 4 may be those derived by the image processing device of this embodiment described later.

本実施形態においては、後述する画像処理装置において、処理対象となる対象画像から直腸がんを検出する。このため、原画像Ｇ０は、原画像Ｇ０に含まれる直腸がん、直腸の粘膜層、直腸の粘膜下層、直腸の固有筋層、直腸の漿膜下層およびこれら以外の背景のそれぞれの領域を識別するためのマスクがマスク画像Ｍ０として原画像Ｇ０に付与されている。原画像Ｇ０に含まれる直腸がんが対象物体の一例であり、直腸の粘膜層、直腸の粘膜下層、直腸の固有筋層、直腸の漿膜下層およびこれら以外の背景が、対象物体以外の他の物体の一例である。 In the present embodiment, rectal cancer is detected from a target image to be processed by an image processing apparatus to be described later. Therefore, the original image G0 identifies the respective areas of rectal cancer, the mucosal layer of the rectum, the submucosa of the rectum, the muscularis propria of the rectum, the subserosa of the rectum, and other background areas included in the original image G0. A mask for this purpose is added to the original image G0 as a mask image M0. The rectal cancer included in the original image G0 is an example of the target object, and the mucosal layer of the rectum, the submucosa of the rectum, the muscularis propria of the rectum, the subserosa of the rectum, and the background other than these are other than the target object. This is an example of an object.

ここで、初期の直腸がんは粘膜層にのみ存在するが、直腸がんが進行すると粘膜層から外側に広がり、粘膜下層および固有筋層へ浸潤し、粘膜下層および固有筋層と包含関係を有するものとなる。図４は直腸がんの進行を説明するための直腸の断面を模式的に示す図である。図４に示すように、直腸３０は、粘膜層３１、粘膜下層３２、固有筋層３３および漿膜下層３４からなる。図４には、がんの進行度合い、すなわちがんのステージに応じた直腸がん３５の領域にハッチングを付与して示す。図４に示すように、初期のステージＴ１，Ｔ２の直腸がん３５は直腸３０の粘膜層３１に位置するが、直腸がん３５が進行すると粘膜下層３２へ浸潤し（ステージＴ３ａｂ）、さらに固有筋層３３へ浸潤し（ステージＴ３ｃｄ）、徐々に漿膜下層３４へ近づく（ステージＴ３ＭＲＦ＋）。さらに進行すると固有筋層３３および漿膜下層３４を突き破り（ステージＴ４ａ）、他の臓器にまで到達する（ステージＴ４ｂ）。なお、直腸がんの各ステージが本開示の異なるクラスに対応する。 In the early stages, rectal cancer exists only in the mucosal layer, but as rectal cancer progresses, it spreads outward from the mucosal layer, invades the submucosa and muscularis propria, and forms an inclusive relationship with the submucosa and muscularis propria. Become what you have. FIG. 4 is a diagram schematically showing a cross section of the rectum to explain the progression of rectal cancer. As shown in FIG. 4, the rectum 30 consists of a mucosal layer 31, a submucosal layer 32, a muscularis propria 33, and a subserosal layer 34. In FIG. 4, regions of rectal cancer 35 are shown hatched according to the degree of progression of the cancer, that is, the stage of the cancer. As shown in FIG. 4, rectal cancer 35 at early stages T1 and T2 is located in the mucosal layer 31 of the rectum 30, but as rectal cancer 35 progresses, it invades the submucosa 32 (stage T3ab) and further It invades the muscular layer 33 (stage T3cd) and gradually approaches the subserosal layer 34 (stage T3MRF+). As it progresses further, it breaks through the muscularis propria 33 and the subserosal layer 34 (stage T4a) and reaches other organs (stage T4b). Note that each stage of rectal cancer corresponds to a different class in the present disclosure.

図５は原画像Ｇ０およびマスク画像Ｍ０の例を示す図である。図５に示す原画像Ｇ０は直腸における直腸がんが存在する部分を直腸の中心軸に交わる方向に切断した断層面を示している。図５に示す原画像Ｇ０においては、直腸がん３５が粘膜層３１から外側に広がり、粘膜下層３２および固有筋層３３へ浸潤し、その結果、直腸がん３５と粘膜層３１、粘膜下層３２および固有筋層３３とは包含関係を有するものとなっている。なお、図５において直腸がん３５は漿膜下層３４には浸潤していない。 FIG. 5 is a diagram showing an example of the original image G0 and the mask image M0. The original image G0 shown in FIG. 5 shows a tomographic plane obtained by cutting a portion of the rectum where rectal cancer exists in a direction intersecting the central axis of the rectum. In the original image G0 shown in FIG. It has an inclusive relationship with the muscularis propria and the muscularis propria 33. In addition, in FIG. 5, the rectal cancer 35 has not invaded the subserosal layer 34.

本実施形態においては、原画像Ｇ０において包含関係にある領域は、包含関係にない領域とは異なるマスクが付与される。例えば、図５に示すマスク画像Ｍ０は、粘膜層３１、粘膜下層３２、固有筋層３３および漿膜下層３４にはそれぞれマスクＭ１，Ｍ２，Ｍ３，Ｍ４が付与されている。直腸がん３５において直腸のいずれの組織にも包含関係にない領域にはマスクＭ５が付与されている。直腸がん３５において粘膜層３１と包含関係にある領域にはマスクＭ６が付与され、粘膜下層３２と包含関係にある領域にはマスクＭ７が付与され、固有筋層３３と包含関係にある領域にはマスクＭ８が付与される。なお、図５においては背景のマスクは省略している。 In this embodiment, areas in an inclusive relationship in the original image G0 are given a different mask from areas that are not in an inclusive relationship. For example, in the mask image M0 shown in FIG. 5, masks M1, M2, M3, and M4 are applied to the mucosal layer 31, the submucosa layer 32, the muscularis propria 33, and the subserosa layer 34, respectively. In the rectal cancer 35, a mask M5 is given to a region that is not included in any tissue of the rectum. In the rectal cancer 35, a mask M6 is applied to an area in an inclusive relationship with the mucosal layer 31, a mask M7 is applied to an area in an inclusive relationship with the submucosal layer 32, and a mask M7 is applied to an area in an inclusive relationship with the muscularis propria 33. is given mask M8. Note that in FIG. 5, the background mask is omitted.

なお、マスク画像Ｍ０をセグメンテーションモデルにより導出する場合、セマンティックセグメンテーションモデルにより包含関係にある領域をセグメンテーションするためには、包含関係にある領域と包含関係にない領域とを異なるようにマスクした正解データを含む教師データを用意して機械学習を行うことにより、セグメンテーションモデルを構築すればよい。 Note that when deriving the mask image M0 using a segmentation model, in order to segment regions that have an inclusion relationship using a semantic segmentation model, correct data that masks regions that have an inclusion relationship and areas that do not have an inclusion relationship differently must be used. A segmentation model can be constructed by preparing training data including the following and performing machine learning.

疑似マスク導出部２２は、原画像Ｇ０のマスク画像Ｍ０に含まれるマスクを加工することにより疑似マスク画像を導出する。このために、表示制御部７４がマスク加工画面をディスプレイ１４に表示する。図６はマスク加工画面を示す図である。図６に示すようにマスク加工画面４０には、マスク画像Ｍ０、マスク画像Ｍ０に含まれるマスクＭ０が加工されることにより得られるマスクＭｆ０を含む疑似マスク画像Ｍｆ０、原画像Ｇ０に含まれる直腸がんの３次元モデル４１、および３次元モデル４１の変形の程度を設定するためのプルダウンメニュー２６、マスクの加工を実行させるための加工ボタン２７、および疑似画像を導出させるための変換ボタン２８が表示されている。 The pseudo mask deriving unit 22 derives a pseudo mask image by processing the mask included in the mask image M0 of the original image G0. For this purpose, the display control unit 74 displays a mask processing screen on the display 14. FIG. 6 is a diagram showing a mask processing screen. As shown in FIG. 6, the mask processing screen 40 includes a mask image M0, a pseudo mask image Mf0 including a mask Mf0 obtained by processing the mask M0 included in the mask image M0, and a rectum included in the original image G0. A pull-down menu 26 for setting the three-dimensional model 41 and the degree of deformation of the three-dimensional model 41, a processing button 27 for performing mask processing, and a conversion button 28 for deriving a pseudo image are displayed. has been done.

なお、図６に示すマスク画像Ｍ０および疑似マスク画像Ｍｆ０は、直腸における直腸がんが存在する部分を直腸の中心軸に交わる方向に切断した断層面を示している。また、ここでは説明のために、図６に示すマスク画像Ｍ０は原画像Ｇ０において加工を行う直腸がんの領域にのみマスクＭｓ０が付与された画像となっている。また、疑似マスク画像Ｍｆ０には加工される直腸がんのマスクのみに参照符号Ｍｓｆ０を付与している。図６に示すマスク画像Ｍ０と疑似マスク画像Ｍｆ０とはマスクの加工前であるため、それぞれに付与されたマスクＭｓ０，Ｍｓｆ０は同一形状である。ここで、図６に示す直腸がんは、図４に示す直腸がんのステージとしてはステージＴ３ａｂである。 Note that the mask image M0 and pseudo-mask image Mf0 shown in FIG. 6 show tomographic planes obtained by cutting a portion of the rectum where rectal cancer exists in a direction intersecting the central axis of the rectum. For the purpose of explanation, the mask image M0 shown in FIG. 6 is an image in which a mask Ms0 is applied only to the rectal cancer region to be processed in the original image G0. Further, in the pseudo mask image Mf0, only the rectal cancer mask to be processed is given the reference code Msf0. Since the mask image M0 and the pseudo mask image Mf0 shown in FIG. 6 are before mask processing, the masks Ms0 and Msf0 given to each have the same shape. Here, the rectal cancer shown in FIG. 6 is at stage T3ab as the stage of the rectal cancer shown in FIG. 4.

３次元モデル４１は、原画像Ｇ０における直腸がんの領域のみを抽出してボリュームレンダリングすることにより導出された３次元的な画像である。操作者は入力デバイス１５を操作することにより、３次元モデル４１の全方向の向きを変更することができる。 The three-dimensional model 41 is a three-dimensional image derived by extracting only the region of rectal cancer in the original image G0 and performing volume rendering. By operating the input device 15, the operator can change the orientation of the three-dimensional model 41 in all directions.

プルダウンメニュー２６は直腸がんのステージを選択可能とされている。すなわち、プルダウンメニュー２６は、図４に示す直腸がんのステージＴ１、Ｔ２、Ｔ３ａｂ、Ｔ３ｃｄ、Ｔ３ＭＲＦ＋、Ｔ４ａおよびＴ４ｂを選択可能となっている。 The pull-down menu 26 allows selection of the stage of rectal cancer. That is, the pull-down menu 26 allows selection of rectal cancer stages T1, T2, T3ab, T3cd, T3MRF+, T4a, and T4b shown in FIG. 4.

本実施形態においては、疑似マスク導出部２２は、対象物体が示すクラスとは異なるクラスの対象物体を含む疑似画像を生成可能な疑似マスク画像を導出する。すなわち、疑似マスク画像Ｍｆ０の導出に際して、疑似マスク導出部２２は、原画像Ｇ０に含まれる直腸がんのステージとは異なるステージの直腸がんを含む疑似画像を生成可能なように３次元モデル４１を変形させてマスクを加工する。例えば、原画像Ｇ０に含まれる直腸がんのステージが粘膜層にのみ存在するステージＴ１である場合、直腸がんの３次元モデル４１を固有筋層まで伸びるように変形することにより、３次元モデル４１は、例えばステージＴ１から進んだステージＴ３ａｂの直腸がんに相当するものとなる。さらに、直腸がんの３次元モデル４１を固有筋層を突き破るように変形することにより、３次元モデル４１はステージＴ４ａの直腸がんに相当するものとなる。 In the present embodiment, the pseudo mask deriving unit 22 derives a pseudo mask image that can generate a pseudo image including a target object of a class different from the class indicated by the target object. That is, when deriving the pseudo mask image Mf0, the pseudo mask deriving unit 22 uses the three-dimensional model 41 to generate a pseudo image containing rectal cancer at a stage different from the stage of rectal cancer included in the original image G0. Process the mask by transforming it. For example, if the stage of rectal cancer included in the original image G0 is stage T1, which exists only in the mucosal layer, the three-dimensional model 41 of the rectal cancer is transformed so as to extend to the muscularis propria. 41 corresponds to, for example, rectal cancer at stage T3ab, which has progressed from stage T1. Furthermore, by deforming the three-dimensional model 41 of rectal cancer so as to break through the muscularis propria, the three-dimensional model 41 corresponds to stage T4a rectal cancer.

このため、操作者は、表示されたマスク画像Ｍ０を見て、疑似画像として生成する直腸がんのステージをプルダウンメニュー２６から選択する。図６においては、ステージＴ３ＭＲＦ＋が選択された状態を示している。直腸がんのステージの選択後、操作者により加工ボタン２７が選択されると、疑似マスク導出部２２は、３次元モデル４１により表される直腸がんのステージがＴ３ＭＲＦ＋となるように３次元モデル４１を変形する。さらに、疑似マスク導出部２２は、疑似マスク画像Ｍｆ０に付与されているマスクＭｓｆ０を、変形した３次元モデル４１の形状に適合するように加工する。 Therefore, the operator looks at the displayed mask image M0 and selects the stage of rectal cancer to be generated as a pseudo image from the pull-down menu 26. FIG. 6 shows a state in which stage T3MRF+ is selected. When the operator selects the processing button 27 after selecting the stage of rectal cancer, the pseudo mask deriving unit 22 creates a three-dimensional model so that the stage of rectal cancer represented by the three-dimensional model 41 is T3MRF+. Transform 41. Furthermore, the pseudo-mask deriving unit 22 processes the mask Msf0 given to the pseudo-mask image Mf0 so that it conforms to the shape of the deformed three-dimensional model 41.

なお、疑似マスク画像Ｍｆ０の導出に際して、疑似マスク導出部２２は、医用画像に関して臨床において評価指標となっている病変形状評価指標に基づいて、病変の形状および／または進行度が原画像Ｇ０に含まれる病変とは異なるものとなるように３次元モデル４１を変形する。すなわち、原画像Ｇ０に含まれる直腸がんの形状および／または進行度が、例えばステージＴ３ａｂからステージＴ３ＭＲＦ＋となるように、直腸がんの形状を変形する。あるいは、医用画像に関して正常な臓器を臨床の計測指標に基づいて病変と評価される形状となるまで３次元モデル４１を変形する。 Note that when deriving the pseudo mask image Mf0, the pseudo mask deriving unit 22 determines whether the shape and/or degree of progression of the lesion is included in the original image G0 based on a lesion shape evaluation index that is a clinical evaluation index for medical images. The three-dimensional model 41 is deformed so that it becomes a lesion different from that of the lesion. That is, the shape of the rectal cancer included in the original image G0 is changed so that the shape and/or the degree of progression of the rectal cancer changes from stage T3ab to stage T3MRF+, for example. Alternatively, the three-dimensional model 41 is deformed until a normal organ in a medical image has a shape that can be evaluated as a lesion based on clinical measurement indicators.

また、疑似マスク導出部２２は、直腸がんに付与されたマスクの３次元的な連続性を保持しつつマスクを加工する。例えば、３次元モデル４１を直腸の外側に向けて延ばすように変形する際に、３次元モデル４１における直腸の中心から離れるほど変形の程度を小さくする。これにより、元の３次元モデル４１における３次元的な連続性を保持しつつ３次元モデル４１を変形することができ、その結果、直腸がんに付与されたマスクの３次元的な連続性を保持しつつマスクを加工することができる。 Further, the pseudo mask deriving unit 22 processes the mask while maintaining the three-dimensional continuity of the mask applied to the rectal cancer. For example, when deforming the three-dimensional model 41 so as to extend it toward the outside of the rectum, the degree of deformation is made smaller as the distance from the center of the rectum in the three-dimensional model 41 increases. As a result, the three-dimensional model 41 can be transformed while maintaining the three-dimensional continuity of the original three-dimensional model 41, and as a result, the three-dimensional continuity of the mask applied to rectal cancer can be changed. You can process the mask while holding it.

また、３次元モデル４１は直腸がんに対応し、直腸がんのステージを進行させるように３次元モデル４１を変形することは、直腸がんを直腸の粘膜下層さらには固有筋層に浸潤させることとなる。ここで、直腸がんは進行により拡大したり変形したりするが、その変形の仕方は直腸の形状に依存する。すなわち、直腸がんは直腸の形状に合うように拡大したり変形したりする。ここで、直腸は原画像Ｇ０内においては移動および変形することなく固定されている。このため、疑似マスク導出部２２は、直腸がんの周囲にある固定された粘膜下層および固有筋層に付与されたマスクの形状に合わせて３次元モデル４１を変形する。これにより、直腸の形状に合った自然な形状の直腸がんを表す疑似マスク画像Ｍｆ０を導出することができる。 Furthermore, the three-dimensional model 41 corresponds to rectal cancer, and deforming the three-dimensional model 41 to advance the stage of rectal cancer causes rectal cancer to invade the submucosa of the rectum and even the muscularis propria. That will happen. Rectal cancer expands and deforms as it progresses, but the way it deforms depends on the shape of the rectum. That is, rectal cancer expands and deforms to fit the shape of the rectum. Here, the rectum is fixed within the original image G0 without moving or deforming. For this reason, the pseudo mask deriving unit 22 deforms the three-dimensional model 41 in accordance with the shape of the mask applied to the fixed submucosal layer and muscularis propria around the rectal cancer. Thereby, it is possible to derive a pseudo mask image Mf0 that represents rectal cancer in a natural shape that matches the shape of the rectum.

図７にはマスク加工画面４０において、３次元モデル４１がステージＴ３ａｂの直腸がんからステージＴ３ＭＲＦ＋の直腸がんを表すものとなるように変形されて領域４１Ａが付加された状態を示している。３次元モデル４１の変形に伴い、疑似マスク画像Ｍｆ０には、マスク画像Ｍ０のマスクＭｓ０が加工されたマスクＭｓｆ０が付与されている。 FIG. 7 shows a state in which the three-dimensional model 41 has been transformed from stage T3ab rectal cancer to stage T3MRF+ rectal cancer and a region 41A has been added on the mask processing screen 40. As the three-dimensional model 41 is deformed, a mask Msf0 obtained by processing the mask Ms0 of the mask image M0 is added to the pseudo mask image Mf0.

なお、３次元モデル４１の変形の程度を操作者が指示できるようにしてもよい。図８は３次元モデルの変形の程度を操作者が指示可能なマスク加工画面を示す図である。なお、図８において図６と同一の構成要素については同一の参照番号を付与し、ここでは詳細な説明は省略する。図８に示すようにマスク加工画面４０には、図６に示すプルダウンメニュー２６および加工ボタン２７に代えて、３次元モデル４１の変形の程度を調整するためのスケール４２が表示されている。 Note that the operator may be able to instruct the degree of deformation of the three-dimensional model 41. FIG. 8 is a diagram showing a mask processing screen on which the operator can instruct the degree of deformation of the three-dimensional model. Note that in FIG. 8, the same reference numerals are given to the same components as in FIG. 6, and detailed explanations are omitted here. As shown in FIG. 8, the mask processing screen 40 displays a scale 42 for adjusting the degree of deformation of the three-dimensional model 41, in place of the pull-down menu 26 and processing button 27 shown in FIG.

図８に示すマスク加工画面４０Ａにおいて、操作者は、入力デバイス１５を用いてスケール４２の摘子４２Ａを移動させることにより、３次元モデル４１の変形の程度を指定する。これにより疑似マスク導出部２２は、指定された位置を起点として３次元モデル４１を延ばすように変形し、さらに疑似マスク画像Ｍｆ０に付与されているマスクＭｓｆ０を、伸ばした３次元モデル４１の形状に適合するように加工する。この場合においても、疑似マスク導出部２２は、操作者が指定した医用画像に関して臨床において評価指標となっている病変形状評価指標に基づいて、病変の形状および／または進行度が原画像Ｇ０に含まれる病変とは異なるものとなるように３次元モデル４１を変形する。また、疑似マスク導出部２２は、直腸がんに付与されたマスクの３次元的な連続性を保持しつつマスクを加工する。 On the mask processing screen 40A shown in FIG. 8, the operator specifies the degree of deformation of the three-dimensional model 41 by moving the knob 42A of the scale 42 using the input device 15. As a result, the pseudo mask deriving unit 22 deforms the three-dimensional model 41 so as to extend it from the specified position, and further transforms the mask Msf0 given to the pseudo mask image Mf0 into the shape of the extended three-dimensional model 41. Process it to fit. In this case as well, the pseudo mask deriving unit 22 determines whether the shape and/or degree of progression of the lesion is included in the original image G0 based on the lesion shape evaluation index, which is a clinical evaluation index for the medical image specified by the operator. The three-dimensional model 41 is deformed so that it becomes a lesion different from that of the lesion. Further, the pseudo mask deriving unit 22 processes the mask while maintaining the three-dimensional continuity of the mask applied to the rectal cancer.

なお、疑似マスク画像Ｍｆ０の導出は、上記のように指定された直腸がんのステージに応じたものには限定されない。例えば、浸潤リンパ節のような複数の棘状突起を有するマスクを生成するために、図６に示すマスク加工画面４０において、直腸がんのステージを選択するプルダウンメニュー２６に加えて、またはこれに代えて棘状突起の付与の有無を選択するためのプルダウンメニューを表示するようにしてもよい。また、図８に示すスケール４２に加えて、棘状突起の付与の有無を選択するためのプルダウンメニューを表示するようにしてもよい。この場合、マスク加工画面４０の３次元モデル４１は、図９に示すように浸潤リンパ節４３が付与されたものとなり、疑似マスク画像Ｍｆ０は直腸がんのマスクＭｓｆ０に浸潤リンパ節４３の領域が追加されたものとなる。 Note that the derivation of the pseudo mask image Mf0 is not limited to that according to the stage of rectal cancer specified as described above. For example, in order to generate a mask having multiple spinous processes such as infiltrated lymph nodes, in the mask processing screen 40 shown in FIG. Alternatively, a pull-down menu for selecting whether to add spinous processes may be displayed. Further, in addition to the scale 42 shown in FIG. 8, a pull-down menu for selecting whether or not to add spinous processes may be displayed. In this case, the three-dimensional model 41 on the mask processing screen 40 has the infiltrated lymph nodes 43 added thereto as shown in FIG. It will be added.

また、血管浸潤のような小さな突起を有するマスクを生成するために、図６に示すマスク加工画面４０において、直腸がんのステージを選択するプルダウンメニュー２６に加えて、またはこれに代えて突起の付与の有無を選択するためのプルダウンメニューを表示するようにしてもよい。また、図８に示すスケール４２に加えて、突起の付与の有無を選択するためのプルダウンメニューを表示するようにしてもよい。なお、直腸がんにおける突起を付与する位置および突起の先端位置は操作者が入力デバイス１５を用いて指定することとなる。この場合、マスク加工画面４０の３次元モデル４１は、図１０に示すように突起を付与する位置および突起の先端位置が例えばスプライン補間等により補間されて、血管浸潤４４が付与されたものとなる。なお、図１０に示す３次元モデル４１においては、２つの血管浸潤が付与されている。また、疑似マスク画像Ｍｆ０は直腸がんのマスクに２つの血管浸潤４４の領域が追加されたものとなる。 In order to generate a mask having small protrusions such as vascular invasion, in addition to or in place of the pull-down menu 26 for selecting the stage of rectal cancer, in the mask processing screen 40 shown in FIG. A pull-down menu may be displayed for selecting whether to grant or not. Furthermore, in addition to the scale 42 shown in FIG. 8, a pull-down menu for selecting whether or not to provide protrusions may be displayed. Note that the position where the protrusion is to be provided and the position of the tip of the protrusion in rectal cancer are specified by the operator using the input device 15. In this case, the three-dimensional model 41 on the mask processing screen 40 has a vascular invasion 44 added thereto by interpolating the protrusion placement position and the tip position of the protrusion by, for example, spline interpolation, as shown in FIG. . Note that in the three-dimensional model 41 shown in FIG. 10, two blood vessel invasions are added. Further, the pseudo mask image Mf0 is obtained by adding two areas of blood vessel invasion 44 to the rectal cancer mask.

また、疑似マスク導出部２２は、導出された疑似マスク画像Ｍｆ０についての直腸がんのステージを表す情報を導出する。ここで、図６に示すマスク加工画面４０においては直腸がんのステージが操作者によりプルダウンメニュー２６から選択されているため、選択された直腸がんのステージをそのまま用いればよい。一方、図８に示すマスク加工画面４０Ａにおいては、加工の程度が操作者により指示されている。このため、疑似マスク導出部２２は、疑似マスク画像Ｍｆ０に含まれる直腸がんについて、マスクＭｓｆ０の粘膜層３１からの深さに応じてステージを判定し、ステージを表す情報を疑似マスク画像Ｍｆ０に付与する。なお、疑似マスク画像Ｍｆ０における直腸がんのステージを表す情報は、操作者による入力デバイス１５からの入力に基づくものであってもよい。 Further, the pseudo mask deriving unit 22 derives information representing the stage of rectal cancer for the derived pseudo mask image Mf0. Here, in the mask processing screen 40 shown in FIG. 6, since the stage of rectal cancer is selected by the operator from the pull-down menu 26, the selected stage of rectal cancer may be used as is. On the other hand, on the mask processing screen 40A shown in FIG. 8, the degree of processing is instructed by the operator. Therefore, the pseudo-mask deriving unit 22 determines the stage of the rectal cancer included in the pseudo-mask image Mf0 according to the depth of the mask Msf0 from the mucous membrane layer 31, and adds information representing the stage to the pseudo-mask image Mf0. Give. Note that the information representing the stage of rectal cancer in the pseudo mask image Mf0 may be based on input from the input device 15 by the operator.

疑似画像導出部２３は、マスク加工画面４０において変換ボタン２８が選択されると、原画像Ｇ０および疑似マスク画像Ｍｆ０に基づいて、疑似マスク画像Ｍｆ０に含まれるマスクに基づく領域を有する疑似画像を導出する。このために、疑似画像導出部２３は、敵対的生成ネットワーク（Generative Adversarial Networks：GAN）を用いて学習がなされたジェネレータ５０を有する。ジェネレータ５０は直腸がんを含む原画像Ｇ０およびマスク画像が入力されると、マスク画像に含まれるマスクに基づく領域を有する疑似画像を出力するように学習がなされることにより構築される。 When the conversion button 28 is selected on the mask processing screen 40, the pseudo image deriving unit 23 derives a pseudo image having a region based on the mask included in the pseudo mask image Mf0, based on the original image G0 and the pseudo mask image Mf0. do. For this purpose, the pseudo image deriving unit 23 includes a generator 50 trained using Generative Adversarial Networks (GAN). The generator 50 is constructed by learning to output a pseudo image having a region based on the mask included in the mask image when the original image G0 containing rectal cancer and a mask image are input.

本実施形態においては、疑似画像とは、原画像Ｇ０を取得したモダリティにより取得される画像と同一の表現形式を有する画像を意味する。すなわち、原画像Ｇ０がＭＲＩ撮影装置により取得されたＭＲＩ画像である場合、疑似画像とは、ＭＲＩ画像と同一の表現形式を有する画像を意味する。ここで、表現形式が同一であるとは、同一の組成を有する構造物については、同一の濃度あるいは輝度で表されることを意味する。 In this embodiment, a pseudo image means an image having the same expression format as the image acquired by the modality that acquired the original image G0. That is, when the original image G0 is an MRI image acquired by an MRI imaging device, the pseudo image means an image having the same expression format as the MRI image. Here, the expression format is the same means that structures having the same composition are expressed with the same density or brightness.

ここで、ジェネレータ５０は、例えば「Semantic Image Synthesis with Spatially-Adaptive Normalization、Parkら、arXiv:1903.07291v2 [cs.CV] 5 Nov 2019」に記載されたマスク画像から疑似画像を生成するＳＰＡＤＥの手法により構築されたジェネレータを用いることができる。 Here, the generator 50 uses, for example, the SPADE method of generating a pseudo image from a mask image described in "Semantic Image Synthesis with Spatially-Adaptive Normalization, Park et al., arXiv:1903.07291v2 [cs.CV] 5 Nov 2019". A constructed generator can be used.

図１１はジェネレータおよびその学習を模式的に示す図である。図１１に示すように、ジェネレータ５０は、エンコーダ５１およびデコーダ５２を有する。本実施形態においては、ジェネレータ５０は後述するディスクリミネータ５３とともに敵対的生成ネットワーク（ＧＡＮ）を構成する。なお、図１１に示す例においては、直腸を含むＭＲＩ画像である学習用画像Ｓ１および学習用画像Ｓ１における直腸がん、粘膜層、粘膜下層、固有筋層および漿膜下層にマスクが付与された学習用マスク画像Ｓ２からなる教師データＳ０が用意される。 FIG. 11 is a diagram schematically showing a generator and its learning. As shown in FIG. 11, generator 50 includes an encoder 51 and a decoder 52. In this embodiment, the generator 50 constitutes a generative adversarial network (GAN) together with a discriminator 53, which will be described later. In the example shown in FIG. 11, the learning image S1 is an MRI image including the rectum, and the learning image in which masks are applied to the rectal cancer, mucosal layer, submucosal layer, muscularis propria, and subserosal layer in the learning image S1. Teacher data S0 consisting of a mask image S2 is prepared.

ジェネレータ５０を構成するエンコーダ５１は、複数の処理層が階層的に接続された多層ニューラルネットワークの１つである、畳み込みニューラルネットワーク（ＣＮＮ(Convolutional Neural Network)）からなり、本実施形態においては、学習用画像Ｓ１が入力されると直腸を含むＭＲＩ画像の特徴量を表す潜在表現ｚ０を出力する。 The encoder 51 constituting the generator 50 is a convolutional neural network (CNN), which is one of the multilayer neural networks in which a plurality of processing layers are hierarchically connected. When the image S1 is input, a latent expression z0 representing the feature amount of the MRI image including the rectum is output.

デコーダ５２は、エンコーダ５１が出力した潜在表現ｚ０をエンコードしつつ、学習用マスク画像Ｓ２に含まれる個々の領域のマスクを適用して各マスクが表す領域を生成し、マスク画像に含まれるマスクのそれぞれに基づく領域を有し、学習用画像Ｓ１と同一の表現形式を有する疑似画像Ｓ３を出力する。 The decoder 52 encodes the latent expression z0 output by the encoder 51, applies the masks of the individual regions included in the learning mask image S2 to generate regions represented by each mask, and A pseudo image S3 having regions based on the respective regions and having the same expression format as the learning image S1 is output.

ディスクリミネータ５３は、入力された画像が実画像であるかジェネレータ５０により生成された疑似画像であるかを判別して判別結果ＴＦ０を出力する。ここで、実画像とは、ジェネレータ５０が生成した画像ではなく、撮影装置３により被写体を撮影することにより取得された原画像である。これに対して、疑似画像はジェネレータ５０によりマスク画像から生成された原画像と同一の表現形式を有する画像である。 The discriminator 53 determines whether the input image is a real image or a pseudo image generated by the generator 50, and outputs a determination result TF0. Here, the actual image is not an image generated by the generator 50, but an original image obtained by photographing a subject with the photographing device 3. On the other hand, the pseudo image is an image that has the same expression format as the original image generated from the mask image by the generator 50.

本実施形態においては、入力された画像が実画像であるか、ジェネレータ５０により生成された疑似画像であるかの判別結果ＴＦ０を正解するように、ディスクリミネータ５３が学習される。また、入力されたマスク画像から実画像に似せた疑似画像を導出し、ディスクリミネータ５３が判別結果ＴＦ０を不正解とするように、ジェネレータ５０が学習される。これにより、ジェネレータ５０はディスクリミネータ５３に識別されない、本物のＭＲＩ画像と同一の表現形式を有する疑似画像を生成できるようになる。 In this embodiment, the discriminator 53 is trained so as to correctly determine whether the input image is a real image or a pseudo image generated by the generator 50 as the determination result TF0. Further, the generator 50 is trained so that a pseudo image resembling the real image is derived from the input mask image, and the discriminator 53 determines that the determination result TF0 is incorrect. This allows the generator 50 to generate a pseudo image that is not identified by the discriminator 53 and has the same representation format as the real MRI image.

疑似画像導出部２３は、このようにして構築されたジェネレータ５０により、原画像Ｇ０および疑似マスク導出部２２が導出した疑似マスク画像Ｍｆ０から疑似画像を導出する。例えば、ジェネレータ５０は、原画像Ｇ０および図１２に示すような疑似マスク画像Ｍｆ０が入力されると、原画像Ｇ０と同一の表現形式を有する疑似画像Ｇｆ０を出力する。導出された疑似画像Ｇｆ０は、疑似マスク画像Ｍｆ０および直腸がんのステージを表す情報と併せてストレージ１３に保存される。ここで、本実施形態においては、本実施形態による画像生成装置により生成されたものではない、既存の画像すなわち原画像Ｓ０、原画像Ｓ０の直腸がんにマスクが付与されたマスク画像Ｍ０および直腸がんのステージを表す情報が、後述するセグメンテーションモデルを学習するための教師データとしてストレージ１３に蓄積されている。そして、本実施形態においては、本実施形態による画像生成装置１により導出された、疑似画像Ｇｆ０、疑似マスク画像Ｍｆ０および直腸がんのステージを表す情報は、既存の教師データに加えてストレージ１３に教師データとして蓄積される。また、疑似画像Ｇｆ０は、疑似マスク画像Ｍｆ０および直腸がんのステージを表す情報を画像保管サーバ４に送信し、ここで既存の教師データと併せて蓄積するようにしてもよい。 The pseudo image deriving unit 23 uses the generator 50 constructed in this manner to derive a pseudo image from the original image G0 and the pseudo mask image Mf0 derived by the pseudo mask deriving unit 22. For example, when the original image G0 and the pseudo mask image Mf0 as shown in FIG. 12 are input, the generator 50 outputs the pseudo image Gf0 having the same expression format as the original image G0. The derived pseudo image Gf0 is stored in the storage 13 together with the pseudo mask image Mf0 and information representing the stage of rectal cancer. Here, in the present embodiment, existing images that are not generated by the image generation device according to the present embodiment, that is, the original image S0, the mask image M0 in which the rectal cancer in the original image S0 is masked, and the rectal Information representing the stage of cancer is stored in the storage 13 as training data for learning a segmentation model to be described later. In the present embodiment, the pseudo image Gf0, the pseudo mask image Mf0, and the information representing the stage of rectal cancer derived by the image generation device 1 according to the present embodiment are stored in the storage 13 in addition to the existing training data. Accumulated as teacher data. Further, the pseudo image Gf0 may transmit the pseudo mask image Mf0 and information representing the stage of rectal cancer to the image storage server 4, where they may be stored together with existing teacher data.

学習部２４は、直腸を含むＭＲＩ画像を複数の領域にセグメンテーションするセグメンテーションモデルを学習する。本実施形態においては、ＭＲＩ画像を直腸がん、直腸の粘膜層、直腸の粘膜下層、直腸の固有筋層、直腸の漿膜下層およびこれら以外の背景のそれぞれの領域にセグメンテーションするセマンティックセグメンテーションモデルの学習を行い、学習済みのセマンティックセグメンテーションモデルを構築する。セマンティックセグメンテーションモデル（以下、ＳＳ（Semantic Segmentation）モデルとする）は、周知のように、入力画像の各画素に対して抽出対象物（クラス）を表すマスクを付与した出力画像を出力する機械学習モデルである。本実施形態においては、ＳＳモデルへの入力画像は直腸の領域を含むＭＲＩ画像であり、出力画像はＭＲＩ画像における直腸がん、直腸の粘膜層、直腸の粘膜下層、直腸の固有筋層、直腸の漿膜下層およびこれら以外の背景のそれぞれの領域をそれぞれマスクしたマスク画像である。ＳＳモデルは、ＲｅｓＮｅｔ（Residual Networks）、Ｕ－Ｎｅｔ（U-shaped Networks）といった畳み込みニューラルネットワーク（ＣＮＮ；Convolutional neural network）により構築される。 The learning unit 24 learns a segmentation model that segments an MRI image including the rectum into a plurality of regions. In this embodiment, we learn a semantic segmentation model that segments an MRI image into rectal cancer, the mucosal layer of the rectum, the submucosa of the rectum, the muscularis propria of the rectum, the subserosa of the rectum, and other background regions. and build a trained semantic segmentation model. As is well known, a semantic segmentation model (hereinafter referred to as SS (Semantic Segmentation) model) is a machine learning model that outputs an output image in which each pixel of an input image is given a mask representing the extraction target (class). It is. In this embodiment, the input image to the SS model is an MRI image including the rectal region, and the output image is the rectal cancer in the MRI image, the mucosal layer of the rectum, the submucosa of the rectum, the muscularis propria of the rectum, and the rectal region. This is a mask image in which the subserosa layer and other background areas are masked. The SS model is constructed using a convolutional neural network (CNN) such as ResNet (Residual Networks) or U-Net (U-shaped Networks).

ＳＳモデルの学習に際しては、既存の教師データ、すなわち原画像Ｇ０および原画像Ｇ０のマスク画像Ｍ０の組み合わせからなる教師データに加えて、疑似マスク導出部２２が導出した疑似マスク画像Ｍｆ０および疑似画像導出部２３が導出した疑似画像Ｇｆ０の組み合わせからなる教師データが使用される。既存の教師データにおいては、原画像Ｇ０および疑似画像Ｇｆ０が学習用データであり、マスク画像Ｍ０および疑似マスク画像Ｍｆ０が正解データである。疑似画像Ｇｆ０および疑似マスク画像Ｍｆ０を含む教師データにおいては、疑似画像Ｇｆ０が学習用データであり、疑似マスク画像Ｍｆ０が正解データである。 When learning the SS model, in addition to existing training data, that is, training data consisting of a combination of the original image G0 and the mask image M0 of the original image G0, the pseudo mask image Mf0 and the pseudo image derived by the pseudo mask derivation unit 22 are used. Teacher data consisting of a combination of pseudo images Gf0 derived by the unit 23 is used. In the existing teacher data, the original image G0 and pseudo image Gf0 are learning data, and the mask image M0 and pseudo mask image Mf0 are correct data. In the teacher data including the pseudo image Gf0 and the pseudo mask image Mf0, the pseudo image Gf0 is the learning data, and the pseudo mask image Mf0 is the correct data.

学習の際にはＳＳモデルに原画像Ｇ０および疑似画像Ｇｆ０が入力され、これらの画像に含まれる物体がセグメンテーションされたマスク画像が出力される。次に、ＳＳモデルが出力したマスク画像と正解データであるマスク画像Ｍ０および疑似マスク画像Ｍｆ０との相違が損失として導出される。そして、損失が小さくなるように複数の教師データを用いてＳＳモデルの学習が繰り返されて、ＳＳモデルが構築される。 During learning, the original image G0 and pseudo image Gf0 are input to the SS model, and a mask image in which objects included in these images are segmented is output. Next, the difference between the mask image output by the SS model and the mask image M0 and pseudo mask image Mf0, which are correct data, is derived as a loss. Then, learning of the SS model is repeated using a plurality of pieces of training data so that the loss is small, and the SS model is constructed.

また、学習部２４は、直腸を含むＭＲＩ画像について直腸がんのステージを判別する判別モデルの学習を行い、学習済みの判別モデルを構築する。本実施形態においては、判別モデルへの入力画像は直腸の領域を含むＭＲＩ画像およびＭＲＩ画像をセグメンテーションしたマスク画像であり、出力はＭＲＩ画像に含まれる直腸がんのステージである。判別モデルも、ＲｅｓＮｅｔ、Ｕ－Ｎｅｔといった畳み込みニューラルネットワークにより構築される。 The learning unit 24 also learns a discrimination model for determining the stage of rectal cancer for MRI images including the rectum, and constructs a learned discrimination model. In this embodiment, the input images to the discriminant model are an MRI image including the rectal region and a mask image obtained by segmenting the MRI image, and the output is the stage of rectal cancer included in the MRI image. The discriminant model is also constructed using a convolutional neural network such as ResNet or U-Net.

判別モデルの学習に際しては、既存の教師データすなわち原画像Ｇ０、原画像Ｇ０のマスク画像Ｍ０および原画像Ｇ０に含まれる直腸がんのステージを表す情報の組み合わせからなる教師データに加えて、疑似マスク導出部２２が導出した疑似マスク画像Ｍｆ０、疑似画像導出部２３が導出した疑似画像Ｇｆ０および疑似画像Ｇｆ０に含まれる直腸がんのステージを表す情報の組み合わせからなる教師データが使用される。既存の教師データにおいては、原画像Ｇ０およびマスク画像Ｍ０が学習用データであり、原画像Ｇ０の直腸がんのステージを表す情報が正解データである。疑似画像Ｇｆ０および疑似マスク画像Ｍｆ０を含む教師データにおいては、疑似画像Ｇｆ０および疑似マスク画像Ｍｆ０が学習用データであり、直腸がんのステージを表す情報が正解データである。 When learning the discriminant model, in addition to the existing training data, which is a combination of the original image G0, the mask image M0 of the original image G0, and the information representing the stage of rectal cancer included in the original image G0, a pseudo mask is used. The pseudo mask image Mf0 derived by the deriving unit 22, the pseudo image Gf0 derived by the pseudo image deriving unit 23, and the training data consisting of a combination of information representing the stage of rectal cancer included in the pseudo image Gf0 are used. In the existing teacher data, the original image G0 and the mask image M0 are learning data, and the information representing the stage of rectal cancer in the original image G0 is correct data. In the training data including pseudo image Gf0 and pseudo mask image Mf0, pseudo image Gf0 and pseudo mask image Mf0 are learning data, and information representing the stage of rectal cancer is correct data.

学習の際には判別モデルに原画像Ｇ０およびマスク画像Ｍ０、並びに疑似画像Ｇｆ０および疑似マスク画像Ｍｆ０が入力され、これらの画像に含まれる直腸がんのステージを表す情報が出力される。直腸がんのステージを表す情報としては、直腸がんの各ステージであることの確率である。確率は０～１の値をとる。次に、直腸がんの各ステージであることの確率と、正解データの直腸がんのステージとの相違が損失として導出される。ここで、判別モデルが出力した直腸がんのステージ（Ｔ１，Ｔ２，Ｔ３，Ｔ４）＝（０．１，０．１，０．７，０．１）であり、正解データが（０，０，１，０）であるとすると、判別モデルが出力した直腸がんの各ステージの確率と正解データにおける直腸がんの各ステージの確率との相違が損失として導出される。そして、損失が小さくなるように複数の教師データを用いて判別モデルの学習が繰り返されて、判別モデルが構築される。 During learning, the original image G0 and mask image M0, as well as the pseudo image Gf0 and pseudo mask image Mf0, are input to the discriminant model, and information representing the stage of rectal cancer included in these images is output. Information representing the stage of rectal cancer is the probability of being at each stage of rectal cancer. Probability takes a value between 0 and 1. Next, the difference between the probability of each stage of rectal cancer and the stage of rectal cancer in the correct data is derived as a loss. Here, the stage of rectal cancer output by the discriminant model (T1, T2, T3, T4) = (0.1, 0.1, 0.7, 0.1), and the correct data is (0, 0 , 1, 0), the difference between the probability of each stage of rectal cancer output by the discriminant model and the probability of each stage of rectal cancer in the correct data is derived as a loss. Then, learning of the discriminant model is repeated using a plurality of pieces of training data so that the loss is small, and the discriminant model is constructed.

なお、入力画像についてのマスク画像のみが入力されると入力画像に含まれる直腸がんのステージを表す情報を出力するように判別モデルを構築してもよい。この場合、判別モデルの学習には、原画像Ｇ０についてのマスク画像Ｍ０を学習用データ、原画像Ｇ０に含まれる直腸がんのステージを表す情報を正解データとして含む教師データ、並びに疑似画像Ｇｆ０についての疑似マスク画像Ｍｆ０を学習用データ、疑似画像Ｇｆ０に含まれる直腸がんのステージを表す情報を正解データとして含む教師データが使用される。 Note that the discrimination model may be constructed so that when only the mask image for the input image is input, information representing the stage of rectal cancer included in the input image is output. In this case, for learning the discriminant model, training data includes a mask image M0 for the original image G0, teacher data including information representing the stage of rectal cancer included in the original image G0 as correct data, and training data for the pseudo image Gf0. The pseudo mask image Mf0 of is used as learning data, and the teacher data including information representing the stage of rectal cancer included in the pseudo image Gf0 as correct data is used.

次いで、本実施形態による画像処理装置について説明する。図１３は、本実施形態による画像処理装置のハードウェア構成を示す図である。図１３に示すように、画像処理装置６０は、ＣＰＵ６１、不揮発性のストレージ６３、および一時記憶領域としてのメモリ６６を含む。また、画像処理装置６０は、液晶ディスプレイ等のディスプレイ６４、キーボードとマウス等の入力デバイス６５、およびネットワーク５に接続されるネットワークＩ／Ｆ（InterFace）６７を含む。ＣＰＵ６１、ストレージ６３、ディスプレイ６４、入力デバイス６５、メモリ６６およびネットワークＩ／Ｆ６７は、バス６８に接続される。なお、ＣＰＵ６１は、本開示におけるプロセッサの一例である。 Next, an image processing apparatus according to this embodiment will be explained. FIG. 13 is a diagram showing the hardware configuration of the image processing device according to this embodiment. As shown in FIG. 13, the image processing device 60 includes a CPU 61, a nonvolatile storage 63, and a memory 66 as a temporary storage area. The image processing device 60 also includes a display 64 such as a liquid crystal display, an input device 65 such as a keyboard and a mouse, and a network I/F (InterFace) 67 connected to the network 5. The CPU 61, storage 63, display 64, input device 65, memory 66, and network I/F 67 are connected to a bus 68. Note that the CPU 61 is an example of a processor in the present disclosure.

ストレージ６３には、画像処理プログラム６２が記憶される。ＣＰＵ６１は、ストレージ６３から画像処理プログラム６２を読み出してメモリ６６に展開し、展開した画像処理プログラム６２を実行する。 An image processing program 62 is stored in the storage 63. The CPU 61 reads the image processing program 62 from the storage 63, loads it into the memory 66, and executes the loaded image processing program 62.

次いで、本実施形態による画像処理装置の機能的な構成を説明する。図１４は、本実施形態による画像処理装置の機能的な構成を示す図である。図１４に示すように画像処理装置６０は、画像取得部７１、セグメンテーション部７２、判別部７３および表示制御部７４を備える。そして、ＣＰＵ６１が画像処理プログラム６２を実行することにより、ＣＰＵ６１は、画像取得部７１、セグメンテーション部７２、判別部７３および表示制御部７４として機能する。 Next, the functional configuration of the image processing apparatus according to this embodiment will be explained. FIG. 14 is a diagram showing the functional configuration of the image processing device according to this embodiment. As shown in FIG. 14, the image processing device 60 includes an image acquisition section 71, a segmentation section 72, a discrimination section 73, and a display control section 74. When the CPU 61 executes the image processing program 62, the CPU 61 functions as an image acquisition section 71, a segmentation section 72, a discrimination section 73, and a display control section 74.

画像取得部７１は、処理の対象となる対象画像Ｔ０を画像保管サーバ４から取得する。対象画像Ｔ０は患者の直腸を含むＭＲＩ画像である。 The image acquisition unit 71 acquires a target image T0 to be processed from the image storage server 4. The target image T0 is an MRI image including the patient's rectum.

セグメンテーション部７２は、対象画像Ｔ０に含まれる物体の領域をセグメンテーションして、対象画像Ｔ０に含まれる物体の領域をマスクしたマスク画像ＴＭ０を導出する。本実施形態においては、対象画像Ｔ０に含まれる直腸がん、直腸の粘膜層、直腸の粘膜下層、直腸の固有筋層、直腸の漿膜下層およびこれら以外の背景のそれぞれの領域をセグメンテーションし、各領域にマスクを付与したマスク画像ＴＭ０を導出する。このために、セグメンテーション部７２は、本実施形態による学習装置によって構築されたＳＳモデル７２Ａが適用されている。 The segmentation unit 72 segments the object region included in the target image T0 and derives a mask image TM0 in which the object region included in the target image T0 is masked. In this embodiment, each region of the rectal cancer, the mucosal layer of the rectum, the submucosa of the rectum, the muscularis propria of the rectum, the subserosa of the rectum, and the background other than these, which are included in the target image T0, is segmented. A mask image TM0 in which a mask is applied to a region is derived. For this purpose, an SS model 72A constructed by the learning device according to the present embodiment is applied to the segmentation unit 72.

判別部７３は、対象画像Ｔ０に含まれる直腸がんのステージを判別し、判別結果を出力する。このために、判別部７３は、本実施形態による学習装置によって構築された判別モデル７３Ａが適用されている。判別モデル７３Ａには対象画像Ｔ０およびセグメンテーション部７２が導出した対象画像Ｔ０のマスク画像ＴＭ０が入力され、対象画像Ｔ０に含まれる直腸がんのステージの判別結果を出力する。 The determining unit 73 determines the stage of rectal cancer included in the target image T0, and outputs the determination result. For this purpose, the discrimination model 73A constructed by the learning device according to the present embodiment is applied to the discrimination unit 73. The target image T0 and the mask image TM0 of the target image T0 derived by the segmentation unit 72 are input to the discrimination model 73A, and outputs the determination result of the stage of rectal cancer included in the target image T0.

表示制御部７４は、セグメンテーション部７２が導出したマスク画像ＴＭ０および判別部７３が導出した直腸がんのステージの判別結果をディスプレイ６４に表示する。図１５は、マスク画像ＴＭ０および判別結果の表示画面を示す図である。図１５に示すように表示画面８０には、対象画像Ｔ０、マスク画像ＴＭ０および判別結果８１が表示される。なお、図１５においては判別結果は「ステージＴ３」である。 The display control unit 74 displays on the display 64 the mask image TM0 derived by the segmentation unit 72 and the determination result of the stage of rectal cancer derived by the determination unit 73. FIG. 15 is a diagram showing a display screen of the mask image TM0 and the discrimination results. As shown in FIG. 15, the target image T0, mask image TM0, and determination result 81 are displayed on the display screen 80. In addition, in FIG. 15, the determination result is "stage T3".

次いで、本実施形態において行われる処理について説明する。図１６は本実施形態における画像生成処理のフローチャートである。まず，情報取得部２１が画像保管サーバ４から原画像Ｇ０およびマスク画像Ｍ０を取得する（ステップＳＴ１）。次いで、疑似マスク導出部２２がマスクを加工することにより疑似マスク画像Ｍｆ０を導出する（ステップＳＴ２）。そして、疑似画像導出部２３が疑似マスクに基づく領域を有する疑似画像Ｇｆ０を導出し（ステップＳＴ３）、疑似画像Ｇｆ０、疑似マスク画像Ｍｆ０および疑似画像Ｇｆ０に含まれる直腸がんのステージを表す情報を、教師データとしてストレージ１３あるいは画像保管サーバ４に、既存の教師データと併せて蓄積し（ステップＳＴ４）、処理を終了する。 Next, the processing performed in this embodiment will be explained. FIG. 16 is a flowchart of image generation processing in this embodiment. First, the information acquisition unit 21 acquires the original image G0 and the mask image M0 from the image storage server 4 (step ST1). Next, the pseudo mask deriving unit 22 derives a pseudo mask image Mf0 by processing the mask (step ST2). Then, the pseudo image deriving unit 23 derives a pseudo image Gf0 having a region based on the pseudo mask (step ST3), and extracts information representing the stage of rectal cancer included in the pseudo image Gf0, the pseudo mask image Mf0, and the pseudo image Gf0. , is stored as teacher data in the storage 13 or the image storage server 4 together with existing teacher data (step ST4), and the process ends.

図１７は本実施形態における学習処理のフローチャートである。まず、学習部２４が、疑似画像Ｇｆ０および疑似マスク画像Ｍｆ０の組み合わせからなる教師データを取得する（ステップＳＴ１１）。そして学習部２４は、教師データを用いてＳＳモデルの学習を行う（ステップＳＴ１２）。これにより学習済みのＳＳモデルが構築される。 FIG. 17 is a flowchart of learning processing in this embodiment. First, the learning unit 24 acquires teacher data consisting of a combination of the pseudo image Gf0 and the pseudo mask image Mf0 (step ST11). The learning unit 24 then learns the SS model using the teacher data (step ST12). As a result, a trained SS model is constructed.

図１８は本実施形態における画像処理のフローチャートである。まず、画像取得部７１が画像保管サーバ４から処理の対象となる対象画像Ｔ０を取得する（ステップＳＴ２１）。そして、セグメンテーション部７２がＳＳモデル７２Ａにより対象画像Ｔ０をセグメンテーションしてマスク画像ＴＭ０を導出する（ステップＳＴ２２）。次いで、判別部７３が判別モデル７３Ａにより対象画像Ｔ０に含まれる直腸がんのステージの判別結果を導出する（ステップＳＴ２３）。そして、表示制御部７４がマスク画像ＴＭ０および判別結果の表示画面を表示し（ステップＳＴ２４）、処理を終了する。 FIG. 18 is a flowchart of image processing in this embodiment. First, the image acquisition unit 71 acquires the target image T0 to be processed from the image storage server 4 (step ST21). Then, the segmentation unit 72 segments the target image T0 using the SS model 72A to derive a mask image TM0 (step ST22). Next, the discrimination unit 73 derives the discrimination result of the stage of rectal cancer included in the target image T0 using the discrimination model 73A (step ST23). Then, the display control unit 74 displays the mask image TM0 and the discrimination result on the display screen (step ST24), and the process ends.

ここで、進行がんのような出現頻度が少ない稀少疾患は症例が少ないため、セグメンテーションおよびステージの判別を行うための機械学習モデルを構築するための十分な量の教師データを用意することができない。このため、稀少疾患を精度よくセグメンテーションしたり、進行がんのステージのような稀少疾患のクラスを精度よく判別したりすることが可能な機械学習モデルを提供することが難しい。 Here, because there are few cases of rare diseases such as advanced cancer that occur less frequently, it is not possible to prepare a sufficient amount of training data to build a machine learning model for segmentation and stage discrimination. . For this reason, it is difficult to provide a machine learning model that can accurately segment rare diseases or accurately distinguish classes of rare diseases, such as stages of advanced cancer.

本実施形態においては、原画像Ｇ０についてのマスク画像Ｍ０におけるマスクを加工することにより疑似マスク画像Ｍｆ０を導出し、疑似マスク画像Ｍｆ０に基づく領域を有する疑似画像Ｇｆ０を導出するようにした。これにより、セグメンテーションモデルを構築するための既存の教師データにはないか、または全くないクラスの対象物体を含む教師データを用意することができる。例えば、進行した直腸がんを含む疑似画像Ｇｆ０を教師データとして用意することができる。このため、稀少疾患についての疑似画像Ｇｆ０を導出し、これを既存の教師データとともに蓄積して、セグメンテーションおよびステージの判別を行うための学習モデルの学習に用いることにより、稀少疾患についても精度よくセグメンテーションができるような十分な量の教師データを用意することができる。したがって、処理対象となる対象画像について、稀少疾患を精度よくセグメンテーションしたり、進行がんのステージのような稀少疾患のクラスを精度よく判別したりすることが可能な機械学習モデルを提供することが可能となる。 In this embodiment, the pseudo mask image Mf0 is derived by processing the mask in the mask image M0 for the original image G0, and the pseudo image Gf0 having a region based on the pseudo mask image Mf0 is derived. As a result, it is possible to prepare training data for constructing a segmentation model that includes a target object of a class that is not present in existing training data or is not present at all. For example, a pseudo image Gf0 including advanced rectal cancer can be prepared as training data. Therefore, by deriving a pseudo image Gf0 for rare diseases, accumulating it together with existing training data, and using it to train a learning model for segmentation and stage discrimination, it is possible to accurately segment rare diseases. A sufficient amount of training data can be prepared to enable the Therefore, it is possible to provide a machine learning model that can accurately segment rare diseases and accurately discriminate classes of rare diseases, such as stages of advanced cancer, for target images to be processed. It becomes possible.

なお、上記実施形態においては、対象物体を直腸がんとしているが、これに限定されるものではない。直腸以外の他の臓器または構造のがんあるいは腫瘍等の病変を対象物体とすることができる。例えば、関節にある骨棘を対象物体として疑似マスクおよび疑似画像の導出、並びにＳＳモデルの構築を行うことができる。以下、これを他の実施形態として説明する。 Note that in the above embodiment, the target object is rectal cancer, but the present invention is not limited to this. The target object can be a lesion such as cancer or tumor in an organ or structure other than the rectum. For example, a pseudo mask and a pseudo image can be derived, and an SS model can be constructed using a bone spur in a joint as a target object. This will be described below as another embodiment.

ここで、骨棘とは、関節面の軟骨が肥大増殖し、次第に硬くなって骨化して「とげ」のようになったものであり、関節面周辺にできる変形性関節症の特徴的な所見の１つである。このような場合、関節を構成する骨を対象物体として骨棘を形成するように疑似マスクを導出し、関節を構成する骨に骨棘が形成された疑似画像を導出するようにすればよい。 Here, osteophytes are cartilage on articular surfaces that enlarge and proliferate, gradually becoming hard and ossified into "thorns", and are a characteristic finding of osteoarthritis that occurs around articular surfaces. It is one of the In such a case, a pseudo mask may be derived so as to form osteophytes using the bones forming the joint as target objects, and a pseudo image in which osteophytes are formed on the bones forming the joint may be derived.

図１９は骨棘についてのマスク加工画面を示す図である。図１９に示すようにマスク加工画面９０には、マスク画像Ｍ０、マスク画像Ｍ０におけるマスクが加工された疑似マスク画像Ｍｆ０、原画像Ｇ０に含まれる膝関節の３次元モデル９１、および３次元モデル９１の変形の程度を調整するためのスケール９２が表示されている。ここで、原画像Ｇ０は患者の膝関節のＭＲＩ画像である。また、膝関節の３次元モデル９１は脛骨の関節付近を表すものとなっている。なお、原画像Ｇ０においては関節に骨棘は形成されていない。 FIG. 19 is a diagram showing a mask processing screen for bone spurs. As shown in FIG. 19, the mask processing screen 90 includes a mask image M0, a pseudo mask image Mf0 obtained by processing the mask in the mask image M0, a three-dimensional model 91 of the knee joint included in the original image G0, and a three-dimensional model 91. A scale 92 for adjusting the degree of deformation is displayed. Here, the original image G0 is an MRI image of the patient's knee joint. Furthermore, the three-dimensional knee joint model 91 represents the vicinity of the tibia joint. Note that no osteophyte is formed in the joint in the original image G0.

なお、ここでは説明のために、図１９に示すマスク画像Ｍ０には加工を行う脛骨の領域にのみマスクＭｓ０を付与している。また、疑似マスク画像Ｍｆ０には加工される脛骨のマスクＭｓｆ０のみに参照符号を付与している。図１９に示すマスク画像Ｍ０と疑似マスク画像Ｍｆ０とはマスクの加工前であるため、それぞれに付与されたマスクは同一のものとなっている。 Note that for the purpose of explanation, a mask Ms0 is provided only to the region of the tibia to be processed in the mask image M0 shown in FIG. 19. Further, in the pseudo mask image Mf0, only the mask Msf0 of the tibia to be processed is given a reference numeral. Since the mask image M0 and the pseudo mask image Mf0 shown in FIG. 19 are before mask processing, the masks applied to each are the same.

３次元モデル９１は、原画像Ｇ０における脛骨の関節付近の領域のみを抽出してボリュームレンダリングすることにより導出された３次元的な画像である。操作者は入力デバイス１５を操作することにより、３次元モデル９１の全方向の向きを変更することができる。また、入力デバイス１５を用いて３次元モデル９１における所望とされる位置を指定することによりマスクの加工場所を指定する。そして、入力デバイス１５を用いてスケール９２の摘子９２Ａを移動させることにより、マスクの加工の程度を指定する。これにより疑似マスク導出部２２は、図２０に示すように指定された位置を起点として３次元モデル９１を延ばすように加工し、さらに疑似マスク画像Ｍｆ０に付与されているマスクＭｓｆ０に対して、３次元モデル９１の形状に適合するようにマスクＭｓｆ１を付与する。 The three-dimensional model 91 is a three-dimensional image derived by extracting only the region near the tibia joint in the original image G0 and performing volume rendering. By operating the input device 15, the operator can change the orientation of the three-dimensional model 91 in all directions. Furthermore, the mask processing location is designated by designating a desired position in the three-dimensional model 91 using the input device 15. Then, by moving the knob 92A of the scale 92 using the input device 15, the degree of mask processing is specified. As a result, the pseudo mask deriving unit 22 processes the three-dimensional model 91 to extend it from the specified position as the starting point, as shown in FIG. A mask Msf1 is provided to match the shape of the dimensional model 91.

この際、疑似マスク導出部２２は、脛骨に付与されたマスクの３次元的な連続性を保持しつつマスクを加工する。例えば、３次元モデル９１における指定された位置を延ばすように変形する際に、３次元モデル９１における指定された位置から離れるほど変形の程度を小さくする。これにより、元の３次元モデル９１における３次元的な連続性を保持しつつ３次元モデル９１を変形することができ、その結果、脛骨に付与されたマスクの３次元的な連続性を保持しつつ、骨棘が形成されたものとなるようにマスクを加工することができる。 At this time, the pseudo mask deriving unit 22 processes the mask while maintaining the three-dimensional continuity of the mask applied to the tibia. For example, when deforming the three-dimensional model 91 to extend a designated position, the degree of deformation is made smaller as the distance from the designated position in the three-dimensional model 91 increases. As a result, the three-dimensional model 91 can be transformed while maintaining the three-dimensional continuity of the original three-dimensional model 91, and as a result, the three-dimensional continuity of the mask applied to the tibia can be maintained. However, the mask can be processed to have bone spurs formed therein.

疑似画像導出部２３は、骨棘が形成された疑似マスク画像Ｍｆ０から骨棘が形成された脛骨を含む疑似画像Ｇｆ０を導出する。そして、学習部２４は、骨棘が形成された疑似マスク画像Ｍｆ０および疑似画像Ｇｆ０を教師データとして用いてＳＳモデルを学習する。これにより、膝関節を含むＭＲＩ画像において、骨棘を精度よくセグメンテーションすることが可能な学習済みのＳＳモデルを構築することができる。したがって、このように構築したＳＳモデルを本実施形態による画像処理装置のセグメンテーション部７２に適用することにより、膝関節を含むＭＲＩ画像において骨棘の領域を精度よくセグメンテーションすることができる。 The pseudo-image deriving unit 23 derives a pseudo-image Gf0 including the tibia on which osteophytes are formed from the pseudo-mask image Mf0 on which osteophytes are formed. The learning unit 24 then learns the SS model using the pseudo mask image Mf0 and the pseudo image Gf0 in which bone spurs are formed as teacher data. Thereby, it is possible to construct a trained SS model that can accurately segment osteophytes in an MRI image including a knee joint. Therefore, by applying the SS model constructed in this manner to the segmentation unit 72 of the image processing apparatus according to this embodiment, it is possible to precisely segment the osteophyte region in an MRI image including the knee joint.

また、関節に骨棘が形成された脛骨を含む疑似画像Ｇｆ０を教師データとして用いることにより、脛骨の関節を含むＭＲＩ画像について、骨棘の有無を判別する判別モデルを構築することも可能である。骨棘の有無を判別する判別モデルを構築する際には、骨棘が形成されていない脛骨を含む原画像も教師データとして用いる。 Furthermore, by using the pseudo image Gf0 that includes the tibia with osteophytes formed in the joint as training data, it is also possible to construct a discriminant model that determines the presence or absence of osteophytes in MRI images that include the tibia joint. . When constructing a discriminant model that determines the presence or absence of osteophytes, the original image that includes the tibia without osteophytes is also used as training data.

なお、上記他の実施形態においては、骨棘を含まない原画像Ｇ０を加工することにより、疑似マスク画像Ｍｆ０さらには疑似画像Ｇｆ０を導出しているが、追加される病変は骨棘に限定されるものではない。任意の病変の追加の対象となる臓器を含み、かつ病変を含まない原画像Ｇ０に対して、病変を追加するように原画像Ｇ０を加工して、疑似マスク画像Ｍｆ０さらには疑似画像Ｇｆ０を導出するようにしてもよい。
クを変換する実施形態を後で書く。 Note that in the other embodiments described above, the pseudo mask image Mf0 and further pseudo image Gf0 are derived by processing the original image G0 that does not include osteophytes, but the added lesions are limited to osteophytes. It's not something you can do. A pseudo mask image Mf0 and a pseudo image Gf0 are derived by processing the original image G0 so as to add a lesion to the original image G0, which includes an organ to which an arbitrary lesion is to be added but does not contain a lesion. You may also do so.
I will write an embodiment for converting the code later.

なお、上記各実施形態において、疑似マスク導出部２２がマスクを加工して疑似マスク画像Ｍｆ０を導出するに際しては、マスクの変形に対して拘束条件を設定し、拘束条件に従ってマスクの加工の程度の指定を受け付けるようにしてもよい。図２１はマスク加工画面の他の例を示す図である。図２１に示すマスク加工画面１００は、疑似マスク画像Ｍｆ０、拘束条件を設定するための条件リスト１０１、マスクの変形の程度を設定するためのプルダウンメニュー１０２および疑似画像の導出を実行させるための変換ボタン１０３が表示されている。 In each of the embodiments described above, when the pseudo mask deriving unit 22 processes a mask to derive the pseudo mask image Mf0, a constraint condition is set for the deformation of the mask, and the degree of processing of the mask is determined according to the constraint condition. It may also be possible to accept specifications. FIG. 21 is a diagram showing another example of the mask processing screen. The mask processing screen 100 shown in FIG. 21 includes a pseudo mask image Mf0, a condition list 101 for setting constraint conditions, a pull-down menu 102 for setting the degree of mask deformation, and a conversion for deriving a pseudo image. A button 103 is displayed.

図２１に示すマスク加工画面１００に表示された疑似マスク画像Ｍｆ０においては、直腸がんの領域にマスクＭｓｆ０が付与され、直腸の領域にマスクＭｓｆ１が付与されている。マスク加工画面１００に表示された疑似マスク画像Ｍｆ０に含まれるマスクＭｓｆ０，Ｍｓｆ１に対しては、加工するか否かを設定可能とされている。例えば、マウスカーソルを所望とするマスク上に移動させて右クリックする等の予め定められた操作により、選択したマスクを加工するか否かを設定可能とされている。本実施形態においては、直腸のマスクＭｓｆ１は加工せず、直腸がんのマスクＭｓｆ０を加工するように設定したものとする。なお、以降の説明においては加工しないように設定したマスクを固定マスクと称する。 In the pseudo mask image Mf0 displayed on the mask processing screen 100 shown in FIG. 21, a mask Msf0 is applied to the rectal cancer area, and a mask Msf1 is applied to the rectal area. It is possible to set whether or not to process the masks Msf0 and Msf1 included in the pseudo mask image Mf0 displayed on the mask processing screen 100. For example, by a predetermined operation such as moving the mouse cursor over a desired mask and right-clicking, it is possible to set whether or not to process the selected mask. In this embodiment, it is assumed that the rectal mask Msf1 is not processed and the rectal cancer mask Msf0 is processed. Note that in the following description, a mask set not to be processed will be referred to as a fixed mask.

条件リスト１０１には、加工するマスクの変形に対する拘束条件がチェックボックスにより選択可能に表示されている。図２１に示すように、拘束条件としては、例えば「固定マスクの中心に回転」、「固定マスクに内接」、「固定マスクからの距離が□ｍｍ」、「固定マスクの重心に合わせた回転」および「固定マスクの淵に沿った回転」が表示されている。操作者は、条件リスト１０１から所望とする１以上の拘束条件を選択することができる。なお、「固定マスクからの距離が□ｍｍ」の拘束条件については距離の数値を入力可能とされている。本実施形態において、拘束条件として「固定マスクの重心中心に回転」が選択されたとする。また、プルダウンメニュー１０２は、図６に示すプルダウンメニュー２６と同様に、直腸がんのステージＴ１、Ｔ２、Ｔ３ａｂ、Ｔ３ｃｄ、Ｔ３ＭＲＦ＋、Ｔ４ａおよびＴ４ｂを選択可能となっている。 In the condition list 101, constraint conditions for deformation of the mask to be processed are displayed in a selectable manner using check boxes. As shown in Figure 21, the constraint conditions include, for example, "rotation around the center of the fixed mask", "inscribed in the fixed mask", "distance from the fixed mask is □mm", and "rotation according to the center of gravity of the fixed mask". ” and “Rotation along edge of fixed mask” are displayed. The operator can select one or more desired constraint conditions from the condition list 101. Note that for the constraint condition "distance from the fixed mask is □mm", it is possible to input a numerical value of the distance. In this embodiment, it is assumed that "rotation around the center of gravity of the fixed mask" is selected as the constraint condition. Further, in the pull-down menu 102, similarly to the pull-down menu 26 shown in FIG. 6, the stages of rectal cancer T1, T2, T3ab, T3cd, T3MRF+, T4a, and T4b can be selected.

拘束条件として「固定マスクの中心に回転」が選択されると、固定マスクである直腸のマスクＭｓｆ１の重心Ｃ０が疑似マスク画像Ｍｆ０に表示される。これにより、操作者がマスクＭｆｓ０を入力デバイス１５を用いて変形して加工する際に、マスクＭｓｆ０は固定マスクＭｓｆ１の重心Ｃ０を中心とした回転のみが可能なようにその変形が拘束される。図２２にはマスクＭｓｆ０を回転させた状態を示す。なお、図２２には変形前のマスクＭｓｆ０を仮想線で示している。また、操作者はプルダウンメニュー１０２において直腸がんのステージを選択することができる。図２２においては現在表示されている直腸がんと同一のステージであるＴ３ａｂが表示されている。 When "rotate around the center of the fixed mask" is selected as the constraint condition, the center of gravity C0 of the rectal mask Msf1, which is the fixed mask, is displayed on the pseudo mask image Mf0. Thereby, when the operator deforms and processes the mask Mfs0 using the input device 15, the deformation of the mask Msf0 is restricted so that it can only rotate about the center of gravity C0 of the fixed mask Msf1. FIG. 22 shows a state in which the mask Msf0 is rotated. Note that in FIG. 22, the mask Msf0 before deformation is shown by a virtual line. Further, the operator can select the stage of rectal cancer from the pull-down menu 102. In FIG. 22, T3ab, which is the same stage as the currently displayed rectal cancer, is displayed.

そしてマスクＭｓｆ０が加工されて疑似マスク画像Ｍｆ０が導出された後、変換ボタン１０３が選択されると、疑似画像導出部２３は、加工されたマスクＭｓｆ０および選択された直腸がんのステージに応じた疑似画像Ｇｆ０を導出する。 After the mask Msf0 is processed and the pseudo mask image Mf0 is derived, when the conversion button 103 is selected, the pseudo image derivation unit 23 converts the processed mask Msf0 and the selected stage of rectal cancer. A pseudo image Gf0 is derived.

図２３はマスク加工画面の他の例を示す図である。図２３に示すマスク加工画面１０５に表示された疑似マスク画像Ｍｆ０においては、図２１に示すマスクＭｆｓ０のうちの直腸と重なる領域が固定されたマスクＭｆｓ２として設定されている。これにより、直腸から外れた領域に存在するマスクＭｆｓ０のみが変形可能とされている。また、ここでは拘束条件として「固定マスクからの距離が□ｍｍ」が選択され、距離として３ｍｍが入力されたものとする。これにより、操作者が入力デバイス１５を用いてマスクＭｆｓ０を変形して加工する際に、マスクＭｓｆ０は固定マスクから最も離れた位置が３ｍｍ以下となるようにその変形が拘束される。図２４にはマスクＭｓｆ０を変形させた状態を示す。図２４に示すようにマスクＭｓｆ０における突出部分は固定マスクＭｓｆ１，Ｍｓｆ２からの距離が指定された距離（すなわち３ｍｍ）となるように変形されている。なお、変形のマスクＭｓｆ０の輪郭は仮想線で示している。 FIG. 23 is a diagram showing another example of the mask processing screen. In the pseudo mask image Mf0 displayed on the mask processing screen 105 shown in FIG. 23, the region of the mask Mfs0 shown in FIG. 21 that overlaps with the rectum is set as a fixed mask Mfs2. As a result, only the mask Mfs0 existing in the region outside the rectum can be deformed. Further, here, it is assumed that "the distance from the fixed mask is □ mm" is selected as the constraint condition, and 3 mm is input as the distance. Thereby, when the operator deforms and processes the mask Mfs0 using the input device 15, the deformation of the mask Msf0 is restrained so that the farthest position from the fixed mask is 3 mm or less. FIG. 24 shows a state in which the mask Msf0 is deformed. As shown in FIG. 24, the protruding portion of the mask Msf0 is deformed so that the distance from the fixed masks Msf1 and Msf2 is a specified distance (ie, 3 mm). Note that the outline of the deformation mask Msf0 is shown by a virtual line.

図２５はマスク加工画面の他の例を示す図である。図２５に示すマスク加工画面１０６に表示されたマスク画像Ｍｆ０においては、図２１と同様のマスクＭｆｓ０，Ｍｓｆ１が設定されている。そして、条件リスト１０１において、「固定マスクの淵に沿った回転」が選択されたものとする。条件リスト１０１において、「固定マスクの淵に沿った回転」が選択されると、疑似マスク画像Ｍｆ０には固定マスクである直腸のマスクＭｓｆ１の輪郭とマスクＭｓｆ０の輪郭との交点Ｃ２，Ｃ３が表示される。これにより、操作者がマスクＭｆｓ０を入力デバイス１５を用いて変形して加工する際に、マスクＭｓｆ０は固定マスクＭｓｆ１の縁に沿った回転のみが可能なようにその変形が拘束される。図２６にはマスクＭｓｆ０を回転させた状態を示す。 FIG. 25 is a diagram showing another example of the mask processing screen. In the mask image Mf0 displayed on the mask processing screen 106 shown in FIG. 25, masks Mfs0 and Msf1 similar to those in FIG. 21 are set. It is assumed that "rotation along the edge of fixed mask" is selected in the condition list 101. When "rotation along the edge of the fixed mask" is selected in the condition list 101, the intersection points C2 and C3 between the outline of the rectal mask Msf1, which is a fixed mask, and the outline of the mask Msf0 are displayed in the pseudo mask image Mf0. be done. Thereby, when the operator deforms and processes the mask Mfs0 using the input device 15, the deformation of the mask Msf0 is restricted so that it can only rotate along the edge of the fixed mask Msf1. FIG. 26 shows a state in which the mask Msf0 is rotated.

図２７はマスク加工画面の他の例を示す図である。上記図８に示す加工画面においては、スケール４２を用いてマスクＭｓｆ０を変形していたが、図２７に示すマスク加工画面１０７においては、操作者が入力デバイス１５を操作してマスクＭｓｆ０を直接変形するようにしたものである。この場合、操作者は入力デバイス１５を用いてマスクＭｓｆ０の端部の点Ｃ４を点Ｃ５に向けてドラッグすることにより、図２８に示すようにマスクＭｓｆ０を変形して加工することができる。なお、この場合、点Ｃ４のドラッグに代えて、点Ｃ５のみを入力デバイス１５を用いて指定できるようにしてもよい。例えば、点Ｃ５の位置をダブルクリックすることにより点Ｃ５の位置を指定できるようにしてもよい。この場合、点Ｃ５の指定とともにマスクＭｓｆ０が図２８に示すように変形されて加工されることとなる。 FIG. 27 is a diagram showing another example of the mask processing screen. In the processing screen shown in FIG. 8 above, the mask Msf0 is transformed using the scale 42, but in the mask processing screen 107 shown in FIG. 27, the operator directly transforms the mask Msf0 by operating the input device 15. It was designed to do so. In this case, the operator can deform and process the mask Msf0 as shown in FIG. 28 by dragging the point C4 at the end of the mask Msf0 toward the point C5 using the input device 15. In this case, instead of dragging the point C4, only the point C5 may be specified using the input device 15. For example, the position of point C5 may be specified by double-clicking the position of point C5. In this case, the mask Msf0 is deformed and processed as shown in FIG. 28 along with the designation of the point C5.

なお、図２８に示すマスク加工画面１０７においては、変形したマスクＭｓｆ０の突端の形状を設定できるようにしてもよい。例えば、上記図９に示す棘状突起を有するマスクを生成可能なように、棘状突起の付与の有無を選択するためのプルダウンメニューを表示するようにしてもよい。図２９は棘状突起の有無を選択するためのプルダウンメニューが表示されたマスク加工画面を示す図である。図２９に示すように棘状突起のプルダウンメニュー１０９において「有」が選択されると、操作者がマスクＭｓｆ０の端部の点Ｃ４を点Ｃ５に向けてドラッグすると、移動先の点Ｃ４，Ｃ５の先端に棘状突起を有するマスクが追加されるようにマスクＭｓｆ０が変形されて加工される。 Note that in the mask processing screen 107 shown in FIG. 28, the shape of the tip of the deformed mask Msf0 may be set. For example, a pull-down menu for selecting whether to add spinous processes may be displayed so that a mask having spinous processes shown in FIG. 9 can be generated. FIG. 29 is a diagram showing a mask processing screen on which a pull-down menu for selecting the presence or absence of spinous processes is displayed. As shown in FIG. 29, when "Present" is selected in the spinous process pull-down menu 109, when the operator drags the point C4 at the end of the mask Msf0 toward the point C5, the destination points C4, C5 The mask Msf0 is modified and processed so that a mask having a spinous process is added to the tip of the mask.

また、上記各実施形態においては、直腸がんを大きくするように変形させたり、骨棘を追加したりすることにより疑似マスク画像Ｍｆ０を導出しているが、これに限定されるものではない。収縮されるような疾患についての疑似画像Ｇｆ０を導出する場合にも本開示の技術を適用できる。 Further, in each of the embodiments described above, the pseudo mask image Mf0 is derived by deforming the rectal cancer to enlarge it or adding bone spurs, but the present invention is not limited to this. The technology of the present disclosure can also be applied when deriving a pseudo image Gf0 for a disease that causes contraction.

例えば、血管狭窄の疾患を含む疑似画像Ｇｆ０を導出する場合、血管狭窄を含まない原画像Ｇ０の血管のマスクが細くなるように加工することにより疑似マスク画像Ｍｆ０を導出すれば、血管狭窄を含む疑似画像Ｇｆ０を導出することが可能である。 For example, when deriving a pseudo image Gf0 that includes a disease of vascular stenosis, if the pseudo mask image Mf0 is derived by processing the original image G0 that does not include vascular stenosis so that the blood vessel mask becomes thinner, then It is possible to derive a pseudo image Gf0.

また、上記実施形態においては、疑似画像導出部２３は原画像Ｇ０と同一の表現形式を有する疑似画像Ｇｆ０を導出しているが、これに限定されるものではない。疑似画像導出部２３が疑似画像を導出する際に、疑似画像Ｇｆ０の濃度、色およびテクスチャの少なくとも１つを変更するようにしてもよい。この場合、予め定められた濃度、色およびテクスチャを有する複数のスタイル画像をストレージ１３に保存しておき、複数のスタイル画像から選択された濃度、色またはテクスチャを有するように疑似画像Ｇｆ０を導出すればよい。図３０は複数のスタイル画像が表示されたマスク加工画面を示す図である。なお、図３０に示すマスク加工画面１２０は、図６，７に示すマスク加工画面４０に複数のスタイル画像を含むスタイル画像リスト１２２を表示し、かつ疑似画像Ｇｆ０を表示するようにしたものである。 Further, in the above embodiment, the pseudo image deriving unit 23 derives the pseudo image Gf0 having the same expression format as the original image G0, but the present invention is not limited to this. When the pseudo image derivation unit 23 derives the pseudo image, at least one of the density, color, and texture of the pseudo image Gf0 may be changed. In this case, a plurality of style images having predetermined density, color, and texture are stored in the storage 13, and a pseudo image Gf0 is derived to have density, color, or texture selected from the plurality of style images. Bye. FIG. 30 is a diagram showing a mask processing screen on which a plurality of style images are displayed. Note that the mask processing screen 120 shown in FIG. 30 displays a style image list 122 including a plurality of style images on the mask processing screen 40 shown in FIGS. 6 and 7, and also displays a pseudo image Gf0. .

図３０に示すマスク加工画面１２０に表示されたスタイル画像リスト１２２は６つの異なるスタイルのスタイル画像を含む。リスト１２２の上段は３種類の異なる色のスタイル画像１２３Ａ，１２３Ｂ，１２３Ｃ、下段は３種類の異なるテクスチャのスタイル画像１２３Ａ，１２４Ｂ，１２４Ｃである。なお、色およびテクスチャに加えて濃度のスタイル画像を含めてもよく、濃度、色およびテクスチャのうちの２つまたは全部のスタイル画像を含めてもよい。マスクＭｓｆ０が加工されて疑似マスク画像Ｍｆ０が導出された後、操作者は実行ボタン２９を選択する前に、スタイル画像リスト１２２から所望とする色およびテクスチャのスタイル画像を選択する。ここでは、色のスタイル画像１２３Ｂが選択されたものとする。スタイル画像の選択後、操作者が実行ボタン２９を選択すると、疑似画像導出部２３が、選択されたスタイル画像に応じた色の直腸がんを含む疑似画像Ｇｆ０を導出する。 The style image list 122 displayed on the mask processing screen 120 shown in FIG. 30 includes style images of six different styles. The upper row of the list 122 is style images 123A, 123B, 123C of three different colors, and the lower row is style images 123A, 124B, 124C of three different textures. Note that a style image of density may be included in addition to color and texture, or style images of two or all of density, color, and texture may be included. After the mask Msf0 is processed and the pseudo mask image Mf0 is derived, the operator selects a style image with a desired color and texture from the style image list 122 before selecting the execution button 29. Here, it is assumed that the color style image 123B is selected. After selecting the style image, when the operator selects the execution button 29, the pseudo image derivation unit 23 derives a pseudo image Gf0 containing rectal cancer in a color corresponding to the selected style image.

また、上記実施形態において、例えば、画像生成装置２０の情報取得部２１、疑似マスク導出部２２、疑似画像導出部２３および学習部２４、並びに画像処理装置６０の画像取得部７１、セグメンテーション部７２、判別部７３および表示制御部７４といった各種の処理を実行する処理部（Processing Unit）のハードウェア的な構造としては、次に示す各種のプロセッサ（Processor）を用いることができる。上記各種のプロセッサには、上述したように、ソフトウェア（プログラム）を実行して各種の処理部として機能する汎用的なプロセッサであるＣＰＵに加えて、ＦＰＧＡ（Field Programmable Gate Array）等の製造後に回路構成を変更可能なプロセッサであるプログラマブルロジックデバイス（Programmable Logic Device :PLD）、ＡＳＩＣ（Application Specific Integrated Circuit）等の特定の処理を実行させるために専用に設計された回路構成を有するプロセッサで
ある専用電気回路等が含まれる。 Further, in the above embodiment, for example, the information acquisition unit 21, pseudo mask derivation unit 22, pseudo image derivation unit 23, and learning unit 24 of the image generation device 20, the image acquisition unit 71, the segmentation unit 72, As a hardware structure of a processing unit (Processing Unit) such as the determination unit 73 and the display control unit 74 that executes various processes, the following various processors can be used. As mentioned above, the various processors mentioned above include the CPU, which is a general-purpose processor that executes software (programs) and functions as various processing units, as well as circuits such as FPGA (Field Programmable Gate Array) after manufacturing. A programmable logic device (PLD), which is a processor whose configuration can be changed, and a dedicated electrical device, which is a processor with a circuit configuration specifically designed to execute a specific process, such as an ASIC (Application Specific Integrated Circuit) Includes circuits, etc.

１つの処理部は、これらの各種のプロセッサのうちの１つで構成されてもよいし、同種または異種の２つ以上のプロセッサの組み合わせ（例えば、複数のＦＰＧＡの組み合わせまたはＣＰＵとＦＰＧＡとの組み合わせ）で構成されてもよい。また、複数の処理部を１つのプロセッサで構成してもよい。 One processing unit may be composed of one of these various types of processors, or a combination of two or more processors of the same type or different types (for example, a combination of multiple FPGAs or a combination of a CPU and an FPGA). ). Further, the plurality of processing units may be configured with one processor.

複数の処理部を１つのプロセッサで構成する例としては、第１に、クライアントおよびサーバ等のコンピュータに代表されるように、１つ以上のＣＰＵとソフトウェアとの組み合わせで１つのプロセッサを構成し、このプロセッサが複数の処理部として機能する形態がある。第２に、システムオンチップ（System On Chip:SoC）等に代表されるように、複数の処理部を含むシステム全体の機能を１つのＩＣ（Integrated Circuit）チップで実現するプロセッサを使用する形態がある。このように、各種の処理部は、ハードウェア的な構造として、上記各種のプロセッサの１つ以上を用いて構成される。 As an example of configuring a plurality of processing units with one processor, firstly, as typified by computers such as a client and a server, one processor is configured with a combination of one or more CPUs and software, There is a form in which this processor functions as a plurality of processing units. Second, there are processors that use a single IC (Integrated Circuit) chip to implement the functions of an entire system including multiple processing units, as typified by System On Chip (SoC). be. In this way, various processing units are configured using one or more of the various processors described above as a hardware structure.

さらに、これらの各種のプロセッサのハードウェア的な構造としては、より具体的には、半導体素子等の回路素子を組み合わせた電気回路（Circuitry）を用いることができる。 Furthermore, as the hardware structure of these various processors, more specifically, an electric circuit (Circuitry) that is a combination of circuit elements such as semiconductor elements can be used.

１，２コンピュータ
３撮影装置
４画像保管サーバ
５ネットワーク
１１，６１ＣＰＵ
１２Ａ画像生成プログラム
１２Ｂ学習プログラム
１３，６３ストレージ
１４，６４ディスプレイ
１５，６５入力デバイス
１６，６６メモリ
１７，６７ネットワークＩ／Ｆ
１８，６８バス
２０画像処理装置
２１情報取得部
２２疑似マスク導出部
２３疑似画像導出部
２４学習部
２６，１０２プルダウンメニュー
２７加工ボタン
２８，１０３変換ボタン
３０直腸
３１粘膜層
３２粘膜下層
３３固有筋層
３４漿膜下層
３５直腸がん
４０，９０，１００，１０５，１０６，１０７，１２０マスク加工画面
４１，９１３次元モデル
４１Ａ付加された領域
４２，９２スケール
４２Ａ，９２Ａ摘子
４３浸潤リンパ節
４４血管浸潤
５０ジェネレータ
５１エンコーダ
５２デコーダ
５３ディスクリミネータ
６２画像処理プログラム
７１画像取得部
７２セグメンテーション部
７２Ａセグメンテーションモデル
７３判別部
７３Ａ判別モデル
７４表示制御部
８０表示画面
８１判別結果
１０１条件リスト
１２２スタイル画像リスト
１２３Ａ～１２３Ｃ、１２４Ａ～１２４Ｃスタイル画像
Ｃ０重心
Ｃ２～Ｃ５点
Ｇ０原画像
Ｇｆ０疑似画像
Ｍ０マスク画像
Ｍ１～Ｍ８，Ｍｓ０，Ｍｓｆ０，Ｍｓｆ１マスク
Ｍｆ０疑似マスク画像
Ｓ０教師データ
Ｓ１学習用画像
Ｓ２学習用マスク画像
Ｓ３疑似画像
Ｔ０対象画像
ＴＦ０判別結果
ＴＭ０マスク画像
ｚ０潜在表現 1, 2 Computer 3 Photographing device 4 Image storage server 5 Network 11, 61 CPU
12A Image generation program 12B Learning program 13, 63 Storage 14, 64 Display 15, 65 Input device 16, 66 Memory 17, 67 Network I/F
18, 68 Bus 20 Image processing device 21 Information acquisition section 22 Pseudo-mask derivation section 23 Pseudo-image derivation section 24 Learning section 26, 102 Pull-down menu 27 Processing button 28, 103 Conversion button 30 Rectum 31 Mucosal layer 32 Submucosal layer 33 Muscular layer propria 34 Subserosa layer 35 Rectal cancer 40, 90, 100, 105, 106, 107, 120 Mask processing screen 41, 91 3D model 41A Added area 42, 92 Scale 42A, 92A Pump 43 Infiltrated lymph node 44 Vascular invasion 50 Generator 51 Encoder 52 Decoder 53 Discriminator 62 Image processing program 71 Image acquisition section 72 Segmentation section 72A Segmentation model 73 Discrimination section 73A Discrimination model 74 Display control section 80 Display screen 81 Discrimination result 101 Condition list 122 Style image list 123A to 123C , 124A to 124C Style image C0 Center of gravity C2 to C5 points G0 Original image Gf0 Pseudo image M0 Mask image M1 to M8, Ms0, Msf0, Msf1 Mask Mf0 Pseudo mask image S0 Teacher data S1 Learning image S2 Learning mask image S3 Pseudo image T0 Target image TF0 Discrimination result TM0 Mask image z0 Latent expression

Claims

comprising at least one processor;
The processor includes:
Obtaining an original image and a mask image in which a mask is applied to one or more areas representing each of one or more objects including the target object in the original image,
Deriving a pseudo mask image by processing the mask in the mask image,
An image generation device that derives, based on the original image and the pseudo mask image, a pseudo image that has a region based on a mask included in the pseudo mask image and has the same expression format as the original image.

The image generation device according to claim 1, wherein the pseudo mask image and the pseudo image are used as training data for learning a segmentation model that segments the object included in an image.

The image generation device according to claim 2, wherein the processor accumulates the pseudo mask image and the pseudo image as the teacher data.

The image generation device according to claim 1, wherein the processor derives the pseudo mask image capable of generating the pseudo image including a target object of a class different from a class indicated by the target object.

With respect to the medical image, the processor is configured to process the medical image so that at least one of the shape and degree of progression of the lesion is different from that of the lesion included in the original image, based on a lesion shape evaluation index that is an evaluation index in clinical practice. The image generation device according to claim 1, wherein the pseudo mask image is derived by processing a mask.

Image generation according to claim 1, wherein the processor derives the pseudo mask image by processing the mask until a normal organ has a shape that is evaluated as a lesion based on a clinical measurement index with respect to a medical image. Device.

2. The pseudo image according to claim 1, wherein the processor refers to at least one style image having a predetermined density, color, or texture, and generates the pseudo image having a density, color, or texture according to the style image. Image generation device.

The image generation device according to claim 1, wherein the processor receives an instruction regarding the degree of processing of the mask, and derives the pseudo mask image by processing the mask based on the instruction.

The image generation device according to claim 8, wherein the processor receives a designation of a position of an end point of the mask after processing and a designation of a processing amount as an instruction of the degree of processing.

9. The image generation device according to claim 8, wherein the processor receives an instruction regarding the degree of processing of the mask according to preset constraint conditions.

If the original image includes a plurality of the objects, and some regions of the target object and other objects other than the target object are in an inclusive relationship with each other, the mask image includes the regions that have the inclusive relationship with each other. The image generation device according to claim 1, wherein a mask different from that for areas that are not in an inclusive relationship is provided.

When the other object in the inclusion relationship is a fixed object in the original image, the processor adjusts the mask given to the target object to the shape of the mask given to the fixed object. The image generation device according to claim 11, wherein the pseudo mask image is derived by processing the pseudo mask image.

When the original image is a three-dimensional image, the processor derives the pseudo mask image by processing the mask while maintaining the three-dimensional continuity of the mask applied to the region of the target object. The image generation device according to claim 1.

The original image is a three-dimensional medical image,
The image generation device according to claim 1, wherein the target object is a lesion included in the medical image.

the medical image includes a rectum of a human body;
The target object is rectal cancer, and the other object other than the target object is at least one of the mucosal layer of the rectum, the submucosa of the rectum, the muscularis propria of the rectum, the subserosa of the rectum, and a background other than these. The image generation device according to claim 14.

the medical image includes a joint of a human body;
15. The image generation device according to claim 14, wherein the target object is a bone forming a joint, and the object other than the target object is a background other than the bone forming the joint.

comprising at least one processor;
The processor includes:
By performing machine learning using a set of a plurality of pseudo images and pseudo mask images generated by the image generation device according to claim 1 as training data, one or more objects including a target object included in an input image can be A learning device that builds a segmentation model that segments regions.

The processor includes:
The learning device according to claim 17, further constructing the segmentation model by performing machine learning using a plurality of sets of original images and mask images as training data.

A segmentation model constructed by the learning device according to claim 17.

comprising at least one processor;
The processor segments one or more object regions including the target object included in the target image to be processed, using the segmentation model according to claim 19, thereby segmenting one or more object regions included in the target image to be processed. An image processing device that derives a mask image in which an object is masked.

The image processing apparatus according to claim 20, wherein the processor determines the class of the target object masked in the mask image using a discriminant model that determines the class of the target object included in the mask image.

Obtaining an original image and a mask image in which a mask is applied to one or more areas representing each of one or more objects including the target object in the original image,
Deriving a pseudo mask image by processing the mask in the mask image,
An image generation method that derives, based on the original image and the pseudo mask image, a pseudo image that has a region based on a mask included in the pseudo mask image and has the same expression format as the original image.

By performing machine learning using a set of a plurality of pseudo images and pseudo mask images generated by the image generation method according to claim 22 as training data, one or more objects including the target object included in the input image are A learning method for building a segmentation model that segments regions.

The one or more objects included in the target image to be processed are masked by segmenting the region of one or more objects including the target object included in the target image to be processed using the segmentation model according to claim 19. An image processing method for deriving a mask image.

a step of acquiring an original image and a mask image in which a mask is applied to one or more regions representing each of one or more objects including the target object in the original image;
a step of deriving a pseudo mask image by processing a mask in the mask image;
and causing a computer to execute a procedure for deriving a pseudo image having a region based on a mask included in the pseudo mask image and having the same expression format as the original image, based on the original image and the pseudo mask image. Image generation program.

By performing machine learning using a set of a plurality of pseudo images and pseudo mask images generated by the image generation method according to claim 22 as training data, one or more objects including the target object included in the input image are A learning program that causes a computer to execute the steps to build a segmentation model that segments an area.

The one or more objects included in the target image to be processed are masked by segmenting the region of one or more objects including the target object included in the target image to be processed using the segmentation model according to claim 19. An image processing program that causes a computer to execute the steps to derive a mask image.