JP5845154B2

JP5845154B2 - Biological model generation device, biological model generation method, cytometer, and program

Info

Publication number: JP5845154B2
Application number: JP2012176378A
Authority: JP
Inventors: 瑞樹蔦; 純一杉山; 晋川崎; かおり藤田; 正俊吉村; 佐々木　康彦; 康彦佐々木; 三雄武井; 石川　淳; 淳石川; 英樹中本; 裕介渡邊; 竹内　郁雄; 郁雄竹内
Original assignee: National Agriculture and Food Research Organization; Hitachi Power Solutions Co Ltd
Current assignee: National Agriculture and Food Research Organization; Hitachi Power Solutions Co Ltd
Priority date: 2012-08-08
Filing date: 2012-08-08
Publication date: 2016-01-20
Anticipated expiration: 2032-08-08
Also published as: JP2014035255A

Description

本発明は、生体数モデル生成装置、生体数モデル生成方法、サイトメーター、および、プログラムに関する。 The present invention relates to a biological number model generation device, a biological number model generation method, a cytometer, and a program.

従来、食品等の品質を管理するため、培養法等を用いて微生物モニタリングが行われている。培養法では、食品等の希釈液を培地に添加し、培養の結果得られたコロニーの数によって、微生物の濃度を調査している。 Conventionally, in order to control the quality of foods and the like, microbial monitoring is performed using a culture method or the like. In the culture method, a diluent such as food is added to a culture medium, and the concentration of microorganisms is examined based on the number of colonies obtained as a result of the culture.

ここで、培養法は正確であるが結果が出るまでに数日を要するため、微生物汚染の有無が確定するまで食品等の製品を出荷できず、その間に製品品質が劣化するというジレンマを抱えている。そのため、近年、フローサイトメトリーによって迅速に微生物モニタリングを行う装置が開発されている。 Here, the culture method is accurate, but it takes several days for results to be obtained, so we cannot ship products such as food until the presence or absence of microbial contamination is determined, and we have the dilemma that product quality deteriorates during that time. Yes. Therefore, in recent years, an apparatus for rapidly monitoring microorganisms by flow cytometry has been developed.

非特許文献１に記載の細菌数計測装置では、自家蛍光や非特異的吸着による蛍光と区別するために、全菌用蛍光色素２種類と死菌用蛍光色素１種類を用いて染色したサンプルをマイクロ流路に流しながら蛍光を時系列に沿って検出し、時系列の検出データに基づいて、同時点において全菌用蛍光色素の蛍光が観測され且つ死菌用蛍光色素の蛍光が観察されなかった場合に生菌と判定することが開示されている。 In the bacterial count measuring apparatus described in Non-Patent Document 1, in order to distinguish from autofluorescence and fluorescence due to non-specific adsorption, samples stained with two types of fluorescent dyes for whole bacteria and one type of fluorescent dye for killed bacteria are used. Fluorescence is detected in time series while flowing through the microchannel. Based on the time series detection data, fluorescence of the fluorescent dye for all bacteria is observed at the same time, and fluorescence of the fluorescent dye for killed bacteria is not observed. It is disclosed that it is determined as a viable bacteria in the case of

佐々木康彦・武井三雄著「カセット式迅速細菌数計測装置「カセットラボ（Ｒ）ＯＮＥ」の紹介」月刊ＨＡＣＣＰ２０１２年６月号，ｐｐ．３８〜４５Yasuhiko Sasaki & Mitsuo Takei “Introduction of cassette type rapid bacteria count device“ Cassette Lab (R) ONE ”” monthly HACCP June 2012 issue, pp. 38-45

しかしながら、従来のサイトメトリー法による生菌数の計測技術においては、迅速な細菌数の計測が可能となるものの、培養法による正確な実測値との間には未だ測定数の誤差が大きいという問題点を有していた。 However, while the conventional technique for measuring the number of viable cells by cytometry can quickly measure the number of bacteria, there is still a problem that there is still a large error in the number of measured cells between the actual measured values by the culture method. Had a point.

本発明は、上記問題点に鑑みてなされたもので、正確かつ迅速に生体数を計測することができる生体数モデル生成装置、生体数モデル生成方法、サイトメーター、および、プログラムを提供することを目的とする。 The present invention has been made in view of the above problems, and provides a living body number model generating device, a living body number model generating method, a cytometer, and a program capable of accurately and quickly measuring the living body number. Objective.

このような目的を達成するため、本発明の生体数モデル生成装置は、記憶部と制御部を少なくとも備えた生体数モデル生成装置において、上記記憶部は、生体の生死に応じて上記生体との相互作用の程度とそれに応じた発光強度が異なる蛍光標識を用いたサイトメトリーにより得られた、検出強度データを記憶する検出強度記憶手段、を備え、上記制御部は、上記検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを生成するヒストグラム生成手段と、上記ヒストグラムの各区分における頻度値を説明変数として多変量解析を行い、得られた上記説明変数に対する各係数を用いて上記生体の有無または生体数を判定する数式である生体数モデルを生成する多変量解析手段と、を備えたことを特徴とする。 In order to achieve such an object, the living body number model generation device of the present invention is a living body number model generation device including at least a storage unit and a control unit, wherein the storage unit is connected to the living body according to the life or death of the living body. Detection intensity storage means for storing detection intensity data obtained by cytometry using fluorescent labels having different degrees of interaction and emission intensity corresponding thereto, and the control unit is based on the detection intensity data , A histogram generating means for generating a histogram indicating the frequency of each detected intensity section, and multivariate analysis using the frequency value in each section of the histogram as an explanatory variable, and using each coefficient for the obtained explanatory variable And multivariate analysis means for generating a biological number model that is a mathematical expression for determining the presence or absence of the living body or the number of living bodies.

また、本発明の生体数モデル生成装置は、上記記載の生体数モデル生成装置において、上記検出強度記憶手段は、複数の蛍光標識を用いたサイトメトリーにより得られた、上記各蛍光標識が発する蛍光の検出強度を示す上記検出強度データを記憶し、上記ヒストグラム生成手段は、上記検出強度データに基づいて、上記区分毎の頻度を示すヒストグラムを上記複数の蛍光標識において生成し、上記多変量解析手段は、上記蛍光標識の区分毎の頻度を上記説明変数とすることを特徴とする。 Further, the living body number model generating apparatus of the present invention is the living body number model generating apparatus described above, wherein the detected intensity storage means is a fluorescence emitted by each of the fluorescent labels obtained by cytometry using a plurality of fluorescent labels. The detected intensity data indicating the detected intensity of the image is stored, and the histogram generating means generates a histogram indicating the frequency for each of the categories based on the detected intensity data in the plurality of fluorescent labels, and the multivariate analyzing means Is characterized in that the frequency for each category of the fluorescent label is used as the explanatory variable.

また、本発明の生体数モデル生成装置は、上記記載の生体数モデル生成装置において、上記検出強度記憶手段は、複数の蛍光標識を用いたサイトメトリーにより得られた、上記各蛍光標識が発する蛍光の検出強度を示す上記検出強度データを記憶し、上記ヒストグラム生成手段は、上記検出強度データに基づいて、上記複数の蛍光標識のうち一つの上記蛍光標識の上記検出強度が所定の範囲にある場合の、他の上記蛍光標識の上記区分毎の頻度を示すヒストグラムを生成することを特徴とする。 Further, the living body number model generating apparatus of the present invention is the living body number model generating apparatus described above, wherein the detected intensity storage means is a fluorescence emitted by each of the fluorescent labels obtained by cytometry using a plurality of fluorescent labels. When the detection intensity data indicating the detection intensity of the plurality of fluorescent labels is stored in the predetermined range based on the detected intensity data, the histogram generation means stores the detected intensity data A histogram showing the frequency of each of the other fluorescent labels for each of the sections is generated.

また、本発明の生体数モデル生成装置は、上記記載の生体数モデル生成装置において、上記検出強度記憶手段は、複数の蛍光標識を用いたサイトメトリーにより得られた、上記各蛍光標識が発する蛍光の検出強度を示す上記検出強度データを記憶し、上記ヒストグラム生成手段は、同時に計測された上記蛍光標識の上記検出強度データ間で加減乗除し、加減乗除した値の区分毎の頻度を示すヒストグラムを生成することを特徴とする。 Further, the living body number model generating apparatus of the present invention is the living body number model generating apparatus described above, wherein the detected intensity storage means is a fluorescence emitted by each of the fluorescent labels obtained by cytometry using a plurality of fluorescent labels. The detected intensity data indicating the detected intensity is stored, and the histogram generating means adds / subtracts / divides between the detected intensity data of the fluorescent label simultaneously measured, and displays a histogram indicating the frequency for each division of the added / subtracted / divided values. It is characterized by generating.

また、本発明の生体数モデル生成装置は、上記記載の生体数モデル生成装置において、上記制御部は、上記多変量解析手段により得られた上記生体数モデルについて実測値との誤差を検証しながら、繰り返し上記ヒストグラム生成手段により生成される上記ヒストグラムの上記区分の間隔が変更されるよう制御することにより、上記生体数モデルの最適化を行うモデル最適化手段、を更に備えたことを特徴とする。 Further, the biological number model generation device of the present invention is the biological number model generation device described above, wherein the control unit verifies an error from an actual measurement value for the biological number model obtained by the multivariate analysis unit. The apparatus further comprises model optimization means for optimizing the biological number model by controlling so that the interval between the sections of the histogram repeatedly generated by the histogram generation means is changed. .

また、本発明の生体数モデル生成装置は、上記記載の生体数モデル生成装置において、上記ヒストグラム生成手段は、上記検出強度データに基づいて、上記検出強度を対数に変換して、変換した対数の区分毎の頻度を示すヒストグラムを生成することを特徴とする。 Further, the biological number model generation device of the present invention is the biological number model generation device described above, wherein the histogram generation means converts the detected intensity into a logarithm based on the detected intensity data, and converts the logarithm of the converted logarithm. A histogram showing the frequency for each category is generated.

また、本発明の生体数モデル生成装置は、上記記載の生体数モデル生成装置において、上記ヒストグラム生成手段は、生成した上記ヒストグラムをスペクトルとして、中心化、尺度化、平滑化、微分、および／または標準化の処理を行うことを特徴とする。 Further, the biological number model generation device of the present invention is the biological number model generation device described above, wherein the histogram generation means uses the generated histogram as a spectrum for centering, scaling, smoothing, differentiation, and / or It is characterized by performing standardization processing.

また、本発明の生体数モデル生成装置は、上記記載の生体数モデル生成装置において、上記多変量解析手段は、教師なしのパターン認識手法、教師ありの判別分析手法、または、教師ありの回帰分析手法を用いて、上記多変量解析を行うことを特徴とする。 Further, the biological number model generation device of the present invention is the biological number model generation device described above, wherein the multivariate analysis means includes an unsupervised pattern recognition method, a supervised discriminant analysis method, or a supervised regression analysis. The above-described multivariate analysis is performed using a technique.

また、本発明の生体数モデル生成方法は、記憶部と制御部を少なくとも備えたコンピュータにおいて実行される生体数モデル生成方法であって、上記記憶部は、生体の生死に応じて上記生体との相互作用の程度とそれに応じた発光強度が異なる蛍光標識を用いたサイトメトリーにより得られた、検出強度データを記憶する検出強度記憶手段、を備え、上記制御部において実行される、上記検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを生成するヒストグラム生成ステップと、上記ヒストグラムの各区分における頻度値を説明変数として多変量解析を行い、得られた上記説明変数に対する各係数を用いて上記生体の有無または生体数を判定する数式である生体数モデルを生成する多変量解析ステップと、を含むことを特徴とする。 Further, the living body number model generating method of the present invention is a living body number model generating method executed in a computer including at least a storage unit and a control unit, and the storage unit is connected to the living body according to life or death of the living body. Detection intensity storage means for storing detection intensity data obtained by cytometry using fluorescent labels having different degrees of interaction and emission intensity corresponding to the degree of interaction, and the detection intensity data executed in the control unit Based on the histogram generation step for generating a histogram indicating the frequency of each detected intensity section, and the multivariate analysis using the frequency value in each section of the histogram as an explanatory variable. And using a multivariate analysis step to generate a biological number model that is a mathematical formula for determining the presence or absence of the biological body or the number of living bodies. And features.

また、本発明のサイトメーターは、上記記載された生体数モデル生成方法により生成された生体数モデルを用いて、上記検出強度データから上記生体数または上記生体の有無を判定するサイトメーターであることを特徴とする。 Moreover, the cytometer of the present invention is a cytometer that determines the number of living organisms or the presence or absence of the living organism from the detected intensity data using the living organism number model generated by the above-described method for generating the living organism number model. It is characterized by.

また、本発明のプログラムは、記憶部と制御部を少なくとも備えたコンピュータに実行させるためのプログラムであって、上記記憶部は、生体の生死に応じて上記生体との相互作用の程度とそれに応じた発光強度が異なる蛍光標識を用いたサイトメトリーにより得られた、検出強度データを記憶する検出強度記憶手段、を備え、上記制御部において、上記検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを生成するヒストグラム生成ステップと、上記ヒストグラムの各区分における頻度値を説明変数として多変量解析を行い、得られた上記説明変数に対する各係数を用いて上記生体の有無または生体数を判定する数式である生体数モデルを生成する多変量解析ステップと、を実行させることを特徴とする。 The program according to the present invention is a program for causing a computer including at least a storage unit and a control unit to execute the program, and the storage unit determines the degree of interaction with the living body according to the life and death of the living body, and accordingly. Detection intensity storage means for storing detection intensity data obtained by cytometry using fluorescent labels having different emission intensities, and in the control unit, based on the detection intensity data, A histogram generation step for generating a histogram indicating the frequency, multivariate analysis is performed using the frequency value in each section of the histogram as an explanatory variable, and the presence / absence or the number of living organisms is determined using each coefficient for the obtained explanatory variable. And performing a multivariate analysis step of generating a biological number model as a mathematical expression to be determined.

この発明によれば、生体の生死に応じて生体との相互作用の程度とそれに応じた発光強度が異なる蛍光標識を用いたサイトメトリーにより得られた、検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを生成し、ヒストグラムの各区分における頻度値を説明変数として多変量解析を行い、得られた説明変数に対する各係数を用いて生体の有無または生体数を判定する数式である生体数モデルを生成する。これにより、本発明は、サイトメトリーによる正確かつ迅速に生体数を計測することができる生体数モデルを提示することができるという効果を奏する。 According to this invention, the classification of the detection intensity based on the detection intensity data obtained by the cytometry using the fluorescent label in which the degree of the interaction with the living body and the emission intensity corresponding to the living body depending on the life or death of the living body are different. This is a mathematical formula for generating a histogram showing the frequency of each, performing multivariate analysis using the frequency value in each section of the histogram as an explanatory variable, and determining the presence or the number of living organisms using each coefficient for the obtained explanatory variable. Generate a number model. Thereby, this invention has the effect that the biological number model which can measure a biological number correctly and rapidly by cytometry can be shown.

また、本発明によれば、上記において、複数の蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに基づいて、区分毎の頻度を示すヒストグラムを複数の蛍光標識において生成し、蛍光標識の区分毎の頻度を説明変数として多変量解析を行う。これにより、本発明は、複数の蛍光標識を検出した検出強度によるヒストグラムを一つのスペクトルとみなして多変量解析を行うことにより、複数の連続量の検出強度データを活かしたまま解析を行って、検出率を向上させることができるという効果を奏する。 Further, according to the present invention, in the above, based on the detection intensity data indicating the detection intensity of the fluorescence emitted by each fluorescent label, obtained by cytometry using a plurality of fluorescent labels, the histogram indicating the frequency for each category Are generated in a plurality of fluorescent labels, and multivariate analysis is performed using the frequency of each fluorescent label as an explanatory variable. Thereby, the present invention performs analysis while taking advantage of a plurality of continuous amounts of detection intensity data by performing a multivariate analysis by regarding a histogram based on detection intensities obtained by detecting a plurality of fluorescent labels as one spectrum, There is an effect that the detection rate can be improved.

また、本発明によれば、上記において、複数の蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに基づいて、複数の蛍光標識のうち一つの蛍光標識の検出強度が所定の範囲にある場合の、他の蛍光標識の区分毎の頻度を示すヒストグラムを生成する。これにより、本発明は、死菌用の蛍光が閾値以下等である場合のデータを用いて、推定精度を向上させることができるという効果を奏する。 Further, according to the present invention, in the above, one of the plurality of fluorescent labels is obtained based on the detection intensity data obtained by cytometry using the plurality of fluorescent labels and indicating the detection intensity of the fluorescence emitted by each fluorescent label. When the detection intensity of one fluorescent label is within a predetermined range, a histogram is generated that indicates the frequency for each of the other fluorescent label sections. Thereby, this invention has the effect that estimation accuracy can be improved using the data in case the fluorescence for killed bacteria is below a threshold value.

また、本発明によれば、上記において、複数の蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに基づいて、同時に計測された蛍光標識の検出強度データ間で加減乗除し、加減乗除した値の区分毎の頻度を示すヒストグラムを生成する。これにより、本発明は、異なるセンサ間のデータを組み合せて、推定精度を向上させることができるという効果を奏する。 Further, according to the present invention, in the above, based on the detection intensity data obtained by cytometry using a plurality of fluorescent labels and indicating the detection intensity of the fluorescence emitted by each fluorescent label, the fluorescent labels simultaneously measured Addition / subtraction / division / division between the detected intensity data, and a histogram indicating the frequency of each of the divided values is generated. Thereby, this invention produces the effect that the estimation precision can be improved combining the data between different sensors.

また、本発明によれば、上記において、多変量解析により得られた生体数モデルについて実測値との誤差を検証しながら、繰り返しヒストグラムの区分の間隔が変更されるよう制御することにより、生体数モデルの最適化を行う。これにより、本発明は、細かい区分間隔のデータを用いて解析を行う場合、性質が同じ情報が違うものとみなされるリスクが高まる一方で、粗い区分間隔のデータを用いて解析を行う場合は、性質が異なる情報が同じとみなされるリスクが高まるというジレンマの間で、ヒストグラムの区分の間隔について最適化を図ることができるという効果を奏する。 Further, according to the present invention, in the above, the number of living bodies is controlled by changing the interval between the sections of the repeated histogram while verifying an error from the actual measurement value of the living body number model obtained by the multivariate analysis. Optimize the model. Thereby, the present invention increases the risk that the information having the same property is regarded as different when performing analysis using the data of the fine segment intervals, while performing analysis using the data of the coarse segment intervals, There is an effect that it is possible to optimize the interval of the histogram division between the dilemmas that the risk that information of different properties is regarded as the same is increased.

また、本発明によれば、上記において、検出強度データに基づいて、検出強度を対数に変換して、変換した対数の区分毎の頻度を示すヒストグラムを生成するので、対数表現にてヒストグラム区分の間隔を設定できるという効果を奏する。 Further, according to the present invention, in the above, based on the detected intensity data, the detected intensity is converted into a logarithm, and a histogram indicating the frequency of each converted logarithm section is generated. The effect is that the interval can be set.

また、本発明によれば、上記において、生成したヒストグラムをスペクトルとして、中心化、尺度化、平滑化、微分、および／または標準化の処理を行うので、スペクトルに含まれるノイズを低減したり、スペクトルに含まれる情報を強調したり、異なるサンプル間のスペクトルの尺度を合わせる等の効果を奏する。 Further, according to the present invention, in the above, since the generated histogram is used as a spectrum, centering, scaling, smoothing, differentiation, and / or standardization processing is performed, so that noise included in the spectrum can be reduced or spectrum can be reduced. Effects such as emphasizing the information contained in the image and adjusting the spectrum scale between different samples.

また、本発明によれば、上記において、教師なしのパターン認識手法、教師ありの判別分析手法、または、教師ありの回帰分析手法を用いて多変量解析を行うことができ、特に、主成分分析、クラスター分析、ＰＬＳ判別分析、ＳＩＭＣＡ、ＳＶＭ判別分析、重回帰分析、ＰＬＳ回帰分析、ＳＶＭ回帰分析、または、局所重みつき回帰により、多変量解析を行うことができるという効果を奏する。 Further, according to the present invention, in the above, multivariate analysis can be performed using an unsupervised pattern recognition method, a supervised discriminant analysis method, or a supervised regression analysis method. It is possible to perform multivariate analysis by cluster analysis, PLS discriminant analysis, SIMCA, SVM discriminant analysis, multiple regression analysis, PLS regression analysis, SVM regression analysis, or local weighted regression.

図１は、本実施形態が適用される生体数モデル生成装置１００の構成の一例を示すブロック図である。FIG. 1 is a block diagram illustrating an example of a configuration of a biological number model generation apparatus 100 to which the present exemplary embodiment is applied. 図２は、マイクロ流路を用いた蛍光標識済生体の蛍光検出方法を模式的に示す図である。FIG. 2 is a diagram schematically showing a fluorescence detection method for a fluorescently labeled living body using a microchannel. 図３は、本実施の形態における生体数モデル生成装置１００の処理の一例を示すフローチャートである。FIG. 3 is a flowchart illustrating an example of processing of the biological number model generation device 100 according to the present embodiment. 図４は、ヒストグラム生成部１０２ｂによる検出強度データからのヒストグラムの変換処理を模式的に示す図である。FIG. 4 is a diagram schematically illustrating a process of converting a histogram from detected intensity data by the histogram generation unit 102b. 図５は、複数の蛍光標識におけるヒストグラムデータがある場合に実行される結合処理を模式的に示す図である。FIG. 5 is a diagram schematically showing a combining process executed when there is histogram data for a plurality of fluorescent labels. 図６は、本実施形態により生成された生体数モデルおよび従来法による推定値と、培養法による実測値との誤差を示す図である。FIG. 6 is a diagram illustrating an error between the biological number model generated according to the present embodiment, an estimated value by the conventional method, and an actually measured value by the culture method. 図７は、本実施例によるデータの流れの一例を示す図である。FIG. 7 is a diagram illustrating an example of a data flow according to the present embodiment. 図８は、チャンネルＡの電圧値のみをヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 8 is a diagram showing the combined histogram obtained by the histogram generation process when only the voltage value of channel A is used for the histogram, and the coefficient value for each section of the histogram. 図９は、チャンネルＡの電圧値のみをヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 9 is a graph comparing the biological number model obtained by the histogram generation processing when only the voltage value of channel A is used for the histogram or the estimated value by the conventional method and the actually measured value. 図１０は、チャンネルＡおよびＢにおける電圧値をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 10 is a diagram showing the combined histogram obtained by the histogram generation process when the voltage values in the channels A and B are used for the histogram, and the coefficient values for each section of the histogram. 図１１は、チャンネルＡおよびＢにおける電圧値をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 11 is a graph comparing the biological number model obtained by the histogram generation processing when the voltage values in the channels A and B are used for the histogram or the estimated value by the conventional method and the actually measured value. 図１２は、チャンネルＡおよびＣにおける電圧値をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 12 is a diagram showing the combined histogram obtained by the histogram generation processing when the voltage values in the channels A and C are used for the histogram, and the coefficient values for each section of the histogram. 図１３は、チャンネルＡおよびＣにおける電圧値をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 13 is a graph comparing the biological model obtained by the histogram generation process when the voltage values in the channels A and C are used for the histogram or the estimated value by the conventional method and the actually measured value. 図１４は、全てのチャンネルＡ，Ｂ，Ｃにおける電圧値をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 14 is a diagram showing the combined histogram obtained by the histogram generation process when the voltage values in all the channels A, B, and C are used for the histogram, and the coefficient values for each section of the histogram. 図１５は、全てのチャンネルＡ，Ｂ，Ｃにおける電圧値をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 15 is a graph comparing the biological model obtained by the histogram generation process when the voltage values in all the channels A, B, and C are used in the histogram or the estimated value by the conventional method and the actually measured value. 図１６は、チャンネルＡおよびＢの電圧値のうち、Ｂ＝０（死菌の蛍光が検出されない）場合のデータをヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 16 shows the combined histogram and histogram obtained by the histogram generation process when the data of channel A and B when B = 0 (the fluorescence of dead bacteria is not detected) is used for the histogram. It is a figure which shows the coefficient value with respect to each division. 図１７は、チャンネルＡおよびＢの電圧値のうち、Ｂ＝０（死菌の蛍光が検出されない）場合のデータをヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 17 shows the biological number model obtained by the histogram generation process when the data of the case where B = 0 (the fluorescence of dead bacteria is not detected) among the voltage values of the channels A and B is used for the histogram, or the conventional method. It is the graph which compared the estimated value and the measured value. 図１８は、チャンネルＡにおける電圧値を同時に計測されたチャンネルＣの電圧値で除した値（Ａ÷Ｃ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 18 shows each of the combined histogram and the histogram obtained by the histogram generation process when a value (A ÷ C) obtained by dividing the voltage value in channel A by the voltage value of channel C measured simultaneously is used as the histogram. It is a figure which shows the coefficient value with respect to a division. 図１９は、チャンネルＡにおける電圧値を同時に計測されたチャンネルＣの電圧値で除した値（Ａ÷Ｃ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 19 shows a biological number model obtained by histogram generation processing when a value (A ÷ C) obtained by dividing the voltage value of channel A by the voltage value of channel C simultaneously measured (A ÷ C) is estimated by the histogram generation processing or the conventional method. It is the graph which compared the value and the measured value. 図２０は、チャンネルＡにおける電圧値を同時に計測されたチャンネルＢの電圧値で減じた値（Ａ−Ｂ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 20 shows each of the combined histogram and the histogram obtained by the histogram generation process in the case where the value (AB) obtained by subtracting the voltage value in channel A by the voltage value of channel B measured simultaneously is used as the histogram. It is a figure which shows the coefficient value with respect to a division. 図２１は、チャンネルＡにおける電圧値を同時に計測されたチャンネルＢの電圧値で減じた値（Ａ−Ｂ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 21 shows a biological number model obtained by histogram generation processing or a conventional method estimated when a value (AB) obtained by subtracting the voltage value of channel A by the voltage value of channel B measured simultaneously is used for the histogram. It is the graph which compared the value and the measured value. 図２２は、チャンネルＡにおける電圧値に同時に計測されたチャンネルＣの電圧値を加えた値（Ａ＋Ｃ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 22 shows the combined histogram obtained by the histogram generation process in the case where the value (A + C) obtained by adding the voltage value of channel C simultaneously measured to the voltage value of channel A is used for the histogram, and the respective sections of the histogram. It is a figure which shows a coefficient value. 図２３は、チャンネルＡにおける電圧値に同時に計測されたチャンネルＣの電圧値を加えた値（Ａ＋Ｃ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 23 shows the biological number model obtained by the histogram generation processing when the value (A + C) obtained by adding the voltage value of channel C simultaneously measured to the voltage value of channel A is used for the histogram, or an estimated value by the conventional method. It is the graph which compared the measured value. 図２４は、チャンネルＡにおける電圧値に同時に計測されたチャンネルＣの電圧値を乗じた値（Ａ×Ｃ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 24 shows a histogram of combined histograms and histograms obtained by histogram generation processing in the case where a value (A × C) obtained by multiplying the voltage value of channel A simultaneously with the voltage value of channel C is used for the histogram. It is a figure which shows the coefficient value with respect to a division. 図２５は、チャンネルＡにおける電圧値に同時に計測されたチャンネルＣの電圧値を乗じた値（Ａ×Ｃ）をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 25 shows a biological number model obtained by histogram generation processing or a conventional method estimated when a value (A × C) obtained by multiplying the voltage value of channel A simultaneously with the voltage value of channel C is used for the histogram. It is the graph which compared the value and the measured value. 図２６は、チャンネルＡ，Ｂ，Ｃにおける電圧値を様々に組み合わせた値をヒストグラムに用いた場合のヒストグラム生成処理により得られた結合後のヒストグラムとヒストグラムの各区分に対する係数値を示す図である。FIG. 26 is a diagram showing the combined histogram obtained by the histogram generation process when values obtained by variously combining the voltage values in the channels A, B, and C are used for the histogram, and coefficient values for each section of the histogram. . 図２７は、チャンネルＡ，Ｂ，Ｃにおける電圧値を様々に組み合わせた値をヒストグラムに用いた場合のヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。FIG. 27 is a graph comparing a biological number model obtained by a histogram generation process when values obtained by variously combining voltage values in channels A, B, and C are used in a histogram, or an estimated value obtained by a conventional method and an actual measurement value. FIG. 図２８は、本実施例における最適化処理の一例を示すフローチャートである。FIG. 28 is a flowchart illustrating an example of optimization processing in the present embodiment. 図２９は、検出強度データである電圧データから細かい区分間隔でヒストグラムを生成した場合と、粗い区分間隔でヒストグラムを生成した場合の違いを模式的に示した図である。FIG. 29 is a diagram schematically showing a difference between a case where a histogram is generated with fine segment intervals from voltage data which is detected intensity data and a case where a histogram is generated with coarse segment intervals. 図３０は、チャンネルＢの電圧が検出されない場合（Ｂ＝０）の、チャンネルＡおよびＣの電圧値をヒストグラムに用いて本実施形態を適用した場合に得られた誤差を示す等値図である。FIG. 30 is an isometric view showing errors obtained when the present embodiment is applied using the voltage values of channels A and C in the histogram when the voltage of channel B is not detected (B = 0). . 図３１は、生菌数推定誤差が最小の場合のヒストグラムと、生菌数推定誤差が最大の場合のヒストグラムとを比較した図である。FIG. 31 is a diagram comparing a histogram when the viable cell count estimation error is minimum and a histogram when the viable cell count estimation error is maximum. 図３２は、生菌数推定誤差が最小の場合の生体数モデル、および、生菌数推定誤差が最大の場合の生体数モデルによる推定値と実測値との誤差を示すグラフ図である。FIG. 32 is a graph showing the error between the estimated value and the actual measurement value of the living body number model when the viable cell number estimation error is minimum and the living body number model when the viable cell number estimation error is maximum. 図３３は、ヒストグラムの各サンプリング間隔において、モデル作成時と検証時における生菌数推定誤差を示すグラフ図である。FIG. 33 is a graph showing the viable cell count estimation error at the time of model creation and verification at each sampling interval of the histogram. 図３４は、図３３において、誤差が最も大きい場合の生体数モデルと、誤差が最も小さい場合の数理モデルのそれぞれにおいて、従来法と比較したグラフ図である。FIG. 34 is a graph comparing the conventional method with the biological model when the error is the largest in FIG. 33 and the mathematical model when the error is the smallest. 図３５は、前処理を行わなかった場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 35 is a graph (right diagram) showing an error between a histogram (left diagram) when no preprocessing is performed and an estimated value based on the generated biological number model. 図３６は、中心化（ｍｅａｎｃｅｎｔｅｒｉｎｇ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 36 is a graph (right diagram) showing a histogram (left diagram) in the case where the preprocessing of centering is performed (left diagram) and an error of an estimated value based on the generated number model. 図３７は、尺度化（ｓｃａｌｉｎｇ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 37 is a graph (right diagram) showing a histogram (left diagram) in the case of performing preprocessing for scaling (left diagram) and an error of an estimated value based on the generated number model. 図３８は、平滑化（ｓｍｏｏｔｈｉｎｇ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 38 is a graph (right diagram) showing a histogram (left diagram) in the case of performing smoothing preprocessing and an error of an estimated value based on the generated number model. 図３９は、一次微分（ｆｉｒｓｔｄｅｒｉｖａｔｉｖｅ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 39 is a graph (right diagram) showing an error in an estimated value based on a histogram (left diagram) and a generated biological number model when pre-processing of first derivative (first derivative) is performed. 図４０は、二次微分（ｓｅｃｏｎｄｄｅｒｉｖａｔｉｖｅ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 40 is a graph (right diagram) showing an error of an estimated value based on a histogram (left diagram) and a generated biological number model when preprocessing of second derivative (second derivative) is performed. 図４１は、標準化（Ｎｏｒｍａｌｉｚａｔｉｏｎ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 41 is a graph (right diagram) showing an error between a histogram (left diagram) and an estimated value based on the generated number model when normalization is performed. 図４２は、標準化と二次微分と中心化と尺度化の前処理を組み合わせて行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。FIG. 42 is a graph (left diagram) and a graph (right diagram) showing an error in an estimated value by a generated biological number model when the standardization, second-order differentiation, centering, and scaling preprocessing are performed in combination. is there. 図４３は、主成分分析により多変量解析を行った実施例を示す図である。FIG. 43 is a diagram illustrating an example in which multivariate analysis is performed by principal component analysis. 図４４は、クラスター分析により多変量解析を行った実施例を示す図である。FIG. 44 is a diagram showing an example in which multivariate analysis was performed by cluster analysis. 図４５は、ＰＬＳ判別分析により多変量解析を行った実施例を示す図である。FIG. 45 is a diagram illustrating an example in which multivariate analysis is performed by PLS discriminant analysis. 図４６は、ＳＩＭＣＡにより多変量解析を行った実施例を示す図である。FIG. 46 is a diagram showing an example in which multivariate analysis was performed by SIMCA. 図４７は、ＳＶＭ回帰分析により多変量解析を行った実施例を示す図である。FIG. 47 is a diagram showing an example in which multivariate analysis was performed by SVM regression analysis. 図４８は、ＬＷＲ（局所重みつき回帰）により多変量解析を行った実施例を示す図である。FIG. 48 is a diagram illustrating an example in which multivariate analysis is performed by LWR (local weighted regression).

以下に、本発明にかかる生体数モデル生成装置、生体数モデル生成方法、サイトメーター、および、プログラムの実施の形態を図面に基づいて詳細に説明する。なお、この実施の形態によりこの発明が限定されるものではない。 Hereinafter, embodiments of a biological number model generation device, a biological number model generation method, a cytometer, and a program according to the present invention will be described in detail with reference to the drawings. Note that the present invention is not limited to the embodiments.

特に以下の実施の形態においては、本発明を食品等の製品の品質管理分野に適用した例について説明する場合があるが、この場合に限られず、医療や臨床検査など全ての技術分野において、同様に適用することができる。 In particular, in the following embodiment, an example in which the present invention is applied to the quality control field of products such as foods may be described. However, the present invention is not limited to this, and the same applies to all technical fields such as medical care and clinical examinations. Can be applied to.

［本実施の形態の概要］
以下、本発明にかかる実施形態の概要について説明し、その後、本実施形態の構成および処理等について詳細に説明する。 [Outline of this embodiment]
Hereinafter, an outline of an embodiment according to the present invention will be described, and then a configuration, processing, and the like of the present embodiment will be described in detail.

本実施の形態は、概略的に、以下の基本的特徴を有する。まず、本実施形態は、生体の生死に応じて生体との相互作用の程度とそれに応じた発光強度が異なる蛍光標識を用いたサイトメトリーにより得られた、検出強度データを取得する。ここで、本実施形態において、「生体」とは、細菌や真菌等の菌類や微生物の個体、単細胞生物、ならびに、多細胞生物における個々の細胞等のほか、ミトコンドリアやクロロフィルなどの準生物的生体をも含み、サイトメトリーによって検出可能なあらゆる生体をいう。また、「蛍光標識」は、生体の生死に応じて生体との相互作用の程度とそれに応じた発光強度が異なる標識である。ここで、蛍光標識は、複数の蛍光標識によって、生体の生死が判定できるような組み合せであってもよく、例えば、特開２０１１−９２１０４号公報に記載の多重染色法を用いて細胞または微生物（生菌）を標識してもよい。一例として、全菌（生菌および死菌）を染色する色素としてＤＡＰＩ（４´，６−ｄｉａｍｉｄｉｎｏ−２−ｐｈｅｎｙｌｉｎｄｏｌｅ）等を用いてもよく、死菌を染色する色素としてＰＩ（ＰｒｏｐｉｄｉｕｍＩｏｄｉｄｅ）等を用いてもよい。また、「サイトメトリー」は、フローサイトメトリーやイメージングサイトメトリー等のように個々の生体を検出可能な手法であり、フローサイトメーターやイメージングサイトメーター等により実施される。 The present embodiment schematically has the following basic features. First, in the present embodiment, detection intensity data obtained by cytometry using a fluorescent label having a degree of interaction with a living body and a light emission intensity corresponding to the degree of interaction with the living body depending on whether the living body is alive or dead is acquired. Here, in this embodiment, the “living body” means fungi such as bacteria and fungi, individuals of microorganisms, unicellular organisms, individual cells in multicellular organisms, and quasibiological organisms such as mitochondria and chlorophyll. And any organism that can be detected by cytometry. Further, the “fluorescent label” is a label that differs in the degree of interaction with the living body and the light emission intensity corresponding to the living or dead body. Here, the fluorescent label may be a combination that can determine whether a living organism is alive or not by a plurality of fluorescent labels. For example, cells or microorganisms (using a multiple staining method described in JP2011-92104A) may be used. Live bacteria) may be labeled. As an example, DAPI (4 ′, 6-diamidino-2-phenylindole) or the like may be used as a dye that stains all bacteria (live and dead), and PI (Propidium Iodide) or the like as a dye that stains dead bacteria. May be used. Further, “cytometry” is a technique capable of detecting an individual living body, such as flow cytometry and imaging cytometry, and is performed by a flow cytometer, an imaging cytometer, or the like.

そして、本実施形態は、検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを生成する。ヒストグラムの作成方法については詳細を後述する。 And this embodiment produces | generates the histogram which shows the frequency for every division of detection intensity based on detection intensity data. Details of the method of creating the histogram will be described later.

そして、本実施形態は、ヒストグラムの各区分における頻度値を説明変数として多変量解析を行い、得られた説明変数に対する各係数を用いて生体の有無または生体数を判定する数式である生体数モデルを生成する。なお、「生体数モデル」との用語は、生体数を判定するモデルに限定されるものではなく、生体の有無を判定するモデルを意味してもよいものである。ここで、本実施形態は、多変量解析手段により得られた生体数モデルについて実測値との誤差を検証しながら、繰り返し、上記のヒストグラムの区分の間隔を変更することにより、生体数モデルの最適化を行ってもよい。 The present embodiment performs a multivariate analysis using the frequency value in each section of the histogram as an explanatory variable, and uses a coefficient for the obtained explanatory variable to determine the presence or absence of the living body or the living body number model. Is generated. The term “biological number model” is not limited to a model for determining the number of living bodies, but may mean a model for determining the presence or absence of a living body. Here, in the present embodiment, the number model obtained by the multivariate analysis unit is repeatedly verified while verifying an error with the actual measurement value, and the interval of the above-mentioned histogram segment is changed to optimize the number model. May also be performed.

以上が本実施形態の概要である。なお、以下に本実施形態をコンピュータ等の制御部を備えた装置によって実施する例について説明するが、本発明は、これに限られず、本実施形態の方法を人によって実施してもよいものである。 The above is the outline of the present embodiment. In addition, although the example which implements this embodiment by the apparatus provided with control parts, such as a computer, is demonstrated below, this invention is not restricted to this, The method of this embodiment may be implemented by a person. is there.

［生体数モデル生成装置の構成］
次に、本発明にかかる本実施形態の生体数モデル生成装置の構成について図１を参照して説明する。図１は、本実施形態が適用される生体数モデル生成装置１００の構成の一例を示すブロック図であり、該構成のうち本発明に関係する部分のみを概念的に示している。 [Configuration of Biological Model Generation Device]
Next, the configuration of the living body number model generation device of the present embodiment according to the present invention will be described with reference to FIG. FIG. 1 is a block diagram showing an example of the configuration of a biological number model generation apparatus 100 to which the present embodiment is applied, and conceptually shows only the portion related to the present invention in the configuration.

図１において生体数モデル生成装置１００は、概略的に、制御部１０２と通信制御インターフェース部１０４と入出力制御インターフェース部１０８と記憶部１０６を備えて構成される。ここで、制御部１０２は、生体数モデル生成装置１００の全体を統括的に制御するＣＰＵ等である。また、入出力制御インターフェース部１０８は、測定部１１０や入力部１１２や出力部１１４等に接続されるインターフェースである。また、記憶部１０６は、各種のデータベースやテーブルなどを格納する装置である。これら生体数モデル生成装置１００の各部は任意の通信路を介して通信可能に接続されている。 In FIG. 1, the biological number model generation apparatus 100 schematically includes a control unit 102, a communication control interface unit 104, an input / output control interface unit 108, and a storage unit 106. Here, the control unit 102 is a CPU or the like that comprehensively controls the whole body number model generation apparatus 100. The input / output control interface unit 108 is an interface connected to the measurement unit 110, the input unit 112, the output unit 114, and the like. The storage unit 106 is a device that stores various databases and tables. Each unit of the biological number model generation apparatus 100 is connected to be communicable via an arbitrary communication path.

記憶部１０６に格納される各種のデータベースやファイル（検出強度ファイル１０６ａ、ヒストグラムファイル１０６ｂ、および、生体数モデルファイル１０６ｃ等）は、固定ディスク装置等のストレージ手段である。例えば、記憶部１０６は、各種処理に用いる各種のプログラムやテーブルやファイルやデータベース等を格納する。 Various databases and files (detection intensity file 106a, histogram file 106b, biological number model file 106c, etc.) stored in the storage unit 106 are storage means such as a fixed disk device. For example, the storage unit 106 stores various programs, tables, files, databases, and the like used for various processes.

これら記憶部１０６の各構成要素のうち、検出強度ファイル１０６ａは、生体の生死に応じて生体との相互作用の程度とそれに応じた発光強度が異なる蛍光標識を行ったサンプルに対してサイトメトリーにより得られた検出強度データを記憶する検出強度記憶手段である。例えば、検出強度ファイル１０６ａは、測定部１１０により取得された生データを検出強度データとして記憶してもよく、ネットワーク３００を介して外部機器２００より受信した検出強度データを記憶してもよい。一例として、検出強度ファイル１０６ａに記憶される検出強度データは、検出手段の性質に従って時系列の時間順や走査された順等の検出順に並べられた、蛍光強度の検出データである。例えば、フローサイトメーターにより得られた検出強度データは、一例として、検出する波長の範囲が予め設定された各チャンネルにおける、時刻に対応付けられた電圧データ（光の強さに比例した電気パルスに応じた強度データ等）であってもよい。また、イメージングサイトメーターにより得られた検出強度データは、一例として、二次元平面上で走査した座標（ｘ座標，ｙ座標）に対応付けた所望の波長範囲の検出値データであってもよい。 Among these components of the storage unit 106, the detection intensity file 106a is obtained by cytometry on a sample that has been subjected to fluorescent labeling that differs in the degree of interaction with the living body and the emission intensity corresponding to the living body's life or death. Detection intensity storage means for storing the obtained detection intensity data. For example, the detection intensity file 106a may store raw data acquired by the measurement unit 110 as detection intensity data, or may store detection intensity data received from the external device 200 via the network 300. As an example, the detection intensity data stored in the detection intensity file 106a is fluorescence intensity detection data arranged in a detection order such as a time-series time order or a scanned order according to the nature of the detection means. For example, the detected intensity data obtained by the flow cytometer is, for example, voltage data (electric pulse proportional to the light intensity) associated with the time in each channel in which the wavelength range to be detected is preset. And corresponding intensity data). Moreover, the detection intensity data obtained by the imaging cytometer may be detection value data of a desired wavelength range associated with coordinates (x coordinate, y coordinate) scanned on a two-dimensional plane as an example.

また、ヒストグラムファイル１０６ｂは、後述するヒストグラム生成部１０２ｂにより生成されたヒストグラムのデータを記憶するヒストグラムデータ記憶手段である。例えば、ヒストグラムファイル１０６ｂに記憶されるヒストグラムデータは、ヒストグラムの各区分に対応付けた頻度値のデータである。 The histogram file 106b is a histogram data storage unit that stores histogram data generated by a histogram generation unit 102b described later. For example, the histogram data stored in the histogram file 106b is frequency value data associated with each section of the histogram.

また、生体数モデルファイル１０６ｃは、後述する多変量解析部１０２ｃにより生成された生体数モデルを記憶する生体数モデル記憶手段である。ここで、生体数モデルファイル１０６ｃに記憶される生体数モデルを示すデータは、ヒストグラムの区分（チャンネル種別や幅や範囲等）に対応する係数を定義付けたデータであってもよい。 The biological number model file 106c is a biological number model storage unit that stores a biological number model generated by a multivariate analysis unit 102c described later. Here, the data indicating the biological number model stored in the biological number model file 106c may be data in which coefficients corresponding to the histogram classification (channel type, width, range, etc.) are defined.

また、図１において、入出力制御インターフェース部１０８は、測定部１１０や入力部１１２や出力部１１４等の制御を行う。ここで、出力部１１４としては、モニタ（家庭用テレビを含む）の他、スピーカを用いることができる（なお、以下においては出力部１１４をモニタとして記載する場合がある）。また、入力部１１２としては、キーボードや、マウス、マイク等を用いることができる。 In FIG. 1, the input / output control interface unit 108 controls the measurement unit 110, the input unit 112, the output unit 114, and the like. Here, as the output unit 114, in addition to a monitor (including a home television), a speaker can be used (hereinafter, the output unit 114 may be described as a monitor). As the input unit 112, a keyboard, a mouse, a microphone, or the like can be used.

また、測定部１１０は、所定の波長範囲の光の強度を検出する光強度検出手段である。例えば、測定部１１０は、蛍光標識により発せられる蛍光の波長を含む範囲の強度を検出してもよい。なお、測定部１１０は、光の強度を光電効果によって電圧パルスとして検出してもよく、例えば、蛍光標識された生体が検出領域を通過したことにより検出された電圧パルスのピーク値やパルス面積を検出強度として検出してもよい。一例として、測定部１１０は、フローサイトメーターであってもよく、イメージングサイトメーターであってもよい。例えば、イメージングサイトメーターとしての測定部１１０は、生体が含まれうるサンプルが散布された平面に対してレーザー光を照射しつつ平面上を走査し、サンプルが発する蛍光の強度データを走査線に沿って取得してもよい。また、フローサイトメーターとしての測定部１１０は、個々の生体を浮遊させた浮遊液を筒状のシース液中に流してレーザー光上に通過させることにより、レーザー光により励起された蛍光の強度を時系列に検出してもよい。このほか、フローサイトメーターとしての測定部１１０は、生体の浮遊液をマイクロ流路に流して励起光上を通過させて蛍光の強度を検出してもよい。ここで、図２は、マイクロ流路を用いた蛍光標識済生体の蛍光検出方法を模式的に示す図である。 The measurement unit 110 is a light intensity detection unit that detects the intensity of light in a predetermined wavelength range. For example, the measurement unit 110 may detect the intensity in a range including the wavelength of the fluorescence emitted by the fluorescent label. The measurement unit 110 may detect the intensity of light as a voltage pulse by the photoelectric effect. For example, the peak value or the pulse area of the voltage pulse detected when the fluorescently labeled biological body passes through the detection region. You may detect as detection intensity. As an example, the measurement unit 110 may be a flow cytometer or an imaging cytometer. For example, the measurement unit 110 as an imaging cytometer scans a plane on which a sample containing a living body is scattered while irradiating a laser beam, and the intensity data of the fluorescence emitted by the sample along the scanning line. May be obtained. In addition, the measurement unit 110 as a flow cytometer allows the suspended liquid in which individual living bodies are suspended to flow in a cylindrical sheath liquid and pass over the laser light, thereby increasing the intensity of fluorescence excited by the laser light. You may detect in time series. In addition, the measurement unit 110 as a flow cytometer may detect the intensity of fluorescence by flowing a biological suspension through the microchannel and passing it over the excitation light. Here, FIG. 2 is a diagram schematically showing a fluorescence detection method for a fluorescently labeled living body using a microchannel.

マイクロキャピラリー方式のフローサイトメトリーでは、シース液を用いるフローサイトメトリーとは異なり、シーケンス制御された空気圧によって、図２に示すように、マイクロキャピラリー中のマイクロ流路に浮遊液を送り込むことによりフロー（流れ）を形成させる。ここで、一例として、励起光は２波長同時に照射し、蛍光は３波長を同時に各チャンネルで検出してもよい。マイクロキャピラリー方式のフローサイトメーターの一例として、非特許文献１に記載の日立エンジニアリング・アンド・サービス社製カセットラボ（Ｒ）ＯＮＥを測定部１１０として用いてもよい。 Unlike flow cytometry using a sheath liquid, microcapillary flow cytometry uses a sequence-controlled air pressure to send a floating liquid into a microchannel in a microcapillary as shown in FIG. Flow). Here, as an example, two wavelengths of excitation light may be irradiated simultaneously, and three wavelengths of fluorescence may be detected simultaneously in each channel. As an example of a microcapillary flow cytometer, a cassette laboratory (R) ONE manufactured by Hitachi Engineering & Service Co., Ltd. described in Non-Patent Document 1 may be used as the measurement unit 110.

また、図１において、制御部１０２は、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）等の制御プログラムや、各種の処理手順等を規定したプログラム、および、所要データを格納するための内部メモリを有する。そして、制御部１０２は、これらのプログラム等により、種々の処理を実行するための情報処理を行う。制御部１０２は、機能概念的に、データ取得部１０２ａ、ヒストグラム生成部１０２ｂ、多変量解析部１０２ｃ、最適化部１０２ｄ、および、生体数判定部１０２ｆを備える。 In FIG. 1, the control unit 102 has an internal memory for storing a control program such as an OS (Operating System), a program that defines various processing procedures, and necessary data. And the control part 102 performs the information processing for performing various processes by these programs. The control unit 102 includes a data acquisition unit 102a, a histogram generation unit 102b, a multivariate analysis unit 102c, an optimization unit 102d, and a living body number determination unit 102f in terms of functional concept.

このうち、データ取得部１０２ａは、検出強度データを取得する検出強度データ取得手段である。例えば、データ取得部１０２ａは、測定部１１０を介して測定された検出強度データを取得してもよい。なお、測定部１１０を介して測定された生データにアナログ／デジタル変換等のデータ処理が必要な場合は、データ取得部１０２ａは、公知のデータ加工手法を用いて、測定部１１０を介して測定された生データを検出強度データに変換してもよい。また、データ取得部１０２ａは、ネットワーク３００を介して外部機器２００から検出強度データを受信してもよい。なお、本実施形態において、データ取得部１０２ａは、取得した検出強度データを検出強度ファイル１０６ａに格納する。 Among these, the data acquisition unit 102a is detection intensity data acquisition means for acquiring detection intensity data. For example, the data acquisition unit 102a may acquire detection intensity data measured via the measurement unit 110. In addition, when data processing such as analog / digital conversion is necessary for the raw data measured through the measurement unit 110, the data acquisition unit 102a performs measurement through the measurement unit 110 using a known data processing method. The obtained raw data may be converted into detected intensity data. Further, the data acquisition unit 102 a may receive detection intensity data from the external device 200 via the network 300. In the present embodiment, the data acquisition unit 102a stores the acquired detection intensity data in the detection intensity file 106a.

また、ヒストグラム生成部１０２ｂは、検出強度ファイル１０６ａに記憶された検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを生成し、生成したヒストグラムのデータをヒストグラムファイル１０６ｂに格納するヒストグラム生成手段である。ここで、ヒストグラム生成部１０２ｂは、複数の蛍光標識に対応する検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを複数の蛍光標識において生成してもよい。例えば、ヒストグラム生成部１０２ｂは、全菌用蛍光色素および死菌用蛍光色素に対応する２波長の検出強度データに基づいて、それぞれの検出強度の区分毎の頻度を示すヒストグラムを生成して、生成したヒストグラムを結合させてもよい。例えば、５つあるフローサイトメーターのチャンネルは、それぞれ１０２４に区分されたヒストグラムデータが得られるので、ヒストグラム生成部１０２ｂは、それらを一列につなげて１サンプルあたり５１２０の変数となる頻度値を得てもよい。ここで、ヒストグラム生成部１０２ｂは、サイトメーターのチャンネル毎に、すなわち、複数の蛍光標識による検出強度毎に、ヒストグラムの区分の間隔を変更してもよい。 In addition, the histogram generation unit 102b generates a histogram indicating the frequency of each detection intensity based on the detection intensity data stored in the detection intensity file 106a, and stores the generated histogram data in the histogram file 106b. It is a generation means. Here, the histogram generation unit 102b may generate a histogram indicating the frequency of each detected intensity for each of the plurality of fluorescent labels based on the detected intensity data corresponding to the plurality of fluorescent labels. For example, the histogram generation unit 102b generates and generates a histogram indicating the frequency of each detection intensity category based on the two-wavelength detection intensity data corresponding to the whole bacteria fluorescent dye and the dead bacteria fluorescent dye. The histograms may be combined. For example, since the histogram data divided into 1024 is obtained for each channel of five flow cytometers, the histogram generator 102b connects them in a row to obtain a frequency value that is 5120 variables per sample. Also good. Here, the histogram generation unit 102b may change the interval of the histogram division for each channel of the cytometer, that is, for each detection intensity by a plurality of fluorescent labels.

また、ヒストグラム生成部１０２ｂは、複数の蛍光標識に対応する検出強度データに基づいて、複数の蛍光標識のうち一つの蛍光標識の検出強度が所定の範囲にある場合の、他の蛍光標識の区分毎の頻度を示すヒストグラムを生成してもよい。例えば、ヒストグラム生成部１０２ｂは、複数の蛍光標識に対応する検出強度データに基づいて、一方の蛍光標識の検出強度が所定の閾値以下である場合の、他方の蛍光標識の区分毎の頻度を示すヒストグラムを生成してもよい。例えば、生体が生の状態で相互作用し易い蛍光標識と、生体が死の状態で相互作用し易い蛍光標識と、を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データについて、ヒストグラム生成部１０２ｂは、後者の蛍光標識の検出強度が所定の閾値以下である場合の、前者の蛍光標識の区分毎の頻度を示すヒストグラムを生成してもよい。他の例として、ヒストグラム生成部１０２ｂは、全菌用蛍光標識および死菌蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに基づいて、死菌用蛍光標識の検出強度が所定の閾値以下である場合の、全菌用蛍光標識の区分毎の頻度を示すヒストグラムを生成してもよい。 Further, the histogram generation unit 102b classifies other fluorescent labels when the detected intensity of one fluorescent label is within a predetermined range based on the detected intensity data corresponding to the plurality of fluorescent labels. You may generate | occur | produce the histogram which shows every frequency. For example, the histogram generation unit 102b indicates the frequency for each category of the other fluorescent label when the detected intensity of one fluorescent label is equal to or lower than a predetermined threshold based on the detected intensity data corresponding to the plurality of fluorescent labels. A histogram may be generated. For example, the detection intensity of fluorescence emitted by each fluorescent label obtained by cytometry using a fluorescent label that easily interacts in a living state and a fluorescent label that easily interacts in a dead state. For the detected intensity data shown, the histogram generator 102b may generate a histogram indicating the frequency of each of the former fluorescent labels when the latter detected intensity of the fluorescent label is equal to or lower than a predetermined threshold. As another example, the histogram generation unit 102b performs death based on detection intensity data indicating the detection intensity of fluorescence emitted by each fluorescent label, which is obtained by cytometry using a fluorescent label for whole bacteria and a fluorescent label for dead bacteria. A histogram may be generated that indicates the frequency for each category of the fluorescent label for all bacteria when the detection intensity of the fluorescent label for bacteria is not more than a predetermined threshold.

また、ヒストグラム生成部１０２ｂは、複数の蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに基づいて、同時に計測された蛍光標識の検出強度データ間で加減乗除し、加減乗除した値の区分毎の頻度を示すヒストグラムを生成してもよい。例えば、全菌用蛍光標識および死菌蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに、ヒストグラム生成部１０２ｂは、全菌用蛍光標識の検出強度を、同時刻に検出された死菌用蛍光標識の検出強度で減算ないし除算した値の区分毎の頻度を示すヒストグラムを生成してもよい。 In addition, the histogram generation unit 102b detects the fluorescence label detection intensity data measured simultaneously based on the detection intensity data indicating the detection intensity of the fluorescence emitted by each fluorescence label, which is obtained by cytometry using a plurality of fluorescence labels. It is also possible to generate a histogram indicating the frequency for each category of the value obtained by adding / subtracting / dividing between the values. For example, the histogram generation unit 102b includes the detection intensity data indicating the detection intensity of the fluorescence emitted by each fluorescent label obtained by cytometry using the fluorescent label for whole bacteria and the dead fluorescent label. You may generate | occur | produce the histogram which shows the frequency for every division | segmentation of the value which subtracted thru | or divided the detection intensity by the detection intensity of the fluorescent label for killed bacteria detected at the same time.

ここで、ヒストグラム生成部１０２ｂは、検出強度データに基づいて、検出強度を対数に変換して、変換した対数の区分毎の頻度を示すヒストグラムを生成してもよい。例えば、ヒストグラム生成部１０２ｂは、検出強度に対して等間隔に区分わけしてヒストグラムを生成するのではなく、検出強度の対数（常用対数や自然対数等）に対して等間隔に区分わけしてヒストグラムを生成してもよい。また、片対数のみならず、両対数としてもよい。例えば、ヒストグラム生成部１０２ｂは、検出強度データに基づいて、横軸を検出強度の対数に、縦軸を頻度値の対数にして両対数のヒストグラムを生成してもよい。 Here, the histogram generation unit 102b may convert the detection intensity into a logarithm based on the detection intensity data, and generate a histogram indicating the frequency for each segment of the converted logarithm. For example, the histogram generation unit 102b does not generate a histogram by dividing the detection intensity at equal intervals, but by dividing the logarithm of the detection intensity (common logarithm, natural logarithm, etc.) at equal intervals. A histogram may be generated. Moreover, it is good also as not only a semilogarithm but a both logarithm. For example, the histogram generation unit 102b may generate a log-log histogram based on the detection intensity data, with the horizontal axis representing the logarithm of the detection intensity and the vertical axis representing the logarithm of the frequency value.

また、ヒストグラム生成部１０２ｂは、生成したヒストグラムに対して、後述する多変量解析部１０２ｃによる多変量解析処理を実行する前に、前処理を行ってもよい。例えば、ヒストグラム生成部１０２ｂは、生成したヒストグラムをスペクトルとして、中心化、尺度化、平滑化、微分、および／または標準化などの前処理を行ってもよい。 The histogram generation unit 102b may perform preprocessing on the generated histogram before executing multivariate analysis processing by the multivariate analysis unit 102c described later. For example, the histogram generation unit 102b may perform preprocessing such as centering, scaling, smoothing, differentiation, and / or standardization using the generated histogram as a spectrum.

また、多変量解析部１０２ｃは、ヒストグラム生成部１０２ｂにより生成されたヒストグラムの各区分における頻度値を説明変数として多変量解析を行い、得られた説明変数に対する各係数を用いて生体の有無（生菌の存在有無など）または生体数（生菌数など）を判定する数式である生体数モデルを生成する多変量解析手段である。例えば、多変量解析部１０２ｃは、ヒストグラムファイル１０６ｂに記憶されたヒストグラムデータの各区分の頻度値を説明変数として多変量解析を行い、その結果得られた各説明変数の係数を、区分番号等に対応付けて生体数モデルファイル１０６ｃに格納してもよい。 In addition, the multivariate analysis unit 102c performs multivariate analysis using the frequency values in each section of the histogram generated by the histogram generation unit 102b as explanatory variables, and uses the coefficients for the obtained explanatory variables to determine the presence / absence of a living body (raw This is a multivariate analysis means for generating a biological number model that is a mathematical expression for determining the presence or absence of bacteria or the number of living organisms (viable bacterial count or the like). For example, the multivariate analysis unit 102c performs multivariate analysis using the frequency value of each section of the histogram data stored in the histogram file 106b as an explanatory variable, and the coefficient of each explanatory variable obtained as a result is used as a section number or the like. Correspondingly, the biological number model file 106c may be stored.

ここで、多変量解析部１０２ｃは、以下の数式（理論式）を用いて生体数モデルを生成してもよい。そして、多変量解析部１０２ｃは、生体数が既知のサンプルの希釈系列による各生体数（実測値）を、上記の理論式にあてはめて、理論式のフィッティングを行ってもよい。すなわち、多変量解析部１０２ｃは、各実測値の生体数に、理論式が最も適合（フィッティング）するように、最小二乗法等を用いて理論式の係数の値を調整して決定してもよい。
Here, the multivariate analysis unit 102c may generate a biological number model using the following mathematical formula (theoretical formula). Then, the multivariate analysis unit 102c may perform fitting of the theoretical formula by applying each biological count (actually measured value) based on a dilution series of a sample with a known biological count to the above theoretical formula. That is, the multivariate analysis unit 102c may adjust and determine the value of the coefficient of the theoretical formula using the least square method or the like so that the theoretical formula is best fitted (fitted) to the number of living bodies of each actually measured value. Good.

ここで、多変量解析部１０２ｃは、教師なしのパターン認識手法、教師ありの判別分析手法、または、教師ありの回帰分析手法を用いて、多変量解析を行ってもよい。例えば、多変量解析部１０２ｃは、主成分分析、クラスター分析、ＰＬＳ（ＰａｒｔｉａｌＬｅａｓｔＳｑｕａｒｅｓ）判別分析、ＳＩＭＣＡ（ＳｏｆｔＩｎｄｅｐｅｎｄｅｎｔＭｏｄｅｌｉｎｇｏｆＣｌａｓｓＡｎａｌｏｇｙ）、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）判別分析、重回帰分析、ＰＬＳ回帰分析、ＳＶＭ回帰分析、および、局所重みつき回帰等により、多変量解析を行ってもよい。このほか、多変量解析部１０２ｃは、特開２０１０−１８５７１９号公報や、特開２００４−２９４３３７号公報、特開２０１０−２６６３８０号公報、『ＪＭＰによる多変量データ活用術』（ＵＲＬ：ｈｔｔｐ：／／ｇｏｏ．ｇｌ／ＮｊＦ３２）、『Ｒによるデータサイエンス − データ解析の基礎から最新手法まで』（ＵＲＬ：ｈｔｔｐ：／／ｇｏｏ．ｇｌ／ＬｓｚＮｖ）等に記載の公知の多変量解析手法を用いて多変量解析を実施してもよい。このように、多変量解析部１０２ｂは、上述した手法を採用することにより、回帰式や多項式や検量線や判別式等の数式やグラフで表現された生体数モデルを生成してもよい。 Here, the multivariate analysis unit 102c may perform multivariate analysis using an unsupervised pattern recognition method, a supervised discriminant analysis method, or a supervised regression analysis method. For example, the multivariate analysis unit 102c performs principal component analysis, cluster analysis, PLS (Partial Last Squares) discriminant analysis, SIMCA (Soft Independent Modeling of Class Analysis), SVM (Support Vector Machine Regression, PL analysis), Multivariate analysis may be performed by analysis, SVM regression analysis, local weighted regression, or the like. In addition, the multivariate analysis unit 102c is disclosed in Japanese Unexamined Patent Application Publication No. 2010-185719, Japanese Unexamined Patent Application Publication No. 2004-294337, Japanese Unexamined Patent Application Publication No. 2010-266380, “Multivariate Data Utilization Method by JMP” (URL: http: // /Go.gl/NjF32), “Data Science by R-From Basics of Data Analysis to Latest Methods” (URL: http://goo.gl/LszNv), etc. A random analysis may be performed. As described above, the multivariate analysis unit 102b may generate a biological number model represented by a mathematical expression or graph such as a regression equation, a polynomial, a calibration curve, or a discriminant by adopting the above-described method.

また、最適化部１０２ｄは、多変量解析部１０２ｃにより得られた生体数モデルによる推定値と実測値との誤差を検証しながら繰り返し、ヒストグラム生成部１０２ｂによりヒストグラムの区分の間隔が変更されるよう制御することにより、生体数モデルの最適化を行うモデル最適化手段である。ここで、最適化部１０２ｄは、クロスバリデーションやダブルクロスバリデーション等の手法に基づいて、ヒストグラム間隔の最適化を行ってもよい。なお、最適化部１０２ｄは、ヒストグラム間隔の最適化処理を、全チャンネル（全蛍光標識の検出強度）一括で行ってもよく、各々のチャンネル（各蛍光標識の検出強度）個別に最適化してもよい。 In addition, the optimization unit 102d repeats while verifying an error between the estimated value and the actual measurement value obtained from the biological number model obtained by the multivariate analysis unit 102c, and the histogram generation unit 102b changes the interval of the histogram division. It is a model optimizing unit that optimizes the biological number model by controlling. Here, the optimization unit 102d may optimize the histogram interval based on a technique such as cross validation or double cross validation. The optimization unit 102d may perform the histogram interval optimization process for all channels (detection intensities of all fluorescent labels) at once, or may optimize each channel (detection intensity of each fluorescent label) individually. Good.

また、生体数判定部１０２ｆは、多変量解析部１０２ｃにより生成された生体数モデルに基づいて、検出強度データから生体の有無（生菌の存在の有無など）または生体数（生菌数など）を判定する生体数判定手段である。 In addition, the living body number determination unit 102f, based on the living body number model generated by the multivariate analysis unit 102c, detects the presence or absence of living organisms (such as the presence or absence of viable bacteria) or the number of living organisms (such as the number of living bacteria) from the detected intensity data. It is a living body number judging means for judging.

以上が、本生体数モデル生成装置１００の構成の一例である。ここで、生体数モデル生成装置１００は、ルータ等の通信装置および専用線等の有線または無線の通信回線を介して、ネットワーク３００に通信可能に接続されてもよい。この場合、通信制御インターフェース部１０４は、通信回線等に接続されるルータ等の通信装置（図示せず）に接続されるインターフェースであり、生体数モデル生成装置１００とネットワーク３００（またはルータ等の通信装置）との間における通信制御を行う。すなわち、通信制御インターフェース部１０４は、他の端末と通信回線を介してデータを通信する機能を有する。図１において、ネットワーク３００は、生体数モデル生成装置１００と外部機器２００とを相互に接続する機能を有し、例えば、インターネット等である。 The above is an example of the configuration of the living body number model generation apparatus 100. Here, the biological number model generation device 100 may be communicably connected to the network 300 via a communication device such as a router and a wired or wireless communication line such as a dedicated line. In this case, the communication control interface unit 104 is an interface connected to a communication device (not shown) such as a router connected to a communication line or the like, and the biological model generation device 100 and the network 300 (or a communication such as a router). Communication control with the device. That is, the communication control interface unit 104 has a function of communicating data with other terminals via a communication line. In FIG. 1, a network 300 has a function of connecting a biological number model generation apparatus 100 and an external device 200 to each other, such as the Internet.

また、生体数モデル生成装置１００は、測定データやパラメータ等に関する外部データベースや、生体数モデル生成装置として機能させるための外部プログラム等を提供する外部機器２００に、ネットワーク３００を介して通信可能に接続されてもよい。 The biological number model generation apparatus 100 is communicably connected via the network 300 to an external device 200 that provides an external database relating to measurement data, parameters, and the like, an external program for causing the biological number model generation apparatus to function, and the like. May be.

図１において、外部機器２００は、ネットワーク３００を介して、生体数モデル生成装置１００と相互に接続され、利用者に対して測定データや理論式やパラメータの値等に関する外部データベースや、情報処理装置を生体数モデル生成装置として機能させるための標的核酸測定プログラム等の外部プログラム等を提供する機能を有する。ここで、外部機器２００は、ＷＥＢサーバやＡＳＰサーバ等として構成してもよい。また、外部機器２００のハードウェア構成は、一般に市販されるワークステーション、パーソナルコンピュータ等の情報処理装置およびその付属装置により構成してもよい。また、外部機器２００の各機能は、外部機器２００のハードウェア構成中のＣＰＵ、ディスク装置、メモリ装置、入力装置、出力装置、通信制御装置等およびそれらを制御するプログラム等により実現される。 In FIG. 1, an external device 200 is mutually connected to the biological number model generation device 100 via a network 300, and an external database or information processing device regarding measurement data, theoretical formulas, parameter values, etc. to the user. Has a function of providing an external program or the like such as a target nucleic acid measurement program for causing the to function as a biological number model generation device. Here, the external device 200 may be configured as a WEB server, an ASP server, or the like. Further, the hardware configuration of the external device 200 may be configured by an information processing apparatus such as a commercially available workstation or a personal computer, and an accessory device thereof. Each function of the external device 200 is realized by a CPU, a disk device, a memory device, an input device, an output device, a communication control device, and the like in the hardware configuration of the external device 200 and a program for controlling them.

［生体数モデル生成装置１００の処理］
次に、このように構成された本実施の形態における生体数モデル生成装置１００の処理の一例について、以下に図３〜図６を参照して詳細に説明する。ここで、図３は、本実施の形態における生体数モデル生成装置１００の処理の一例を示すフローチャートである。 [Process of Biological Model Generation Device 100]
Next, an example of the process of the biological number model generation device 100 according to the present embodiment configured as described above will be described in detail with reference to FIGS. Here, FIG. 3 is a flowchart showing an example of processing of the biological number model generation apparatus 100 in the present embodiment.

図３に示すように、まず、ヒストグラム生成部１０２ｂは、検出強度ファイル１０６ａに記憶された検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを生成し、生成したヒストグラムのデータをヒストグラムファイル１０６ｂに格納する（ステップＳＡ−１）。ここで、図４は、ヒストグラム生成部１０２ｂによる検出強度データからのヒストグラムの変換処理を模式的に示す図である。なお、図４の検出強度データは、時刻ｔにおいて検出された、全菌用蛍光標識用の全菌チャンネルにおける電圧値と、死菌用蛍光標識用の死菌チャンネルにおける電圧値を示す。なお、検出強度データにおける判定項目は、死菌チャンネルにおける電圧値が０において全菌チャンネルにおける電圧値が正の値ならば生菌１個と判定する従来法の例を示したものであり、本実施形態を実施するために必須の項目ではない。 As shown in FIG. 3, first, the histogram generation unit 102b generates a histogram indicating the frequency for each category of the detected intensity based on the detected intensity data stored in the detected intensity file 106a, and generates the generated histogram data. Store in the histogram file 106b (step SA-1). Here, FIG. 4 is a diagram schematically showing the conversion processing of the histogram from the detected intensity data by the histogram generation unit 102b. In addition, the detection intensity data of FIG. 4 shows the voltage value in the whole bacteria channel for fluorescent labeling for whole bacteria, and the voltage value in the dead bacteria channel for fluorescent labeling for dead bacteria detected at the time t. The determination item in the detection intensity data is an example of a conventional method for determining that there is one viable cell if the voltage value in the dead cell channel is 0 and the voltage value in the whole cell channel is positive. It is not an indispensable item for carrying out the embodiment.

図４に示すように、一例として、ヒストグラム生成部１０２ｂは、サイトメーターのチャンネル毎に（すなわち、複数の蛍光標識による検出強度毎に）、電圧値の区分ごとに頻度値を示すヒストグラムを生成してもよい。なお、ヒストグラム生成部１０２ｂは、複数の蛍光標識に対応する検出強度データに基づいて、検出強度の区分毎の頻度を示すヒストグラムを複数の蛍光標識において生成することに限られない。例えば、ヒストグラム生成部１０２ｂは、複数の蛍光標識に対応する検出強度データに基づいて、一つの蛍光標識の検出強度が所定の範囲にある場合（例えば、所定の閾値以下である場合や、所定の閾値以上である場合）の、他の蛍光標識の区分毎の頻度を示すヒストグラムを生成してもよい。例えば、図４の検出データの例において、ヒストグラム生成部１０２ｂは、全菌用蛍光標識および死菌蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに基づいて、死菌用蛍光標識の検出強度が所定の閾値以下である場合の（例えば電圧値が０である場合の）、全菌用蛍光標識の区分毎の頻度を示すヒストグラムを生成してもよい。また、ヒストグラム生成部１０２ｂは、複数の蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに基づいて、同時に計測された蛍光標識の検出強度データ間で加減乗除し、加減乗除した値の区分毎の頻度を示すヒストグラムを生成してもよい。例えば、全菌用蛍光標識および死菌蛍光標識を用いたサイトメトリーにより得られた、各蛍光標識が発する蛍光の検出強度を示す検出強度データに、ヒストグラム生成部１０２ｂは、全菌用蛍光標識の検出強度を、同時刻に検出された死菌用蛍光標識の検出強度で減算ないし除算した値の区分毎の頻度を示すヒストグラムを生成してもよい。 As shown in FIG. 4, as an example, the histogram generation unit 102b generates a histogram indicating a frequency value for each voltage value category for each channel of the cytometer (that is, for each detection intensity by a plurality of fluorescent labels). May be. Note that the histogram generation unit 102b is not limited to generating a histogram indicating the frequency of each detected intensity for each of the plurality of fluorescent labels based on the detected intensity data corresponding to the plurality of fluorescent labels. For example, the histogram generation unit 102b determines that the detection intensity of one fluorescent label is within a predetermined range based on detection intensity data corresponding to a plurality of fluorescent labels (for example, when the detection intensity is equal to or lower than a predetermined threshold, You may generate | occur | produce the histogram which shows the frequency for every division | segmentation of another fluorescence label | marker (when it is more than a threshold value). For example, in the detection data example of FIG. 4, the histogram generation unit 102 b detects the detection intensity indicating the detection intensity of the fluorescence emitted by each fluorescent label, which is obtained by cytometry using the fluorescent label for whole bacteria and the fluorescent label for dead bacteria. Based on the data, when the detection intensity of the killing fluorescent label is less than a predetermined threshold (for example, when the voltage value is 0), a histogram indicating the frequency for each category of the fluorescent label for all bacteria is generated. May be. In addition, the histogram generation unit 102b detects the fluorescence label detection intensity data measured simultaneously based on the detection intensity data indicating the detection intensity of the fluorescence emitted by each fluorescence label, which is obtained by cytometry using a plurality of fluorescence labels. It is also possible to generate a histogram indicating the frequency for each category of the value obtained by adding / subtracting / dividing between the values. For example, the histogram generation unit 102b includes the detection intensity data indicating the detection intensity of the fluorescence emitted by each fluorescent label obtained by cytometry using the fluorescent label for whole bacteria and the dead fluorescent label. You may generate | occur | produce the histogram which shows the frequency for every division | segmentation of the value which subtracted thru | or divided the detection intensity by the detection intensity of the fluorescent label for killed bacteria detected at the same time.

なお、ヒストグラム生成部１０２ｂは、検出強度データに基づいて、検出強度を対数に変換して、変換した対数の区分毎の頻度を示すヒストグラムを生成してもよい。また、ヒストグラム生成部１０２ｂは、生成したヒストグラムに対して、次のステップＳＡ−２の多変量解析処理の前に、前処理を行ってもよい。例えば、ヒストグラム生成部１０２ｂは、生成したヒストグラムをスペクトルとして、中心化、尺度化、平滑化、微分、および／または標準化などの前処理を行ってもよい。 Note that the histogram generation unit 102b may convert the detected intensity into a logarithm based on the detected intensity data, and generate a histogram indicating the frequency for each segment of the converted logarithm. The histogram generation unit 102b may perform preprocessing on the generated histogram before the multivariate analysis processing in the next step SA-2. For example, the histogram generation unit 102b may perform preprocessing such as centering, scaling, smoothing, differentiation, and / or standardization using the generated histogram as a spectrum.

つづいて、多変量解析部１０２ｃは、ヒストグラム生成部１０２ｂにより生成されヒストグラムファイル１０６ｂに格納されたデータによるヒストグラムの各区分における頻度値を説明変数として多変量解析を行う（ステップＳＡ−２）。ここで、図５は、複数の蛍光標識におけるヒストグラムデータがある場合に実行される結合処理を模式的に示す図である。 Subsequently, the multivariate analysis unit 102c performs multivariate analysis using the frequency value in each section of the histogram based on the data generated by the histogram generation unit 102b and stored in the histogram file 106b as an explanatory variable (step SA-2). Here, FIG. 5 is a diagram schematically showing a combining process executed when there is histogram data for a plurality of fluorescent labels.

図５に示すように、複数の蛍光標識におけるヒストグラムデータがある場合、多変量解析部１０２ｃは、複数のヒストグラムを結合させて、結合させたヒストグラムの各区分における頻度値を説明変数として多変量解析を行ってもよい。ここで、多変量解析部１０２ｃは、教師なしのパターン認識手法、教師ありの判別分析手法、または、教師ありの回帰分析手法を用いて、多変量解析を行ってもよい。例えば、多変量解析部１０２ｃは、主成分分析、クラスター分析、ＰＬＳ（ＰａｒｔｉａｌＬｅａｓｔＳｑｕａｒｅｓ）判別分析、ＳＩＭＣＡ（ＳｏｆｔＩｎｄｅｐｅｎｄｅｎｔＭｏｄｅｌｉｎｇｏｆＣｌａｓｓＡｎａｌｏｇｙ）、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）判別分析、重回帰分析、ＰＬＳ回帰分析、ＳＶＭ回帰分析、および、局所重みつき回帰等により、多変量解析を行ってもよい。このほか、多変量解析部１０２ｃは、特開２０１０−１８５７１９号公報や、特開２００４−２９４３３７号公報、特開２０１０−２６６３８０号公報、『ＪＭＰによる多変量データ活用術』（ＵＲＬ：ｈｔｔｐ：／／ｇｏｏ．ｇｌ／ＮｊＦ３２）、『Ｒによるデータサイエンス − データ解析の基礎から最新手法まで』（ＵＲＬ：ｈｔｔｐ：／／ｇｏｏ．ｇｌ／ＬｓｚＮｖ）等に記載の公知の多変量解析手法を用いて多変量解析を実施してもよい。一例として、多変量解析部１０２ｃは、以下の数式（理論式）を用いて生体数モデルを生成してもよい。そして、多変量解析部１０２ｃは、生体数が既知のサンプルの希釈系列による各生体数（実測値）を、上記の理論式にあてはめて、理論式のフィッティングを行ってもよい。すなわち、多変量解析部１０２ｃは、各実測値の生体数に、理論式が最も適合（フィッティング）するように、最小二乗法等を用いて理論式の係数の値を調整して決定してもよい。
As shown in FIG. 5, when there is histogram data for a plurality of fluorescent labels, the multivariate analysis unit 102c combines the plurality of histograms, and uses the frequency value in each section of the combined histograms as an explanatory variable for multivariate analysis. May be performed. Here, the multivariate analysis unit 102c may perform multivariate analysis using an unsupervised pattern recognition method, a supervised discriminant analysis method, or a supervised regression analysis method. For example, the multivariate analysis unit 102c performs principal component analysis, cluster analysis, PLS (Partial Last Squares) discriminant analysis, SIMCA (Soft Independent Modeling of Class Analysis), SVM (Support Vector Machine Regression, PL analysis), Multivariate analysis may be performed by analysis, SVM regression analysis, local weighted regression, or the like. In addition, the multivariate analysis unit 102c is disclosed in Japanese Unexamined Patent Application Publication No. 2010-185719, Japanese Unexamined Patent Application Publication No. 2004-294337, Japanese Unexamined Patent Application Publication No. 2010-266380, “Multivariate Data Utilization Method by JMP” (URL: http: // /Go.gl/NjF32), “Data Science by R-From Basics of Data Analysis to Latest Methods” (URL: http://goo.gl/LszNv), etc. A random analysis may be performed. As an example, the multivariate analysis unit 102c may generate a biological number model using the following mathematical formula (theoretical formula). Then, the multivariate analysis unit 102c may perform fitting of the theoretical formula by applying each biological count (actually measured value) based on a dilution series of a sample with a known biological count to the above theoretical formula. That is, the multivariate analysis unit 102c may adjust and determine the value of the coefficient of the theoretical formula using the least square method or the like so that the theoretical formula is best fitted (fitted) to the number of living bodies of each actually measured value. Good.

そして、多変量解析部１０２ｃは、多変量解析処理により、説明変数に対する各係数を用いて生体の有無（生菌の存在有無など）または生体数（生菌数など）を判定する数式である生体数モデルを生成し、生成した生体数モデルを生体数モデルファイル１０６ｃに格納する（ステップＳＡ−３）。例えば、多変量解析部１０２ｃは、ヒストグラムファイル１０６ｂに記憶されたヒストグラムデータの各区分の頻度値を説明変数とした多変量解析の結果、得られた各説明変数の係数を、区分番号等に対応付けて生体数モデルファイル１０６ｃに格納してもよい。 The multivariate analysis unit 102c is a living body that is a mathematical expression that determines the presence / absence of a living body (such as the presence / absence of living bacteria) or the number of living bodies (such as the number of living bacteria) using each coefficient for the explanatory variable by multivariate analysis processing. A number model is generated, and the generated number model is stored in the number model file 106c (step SA-3). For example, the multivariate analysis unit 102c supports the coefficient of each explanatory variable obtained as a result of the multivariate analysis using the frequency value of each division of the histogram data stored in the histogram file 106b as an explanatory variable, to the division number or the like. In addition, the biological number model file 106c may be stored.

そして、最適化部１０２ｄは、多変量解析部１０２ｃにより得られた生体数モデルによる推定値と実測値との誤差を検証し、ステップＳＡ−１に処理を戻してヒストグラム生成部１０２ｂによりヒストグラムの区分の間隔が変更されるよう制御する（ステップＳＡ−４）。このように、最適化部１０２ｄは、多変量解析部１０２ｃにより生成された生体数モデルの検証を行い、繰り返しヒストグラム区分が変更されるよう制御することにより、実測値との誤差が少なくなるように生体数モデルの最適化を行う。ここで、最適化部１０２ｄは、クロスバリデーションやダブルクロスバリデーション等の手法に基づいて、ヒストグラム間隔の最適化を行ってもよい。なお、最適化部１０２ｄは、ヒストグラム間隔の最適化処理を、全チャンネル（全蛍光標識の検出強度）一括で行ってもよく、各々のチャンネル（各蛍光標識の検出強度）個別に最適化してもよい。 Then, the optimization unit 102d verifies an error between the estimated value and the actual measurement value obtained from the biological number model obtained by the multivariate analysis unit 102c, returns the process to Step SA-1, and classifies the histogram by the histogram generation unit 102b. Is controlled to be changed (step SA-4). As described above, the optimization unit 102d verifies the biological number model generated by the multivariate analysis unit 102c and performs control so that the histogram section is repeatedly changed, so that an error from the actual measurement value is reduced. Optimize the number model. Here, the optimization unit 102d may optimize the histogram interval based on a technique such as cross validation or double cross validation. The optimization unit 102d may perform the histogram interval optimization process for all channels (detection intensities of all fluorescent labels) at once, or may optimize each channel (detection intensity of each fluorescent label) individually. Good.

以上が、本実施の形態における生体数モデル生成装置１００の処理の一例である。このように生成された生体数モデルを用いて、生体数モデル生成装置１００は、生体数判定部１０２ｆの処理により、検出強度データから生体の有無（生菌の存在の有無など）または生体数（生菌数など）を正確かつ迅速に判定することができる。ここで、図６は、本実施形態により生成された生体数モデルおよび従来法による推定値と、培養法による実測値との誤差を示す図である。図６の横軸は、培養法における実測値を示し、縦軸は、各手法による推定値を示しており、傾き１の直線上にあるほど誤差が少ない。図中の菱形は、蛍光の検出感度が高い場合における従来のサイトメトリー法によるプロット、図中の正方形は、蛍光の検出感度が低い場合における従来のサイトメトリー法によるプロット、図中の三角形は、本実施形態によるプロットを示す。 The above is an example of the processing of the biological number model generation apparatus 100 in the present embodiment. Using the living body number model generated in this way, the living body number model generating apparatus 100 performs the processing of the living body number determination unit 102f to detect the presence / absence of a living body (such as the presence / absence of viable bacteria) or the number of living bodies (from the detected intensity data) Viable counts etc.) can be determined accurately and quickly. Here, FIG. 6 is a diagram illustrating an error between the number model generated by the present embodiment and the estimated value by the conventional method and the actually measured value by the culture method. The horizontal axis of FIG. 6 shows the actual measurement value in the culture method, and the vertical axis shows the estimated value by each method, and the error is smaller as the slope is on the straight line. The rhombus in the figure is a plot by the conventional cytometry method when the fluorescence detection sensitivity is high, the square in the figure is the plot by the conventional cytometry method when the fluorescence detection sensitivity is low, and the triangle in the figure is 3 shows a plot according to this embodiment.

図６に示すように、従来のサイトメトリー法では、生菌数が少ない場合に、擬陽性が多く、検出限界は、１０^３個を超える。そのため、生菌の有無を正しく判定できない。また、生菌数が多い場合にも、実測値よりも１〜２桁程度、低く見積もってしまうため、正確な生菌数が推定できない。本実施形態によれば、生菌数が多い場合にも少ない場合にも、プロットは、ほぼ傾き１の直線上にあり、実測値との誤差の少ない正確な生菌数を推定することが可能である。従来のサイトメトリー法においては、図４の判定項目にて示したように、連続量である蛍光強度データ（電圧値）から、生菌か死菌かという離散値の判別を行うために誤差や擬陽性が生じ易かった。本実施形態によれば、ヒストグラムをスペクトルとみなして多変量解析を行うことにより、連続量のデータを活かしたまま解析を行うので、従来のサイトメトリー法よりも誤差や擬陽性が少なく正確な生体数の判定や生体の有無の判定を行うことができ、検出率（推定値／実測値）を向上させることができる。
る。 As shown in FIG. 6, in the conventional cytometry method, when the number of viable bacteria is small, there are many false positives, and the detection limit exceeds 10 ³ . Therefore, the presence or absence of viable bacteria cannot be determined correctly. In addition, even when the number of viable bacteria is large, the number of viable bacteria cannot be estimated accurately because it is estimated to be about 1 to 2 digits lower than the actually measured value. According to this embodiment, whether the number of viable bacteria is large or small, the plot is almost on a straight line with a slope of 1, and it is possible to estimate the exact viable cell count with little error from the actual measurement value. It is. In the conventional cytometry method, as shown in the determination item of FIG. 4, in order to discriminate a discrete value whether live or dead from fluorescent intensity data (voltage value) that is a continuous amount, False positives were likely to occur. According to the present embodiment, the multivariate analysis is performed by regarding the histogram as a spectrum, and the analysis is performed while taking advantage of the continuous amount of data. Therefore, an accurate number of living organisms with fewer errors and false positives than the conventional cytometry method. And the presence / absence of a living body can be determined, and the detection rate (estimated value / measured value) can be improved.
The

［実施例］
つづいて、本実施の形態にかかる実施例について、以下に図７〜図４８を参照して説明する。ここで、図７は、本実施例によるデータの流れの一例を示す図である。 [Example]
Next, examples according to the present embodiment will be described below with reference to FIGS. Here, FIG. 7 is a diagram illustrating an example of a data flow according to the present embodiment.

図７に示すように、サイトメトリーにより得られた元データＭＡ−１は、検出強度データである電圧値データＭＡ−２に変換される。つづいて、電圧値を対数に変換することにより（ＭＡ−３）、電圧値データＭＡ−２は、対数の電圧値データＭＡ−４に変換される。そして、本実施例によりヒストグラム変換され（ＭＡ−５）、電圧値データＭＡ−４は、ヒストグラムＭＡ−６に変換される。 As shown in FIG. 7, the original data MA-1 obtained by cytometry is converted into voltage value data MA-2 that is detected intensity data. Subsequently, by converting the voltage value into a logarithm (MA-3), the voltage value data MA-2 is converted into logarithmic voltage value data MA-4. Then, the histogram is converted (MA-5) according to this embodiment, and the voltage value data MA-4 is converted into a histogram MA-6.

一方、実測値側としては、取り纏め表ＭＡ−７に基づいて、ＣＳＶデータに含まれる生菌数（個）データＭＡ−８を抽出し、実測値としてカウント数ＭＡ−９を準備する。 On the other hand, on the actual measurement value side, based on the summary table MA-7, the viable cell count (number) data MA-8 included in the CSV data is extracted, and the count number MA-9 is prepared as the actual measurement value.

本実施例において、ヒストグラムＭＡ−６もカウント数ＭＡ−９も対数変換を行い（ＭＡ−１０）、それぞれ対数表現のヒストグラムＭＡ−１１とカウント数ＭＡ−１２に変換する。 In this embodiment, both the histogram MA-6 and the count number MA-9 are logarithmically converted (MA-10), and converted to a logarithmic expression histogram MA-11 and a count number MA-12, respectively.

そして、本実施例において、ヒストグラムＭＡ−１１の各区分における頻度値を説明変数として、実測値のカウント数ＭＡ−１２を目的変数として、多変量解析を行う（ＭＡ−１３）。 In this embodiment, multivariate analysis is performed using the frequency value in each section of the histogram MA-11 as an explanatory variable and the actual measurement count MA-12 as an objective variable (MA-13).

そして、本実施例において、多変量解析によって得られた生体数モデルによる推定値と実測値との誤差を計算することにより、生体数モデルの検証を行う（ＭＡ−１４）。例えば、クロスバリデーション法に従って、実測値であるカウント数データＭＡ−１２のうち、多変量解析の教師データとするデータセットと、モデル検証用のデータセットとを、シャッフルしながら複数回、多変量解析とモデル検証を繰り返し行ってもよい。 In this embodiment, the biological number model is verified by calculating an error between the estimated value and the actual measurement value obtained from the biological number model obtained by the multivariate analysis (MA-14). For example, according to the cross-validation method, among the count data MA-12 that are actually measured values, a multivariate analysis is performed a plurality of times while shuffling a data set used as teacher data for multivariate analysis and a data set for model verification. Model validation may be repeated.

そして、本実施例において、多変量解析の結果生成された生体数モデルの推定結果（実測値との誤差評価結果等）を得る（ＭＡ−１５）。 In this embodiment, an estimation result (an error evaluation result from the actual measurement value, etc.) of the biological number model generated as a result of the multivariate analysis is obtained (MA-15).

そして、本実施例において、ヒストグラムの区分の間隔を再設定し（ＭＡ−１６）、上述した処理を繰り返すことにより、生体数モデルの最適化を図る。 In this embodiment, the interval between the histogram segments is reset (MA-16), and the above-described processing is repeated to optimize the biological number model.

以上が、本実施例によるデータの流れの一例を示す図である。つづいて、以下に、本実施形態おける、ヒストグラム生成処理、最適化処理、前処理、および、多変量解析処理の詳細について説明する。なお、以下の実施例において特記しない場合は、多変量解析手法として、ＰＬＳ回帰分析を用いている。 The above is a diagram illustrating an example of a data flow according to the present embodiment. Next, details of the histogram generation process, the optimization process, the preprocess, and the multivariate analysis process in this embodiment will be described. In the following examples, unless otherwise specified, PLS regression analysis is used as a multivariate analysis method.

［ヒストグラム生成処理］
本実施形態におけるヒストグラム生成処理の実施例について、図８〜図２７を参照して説明する。 [Histogram generation processing]
An example of the histogram generation process in the present embodiment will be described with reference to FIGS.

従来のフローサイトメトリー法（以下「従来法」と呼ぶ。）では、生菌用蛍光染色剤ａおよびｃの蛍光を捉えるセンサ（Ａ，Ｃチャンネル）において電圧が生じ、かつ、死菌用蛍光染色剤ｂの蛍光を捉えるセンサ（Ｂチャンネル）に電圧が生じていない場合に（すなわち「ＡａｎｄＣｎｏｔＢ」の場合に）、１つの生菌と判定していた（非特許文献１参照）。 In the conventional flow cytometry method (hereinafter referred to as “conventional method”), voltage is generated in the sensors (A and C channels) that capture the fluorescence of the fluorescent stains a and c for viable bacteria, and the fluorescent stain for dead bacteria is used. When no voltage was generated in the sensor (B channel) that captures the fluorescence of the agent b (that is, in the case of “A and C not B”), it was determined as one viable bacterium (see Non-Patent Document 1).

本実施例では、フローサイトメトリーにおいて計測される各チャンネル（Ａ〜Ｃチャンネル）の電圧値をヒストグラム化して、各チャンネルのヒストグラムを結合する。ここで、図８、図１０、図１２、図１４、図１６、図１８、図２０、図２２、図２４、および、図２６の上図は、各ヒストグラム生成処理により得られた結合後のヒストグラムを示す図である。本実施例では、つづいて、結合したヒストグラムにＰＬＳ回帰を適用し、ヒストグラムから生菌数を算出する数式である生体数モデルを構築する。ここで、本実施例において基本とする生体数モデル（理論式）は、ヒストグラムの各区分における頻度値に係数を乗じて足し合わせる以下の多項式とした。ここで、図８、図１０、図１２、図１４、図１６、図１８、図２０、図２２、図２４、および、図２６の下図は、ＰＬＳ回帰の結果得られた、ヒストグラムの各区分（区分番号ｉ＝１〜ｎ、対象チャンネルＡ〜Ｃ、範囲）に対する係数値（回帰係数）を示す図である。
係数
In this embodiment, the voltage values of the channels (A to C channels) measured in the flow cytometry are histogrammed, and the histograms of the channels are combined. Here, the upper diagrams of FIGS. 8, 10, 12, 14, 16, 18, 20, 20, 24, and 26 show the combined results obtained by the respective histogram generation processes. It is a figure which shows a histogram. In the present embodiment, PLS regression is applied to the combined histogram, and a living body number model that is a mathematical formula for calculating the viable cell count from the histogram is constructed. Here, the living body number model (theoretical formula) as a basis in the present embodiment is the following polynomial that is added by multiplying the frequency value in each section of the histogram by a coefficient. 8, 10, 12, 14, 16, 18, 20, 20, 22, 24, and 26, the lower graphs show the respective sections of the histogram obtained as a result of the PLS regression. It is a figure which shows the coefficient value (regression coefficient) with respect to (division number i = 1-n, object channel AC, range).
coefficient

ここで、図９、図１１、図１３、図１５、図１７、図１９、図２１、図２３、図２５、および、図２７は、各ヒストグラム生成処理により得られた生体数モデルまたは従来法による推定値と実測値とを比較したグラフ図である。なお、生体数モデルの生菌数推定精度は以下の指標で評価した。決定係数は、相関係数を二乗した値であって、実測値（培養法によりカウントした生菌数）と推定値の間に、どれだけ直線性があるかを示しており、値が大きければ大きいほど良い（最大値は１）。誤差の標準偏差は、各サンプルの推定誤差を集計した値であって、以下の式で求められ、値は小さければ小さいほど良い。
9, 11, 13, 15, 17, 19, 21, 23, 25, and 27 are biometric models or conventional methods obtained by the respective histogram generation processes. It is the graph which compared the estimated value by and actual value. In addition, the viable count estimation accuracy of the biological number model was evaluated by the following index. The coefficient of determination is a value obtained by squaring the correlation coefficient, and indicates how much linearity there is between the measured value (the number of viable bacteria counted by the culture method) and the estimated value. Larger is better (maximum value is 1). The standard deviation of the error is a value obtained by summing up the estimation errors of each sample, and is obtained by the following formula. The smaller the value, the better.

ここで、図８および図９は、チャンネルＡの電圧値のみをヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．６１１，誤差の標準偏差＝２．５３ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．６９４，誤差の標準偏差＝１．１５ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 Here, FIGS. 8 and 9 are diagrams illustrating an example in which the present embodiment is applied using only the voltage value of the channel A in the histogram. When this data set is used, in the conventional method, the determination coefficient = 0.611 and the standard deviation of the error = 2.53 log (pieces / ml). In this embodiment, the determination coefficient = 0.694 and the error A viable count estimation accuracy of standard deviation = 1.15 log (cells / ml) was obtained.

図１０および図１１は、チャンネルＡおよびＢにおける電圧値をヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．８７２，誤差の標準偏差＝１．１５ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．７５１，誤差の標準偏差＝１．０５ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 10 and 11 are diagrams showing an example in which the present embodiment is applied using the voltage values in channels A and B as histograms. When this data set is used, in the conventional method, the determination coefficient = 0.722, the standard deviation of error = 1.15 log (pieces / ml), but in this embodiment, the determination coefficient = 0.551, A viable cell count estimation accuracy of standard deviation = 1.05 log (cells / ml) was obtained.

図１２および図１３は、チャンネルＡおよびＣにおける電圧値をヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．４０４，誤差の標準偏差＝２．５４ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．６９６，誤差の標準偏差＝１．１５ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 12 and 13 are diagrams showing an example in which the present embodiment is applied using voltage values in channels A and C as histograms. When this data set is used, in the conventional method, the determination coefficient = 0.404 and the standard deviation of the error = 2.54 log (pieces / ml). In this embodiment, the determination coefficient = 0.696 and the error A viable count estimation accuracy of standard deviation = 1.15 log (cells / ml) was obtained.

図１４および図１５は、全てのチャンネルＡ，Ｂ，Ｃにおける電圧値をヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．８７２，誤差の標準偏差＝１．１５ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．７５２，誤差の標準偏差＝１．０４ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 14 and 15 are diagrams showing an example in which the present embodiment is applied using the voltage values in all the channels A, B, and C as histograms. When this data set is used, in the conventional method, the determination coefficient = 0.722, the standard deviation of error = 1.15 log (pieces / ml), but in this embodiment, the determination coefficient = 0.552, A viable cell count estimation accuracy of standard deviation = 1.04 log (cells / ml) was obtained.

図１６および図１７は、チャンネルＡおよびＢの電圧値のうち、Ｂ＝０（死菌の蛍光が検出されない）場合のデータをヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．８７２，誤差の標準偏差＝１．１５ｌｏｇ（個／ｍｌ）であることろ、本実施例では、決定係数＝０．９５８，誤差の標準偏差＝０．４５ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 16 and FIG. 17 are diagrams showing an example in which the present embodiment is applied using the data in the case where B = 0 (the fluorescence of dead bacteria is not detected) among the voltage values of channels A and B as histograms. When this data set is used, the determination coefficient is 0.872 and the standard deviation of error is 1.15 log (pieces / ml) in the conventional method. In this embodiment, the determination coefficient is 0.958 and the error is The viable cell count estimation accuracy of standard deviation = 0.45 log (cells / ml) was obtained.

図１８および図１９は、チャンネルＡにおける電圧値を同時に計測されたチャンネルＣの電圧値で除した値（Ａ÷Ｃ）をヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．４０４，誤差の標準偏差＝２．５４ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．８１５，誤差の標準偏差＝０．９１ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 18 and FIG. 19 are diagrams showing an example in which the present embodiment is applied using a value (A ÷ C) obtained by dividing the voltage value in channel A by the voltage value of channel C, which is simultaneously measured, in the histogram. When this data set is used, in the conventional method, the determination coefficient = 0.404 and the standard deviation of error = 2.54 log (pieces / ml). In this embodiment, the determination coefficient = 0.815, and the error A viable cell count estimation accuracy of standard deviation = 0.91 log (cells / ml) was obtained.

図２０および図２１は、チャンネルＡにおける電圧値を同時に計測されたチャンネルＢの電圧値で減じた値（Ａ−Ｂ）をヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．８７２，誤差の標準偏差＝１．１５ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．８４２，誤差の標準偏差＝０．８３ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 20 and 21 are diagrams showing an example in which the present embodiment is applied using a value (AB) obtained by subtracting the voltage value in channel A by the voltage value in channel B, which is simultaneously measured, as a histogram. When this data set is used, in the conventional method, the coefficient of determination is 0.872 and the standard deviation of the error is 1.15 log (pieces / ml). In this embodiment, the coefficient of determination is 0.842 and the error is A viable cell count estimation accuracy of standard deviation = 0.83 log (cells / ml) was obtained.

図２２および図２３は、チャンネルＡにおける電圧値に同時に計測されたチャンネルＣの電圧値を加えた値（Ａ＋Ｃ）をヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．４０４，誤差の標準偏差＝２．５４ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．５８７，誤差の標準偏差＝１．３４ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 22 and 23 are diagrams showing an example in which the present embodiment is applied using a value (A + C) obtained by adding the voltage value of channel C, which is simultaneously measured to the voltage value of channel A, for the histogram. When this data set is used, in the conventional method, the coefficient of determination is 0.404 and the standard deviation of the error is 2.54 log (pieces / ml). In this embodiment, the coefficient of determination is 0.587 and the error is A viable count estimation accuracy of standard deviation = 1.34 log (cells / ml) was obtained.

図２４および図２５は、チャンネルＡにおける電圧値に同時に計測されたチャンネルＣの電圧値を乗じた値（Ａ×Ｃ）をヒストグラムに用いて本実施例を適用した例を示す図である。このデータセットを用いた場合、従来法では、決定係数＝０．４０４，誤差の標準偏差＝２．５４ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．８１２，誤差の標準偏差＝０．９０ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 24 and 25 are diagrams illustrating an example in which the present embodiment is applied using a value (A × C) obtained by multiplying the voltage value of channel A by the voltage value of channel C simultaneously measured in the histogram. When this data set is used, in the conventional method, the determination coefficient = 0.404 and the standard deviation of error = 2.54 log (pieces / ml). In this embodiment, the determination coefficient = 0.812, and the error A viable count estimation accuracy of standard deviation = 0.90 log (cells / ml) was obtained.

図２６および図２７は、チャンネルＡ，Ｂ，Ｃにおける電圧値を様々に組み合わせた値をヒストグラムに用いて本実施例を適用した例を示す図である。なお、図２６下図の対象チャンネルの項目に、チャンネルＡ，Ｂ，Ｃにおける電圧値の組み合わせ計算方法を示す。この合計２１チャンネルのデータセットを用いた場合、従来法では、決定係数＝０．８７２，誤差の標準偏差＝１．１５ｌｏｇ（個／ｍｌ）であるところ、本実施例では、決定係数＝０．９６１，誤差の標準偏差＝０．４３ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 26 and 27 are diagrams showing an example in which the present embodiment is applied using values obtained by variously combining voltage values in channels A, B, and C in a histogram. In addition, in the item of the target channel in the lower diagram of FIG. 26, a combination calculation method of voltage values in channels A, B and C is shown. When this 21-channel data set is used, in the conventional method, the determination coefficient = 0.722, the standard deviation of error = 1.15 log (pieces / ml), but in this embodiment, the determination coefficient = 0. 961, standard deviation of error = 0.43 log (cells / ml) was obtained.

以上で、本実施形態におけるヒストグラム生成処理の実施例の説明を終える。このように、一つまたは複数のセンサで捉えられた蛍光の輝度情報をヒストグラムに用いて本実施の形態を適用することができる。特に、特定のセンサで蛍光が捉えられていない場合のデータのみを対象にして処理を行うことで精度向上を図ることができる。また、異なるセンサ間の蛍光輝度値（電圧値）を加減乗除した値をヒストグラム化することで精度向上を図ることも可能となる。 This is the end of the description of the example of the histogram generation processing in the present embodiment. As described above, the present embodiment can be applied using the luminance information of fluorescence captured by one or a plurality of sensors as a histogram. In particular, accuracy can be improved by performing processing only on data when fluorescence is not captured by a specific sensor. It is also possible to improve accuracy by creating a histogram of values obtained by adding, subtracting, and dividing fluorescence luminance values (voltage values) between different sensors.

［最適化処理］
本実施形態における最適化処理の実施例について、図２８〜図３４を参照して説明する。ここで、図２８は、本実施例における最適化処理の一例を示すフローチャートである。 [Optimization processing]
An example of optimization processing in the present embodiment will be described with reference to FIGS. Here, FIG. 28 is a flowchart illustrating an example of optimization processing in the present embodiment.

図２８に示すように、まず、本実施例では、検出強度データのチャンネル毎に区分間隔を設定する（ステップＳＢ−１）。そして、得られたデータサンプルを、キャリブレーションセットと、バリデーションセットに二分割する（ステップＳＢ−２）。 As shown in FIG. 28, first, in this embodiment, the division interval is set for each channel of the detected intensity data (step SB-1). Then, the obtained data sample is divided into a calibration set and a validation set (step SB-2).

そして、キャリブレーションセットに多変量解析を適用し（ステップＳＢ−３）、生菌数を推定する生体数モデルを生成する（ステップＳＢ−４）。 Then, multivariate analysis is applied to the calibration set (step SB-3), and a biological number model for estimating the viable cell count is generated (step SB-4).

そして、得られた生体数モデルをバリデーションセットに適用することにより、推定値の誤差の標準偏差を算出する（ステップＳＢ−５）。 And the standard deviation of the error of an estimated value is calculated by applying the obtained number model to a validation set (step SB-5).

そして、ステップＳＢ−２に処理を戻し、データサンプルを別の組み合わせで、キャリブレーションセットと、バリデーションセットに再分割し、上述したステップＳＢ−２〜ＳＢ−６の処理を繰り返し、交差検定を行う（ステップＳＢ−６）。 Then, the process returns to step SB-2, and the data sample is subdivided into a calibration set and a validation set in another combination, and the above-described steps SB-2 to SB-6 are repeated to perform cross-validation. (Step SB-6).

そして、交差検定が終了すると（ステップＳＢ−６，Ｙｅｓ）、ステップＳＢ−１に処理を戻し、検出強度データのチャンネル毎の区分間隔を再設定し、上述したステップＳＢ−１以降の処理を繰り返し、最適な生体数モデルとなるための区分間隔を最適化する（ステップＳＢ−７）。ここで、図２９は、検出強度データである電圧データから細かい区分間隔でヒストグラムを生成した場合と、粗い区分間隔でヒストグラムを生成した場合の違いを模式的に示した図である。なお、多変量解析手法として、ＰＬＳ回帰分析を用いた。 When the cross-validation is completed (step SB-6, Yes), the process returns to step SB-1, the division interval for each channel of the detected intensity data is reset, and the processes after step SB-1 are repeated. Then, the interval between segments to be an optimal number model is optimized (step SB-7). Here, FIG. 29 is a diagram schematically showing a difference between a case where a histogram is generated with fine segment intervals from voltage data which is detected intensity data and a case where a histogram is generated with coarse segment intervals. In addition, PLS regression analysis was used as a multivariate analysis method.

図２９に示すように、一般的に、細かい区分間隔のデータを用いて解析を行う場合、性質が同じ情報が違うものとみなされるリスクが高まる一方で、粗い区分間隔のデータを用いて解析を行う場合は、性質が異なる情報が同じとみなされるリスクが高まる。前者の例では、色素Ａに染色された菌のシグナルでも、わずかでも明るさが異なるものと判定されてしまい、後者の例では、色素Ｃに染色された菌とクロロフィル蛍光も一緒のものと判定されてしまう。これら両極端の間で、ヒストグラムの区分（階級）の間隔を最適化する必要がある。 As shown in FIG. 29, in general, when analysis is performed using data of fine segment intervals, there is an increased risk that information having the same property is regarded as different, while analysis is performed using data of coarse segment intervals. Doing so increases the risk that information of different nature will be considered the same. In the former example, it is determined that even the signal of the bacteria stained with dye A is slightly different in brightness, and in the latter example, the bacteria stained with dye C and chlorophyll fluorescence are also determined to be the same. Will be. It is necessary to optimize the interval (class) of the histogram between these extremes.

本実施例では、以上のステップＳＢ−１〜ＳＢ−７の処理により、誤差の標準偏差が最小となる最適な各チャンネルの区分間隔と生体数モデルを決定する（ステップＳＢ−８）。ここで、図３０は、チャンネルＢの電圧が検出されない場合（Ｂ＝０）の、チャンネルＡおよびＣの電圧値をヒストグラムに用いて本実施形態を適用した場合に得られた誤差を示す等値図である。図中の横軸は、チャンネルＡのヒストグラム区分の間隔を示し、縦軸は、チャンネルＣのヒストグラム区分の間隔を示している。 In the present embodiment, through the processes of steps SB-1 to SB-7, the optimum segment interval and biological number model that minimize the standard deviation of the error are determined (step SB-8). Here, FIG. 30 shows an equal value indicating an error obtained when the present embodiment is applied using the voltage values of the channels A and C in the histogram when the voltage of the channel B is not detected (B = 0). FIG. In the figure, the horizontal axis indicates the interval between histogram sections of channel A, and the vertical axis indicates the interval between histogram sections of channel C.

図３０に示すように、チャンネルＡおよびＣのヒストグラム区分の間隔の組み合せにより、得られる生体数モデルの誤差の標準偏差は大きく変動する。本実施例の最適化処理によって、誤差が最も小さくなるような区分間隔の組み合わせが見出される。ここで、図３１は、生菌数推定誤差が最小の場合のヒストグラムと、生菌数推定誤差が最大の場合のヒストグラムとを比較した図である。また、図３２は、生菌数推定誤差が最小の場合の生体数モデル、および、生菌数推定誤差が最大の場合の生体数モデルによる推定値と実測値との誤差を示すグラフ図である。グラフの横軸は、培養法による生菌数の実測値を示しており、縦軸は各生体数モデルによる推定値を示している。 As shown in FIG. 30, the standard deviation of the error of the obtained number model varies greatly depending on the combination of the intervals of the histogram sections of channels A and C. By the optimization processing of this embodiment, a combination of segment intervals that minimizes the error is found. Here, FIG. 31 is a diagram comparing a histogram when the viable cell count estimation error is minimum and a histogram when the viable cell count estimation error is maximum. FIG. 32 is a graph showing the difference between an estimated value and an actual measurement value of the living body number model when the viable cell count estimation error is minimum, and the living body number model when the viable cell count estimation error is maximum. . The horizontal axis of the graph shows the actual value of the viable cell count by the culture method, and the vertical axis shows the estimated value by each biological number model.

図３１および図３２に示すように、最適なヒストグラム区分間隔の組み合せで生体数モデルを生成した場合、実測値と推定値のプロットが傾き１の直線上に近くなり、誤差が少なくなることが確認された。ここで、図３３は、ヒストグラムの各サンプリング間隔において、モデル作成時と検証時における生菌数推定誤差を示すグラフ図である。なお、モデル検証時の誤差は、検証１００回における平均値である。 As shown in FIG. 31 and FIG. 32, it is confirmed that the plot of the measured value and the estimated value is close to the straight line with the slope 1 and the error is reduced when the living body number model is generated with the optimum combination of the histogram division intervals. It was done. Here, FIG. 33 is a graph showing the viable cell count estimation error at the time of model creation and verification at each sampling interval of the histogram. The error at the time of model verification is an average value in 100 verifications.

図３３に示すように、本実施例において、クロスバリデーション法によりモデル検証を１００回繰り返し行うことにより、正確な生体数モデルの評価を行うことができる。図３３において、モデル検証時の誤差が最も小さい場合（間隔１．１５付近）に理想的な生体数モデルが得られると期待できる。ここで、図３４は、図３３において、誤差が最も大きい場合の生体数モデルと、誤差が最も小さい場合の数理モデルのそれぞれにおいて、従来法と比較したグラフ図である。 As shown in FIG. 33, in this embodiment, accurate model evaluation can be performed by repeatedly performing model verification 100 times by the cross-validation method. In FIG. 33, it can be expected that an ideal number model is obtained when the error at the time of model verification is the smallest (near the interval of 1.15). Here, FIG. 34 is a graph compared with the conventional method in each of the biological number model when the error is the largest and the mathematical model when the error is the smallest in FIG.

図３４の左図に示すように、誤差が最も大きい場合の生体数モデルでは、生菌数が少ない場合に、擬陽性が多くなる。すなわち、ヒストグラム間隔が細かすぎても、擬陽性が多くなってしまう。図３４の右図に示すように、適度なヒストグラム間隔で生成された生体数モデルでは、生菌数が少ない場合でも擬陽性が少なく、傾き１の理想的な直線に近い場所にプロットされることが確認された。 As shown in the left diagram of FIG. 34, in the living body number model when the error is the largest, the number of false positives increases when the number of living bacteria is small. That is, even if the histogram interval is too small, false positives increase. As shown in the right diagram of FIG. 34, the biological number model generated at an appropriate histogram interval has few false positives even when the number of viable bacteria is small, and is plotted at a location close to an ideal straight line with a slope of 1. confirmed.

以上で、本実施例における最適化処理の一例の説明を終える。 Above, description of an example of the optimization process in a present Example is finished.

［前処理］
本実施形態における前処理の実施例について、図３５〜図４２を参照して説明する。 [Preprocessing]
An example of pre-processing in the present embodiment will be described with reference to FIGS.

本実施例において、スペクトルの前処理とは、近赤外分光法などで用いられるスペクトル形状の変換手法であって、例えば、ヒストグラムをスペクトルとみなして、中心化、尺度化、平滑化、微分、標準化などの処理を行うことをいう。この前処理には、スペクトルに含まれるノイズを低減したり、スペクトルに含まれる情報を強調したり、異なるサンプル間のスペクトルの尺度を合わせるといった効果がある。本実施例では、複数チャンネルの電圧値をヒストグラム化し、それを結合して繋ぎ合わせたものをスペクトルとみなし、このスペクトルに前処理を適用する。なお、以下の実施例では、チャンネルＢの電圧が検出されない場合（Ｂ＝０の時）の、チャンネルＡおよびＣの電圧値をヒストグラムに用いて、ＰＬＳ回帰にて生菌数を推定した。 In the present embodiment, the spectrum preprocessing is a spectral shape conversion technique used in near infrared spectroscopy, for example, considering a histogram as a spectrum, centering, scaling, smoothing, differentiation, Refers to standardization. This preprocessing has the effect of reducing noise contained in the spectrum, enhancing information contained in the spectrum, and adjusting the spectrum scale between different samples. In the present embodiment, the voltage values of a plurality of channels are converted into a histogram, and a combination of them is regarded as a spectrum, and preprocessing is applied to this spectrum. In the following examples, the number of viable bacteria was estimated by PLS regression using the voltage values of channels A and C when the voltage of channel B was not detected (when B = 0) in the histogram.

図３５は、前処理を行わなかった場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図３５に示すように、前処理を行わない場合、決定係数＝０．９５７，誤差の標準偏差＝０．３５ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 35 is a graph (right diagram) showing an error between a histogram (left diagram) when no preprocessing is performed and an estimated value based on the generated biological number model. As shown in FIG. 35, when pre-processing was not performed, the viable cell count estimation accuracy of a determination coefficient = 0.957 and an error standard deviation = 0.35 log (cells / ml) was obtained.

図３６は、中心化（ｍｅａｎｃｅｎｔｅｒｉｎｇ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図３６に示すように、中心化の前処理を行った結果、決定係数＝０．９７５，誤差の標準偏差＝０．２６ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 36 is a graph (right diagram) showing a histogram (left diagram) in the case where the preprocessing of centering is performed (left diagram) and an error of an estimated value based on the generated number model. As shown in FIG. 36, as a result of the preprocessing of the centering, the viable cell count estimation accuracy of the determination coefficient = 0.975 and the standard deviation of error = 0.26 log (cells / ml) was obtained.

図３７は、尺度化（ｓｃａｌｉｎｇ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図３７に示すように、尺度化の前処理を行った結果、決定係数＝０．９７５，誤差の標準偏差＝０．２８ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 37 is a graph (right diagram) showing a histogram (left diagram) in the case of performing preprocessing for scaling (left diagram) and an error of an estimated value based on the generated number model. As shown in FIG. 37, as a result of the pre-processing of the scaling, the viable cell count estimation accuracy of determination coefficient = 0.975 and standard deviation of error = 0.28 log (cells / ml) was obtained.

図３８は、平滑化（ｓｍｏｏｔｈｉｎｇ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図３８に示すように、平滑化の前処理を行った結果、決定係数＝０．９７８，誤差の標準偏差＝０．２６ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 38 is a graph (right diagram) showing a histogram (left diagram) in the case of performing smoothing preprocessing and an error of an estimated value based on the generated number model. As shown in FIG. 38, as a result of pre-processing for smoothing, the viable cell count estimation accuracy of determination coefficient = 0.978 and standard deviation of error = 0.26 log (cells / ml) was obtained.

図３９は、一次微分（ｆｉｒｓｔｄｅｒｉｖａｔｉｖｅ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図３９に示すように、一次微分の前処理を行った結果、決定係数＝０．９７５，誤差の標準偏差＝０．２８ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 39 is a graph (right diagram) showing an error in an estimated value based on a histogram (left diagram) and a generated biological number model when pre-processing of first derivative (first derivative) is performed. As shown in FIG. 39, as a result of pre-processing of the first derivative, a viable cell count estimation accuracy of a determination coefficient = 0.975 and an error standard deviation = 0.28 log (cells / ml) was obtained.

図４０は、二次微分（ｓｅｃｏｎｄｄｅｒｉｖａｔｉｖｅ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図４０に示すように、二次微分の前処理を行った結果、決定係数＝０．９７６，誤差の標準偏差＝０．２８ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 40 is a graph (right diagram) showing an error of an estimated value based on a histogram (left diagram) and a generated biological number model when preprocessing of second derivative (second derivative) is performed. As shown in FIG. 40, as a result of pre-processing of the second derivative, a viable cell count estimation accuracy of a determination coefficient = 0.976 and an error standard deviation = 0.28 log (cells / ml) was obtained.

図４１は、標準化（Ｎｏｒｍａｌｉｚａｔｉｏｎ）の前処理を行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図４１に示すように、標準化の前処理を行った結果、決定係数＝０．９７７，誤差の標準偏差＝０．２７ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 41 is a graph (right diagram) showing an error between a histogram (left diagram) and an estimated value based on the generated number model when normalization is performed. As shown in FIG. 41, as a result of the pre-processing for standardization, the viable cell count estimation accuracy of a determination coefficient = 0.777 and an error standard deviation = 0.27 log (cells / ml) was obtained.

図４２は、標準化と二次微分と中心化と尺度化の前処理を組み合わせて行った場合のヒストグラム（左図）と生成した生体数モデルによる推定値の誤差を示すグラフ図（右図）である。図４２に示すように、標準化と二次微分と中心化と尺度化の前処理を組み合わせて行った結果、決定係数＝０．９８３，誤差の標準偏差＝０．２３ｌｏｇ（個／ｍｌ）という生菌数推定精度が得られた。 FIG. 42 is a graph (left diagram) and a graph (right diagram) showing an error in an estimated value by a generated biological number model when the standardization, second-order differentiation, centering, and scaling preprocessing are performed in combination. is there. As shown in FIG. 42, as a result of combining standardization, second-order differentiation, centering, and scaling preprocessing, a determination coefficient = 0.983 and a standard deviation of error = 0.23 log (pieces / ml) are obtained. Bacteria count estimation accuracy was obtained.

以上で、本実施形態における前処理の実施例の説明を終える。 Above, description of the example of the pre-processing in this embodiment is finished.

［多変量解析処理］
本実施形態における多変量解析処理の実施例について、図４３〜図４８を参照して説明する。 [Multivariate analysis processing]
An example of multivariate analysis processing in the present embodiment will be described with reference to FIGS.

本実施例においては、お茶飲料にて大腸菌を培養した溶液を添加したサンプルをデータに用いた。大腸菌溶液の濃度を、１／１０^３〜１／１０^８に希釈することで、生菌数の希釈系列を調製した。そして、培養法により実測の生菌数をカウントするとともに、フローサイトメーターによって全６８サンプルを計測した。なお、本実施例では、チャンネルＢの電圧が検出されない場合（Ｂ＝０の時）の、チャンネルＡおよびＣの電圧値をヒストグラムに用いた。 In the present Example, the sample which added the solution which culture | cultivated E. coli with the tea drink was used for data. A dilution series of the number of viable bacteria was prepared by diluting the concentration of the E. coli solution to 1/10 ³ to 1/10 ⁸ . Then, the actual number of viable bacteria was counted by a culture method, and all 68 samples were measured by a flow cytometer. In this embodiment, the voltage values of channels A and C when the voltage of channel B is not detected (when B = 0) are used in the histogram.

ここで、本実施形態においては、様々な多変量解析手法を用いることができる（多変量解析手法について特開２０１０−１８５７１９参照）。一例として、教師なしのパターン認識手法としては、主成分分析や、クラスター分析、自己組織化マップなどのパターン分類方法が挙げられる。なお、この教師なしのパターン認識手法は、説明変数（今回の場合はヒストグラムをスペクトル化したデータ）の持つ特徴を可視化する手法であり、目的変数（生菌数など）は用いない。ここで、図４３は、主成分分析により多変量解析を行った実施例を示す図である。 Here, in this embodiment, various multivariate analysis methods can be used (refer to JP 2010-185719 regarding the multivariate analysis method). As an example, unsupervised pattern recognition methods include pattern classification methods such as principal component analysis, cluster analysis, and self-organizing maps. This unsupervised pattern recognition method is a method for visualizing the characteristics of explanatory variables (in this case, data obtained by spectralizing a histogram), and does not use objective variables (viable cell counts, etc.). Here, FIG. 43 is a diagram illustrating an example in which multivariate analysis is performed by principal component analysis.

図４３に示すように、大腸菌添加濃度毎にクラスターが形成されており、判別分析や定量分析による生菌添加濃度の判別や生菌数の定量が可能となることが確認された。また、図４４は、クラスター分析により多変量解析を行った実施例を示す図である。 As shown in FIG. 43, clusters were formed for each E. coli addition concentration, and it was confirmed that discrimination of viable cell addition concentration and quantification of viable cell count by discriminant analysis and quantitative analysis became possible. FIG. 44 is a diagram showing an example in which multivariate analysis was performed by cluster analysis.

図４４に示すように、大腸菌添加濃度が１／１０^７以下のグループ（低濃度）とそれ以外（高濃度）が明確に異なるクラスターを形成しており、両者を判別できることが確認された。 As shown in FIG. 44, it was confirmed that a group (low concentration) having an E. coli addition concentration of 1/10 ⁷ or less and a cluster other than that (high concentration) formed distinctly different, and both could be discriminated.

本実施形態に用いることができる多変量解析手法の他の例では、教師ありの判別手法として、線形判別分析や、ＰａｒｔｉａｌＬｅａｓｔＳｑｕａｒｅｓ（ＰＬＳ）判別分析、ＳｏｆｔＩｎｄｅｐｅｎｄｅｎｔＭｏｄｅｌｉｎｇｏｆＣｌａｓｓＡｎａｌｏｇｙ（ＳＩＭＣＡ），ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ（ＳＶＭ）判別分析などを適用することができる。なお、この教師ありの判別手法は、説明変数から定性的な目的変数（生菌の有無）を判別するモデルを構築する手法である。ＳＶＭ判別分析により得られた生体数モデルによる判別結果のクロス集計では、実測値で低濃度の場合（３０サンプル）に、１００％の精度で低濃度と推定でき、実測値で高濃度の場合（３８サンプル）に、１００％の精度で高濃度と推定することができた。ここで、図４５は、ＰＬＳ判別分析により多変量解析を行った実施例を示す図である。 In another example of the multivariate analysis method that can be used in the present embodiment, as a discriminating method with supervision, linear discriminant analysis, Partial Last Squares (PLS) discriminant analysis, Soft Independent Modeling of Class Analysis (SIMCA), Support Vector machine (SVM) discriminant analysis and the like can be applied. This supervised discrimination method is a method for constructing a model for discriminating a qualitative objective variable (presence / absence of viable bacteria) from explanatory variables. In the cross tabulation of the discrimination results based on the number model obtained by the SVM discriminant analysis, it is possible to estimate the low concentration with 100% accuracy when the measured value is low (30 samples), and when the measured value is high ( 38 samples), it was possible to estimate a high concentration with 100% accuracy. Here, FIG. 45 is a diagram showing an example in which multivariate analysis is performed by PLS discriminant analysis.

ＰＬＳ判別分析により得られた生体数モデルによる判別結果のクロス集計では、実測値で低濃度の場合（３０サンプル）に、１００％の精度で低濃度と推定でき、実測値で高濃度の場合（３８サンプル）に、１００％の精度で高濃度と推定することができた。また、図４６は、ＳＩＭＣＡにより多変量解析を行った実施例を示す図である。 In the cross tabulation of the discrimination results based on the number model obtained by the PLS discriminant analysis, it is possible to estimate the low concentration with 100% accuracy when the measured value is low (30 samples), and when the measured value is high ( 38 samples), it was possible to estimate a high concentration with 100% accuracy. Moreover, FIG. 46 is a figure which shows the Example which performed the multivariate analysis by SIMCA.

ＳＩＭＣＡにより得られた生体数モデルによる判別結果のクロス集計では、実測値で低濃度の場合（３０サンプル）に、１００％の精度で低濃度と推定でき、実測値で高濃度の場合（３８サンプル）に、１００％の精度で高濃度と推定することができた。 In the cross tabulation of the discrimination result by the number model obtained by SIMCA, it is possible to estimate the low concentration with 100% accuracy when the measured value is low (30 samples), and when the measured value is high (38 samples) ) To a high concentration with 100% accuracy.

本実施形態に用いることができる多変量解析手法の他の例では、教師ありの回帰分析手法として、重回帰分析や、ＰＬＳ回帰分析、ＳＶＭ回帰分析、ＬｏｃａｌｌｙＷｅｉｇｈｔｅｄＲｅｇｒｅｓｓｉｏｎ（局所重みつき回帰）などを適用することができる。なお、この教師ありの回帰分析手法は、説明変数から定量的な目的変数（生菌数）を定量推定するモデルを構築する手法である。ここで、図４７は、ＳＶＭ回帰分析により多変量解析を行った実施例を示す図である。 In another example of the multivariate analysis method that can be used in the present embodiment, as a supervised regression analysis method, multiple regression analysis, PLS regression analysis, SVM regression analysis, locally weighted regression (local weighted regression), or the like is used. Can be applied. This supervised regression analysis method is a method for constructing a model for quantitatively estimating a quantitative objective variable (viable cell count) from explanatory variables. Here, FIG. 47 is a diagram showing an example in which multivariate analysis was performed by SVM regression analysis.

図４７に示すように、ＳＶＭ回帰分析により、低濃度においても高濃度においても実測値との誤差の少ない生体数モデルが得られた（決定係数＝０．９８０，誤差の標準偏差＝０．１９ｌｏｇ（個／ｍｌ））。また、図４８は、ＬＷＲ（局所重みつき回帰）により多変量解析を行った実施例を示す図である。 As shown in FIG. 47, the SVM regression analysis yielded a biological number model with little error from the actual measurement value at both low and high concentrations (decision coefficient = 0.980, standard deviation of error = 0.19 log). (Pieces / ml)). FIG. 48 is a diagram illustrating an example in which multivariate analysis is performed by LWR (local weighted regression).

図４８に示すように、ＬＷＲ（局所重みつき回帰）により、低濃度においても高濃度においても実測値との誤差の少ない生体数モデルが得られた決定係数＝０．９７５，誤差の標準偏差＝０．０４ｌｏｇ（個／ｍｌ）。 As shown in FIG. 48, LWR (local weighted regression) was used to obtain a biological number model with a small error from an actual measurement value at both low and high concentrations. Determination coefficient = 0.975, standard deviation of error = 0.04 log (pieces / ml).

以上のように、主成分分析や、クラスター分析などのパターン認識手法により多変量解析を行うことで、データが持つ生菌数あるいは生菌数の多少に関する情報を可視化することができることが確認された。また、線形判別分析や、ＰＬＳ判別分析、ＳＩＭＣＡ，ＳＶＭ判別分析などの判別分析により多変量解析を行うことで、生菌数の多少を判定できることが確認された。また、重回帰分析や、ＰＬＳ回帰分析、ＳＶＭ回帰分析、ＬＷＲ（局所重みつき回帰）などの回帰分析により多変量解析を行うことで、生菌数を推定することができることが確認された。 As described above, it was confirmed that information on the number of viable cells or the number of viable cells in data can be visualized by performing multivariate analysis using pattern recognition methods such as principal component analysis and cluster analysis. . In addition, it was confirmed that the number of viable bacteria can be determined by performing multivariate analysis by discriminant analysis such as linear discriminant analysis, PLS discriminant analysis, SIMCA, SVM discriminant analysis and the like. It was also confirmed that the viable cell count could be estimated by performing multivariate analysis by regression analysis such as multiple regression analysis, PLS regression analysis, SVM regression analysis, LWR (local weighted regression).

以上で、本実施形態における実施例の説明を終える。 Above, description of the Example in this embodiment is finished.

［他の実施の形態］
さて、これまで本発明の実施の形態について説明したが、本発明は、上述した実施の形態以外にも、特許請求の範囲に記載した技術的思想の範囲内において種々の異なる実施の形態にて実施されてよいものである。 [Other embodiments]
Although the embodiments of the present invention have been described so far, the present invention is not limited to the above-described embodiments, but can be applied to various different embodiments within the scope of the technical idea described in the claims. It may be implemented.

例えば、生体数モデル生成装置１００がスタンドアローンの形態で処理を行う場合を一例に説明したが、生体数モデル生成装置１００は、クライアント端末（生体数モデル生成装置１００とは別筐体である）からの要求に応じて処理を行い、その処理結果を当該クライアント端末に返却するようにしてもよい。 For example, the case where the biological number model generation apparatus 100 performs processing in a stand-alone form has been described as an example, but the biological number model generation apparatus 100 is a client terminal (a separate housing from the biological number model generation apparatus 100). Processing may be performed in response to a request from the client, and the processing result may be returned to the client terminal.

また、実施の形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。 In addition, among the processes described in the embodiment, all or part of the processes described as being automatically performed can be performed manually, or the processes described as being performed manually can be performed. All or a part can be automatically performed by a known method.

このほか、上記文献中や図面中で示した処理手順、制御手順、具体的名称、各処理の登録データや検索条件等のパラメータを含む情報、画面例、データベース構成については、特記する場合を除いて任意に変更することができる。 In addition, unless otherwise specified, the processing procedures, control procedures, specific names, information including registration data for each processing, parameters such as search conditions, screen examples, and database configurations shown in the above documents and drawings Can be changed arbitrarily.

また、生体数モデル生成装置１００に関して、図示の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。 In addition, regarding the biological number model generation apparatus 100, each illustrated component is functionally conceptual and does not necessarily need to be physically configured as illustrated.

例えば、生体数モデル生成装置１００の各装置が備える処理機能、特に制御部１０２にて行われる各処理機能については、その全部または任意の一部を、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）および当該ＣＰＵにて解釈実行されるプログラムにて実現してもよく、また、ワイヤードロジックによるハードウェアとして実現してもよい。尚、プログラムは、後述する、コンピュータに本発明に係る方法を実行させるためのプログラム化された命令を含む、一時的でないコンピュータ読み取り可能な記録媒体に記録されており、必要に応じて１００に機械的に読み取られる。すなわち、ＲＯＭまたはＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）などの記憶部１０６などには、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）と協働してＣＰＵに命令を与え、各種処理を行うためのコンピュータプログラムが記録されている。このコンピュータプログラムは、ＲＡＭにロードされることによって実行され、ＣＰＵと協働して制御部を構成する。 For example, all or some of the processing functions provided in each device of the biometric model generation device 100, particularly the processing functions performed by the control unit 102, are performed by a CPU (Central Processing Unit) and the CPU. It may be realized by a program to be interpreted and executed as hardware by wired logic. The program is recorded on a non-transitory computer-readable recording medium including a programmed instruction for causing a computer to execute the method according to the present invention, which will be described later. Can be read. That is, in the storage unit 106 such as a ROM or an HDD (Hard Disk Drive), a computer program for giving instructions to the CPU and performing various processes in cooperation with the OS (Operating System) is recorded. This computer program is executed by being loaded into the RAM, and constitutes a control unit in cooperation with the CPU.

また、このコンピュータプログラムは、生体数モデル生成装置１００に対して任意のネットワーク３００を介して接続されたアプリケーションプログラムサーバに記憶されていてもよく、必要に応じてその全部または一部をダウンロードすることも可能である。 The computer program may be stored in an application program server connected to the biological model generation apparatus 100 via an arbitrary network 300, and the computer program may be downloaded in whole or in part as necessary. Is also possible.

また、本発明に係るプログラムを、コンピュータ読み取り可能な記録媒体に格納してもよく、また、プログラム製品として構成することもできる。ここで、この「記録媒体」とは、メモリーカード、ＵＳＢメモリ、ＳＤカード、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＥＰＲＯＭ、ＥＥＰＲＯＭ、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ、および、Ｂｌｕ−ｒａｙＤｉｓｃ等の任意の「可搬用の物理媒体」を含むものとする。 In addition, the program according to the present invention may be stored in a computer-readable recording medium, and may be configured as a program product. Here, the “recording medium” is any memory card, USB memory, SD card, flexible disk, magneto-optical disk, ROM, EPROM, EEPROM, CD-ROM, MO, DVD, Blu-ray Disc, etc. Of “portable physical media”.

また、「プログラム」とは、任意の言語や記述方法にて記述されたデータ処理方法であり、ソースコードやバイナリコード等の形式を問わない。なお、「プログラム」は必ずしも単一的に構成されるものに限られず、複数のモジュールやライブラリとして分散構成されるものや、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）に代表される別個のプログラムと協働してその機能を達成するものをも含む。なお、実施の形態に示した各装置において記録媒体を読み取るための具体的な構成、読み取り手順、あるいは、読み取り後のインストール手順等については、周知の構成や手順を用いることができる。 The “program” is a data processing method described in an arbitrary language or description method, and may be in any format such as source code or binary code. The “program” is not necessarily limited to a single configuration, but is distributed in the form of a plurality of modules and libraries, or in cooperation with a separate program represented by an OS (Operating System). Including those that achieve the function. Note that a well-known configuration and procedure can be used for a specific configuration for reading a recording medium, a reading procedure, an installation procedure after reading, and the like in each device described in the embodiment.

記憶部１０６に格納される各種のデータベース等（検出強度ファイル１０６ａ〜生体数モデルファイル１０６ｃ）は、ＲＡＭ、ＲＯＭ等のメモリ装置、ハードディスク等の固定ディスク装置、フレキシブルディスク、および、光ディスク等のストレージ手段であり、各種処理やウェブサイト提供に用いる各種のプログラム、テーブル、データベース、および、ウェブページ用ファイル等を格納する。 Various databases and the like (detection intensity file 106a to biological number model file 106c) stored in the storage unit 106 include memory devices such as RAM and ROM, fixed disk devices such as hard disks, flexible disks, and storage means such as optical disks. It stores various programs, tables, databases, web page files, etc. used for various processes and website provision.

また、生体数モデル生成装置１００は、既知のパーソナルコンピュータ、ワークステーション等の情報処理装置として構成してもよく、また、該情報処理装置に任意の周辺装置を接続して構成してもよい。また、生体数モデル生成装置１００は、該情報処理装置に本発明の方法を実現させるソフトウェア（プログラム、データ等を含む）を実装することにより実現してもよい。 In addition, the biological model generation apparatus 100 may be configured as an information processing apparatus such as a known personal computer or workstation, or may be configured by connecting an arbitrary peripheral device to the information processing apparatus. In addition, the biological number model generation apparatus 100 may be realized by installing software (including programs, data, and the like) that realizes the method of the present invention in the information processing apparatus.

更に、装置の分散・統合の具体的形態は図示するものに限られず、その全部または一部を、各種の付加等に応じて、または、機能負荷に応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。すなわち、上述した実施形態を任意に組み合わせて実施してもよく、実施形態を選択的に実施してもよい。 Furthermore, the specific form of distribution / integration of the devices is not limited to that shown in the figure, and all or a part of them may be functional or physical in arbitrary units according to various additions or according to functional loads. Can be distributed and integrated. That is, the above-described embodiments may be arbitrarily combined and may be selectively implemented.

以上詳述に説明したように、本実施形態によれば、正確かつ迅速に生体数を計測することができる、生体数モデル生成装置、生体数モデル生成方法、サイトメーター、および、プログラム、並びに、記録媒体を提供することができる。したがって、茶飲料製造工場など食品製造現場での品質管理や、ミネラルウオーター、動物性タンパクを含む食品等の検査に極めて有用である。特に、本発明は、市販されるフローサイトメトリー装置に実装可能であり、データは市販機（カセットラボＯＮＥ、日立エンジニアリング・アンド・サービス）にて取得可能である。そのほか、フローサイトメトリー装置やイメージングサイトメトリー装置等を用いた品質管理や医療や製薬や創薬や生物学研究や臨床検査などの様々な分野において極めて有用である。 As described above in detail, according to the present embodiment, a biological number model generation device, a biological number model generation method, a cytometer, a program, and a program that can accurately and quickly measure the number of living bodies, A recording medium can be provided. Therefore, it is extremely useful for quality control at food production sites such as tea beverage manufacturing factories, and inspection of foods containing mineral water and animal protein. In particular, the present invention can be implemented in a commercially available flow cytometry apparatus, and data can be acquired by a commercially available machine (Cassette Lab ONE, Hitachi Engineering & Service). In addition, it is extremely useful in various fields such as quality control using a flow cytometry apparatus, an imaging cytometry apparatus, and the like, medical treatment, pharmaceuticals, drug discovery, biological research, and clinical examinations.

１００生体数モデル生成装置
１０２制御部
１０２ａデータ取得部
１０２ｂヒストグラム生成部
１０２ｃ多変量解析部
１０２ｄ最適化部
１０２ｆ生体数判定部
１０４通信制御インターフェース部
１０６記憶部
１０６ａ検出強度ファイル
１０６ｂヒストグラムファイル
１０６ｃ生体数モデルファイル
１０８入出力制御インターフェース部
１１０測定部
１１２入力部
１１４出力部
２００外部機器
３００ネットワーク DESCRIPTION OF SYMBOLS 100 Biological number model production | generation apparatus 102 Control part 102a Data acquisition part 102b Histogram generation part 102c Multivariate analysis part 102d Optimization part 102f Biological number determination part 104 Communication control interface part 106 Storage part 106a Detection intensity file 106b Histogram file 106c Biological number model File 108 Input / output control interface unit 110 Measuring unit 112 Input unit 114 Output unit 200 External device 300 Network

Claims

In a biological number model generation device including at least a storage unit and a control unit,
The storage unit
A detection intensity storage means for storing detection intensity data obtained by cytometry using a fluorescent label in which the degree of interaction with the living body and the emission intensity corresponding to the degree of the living body depending on whether the living body is alive or dead,
With
The control unit
Histogram generation means for generating a histogram indicating the frequency of each detection intensity based on the detection intensity data;
A multivariate analysis is performed using frequency values in each section of the histogram as explanatory variables, and a biological number model that is a mathematical expression for determining the presence or absence of the living body or the number of living bodies is obtained using the coefficients for the obtained explanatory variables. Variable analysis means;
A living body number model generation device characterized by comprising:

In the living body number model generating device according to claim 1,
The detected intensity storage means is
Storing the detection intensity data obtained by cytometry using a plurality of fluorescent labels, indicating the detection intensity of the fluorescence emitted by each of the fluorescent labels,
The histogram generation means includes
Based on the detected intensity data, a histogram showing the frequency for each category is generated in the plurality of fluorescent labels,
The multivariate analysis means is:
The frequency for each category of the fluorescent label is the explanatory variable,
A biological number model generation device characterized by the above.

In the living body number model generation device according to claim 1 or 2,
The detected intensity storage means is
Storing the detection intensity data obtained by cytometry using a plurality of fluorescent labels, indicating the detection intensity of the fluorescence emitted by each of the fluorescent labels,
The histogram generation means includes
Based on the detected intensity data, a histogram is generated that indicates the frequency of each of the other fluorescent labels for each of the sections when the detected intensity of one of the fluorescent labels is within a predetermined range. about,
A biological number model generation device characterized by the above.

In the living body number model generating device according to any one of claims 1 to 3,
The detected intensity storage means is
Storing the detection intensity data obtained by cytometry using a plurality of fluorescent labels, indicating the detection intensity of the fluorescence emitted by each of the fluorescent labels,
The histogram generation means includes
Adding, subtracting, multiplying / dividing between the detected intensity data of the fluorescent label simultaneously measured, and generating a histogram showing the frequency for each category of the added / subtracted / divided values;
A biological number model generation device characterized by the above.

In the living body number model generation device according to any one of claims 1 to 4,
The control unit
By controlling the interval between the sections of the histogram repeatedly generated by the histogram generation means while verifying an error from the actual measurement value for the biological number model obtained by the multivariate analysis means, Model optimization means for optimizing the biological number model,
A living body number model generating apparatus, further comprising:

In the living body number model generating device according to any one of claims 1 to 5,
The histogram generation means includes
Converting the detected intensity into a logarithm based on the detected intensity data, and generating a histogram showing the frequency for each segment of the converted logarithm;
A biological number model generation device characterized by the above.

In the living body number model generation device according to any one of claims 1 to 6,
The histogram generation means includes
Centering, scaling, smoothing, differentiating and / or standardizing the generated histogram as a spectrum,
A biological number model generation device characterized by the above.

In the living body number model generation device according to any one of claims 1 to 7,
The multivariate analysis means is:
Performing the above multivariate analysis using unsupervised pattern recognition techniques, supervised discriminant analysis techniques, or supervised regression analysis techniques,
A biological number model generation device characterized by the above.

A biological number model generation method executed in a computer including at least a storage unit and a control unit,
The storage unit
A detection intensity storage means for storing detection intensity data obtained by cytometry using a fluorescent label in which the degree of interaction with the living body and the emission intensity corresponding to the degree of the living body depending on whether the living body is alive or dead,
With
Executed in the control unit,
Based on the detection intensity data, a histogram generation step for generating a histogram indicating the frequency of each detection intensity category;
A multivariate analysis is performed using frequency values in each section of the histogram as explanatory variables, and a biological number model that is a mathematical expression for determining the presence or absence of the living body or the number of living bodies is obtained using the coefficients for the obtained explanatory variables. A variable analysis step;
A method for generating a biological number model, comprising:

A cytometer that determines the number of living organisms or the presence or absence of the living organism from the detected intensity data, using the biological number model generated by the living organism number model generating method according to claim 9.

A program for causing a computer including at least a storage unit and a control unit to execute the program,
The storage unit
A detection intensity storage means for storing detection intensity data obtained by cytometry using a fluorescent label in which the degree of interaction with the living body and the emission intensity corresponding to the degree of the living body depending on whether the living body is alive or dead,
With
In the control unit,
Based on the detection intensity data, a histogram generation step for generating a histogram indicating the frequency of each detection intensity category;
A multivariate analysis is performed using frequency values in each section of the histogram as explanatory variables, and a biological number model that is a mathematical expression for determining the presence or absence of the living body or the number of living bodies is obtained using the coefficients for the obtained explanatory variables. A variable analysis step;
A program for running