JP7496470B2

JP7496470B2 - Techniques for analyzing and detecting execution artifacts in microwell plates - Patents.com

Info

Publication number: JP7496470B2
Application number: JP2023504097A
Authority: JP
Inventors: マークフェーダーフォゲルソン，ベンジャミン; マクリーン，ピーター; ハク，イムラン; サンダース，マリッサ; フィッシュ，エリック; べイカー，チャールズ; ヴェラ，フワンセバスティアンロドリゲス
Original assignee: リカージョンファーマシューティカルズインコーポレイテッド
Priority date: 2020-07-27
Filing date: 2021-07-19
Publication date: 2024-06-06
Anticipated expiration: 2041-07-19
Also published as: EP4189641A4; JP2023536695A; IL300002A; CA3186058A1; AU2021316176A1; AU2021316176B2; CN116210032A; WO2022026226A1; EP4189641A1

Description

CROSS-REFERENCE TO RELATED APPLICATIONS

この出願は、２０２０年７月２７日に出願された米国特許出願係属番号第１６／９４０，３２０号の利益を主張するものであり、また２０２０年７月２７日に出願された米国特許出願係属番号第１６／９４０，３２５号の利益を主張するものである。これらの関連出願の主題は、参照により本明細書に組み込まれる。 This application claims the benefit of U.S. Patent Application No. 16/940,320, filed July 27, 2020, and also claims the benefit of U.S. Patent Application No. 16/940,325, filed July 27, 2020. The subject matter of these related applications is incorporated herein by reference.

様々な実施形態は、一般に、コンピュータサイエンス及び生化学的分析に関し、より具体的には、マイクロウェルプレートでの実行アーティファクトを分析及び検出するための技法に関する。 Various embodiments relate generally to computer science and biochemical analysis, and more specifically to techniques for analyzing and detecting performance artifacts in microwell plates.

ハイスループットスクリーニングは、研究者が１日あたり数万、または数十万さえもの化学的、生物学的、遺伝的、及び／または薬理検査を行うことを可能にする自動化プロセスである。典型的な実験では、統合システムがマイクロウェルプレートのセットを使用して試験を自動的に実施しており、各プレートには２次元（「２Ｄ」）グリッドのウェルが含まれている。統合されたシステムは、試験用の様々な化合物のサンプルを、ターゲットのサンプルと共に様々なウェルに分注する。化合物とターゲット間のいずれかの反応が発生するまでのインキュベーション期間の後、様々な測定がウェルで実行され、結果が測定値の２Ｄ配列として、実験データセットに保存される。 High-throughput screening is an automated process that allows researchers to perform tens or even hundreds of thousands of chemical, biological, genetic, and/or pharmacological tests per day. In a typical experiment, an integrated system automatically performs tests using a set of microwell plates, each plate containing a two-dimensional ("2D") grid of wells. The integrated system dispenses samples of various compounds to be tested into the various wells along with samples of the target. After an incubation period until any reaction between the compounds and the target occurs, various measurements are performed on the wells and the results are stored in an experimental dataset as a 2D array of measurements.

ハイスループットスクリーニングに関連する１つの課題は、実験データセットに、実験自体の実行に起因する「実行アーティファクト」と呼ばれる特定のエラーが含まれ得る。例えば、分注ノズルが部分的に詰まっていて、そのノズルに割り当てられた特定のウェルにターゲットのサンプルを適切に分注できない場合、それらの特定のウェルで実行された測定では、ターゲットと、ウェルに対応する化合物との間の実際の反応または「完全な」反応を捉えないであろう。特定のウェルで実行された測定から得られた測定値は不正確であり、実際の反応または「完全な」反応を反映していないため、これらの測定値は、実験データセットの全体的な質を低下させる実行アーティファクトと見なされる。一般に、質の劣悪な実験データセットを使用しながらターゲットに対する様々な化合物の有効性について有効な結論を導き出すことは、はるかに困難である。したがって、実験データセットの実行アーティファクトを特定して軽減するために、様々な試みが行われてきた。 One challenge associated with high throughput screening is that the experimental data set may contain certain errors, called "run artifacts", that result from the execution of the experiment itself. For example, if a dispensing nozzle is partially clogged and cannot properly dispense samples of the target into the specific wells assigned to that nozzle, the measurements performed in those specific wells will not capture the actual or "complete" reaction between the target and the compound corresponding to the well. Because the measurements obtained from measurements performed in specific wells are inaccurate and do not reflect the actual or "complete" reaction, these measurements are considered run artifacts that reduce the overall quality of the experimental data set. In general, it is much more difficult to draw valid conclusions about the effectiveness of various compounds against the target while using an experimental data set of poor quality. Thus, various attempts have been made to identify and mitigate run artifacts in experimental data sets.

実行アーティファクトを識別する１つのアプローチでは、人間のレビュー担当者が「ヒートマップ」（測定値の様々な配列、または測定値の様々な配列の視覚的表現）を分析して、実行アーティファクトを示す測定値の異常なパターンを検出しようとする。異常なパターンを特定すると、レビュー担当者は通常、関連するプレートに注釈を付けて、疑わしい実行アーティファクト（複数可）のタイプと重大度を示す。注釈が付けられた情報に基づいて、プレートに関連付けられた測定値を実験データセットから除外できる及び／または再検討し得る。さらに、一部のタイプの実行アーティファクトでは、アーティファクトの根本原因を判定して修正する試みが行われる。 In one approach to identifying execution artifacts, a human reviewer analyzes "heat maps" (various arrays of measurements, or visual representations of various arrays of measurements) to attempt to detect anomalous patterns of measurements that may be indicative of execution artifacts. Upon identifying an anomalous pattern, the reviewer typically annotates the associated plate to indicate the type and severity of the suspected execution artifact(s). Based on the annotated information, measurements associated with the plate may be removed from the experimental dataset and/or reconsidered. Additionally, for some types of execution artifacts, an attempt is made to determine and correct the root cause of the artifact.

上記のアプローチの欠点の１つは、ヒートマップを手動式に分析すると、時間がかかり、エラーが発生しやすいことである。多くの場合、レビュー担当者は、分析プロセスに利用できる時間内に、実験に関連するすべてのヒートマップを適切に精査することができない。利用可能な時間で分析プロセスを完了するために、レビュー担当者は通常、限られた数のヒートマップのみを分析する、及び／またはヒートマップの大まかな分析を実行する。その結果、ヒートマップに反映された実行上の異常が見落とされ得る、または誤解され得る。 One drawback of the above approach is that manually analyzing heatmaps is time-consuming and error-prone. Often, reviewers are unable to adequately scrutinize all heatmaps associated with an experiment in the time available for the analysis process. To complete the analysis process in the available time, reviewers typically analyze only a limited number of heatmaps and/or perform a cursory analysis of the heatmaps. As a result, execution anomalies reflected in the heatmaps may be overlooked or misinterpreted.

もう１つの欠点は、実行アーティファクトの識別が主観的なプロセスであることである。したがって、同様の視覚パターンを示すヒートマップであっても、特定された実行アーティファクトの数及び／またはタイプは、レビュー担当者によって異なり得る。さらに、手動式のレビュープロセスは、実行アーティファクトを一貫して特定するものではないため、実行アーティファクトの経時的な傾向を検出することは、不可能ではないにしても非常に困難である。その結果、実行アーティファクトを減らすための実験プロセス及び／または機器を改善する機会が失われる可能性がある。 Another drawback is that identifying execution artifacts is a subjective process. Thus, even heat maps that show similar visual patterns may vary in the number and/or type of execution artifacts identified by different reviewers. Furthermore, because manual review processes do not consistently identify execution artifacts, it is very difficult, if not impossible, to detect trends in execution artifacts over time. As a result, opportunities to improve experimental processes and/or equipment to reduce execution artifacts may be missed.

上記のことが示すように、当技術分野で必要とされるのは、マイクロウェルプレートを含む実験で実行アーティファクトを分析及び検出するためのより効果的な技術である。 As the above indicates, what is needed in the art are more effective techniques for analyzing and detecting performance artifacts in experiments involving microwell plates.

本発明の一実施形態は、マイクロウェルプレートを含む実験において実行アーティファクトを検出するための方法を設定する。この方法は、第１のマイクロウェルプレートに関連付けられた１つまたは複数のヒートマップに基づいて空間的特徴の１つまたは複数のセットを計算すること、第１の特徴ベクトルを生成するために、空間的特徴の１つまたは複数のセットを集約すること、及び第１の特徴ベクトルを訓練された分類器に入力することであって、それに応じて、第１のマイクロウェルプレートが第１の実行アーティファクトに関連付けられていることを示す第１のラベルを生成する、入力することを含む。 One embodiment of the present invention provides a method for detecting execution artifacts in an experiment involving a microwell plate. The method includes calculating one or more sets of spatial features based on one or more heat maps associated with a first microwell plate, aggregating the one or more sets of spatial features to generate a first feature vector, and inputting the first feature vector into a trained classifier, which in response generates a first label indicating that the first microwell plate is associated with a first execution artifact.

従来技術と比較した開示の技術の少なくとも１つの技術的利点は、開示されている技術を使用して、マイクロウェルプレートを伴う実験で実行アーティファクトをより正確かつ一貫して分析及び検出できることである。とりわけ、開示されている技術により、各マイクロウェルプレートは、マイクロウェルプレートに対して生成されたヒートマップで検出された空間パターンに基づいて、自動的に分類される。したがって、ヒートマップに反映された実行上の異常が見落とされたり誤解されたりする可能性は、従来技術のアプローチに比べて減少する。さらに、マイクロウェルプレートは実行アーティファクトに関して一貫した客観的な方法で分類されるため、経時的な実行アーティファクトの傾向を効果的に検出及び使用して、実験のプロセス及び／または装置を改善することができる。これらの技術的な利点は、従来技術のアプローチに対して１つまたは複数の技術的な改善をもたらす。 At least one technical advantage of the disclosed technology over the prior art is that the disclosed technology can be used to more accurately and consistently analyze and detect execution artifacts in experiments involving microwell plates. In particular, the disclosed technology automatically classifies each microwell plate based on spatial patterns detected in a heat map generated for the microwell plate. Thus, the likelihood that execution anomalies reflected in the heat map will be overlooked or misinterpreted is reduced compared to prior art approaches. Furthermore, because microwell plates are classified in a consistent and objective manner with respect to execution artifacts, trends in execution artifacts over time can be effectively detected and used to improve experimental processes and/or equipment. These technical advantages provide one or more technical improvements over prior art approaches.

多様な実施形態の上段にて列挙された特徴が詳細に理解できるように、上記で簡潔に要約された本発明の概念のより具体的な説明は、多様な実施形態を参照することによって行われ得、特定の実施形態のそれぞれは添付図で示される。しかし、添付の図面が本発明の概念の典型的な実施形態だけを示し、ひいては、決して範囲を限定するものと見なされるべきではなく、他にも同等に効果的な実施形態があるということに留意されたい。 So that the features recited above in the various embodiments can be understood in detail, a more particular description of the inventive concept briefly summarized above can be made by reference to various embodiments, each of which is illustrated in the accompanying drawings. It should be noted, however, that the accompanying drawings illustrate only exemplary embodiments of the inventive concept, and thus should not be considered as limiting the scope in any way, as there are other equally effective embodiments.

様々な実施形態の１つ以上の態様を実装するように構成されているシステムの概念の図である。FIG. 1 is a conceptual diagram of a system configured to implement one or more aspects of various embodiments. 様々な実施形態による、図１の特徴エンジンのより詳細な図である。2 is a more detailed diagram of the feature engine of FIG. 1 in accordance with various embodiments. 様々な実施形態による、図１の出力エンジンのより詳細な図である。2 is a more detailed diagram of the output engine of FIG. 1 in accordance with various embodiments. 様々な実施形態による、マイクロウェルプレートを含む実験において実行アーティファクトを検出するように分類器をトレーニングするための方法ステップの流れ図である。1 is a flow diagram of method steps for training a classifier to detect performance artifacts in experiments involving microwell plates, according to various embodiments. 様々な実施形態による、訓練された分類器を使用するマイクロウェルプレートを含む実験において実行アーティファクトを検出するための方法ステップの流れ図である。1 is a flow diagram of method steps for detecting performance artifacts in experiments involving microwell plates using a trained classifier, in accordance with various embodiments.

以下の説明では、多様な実施形態のさらに十分な理解をもたらすために多数の具体的な詳細を説明する。しかし、本発明の概念がこれらの具体的な詳細の１つ以上なしに実践され得ることは、当業者にとって明らかである。 In the following description, numerous specific details are set forth to provide a more thorough understanding of various embodiments. However, it will be apparent to one of ordinary skill in the art that the concepts of the present invention may be practiced without one or more of these specific details.

開示された技術は、マイクロウェルプレートを使用して実施された実験における実行の異常を自動的に検出するために使用することができる。トレーニング段階では、トレーニングアプリケーションは、ヒートマップセットに基づいて訓練された分類器を生成し、各ヒートマップセットは、異なるマイクロウェルプレートに関連付けられた１つ以上のヒートマップを明示する。例えば、所与のマイクロウェルプレートのヒートマップセットには、細胞数ヒートマップと任意の数の強度ヒートマップを含めることができ、各強度ヒートマップは異なる蛍光色素に関連付けられる。細胞数ヒートマップは、実験中に使用されるマイクロウェルプレートの各ウェルの細胞数を明示できる。所与の強度ヒートマップは、関連する蛍光色素を介して励起された場合に使用される各ウェルの平均強度を明示できる。 The disclosed techniques can be used to automatically detect run anomalies in experiments conducted using microwell plates. In a training phase, a training application generates a trained classifier based on a heatmap set, where each heatmap set specifies one or more heatmaps associated with a different microwell plate. For example, the heatmap set for a given microwell plate can include a cell count heatmap and any number of intensity heatmaps, where each intensity heatmap is associated with a different fluorescent dye. The cell count heatmap can specify the number of cells in each well of the microwell plate used during the experiment. The given intensity heatmap can specify the average intensity of each well used when excited via the associated fluorescent dye.

重要なことに、マイクロウェルプレートに関連する実行アーティファクトは、多くの場合、関連するヒートマップの１つ以上に低周波の空間パターンとして現れる。このため、各ヒートマップに対して、トレーニングアプリケーションはヒートマップにウェーブレット変換を適用して、一連の低周波空間パターンを判定する。マイクロウェルプレートのそれぞれについて、トレーニングアプリケーションは、関連する低周波空間パターンのセットに基づいて特徴ベクトルを生成する。 Importantly, performance artifacts associated with a microwell plate often manifest as low-frequency spatial patterns in one or more of the associated heatmaps. Thus, for each heatmap, the training application applies a wavelet transform to the heatmap to determine a set of low-frequency spatial patterns. For each microwell plate, the training application generates a feature vector based on the set of associated low-frequency spatial patterns.

トレーニングアプリケーションは、クラスタリングアルゴリズムを実行して、特徴ベクトルをクラスタに分割する。各クラスタ内の特徴ベクトルは、他のクラスタの特徴ベクトルよりも互いに類似している。トレーニングアプリケーションは、グラフィカルユーザインターフェース（「ＧＵＩ」）を介してオーバーライドできるクラスタごとに異なるラベルを生成する。例えば、トレーニングアプリケーションは、マイクロウェルプレートに関連付けられているクラスタのラベル「Ｌ１」を自動的に生成でき、このクラスタでは、下から４番目の行は、マイクロウェルプレートの他の行に比べて細胞数が少なく、強度の値も低くなっている。ラベル「Ｌ１」は、ＧＵＩを介してラベル「行の不履行」に更新できる。特徴ベクトル及び関連付けられたラベルに基づいて、トレーニングアプリケーションは分類器をトレーニングして、特徴ベクトルを、予測ラベルと、関連付けられたラベル信頼度にマッピングする。 The training application runs a clustering algorithm to divide the feature vectors into clusters. The feature vectors in each cluster are more similar to each other than to the feature vectors of other clusters. The training application generates a different label for each cluster that can be overridden via a graphical user interface ("GUI"). For example, the training application can automatically generate a label "L1" for a cluster associated with a microwell plate, where the fourth row from the bottom has fewer cells and lower intensity values compared to the other rows of the microwell plate. The label "L1" can be updated via the GUI to the label "row default". Based on the feature vectors and associated labels, the training application trains a classifier to map the feature vectors to a predicted label and associated label confidence.

その後、推論段階で、実験分析アプリケーションは、訓練された分類器を使用して、マイクロウェルプレートのセット用のヒートマップセットに基づいて、マイクロウェルプレートのセットを介して実施される実験に関連する実行の異常を検出及び評価する。実験分析アプリケーションは、マイクロウェルプレートのセット用のヒートマップセットに基づいて、実験全体を表す平均的なヒートマップセットを生成する。実験アプリケーションは、実験に関連付けられた各ヒートマップセット（平均的なヒートマップセットを含む）を訓練された分類器に入力して、予測ラベルとラベル信頼度を生成する。 Then, in the inference phase, the experiment analysis application uses the trained classifier to detect and evaluate anomalies in execution associated with experiments conducted through the set of micro-well plates based on the heatmap set for the set of micro-well plates. The experiment analysis application generates an average heatmap set representative of the entire experiment based on the heatmap set for the set of micro-well plates. The experiment application inputs each heatmap set associated with an experiment (including the average heatmap set) into the trained classifier to generate a predicted label and a label confidence.

マイクロウェルプレートのセットに含まれる各マイクロウェルプレートについて、実験分析アプリケーションは、予測ラベルに関連付けられたクラスタに対してマイクロウェルプレートがどの程度外れているかを示す異常スコアを計算する。実験全体について、実験分析アプリケーションは、実験全体の予測ラベルと等しい予測ラベルを有する実験に関連付けられたマイクロウェルプレートのパーセンテージを明示する、整合するプレートの割合を計算する。次に、実験分析アプリケーションは、任意の数の予測ラベル、ラベルの信頼度、異常スコア、及び整合するプレートの割合を、任意の組み合わせで、任意の数のソフトウェアアプリケーション及び／またはディスプレイに、ＧＵＩを介して与える。このように、実験分析アプリケーションは、実験全体と、関連するマイクロウェルプレートのそれぞれを、実行アーティファクトに関して一貫した客観的な方法で、分類する。 For each micro-well plate in the set of micro-well plates, the experiment analysis application calculates an anomaly score indicating how far the micro-well plate deviates from the cluster associated with the predicted label. For the entire experiment, the experiment analysis application calculates a matching plate percentage, which indicates the percentage of micro-well plates associated with the experiment that have a predicted label equal to the predicted label of the entire experiment. The experiment analysis application then provides any number of predicted labels, label confidences, anomaly scores, and matching plate percentages, in any combination, to any number of software applications and/or displays via a GUI. In this way, the experiment analysis application classifies the entire experiment and each of the associated micro-well plates in a consistent and objective manner with respect to execution artifacts.

システムの概要
図１は、様々な実施形態の１つ以上の態様を実装するように構成されているシステムの概念の図である。説明の目的で、同様の対象物の複数のインスタンスは、対象物を識別する参照番号と、必要に応じてインスタンスを識別する括弧内の英数字（複数可）で示される。示されるように、システム１００は、非限定的に、計算インスタンス１１０（１）及び１１０（２）、表示デバイス１０８（１）及び１０８（２）、ラベル付けされていないトレーニングデータセット１０２、及び実験データセット１０６を含む。 System Overview Figure 1 is a conceptual diagram of a system configured to implement one or more aspects of various embodiments. For purposes of illustration, multiple instances of similar objects are indicated with a reference number that identifies the object and, where appropriate, an alphanumeric character(s) in parentheses that identifies the instance. As shown, the system 100 includes, without limitation, computational instances 110(1) and 110(2), display devices 108(1) and 108(2), an unlabeled training data set 102, and an experimental data set 106.

いくつかの実施形態では、システム１００は、非限定的に、任意の数の計算インスタンス１１０、任意の数の表示デバイス１０８、任意の数のラベル付けされていないトレーニングデータセット１０２、及び任意の数の実験データセット１０６を任意の組み合わせで含むことができる。システム１００の構成要素は、任意の数の共有された地理的位置及び／または任意の数の異なる地理的位置に分散させる、及び／または１つまたは複数のクラウドコンピューティング環境（つまり、カプセル化された共有リソース、ソフトウェア、データなど）に任意の組み合わせで実装され得る。 In some embodiments, the system 100 may include, without limitation, any combination of any number of compute instances 110, any number of display devices 108, any number of unlabeled training data sets 102, and any number of experimental data sets 106. Components of the system 100 may be distributed across any number of shared geographic locations and/or any number of different geographic locations, and/or implemented in one or more cloud computing environments (i.e., encapsulating shared resources, software, data, etc.).

示されるように、計算インスタンス１１０（１）は、プロセッサ１１２（１）及びメモリ１１６（１）を含むが、これらに限定されず、計算インスタンス１１０（２）は、プロセッサ１１２（２）及びメモリ１１６（２）を含むが、これらに限定されない。計算インスタンス１１０（１）及び１１０（２）はまた、本明細書では個別に「計算インスタンス１１０」と称し、集合的に「計算インスタンス１１０」と呼ぶ。プロセッサ１１２（１）及び１１２（２）はまた、本明細書では個別に「プロセッサ１１２」と称し、集合的に「プロセッサ１１２」と呼ぶ。メモリ１１６（１）及び１１６（２）はまた、本明細書では個別に「メモリ１１６」と称し、集合的に「メモリ１１６」と呼ぶ。計算インスタンス１１０のそれぞれは、クラウドコンピューティング環境で実装されてもよく、任意の他の分散コンピューティング環境の一部として実装されても、スタンドアロン方式で実装されてもよい。 As shown, compute instance 110(1) includes, but is not limited to, processor 112(1) and memory 116(1), and compute instance 110(2) includes, but is not limited to, processor 112(2) and memory 116(2). Computation instances 110(1) and 110(2) are also referred to herein individually as "computation instance 110" and collectively as "computation instance 110". Processors 112(1) and 112(2) are also referred to herein individually as "processor 112" and collectively as "processor 112". Memory 116(1) and 116(2) are also referred to herein individually as "memory 116" and collectively as "memory 116". Each of compute instances 110 may be implemented in a cloud computing environment, as part of any other distributed computing environment, or in a stand-alone manner.

プロセッサ１１２は、命令を実行できる任意の命令実行システム、装置、またはデバイスであり得る。例えば、プロセッサ１１２は、中央処理装置、グラフィックス処理装置、コントローラ、マイクロコントローラ、ステートマシン、またはそれらの任意の組み合わせを含むことができる。計算インスタンス１１０のメモリ１１６は、計算インスタンス１１０のプロセッサ１１２による使用のために、ソフトウェアアプリケーション及びデータなどのコンテンツを格納する。いくつかの実施形態では、任意の数の計算インスタンス１１０のそれぞれが、任意の数のプロセッサ１１２及び任意の数のメモリ１１６を任意の組み合わせで含むことができる。 The processor 112 may be any instruction execution system, apparatus, or device capable of executing instructions. For example, the processor 112 may include a central processing unit, a graphics processing unit, a controller, a microcontroller, a state machine, or any combination thereof. The memory 116 of the compute instance 110 stores content such as software applications and data for use by the processor 112 of the compute instance 110. In some embodiments, each of any number of compute instances 110 may include any number of processors 112 and any number of memories 116 in any combination.

計算インスタンス１１０のそれぞれは、クラウドコンピューティング環境で実装されてもよく、任意の他の分散コンピューティング環境の一部として実装されても、スタンドアロン方式で実装されてもよい。特に、任意の数の計算インスタンス１１０（１つを含む）が、任意の技術的に実行可能な方法で、マルチプロセッシング環境を提供することができる。 Each of the computational instances 110 may be implemented in a cloud computing environment, as part of any other distributed computing environment, or in a stand-alone manner. In particular, any number of computational instances 110 (including one) may provide a multiprocessing environment in any technically feasible manner.

メモリ１１６は、ランダムアクセスメモリ、読み取り専用メモリ、フロッピーディスク、ハードディスク、またはローカルまたはリモートの任意の他の形態のデジタル記憶装置など、容易に利用可能なメモリの１つまたは複数とすることができる。いくつかの実施形態では、記憶装置（図示せず）は、メモリ１１６を補足または置換することができる。記憶装置は、プロセッサ１１２にアクセス可能な任意の数及びタイプの外部メモリを含むことができる。例えば、非限定的に、記憶装置は、セキュアデジタルカード、外部フラッシュメモリ、ポータブルコンパクトディスク読み取り専用メモリ、光記憶装置、磁気記憶装置、または上記の任意の適切な組み合わせを含むことができる。 Memory 116 can be one or more of readily available memory, such as random access memory, read-only memory, floppy disk, hard disk, or any other form of digital storage, local or remote. In some embodiments, storage (not shown) can supplement or replace memory 116. Storage can include any number and type of external memory accessible to processor 112. For example, without limitation, storage can include secure digital cards, external flash memory, portable compact disk read-only memory, optical storage, magnetic storage, or any suitable combination of the above.

一部の実施形態では、計算インスタンス１１０は、任意の数（ゼロを含む）及び／またはタイプの入力デバイス、出力デバイス、及び／または入力／出力デバイスに、任意の組み合わせで関連付けることができる。入力デバイスは、ユーザからの入力を受け取ることができる任意のデバイスである。入力デバイスのいくつかの例には、キーボード、マウス、トラックパッド、マイク、ビデオカメラなどがあるが、これらに限定されない。出力デバイスは、ユーザに出力を与えることができる任意のデバイスである。出力デバイスのいくつかの例には、表示デバイス１０８、ヘッドホン、スピーカーなどがあるが、これらに限定されない。入力／出力デバイスは、タッチスクリーンなど、ユーザからの入力の受信とユーザへの出力の両方が可能な任意のデバイスである。 In some embodiments, a computation instance 110 may be associated with any number (including zero) and/or types of input devices, output devices, and/or input/output devices, in any combination. An input device is any device capable of receiving input from a user. Some examples of input devices include, but are not limited to, a keyboard, a mouse, a trackpad, a microphone, a video camera, etc. An output device is any device capable of providing output to a user. Some examples of output devices include, but are not limited to, a display device 108, headphones, speakers, etc. An input/output device is any device capable of both receiving input from a user and providing output to a user, such as a touch screen.

示されるように、いくつかの実施形態では、計算インスタンス１１０（１）は表示デバイス１０８（１）に関連付けられ、計算インスタンス１１０（２）は表示デバイス１０８（２）に関連付けられる。表示デバイス１０８（１）及び１０８（２）はまた、本明細書では個別に「表示デバイス１０８」、また集合的に「表示デバイス１０８」と呼ぶ。表示デバイス１０８は、画像を表示できる任意の装置及び／またはその他の種類の視覚コンテンツとすることができる。表示デバイス１０８のいくつかの例には、液晶ディスプレイ、発光ダイオードディスプレイ、プロジェクションディスプレイ、プラズマディスプレイパネルなどがあるが、これらに限定されない。いくつかの実施形態では、表示デバイス１０８は、ビジュアルコンテンツの表示と入力（例えば、ユーザからの）の受信ができるタッチスクリーンである。 As shown, in some embodiments, computation instance 110(1) is associated with display device 108(1) and computation instance 110(2) is associated with display device 108(2). Display devices 108(1) and 108(2) are also referred to herein individually as "display device 108" and collectively as "display device 108". Display device 108 can be any device capable of displaying images and/or other types of visual content. Some examples of display device 108 include, but are not limited to, a liquid crystal display, a light emitting diode display, a projection display, a plasma display panel, and the like. In some embodiments, display device 108 is a touch screen capable of displaying visual content and receiving input (e.g., from a user).

いくつかの実施形態では、計算インスタンス１１０は、任意の数及び／または種類の他のデバイス（例えば、他の計算インスタンス１１０、入力デバイス、出力デバイス、入力／出力デバイスなど）をユーザデバイスに統合することができる。ユーザデバイスの一部の例には、デスクトップコンピュータ、ラップトップ、スマートフォン、スマートテレビ、ゲームコンソール、タブレットなどが、非限定的に含まれる。 In some embodiments, the computing instance 110 can integrate any number and/or type of other devices (e.g., other computing instances 110, input devices, output devices, input/output devices, etc.) into a user device. Some examples of user devices include, but are not limited to, desktop computers, laptops, smartphones, smart televisions, game consoles, tablets, etc.

一般に、計算インスタンス１１０のそれぞれは、１つまたは複数のアプリケーションを実装するように構成される。説明のみを目的として、各アプリケーションは、単一の計算インスタンス１１０のメモリ１１６に存在し、単一の計算インスタンス１１０のプロセッサ１１２で実行されるものとして記載されている。しかし、いくつかの実施形態では、各アプリケーションの機能は、任意の数の計算インスタンス１１０のメモリ１１６に存在し、任意の数の計算インスタンス１１０のプロセッサ１１２で、任意の組み合わせで実行される任意の数の他のアプリケーションにわたって分散され得る。さらに、任意の数のアプリケーションの機能を単一のアプリケーションに統合できる。 Generally, each of the compute instances 110 is configured to implement one or more applications. For purposes of explanation only, each application is described as residing in the memory 116 of a single compute instance 110 and executing on the processor 112 of a single compute instance 110. However, in some embodiments, the functionality of each application may be distributed across any number of other applications residing in the memory 116 of any number of compute instances 110 and executing in any combination on the processors 112 of any number of compute instances 110. Furthermore, the functionality of any number of applications may be integrated into a single application.

いくつかの実施形態では、任意の数のアプリケーション及び／またはアプリケーションの一部は、１つまたは複数の非一時的コンピュータ可読媒体に格納される。本明細書で使用される「非一時的」という用語は、データストレージの持続性に対する限界（例えば、ＲＡＭ対ＲＯＭ）とは対照的な、メディア自体の限界（つまり、信号ではなく触知可能なもの）である。非一時的コンピュータ可読媒体は、本明細書では「コンピュータ可読媒体」とも呼ばれる。例えば、いくつかの実施形態では、メモリ１１６（１）はコンピュータ可読媒体であり、任意の数のアプリケーション及び／またはアプリケーションの一部は、メモリ１１６（１）に格納される。同じまたは他の実施形態で、メモリ１１６（２）はコンピュータ可読媒体であり、任意の数のアプリケーション及び／またはアプリケーションの一部は、メモリ１１６（２）に格納される。 In some embodiments, the number of applications and/or portions of the applications are stored on one or more non-transitory computer readable media. The term "non-transitory" as used herein is a limitation of the medium itself (i.e., tangible, not signal) as opposed to a limitation on persistence of data storage (e.g., RAM vs. ROM). Non-transitory computer readable media are also referred to herein as "computer readable media." For example, in some embodiments, memory 116(1) is a computer readable medium and the number of applications and/or portions of the applications are stored in memory 116(1). In the same or other embodiments, memory 116(2) is a computer readable medium and the number of applications and/or portions of the applications are stored in memory 116(2).

いくつかの実施形態では、任意の数のアプリケーション及び／またはアプリケーションの一部は、メモリ１１６（１）及び／またはメモリ１１６（２）に格納される前に、１つまたは複数のコンピュータ可読媒体に格納される。例えば、いくつかの実施形態では、任意の数のアプリケーション及び／またはアプリケーションの一部はマシン（サーバーマシンなど）に保存され、任意の数のアプリケーション及び／またはアプリケーションの一部がマシンからメモリ１１６（１）及び／またはメモリ１１６（２）にダウンロードされる。同じまたは他の実施形態において、任意の数のアプリケーション及び／またはアプリケーションの一部は、何らかの形式のポータブルコンピュータ可読媒体に保存され、任意の数のアプリケーション及び／またはアプリケーションの一部は、ポータブルコンピュータ可読媒体からメモリ１１６（１）及び／またはメモリ１１６（２）にダウンロードされる。ポータブルコンピュータ可読媒体の一部の例は、デジタルビデオディスク、メモリディスク、メモリスティックなどを含むが、これらに限定されない。 In some embodiments, the number of applications and/or portions of the applications are stored on one or more computer readable media before being stored in memory 116(1) and/or memory 116(2). For example, in some embodiments, the number of applications and/or portions of the applications are stored on a machine (e.g., a server machine) and the number of applications and/or portions of the applications are downloaded from the machine to memory 116(1) and/or memory 116(2). In the same or other embodiments, the number of applications and/or portions of the applications are stored on some form of portable computer readable medium and the number of applications and/or portions of the applications are downloaded from the portable computer readable medium to memory 116(1) and/or memory 116(2). Examples of portions of portable computer readable media include, but are not limited to, digital video disks, memory disks, memory sticks, etc.

いくつかの実施形態では、本開示の態様は、コンピュータ可読プログラムコーデックが具現化された１つまたは複数のコンピュータ可読媒体で具現化されたコンピュータプログラム製品の形態をとることができる。１つ以上のコンピュータ可読媒体の任意の組み合わせを利用し得る。各コンピュータ可読媒体は、コンピュータ可読信号媒体またはコンピュータ可読記憶媒体であり得る。コンピュータ可読記憶媒体は、例えば、限定ではないが、電子、磁気、光、電磁気、赤外線、もしくは半導体システム、装置、もしくはデバイス、または任意の前述の好適な組み合わせであり得る。コンピュータ可読記憶媒体のさらなる具体例（包括的ではない列挙）は、以下、１つ以上の通信回線を有する電気的接続、ポータブルコンピュータディスケット、ハードディスク、ランダムアクセスメモリ、読み取り専用メモリ、消去可能プログラマブルＲＯＭ、またはフラッシュメモリ）、光ファイバ、ポータブルコンパクトディスク読み取り専用メモリ、光学記憶デバイス、磁気記憶デバイス、または前述の任意の好適な組み合わせを含むであろう。本文書の文脈において、コンピュータ可読記憶媒体は、命令実行システム、装置、もしくはデバイスによる使用のため、またはそれらと接続してプログラムを含むまたは記憶することができる任意の有形媒体であり得る。 In some embodiments, aspects of the disclosure may take the form of a computer program product embodied in one or more computer readable media having computer readable program codecs embodied therein. Any combination of one or more computer readable media may be utilized. Each computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. Further specific examples (non-exhaustive enumerations) of computer readable storage media would include the following: an electrical connection having one or more communication lines, a portable computer diskette, a hard disk, a random access memory, a read only memory, an erasable programmable ROM, or a flash memory), an optical fiber, a portable compact disk read only memory, an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium capable of containing or storing a program for use by or in connection with an instruction execution system, apparatus, or device.

計算インスタンス１１０は、ハイスループットスクリーニングを使用して実行されるマイクロウェルプレートを含む実験における実行アーティファクトの根本原因の分析を検出し、促進するように構成される。本明細書で前述しているように、典型的な実験では、統合システムがマイクロウェルプレートのセットを使用して自動的に試験を実施する。本明細書で言及されるように、マイクロウェルプレートは、非限定的に、各々が限定された容積を保持することができる２Ｄグリッドのウェルを含む任意のプレートであり得る。マイクロウェルプレートはまた、一般にマイクロプレート、マルチウェルプレート、及びマルチウェル培養プレートとも呼ばれる。 The computational instance 110 is configured to detect and facilitate root cause analysis of execution artifacts in experiments involving microwell plates performed using high throughput screening. As previously described herein, in a typical experiment, an integrated system automatically performs tests using a set of microwell plates. As referred to herein, a microwell plate can be, without limitation, any plate that includes a 2D grid of wells, each capable of holding a limited volume. Microwell plates are also commonly referred to as microplates, multiwell plates, and multiwell culture plates.

各実験には、任意の数の化学的、生物学的、遺伝的、及び／または薬理学的試験が含まれるが、これらに限定されず、各試験は異なるウェルの中で実施される。典型的な実験では、ウェルへの試験の割り当ては、各マイクロウェルプレート内でランダム化される。試験が完了した後、ウェルに対して様々な測定が実行され、結果が測定値の２Ｄ配列として実験のデータセットに保存される。試験の割り当てはランダム化されているため、各２Ｄ配列内の測定値の分布は、表向きランダムである。 Each experiment may include, but is not limited to, any number of chemical, biological, genetic, and/or pharmacological tests, with each test being performed in a different well. In a typical experiment, the assignment of tests to wells is randomized within each microwell plate. After a test is completed, various measurements are performed on the wells and the results are stored in the experiment's dataset as a 2D array of measurements. Because the assignment of tests is randomized, the distribution of measurements within each 2D array is ostensibly random.

ハイスループットスクリーニングに関連する１つの課題は、実験データセットに、実験自体に起因する「実行アーティファクト」と知られる特定のエラーが含まれ得る。実行アーティファクトの根本原因の一部に、機器の問題、キャリブレーションの問題、環境の変化などがあるが、これらに限定されない。当業者が認識するように、これらのタイプの状況は、測定値の２Ｄ配列の１つまたは複数における低周波数の空間パターンとして現れることが多い。一般的な問題として、関連する実験データセットの質が実行アーティファクトによって大幅に低下している場合、実験に関して有効な結論を引き出すことは、はるかにより困難である。したがって、実験データセットの実行アーティファクトを特定して軽減するために、様々な試みが行われてきた。 One challenge associated with high throughput screening is that experimental data sets may contain certain errors known as "execution artifacts" that originate from the experiment itself. Some of the root causes of execution artifacts include, but are not limited to, instrument issues, calibration issues, and environmental changes. As one skilled in the art will recognize, these types of situations often manifest as low frequency spatial patterns in one or more of the 2D arrays of measurements. As a general problem, it is much more difficult to draw valid conclusions about an experiment when the quality of the associated experimental data set is significantly degraded by execution artifacts. Thus, various attempts have been made to identify and mitigate execution artifacts in experimental data sets.

実行アーティファクトを識別するためのレビューベースのアプローチでは、人間のレビュー担当者がヒートマップを視覚的に分析して、実行アーティファクトを示す測定値の異常なパターンを検出しようとする。そのようなレビューに基づいたアプローチの欠点の１つは、手動式でヒートマップを分析すると時間がかかり、エラーが発生しやすいことである。その結果、ヒートマップに反映された実行上の異常が見落とされたり、誤解されたりする可能性がある。もう１つの欠点は、実行アーティファクトの識別が主観的なプロセスであることである。したがって、同様の視覚パターンを示すヒートマップであっても、特定された実行アーティファクトのタイプ及び／または数は、レビュー担当者によって異なり得る。 In a review-based approach to identifying execution artifacts, human reviewers visually analyze heat maps to attempt to detect anomalous patterns in measurements that are indicative of execution artifacts. One drawback of such review-based approaches is that manually analyzing heat maps is time-consuming and error-prone. As a result, execution anomalies reflected in the heat maps may be overlooked or misinterpreted. Another drawback is that identifying execution artifacts is a subjective process. Thus, even for heat maps that show similar visual patterns, the type and/or number of execution artifacts identified may vary across reviewers.

空間情報に基づく実行アーティファクトの自動識別
上記の問題に対処するために、計算インスタンス１１０（１）は、トレーニングアプリケーション１２０を含むが、これに限定されない。以下に説明するように、トレーニングアプリケーション１２０は、マイクロウェルプレートに関連付けられた特徴ベクトル１３８を、マイクロウェルプレートに関連付けられる実行アーティファクトのタイプ及び／または重大度を示す予測ラベル１８６に自動的にマッピングする、訓練された分類器１７０を生成する。いくつかの実施形態では、訓練された分類器１７０は、予測ラベル１８６が正確である可能性に相関するラベル信頼度１８８も生成する。 Automatic Identification of Performance Artifacts Based on Spatial Information To address the above problems, computational instance 110(1) includes, but is not limited to, a training application 120. As described below, training application 120 generates a trained classifier 170 that automatically maps feature vectors 138 associated with micro-well plates to predicted labels 186 indicative of the type and/or severity of performance artifacts associated with the micro-well plates. In some embodiments, trained classifier 170 also generates a label confidence 188 that correlates to the likelihood that predicted label 186 is accurate.

説明のみを目的として、「特徴ベクトル１３８」は、特定のインスタンスがいずれかの図に示されているかどうかに関係なく、特徴ベクトル１３８の任意のインスタンスを指す。「予測ラベル１８６」は、特定のインスタンスがいずれかの図に示されているかどうかに関係なく、予測ラベル１８６の任意のインスタンスを指す。「ラベル信頼度１８８」は、特定のインスタンスがいずれかの図に示されているかどうかに関係なく、ラベル信頼度１８８の任意のインスタンスを指す。 For purposes of explanation only, "feature vector 138" refers to any instance of feature vector 138, regardless of whether the particular instance is shown in any figure. "Predicted label 186" refers to any instance of predicted label 186, regardless of whether the particular instance is shown in any figure. "Label confidence 188" refers to any instance of label confidence 188, regardless of whether the particular instance is shown in any figure.

トレーニングアプリケーション１２０は、計算インスタンス１１０（１）のメモリ１１６（１）に存在し、計算インスタンス１１０（１）のプロセッサ１１２（１）で実行する。いくつかの実施形態では、トレーニングアプリケーション１２０は、ラベル付けされていないトレーニングデータセット１０２に基づいて、訓練された分類器１７０を生成する。図示のように、ラベル付けされていないトレーニングデータセット１０２は、ヒートマップセット１０４（１）～１０４（Ｈ）を含むが、これに限定されず、それにおいてＨは任意の正の整数であり得る。ヒートマップセット１０４（１）～１０４（Ｈ）のそれぞれは、異なるマイクロウェルプレートに関連付けられ、Ｈマイクロウェルプレートは、以前に実施された任意の数の実験に関連付けることができる。説明のみを目的として、「ヒートマップセット１０４」は、ヒートマップセット１０４（ヒートマップセット１０４（１）～１０４（Ｈ）のそれぞれを含む）のいずれかのインスタンスを、特定のインスタンスがいずれかの図に示されているかどうかに関係なく、指している。 The training application 120 resides in the memory 116(1) of the computational instance 110(1) and executes on the processor 112(1) of the computational instance 110(1). In some embodiments, the training application 120 generates a trained classifier 170 based on an unlabeled training data set 102. As shown, the unlabeled training data set 102 includes, but is not limited to, heatmap sets 104(1)-104(H), where H can be any positive integer. Each of the heatmap sets 104(1)-104(H) is associated with a different microwell plate, and the H microwell plates can be associated with any number of previously performed experiments. For purposes of explanation only, "heatmap set 104" refers to any instance of the heatmap set 104 (including each of the heatmap sets 104(1)-104(H)), regardless of whether a particular instance is shown in any figure.

ヒートマップセット１０４は、非限定的に、Ｆ個のヒートマップ（図１には図示せず）を含み、それにおいてＦは１以上の整数である。各ヒートマップは、測定値の２Ｄ配列または測定値の２Ｄ配列の視覚的表現であり、各測定値は関連するマイクロウェルプレートに含まれる異なるウェルに対応する。本明細書で言及される場合、「測定値」は、任意の関連するマイクロウェルプレートで実行される測定の数値及び／またはタイプに基づいて、導き出すことができる。一般に、ヒートマップに含まれる測定値の空間的な位置は、関連するマイクロウェルプレートに含まれるウェルの２Ｄグリッド内の対応するウェルの空間的な位置に相関する。いくつかの実施形態では、各ヒートマップは、任意の技術的に実現可能な方法において、いずれかの値の数及び／またはタイプを明示することができる。 Heatmap set 104 includes, without limitation, F heatmaps (not shown in FIG. 1 ), where F is an integer equal to or greater than 1. Each heatmap is a 2D array of measurements or a visual representation of a 2D array of measurements, each measurement corresponding to a different well contained in an associated microwell plate. As referred to herein, a “measurement” can be derived based on the number and/or type of measurements performed on any associated microwell plate. In general, the spatial location of measurements included in a heatmap correlates to the spatial location of the corresponding well within a 2D grid of wells contained in the associated microwell plate. In some embodiments, each heatmap can indicate the number and/or type of any values in any technically feasible manner.

いくつかの実施形態では、各ヒートマップは測定値配列に置き換えられ、ヒートマップセット１０４は測定値配列のセットに置き換えられる。各測定値配列は、任意の数の測定値を含むが、これに限定されない。所与の測定値配列に含まれる各測定値について、測定値配列は、任意の技術的に実現可能な方法で、関連するマイクロウェルプレートの対応するウェルを示す。いくつかの他の実施形態では、ヒートマップセット１０４は、任意の数の測定値に置き換えられ、測定値のそれぞれについて、測定値のタイプ及び関連するマイクロウェルプレートの対応するウェルが、任意の技術的に実現可能な方法で示される。本明細書に記載の技術は、適宜修正される。 In some embodiments, each heat map is replaced with a measurement array, and the heat map set 104 is replaced with a set of measurement arrays. Each measurement array may include, but is not limited to, any number of measurements. For each measurement included in a given measurement array, the measurement array indicates the corresponding well of the associated microwell plate in any technically feasible manner. In some other embodiments, the heat map set 104 is replaced with any number of measurements, and for each measurement, the type of measurement and the corresponding well of the associated microwell plate are indicated in any technically feasible manner. The techniques described herein may be modified as appropriate.

実行アーティファクトの周知であるタイプの１つは、マイクロウェルプレートの周囲にあるウェルに関連する「エッジアーティファクト」である。マイクロウェルプレートの周囲に配置されたウェル内で実施されたいずれかの試験の結果は、通常、物理的及び環境的変動（蒸発など）によって損なわれる。このため、いくつかの実施形態では、試験はマイクロウェルプレートの外周にあるウェルの中では実施されない。したがって、マイクロウェルプレートに含まれるウェルの２Ｄグリッドのサイズは、関連するヒートマップのサイズよりも大きくなる。例えば、いくつかの実施形態では、各マイクロウェルプレートは、非限定的に、３２×４８グリッドのウェルを含み、ヒートマップのそれぞれは、３０×４６という配列の測定値である。 One well-known type of execution artifact is the "edge artifact" associated with wells at the periphery of a microwell plate. The results of any tests performed in wells located at the periphery of a microwell plate are typically corrupted by physical and environmental variations (such as evaporation). For this reason, in some embodiments, tests are not performed in wells at the periphery of a microwell plate. Thus, the size of the 2D grid of wells contained in the microwell plate is larger than the size of the associated heat map. For example, in some embodiments, each microwell plate includes, without limitation, a 32 x 48 grid of wells, and each of the heat maps is a measurement of a 30 x 46 array.

ヒートマップセット１０４に含まれる各ヒートマップは、関連するマイクロウェルプレートで実行される異なるタイプの測定に対応する。例えば、いくつかの実施形態では、各ヒートマップセット１０４は、細胞数に対応するヒートマップと、６つの異なる画像化チャネルに対応する６つのヒートマップとを含むが、これらに限定されない。各ヒートマップは、任意の技術的に実現可能な方法で生成できる。例えば、いくつかの実施形態では、画像化チャネルに対応するヒートマップは、画像化チャネルに関連する蛍光色素を介して励起されたときの各ウェルの平均強度を特定する。 Each heatmap included in heatmap set 104 corresponds to a different type of measurement performed on the associated microwell plate. For example, in some embodiments, each heatmap set 104 includes, but is not limited to, a heatmap corresponding to cell count and six heatmaps corresponding to six different imaging channels. Each heatmap may be generated in any technically feasible manner. For example, in some embodiments, the heatmap corresponding to an imaging channel identifies the average intensity of each well when excited via a fluorescent dye associated with the imaging channel.

トレーニングアプリケーション１２０は、任意の技術的に実現可能な方法で、ラベル付けされていないトレーニングデータセット１０２を取得する。例えば、また非限定的に、トレーニングアプリケーション１２０は、ラベル付けされていないトレーニングデータセット１０２をメモリ１１６から読み取る、ラベル付けされていないトレーニングデータセット１０２を入力として受け取る、などを行うことができる。いくつかの実施形態では、トレーニングアプリケーション１２０は、非限定的に、ラベル付けされていないトレーニングデータセット１０２に、任意の数（ゼロを含む）及び／またはタイプの前処理操作を行う。前処理操作のタイプのいくつかの例には、未定義の測定値の補間、極端な測定値のクリッピング、及びヒートマップセット１０４（１）～１０４（Ｈ）の中及び／または全体の正規化が、非限定的に含まれる。 The training application 120 obtains the unlabeled training data set 102 in any technically feasible manner. For example and without limitation, the training application 120 may read the unlabeled training data set 102 from the memory 116, receive the unlabeled training data set 102 as input, etc. In some embodiments, the training application 120 performs any number (including zero) and/or type of preprocessing operations on the unlabeled training data set 102, without limitation. Some examples of types of preprocessing operations include, without limitation, interpolation of undefined measurements, clipping of extreme measurements, and normalization within and/or across the heatmap sets 104(1)-104(H).

示されるように、トレーニングアプリケーション１２０は、非限定的に、特徴エンジン１３０（１）～１３０（Ｈ）（Ｈは、ラベル付けされていないトレーニングデータセット１０２に含まれるヒートマップセット１０４の総数である）、クラスタリングエンジン１４０、ラベル付けエンジン１５０、及びトレーニングエンジン１６０を含む。特徴エンジン１３０（１）～１３０（Ｈ）は、単一の特徴エンジン１３０（明示的には示されていない）の異なるインスタンスである。説明のみを目的として、本明細書で使用される「特徴エンジン１３０」は、特定のインスタンスがいずれかの図に示されているかどうかに関係なく、特徴エンジン１３０の任意のインスタンスを指す。 As shown, the training application 120 includes, without limitation, feature engines 130(1)-130(H) (where H is the total number of heatmap sets 104 included in the unlabeled training dataset 102), a clustering engine 140, a labeling engine 150, and a training engine 160. The feature engines 130(1)-130(H) are different instances of a single feature engine 130 (not explicitly shown). For purposes of explanation only, "feature engine 130" as used herein refers to any instance of feature engine 130, regardless of whether a particular instance is shown in any figure.

図示のように、トレーニングアプリケーション１２０は、ヒートマップセット１０４（１）～１０４（Ｈ）を特徴エンジン１３０（１）～１３０（Ｈ）にそれぞれ入力する。これに応答して、特徴エンジン１３０（１）～１３０（Ｈ）は、特徴ベクトル１３８（１）～１３８（Ｈ）をそれぞれ出力する。いくつかの実施形態では、トレーニングアプリケーション１２０は、特徴エンジン１３０のＨ個未満のインスタンスを含み、トレーニングアプリケーション１２０は、ヒートマップセット１０４（１）～１０４（Ｈ）を特徴エンジン１３０の任意の数のインスタンスに順次、同時に、またはそれらを任意に組み合わせて入力する。例えば、いくつかの実施形態では、トレーニングアプリケーション１２０は、ヒートマップセット１０４（１）～１０４（Ｈ）を特徴エンジン１３０の単一のインスタンスに順次入力する。それに応答して、特徴エンジン１３０の単一のインスタンスは、特徴ベクトル１３８（１）～１３８（Ｈ）を順次出力する。 As shown, the training application 120 inputs the heatmap sets 104(1)-104(H) to the feature engines 130(1)-130(H), respectively. In response, the feature engines 130(1)-130(H) output feature vectors 138(1)-138(H), respectively. In some embodiments, the training application 120 includes fewer than H instances of the feature engine 130, and the training application 120 inputs the heatmap sets 104(1)-104(H) to any number of instances of the feature engine 130 sequentially, simultaneously, or in any combination thereof. For example, in some embodiments, the training application 120 inputs the heatmap sets 104(1)-104(H) sequentially to a single instance of the feature engine 130. In response, the single instance of the feature engine 130 sequentially outputs feature vectors 138(1)-138(H).

特徴ベクトル１３８（１）～１３８（Ｈ）は、非限定的に、それぞれ、ヒートマップセット１０４（１）～１０４（Ｈ）に関連する空間パターンに関連する任意の量及び／またはタイプの情報を明示する。したがって、特徴ベクトル１３８（１）～１３８（Ｈ）はそれぞれ、非限定的に、異なるマイクロウェルプレートに関連する空間パターンを表す。いくつかの実施形態では、特徴ベクトル１３８（１）～１３８（Ｈ）のそれぞれは、非限定的に、任意の数の空間的特徴及び／または任意の数の他のタイプの機能を、任意の組み合わせで含む。 Feature vectors 138(1)-138(H) may each, without limitation, represent any amount and/or type of information associated with a spatial pattern associated with heatmap set 104(1)-104(H). Thus, feature vectors 138(1)-138(H) may each, without limitation, represent a spatial pattern associated with a different microwell plate. In some embodiments, each of feature vectors 138(1)-138(H) may include, without limitation, any number of spatial features and/or any number of other types of features, in any combination.

特徴エンジン１３０は、ヒートマップセット１０４に関連付けられた特徴ベクトル１３８を生成するために、ヒートマップセット１０４に対して任意の数及び／またはタイプの操作を実行することができる。いくつかの実施形態では、図２に関連して以下でより詳細に説明するように、特徴エンジン１３０は、ヒートマップセット１０４に含まれるヒートマップのそれぞれにウェーブレット変換を適用して、マルチレベルウェーブレット分解を生成する。次いで、特徴エンジン１３０は、マルチレベルウェーブレット分解の２つの最低レベルから特徴を抽出し、抽出された特徴を連結して特徴ベクトル１３８を生成する。 The feature engine 130 may perform any number and/or types of operations on the heatmap set 104 to generate a feature vector 138 associated with the heatmap set 104. In some embodiments, as described in more detail below in connection with FIG. 2, the feature engine 130 applies a wavelet transform to each of the heatmaps included in the heatmap set 104 to generate a multi-level wavelet decomposition. The feature engine 130 then extracts features from the two lowest levels of the multi-level wavelet decomposition and concatenates the extracted features to generate the feature vector 138.

当業者が認識するように、ウェーブレット変換はヒートマップの特定の部分にわたって局所空間情報を抽出し、マルチレベルウェーブレット分解の２つの最低レベルは低周波空間パターンを表す。さらに、本明細書で前述したように、実験に関連する環境の変化は、１つまたは複数のヒートマップにおける低周波の空間パターンとして現れることがよくある。したがって、ウェーブレット変換に基づいて生成される特徴ベクトル１３８は、対応するマイクロウェルプレートに関連する実行アーティファクトのタイプ及び重大度に相関する。 As one skilled in the art will recognize, the wavelet transform extracts local spatial information over a particular portion of the heat map, with the two lowest levels of the multi-level wavelet decomposition representing low frequency spatial patterns. Furthermore, as previously described herein, environmental changes associated with an experiment often manifest as low frequency spatial patterns in one or more heat maps. Thus, the feature vector 138 generated based on the wavelet transform correlates to the type and severity of execution artifacts associated with the corresponding micro-well plate.

図示のように、クラスタリングエンジン１４０は、特徴ベクトル１３８（１）～１３８（Ｈ）に基づいてクラスタセット１４８を生成する。クラスタセット１４８は、非限定的に、任意の数及び／またはタイプのクラスタ（図１には示されていない）を含む。特徴ベクトル１３８（１）～１３８（Ｈ）は、特徴ベクトル１３８（１）～１３３（Ｈ）間の類似性に基づいてクラスタ間に分配される。各クラスタは、非限定的に、他のクラスタに含まれる特徴ベクトル１３８よりも互いに類似している１つまたは複数の特徴ベクトル１３８を明示する。各クラスタは、クラスタに含まれる特徴ベクトル１３８が導出されたマイクロウェルプレートに関連付けられる。クラスタリングエンジン１４０は、クラスタセット１４８を生成するために、任意の技術的に実現可能な方法で、特徴ベクトル１３８（１）～１３８（Ｈ）に基づいた任意の数及び／またはタイプのクラスタリングアルゴリズムを実行することができる。 As shown, the clustering engine 140 generates a cluster set 148 based on the feature vectors 138(1)-138(H). The cluster set 148 includes, without limitation, any number and/or type of clusters (not shown in FIG. 1). The feature vectors 138(1)-138(H) are distributed among the clusters based on the similarity between the feature vectors 138(1)-133(H). Each cluster defines, without limitation, one or more feature vectors 138 that are more similar to each other than the feature vectors 138 included in the other clusters. Each cluster is associated with the microwell plate from which the feature vectors 138 included in the cluster were derived. The clustering engine 140 may perform any number and/or type of clustering algorithms based on the feature vectors 138(1)-138(H) in any technically feasible manner to generate the cluster set 148.

いくつかの実施形態では、クラスタリングエンジン１４０は、特徴ベクトル１３８（１）～１３８（Ｈ）及び経験的に判定されたクラスタの総数に基づいて凝集クラスタリングアルゴリズムを実行する。いくつかの実施形態では、クラスタリングエンジン１４０は、特徴ベクトル１３８（１）～１３８（Ｈ）及び距離の閾値に基づいて凝集クラスタリングアルゴリズムを実行する。いくつかの他の実施形態では、クラスタリングエンジン１４０は、重心ベースのクラスタリングアルゴリズム（例えば、ｋ平均法アルゴリズム）、密度ベースのクラスタリングアルゴリズム、または分布ベースのクラスタリングアルゴリズムを実行する。 In some embodiments, the clustering engine 140 performs an agglomerative clustering algorithm based on the feature vectors 138(1)-138(H) and an empirically determined total number of clusters. In some embodiments, the clustering engine 140 performs an agglomerative clustering algorithm based on the feature vectors 138(1)-138(H) and a distance threshold. In some other embodiments, the clustering engine 140 performs a centroid-based clustering algorithm (e.g., a k-means algorithm), a density-based clustering algorithm, or a distribution-based clustering algorithm.

ラベル付けエンジン１５０は、クラスタセット１４８及びラベル付けされていないトレーニングデータセット１０２に基づいて、ラベルデータセット１５６及びラベル付けされたトレーニングデータセット１５８を生成する。図３に関連してより詳細に説明するように、いくつかの実施形態では、クラスタセット１４８に含まれるクラスタのそれぞれについて、ラベルデータセット１５６は、ラベル付けされたクラスタを非限定的に含む。ラベル付けされた各クラスタには、クラスタ、クラスタラベル、及び任意選択で平均ヒートマップセットが、非限定的に含まれる。クラスタラベルは、クラスタを一意に識別するラベルを明示する。ラベル付けエンジン１５０は、任意の技術的に実行可能な方法でクラスタラベルを判定することができる。例えば、いくつかの実施形態では、ラベル付けエンジン１５０は、最初のクラスタラベルをデフォルトの整数（例えば１）に設定し、その後、後続のクラスタラベルごとに整数を増大させる。 The labeling engine 150 generates a label dataset 156 and a labeled training dataset 158 based on the cluster set 148 and the unlabeled training dataset 102. As described in more detail in connection with FIG. 3, in some embodiments, for each cluster included in the cluster set 148, the label dataset 156 includes, but is not limited to, a labeled cluster. Each labeled cluster includes, but is not limited to, a cluster, a cluster label, and optionally a set of average heatmaps. The cluster label specifies a label that uniquely identifies the cluster. The labeling engine 150 can determine the cluster label in any technically feasible manner. For example, in some embodiments, the labeling engine 150 sets the first cluster label to a default integer (e.g., 1) and then increments the integer for each subsequent cluster label.

図３に関連してより詳細に説明するように、クラスタの平均ヒートマップセットは、クラスタの全体的な表現であり、Ｆの平均ヒートマップを含むが、これらに限定されず、それにおいてＦは、ヒートマップセット１０４（１）～１０４（Ｈ）の各々に含まれるヒートマップの数である。ラベル付けエンジン１５０は、任意の技術的に実現可能な方法で、クラスタの平均ヒートマップセットを計算することができる。例えば、いくつかの実施形態では、ラベル付けエンジン１５０は、クラスタに割り当てられたヒートマップセット１０４に含まれる対応するヒートマップのトリミングされた平均に等しいクラスタの平均ヒートマップのそれぞれを設定する。いくつかの実施形態では、平均ヒートマップは、平均測定値配列に置き換えられる。 As described in more detail in connection with FIG. 3, the average heatmap set of a cluster is a global representation of the cluster, including, but not limited to, F average heatmaps, where F is the number of heatmaps included in each of the heatmap sets 104(1)-104(H). The labeling engine 150 may calculate the average heatmap set of a cluster in any technically feasible manner. For example, in some embodiments, the labeling engine 150 sets each of the average heatmaps of a cluster equal to a trimmed average of the corresponding heatmaps included in the heatmap set 104 assigned to the cluster. In some embodiments, the average heatmaps are replaced with average measurement arrays.

いくつかの実施形態では、ラベル付けエンジン１５０は、ラベルデータセット１５６に基づいて、ラベル付けされたトレーニングデータセット１５８を生成する。ラベル付けされたトレーニングデータセット１５８は、特徴ベクトル１３８（１）～１３８（Ｈ）と、特徴ベクトル１３８（１）～１３８（Ｈ）のそれぞれについてのラベル（図示せず）とを含むが、これらに限定されない。所与の特徴ベクトル１３８のラベルは、特徴ベクトル１３８を含むクラスタのクラスタラベルである。 In some embodiments, the labeling engine 150 generates a labeled training dataset 158 based on the label dataset 156. The labeled training dataset 158 includes, but is not limited to, feature vectors 138(1)-138(H) and a label (not shown) for each of the feature vectors 138(1)-138(H). The label for a given feature vector 138 is the cluster label of the cluster that includes the feature vector 138.

いくつかの実施形態では、破線の矢印で示されるように、ラベル付けエンジン１５０は、表示デバイス１０８（１）を介してラベルＧＵＩ１５２を表示する。ラベル付けエンジン１５０は、任意の技術的に実現可能な方法で、任意の量及び／またはタイプのデータを表示するための任意のタイプの表示を、ラベルＧＵＩ１５２を介して生成することができる。いくつかの実施形態では、ラベル付けエンジン１５０は、ラベルＧＵＩ１５２を介して、ラベルデータセット１５６、ラベル付けされたトレーニングデータセット１５８及び／またはラベル付けされていないトレーニングデータセット１０２の任意の部分（一部または全部を含む）を、任意の組み合わせで任意の時点表示することができる。例えば、いくつかの実施形態では、ラベル付けエンジン１５０は、ラベル付けされたトレーニングデータセット１５８を生成する前に、クラスタセット１４８、任意の数のヒートマップセット１０４（１）～１０４（Ｈ）、及び／または任意の数のクラスタに対する平均ヒートマップセットの視覚的表現を表示する。 In some embodiments, as indicated by the dashed arrow, the labeling engine 150 displays the label GUI 152 via the display device 108(1). The labeling engine 150 may generate any type of display via the label GUI 152 for displaying any amount and/or type of data in any technically feasible manner. In some embodiments, the labeling engine 150 may display any portion (including some or all) of the label dataset 156, the labeled training dataset 158, and/or the unlabeled training dataset 102 in any combination at any time via the label GUI 152. For example, in some embodiments, the labeling engine 150 displays a visual representation of the cluster set 148, any number of heatmap sets 104(1)-104(H), and/or an average heatmap set for any number of clusters prior to generating the labeled training dataset 158.

ラベル付けエンジン１５０は、任意かの技術的に実現可能な方法で、クラスタセット１４８の任意のタイプの視覚的表現を生成することができる。いくつかの実施形態では、ラベル付けエンジン１５０は、特徴ベクトル１３８（１）～１３８（Ｈ）に基づいてｔ分布確率的近傍埋め込み（「ｔ－ＳＮＥ」）アルゴリズムを実行して、変換された出力を生成する。次いで、ラベル付けエンジン１５０は、変換された出力の散布図を表示し、特徴ベクトル１３８（１）～１３８（Ｈ）を表す点は、関連するクラスタラベルに基づいて着色される。 The labeling engine 150 can generate any type of visual representation of the cluster set 148 in any technically feasible manner. In some embodiments, the labeling engine 150 runs a t-distributed stochastic neighborhood embedding ("t-SNE") algorithm based on the feature vectors 138(1)-138(H) to generate a transformed output. The labeling engine 150 then displays a scatter plot of the transformed output, where the points representing the feature vectors 138(1)-138(H) are colored based on the associated cluster label.

いくつかの実施形態では、ラベル付けエンジン１５０は、ラベルＧＵＩ１５２を介して受け取った入力に基づいて、ラベルデータセット１５６を修正する。ラベル付けエンジン１５０がラベルデータセット１５６に対して行うことができる修正のいくつかの例には、同じタイプの実行アーティファクトを表すクラスタをマージすること、クラスタ間でヒートマップセット１０４（及び関連するマイクロウェルプレート）を再分配すること、クラスタレベルを修正することなどが、非限定的に含まれる。特に、クラスタラベルを更新して、空間パターンのタイプ、実行アーティファクトのタイプ、実行アーティファクトの重大度などを識別することができる。クラスタリングエンジン１４０がラベルデータセット１５６を修正した後、ラベル付けエンジン１５０はラベル付けされたトレーニングデータセット１５８を修正及び／または再生成して、ラベルデータセット１５６へ修正を反映させる。 In some embodiments, the labeling engine 150 modifies the label dataset 156 based on input received via the label GUI 152. Some examples of modifications the labeling engine 150 can make to the label dataset 156 include, but are not limited to, merging clusters that represent the same type of execution artifact, redistributing the heatmap set 104 (and associated micro-well plates) among clusters, modifying cluster levels, etc. In particular, the cluster labels can be updated to identify the type of spatial pattern, the type of execution artifact, the severity of the execution artifact, etc. After the clustering engine 140 modifies the label dataset 156, the labeling engine 150 modifies and/or regenerates the labeled training dataset 158 to reflect the modifications to the label dataset 156.

同じまたは他の実施形態では、ラベル付けエンジン１５０は、クラスタリングエンジン１４０に、技術的に実現可能な方法で、任意の数及び／またはタイプの基準に基づいて、クラスタセット１４８を反復的に修正させることができる。例えば、いくつかの実施形態では、ラベル付けエンジン１５０は、クラスタリングエンジン１４０に、ユーザのフィードバックに基づいて、クラスタリングアルゴリズムに関連するパラメータ（例えば、クラスタの総数、距離の閾値など）を修正させることができる。クラスタリングエンジン１４０がクラスタセット１４８を再生成した後、ラベル付けエンジン１５０は、ラベルデータセット１５６及び／またはラベル付けされたトレーニングデータセット１５８を修正及び／または再生成する。 In the same or other embodiments, the labeling engine 150 may cause the clustering engine 140 to iteratively revise the cluster set 148 based on any number and/or type of criteria in a technically feasible manner. For example, in some embodiments, the labeling engine 150 may cause the clustering engine 140 to revise parameters associated with the clustering algorithm (e.g., total number of clusters, distance threshold, etc.) based on user feedback. After the clustering engine 140 regenerates the cluster set 148, the labeling engine 150 revise and/or regenerates the label dataset 156 and/or the labeled training dataset 158.

いくつかの実施形態では、ラベルデータセット１５６は、非限定的に、クラスタ、クラスタラベル、及び平均ヒートマップセットの代わりに、またはこれらに加えて、訓練された分類器１７０を介して識別された、ラベル付けされていないトレーニングデータセット１０２、クラスタセット１４８、ラベル付けされたトレーニングデータセット１５８、及び／または実行アーティファクトに関連する任意の量の情報を含み得る。同じまたは他の実施形態では、ラベル付けエンジン１５０は、ラベルＧＵＩ１５２を介して、ラベル付けされていないトレーニングデータセット１０２、クラスタセット１４８、及び／またはいずれかの技術的に実現可能な方法でラベル付けされたトレーニングデータセット１５８から得られた任意の量の情報を表示する。 In some embodiments, the label dataset 156 may include, without limitation, any amount of information related to the unlabeled training dataset 102, the cluster set 148, the labeled training dataset 158, and/or the execution artifacts identified via the trained classifier 170, instead of or in addition to the clusters, cluster labels, and average heatmap set. In the same or other embodiments, the labeling engine 150 displays, via the label GUI 152, any amount of information obtained from the unlabeled training dataset 102, the cluster set 148, and/or the labeled training dataset 158 in any technically feasible manner.

示されるように、トレーニングエンジン１６０は、ラベル付けされたトレーニングデータセット１５８に基づいて、訓練された分類器１７０を生成する。より正確には、トレーニングエンジン１６０は、異なる特徴ベクトル１３８を異なる予測ラベル１８６にマッピングするように分類子をトレーニングし、それにおいて予測ラベル１８６のそれぞれは、ラベルデータセット１５６に含まれるクラスタラベルの１つに等しくなる。いくつかの実施形態では、トレーニングエンジン１６０はまた、予測ラベル１８６のそれぞれについてラベル信頼度１８８を計算するように分類子をトレーニングし、ラベル信頼度１８８は、予測ラベル１８６が正確である可能性と相関する。 As shown, the training engine 160 generates a trained classifier 170 based on the labeled training dataset 158. More precisely, the training engine 160 trains the classifier to map different feature vectors 138 to different predicted labels 186, where each of the predicted labels 186 is equal to one of the cluster labels included in the label dataset 156. In some embodiments, the training engine 160 also trains the classifier to calculate a label confidence 188 for each of the predicted labels 186, where the label confidence 188 correlates with the likelihood that the predicted label 186 is correct.

当業者が認識するように、クラスタリングアルゴリズムの結果に基づいて分類子をトレーニングするプロセスは「帰納的クラスタリング」と呼ばれ、任意の量及びタイプの関連付けられる動作は、本明細書では「帰納的クラスタリング動作」と呼ばれる。トレーニングエンジン１６０は、ラベル付けされたトレーニングデータセット１５８に基づいて、任意のタイプの訓練された分類器１７０を生成するための任意の数及び／またはタイプの教師付き機械学習アルゴリズムを実行することができる。いくつかの実施形態では、訓練された分類器１７０は、トレーニングされたランダムフォレスト、トレーニングされたニューラルネットワーク、トレーニングされた判定木、トレーニングされたサポートベクターマシン、または他のいずれかの技術的に実行可能な訓練された機械学習モデルである。 As one skilled in the art will recognize, the process of training a classifier based on the results of a clustering algorithm is referred to as "inductive clustering," and any amount and type of associated operations are referred to herein as "inductive clustering operations." The training engine 160 may execute any number and/or type of supervised machine learning algorithms to generate any type of trained classifier 170 based on the labeled training dataset 158. In some embodiments, the trained classifier 170 is a trained random forest, a trained neural network, a trained decision tree, a trained support vector machine, or any other technically feasible trained machine learning model.

いくつかの実施形態では、トレーニングアプリケーション１２０は、任意の数及び／またはタイプの教師なし機械学習操作、教師付き機械学習操作、半教師付き機械学習操作、及び／または強化学習操作をいずれかの組み合わせで実行して、入力された特徴ベクトル１３８を予測ラベル１８６に一緒にマッピングする、任意の数及び／またはタイプのトレーニングされたモデルを生成することができる。例えば、いくつかの実施形態では、ラベル付けされていないトレーニングデータセット１０２は、手動式にラベル付けされたトレーニングデータセットに置き換えられ、クラスタリングエンジン１４０及びラベル付けエンジン１５０はシステム１００から省かれる。 In some embodiments, the training application 120 may perform any number and/or type of unsupervised machine learning operations, supervised machine learning operations, semi-supervised machine learning operations, and/or reinforcement learning operations in any combination to generate any number and/or type of trained models that jointly map input feature vectors 138 to predicted labels 186. For example, in some embodiments, the unlabeled training data set 102 is replaced with a manually labeled training data set, and the clustering engine 140 and labeling engine 150 are omitted from the system 100.

好都合なことに、ラベルデータセット１５６に含まれるクラスタラベルのそれぞれは、任意の数（０を含む）及び／またはタイプの実行上の異常に関連付けられ、予測ラベル１８６の各々は、実行上の異常に対して、関連付けられる特徴ベクトル１３８を分類する。したがって、訓練された分類器１７０は、クラスタセット１４８に関連付けられる任意の数及び／またはタイプの実行上の異常に対し、入力された特徴ベクトル１３８に関連付けられるマイクロウェルプレートを自動的に分類する。 Advantageously, each of the cluster labels included in the label dataset 156 is associated with any number (including zero) and/or type of execution anomalies, and each of the predicted labels 186 classifies the associated feature vector 138 with respect to the execution anomalies. Thus, the trained classifier 170 automatically classifies the micro-well plate associated with the input feature vector 138 with respect to any number and/or type of execution anomalies associated with the cluster set 148.

いくつかの実施形態では、ヒートマップセット１０４に関連付けられた特徴ベクトル１３８は、実行アーティファクトの識別に関連するヒートマップセット１０４に関連する任意の量及びタイプの空間的な情報をそれぞれが表す、任意の数及び／またはタイプの特徴のセットに、置き換えることができる。トレーニングエンジン１６０は、特徴ベクトル１３８の代わりに特徴のセットを非限定的に含むラベル付けされたトレーニングデータセット１５８に基づいて、訓練された分類器１７０を生成する。 In some embodiments, the feature vector 138 associated with the heatmap set 104 can be replaced with any number and/or type of set of features, each representing any amount and type of spatial information associated with the heatmap set 104 relevant to identifying execution artifacts. The training engine 160 generates a trained classifier 170 based on a labeled training dataset 158 that includes, but is not limited to, a set of features in place of the feature vector 138.

いくつかの実施形態では、トレーニングアプリケーション１２０は、訓練された分類器１７０、ラベルデータセット１５６、及び任意選択で、ラベル付けされていないトレーニングデータセット１０２を実験分析アプリケーション１８０に送信する。いくつかの実施形態では、トレーニングアプリケーション１２０は、実験分析アプリケーション１８０の代わりにまたはそれに加えて、訓練された分類器１７０、ラベルデータセット１５６、及びラベル付けされていないトレーニングデータセット１０２のいずれかを、任意の組み合わせで、任意の数及び／またはタイプのソフトウェアアプリケーションに送信する。同じまたは他の実施形態では、トレーニングアプリケーション１２０は、訓練された分類器１７０、ラベルデータセット１５６、及びラベル付けされていないトレーニングデータセット１０２のいずれかを、実験分析アプリケーション１８０及び／または任意の数の他のソフトウェアアプリケーションによってアクセス可能な任意のメモリに任意の組み合わせで格納する。 In some embodiments, the training application 120 transmits the trained classifier 170, the label dataset 156, and optionally the unlabeled training dataset 102 to the experimental analysis application 180. In some embodiments, the training application 120 transmits any of the trained classifier 170, the label dataset 156, and the unlabeled training dataset 102, in any combination, to any number and/or type of software application instead of or in addition to the experimental analysis application 180. In the same or other embodiments, the training application 120 stores any of the trained classifier 170, the label dataset 156, and the unlabeled training dataset 102, in any combination, in any memory accessible by the experimental analysis application 180 and/or any number of other software applications.

いくつかの実施形態では、トレーニングアプリケーション１２０は、ラベルデータセット１５６に基づいて参照ガイドを生成し、任意選択で、実験分析アプリケーション１８０及び／または任意の数及び／またはタイプの他のソフトウェアアプリケーションに参照ガイドを提示する。例えば、いくつかの実施形態では、トレーニングアプリケーション１２０は、ラベルデータセット１５６に含まれる平均ヒートマップセット及びクラスタラベルに基づいて参照ガイドを生成する。 In some embodiments, the training application 120 generates a reference guide based on the label dataset 156 and optionally presents the reference guide to the experiment analysis application 180 and/or any number and/or type of other software applications. For example, in some embodiments, the training application 120 generates a reference guide based on the average heatmap set and cluster labels included in the label dataset 156.

実験分析アプリケーション１８０は、訓練された分類器１７０を使用して、実験データセット１０６に含まれる実行アーティファクトを検出し、検出された実行アーティファクトの根本原因の分析を促進する。いくつかの実施形態では、実験分析アプリケーション１８０はまた、ラベルデータセット１５６、及び任意選択で、ラベル付けされていないトレーニングデータセット１０２を使用して、検出された実行アーティファクトの根本原因の分析を促進する。説明のみを目的として、トレーニングアプリケーション１２０と実験分析アプリケーション１８０の両方に含まれる同様の対象物ごとに、実験分析アプリケーション１８０に含まれるインスタンスを特定する括弧内の英数字（複数可）にプライムマークが付けられる。さらに、トレーニングアプリケーション１２０にも含まれる実験分析アプリケーション１８０に含まれる各対象物の機能は、トレーニングアプリケーション１２０のコンテキストで対象物について説明された機能と同じである。 The experiment analysis application 180 uses the trained classifier 170 to detect execution artifacts included in the experiment data set 106 and facilitates analysis of the root cause of the detected execution artifacts. In some embodiments, the experiment analysis application 180 also uses the label data set 156 and, optionally, the unlabeled training data set 102 to facilitate analysis of the root cause of the detected execution artifacts. For purposes of illustration only, for each similar object included in both the training application 120 and the experiment analysis application 180, a prime is added to the alphanumeric character(s) in parentheses that identify the instance included in the experiment analysis application 180. Furthermore, the functionality of each object included in the experiment analysis application 180 that is also included in the training application 120 is the same as the functionality described for the object in the context of the training application 120.

実験データセット１０６は、マイクロウェルプレートのセットを介して行われる単一の実験の結果を表す。実験データセット１０６は、ヒートマップセット１０４（１’）～１０４（Ｅ’）を含むが、これに限定されず、Ｅは任意の正の整数であり得る。ヒートマップセット１０４（１’）～１０４（Ｅ’）のそれぞれは、実験に関連する異なるマイクロウェルプレートを表す。斜体で示されているように、ヒートマップセット１０４（１’）～１０４（Ｅ’）は、それぞれ１～Ｅで示されるマイクロウェルプレートに関連付けられている。本明細書で前述したように、ヒートマップセット１０４は、非限定的に、Ｆ個のヒートマップ（図１には図示せず）を含み、それにおいてＦは１以上の整数である。 The experimental data set 106 represents the results of a single experiment conducted over a set of microwell plates. The experimental data set 106 includes, but is not limited to, heatmap sets 104(1')-104(E'), where E can be any positive integer. Each of the heatmap sets 104(1')-104(E') represents a different microwell plate associated with the experiment. As indicated in italics, the heatmap sets 104(1')-104(E') are associated with microwell plates designated 1-E, respectively. As previously described herein, the heatmap set 104 includes, but is not limited to, F heatmaps (not shown in FIG. 1), where F is an integer equal to or greater than 1.

説明のみを目的として、実験分析アプリケーション１８０は、単一の実験データセット１０６の文脈で本明細書に記載されている。いくつかの実施形態では、実験分析アプリケーション１８０の任意の数（１つを含む）のインスタンスが、任意の数の実験データセット１０６に含まれる実行アーティファクトを、順次、同時に、またはそれらの任意の組み合わせで検出する。 For purposes of illustration only, the experimental analysis application 180 is described herein in the context of a single experimental data set 106. In some embodiments, any number (including one) of instances of the experimental analysis application 180 detects execution artifacts contained in any number of experimental data sets 106, either sequentially, simultaneously, or any combination thereof.

実験分析アプリケーション１８０は、計算インスタンス１１０（２）のメモリ１１６（２）に存在し、計算インスタンス１１０（２）のプロセッサ１１２（２）で実行する。いくつかの実施形態では、実験分析アプリケーション１８０の任意の数のインスタンスが、任意の数の計算インスタンス１１０のメモリ１１６に存在し、計算インスタンス１１０のプロセッサで実行することができる。示されるように、実験分析アプリケーション１８０は、入力エンジン１８２、特徴エンジン１３０（０’）～１３０（Ｅ’）、訓練された分類器１７０（０’）～１７０（Ｅ’）、及び出力エンジン１９０を含むが、それらに限定されない。 The experiment analysis application 180 resides in memory 116(2) of the computational instance 110(2) and executes on processor 112(2) of the computational instance 110(2). In some embodiments, any number of instances of the experiment analysis application 180 may reside in memory 116 of any number of the computational instances 110 and execute on the processors of the computational instances 110. As shown, the experiment analysis application 180 includes, but is not limited to, an input engine 182, feature engines 130(0')-130(E'), trained classifiers 170(0')-170(E'), and an output engine 190.

入力エンジン１８２は、いずれかの技術的に実現可能な方法で、実験データセット１０６を取得する。いくつかの実施形態では、入力エンジン１８２は、非限定的に、任意の数（ゼロを含む）及び／またはタイプの実験データセット１０６に対する前処理操作を実行する。入力エンジン１８２が実行できる前処理操作のタイプのいくつかの例は、未定義の測定値の補間、極端な測定値のクリッピング、及びヒートマップセット１０４（１’）～１０４（Ｅ’）内及び／または全体の正規化を含み得るが、これらに限定されない。 The input engine 182 acquires the experimental data sets 106 in any technically feasible manner. In some embodiments, the input engine 182 performs pre-processing operations on any number (including zero) and/or type of experimental data sets 106, without limitation. Some examples of the types of pre-processing operations that the input engine 182 can perform may include, but are not limited to, interpolation of undefined measurements, clipping of extreme measurements, and normalization within and/or across heatmap sets 104(1')-104(E').

実験データセット１０６を取得し、任意選択で前処理した後、入力エンジン１８２は、ヒートマップセット１０４（１’）～１０４（Ｅ’）に基づいてヒートマップセット１０４（０’）を生成する。ヒートマップセット１０４（０’）は、実験データセット１０６に関連付けられた、存在しない「平均」のマイクロウェルプレートを表す。入力エンジン１８２は、いずれかの技術的に実行可能な方法でヒートマップセット１０４（０’）を生成することができる。例えば、いくつかの実施形態では、入力エンジン１８２は、ヒートマップセット１０４（０’）に含まれるＦ個のヒートマップのそれぞれを、ヒートマップセット１０４（１’）～１０４（Ｅ’）に含まれる対応するヒートマップの平均に等しく設定する。いくつかの他の実施形態では、入力エンジン１８２は、ヒートマップセット１０４（０’）に含まれるＦ個のヒートマップのそれぞれを、ヒートマップセット１０４（１’）～１０４（Ｅ’）に含まれる対応するヒートマップのトリミングされた平均に等しく設定する。いくつかの実施形態では、入力エンジン１８２はヒートマップセット１０４（０’）を計算せず、本明細書に記載の技術はそれに応じて修正される。 After obtaining and optionally preprocessing the experimental dataset 106, the input engine 182 generates a heatmap set 104(0') based on the heatmap sets 104(1')-104(E'). The heatmap set 104(0') represents a non-existent "average" micro-well plate associated with the experimental dataset 106. The input engine 182 may generate the heatmap set 104(0') in any technically feasible manner. For example, in some embodiments, the input engine 182 sets each of the F heatmaps included in the heatmap set 104(0') equal to the average of the corresponding heatmaps included in the heatmap sets 104(1')-104(E'). In some other embodiments, the input engine 182 sets each of the F heatmaps included in the heatmap set 104(0') equal to a trimmed average of the corresponding heatmaps included in the heatmap sets 104(1')-104(E'). In some embodiments, the input engine 182 does not calculate the heatmap set 104(0'), and the techniques described herein are modified accordingly.

図示のように、実験分析アプリケーション１８０は、ヒートマップセット１０４（０’）～１０４（Ｅ’）を特徴エンジン１３０（０’）～１３０（Ｅ’）にそれぞれ入力する。これに応じて、特徴エンジン１３０（０’）～１３０（Ｅ）は、特徴ベクトル１３８（０’）～１３８（Ｅ’）をそれぞれ出力する。いくつかの実施形態では、実験分析アプリケーション１８０は、特徴エンジン１３０のＥ個未満のインスタンスを含み、実験分析アプリケーション１８０は、ヒートマップセット１０４（０’）～１０４（Ｅ’）を特徴エンジン１３０の任意の数のインスタンスに順次、同時に、またはそれらを任意に組み合わせて入力する。 As shown, the experiment analysis application 180 inputs the heat map sets 104(0')-104(E') to the feature engines 130(0')-130(E'), respectively. In response, the feature engines 130(0')-130(E) output feature vectors 138(0')-138(E'), respectively. In some embodiments, the experiment analysis application 180 includes fewer than E instances of the feature engine 130, and the experiment analysis application 180 inputs the heat map sets 104(0')-104(E') to any number of instances of the feature engine 130 sequentially, simultaneously, or in any combination thereof.

実験分析アプリケーション１８０は、特徴ベクトル１３８（０’）～１３８（Ｅ’）を訓練された分類器１７０（０’）～１７０（Ｅ’）にそれぞれ入力する。これに応答して、訓練された分類器１７０（０’）～１７０（Ｅ’）は、予測ラベル１８６（０）～１８６（Ｅ）をそれぞれ出力する。いくつかの実施形態では、訓練された分類器１７０（０’）～１７０（Ｅ’）はまた、ラベル信頼度１８８（０）～１８８（Ｅ）をそれぞれ出力する。 The experimental analysis application 180 inputs feature vectors 138(0')-138(E') to trained classifiers 170(0')-170(E'), respectively. In response, trained classifiers 170(0')-170(E') output predicted labels 186(0)-186(E), respectively. In some embodiments, trained classifiers 170(0')-170(E') also output label confidences 188(0)-188(E), respectively.

いくつかの実施形態では、実験分析アプリケーション１８０は、訓練された分類器１７０のＥ個未満のインスタンスを含み、実験分析アプリケーション１８０は、ヒートマップセット１０４（０’）～１０４（Ｅ’）を特徴エンジン１３０の任意の数のインスタンスに順次、同時に、またはそれらを任意に組み合わせて入力する。同じまたは他の実施形態では、実験分析アプリケーション１８０は、いずれかの技術的に実行可能な方法で、任意の数の特徴ベクトル１３８に基づいて、任意の数の予測ラベル１８６、及び任意選択で任意の数のラベル信頼度１８８を生成するように、訓練された分類器１７０を構成することができる。 In some embodiments, the experiment analysis application 180 includes fewer than E instances of the trained classifier 170, and the experiment analysis application 180 inputs the heatmap sets 104(0')-104(E') to any number of instances of the feature engine 130 sequentially, simultaneously, or in any combination thereof. In the same or other embodiments, the experiment analysis application 180 can configure the trained classifier 170 to generate any number of predicted labels 186, and optionally any number of label confidences 188, based on any number of feature vectors 138 in any technically feasible manner.

いくつかの実施形態では、トレーニングアプリケーション１２０は、訓練された分類器１７０の代わりに、任意のタイプの訓練された機械学習モデルを生成する。トレーニングアプリケーション１２０は、いずれかの技術的に実行可能な方法で、ラベル付けされたトレーニングデータセット１５８に基づいて、訓練された機械学習モデルを生成することができる。続いて、実験分析アプリケーション１８０は、訓練された機械学習モデルに基づいて、いずれかの技術的に実現可能な方法で、予測ラベル１８６（０）～１８６（Ｅ）を生成する。例えば、いくつかの実施形態では、実験分析アプリケーション１８０は、特徴ベクトル１３８（０’）～１３８（Ｅ’）を、訓練された機械学習モデルの任意の数のインスタンスに入力する。これに応じて、訓練された機械学習モデルのインスタンス（複数可）は、予測ラベル１８６（０）～１８６（Ｅ）を、また任意選択でラベル信頼度１８８（０）－１８８（Ｅ）を出力する。 In some embodiments, the training application 120 generates any type of trained machine learning model instead of the trained classifier 170. The training application 120 can generate the trained machine learning model based on the labeled training dataset 158 in any technically feasible manner. The experiment analysis application 180 then generates the predicted labels 186(0)-186(E) in any technically feasible manner based on the trained machine learning model. For example, in some embodiments, the experiment analysis application 180 inputs the feature vectors 138(0')-138(E') into any number of instances of the trained machine learning model. In response, the instance(s) of the trained machine learning model output the predicted labels 186(0)-186(E) and, optionally, the label confidences 188(0)-188(E).

予測ラベル１８６（０）～１８８（Ｅ）のそれぞれは、クラスタラベルの１つであり、任意の数の他の予測ラベル１８６とは異なり得る。予測ラベル１８６（０）は、実験データセット１０６に関連する実験全体の推定上の分類であり、ラベル信頼度１８８（０）は、予測ラベル１８６（０）が実験全体に適用される可能性に相関する。予測ラベル１８６（１）～１８６（Ｅ）は、それぞれマイクロウェルプレート１～Ｅの推定上の分類である。ラベル信頼度１８８（１）～１８８（Ｅ）は、予測ラベル１８６（１）～１８６（Ｅ）がそれぞれマイクロウェルプレート１～Ｅに適用される可能性に相関する。 Each of the predicted labels 186(0)-188(E) is one of the cluster labels and may be different from any number of the other predicted labels 186. The predicted label 186(0) is a putative classification of the entire experiment associated with the experimental dataset 106, and the label confidence 188(0) correlates to the likelihood that the predicted label 186(0) applies to the entire experiment. The predicted labels 186(1)-186(E) are putative classifications of the microwell plates 1-E, respectively. The label confidences 188(1)-188(E) correlate to the likelihood that the predicted labels 186(1)-186(E) apply to the microwell plates 1-E, respectively.

図３に関連してより詳細に説明されるように、出力エンジン１９０は、予測ラベル１８６（０）～１８６（Ｅ）、ラベル信頼度１８８（０）～１８８（Ｅ）、ラベルデータセット１５６、及び（任意選択で）ラベル付けされていないトレーニングデータセット１０２に基づいて、実験の概要１９６とプレートの概要１９８（１）～１９８（Ｅ）を生成する。実験の概要１９６は、実行アーティファクトまたはその欠如に関連する、実験全体に関する任意の量の情報を提示する。プレートの概要１９８（１）～１９８（Ｅ）は、実行アーティファクトまたはその欠如に関連する、マイクロウェルプレート１～Ｅに関する任意の量の情報をそれぞれ提示する。 As described in more detail in connection with FIG. 3, the output engine 190 generates an experiment summary 196 and plate summaries 198(1)-198(E) based on the predicted labels 186(0)-186(E), the label confidences 188(0)-188(E), the label dataset 156, and (optionally) the unlabeled training dataset 102. The experiment summary 196 presents any amount of information about the experiment as a whole, related to performance artifacts or lack thereof. The plate summaries 198(1)-198(E) present any amount of information about microwell plates 1-E, respectively, related to performance artifacts or lack thereof.

いくつかの実施形態では、実験の概要１９６は、予測ラベル１８６（０）、ラベル信頼度１８８（０）、及び整合するプレートの割合（図１には図示せず）を含むが、これらに限定されない。出力エンジン１９０は、予測ラベル１８６（０）に等しい予測ラベル１８６（１）～１８６（Ｅ）のパーセンテージに等しい整合するプレートの割合を設定する。いくつかの実施形態では、整合するプレートの割合は、実験データセット１０６に関連付けられたマイクロウェルプレートのうち何枚が予測ラベル１８６（０）にも関連付けられているかを示す、整合するプレート数に置き換えられる。 In some embodiments, the experimental summary 196 includes, but is not limited to, a predicted label 186(0), a label confidence 188(0), and a percentage of matching plates (not shown in FIG. 1). The output engine 190 sets the percentage of matching plates equal to the percentage of predicted labels 186(1)-186(E) that are equal to predicted label 186(0). In some embodiments, the percentage of matching plates is replaced with a number of matching plates, which indicates how many of the micro-well plates associated with the experimental dataset 106 are also associated with predicted label 186(0).

同じまたは他の実施形態において、１からＥの間の整数ｘに対するプレートの概要１９８（ｘ）は、予測ラベル１８６（ｘ）、ラベル信頼度１８８（ｘ）、及びマイクロウェルプレートｘに対する異常スコアを含むが、これらに限定されない。マイクロウェルプレートｘの異常スコアは、予測ラベル１８６（ｘ）に関連付けられたクラスタに対して、マイクロウェルプレートｘがどの程度離れているかを示す。出力エンジン１９０は、いずれかの技術的に実行可能な方法で異常スコアを計算することができる。 In the same or other embodiments, the plate summary 198(x) for an integer x between 1 and E includes, but is not limited to, a predicted label 186(x), a label confidence 188(x), and an anomaly score for micro-well plate x. The anomaly score for micro-well plate x indicates how far away micro-well plate x is from the cluster associated with the predicted label 186(x). The output engine 190 may calculate the anomaly score in any technically feasible manner.

例えば、いくつかの実施形態では、出力エンジン１９０は、特徴エンジン１３０を使用して、予測ラベル１８６（ｘ）に関連付けられたクラスタの平均ヒートマップセットに関連付けられた特徴ベクトル１３８を計算する。次に、出力エンジン１９０は、特徴ベクトル１３８（ｘ’）と、平均ヒートマップセットに関連付けられた特徴ベクトル１３８との間の非類似度を計算する。いくつかの実施形態では、異常スコアは、予測ラベル１８６（ｘ）に関連付けられたクラスタに関してマイクロウェルプレートｘがどの程度類似しているかを示す、マイクロウェルプレートｘの類似性スコアに置き換えられる。 For example, in some embodiments, the output engine 190 uses the feature engine 130 to calculate a feature vector 138 associated with the average heatmap set of the cluster associated with the predicted label 186(x). The output engine 190 then calculates a dissimilarity between the feature vector 138(x') and the feature vector 138 associated with the average heatmap set. In some embodiments, the anomaly score is replaced with a similarity score for micro-well plate x, which indicates how similar micro-well plate x is with respect to the cluster associated with the predicted label 186(x).

一部の実施形態では、破線の矢印で示されるように、出力エンジン１９０は、表示デバイス１０８（２）を介して分析ＧＵＩ１９２を表示する。出力エンジン１９０は、いずれかの技術的に実現可能な方法で、任意の量及び／またはタイプのデータを表示するための任意のタイプの表示を、解析ＧＵＩ１９２を介して生成することができる。いくつかの実施形態では、出力エンジン１９０は、解析ＧＵＩ１９２を介して、実験の概要１９６、プレートの概要１９８（１）～１９８（Ｅ）、ラベルデータセット１５６、及び／またはラベル付けされていないトレーニングデータセット１０２を任意の組み合わせで任意の時点で使用できる。例えば、出力エンジン１９０は、解析ＧＵＩ１９２を介して、予測ラベル１８６（１）～１８６（Ｅ’）がヒートマップセット１０４（１’）～１０４（Ｅ’）にそれぞれ割り当てられることを示すことができる。いくつかの実施形態では、出力エンジン１９０は、クラスタセット１４８、任意の数のヒートマップセット１０４（１）～１０４（Ｈ）、及び／またはいずれかの技術的に実現可能な方法で任意の数のクラスタのための平均ヒートマップセットの視覚的表現を表示する。 In some embodiments, the output engine 190 displays an analysis GUI 192 via the display device 108(2), as indicated by the dashed arrow. The output engine 190 can generate any type of display via the analysis GUI 192 for displaying any amount and/or type of data in any technically feasible manner. In some embodiments, the output engine 190 can use, via the analysis GUI 192, the experiment summary 196, the plate summary 198(1)-198(E), the label dataset 156, and/or the unlabeled training dataset 102 in any combination at any time. For example, the output engine 190 can indicate, via the analysis GUI 192, that predicted labels 186(1)-186(E') are assigned to the heatmap sets 104(1')-104(E'), respectively. In some embodiments, the output engine 190 displays a visual representation of the cluster set 148, any number of heatmap sets 104(1)-104(H), and/or the average heatmap set for any number of clusters in any technically feasible manner.

いくつかの実施形態では、出力エンジン１９０は、トレーニングアプリケーション１２０に、解析ＧＵＩ１９２を介して受け取った入力に基づいて、クラスタセット１４８、ラベルデータセット１５６、ラベル付けされたトレーニングデータセット１５８、及び／または訓練された分類器１７０を、反復的に修正及び／または再生成させる。出力エンジン１９０が、分析ＧＵＩ１９２を介して受け取った入力に基づいてトレーニングアプリケーション１２０に実施させることができる修正のいくつかの例は、非限定的に、クラスタをマージすること、クラスタ間でヒートマップセット（及び関連するマイクロウェルプレート）を再分配すること、クラスタラベルを修正すること、などを含む。 In some embodiments, the output engine 190 causes the training application 120 to iteratively modify and/or regenerate the cluster set 148, the label dataset 156, the labeled training dataset 158, and/or the trained classifier 170 based on input received via the analysis GUI 192. Some examples of modifications that the output engine 190 may cause the training application 120 to perform based on input received via the analysis GUI 192 include, without limitation, merging clusters, redistributing heatmap sets (and associated microwell plates) among clusters, modifying cluster labels, etc.

有利なことに、訓練された分類器１７０は、ヒートマップセット１０４（１’）～１０４（Ｅ’）を自動的かつ客観的に分類するので、訓練された分類器１７０を使用して、実験データセット１０６に関連する実験における実行アーティファクトを、効率的かつ正確に識別することができる。さらに、異常スコアは、各マイクロウェルプレートが重大な実行アーティファクトがないクラスタに属しているか、既知の実行アーティファクトがあるマイクロウェルプレートのクラスタに属しているか、新しいタイプの実行アーティファクトまたはその他の異常に関連付けられているかについての洞察を提示する。したがって、異常スコアは根本原因の分析を促進する。 Advantageously, because the trained classifier 170 automatically and objectively classifies the heatmap sets 104(1')-104(E'), the trained classifier 170 can be used to efficiently and accurately identify execution artifacts in the experiments associated with the experimental dataset 106. Furthermore, the anomaly scores provide insight into whether each micro-well plate belongs to a cluster without significant execution artifacts, belongs to a cluster of micro-well plates with known execution artifacts, or is associated with a new type of execution artifact or other anomaly. Thus, the anomaly scores facilitate root cause analysis.

さらに、実験の概要１９６及びプレートの概要１９８（１）～１９８（Ｅ）は、実行の異常に関する客観的な情報を提示するので、実験の概要１９６及びプレートの概要１９８（１）～１９８（Ｅ）は、実行の異常に関連する傾向を経時的に、また実験全体で効率的に検出するべく使用することができる。検出された傾向に基づいて、ユーザは、将来の実験データセット１０６に含まれる実行上の異常の数を減らすために、実験のプロセス及び／または装置に修正を加えることができる。 Furthermore, because the experiment summary 196 and plate summaries 198(1)-198(E) present objective information regarding execution anomalies, the experiment summary 196 and plate summaries 198(1)-198(E) can be used to efficiently detect trends related to execution anomalies over time and across experiments. Based on the detected trends, a user can make modifications to the experimental process and/or equipment to reduce the number of execution anomalies included in future experimental data sets 106.

本明細書で説明する技術は、限定的ではなく例示的であり、本発明のより広い精神及び範囲から逸脱することなく変更できることに留意されたい。記載された実施形態の範囲及び精神から逸脱することのない、トレーニングアプリケーション１２０、特徴エンジン１３０、クラスタリングエンジン１４０、ラベル付けエンジン１５０、訓練された分類器１７０、実験分析アプリケーション１８０、入力エンジン１８２、及び出力エンジン１９０によって得られる機能の多くの修正及び変形は、当業者にとって明白である。 It should be noted that the techniques described herein are illustrative rather than limiting and may be modified without departing from the broader spirit and scope of the present invention. Many modifications and variations of the functionality provided by the training application 120, feature engine 130, clustering engine 140, labeling engine 150, trained classifier 170, experimental analysis application 180, input engine 182, and output engine 190 will be apparent to those skilled in the art without departing from the scope and spirit of the described embodiments.

本明細書に示されるシステム１００は例示であり、変形及び修正が可能であることが理解されよう。例えば、いくつかの実施形態では、本明細書に記載のラベル付けエンジン１５０によって得られる機能は、クラスタリングエンジン１４０に統合される。同じまたは他の実施形態において、トレーニングアプリケーション１２０によって得られる機能及び実験分析アプリケーション１８０によって得られる機能は、単一のアプリケーションに統合される。さらに、図１の様々な構成要素の間の接続トポロジーは、必要に応じて修正することができる。 It will be understood that the system 100 depicted herein is illustrative and that variations and modifications are possible. For example, in some embodiments, the functionality provided by the labeling engine 150 described herein is integrated into the clustering engine 140. In the same or other embodiments, the functionality provided by the training application 120 and the functionality provided by the experiment analysis application 180 are integrated into a single application. Additionally, the connection topology between the various components of FIG. 1 can be modified as desired.

図２は、様々な実施形態による、図１の特徴エンジン１３０のより詳細な図である。示されるように、特徴エンジン１３０は、ヒートマップセット１０４に基づいて特徴ベクトル１３８を生成する。説明のみを目的として、ヒートマップセット１０４は、３０×４６の使用されるウェルを有するマイクロウェルプレートの７つの異なるタイプの測定に関連する測定値を反映している。その結果、ヒートマップセット１０４は、非限定的に、合計９６６０個の測定値（図示せず）を含む。 2 is a more detailed diagram of the feature engine 130 of FIG. 1, in accordance with various embodiments. As shown, the feature engine 130 generates a feature vector 138 based on the heatmap set 104. For illustrative purposes only, the heatmap set 104 reflects measurements associated with seven different types of measurements of a microwell plate having 30×46 used wells. As a result, the heatmap set 104 includes, without limitation, a total of 9660 measurements (not shown).

示されるように、ヒートマップセット１０４は、非限定的に、ヒートマップ２１０（０）～２１０（６）を含む。説明のみを目的とし、斜体で示されているように、ヒートマップ２１０（０）は、１３８０個の使用されるウェルの細胞数を特定する１３８０個の測定値を非限定的に含む２Ｄ配列である。ヒートマップ２１０（１）～２１０（６）のそれぞれは、異なる画像化チャネルの１３８０の使用されるウェルの強度を特定する１３８０の測定値を、非限定的に含む２Ｄ配列である。他の実施形態では、ヒートマップセット１０４は、非限定的に、任意の数のヒートマップ２１０を含むことができ、ヒートマップ２１０のそれぞれは、任意のタイプの測定に対応することができる。 As shown, the heatmap set 104 includes, without limitation, heatmaps 210(0)-210(6). For illustrative purposes only, as shown in italics, heatmap 210(0) is a 2D array including, without limitation, 1380 measurements specifying cell counts for 1380 used wells. Each of heatmaps 210(1)-210(6) is a 2D array including, without limitation, 1380 measurements specifying intensities for 1380 used wells in different imaging channels. In other embodiments, the heatmap set 104 can include, without limitation, any number of heatmaps 210, and each of the heatmaps 210 can correspond to any type of measurement.

特徴エンジン１３０は、空間情報エクストラクタ２２０（０）～２２０（６）及び集約エンジン２８０を非限定的に含む。空間情報エクストラクタ２２０（０）～２２０（６）は、それぞれヒートマップ２１０（０）～２１０（６）に基づいて、空間的特徴セット２７０（０）～２７０（６）をそれぞれ生成する。いくつかの実施形態では、特徴エンジン１３０は、空間情報エクストラクタ２２０の任意の数のインスタンスを含むことができ、空間情報エクストラクタ２２０のインスタンスは、ヒートマップ２１０（０）～２１０（６）にそれぞれ基づいて、同時に、順次、またはそれらの任意の組み合わせで、空間的特徴セット２７０（０）～２７０（６）を生成することができる。同じまたは他の実施形態では、空間的特徴セット２７０（０）～２７０（Ｈ）のそれぞれは、非限定的に、任意の数の空間的特徴及び／または任意の数の他のタイプの特徴を、任意の組み合わせで含む。空間的特徴セット２７０（０）～２７０（６）のそれぞれはまた、本明細書では「空間的特徴のセット」とも呼ばれる。 The feature engine 130 includes, but is not limited to, spatial information extractors 220(0)-220(6) and an aggregation engine 280. The spatial information extractors 220(0)-220(6) generate spatial feature sets 270(0)-270(6), respectively, based on the heat maps 210(0)-210(6). In some embodiments, the feature engine 130 can include any number of instances of the spatial information extractor 220, which can generate spatial feature sets 270(0)-270(6) simultaneously, sequentially, or in any combination thereof, based on the heat maps 210(0)-210(6), respectively. In the same or other embodiments, each of the spatial feature sets 270(0)-270(H) can include, but is not limited to, any number of spatial features and/or any number of other types of features, in any combination. Each of spatial feature sets 270(0)-270(6) is also referred to herein as a "spatial feature set."

示されるように、空間情報エクストラクタ２２０（０）は、非限定的に、プリプロセッサ２３０（０）、ウェーブレット変換２４０（０）、マルチレベルウェーブレット分解２５０（０）、及び特徴エクストラクタ２６０（０）を含む。一般に、空間情報エクストラクタ２２０（ｙ）は、０から６までの整数ｙに対して、非限定的に、プリプロセッサ２３０（ｙ）、ウェーブレット変換２４０（ｙ）、マルチレベルウェーブレット分解２５０（ｙ）、及び特徴エクストラクタ２６０（ｙ）を含む。プリプロセッサ２３０は、プリプロセッサ２３０が任意選択であることを示すために破線のボックスを使用して示されている。 As shown, spatial information extractor 220(0) includes, but is not limited to, a pre-processor 230(0), a wavelet transform 240(0), a multi-level wavelet decomposition 250(0), and a feature extractor 260(0). In general, spatial information extractor 220(y) includes, but is not limited to, a pre-processor 230(y), a wavelet transform 240(y), a multi-level wavelet decomposition 250(y), and a feature extractor 260(y), for integer y between 0 and 6. Pre-processor 230 is shown using a dashed box to indicate that pre-processor 230 is optional.

プリプロセッサ２３０（ｙ）は、ヒートマップ２１０（ｙ）の任意の数及び／またはタイプの前処理操作を実行する。プリプロセッサ２３０（ｙ）が実行できる前処理操作のタイプのいくつかの例には、未定義の測定値の補間、極端な測定値のクリッピング、及びヒートマップ２１０（ｙ）内での正規化を非限定的に含む。いくつかの実施形態では、プリプロセッサ２３０（ｙ）は、空間情報エクストラクタ２２０（ｙ）から省かれる。同じまたは他の実施形態では、特徴エンジン１３０、トレーニングアプリケーション１２０、及び／または入力エンジン１８２は、ヒートマップセット１０４での任意の数及び／またはタイプの処理操作を実行できる。 The pre-processor 230(y) performs any number and/or type of pre-processing operations on the heatmap 210(y). Some examples of types of pre-processing operations the pre-processor 230(y) can perform include, without limitation, interpolation of undefined measurements, clipping of extreme measurements, and normalization within the heatmap 210(y). In some embodiments, the pre-processor 230(y) is omitted from the spatial information extractor 220(y). In the same or other embodiments, the feature engine 130, the training application 120, and/or the input engine 182 can perform any number and/or type of processing operations on the heatmap set 104.

プリプロセッサ２３０（ｙ）がヒートマップ２１０（ｙ）の前処理をした後、空間情報エクストラクタ２２０（ｙ）はウェーブレット変換２４０（ｙ）をヒートマップ２１０（ｙ）に適用して、マルチレベルウェーブレット分解２５０（ｙ）を生成する。空間情報エクストラクタ２２０（ｙ）は、任意の技術的に実現可能な方法で、任意のタイプのウェーブレット変換２４０（ｙ）をヒートマップ２１０（ｙ）に適用することができる。例えば、いくつかの実施形態では、空間情報エクストラクタ２２０（ｙ）は、ウェーブレット変換モジュールに含まれる２Ｄ離散ウェーブレット変換関数への関数の呼び出しを実行する。関数の呼び出しを介して、空間情報エクストラクタ２２０（ｙ）は、カスケードフィルタバンクアルゴリズムを使用してハーウェーブレット変換を計算する前に、リフレクトパディングを使用してヒートマップ２１０（ｙ）を外挿するように２Ｄ離散ウェーブレット変換関数を構成する。関数の呼び出しは、近似、水平の詳細、垂直の詳細、及び対角の詳細な係数を含むがこれらに限定されないマルチレベルウェーブレット分解２５０（ｙ）を返す。 After the pre-processor 230(y) pre-processes the heat map 210(y), the spatial information extractor 220(y) applies a wavelet transform 240(y) to the heat map 210(y) to generate a multi-level wavelet decomposition 250(y). The spatial information extractor 220(y) can apply any type of wavelet transform 240(y) to the heat map 210(y) in any technically feasible manner. For example, in some embodiments, the spatial information extractor 220(y) performs a function call to a 2D discrete wavelet transform function included in a wavelet transform module. Through the function call, the spatial information extractor 220(y) configures the 2D discrete wavelet transform function to extrapolate the heat map 210(y) using reflect padding before computing the herwavelet transform using a cascaded filter bank algorithm. The function call returns a multi-level wavelet decomposition 250(y) that includes, but is not limited to, the approximation, horizontal detail, vertical detail, and diagonal detail coefficients.

特徴エクストラクタ２６０（ｙ）は、マルチレベルウェーブレット分解２５０（ｙ）に基づいて空間的特徴セット２７０（ｙ）を生成する。特徴エクストラクタ２６０（ｙ）は、いずれかの技術的に実行可能な方法で、空間的特徴セット２７０（ｙ）を生成することができる。いくつかの実施形態では、特徴エクストラクタ２６０（ｙ）は、マルチレベルウェーブレット分解２５０から任意の数及び／またはタイプの特徴を抽出し、次いで抽出された特徴に対して任意の数及び／またはタイプの後処理操作を実行して、空間的特徴セット２７０（ｙ）を生成する。 The feature extractor 260(y) generates the spatial feature set 270(y) based on the multi-level wavelet decomposition 250(y). The feature extractor 260(y) may generate the spatial feature set 270(y) in any technically feasible manner. In some embodiments, the feature extractor 260(y) extracts any number and/or type of features from the multi-level wavelet decomposition 250 and then performs any number and/or type of post-processing operations on the extracted features to generate the spatial feature set 270(y).

例えば、いくつかの実施形態では、特徴エクストラクタ２６０（ｙ）は、マルチレベルウェーブレット分解２５０（ｙ）の２つの最低レベルから特徴を抽出し、空間的特徴セット２７０（ｙ）を生成する。いくつかの実施形態では、特徴エクストラクタ２６０（ｙ）は、マルチレベルウェーブレット分解２５０（ｙ）の２つの最低レベルから特徴を抽出し、次いで抽出された特徴のそれぞれを正規化して、空間的特徴セット２７０（ｙ）を生成する。他の実施形態では、特徴エクストラクタ２６０（ｙ）は、マルチレベルウェーブレット分解２５０（ｙ）の３つの最低レベルから特徴のセットを抽出し、主成分分析を使用して特徴のセットを圧縮し、得られた圧縮された特徴のセットに等しい空間的特徴セット２７０（ｙ）を設定する。特徴のセットは、本明細書では「特徴セット」とも呼ばれる。 For example, in some embodiments, feature extractor 260(y) extracts features from the two lowest levels of multilevel wavelet decomposition 250(y) to generate spatial feature set 270(y). In some embodiments, feature extractor 260(y) extracts features from the two lowest levels of multilevel wavelet decomposition 250(y) and then normalizes each of the extracted features to generate spatial feature set 270(y). In other embodiments, feature extractor 260(y) extracts a set of features from the three lowest levels of multilevel wavelet decomposition 250(y), compresses the set of features using principal component analysis, and sets spatial feature set 270(y) equal to the resulting compressed set of features. The set of features is also referred to herein as a "feature set."

図示のように、集約エンジン２８０は、空間的特徴セット２７０（０）～２７０（６）に基づいて特徴ベクトル１３８を生成する。集約エンジン２８０は、いずれかの技術的に実現可能な方法で、特徴ベクトル１３８を生成することができる。例えば、いくつかの実施形態では、集約エンジン２８０は、空間的特徴セット２７０（０）～２７０（６）を連結して、特徴ベクトル１３８を生成する。特徴ベクトル１３８は、本明細書では「特徴のセット」とも呼ばれる。 As shown, aggregation engine 280 generates feature vector 138 based on spatial feature sets 270(0)-270(6). Aggregation engine 280 may generate feature vector 138 in any technically feasible manner. For example, in some embodiments, aggregation engine 280 concatenates spatial feature sets 270(0)-270(6) to generate feature vector 138. Feature vector 138 is also referred to herein as a "set of features."

実行アーティファクトの根本原因分析の促進
図３は、様々な実施形態による、図１の出力エンジン１９０のより詳細な図である。図示のように、出力エンジン１９０は、非限定的に、実験の概要１９６及びプレートの概要１９８（１）～１９８（Ｅ）を含み、Ｅは任意の正の整数であり得る。出力エンジン１９０は、予測ラベル１８６（０）～１８６（Ｅ）、ラベル信頼度１８８（０）～１８８（Ｅ）、ラベルデータセット１５６、及び（任意選択で）ラベル付けされていないトレーニングデータセット１０２に基づいて、実験の概要１９６とプレートの概要１９８（１）～１９８（Ｅ）を生成する。 Facilitating Root Cause Analysis of Execution Artifacts Figure 3 is a more detailed diagram of the output engine 190 of Figure 1, in accordance with various embodiments. As shown, the output engine 190 includes, without limitation, an experiment summary 196 and plate summaries 198(1)-198(E), where E can be any positive integer. The output engine 190 generates the experiment summary 196 and plate summaries 198(1)-198(E) based on the predicted labels 186(0)-186(E), the label confidences 188(0)-188(E), the label dataset 156, and (optionally) the unlabeled training dataset 102.

ラベルデータセット１５６は、非限定的に、ラベル付けされたクラスタ３４０（１）～３４０（Ｃ）を含み、Ｃは任意の正の整数であり得る。示されるように、ラベル付きクラスタ３４０（Ｃ）は、非限定的に、クラスタ３３０（Ｃ）、クラスタラベル３４２（Ｃ）、及び平均ヒートマップセット３４４（Ｃ）を含む。より一般的には、ｚが１からＣまでの整数であるラベル付きクラスタ３４０（ｚ）は、非限定的に、クラスタ３３０（ｚ）、クラスタラベル３４２（ｚ）、及び平均ヒートマップセット３４４（ｚ）を含む。クラスタ３３０（ｚ）は、いずれかの技術的に実現可能な方法でクラスタ３３０（ｚ）に割り当てられる特徴ベクトル１３８を明示する。斜体で示すように、いくつかの実施形態では、クラスタ３３０（ｚ）は、クラスタ３３０（ｚ）に割り当てられた特徴ベクトル１３８のリストを明示する。説明のみを目的として、クラスタ３３０（１）～３３０（Ｃ）は、本明細書では個別に「クラスタ３３０」、また集合的に「クラスタ３３０」とも呼ばれる。また、クラスタラベル３４２（１）～３４２（Ｃ）は、本明細書では個別に「クラスタラベル３４２」、また集合的に「クラスタラベル３４２」とも呼ばれる。 The label dataset 156 includes, but is not limited to, labeled clusters 340(1)-340(C), where C can be any positive integer. As shown, the labeled cluster 340(C) includes, but is not limited to, a cluster 330(C), a cluster label 342(C), and an average heatmap set 344(C). More generally, the labeled cluster 340(z), where z is an integer from 1 to C, includes, but is not limited to, a cluster 330(z), a cluster label 342(z), and an average heatmap set 344(z). The cluster 330(z) specifies the feature vectors 138 assigned to the cluster 330(z) in any technically feasible manner. As shown in italics, in some embodiments, the cluster 330(z) specifies the list of feature vectors 138 assigned to the cluster 330(z). For purposes of explanation only, clusters 330(1)-330(C) are also referred to herein individually as "cluster 330" and collectively as "cluster 330." Additionally, cluster labels 342(1)-342(C) are also referred to herein individually as "cluster label 342" and collectively as "cluster label 342."

図１に関連して本明細書で前述したように、予測ラベル１８６（０）及びラベル信頼度１８８（０）は、マイクロウェルプレート１～Ｅを介して行われる実験に関連する、存在しない平均マイクロウェルプレートに関連する。予測ラベル１８６（１）～１８６（Ｅ）及びラベル信頼度１８８（１）～１８８（Ｅ）は、それぞれマイクロウェルプレート１～Ｅに関連付けられている。予測ラベル１８６（０）～１８６（Ｅ）のそれぞれは、クラスタラベル３４２（１）～３４２（Ｃ）の１つと等しく、任意の数の他の予測ラベル１８６とは異なり得る。 1, predicted label 186(0) and label confidence 188(0) are associated with a non-existent average micro-well plate associated with an experiment conducted via micro-well plates 1-E. Predicted labels 186(1)-186(E) and label confidence 188(1)-188(E) are associated with micro-well plates 1-E, respectively. Each of predicted labels 186(0)-186(E) is equal to one of cluster labels 342(1)-342(C) and may differ from any number of other predicted labels 186.

図示されているように、いくつかの実施形態では、実験の概要１９６は、予測ラベル１８６（０）、ラベル信頼度１８８（０）、及び整合するプレートの割合３１０を含むが、これらに限定されない。出力エンジン１９０は、いずれかの技術的に実行可能な方法で整合するプレートの割合３１０を計算することができる。例えば、いくつかの実施形態では、出力エンジン１９０は、予測ラベル１８６（０）に等しい予測ラベル１８６（１）～１８６（Ｅ）のパーセンテージに等しい整合するプレートの割合を設定する。 As shown, in some embodiments, the experiment summary 196 includes, but is not limited to, a predicted label 186(0), a label confidence 188(0), and a percentage of matching plates 310. The output engine 190 may calculate the percentage of matching plates 310 in any technically feasible manner. For example, in some embodiments, the output engine 190 sets the percentage of matching plates equal to the percentage of predicted labels 186(1)-186(E) that are equal to the predicted label 186(0).

１からＥの整数ｘに対するプレートの概要１９８（ｘ）は、予測ラベル１８６（ｘ）、ラベル信頼度１８８（ｘ）、及び異常スコア３２０（ｘ）を含むが、これらに限定されない。異常スコア３２０（ｘ）は、予測ラベル１８６（ｘ）に関連付けられたクラスタ３３０に対して、マイクロウェルプレートｘがどの程度離れているかを示す。出力エンジン１９０は、いずれかの技術的に実行可能な方法で異常スコア３２０（ｘ）を計算することができる。 The plate summary 198(x) for an integer x between 1 and E includes, but is not limited to, a predicted label 186(x), a label confidence 188(x), and an anomaly score 320(x). The anomaly score 320(x) indicates how far away the micro-well plate x is from the cluster 330 associated with the predicted label 186(x). The output engine 190 may calculate the anomaly score 320(x) in any technically feasible manner.

説明のみを目的として、いくつかの実施形態では、予測ラベル１８６（１）は、クラスタラベル３４２（Ｃ）「行の不履行」に等しく、出力エンジン１９０は、異常スコア３２０（ｘ）を、特徴ベクトル１３８（ｘ）と、平均ヒートマップセット３４４（Ｃ）に関連する特徴ベクトル１３８との間の非類似度に等しく設定する。出力エンジン１９０は、平均ヒートマップセット３４４（Ｃ）に関連付けられた特徴ベクトル１３８、及び特徴ベクトル１３８（ｘ）と、平均ヒートマップセット３４４（Ｃ）に関連付けられた特徴ベクトル１３８との間の非類似度を、任意の技術的に実現可能な方法で計算することができる。 For illustrative purposes only, in some embodiments, predicted label 186(1) is equal to cluster label 342(C) "row default" and output engine 190 sets anomaly score 320(x) equal to the dissimilarity between feature vector 138(x) and feature vector 138 associated with average heatmap set 344(C). Output engine 190 may calculate feature vector 138 associated with average heatmap set 344(C) and the dissimilarity between feature vector 138(x) and feature vector 138 associated with average heatmap set 344(C) in any technically feasible manner.

示されるように、いくつかの実施形態では、出力エンジン１９０は、分析ＧＵＩ１９２を生成して表示する。説明のみを目的として、分析ＧＵＩ１９２は、例示的なユーザの入力に基づく特定の時点で示されている。示されるように、分析ＧＵＩ１９２は、非限定的に、マイクロウェルプレートのいずれも選択しないように構成されたプレートスライダ３７０と、クラスタラベル３４２（Ｃ）の行の不履行に関連付けられたラベル付きクラスタ３４０（Ｃ）を選択するように構成されたクラスタスライダ３８０、及びクラスタ表示ペイン３９０を含む。 As shown, in some embodiments, the output engine 190 generates and displays an analysis GUI 192. For purposes of illustration only, the analysis GUI 192 is shown at a particular point in time based on an exemplary user's input. As shown, the analysis GUI 192 includes, without limitation, a plate slider 370 configured to select none of the micro-well plates, a cluster slider 380 configured to select the labeled cluster 340(C) associated with the row default of the cluster label 342(C), and a cluster display pane 390.

クラスタスライダ３８０は、ラベル付きクラスタ３４０（Ｃ）を選択するように構成されているので、出力エンジン１９０は、ラベル付きクラスタ３４０（Ｃ）に関連する情報をクラスタ表示ペイン３９０に集める。示されるように、出力エンジン１９０は、平均ヒートマップセット３４４（Ｃ）を表示し、実験データセット１０６に関連付けられた実験に含まれるマイクロウェルプレートのサブセットが、行の不履行のクラスタラベル３４２（Ｃ）を有するクラスタ３３０（Ｃ）に割り当てられることを明示する。より具体的には、出力エンジン１９０は、マイクロウェルプレート１、３、４５～５２、及び５８を含むマイクロウェルプレートのサブセットがクラスタラベル３４２（Ｃ）及び平均ヒートマップセット３４４（Ｃ）に関連付けられることを特定する。 Because the cluster slider 380 is configured to select the labeled cluster 340(C), the output engine 190 populates the cluster display pane 390 with information related to the labeled cluster 340(C). As shown, the output engine 190 displays the average heat map set 344(C) to clearly indicate that a subset of the micro-well plates included in the experiment associated with the experimental dataset 106 are assigned to the cluster 330(C) having the row default cluster label 342(C). More specifically, the output engine 190 identifies that a subset of the micro-well plates including micro-well plates 1, 3, 45-52, and 58 are associated with the cluster label 342(C) and the average heat map set 344(C).

図３に示すように、平均ヒートマップセット３４４（Ｃ）は、トレーニングアプリケーション１２０によってクラスタ３３０（Ｃ）に割り当てられた１４８のマイクロウェルプレートの画像化チャネル１～６の平均細胞数及び平均強度を視覚的に示すヒートマップ２１０（０）～２１０（６）を含む。図３に示される各ヒートマップ２１０において、下から４行目の測定値は異常に低く、行の不履行に関連する実行アーティファクトである。 As shown in FIG. 3, average heatmap set 344(C) includes heatmaps 210(0)-210(6) that visually show the average cell counts and average intensities for imaging channels 1-6 of 148 microwell plates assigned to cluster 330(C) by training application 120. In each heatmap 210 shown in FIG. 3, the measurements in the fourth row from the bottom are anomalously low and are a performance artifact associated with row defaults.

クラスタ表示ペイン３９０に基づいて、マイクロウェルプレート１、３、４５～５２、及び５８に関連する測定値を実験データセット１０６から除外することができる。さらに、根本原因の分析により、特定のディスペンスノズルが部分的に詰まっていると結論付けることができる。 Based on the cluster display pane 390, measurements associated with micro-well plates 1, 3, 45-52, and 58 can be excluded from the experimental data set 106. Further, a root cause analysis can conclude that a particular dispense nozzle is partially clogged.

図４は、様々な実施形態による、マイクロウェルプレートを含む実験において実行アーティファクトを識別するように分類子をトレーニングするための方法ステップの流れ図である。方法ステップは、図１～３のシステムを参照して説明されているが、当業者は、方法ステップを任意の順序で実施するように構成された任意のシステムが、本発明の範囲にあることを理解するであろう。 Figure 4 is a flow diagram of method steps for training a classifier to identify execution artifacts in experiments involving microwell plates, according to various embodiments. The method steps are described with reference to the systems of Figures 1-3, but one of ordinary skill in the art will understand that any system configured to perform the method steps in any order is within the scope of the present invention.

示されるように、方法４００はステップ４０２を開始し、これにおいて、ラベル付けされていないトレーニングデータセット１０２に含まれるヒートマップ２１０のそれぞれについて、空間情報エクストラクタ２２０が異なる空間的特徴セット２７０を生成する。ステップ４０４で、ｘが１からＨの間の整数である、ラベル付けされていないトレーニングデータセット１０２に含まれる、ヒートマップセット１０４（ｘ）のそれぞれについて、集約エンジン２８０は、ヒートマップセット１０４（ｘ）に関連付けられる空間的特徴セット２７０に基づいて、関連付けられた特徴ベクトル１３８（ｘ）を生成する。 As shown, the method 400 begins at step 402, where for each heatmap 210 included in the unlabeled training data set 102, the spatial information extractor 220 generates a distinct spatial feature set 270. At step 404, for each heatmap set 104(x) included in the unlabeled training data set 102, where x is an integer between 1 and H, the aggregation engine 280 generates an associated feature vector 138(x) based on the spatial feature set 270 associated with the heatmap set 104(x).

ステップ４０６で、クラスタリングエンジン１４０は、特徴ベクトル１３８（１）～１３８（Ｈ）に基づいてクラスタリング操作を実行して、クラスタセット１４８を生成する。ステップ４０８で、ラベル付けエンジン１５０は、クラスタセット１４８に基づいてラベルデータセット１５６を生成する。ステップ４１０で、ラベル付けエンジン１５０は、ラベルデータセット１５６に基づいてラベル付けされたトレーニングデータセット１５８を生成し、任意選択で、ラベルＧＵＩ１５２を表示する。 In step 406, the clustering engine 140 performs a clustering operation based on the feature vectors 138(1)-138(H) to generate a cluster set 148. In step 408, the labeling engine 150 generates a label dataset 156 based on the cluster set 148. In step 410, the labeling engine 150 generates a labeled training dataset 158 based on the label dataset 156, and optionally displays a label GUI 152.

ステップ４１２で、ラベル付けエンジン１５０は、ラベル付けエンジン１５０がラベルＧＵＩ１５２を介して何らかの入力を受け取ったかどうかを判定する。ステップ４１２で、ラベル付けエンジン１５０が、ラベル付けエンジン１５０がラベルＧＵＩ１５２を介していずれかの入力を受け取っていないと判定した場合、方法４００はステップ４１６に直接進む。 At step 412, the labeling engine 150 determines whether the labeling engine 150 has received any input via the label GUI 152. If at step 412 the labeling engine 150 determines that the labeling engine 150 has not received any input via the label GUI 152, the method 400 proceeds directly to step 416.

しかし、ステップ４１２で、ラベル付けエンジン１５０が、ラベル付けエンジン１５０がラベルＧＵＩ１５２を介して入力を受け取っていないと判定した場合、方法４００はステップ４１４に直接進む。ステップ４１４で、ラベル付けエンジン１５０は、任意の数のクラスタセット１４８、ラベルデータセット１５６、及び／またはラベル付きトレーニングデータセット１５８を、入力に基づく任意の組み合わせで更新する。 However, if at step 412 the labeling engine 150 determines that the labeling engine 150 has not received input via the label GUI 152, then the method 400 proceeds directly to step 414. At step 414, the labeling engine 150 updates any number of the cluster sets 148, the label dataset 156, and/or the labeled training dataset 158 in any combination based on the input.

ステップ４１６で、トレーニングエンジン１６０は、訓練された分類器１７０を生成するために、ラベル付けされたトレーニングデータセット１５８に基づいて機械学習操作を実行する。本明細書で言及されるように、機械学習操作は、経験から学習する、及び／またはデータにアクセスし、データを使用して学習することができるソフトウェアによって実行される、及び／またはそれに関連付けられる、任意のタイプの操作であり得る。機械学習操作のいくつかの例は、非限定的に、教師なし機械学習操作、教師あり機械学習操作、半教師あり機械学習操作、及び強化学習操作を含む。 At step 416, the training engine 160 performs machine learning operations based on the labeled training dataset 158 to generate a trained classifier 170. As referred to herein, a machine learning operation may be any type of operation performed by and/or associated with software that can learn from experience and/or access and use data to learn. Some examples of machine learning operations include, without limitation, unsupervised machine learning operations, supervised machine learning operations, semi-supervised machine learning operations, and reinforcement learning operations.

ステップ４１８で、トレーニングアプリケーション１２０は、訓練された分類器１７０及びラベルデータセット１５６を実験分析アプリケーション１８０及び／または任意の数の他のソフトウェアアプリケーションに提示する。いくつかの実施形態では、トレーニングアプリケーション１２０はまた、ラベル付けされていないトレーニングデータセット１０２を実験分析アプリケーション１８０及び／または任意の数の他のソフトウェアアプリケーションに提示する。その後、方法４００は終了する。 At step 418, the training application 120 presents the trained classifier 170 and the label dataset 156 to the experimental analysis application 180 and/or any number of other software applications. In some embodiments, the training application 120 also presents the unlabeled training dataset 102 to the experimental analysis application 180 and/or any number of other software applications. The method 400 then ends.

図５は、様々な実施形態による、訓練された分類器を使用するマイクロウェルプレートを含む実験において実行アーティファクトを検出するための方法ステップの流れ図である。方法ステップは、図１～３のシステムを参照して説明されているが、当業者は、方法ステップを任意の順序で実施するように構成された任意のシステムが、本発明の範囲にあることを理解するであろう。 Figure 5 is a flow diagram of method steps for detecting execution artifacts in experiments involving microwell plates using a trained classifier, according to various embodiments. The method steps are described with reference to the systems of Figures 1-3, but one of ordinary skill in the art will understand that any system configured to perform the method steps in any order is within the scope of the present invention.

示されるように、方法５００はステップ５０２で始まり、入力エンジン１８２は、実験データセット１０６に含まれるヒートマップセット１０４（１’）～１０４（Ｅ’）に基づいて、実験全体を表すヒートマップセット１０４（０’）を生成する。ステップ５０４で、実験データセット１０６に関連付けられたヒートマップ２１０のそれぞれについて、空間情報エクストラクタ２２０は、異なる空間的特徴セット２７０を生成する。ステップ５０６で、ｘが０とＥの間の整数である、ヒートマップセット１０４（ｘ’）のそれぞれについて、集約エンジン２８０は、ヒートマップセット１０４（ｘ’）に関連付けられる空間的特徴セット２７０に基づいて、関連付けられた特徴ベクトル１３８（ｘ’）を生成する。 As shown, the method 500 begins at step 502, where the input engine 182 generates a heatmap set 104(0') representing the entire experiment based on the heatmap sets 104(1')-104(E') included in the experimental dataset 106. At step 504, for each of the heatmaps 210 associated with the experimental dataset 106, the spatial information extractor 220 generates a different spatial feature set 270. At step 506, for each of the heatmap sets 104(x'), where x is an integer between 0 and E, the aggregation engine 280 generates an associated feature vector 138(x') based on the spatial feature set 270 associated with the heatmap set 104(x').

ステップ５０８で、実験分析アプリケーション１８０は、訓練された分類器１７０を使用して、ｘが０からＥの間の整数である、特徴ベクトル１３８（ｘ’）の各々を、予測ラベル１８６（ｘ）に、任意選択で、ラベル信頼度１８８（ｘ）にマッピングする。ステップ５１０で、出力エンジン１９０は、予測ラベル１８６（０）～１８６（Ｅ）、及び任意選択で、ラベル信頼度１８８（０）～１８８（Ｅ）、及び／またはラベルデータセット１５６に基づいて、実験の概要１９６とプレートの概要１９８（１）～１９８（Ｅ）を生成する。ステップ５１２で、実験分析アプリケーション１８０は、任意の数の実験の概要１９６の任意の部分、プレートの概要１９８（１）～１９８（Ｅ）、及び／またはラベルデータセット１５６を、任意の数及び／またはタイプのソフトウェアアプリケーションに、いずれかの組み合わせで提示する。その後、方法５００は終了する。 In step 508, the experiment analysis application 180 uses the trained classifier 170 to map each of the feature vectors 138(x'), where x is an integer between 0 and E, to a predicted label 186(x) and, optionally, to a label confidence 188(x). In step 510, the output engine 190 generates an experiment summary 196 and a plate summary 198(1)-198(E) based on the predicted labels 186(0)-186(E) and, optionally, the label confidences 188(0)-188(E), and/or the label dataset 156. In step 512, the experiment analysis application 180 presents any portion of any number of the experiment summaries 196, the plate summaries 198(1)-198(E), and/or the label dataset 156 to any number and/or type of software applications, in any combination. The method 500 then ends.

要するに、開示された技術を使用して、マイクロウェルプレートを含む実験で実行での異常を正確かつ一貫して検出するように分類子をトレーニングすることができる。いくつかの実施形態では、トレーニングアプリケーションは、ラベル付けされていないトレーニングデータセットに基づいて、訓練された分類器を生成する。ラベル付けされていないトレーニングデータセットには、任意の数のヒートマップセットが含まれるが、これらに限定されず、各ヒートマップセットは、異なるマイクロウェルプレートに関連付けられた測定値を表す。トレーニングアプリケーションには、非限定的に、特徴エンジン、クラスタリングエンジン、ラベリングエンジン、及びトレーニングエンジンが、非限定的に含まれる。トレーニングアプリケーションは、ラベル付けされていないトレーニングデータセットに含まれる各ヒートマップセットの特徴ベクトルを生成するように特徴エンジンを構成する。 In summary, the disclosed techniques can be used to train a classifier to accurately and consistently detect anomalies in runs in experiments involving microwell plates. In some embodiments, the training application generates a trained classifier based on an unlabeled training dataset. The unlabeled training dataset includes, but is not limited to, any number of heatmap sets, each heatmap set representing measurements associated with a different microwell plate. The training application includes, but is not limited to, a feature engine, a clustering engine, a labeling engine, and a training engine. The training application configures the feature engine to generate a feature vector for each heatmap set included in the unlabeled training dataset.

特定のヒートマップセットの特徴ベクトルを生成するために、特徴エンジンは、ヒートマップセットに含まれる各ヒートマップに、ウェーブレット変換を適用して、マルチレベルウェーブレット分解を生成する。次に、特徴エンジンは、各マルチレベルウェーブレット分解の２つの最低レベルから特徴を抽出して、空間的特徴セットを生成する。特徴エンジンは、空間的特徴セットを集約して、ヒートマップセットに関連付けられた特徴ベクトルを生成する。 To generate a feature vector for a particular heatmap set, the feature engine applies a wavelet transform to each heatmap in the heatmap set to generate a multilevel wavelet decomposition. The feature engine then extracts features from the two lowest levels of each multilevel wavelet decomposition to generate a spatial feature set. The feature engine aggregates the spatial feature sets to generate a feature vector associated with the heatmap set.

クラスタリングエンジンは、特徴ベクトルに基づいて凝集クラスタリングアルゴリズムを実行し、任意の数の特徴ベクトルのクラスタを含むがこれに限定されないクラスタセットを生成する。ラベル付けエンジンは、クラスタごとにラベル付けされたクラスタを含むラベルデータセットを生成するが、これに限定されない。ラベル付けされた各クラスタには、関連するクラスタ、クラスタラベル、及びクラスタを表す平均ヒートマップセットが、非限定的に含まれる。ラベル付けエンジンは、任意選択でラベルＧＵＩを表示する。ラベル付けエンジンは、ラベルＧＵＩを介して受け取った入力に基づいてラベルデータセットを更新できる。 The clustering engine performs an agglomerative clustering algorithm based on the feature vectors to generate a cluster set including, but not limited to, clusters of any number of feature vectors. The labeling engine generates a label dataset including, but not limited to, labeled clusters for each cluster. Each labeled cluster includes, but is not limited to, an associated cluster, a cluster label, and a set of average heatmaps representing the cluster. The labeling engine optionally displays a label GUI. The labeling engine can update the label dataset based on input received via the label GUI.

ラベル付けエンジンは、ラベルデータセットに基づいて、ラベル付けされたトレーニングデータセットを生成する。ラベル付けされたトレーニングデータセットには、各特徴ベクトルと関連するクラスタラベルが、非限定的に含まれる。ラベル付けされたトレーニングデータセットに基づいて、トレーニングエンジンは分類子をトレーニングして、特徴ベクトルを予測ラベル（つまり、クラスタラベルの１つ）及び関連するラベル信頼度にマッピングする。トレーニングアプリケーションは、訓練された分類器と、任意選択でラベルデータセットを実験分析アプリケーション及び／または任意の数の他のソフトウェアアプリケーションに送信する。 The labeling engine generates a labeled training dataset based on the label dataset. The labeled training dataset includes, but is not limited to, a cluster label associated with each feature vector. Based on the labeled training dataset, the training engine trains a classifier to map the feature vector to a predicted label (i.e., one of the cluster labels) and an associated label confidence. The training application sends the trained classifier and, optionally, the label dataset to an experimental analysis application and/or any number of other software applications.

いくつかの実施形態では、実験分析アプリケーションは、訓練された分類器及びラベルデータセットを使用して、実験データセットにおける実行の異常を検出及び評価する。実験データセットは、マイクロウェルプレートを使用して実施された実験に関連付けられており、任意の数のヒートマップセットを含むが、これに限定されない。各ヒートマップセットは、異なるマイクロウェルプレートに関連付けられた測定値を表す。実験分析アプリケーションには、入力エンジン、特徴エンジン、訓練された分類器、及び出力エンジンが、非限定的に含まれる。 In some embodiments, the experimental analysis application uses the trained classifier and the label dataset to detect and evaluate performance anomalies in an experimental dataset. The experimental dataset is associated with an experiment performed using a micro-well plate and includes, but is not limited to, any number of heatmap sets. Each heatmap set represents measurements associated with a different micro-well plate. The experimental analysis application includes, but is not limited to, an input engine, a feature engine, a trained classifier, and an output engine.

入力エンジンは、実験データセットに基づいて「平均」ヒートマップセットを生成し、特徴エンジンを使用して平均ヒートマップセットの特徴ベクトルを生成する。また、入力エンジンは特徴エンジンを使用して、実験データセットに含まれる各ヒートマップセットの特徴ベクトルを生成する。実験分析エンジンは、各特徴ベクトルを訓練された分類器に入力して、予測ラベルとラベル信頼度を生成する。その後、出力エンジンは、予測ラベル、ラベルの信頼度、及びラベルデータセットに基づいて、プレートの概要と実験の概要を生成する。 The input engine generates an "average" heatmap set based on the experimental dataset and uses the feature engine to generate a feature vector for the average heatmap set. The input engine also uses the feature engine to generate a feature vector for each heatmap set in the experimental dataset. The experiment analysis engine inputs each feature vector into a trained classifier to generate a predicted label and a label confidence. The output engine then generates a plate summary and an experiment summary based on the predicted labels, label confidence, and label dataset.

各プレートの概要は、異なるマイクロウェルプレートに関連付けられており、関連付けられた特徴ベクトルの予測ラベル、関連付けられたラベルの信頼度、及び異常スコアが、非限定的に含まれる。異常スコアは、予測ラベルに関連付けられたクラスタに対して、マイクロウェルプレートがどの程度離れているかを示す。実験の概要は、実験全体を表し、平均ヒートマップセットに関連付けられた特徴ベクトルの「平均」予測ラベル、関連付けられたラベルの信頼度、及び整合するプレートの割合が、非限定的に含まれる。整合するプレートの割合は、予測ラベルの平均に等しい予測ラベルを有する実験データセットに関連付けられたマイクロウェルプレートのパーセンテージを明示する。実験分析アプリケーションは、任意の数のソフトウェアアプリケーションを提示し、及び／または分析ＧＵＩを介して、任意の数のプレートの概要、実験の概要、及び／またはラベルデータセットの任意の部分を表示する。 Each plate summary is associated with a different microwell plate and includes, but is not limited to, a predicted label for the associated feature vector, a confidence for the associated label, and an anomaly score. The anomaly score indicates how far the microwell plate is from the cluster associated with the predicted label. The experiment summary represents the entire experiment and includes, but is not limited to, an "average" predicted label for the feature vector associated with the average heatmap set, a confidence for the associated label, and a percentage of matching plates. The percentage of matching plates indicates the percentage of microwell plates associated with the experimental dataset that have a predicted label equal to the average of the predicted labels. The experiment analysis application presents any number of software applications and/or displays any number of plate summaries, experiment summaries, and/or any portion of the label dataset via an analysis GUI.

先行技術に対する開示された技術の少なくとも１つの技術的利点は、実験分析アプリケーションが訓練された分類器を使用して、マイクロウェルプレートを含む実験における実行アーティファクトを、より正確かつ一貫して分析及び検出できることである。実験分析アプリケーションは、訓練された分類器を使用して、関連付けられたヒートマップの空間パターンに基づいてマイクロウェルプレートを自動的に分類するため、ヒートマップに反映された実行上の異常が見落とされたり誤解されたりする可能性が、従来のアプローチに比べて減少する。さらに、各クラスタの異常スコアと平均ヒートマップセットを計算することで、根本原因の分析と新しいタイプの実行アーティファクトの識別の両方を促進する。先行技術のアプローチとは異なり、訓練された分類器は、客観的かつ一貫して、経時的な実行上の異常及び異なる実験に対してマイクロウェルプレートを分類する。その結果、実行上の異常の傾向を効率的に検出し、実験プロセス及び／または装置を改善するために使用できる。これらの技術的利点により、従来技術のアプローチに対して１つまたは複数の技術的改善が得られる。 At least one technical advantage of the disclosed technology over the prior art is that an experiment analysis application can use the trained classifier to more accurately and consistently analyze and detect execution artifacts in experiments involving microwell plates. Because the experiment analysis application uses the trained classifier to automatically classify microwell plates based on the spatial patterns of the associated heat maps, the likelihood that execution anomalies reflected in the heat maps will be overlooked or misinterpreted is reduced compared to prior approaches. Furthermore, computing anomaly scores and average heat map sets for each cluster facilitates both root cause analysis and identification of new types of execution artifacts. Unlike prior art approaches, the trained classifier objectively and consistently classifies microwell plates for execution anomalies over time and for different experiments. As a result, trends in execution anomalies can be efficiently detected and used to improve experimental processes and/or equipment. These technical advantages result in one or more technical improvements over prior art approaches.

１．いくつかの実施形態では、マイクロウェルプレートを伴う実験において実行アーティファクトを検出するためのコンピュータ実装方法は、第１のマイクロウェルプレートに関連付けられた１つまたは複数のヒートマップに基づいて空間的特徴の１つまたは複数のセットを計算すること、第１の特徴ベクトルを生成するために、空間的特徴の１つまたは複数のセットを集約すること、及び前記第１の特徴ベクトルを訓練された分類器に入力することであって、それに応じて、前記第１のマイクロウェルプレートが第１の実行アーティファクトに関連付けられていることを示す第１のラベルを生成する、入力することを含む。 1. In some embodiments, a computer-implemented method for detecting execution artifacts in an experiment involving a microwell plate includes calculating one or more sets of spatial features based on one or more heat maps associated with a first microwell plate, aggregating the one or more sets of spatial features to generate a first feature vector, and inputting the first feature vector into a trained classifier, which in response generates a first label indicating that the first microwell plate is associated with a first execution artifact.

２．前記第１の特徴ベクトル及び前記第１のラベルに関連付けられた特徴ベクトルの第１のクラスタに基づいて、異常スコアを計算することをさらに含む、条項１に記載のコンピュータ実装方法。 2. The computer-implemented method of claim 1, further comprising calculating an anomaly score based on the first feature vector and a first cluster of feature vectors associated with the first label.

３．複数のヒートマップに基づいて、実験に関連する第２の特徴ベクトルを計算することであって、前記複数のヒートマップは、前記第１のマイクロウェルプレートに関連する前記１つまたは複数のヒートマップを含む、前記計算すること、及び前記第２の特徴ベクトルを前記訓練された分類器に入力することであって、それに応じて、前記第１のラベルを含む複数のラベルに関して前記実験を分類する第２のラベルを生成する、前記入力することをさらに含む、条項１または２に記載のコンピュータ実装方法。 3. The computer-implemented method of claim 1 or 2, further comprising: calculating a second feature vector associated with an experiment based on a plurality of heat maps, the plurality of heat maps including the one or more heat maps associated with the first microwell plate; and inputting the second feature vector into the trained classifier to generate a second label that classifies the experiment with respect to a plurality of labels including the first label accordingly.

４．前記第１のマイクロウェルプレートを含む複数のマイクロウェルプレートに基づいて、実験に関連する第２の特徴ベクトルを計算すること、前記第２の特徴ベクトルを前記訓練された分類器に入力することであって、それに応じて前記第１のラベルを生成する、前記入力すること、及び前記複数のマイクロウェルプレートに含まれるマイクロウェルプレートのうちいくつが前記第１のラベルに関連付けられているかを示すパーセンテージを計算することをさらに含む、条項１～３のいずれかに記載のコンピュータ実装方法。 4. The computer-implemented method of any one of clauses 1 to 3, further comprising: calculating a second feature vector associated with an experiment based on a plurality of microwell plates including the first microwell plate; inputting the second feature vector into the trained classifier, which generates the first label accordingly; and calculating a percentage indicating how many of the microwell plates in the plurality of microwell plates are associated with the first label.

５．実験に関連付けられたマイクロウェルプレートのサブセットも前記第１のラベルに関連付けられていることを判定することであって、前記マイクロウェルプレートのサブセットは前記第１のマイクロウェルプレートを含む、前記判定すること、及びグラフィカルユーザインターフェース（「ＧＵＩ」）を介して、前記第１のラベルに関連付けられた平均ヒートマップを表示すること、及び前記マイクロウェルプレートのサブセットが前記平均ヒートマップに関連付けられているという前記ＧＵＩを介した表示を生成すること、をさらに含む、条項１～４のいずれかに記載のコンピュータ実装方法。 5. The computer-implemented method of any one of clauses 1-4, further comprising: determining that a subset of microwell plates associated with an experiment is also associated with the first label, the subset of microwell plates including the first microwell plate; and displaying, via a graphical user interface ("GUI"), an average heat map associated with the first label; and generating an indication via the GUI that the subset of microwell plates is associated with the average heat map.

６．ＧＵＩを介した、前記第１のラベルが前記１つまたは複数のヒートマップに関連付けられているという表示を生成することをさらに含む、条項１～５のいずれかに記載のコンピュータ実装方法。 6. The computer-implemented method of any one of clauses 1 to 5, further comprising generating an indication via a GUI that the first label is associated with the one or more heatmaps.

７．前記訓練された分類器が、前記第１のラベルが前記第１のマイクロウェルプレートに適用される可能性を示すラベル信頼度をさらに生成する、条項１～６のいずれかに記載のコンピュータ実装方法。 7. The computer-implemented method of any one of clauses 1 to 6, wherein the trained classifier further generates a label confidence indicating the likelihood that the first label applies to the first microwell plate.

８．前記１つまたは複数の空間的特徴のセットを計算することが、マルチレベルウェーブレット分解を生成するために、前記１つまたは複数のヒートマップに含まれる第１のヒートマップにウェーブレット変換を適用すること、及び前記マルチレベルウェーブレット分解の少なくとも最低のレベルから第１の空間的特徴のセットを抽出することを含む、条項１～７のいずれかに記載のコンピュータ実装方法。 8. The computer-implemented method of any one of clauses 1 to 7, wherein computing the one or more sets of spatial features includes applying a wavelet transform to a first heat map included in the one or more heat maps to generate a multi-level wavelet decomposition, and extracting a first set of spatial features from at least a lowest level of the multi-level wavelet decomposition.

９．前記１つまたは複数のヒートマップに含まれる第１のヒートマップが複数の強度を明示し、前記複数の強度に含まれる各強度が、前記第１のマイクロウェルプレートに含まれている異なるウェルに関連付けられる、条項１～８のいずれかに記載のコンピュータ実装方法。 9. The computer-implemented method of any one of clauses 1 to 8, wherein a first heat map in the one or more heat maps exhibits a plurality of intensities, and each intensity in the plurality of intensities is associated with a different well in the first microwell plate.

１０．前記訓練された分類器が、ランダムフォレスト、ニューラルネットワーク、判定木、または１つ以上の誘導クラスタリング操作によってトレーニングされたサポートベクターマシンを含む、条項１～９のいずれかに記載のコンピュータ実装方法。 10. The computer-implemented method of any one of clauses 1 to 9, wherein the trained classifier comprises a random forest, a neural network, a decision tree, or a support vector machine trained by one or more guided clustering operations.

１１．いくつかの実施形態では、１つまたは複数の非一時的コンピュータ可読媒体は、１つまたは複数のプロセッサによって実行されるとき、前記１つまたは複数のプロセッサに、第１のマイクロウェルプレートに関連する１つまたは複数の測定値に基づいて空間的特徴の１つまたは複数のセットを計算するステップ、前記空間的特徴の１つまたは複数のセットに基づいて第１の特徴ベクトルを生成するステップ、及び前記第１の特徴ベクトルを訓練された機械学習モデルに入力するステップであって、それに応じて、前記第１のマイクロウェルプレートが第１の実行アーティファクトに関連付けられることを示す第１のラベルを生成する、前記入力するステップを行うことによって、マイクロウェルプレートを伴う実験で実行アーティファクトを検出させる命令を含む。 11. In some embodiments, one or more non-transitory computer-readable media include instructions that, when executed by one or more processors, cause the one or more processors to detect an execution artifact in an experiment involving a micro-well plate by performing the steps of: calculating one or more sets of spatial features based on one or more measurements associated with a first micro-well plate; generating a first feature vector based on the one or more sets of spatial features; and inputting the first feature vector into a trained machine learning model, which in response generates a first label indicating that the first micro-well plate is associated with a first execution artifact.

１２．前記第１の特徴ベクトル及び前記第１のラベルに関連付けられた特徴ベクトルの第１のクラスタに基づいて、異常スコアを計算することをさらに含む、条項１１に記載の１つまたは複数の非一時的コンピュータ可読媒体。 12. The one or more non-transitory computer-readable media of clause 11, further comprising: calculating an anomaly score based on the first feature vector and a first cluster of feature vectors associated with the first label.

１３．複数の測定値配列に基づいて、実験に関連する第２の特徴ベクトルを計算することであって、前記複数の測定値配列は、前記第１のマイクロウェルプレートに関連する前記１つまたは複数の測定値配列を含む、前記計算すること、及び前記第２の特徴ベクトルを前記訓練された機械学習モデルに入力することであって、それに応じて、前記第１のラベルを含む複数のラベルに関して前記実験を分類する第２のラベルを生成する、前記入力することをさらに含む、条項１１または１２に記載の１つまたは複数の非一時的コンピュータ可読媒体。 13. The one or more non-transitory computer-readable media of clause 11 or 12, further comprising: calculating a second feature vector associated with an experiment based on a plurality of measurement sequence, the plurality of measurement sequence including the one or more measurement sequence associated with the first microwell plate; and inputting the second feature vector into the trained machine learning model to generate a second label that classifies the experiment with respect to a plurality of labels including the first label accordingly.

１４．前記第１のマイクロウェルプレートを含む複数のマイクロウェルプレートに基づいて、実験に関連する第２の特徴ベクトルを計算すること、前記第２の特徴ベクトルを前記訓練された機械学習モデルに入力することであって、それに応じて前記第１のラベルを生成する、前記入力すること、及び前記複数のマイクロウェルプレートに含まれるマイクロウェルプレートのうちいくつが前記第１のラベルに関連付けられているかを示すパーセンテージを計算することをさらに含む、条項１１～１３のいずれかに記載の１つまたは複数の非一時的コンピュータ可読媒体。 14. One or more non-transitory computer-readable media according to any one of clauses 11 to 13, further comprising: calculating a second feature vector associated with an experiment based on a plurality of microwell plates including the first microwell plate; inputting the second feature vector into the trained machine learning model, which generates the first label accordingly; and calculating a percentage indicating how many of the microwell plates in the plurality of microwell plates are associated with the first label.

１５．実験に関連付けられたマイクロウェルプレートのサブセットも前記第１のラベルに関連付けられていることを判定することであって、前記マイクロウェルプレートのサブセットは前記第１のマイクロウェルプレートを含む、前記判定すること、及びグラフィカルユーザインターフェース（「ＧＵＩ」）を介して、前記第１のラベルに関連付けられた平均測定値配列を表示すること、及び前記マイクロウェルプレートのサブセットが前記平均測定値配列に関連付けられているという前記ＧＵＩを介した表示を生成すること、をさらに含む、条項１１～１４のいずれかに記載の１つまたは複数の非一時的コンピュータ可読媒体。 15. One or more non-transitory computer-readable media according to any of clauses 11-14, further comprising: determining that a subset of microwell plates associated with an experiment is also associated with the first label, the subset of microwell plates including the first microwell plate; and displaying, via a graphical user interface ("GUI"), an average measurement sequence associated with the first label; and generating an indication via the GUI that the subset of microwell plates is associated with the average measurement sequence.

１６．前記マルチレベルウェーブレット分解を生成するために、前記空間的特徴の１つまたは複数のセットを計算することは、前記１つまたは複数の測定値配列に含まれる第１の測定値配列にウェーブレット変換を適用すること、及び前記マルチレベルウェーブレット分解の少なくとも最低のレベルから第１の空間的特徴のセットを抽出することを含む、条項１１～１５のいずれかに記載の１つまたは複数の非一時的コンピュータ可読媒体。 16. One or more non-transitory computer-readable media according to any one of clauses 11 to 15, wherein computing the one or more sets of spatial features to generate the multi-level wavelet decomposition comprises applying a wavelet transform to a first measurement sequence included in the one or more measurement sequence, and extracting a first set of spatial features from at least a lowest level of the multi-level wavelet decomposition.

１７．前記空間的特徴の１つまたは複数のセットを計算することは、前記１つまたは複数の測定値配列に含まれる第１の測定値配列に基づいて第１の空間情報を計算すること、及び前記第１の空間情報から第１の低周波の空間的特徴のセットを抽出することを含む、条項１１～１６のいずれかに記載の１つまたは複数の非一時的コンピュータ可読媒体。 17. One or more non-transitory computer-readable media according to any one of clauses 11 to 16, wherein calculating the one or more sets of spatial features comprises calculating first spatial information based on a first measurement sequence included in the one or more measurement sequence, and extracting a first set of low frequency spatial features from the first spatial information.

１８．前記１つまたは複数の測定値配列に含まれる第１の測定値配列は、複数の細胞数を明示し、前記複数のセルに含まれる各細胞数は、前記第１のマイクロウェルプレートに含まれる別のウェルに関連付けられる、条項１１～１７のいずれかに記載の１つまたは複数の非一時的コンピュータ可読媒体。 18. One or more non-transitory computer-readable media according to any one of clauses 11 to 17, wherein a first measurement sequence in the one or more measurement sequence specifies a number of cells, each number of cells in the number of cells being associated with a different well in the first microwell plate.

１９．第１のマイクロウェルプレートが第１の実験に関連付けられ、第２のマイクロウェルプレートが第２の実験及び第２の特徴ベクトルに関連付けられ、前記第２の特徴ベクトルを前記訓練された機械学習モデルに入力することであって、それに応じて、前記第２のマイクロウェルプレートが第２の実行アーティファクトに関連付けられていることを示す第２のラベルを生成する、前記入力することをさらに含む、条項１１～１８のいずれかに記載の１つまたは複数の非一時的コンピュータ可読媒体。 19. The one or more non-transitory computer-readable media of any of clauses 11-18, further comprising: inputting the second feature vector into the trained machine learning model, the first microwell plate being associated with a first experiment and the second microwell plate being associated with a second experiment and a second feature vector, and in response, generating a second label indicating that the second microwell plate is associated with a second execution artifact.

２０．いくつかの実施形態では、システムが、命令を格納する１つまたは複数のメモリと、１つまたは複数のプロセッサであって、前記命令を実行するときに、第１のマイクロウェルプレートに関連する複数の測定値に基づいて複数の空間的特徴を計算するステップ、前記複数の空間的特徴に基づいて特徴のセットを生成するステップ、及び前記特徴のセット及び訓練された機械学習モデルに基づいて第１のラベルを計算するステップであって、前記第１のラベルは、前記第１のマイクロウェルプレートが第１の実行アーティファクトに関連付けられていることを示す、前記計算するステップを行う前記１つまたは複数のメモリに結合される、前記１つまたは複数のプロセッサと、を含む。 20. In some embodiments, a system includes one or more memories storing instructions; and one or more processors coupled to the one or more memories that, when executing the instructions, perform the steps of: calculating a plurality of spatial features based on a plurality of measurements associated with a first micro-well plate; generating a set of features based on the plurality of spatial features; and calculating a first label based on the set of features and a trained machine learning model, the first label indicating that the first micro-well plate is associated with a first execution artifact.

請求項のいずれかに記載されている請求項の要素のいずれか及び／または本願に記載されたいずれかの要素のいずれかの組み合わせ及びすべての組み合わせは、何らかの形で、実施形態及び保護の意図された範囲内に入る。 Any and all combinations of any of the claim elements recited in any claim and/or any of the elements recited in this application, in any way, are within the intended scope of the embodiments and protection.

様々な実施形態の説明は、例証の目的で提示されているが、包括的に、または開示される実施形態に限定されることは意図されていない。多くの修正及び変形例は、説明される実施形態の範囲及び主旨から逸脱することなく当業者に明白である。 The descriptions of various embodiments are presented for purposes of illustration, but are not intended to be exhaustive or limited to the disclosed embodiments. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments.

本実施形態の態様は、システム、方法、またはコンピュータプログラム製品として具体化され得る。したがって、本開示の態様は、全体的にハードウェアの実施形態、全体的にソフトウェアの実施形態（ファームウェア、存在ソフトウェア、マイクロコードなどを含む）、またはすべて一般的に「モジュール」もしくは「システム」または「コンピュータ」と称され得るソフトウェア及びハードウェアの態様を組み合わせる実施形態の形態をとり得る。さらに、本開示で説明される任意のハードウェア及び／またはソフトウェアの技術、プロセス、機能、構成要素、エンジン、モジュール、またはシステムは、回路または回路のセットとして実装され得る。 Aspects of the present embodiments may be embodied as a system, method, or computer program product. Thus, aspects of the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, existing software, microcode, etc.), or an embodiment combining software and hardware aspects, which may all be generally referred to as a "module" or "system" or "computer." Additionally, any hardware and/or software techniques, processes, functions, components, engines, modules, or systems described in this disclosure may be implemented as a circuit or set of circuits.

本明細書で前述したように、本開示の態様は、コンピュータ可読プログラムコーデックが具現化された１つまたは複数のコンピュータ可読媒体で具現化されたコンピュータプログラム製品の形をとることができる。１つまたは複数のコンピュータ可読媒体の任意の組み合わせを利用することができる。各コンピュータ可読媒体は、コンピュータ可読信号媒体またはコンピュータ可読記憶媒体であり得る。コンピュータ可読記憶媒体は、例えば、限定するものではないが、電子、磁気、光、電磁気、赤外線、もしくは半導体のシステム、装置、もしくはデバイス、または任意の前述の好適な組み合わせであり得る。コンピュータ可読記憶媒体のより多くの具体例は、１つ以上の通信回線を有する電気的接続、ポータブルコンピュータディスケット、ハードディスク、ランダムアクセスメモリ、読み取り専用メモリ、消去可能プログラマブルＲＯＭ、またはフラッシュメモリ）、光ファイバ、コンパクトディスク読み取り専用メモリ、光学記憶デバイス、磁気記憶デバイス、または前述の任意の好適な組み合わせを含むであろう。本文書の文脈において、コンピュータ可読記憶媒体は、命令実行システム、装置、もしくはデバイスによる使用のために、またはそれらと接続してプログラムを含むまたは記憶することができる任意の有形媒体であり得る。 As previously described herein, aspects of the disclosure may take the form of a computer program product embodied in one or more computer readable media having computer readable program codecs embodied therein. Any combination of one or more computer readable media may be utilized. Each computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of computer readable storage media would include an electrical connection having one or more communication lines, a portable computer diskette, a hard disk, a random access memory, a read only memory, an erasable programmable ROM, or a flash memory), an optical fiber, a compact disk read only memory, an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium capable of containing or storing a program for use by or in connection with an instruction execution system, apparatus, or device.

本開示の態様は、本開示の実施形態に従った方法、装置（システム）、及びコンピュータプログラム製品のフローチャート図及び／またはブロック図を参照して上段にて説明されている。フローチャート図及び／またはブロック図の各ブロック、及びフローチャート図及び／またはブロック図のブロックの組み合わせは、コンピュータプログラム命令によって実施できることが理解される。これらのコンピュータプログラム命令は、機械を生成するために、汎用コンピュータ、専用コンピュータ、または他のプログラム可能なデータ処理装置のプロセッサに提示され得る。命令は、コンピュータまたは他のプログラム可能なデータ処理装置のプロセッサを介して実行されるとき、フローチャート及び／またはブロック図ブロック（複数可）で明示されている機能／動作の実装を可能にする。係るプロセッサは、限定ではなく、汎用プロセッサ、専用プロセッサ、アプリケーション特有プロセッサ、またはフィールドプログラム可能ゲートアレイであり得る。 Aspects of the present disclosure are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions can be presented to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to generate a machine. The instructions, when executed via a processor of a computer or other programmable data processing apparatus, enable implementation of the functions/operations specified in the flowchart and/or block diagram block(s). Such a processor can be, but is not limited to, a general purpose processor, a special purpose processor, an application specific processor, or a field programmable gate array.

図面のフローチャート及びブロック図は、本開示の様々な実施形態に従ったシステム、方法、装置、及びコンピュータプログラム製品の可能である実施態様のアーキテクチャ、機能、及び動作を示す。この点で、フローチャートまたはブロック図の各ブロックは、規定された論理関数（複数可）を実装するための１つ以上の実行可能命令を含むモジュール、セグメント、またはコードの一部を表し得る。また、いくつかの実施態様では、ブロックで留意される機能は、図で留意される順序とは違う順序で起こり得ることを留意されたい。例えば、連続して示される２つのブロックは、実際に、実質的に同時に実行され得る、または、ブロックは、時々、関与する機能に応じて、逆の順序で実行され得る。また、ブロック図及び／またはフローチャート図の各ブロック、及びブロック図及び／またはフローチャート図のブロックの組み合わせは、規定の機能もしくは動作、または特殊目的ハードウェア及びコンピュータ命令の組み合わせを行う特殊目的ハードウェアベースシステムによって実施され得ることが留意される。 The flowcharts and block diagrams in the drawings illustrate the architecture, functionality, and operation of possible implementations of systems, methods, apparatus, and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagram may represent a module, segment, or portion of code that includes one or more executable instructions for implementing a specified logical function(s). It should also be noted that in some implementations, the functions noted in the blocks may occur in a different order than the order noted in the figures. For example, two blocks shown in succession may in fact be executed substantially simultaneously, or the blocks may sometimes be executed in reverse order depending on the functionality involved. It should also be noted that each block of the block diagrams and/or flowchart diagrams, and combinations of blocks in the block diagrams and/or flowchart diagrams, may be implemented by a special-purpose hardware-based system that performs the specified functions or operations, or a combination of special-purpose hardware and computer instructions.

上記は本開示の実施形態を対象としているが、本開示の他の及びさらなる実施形態は、その基本的範囲から逸脱することなく考案され得、その範囲は、以下の特許請求の範囲によって判定される。
以下、本発明の好ましい実施形態を項分け記載する。
実施形態１
マイクロウェルプレートを伴う実験で実行アーティファクトを検出するためのコンピュータ実装方法であって、
第１のマイクロウェルプレートに関連付けられた１つまたは複数のヒートマップに基づいて、空間的特徴の１つまたは複数のセットを計算すること、
第１の特徴ベクトルを生成するために、前記空間的特徴の１つまたは複数のセットを集約すること、及び
前記第１の特徴ベクトルを訓練された分類器に入力することであって、それに応じて、前記第１のマイクロウェルプレートが第１の実行アーティファクトに関連付けられることを示す第１のラベルを生成する、前記入力すること、
を含む前記方法。
実施形態２
前記第１の特徴ベクトル及び前記第１のラベルに関連付けられた特徴ベクトルの第１のクラスタに基づいて、異常スコアを計算することをさらに含む、実施形態１に記載のコンピュータ実装方法。
実施形態３
前記第１のマイクロウェルプレートを含む複数のマイクロウェルプレートに基づいて、実験に関連する第２の特徴ベクトルを計算すること、
前記第２の特徴ベクトルを前記訓練された分類器に入力することであって、それに応じて前記第１のラベルを生成する、前記入力すること、及び
前記複数のマイクロウェルプレートに含まれるマイクロウェルプレートのうちいくつが前記第１のラベルに関連付けられているかを示すパーセンテージを計算すること、
をさらに含む、実施形態１に記載のコンピュータ実装方法。
実施形態４
前記訓練された分類器が、前記第１のラベルが前記第１のマイクロウェルプレートに適用される可能性を示すラベル信頼度をさらに生成する、実施形態１に記載のコンピュータ実装方法。
実施形態５
前記１つまたは複数のヒートマップに含まれる第１のヒートマップが複数の強度を明示し、前記複数の強度に含まれる各強度が、前記第１のマイクロウェルプレートに含まれている異なるウェルに関連付けられる、実施形態１に記載のコンピュータ実装方法。
実施形態６
前記訓練された分類器は、
前記第１のマイクロウェルプレートに関連付けられた第１のヒートマップに基づいて第１の空間情報を計算すること、
前記第１の空間情報に基づいて第１の特徴のセットを計算すること、及び
前記第１の特徴のセットに基づいて１つまたは複数の機械学習操作を実行して、前記訓練された分類器を生成することであって、前記訓練された分類器は、複数の実行アーティファクトに関連する複数のラベルに関して、異なるマイクロウェルプレートに関連する特徴のセットを分類する、前記実行すること、
によりトレーニングされる、実施形態１に記載のコンピュータ実装方法。
実施形態７
前記訓練された分類器が、前記複数のラベルに含まれるラベルのラベル信頼度を推定することによって、特定のマイクロウェルプレートの所与の特徴のセットを分類し、前記ラベル信頼度が、前記ラベルが前記特定のマイクロウェルプレートに適用される可能性を示す、実施形態６に記載のコンピュータ実装方法。
実施形態８
前記第１の空間情報はマルチレベルウェーブレット分解を含み、前記第１の特徴のセットを計算することは、
前記マルチレベルウェーブレット分解の少なくとも最低のレベルから第１の複数の空間的特徴を抽出すること、及び
前記第１の複数の空間的特徴を第２の複数の空間的特徴と集約して、前記第１の特徴のセットを生成することであって、前記第２の複数の空間的特徴は、前記第１のマイクロウェルプレートにまた関連付けられる第２のヒートマップから導出される、前記集約すること、
を含む、実施形態６に記載のコンピュータ実装方法。
実施形態９
前記第１のヒートマップが複数の細胞数を特定し、前記複数の細胞数に含まれる各細胞数が、前記第１のマイクロウェルプレートに含まれる異なるウェルに関連付けられる、実施形態６に記載のコンピュータ実装方法。
実施形態１０
前記１つまたは複数の機械学習操作に含まれる第１の機械学習操作は、教師あり機械学習操作、教師なし機械学習操作、半教師あり機械学習操作、または強化学習操作を含む、実施形態６に記載のコンピュータ実装方法。
実施形態１１
１つまたは複数のプロセッサによって実行されるとき、前記１つまたは複数のプロセッサに、
第１のマイクロウェルプレートに関連する１つまたは複数の測定値に基づいて空間的特徴の１つまたは複数のセットを計算するステップ、
前記空間的特徴の１つまたは複数のセットに基づいて第１の特徴ベクトルを生成するステップ、及び
前記第１の特徴ベクトルを訓練された機械学習モデルに入力するステップであって、それに応じて、前記第１のマイクロウェルプレートが第１の実行アーティファクトに関連付けられることを示す第１のラベルを生成する、前記入力するステップ、
を行うことによって、マイクロウェルプレートを伴う実験で実行アーティファクトを検出させる命令を含む、１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１２
複数の測定値配列に基づいて、実験に関連する第２の特徴ベクトルを計算することであって、前記複数の測定値配列は、前記第１のマイクロウェルプレートに関連する前記１つまたは複数の測定値配列を含む、前記計算すること、及び
前記第２の特徴ベクトルを前記訓練された機械学習モデルに入力することであって、それに応じて、前記第１のラベルを含む複数のラベルに関して前記実験を分類する第２のラベルを生成する、前記入力すること、
をさらに含む、実施形態１１に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１３
実験に関連付けられたマイクロウェルプレートのサブセットも前記第１のラベルに関連付けられていることを判定することであって、前記マイクロウェルプレートのサブセットは前記第１のマイクロウェルプレートを含む、前記判定すること、及び
グラフィカルユーザインターフェース（「ＧＵＩ」）を介して、前記第１のラベルに関連付けられた平均測定値配列を表示すること、及び
前記マイクロウェルプレートのサブセットが前記平均測定値配列に関連付けられているという前記ＧＵＩを介した表示を生成すること、
をさらに含む、実施形態１１に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１４
前記空間的特徴の１つまたは複数のセットを計算することは、
マルチレベルウェーブレット分解を生成するために、前記１つまたは複数の測定値配列に含まれる第１の測定値配列にウェーブレット変換を適用すること、及び
前記マルチレベルウェーブレット分解の少なくとも最低のレベルから第１の空間的特徴のセットを抽出すること、
を含む、実施形態１１に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１５
前記空間的特徴の１つまたは複数のセットを計算することは、
前記１つまたは複数の測定値配列に含まれる第１の測定値配列に基づいて第１の空間情報を計算すること、及び
前記第１の空間情報から第１の低周波の空間的特徴のセットを抽出すること、
を含む、実施形態１１に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１６
前記訓練された分類器が、
第１のマイクロウェルプレートに関連付けられた第１の測定値配列に基づいて１つまたは複数の空間パターンを判定すること、
前記１つまたは複数の空間パターンに基づいて、第１の特徴のセットを計算すること、及び
前記第１の特徴のセットに基づいて１つまたは複数の機械学習操作を実行して、訓練された分類器を生成することであって、前記訓練された分類器は、複数の実行アーティファクトに関連付けられる複数のラベルに関して、異なるマイクロウェルプレートに関連する特徴のセットを分類する、前記実行すること、
によりトレーニングされる、実施形態１１に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１７
前記１つまたは複数の機械学習操作を実行する前に、複数の特徴のセットに対してクラスタリングアルゴリズムを実行して、前記複数のラベルを生成することをさらに含む、実施形態１６に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１８
前記１つまたは複数の空間パターンを判定することは、前記第１の測定値配列にウェーブレット変換を適用することを含む、実施形態１６に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態１９
前記１つまたは複数の機械学習操作に含まれる第１の機械学習操作は、教師あり機械学習操作、教師なし機械学習操作、半教師あり機械学習操作、または強化学習操作を含む、実施形態１６に記載の１つまたは複数の非一時的コンピュータ可読媒体。
実施形態２０
システムであって、
命令を格納する１つまたは複数のメモリと、
１つまたは複数のプロセッサであって、前記命令を実行するときに、
第１のマイクロウェルプレートに関連する複数の測定値に基づいて複数の空間的特徴を計算するステップ、
前記複数の空間的特徴に基づいて特徴のセットを生成するステップ、及び
前記特徴のセット及び訓練された機械学習モデルに基づいて第１のラベルを計算するステップであって、前記第１のラベルは、前記第１のマイクロウェルプレートが第１の実行アーティファクトに関連付けられていることを示す、前記計算するステップ、
を行う前記１つまたは複数のメモリに結合される、前記１つまたは複数のプロセッサと、を含む、前記システム。
While the forgoing is directed to embodiments of the present disclosure, other and further embodiments of the present disclosure may be devised without departing from the basic scope thereof, which scope is determined by the following claims.
Preferred embodiments of the present invention will be described below in detail.
EMBODIMENT 1
1. A computer-implemented method for detecting performance artifacts in an experiment involving a microwell plate, comprising:
calculating one or more sets of spatial features based on the one or more heat maps associated with the first microwell plate;
aggregating the one or more sets of spatial features to generate a first feature vector; and
inputting the first feature vector into a trained classifier, which in response generates a first label indicating that the first micro-well plate is associated with a first execution artifact;
The method comprising:
EMBODIMENT 2
2. The computer-implemented method of embodiment 1, further comprising: calculating an anomaly score based on the first feature vector and a first cluster of feature vectors associated with the first label.
EMBODIMENT 3
calculating a second feature vector associated with an experiment based on a plurality of microwell plates including the first microwell plate;
inputting the second feature vector into the trained classifier, whereby generating the first label in response thereto; and
calculating a percentage indicating how many of the micro-well plates in the plurality of micro-well plates are associated with the first label;
2. The computer-implemented method of embodiment 1, further comprising:
EMBODIMENT 4
2. The computer-implemented method of embodiment 1, wherein the trained classifier further generates a label confidence indicating the likelihood that the first label is applied to the first micro-well plate.
EMBODIMENT 5
2. The computer-implemented method of embodiment 1, wherein a first heat map in the one or more heat maps exhibits a plurality of intensities, each intensity in the plurality of intensities being associated with a different well in the first micro-well plate.
EMBODIMENT 6
The trained classifier may be
calculating first spatial information based on a first heat map associated with the first microwell plate;
calculating a first set of features based on the first spatial information; and
performing one or more machine learning operations based on the first set of features to generate the trained classifier, the trained classifier classifying sets of features associated with different micro-well plates with respect to a plurality of labels associated with a plurality of execution artifacts;
2. The computer-implemented method of embodiment 1, wherein the computer-implemented method is trained by:
EMBODIMENT 7
7. The computer-implemented method of embodiment 6, wherein the trained classifier classifies a given set of features of a particular micro-well plate by estimating a label confidence of a label included in the plurality of labels, the label confidence indicating the likelihood that the label applies to the particular micro-well plate.
EMBODIMENT 8
The first spatial information includes a multi-level wavelet decomposition, and computing the first set of features includes:
extracting a first plurality of spatial features from at least a lowest level of the multilevel wavelet decomposition; and
aggregating the first plurality of spatial features with a second plurality of spatial features to generate the first set of features, the second plurality of spatial features being derived from a second heat map also associated with the first micro-well plate;
7. The computer-implemented method of embodiment 6, comprising:
EMBODIMENT 9
7. The computer-implemented method of embodiment 6, wherein the first heat map identifies a plurality of cell counts, each cell count within the plurality of cell counts being associated with a different well within the first microwell plate.
EMBODIMENT 10
7. The computer-implemented method of claim 6, wherein a first machine learning operation included in the one or more machine learning operations includes a supervised machine learning operation, an unsupervised machine learning operation, a semi-supervised machine learning operation, or a reinforcement learning operation.
EMBODIMENT 11
When executed by one or more processors, the one or more processors are
calculating one or more sets of spatial features based on one or more measurements associated with the first micro-well plate;
generating a first feature vector based on the one or more sets of spatial features; and
inputting the first feature vector into a trained machine learning model, which in response generates a first label indicating that the first micro-well plate is associated with a first execution artifact;
One or more non-transitory computer-readable media comprising instructions for detecting execution artifacts in an experiment involving a microwell plate by performing the steps of:
EMBODIMENT 12
calculating a second feature vector associated with an experiment based on a plurality of measurement arrays, the plurality of measurement arrays including the one or more measurement arrays associated with the first micro-well plate; and
inputting the second feature vector into the trained machine learning model, whereby a second label is generated in response that classifies the experiment with respect to a plurality of labels including the first label;
12. The one or more non-transitory computer-readable media of embodiment 11, further comprising:
EMBODIMENT 13
determining that a subset of microwell plates associated with an experiment are also associated with the first label, the subset of microwell plates including the first microwell plate; and
displaying, via a graphical user interface ("GUI"), an average measurement array associated with the first label; and
generating an indication via the GUI that the subset of micro-well plates is associated with the average measurement array;
12. The one or more non-transitory computer-readable media of embodiment 11, further comprising:
EMBODIMENT 14
Calculating the one or more sets of spatial features comprises:
applying a wavelet transform to a first measurement array in the one or more measurement arrays to generate a multi-level wavelet decomposition; and
extracting a first set of spatial features from at least a lowest level of the multilevel wavelet decomposition;
12. One or more non-transitory computer-readable media as recited in embodiment 11, comprising:
EMBODIMENT 15
Calculating the one or more sets of spatial features comprises:
calculating first spatial information based on a first measurement array included in the one or more measurement arrays; and
extracting a first set of low frequency spatial features from the first spatial information;
12. One or more non-transitory computer-readable media as recited in embodiment 11, comprising:
EMBODIMENT 16
The trained classifier comprises:
determining one or more spatial patterns based on a first array of measurements associated with the first microwell plate;
calculating a first set of features based on the one or more spatial patterns; and
performing one or more machine learning operations based on the first set of features to generate a trained classifier, the trained classifier classifying sets of features associated with different micro-well plates with respect to a plurality of labels associated with a plurality of execution artifacts;
12. The one or more non-transitory computer-readable media of embodiment 11, wherein the one or more non-transitory computer-readable media are trained by:
EMBODIMENT 17
17. The one or more non-transitory computer-readable media of embodiment 16, further comprising: prior to performing the one or more machine learning operations, running a clustering algorithm on a set of features to generate the plurality of labels.
EMBODIMENT 18
17. The one or more non-transitory computer-readable media of embodiment 16, wherein determining the one or more spatial patterns comprises applying a wavelet transform to the first array of measurements.
EMBODIMENT 19
17. The one or more non-transitory computer-readable media of embodiment 16, wherein a first machine learning operation included in the one or more machine learning operations includes a supervised machine learning operation, an unsupervised machine learning operation, a semi-supervised machine learning operation, or a reinforcement learning operation.
EMBODIMENT 20
1. A system comprising:
one or more memories for storing instructions;
One or more processors, which when executing the instructions,
calculating a plurality of spatial features based on a plurality of measurements associated with the first microwell plate;
generating a set of features based on the plurality of spatial features; and
calculating a first label based on the set of features and a trained machine learning model, the first label indicating that the first micro-well plate is associated with a first execution artifact;
and the one or more processors coupled to the one or more memories for performing the

Claims

1. A computer-implemented method for detecting performance artifacts in an experiment involving a microwell plate, comprising:
calculating one or more sets of spatial features based on one or more heat maps associated with a first micro-well plate, the first heat map having a plurality of values associated with a plurality of wells of the first micro-well plate ;
generating a first feature vector based on the one or more sets of spatial features; and inputting the first feature vector into a trained classifier, which in response generates a first label indicating that the first micro-well plate is associated with a first execution artifact.
The method comprising:

The computer-implemented method of claim 1, further comprising: calculating an anomaly score based on the first feature vector and a first cluster of feature vectors associated with the first label.

calculating a second feature vector associated with an experiment based on a plurality of microwell plates including the first microwell plate;
inputting the second feature vector into the trained classifier, whereby generating the first label in response thereto; and calculating a percentage indicating how many of the micro-well plates in the plurality of micro-well plates are associated with the first label.
The computer-implemented method of claim 1 , further comprising:

The computer-implemented method of claim 1, wherein the trained classifier further generates a label confidence indicating the likelihood that the first label applies to the first microwell plate.

2. The computer-implemented method of claim 1, wherein the first heat map comprises a plurality of intensity values , each intensity value within the plurality of intensity values being associated with a different well contained within the first micro-well plate.

The trained classifier may be
calculating first spatial information based on the first heat map associated with the first micro-well plate;
calculating a first set of features based on the first spatial information; and performing one or more machine learning operations based on the first set of features to generate the trained classifier, the trained classifier classifying sets of features associated with different micro-well plates with respect to a plurality of labels associated with a plurality of execution artifacts.
The computer-implemented method of claim 1 , wherein the training is performed by:

The computer-implemented method of claim 6, wherein the trained classifier classifies a given set of features of a particular micro-well plate by estimating a label confidence of a label included in the plurality of labels, the label confidence indicating the likelihood that the label applies to the particular micro-well plate.

The first spatial information includes a multi-level wavelet decomposition, and computing the first set of features includes:
extracting a first plurality of spatial features from at least a lowest level of the multi-level wavelet decomposition; and aggregating the first plurality of spatial features with a second plurality of spatial features to generate the first feature set, the second plurality of spatial features being derived from a second heat map also associated with the first micro-well plate.
The computer-implemented method of claim 6 , comprising:

The computer-implemented method of claim 6, wherein the first heat map identifies a plurality of cell counts, each cell count in the plurality of cell counts being associated with a different well in the first microwell plate.

The computer-implemented method of claim 6, wherein a first machine learning operation included in the one or more machine learning operations includes a supervised machine learning operation, an unsupervised machine learning operation, a semi-supervised machine learning operation, or a reinforcement learning operation.

When executed by one or more processors, the one or more processors are
calculating one or more sets of spatial features based on one or more measurements associated with a first micro-well plate, the one or more measurements including a first measurement having a plurality of values associated with a plurality of wells of the first micro-well plate ;
generating a first feature vector based on the one or more sets of spatial features; and inputting the first feature vector into a trained machine learning model, which in response generates a first label indicating that the first micro-well plate is associated with a first execution artifact.
One or more non-transitory computer-readable media comprising instructions for detecting execution artifacts in an experiment involving a microwell plate by performing the steps of:

1. A system comprising:
one or more memories for storing instructions;
One or more processors, which when executing the instructions,
calculating a plurality of spatial features based on a plurality of measurements associated with a first micro-well plate, the measurements being associated with a plurality of wells of the first micro-well plate ;
generating a set of features based on the plurality of spatial features; and calculating a first label based on the set of features and a trained machine learning model, the first label indicating that the first micro-well plate is associated with a first execution artifact.
and the one or more processors coupled to the one or more memories for performing the