JP2005004507A

JP2005004507A - Information processing system using base sequence-related information

Info

Publication number: JP2005004507A
Application number: JP2003167724A
Authority: JP
Inventors: Takamasa Kato; 孝昌加藤; Tateo Morimoto; 健郎森本
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2003-06-12
Filing date: 2003-06-12
Publication date: 2005-01-06
Anticipated expiration: 2023-06-12
Also published as: WO2004111908A1; JP3677275B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide individual related information and information usable to run a business according to base sequence-related information. <P>SOLUTION: This system is provided with: a requester-side terminal which requests information showing featured trends as to a specified individual; and a statistical analysis terminal having a 1st storage device, which stores position information and base sequence-related information for every individual so that they are made correspond to each other, and a 2nd storage device, which stores individual related information regarding individual organisms for every individual organism. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、例えば通信回線網を介して情報を提供する情報処理システムに関する。
【０００２】
【従来の技術】
現在、ヒトを始めとする各種生物のゲノム塩基配列が急速に決定されており、様々なデータベースにゲノム塩基配列情報が蓄積されている。例えば、インターネット等の情報通信網を介して、各種研究機関や研究者がデータベースに蓄積されたゲノム塩基配列情報を利用できるようなシステムの構築がなされつつある。
【０００３】
同時に、このようなゲノム塩基配列情報に含まれる塩基配列を用いて、ゲノム創薬の研究や遺伝情報の解析等が盛んに行われており、一塩基多型に代表されるような個体間における塩基配列の相違が注目されている。一般に、個体間における塩基配列の相違とは、所定の塩基の相違が個体種中１％以上の頻度で存在すると定義される多型と、所定の塩基の相違が個体種中１％未満であるバリエーションとを意味している。特に、多型には、個体間における１個の塩基の相違である一塩基多型（ＳＮＰ；ＳｉｎｇｌｅＮｕｃｌｅｏｔｉｄｅＰｏｌｙｍｏｒｐｈｉｓｍ）、１から数十塩基（数千塩基の場合もある）が欠失又は挿入している挿入／欠失多型、２から数十塩基を１単位とする配列の繰り返し回数が相違するＶＮＴＲ（ＶａｒｉａｂｌｅＮｕｍｂｅｒｏｆＴａｎｄｅｍＲｅｐｅａｔ）やマイクロサテライト多型（繰り返し配列が２〜４塩基程度のもの）が知られている。
【０００４】
このような多型は、個体間におけるタンパク質のアミノ酸配列の相違や、個体間における所定の遺伝子に関する発現効率の相違等に影響を及ぼすことがある。このような影響により、例えば、所定の疾病に対する罹患可能性が個体間で異なったり、所定の薬剤に対する感受性が個体間で異なることが知られている。
【０００５】
ところが、多型等の個体間における塩基配列情報の相違を有効に利用して、各個体にとって有益な意味情報を提供するようなシステムは構築されていないのが現状である。
【０００６】
【発明が解決しようとする課題】
そこで、本発明は、このような現状に鑑み、塩基配列関連情報に基づいて、個体関連情報を提供できる塩基配列に関する情報処理方法、並びに、塩基配列関連情報に基づいて、事業を行うために使用できる情報を提供できる塩基配列に関する情報処理方法を提供することを目的とする。
【０００７】
【課題を解決するための手段】
上述した目的を達成した本発明に係る塩基配列に関する情報処理システムは、所定の個体について特徴的な傾向を示す情報を要求する要求者側端末と、位置情報と塩基配列関連情報とを関連付けて個体毎に格納した第１の記憶装置と、個体に関する個体関連情報を個体毎に格納した第２の記憶装置とを有する統計解析用端末とを備える。本システムにおいて、統計解析用端末は、要求者側端末から送出された位置情報を取得し、第１の記憶装置及び第２の記憶装置を検索して、要求者側端末から取得した位置情報と、当該位置情報に関連付けられた複数の塩基配列関連情報のうち１の塩基配列関連情報とからなるセットを有する個体に特徴的な傾向を示す個体関連情報を取得する。
【０００８】
また、本発明に係る塩基配列に関する情報処理システムは、所定の集団に含まれる個体を特定するための情報を収集し、当該集団における、所定の位置情報に対応する塩基配列関連情報の保有割合を用いて、事業を行う事業機関が備える要求者側端末と、位置情報と塩基配列関連情報とを関連付けて個体毎に格納した記憶装置を有する統計処理端末とを備える。本システムにおいて、統計処理端末は、要求者側端末から集団に含まれる個体を特定するための情報と保有割合を求める対象となる塩基配列関連情報に対応する位置情報とを取得し、記憶装置を検索し、集団における、取得した位置情報と当該位置情報に関連付けられた複数の塩基配列関連情報のうち１の塩基配列関連情報とからなるセット毎の保有割合を算出する。そして、本システムにおいて、要求者側端末は、集団における、統計処理端末で算出した保有割合を取得し、当該保有割合に基づいて事業を行うための情報を作成する。
【０００９】
なお、本発明に係る塩基配列に関する情報処理システムは、制御装置、送受信装置及び記憶装置等のハードウェアを備えるコンピュータに、各ステップを実行させるプログラムとして実現することができる。また、本発明に係る塩基配列に関する情報処理システムは、制御装置、送受信装置及び記憶装置等のハードウェアを備えるコンピュータに、各ステップを実行させるプログラムを記録した記録媒体として実現することもできる。さらに、本発明に係る塩基配列に関する情報処理システムは、各ステップを実行する制御装置、送受信装置及び記憶装置等のハードウェアを備える情報処理装置として実現することもできる。
その他、本発明は、請求項各項に記載されている通りの構成を有するものである。
【００１０】
【発明の実施の形態】
以下、図面を参照して本発明を詳細に説明する。
＜第１の実施の形態＞
先ず、本発明を適用した第１の実施の形態として、特定の多型番地及び多型パターンのセットを有する個体に特徴的な傾向を示す情報を提供する情報処理システムについて説明する。
【００１１】
以下の説明においては、特定の多型番地及び多型パターンのセットを有する個体に特徴的な傾向を示す情報を提供する情報処理システムについて説明するが、説明の都合上、簡略化したモデルとして説明する。情報処理システムは、図１に示すように、インターネット等の通信回線網１と、通信回線網１に接続された共用コンピュータ２と、通信回線網１に接続された少なくとも１以上の個人用コンピュータ３と、通信回線網１に接続された統計解析用コンピュータＳＴとを備え、通信回線網１を介して共用コンピュータ２と個人用コンピュータ３と統計解析用コンピュータＳＴとの間のデータ通信を可能としている。
【００１２】
共用コンピュータ２は、図２に示すように、当該共用コンピュータ２の動作を全て制御するＣＰＵ４と、情報及びプログラムの実行指示等を入力できるキーボード及びマウス等の入力装置５と、ディスプレイ装置等の表示装置６と、一時的な情報及び書き換え不可能な情報等が記録されるメモリー７と、各種データを格納しているデータベース８と、これらメモリー７及びデータベース８に対して所定の情報を書き込む記録装置９と、通信回線網１を介して個人用コンピュータ３との間で情報の送受信を行う送受信装置１７とから構成されている。
【００１３】
共用コンピュータ２におけるメモリー７は、それぞれ異なる種類の情報を記録するメモリー部Ａ１０及びメモリー部Ｂ１１と、例えば個人用コンピュータ３や表示装置６に表示させる画像データを記録した画面メモリー１２と、本システムを動作させるための処理プログラム１３とから構成されている。なお、共用コンピュータ２においては、画面メモリー１２及び処理プログラム１３等を内部のメモリー７に有さず、通信回線網１を介して共用コンピュータ２と接続された外部記憶装置（図示せず）に有するものであってもよい。
【００１４】
共用コンピュータ２におけるデータベース８（記憶装置）は、多型番地、多型パターン及び意味情報が記録されたメインＤＢ１４と、メモリー部Ａ１０に記録された情報を保存する保管用ＤＢ−Ａ１５と、メモリー部Ｂ１１に記録された情報を保存する保管用ＤＢ−Ｂ１６とから構成されている。メインＤＢ１４は、図３に示すように、多型番地と、当該多型番地で取りうる複数の多型パターンと、当該複数の多型パターンそれぞれを意味づける意味情報とが関連付けられて記録されている。また、メインＤＢ１４には、複数の多型番地における多型パターンの組合せ（例えば、ハプロタイプ）を意味づける意味情報が記録されていても良い。
【００１５】
ここで、「多型番地（位置情報）」とは、少なくとも、塩基配列における多型が存在する位置を意味する。なお、一般的に多型とは、例えば、いわゆるＳＮＰ（ｓｉｎｇｌｅｎｕｃｌｅｏｔｉｄｅｐｏｌｙｍｏｒｐｈｉｓｍ）、ＲＦＬＰ（ｒｅｓｔｒｉｃｔｉｏｎｆｒａｇｍｅｎｔｌｅｎｇｔｈｏｆｐｏｌｙｍｏｒｐｈｉｓｍ）、ＶＮＴＲ（ｖａｒｉａｂｌｅｎｕｍｂｅｒｏｆｔａｎｄｅｍｒｅｐｅａｔ）、マイクロサテライト等を含んでいる。しかし、本明細書において使用する「多型」は、これらに限定されず、個体種中１％未満の頻度でしか存在しない塩基及び塩基配列の変化（バリエーション）も含む意味とする。したがって、「多型番地」は、個体種中１％未満の頻度でしか存在しない塩基及び塩基配列の変化を示す、塩基配列における位置も含む意味である。すなわち、「多型番地」とは、数値、文字及び記号等を組み合わせて、多型等を示す位置を表すものである。多型番地は、特に限定されないが、例えば、染色体番号と多型が存在する遺伝子を表す記号と当該遺伝子における多型の存在位置を示す数値との組合せにより表記することもできるし、多型が存在する遺伝子を示す記号と当該遺伝子における多型の存在位置を示す数値との組合せであってもよい。
【００１６】
また、多型番地は、多型毎に付与される多型固有の表記であっても良い。多型番地として多型固有の表記を使用する場合、多型番地は塩基配列中の位置を直接的には示さないが、多型固有の表記に基づいて間接的に位置を知ることができる。したがって、「多型番地」は、多型固有の表記も含む意味である。
【００１７】
「多型パターン（塩基配列関連情報）」とは、個体間において相違する塩基配列の情報であり、少なくとも、多型における塩基又は塩基配列のパターンを含む意味である。さらに「多型パターン」は、多型に限らず、個体種中１％未満の頻度でしか存在しない塩基及び塩基配列のパターンも含む意味である。
【００１８】
例えば、Ａ又はＧを取ることが知られている多型番地において、「多型パターン」は、「Ａ」及び「Ｇ」のいずれかで表される。また、「多型パターン」は、相同染色体におけるヘテロ接合体又はホモ接合体を示すものであってもよい。この場合、例えば、Ａ又はＧを取ることが知られている多型番地において、「多型パターン」は、「ＡＡ」、「ＧＧ」及び「ＡＧ」のいずれかで表現できる。さらに、「多型パターン」は、所定の多型番地で取りうるパターンを直接的に表記するものではなく、間接的に表記するものであっても良い。すなわち、「多型パターン」は、例えば、Ａ又はＧを取ることが知られている多型番地において「Ａ」を取る場合に「アレル１」とし、「Ｇ」を取る場合に「アレル２」と表記してもよい。また、「多型パターン」が上述したように「ＡＡ」、「ＧＧ」及び「ＡＧ」のいずれかで表現できる場合、例えば、「ＡＡ」で表現できるときに「α」、「ＧＧ」で表現できるときに「β」、「ＡＧ」で表現できるときに「γ」と表記してもよい。その他「多型パターン」の表記例としては、多型がマイクロサテライトの場合には「繰り返し数」を表す数値で、多型が挿入、欠失型の場合には「有／無」を表す記号で表記してもよい。
【００１９】
また更に、各多型番地における「多型パターン」は、所定の規則や取り決めに従って、例えば、「多型１」、「多型２」、「多型３」等と表記されても良い。例えば、各多型番地において、「多型パターン」が取り得る頻度の高い順に、「多型１」、「多型２」、「多型３」と表記できる。この場合、例えば、各多型番地におけるそれぞれの「多型１」は必ずしも同じ内容を表すものではない。すなわち、例えば、ある多型番地の「多型１」は最も取り得る頻度が高い「ＡＡ」を表し、別の多型番地「多型１」は最も取り得る頻度が高い「ＧＧ」を表すことになる。なお、本実施の形態においては、「多型パターン」を「多型１」、「多型２」及び「多型３」等のいずれかで表記する方法を使用している。ところで、本システムにおいては、多型パターンは、暗号化されていても暗号化されていなくても差し支えない。
【００２０】
ここで、「意味情報」とは、「多型パターン」に関連づけられた情報であり、例えば、薬剤に対する応答性、薬剤に対する副作用、疾患及び障害に対するリスク、体質・性質、体質・性質等に基づく生活習慣アドバイス、タンパク質相互作用など、「多型パターン」の相違に起因する様々な情報を意味する。
【００２１】
なお、「意味情報」としては、「多型パターン」の相違に起因する様々な情報を直接表しても良く、また、当該情報を意味する記号などを用いて間接的に表しても良い。「意味情報」は、ゲノム・遺伝子に関する研究が進むことにより種類が増加するとともに訂正が行われる種類の情報であり、常にバージョンアップすることが好ましい。すなわち、「意味情報」は、ゲノム・遺伝子の研究成果を用いてデータベースを更新することによって、蓄積量が増加・減少してより精度の高いものとなる。
【００２２】
なお、直接「多型パターン」には関連づけられていないが「意味情報」から更に導き出される情報は、「意味情報に関連する情報」である。「意味情報」が「疾患に対するリスク」である場合、当該リスクがある一定の水準を超えたときに、例えば特定の「健康診断検査項目」が導き出される。この特定の「健康診断検査項目」が「意味情報に関連する情報」である。
【００２３】
本実施の形態において意味情報は、図３に示すように、少なくとも、所定の「多型番地」及び「多型パターン」に関連づけられた「多型パターンに対する注釈情報」としてメインＤＢ１４に記録されている。また、意味情報には、所定の「多型番地」に対応する「多型分類」及び「分類（疾患名）」等が関連づけられている。すなわち、所定の「多型番地」が所定の「多型パターン」である場合、疾患名の種類と当該疾患に対する罹患可能性を示す注釈情報（意味情報）を得ることができる。したがって、例えば、意味情報は、複数の多型番地に対応するそれぞれの多型パターンの組合せ（例えば、ハプロタイプ）に対して関連付けることもできる。すなわち、複数の多型番地における多型パターンの組合せ毎に、所定の疾患に対する異なる罹患可能性を示す注釈情報（意味情報）を関連付けることができる。この場合、複数の多型番地が所定の多型パターンの組合せである場合、所定の疾患に対する罹患可能性を示す注釈情報（意味情報）を得ることができる。
【００２４】
また、意味情報には、所定の基準で決定した「公開レベル」を関連づけることもできる。例えば、「公開レベル」を決定する際の基準としては、意味情報、すなわちここでは「分類（疾患名）」の罹患可能性を公開することによる個人に対する不測の不利益等を考慮して定めることができる。詳細には、共用コンピュータ２において、法律、規則又は自らの行動基準若しくは利用者との契約等に鑑みて、公開することが相応しくない意味情報については、公開しないような「公開レベル」を決定することができる。この場合、本システムでは、公開不可を意味する「公開レベル」に関連付けられた罹患可能性を示す注釈情報については、利用者に対して開示することはない。これにより、利用者に対して不測の不利益となりうる意味情報を与えることや、契約者以外に意味情報が開示されることを防止できる。
【００２５】
なお、利用者がインフォームドコンセント等により、所定の「公開レベル」を関連づけた意味情報の開示を容認することにより、利用者に対して、所定の「公開レベル」が関連づけられた意味情報を公開するようなシステムであってもよい。
【００２６】
また、「公開レベル」は、例えば「１，２，３、…」又は「ａ，ｂ，ｃ，…」といった３以上の複数の段階として設定することができる。この場合、共用コンピュータ２側では、利用者の年齢、資格及び利用者との契約の有無等、利用者の種類に応じてレベルを設定することができる。なお、インフォームドコンセント等によって、所定の公開レベル以上（又は未満）の公開レベルに関連付けられた罹患可能性を示す注釈情報のみが利用者側に対して提供されるように、当該利用者側が公開レベルを選択することもできる。
【００２７】
なお、データベース８において、保管用ＤＢ−Ｂ１６には、例えば、本システムを利用する要求者個人の遺伝情報である塩基配列関連情報といったデータを記録することができる。また、保管用ＤＢ−Ａ１５には、例えば、本システムを利用する要求者を特定する情報といったデータを記録することができる。このように、保管用ＤＢ−Ａ１５及び保管用ＤＢ−Ｂ１６に、個人の遺伝情報と個人を特定する情報とを分けて記録することによって、要求者の遺伝情報と、要求者を特定するデータとを関連付け難くなる。
【００２８】
なお、共用コンピュータ２は、データベース８を内部に有するものに限定されず、通信回線網１を介して共用コンピュータ２に接続された外部データベース（図示せず）を有するものであってもよい。また、共用コンピュータ２は、内部に複数のデータベース８を有するものであってもよいし、内部のデータベース８と通信回線網１を介して共用コンピュータ２に接続された外部データベースとを有するものであっても良い。
【００２９】
統計解析用コンピュータＳＴは、図４に示すように、当該統計解析用コンピュータＳＴの動作を全て制御するＣＰＵ３０と、情報及びプログラムの実行指示等を入力できるキーボード、マウス等の入力装置３１と、ディスプレイ装置等の表示装置３２と、一時的な情報及び書換え可能な情報や書換え不可能な情報等が記録されるメモリー３３と、各種データを格納しているデータベース３４と、これらメモリー３３及びデータベース３４に対して所定の情報を書き込む記録装置３５と、通信回線網１を介して共用コンピュータ２及び個人用コンピュータ３との間で情報の送受信を行う送受信装置３６とから構成されている。
【００３０】
統計解析用コンピュータＳＴにおけるメモリー３３は、個人用コンピュータ３から送られた多型パターン等を一時的に記録したり、統計解析結果等を記録するメモリー部３７と、例えば個人用コンピュータ３や共用コンピュータ２や表示装置３２に表示させる画像データを記録した画面メモリー３８と、本システムを動作させ、複数の個人（個体）に関する「個体関連情報」と複数の個人（個体）に関する「ゲノム関連情報」とを用いて、例えば、所定の疾患に対する罹患可能性を示す注釈情報といった意味情報を創出する処理プログラム３９とが記録されている。なお、当該処理プログラム３９によれば、先ず、複数の個人（個体）に関する「ゲノム関連情報」と複数の個人（個体）に関する「個体関連情報」とを集積した後、当該「ゲノム関連情報」と当該「個体関連情報」とを用いて統計的に処理することで、所定の疾患に対する罹患可能性を示す注釈情報（意味情報）等を得ることができる。なお、統計解析用コンピュータＳＴにおいては、処理プログラム３９やメモリー部３７を内部のメモリー３３に有さず、通信回線網１を介して統計解析用コンピュータＳＴに接続された外部記憶装置（図示せず）に有するものであっても良い。
【００３１】
ここで、「個体関連情報」とは、個体の性質、心理的状態、体質、体調、健康状態、病歴、生活習慣、行動・思考パターン、癖及び嗜好等、所定の個体に関するあらゆる情報が含まれる。例えば、個体関連情報としては、予め準備された質問等に対して利用者等の個人（個体）が回答することから得られる情報、或いは、例えば、利用者等が医療機関や検査機関等で検査を受けることによって得られる情報を挙げることができる。
また、「ゲノム関連情報」とは、所定の個体に関する複数の「多型パターン」をそれぞれ所定の「多型番地」に関連付けたデータの群（塩基配列関連情報群）である。
【００３２】
統計解析用コンピュータＳＴにおけるデータベース３４（記憶装置）は、複数の個人（個体）に関する「ゲノム関連情報」を各個人（個体）毎に記録した「ゲノム関連情報ＤＢ４０」（図５に示す）を有している。なお、「ゲノム関連情報ＤＢ４０」に記録された「ゲノム関連情報」は、直接個人（個体）を特定する情報と結びつかないように匿名化されていることが望ましい。
【００３３】
また、データベース３４は、個人用コンピュータ３より受信した「個体関連情報」を各個人（個体）毎に記録した「個体関連情報ＤＢ４１」（図６に示す）を有している。なお、「個体関連情報ＤＢ４１」に記録された「個体関連情報」は、直接個人（個体）を特定する情報と結びつかないように匿名化されていることが望ましい。
【００３４】
なお、統計解析用コンピュータＳＴは、前記データベース３４を内部に有するものに限定されず、通信回線網１を介して統計解析用コンピュータＳＴに接続された外部データベース（図示せず）に対してアクセスするものであっても良い。
【００３５】
個人用コンピュータ３は、図７に示すように、当該個人用コンピュータ３の動作を全て制御するＣＰＵ２０と、情報及びプログラムの実行指示等を入力できるキーボード及びマウス等の入力装置２１と、ディスプレイ装置等の表示装置２２と、一時的な情報及び書き換え可能な情報等が記録されるメモリー２３と、ゲノム関連情報記録媒体２４からデータを読み取る読取り装置２５と、通信回線網１を介して共用コンピュータ２との間で情報の送受信を行う送受信装置２９とから構成されている。なお、個人用コンピュータ３は、通常のコンピュータに限定されず、例えば、携帯電話、個人携帯端末及びその他の移動体通信機器等、いかなる形態であってもよい。
【００３６】
個人用コンピュータ３におけるメモリー２３は、ゲノム関連情報記録媒体２４からの情報等を記録するメモリー部２６を有し、本情報処理システムを動作させる処理プログラム２７が記録されている。
【００３７】
ゲノム関連情報記録媒体２４には、個人のゲノム関連情報２８が記録されている。ゲノム関連情報記録媒体２４としては、例えば、磁気ディスクや磁気カード等の磁気記録媒体、光磁気記録方式や相変化記録方式等を適用した光学式記録媒体、半導体メモリー等を挙げることができる。また、このゲノム関連情報記録媒体２４は、カード状、ディスク状、スティック状、テープ状又はドラム状等いかなる形態であってもよい。さらに、このゲノム関連情報記録媒体２４は、単一の個人（個体）のゲノム関連情報２８を記録したものであってもよいが、複数の個人（個体）に関する複数のゲノム関連情報２８を記録したものであってもよい。
【００３８】
ゲノム関連情報記録媒体２４に含まれるゲノム関連情報２８とは、少なくとも、「多型番地」及び個人（個体）の塩基配列を解析した結果として得られる所定の多型番地における「多型パターン」を意味する。また、ゲノム関連情報２８には、既往症、特徴、カルテ情報、健康診断結果といった各種情報を含んでいてもよい。
【００３９】
ゲノム関連情報記録媒体２４には、ゲノム関連情報２８として、例えば、図８に示すように、データＩとしてゲノム関連情報２８に固有の個別番号「Ｇｎｏ．」（ジーナンバー）及び生年月日等の個人情報を記録し、データＩＩとして多型番地及び多型パターンを記録し、データＩＩＩとして既往症を記録し、データＩＶとして特徴を記録し、データＶとしてカルテ情報等を記録する。すなわち、ゲノム関連情報２８は、データＩ、データＩＩ、データＩＩＩ、データＩＶ及びデータＶから構成されている。データＩ及びデータＩＩには必須の情報が含まれており、データＩＩＩ、データＩＶ及びデータＶには付加的な情報から構成されている。
【００４０】
ゲノム関連情報２８においては、塩基配列上の位置に対応する「多型番地」と、当該多型番地における「多型パターン」とをリンクさせて記録している。また、データＩＩには、所定の多型番地における付加的な情報を「コメント」として、「多型番地」にリンクさせて記録していてもよい。なお、データＩＩには、所定の個体に関する全塩基配列を記録しても良い。データＩＩに全塩基配列を記録した場合であっても、データＩＩ内に「多型番地」及び「多型パターン」が含まれることとなる。
【００４１】
なお、本発明において、個人用コンピュータ３及びゲノム関連情報記録媒体２４は、それぞれ図７及び図８に示したような構成に限定されず、例えば、ゲノム関連情報記録媒体が処理プログラムを有するメモリー部を備え、個人用コンピュータが当該ゲノム関連情報記録媒体を装着して処理プログラムを動作させるような構成であってもよい。この場合、個人用コンピュータは、ゲノム関連情報記録媒体のメモリー部に記録された処理プログラムに従って動作できる。
【００４２】
以上のように構成された情報処理システムにおいては、統計解析用コンピュータＳＴのメモリー３３が有する処理プログラム３９及び個人用コンピュータ３のメモリー２３に記録された処理プログラム２７が例えば、図９に示すようなフローチャートに従って情報処理動作することによって、個体関連情報を蓄積して個体関連情報ＤＢ４１を構築することができる。なお、図９に示すフローチャートにおいて、「（統）」と記載したステップは統計解析用コンピュータＳＴにおける処理を意味し、「（個）」と記載したステップは個人用コンピュータ３における処理を意味している。
【００４３】
本情報処理システムは、ゲノム関連情報記録媒体２４を所持する各個人が個人用コンピュータ３を用いて通信回線網１を介して統計解析用コンピュータＳＴにアクセスし、各個人（個体）に関する「個体関連情報」を統計解析用コンピュータＳＴの「個体関連情報ＤＢ４１」に、「ゲノム関連情報ＤＢ４０」の中のゲノム関連情報と関連付けて登録するシステムである。なお、本情報処理システムは、複数人のゲノム関連情報２８がそれぞれ記録されたゲノム関連情報記録媒体２４を用い、各個人がゲノム関連情報記録媒体２４にアクセスするようなシステムであってもよい。
【００４４】
このとき、先ず要求者は、ステップ１（Ｓ１）で、通信回線網１を介して統計解析用コンピュータＳＴにアクセスし、統計解析用コンピュータＳＴに対して個体関連情報の登録を行う旨の意思表示を行う。ステップ１では、統計解析用コンピュータＳＴが提供するウェブページにアクセスして前記意思表示を行っても良いし、統計解析用コンピュータＳＴに対して電子メール等を用いてアクセスして前記意思表示を行っても良い。なお、本例においては、要求者が自らの個体関連情報を登録する場合について述べるが、これに限定されず、要求者が自分以外の個人（個体）に関する個体関連情報を登録することもできる。
【００４５】
次に、統計解析用コンピュータＳＴは、前記意思表示を個人用コンピュータ３から受信した後、ステップ２（Ｓ２）で、画面メモリー３８から、図１０に示すような「個体関連情報収集画面」を読み出し、個人用コンピュータ３の表示装置２２に表示する。ステップ２では、統計解析用コンピュータＳＴが提供するウェブページに個人用コンピュータ３がアクセスした状態で個体関連情報収集画面を表示しても良いし、個人用コンピュータ３に対して送信した個体関連情報収集画面データに基づいて個人用コンピュータ３の表示装置２２に表示しても良い。
【００４６】
次に、個人用コンピュータ３は、ステップ３（Ｓ３）で、個体関連情報収集画面に従って、要求者自身の個体関連情報を入力する。すなわち、要求者は、個体関連情報収集画面に表示されている設問に対して回答し、当該個体関連情報収集画面に回答内容を入力する。或いは、要求者は、個体関連情報収集画面に表示されている設問に対する回答を、当該個体関連情報収集画面とは異なる回答用画面に入力しても良い。また、ステップ３では、個体関連情報として利用者（要求者）等が医療機関や検査機関等で検査を受けることによって得た情報を入力しても良い。
【００４７】
次に、個人用コンピュータ３は、ステップ４（Ｓ４）で、個体関連情報収集画面に表示されている設問に対する回答（個体関連情報）を統計解析用コンピュータＳＴに対して送信する。ステップ４では、回答を入力した個体関連情報収集画面データ又は回答を入力した回答用画面データを、通信回線網１を介して送信することで前記設問に対する回答を統計解析用コンピュータＳＴに対して送信することができる。また、ステップ４では、個体関連情報として利用者（要求者）等が医療機関や検査機関等で検査を受けることによって得た情報を統計解析用コンピュータＳＴに対して送信することもできる。
【００４８】
次に、統計解析用コンピュータＳＴは、個人用コンピュータ３から個体関連情報を受信した後、ステップ５（Ｓ５）で、個人用コンピュータ３に対して、複数の多型番地を送信する。ステップ５で送信する複数の多型番地は、既定の多型番地であっても良いし、ランダムに選択した多型番地であっても良い。
【００４９】
次に、個人用コンピュータ３は、統計解析用コンピュータＳＴから複数の多型番地を受信した後、ステップ６（Ｓ６）で、読取り装置２５を駆動してゲノム関連情報記録媒体２４にアクセスする。次に、個人用コンピュータ３は、ステップ７（Ｓ７）で、統計解析用コンピュータＳＴより受信した複数の多型番地について、それぞれ対応する多型パターンを読み出す。そして、個人用コンピュータ３は、ステップ８（Ｓ８）で、ステップ７で読み出した多型パターンを対応する多型番地と関連付けて統計解析用コンピュータＳＴに対して送信する。すなわち、ステップ８では、統計解析用コンピュータＳＴより受信した複数の多型番地それぞれについて、対応する多型パターンを関連付けて送信する。なお、ステップ８で個人用コンピュータ３から統計解析用コンピュータＳＴに送信する際には、「Ｇｎｏ．」のような個人（個体）を特定しうる情報を送信しないことが望ましい。
【００５０】
ところで、本実施の形態では、ステップ５で統計解析用コンピュータＳＴが送信した多型番地を個人用コンピュータ３で受信し、受信した多型番地に対応する多型パターンをステップ８で個人用コンピュータ３が統計解析用コンピュータＳＴに対して送信している。しかしながら、これに限定されず、例えばステップ４において、個人用コンピュータ３が統計解析用コンピュータＳＴに対して「個体関連情報」を送信するとともに、既定の多型番地と当該多型番地に対応する多型パターンとを自発的に統計解析用コンピュータＳＴに対して送信してもよい。この場合は、上述したステップ５〜８までは行われず、ステップ４の後に以下のステップ９以降のステップを同様に行うことになる。
【００５１】
次に、統計解析用コンピュータＳＴは、個人用コンピュータ３から多型番地及び多型パターンを受信した後、ステップ９（Ｓ９）で、ゲノム関連情報ＤＢ４０にアクセスする。なお、ステップ９では、統計解析用コンピュータＳＴがゲノム関連情報ＤＢ４０を有さず、外部の機関がゲノム関連情報ＤＢ４０を有する場合には、通信回線網１を介して当該外部の機関が有するゲノム関連情報ＤＢ４０にアクセスする。
【００５２】
次に、統計解析用コンピュータＳＴは、ステップ１０（Ｓ１０）で、個人用コンピュータ３より受信した複数の多型番地及び多型パターンの組合せに基づいてゲノム関連情報ＤＢ４０を検索し、ゲノム関連情報ＤＢ４０に格納されたゲノム関連情報の中から、受信した複数の多型番地及び多型パターンの組合せを有する個人（個体）に関するゲノム関連情報を特定する。言い換えると、ステップ１０では、ゲノム関連情報ＤＢ４０に登録された複数の個人（個体）に関する複数のゲノム関連情報の中から要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報を特定する。ステップ１０では、例えば、特定した要求者、すなわち個人（個体）に関するゲノム関連情報に対して「整理Ｎｏ．」を付与してもよい。或いは、ステップ１０では、ゲノム関連情報ＤＢ４０に登録している個人（個体）に関するゲノム関連情報に対して予め付与された「整理Ｎｏ．」を抽出しても良い。
【００５３】
次に、統計解析用コンピュータＳＴでは、ステップ１１（Ｓ１１）で、ステップ１０で特定した要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報と、個人用コンピュータ３より受信した個体関連情報とを関連付ける。具体的には、ステップ１０で特定した要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報に付与した「整理Ｎｏ．」、又はステップ１０で特定した要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報について抽出した「整理Ｎｏ．」を、個体関連情報に関連付けて格納した、図６に示すような「個体関連情報ＤＢ４１」を作成する。
【００５４】
或いは、ステップ１０では、統計解析用コンピュータＳＴがゲノム関連情報ＤＢ４０を有する場合には、ゲノム関連情報ＤＢ４０に格納されている要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報に対して、個人用コンピュータ３より受信した個体関連情報を直接関連付けて格納しても良い。
【００５５】
以上、図９に示すフローチャートに従えば、例えば所定の要求者について、当該要求者に関する個体関連情報を、当該要求者に関するゲノム関連情報に関連付けて統計解析用コンピュータＳＴに登録することができる。また、複数の要求者が図９に示すフローチャートに従ってそれぞれ個体関連情報を登録することによって、統計解析用コンピュータＳＴは、複数の個人（個体）に関する個体関連情報を有することとなる。言い換えると、統計解析用コンピュータＳＴは、複数の個体に関するゲノム関連情報についてそれぞれ個体関連情報を関連付けて格納したデータベースを作成することができる。
【００５６】
以上のように、本システムによれば、統計解析用コンピュータＳＴの「ゲノム関連情報ＤＢ４０」に、予め匿名化して記録しておいたゲノム関連情報に対して、後から匿名化した個体関連情報を統計解析用コンピュータＳＴに送信した場合であっても、当該ゲノム関連情報と当該個体関連情報とをリンクさせることができる。
【００５７】
なお、図９に示したフローチャートにおいては、統計解析用コンピュータＳＴは、ステップ５で送信した複数の多型番地について、個人用コンピュータ３より多型パターンを受信し、ステップ１０でゲノム関連情報ＤＢ４０を検索することによって要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報を特定している。しかしながら、本情報処理システムにおいては、要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報の特定に際してこの方法に限定されず、例えば、所定の「多型番地」及び「多型パターン」の組合せを、順次個人用コンピュータ３から統計解析用コンピュータＳＴに対して送信し、統計解析用コンピュータＳＴが要求者の特定を行っても良い。この場合、個人用コンピュータ３から所定の「多型番地」及び「多型パターン」の組合せを統計解析用コンピュータＳＴに対して順次自発的に送信しても良いし、統計解析用コンピュータＳＴから所定の「多型番地」に対応する「多型パターン」の提出を個人用コンピュータ３に対して順次要求し、個人用コンピュータ３が当該要求に対応した「多型パターン」を「多型番地」に関連付けて順次送信しても良い。
【００５８】
詳細には、統計解析用コンピュータＳＴは、要求者に関する１又は複数の「多型番地」及び「多型パターン」の組合せを受け取るステップと、受け取った「多型番地」及び「多型パターン」の組合せと一致する「多型番地」及び「多型パターン」の組合せを有する個体に関するゲノム関連情報をゲノム関連情報ＤＢ４０から検索するステップとを、検索の結果として１の個人（個体）に関するゲノム関連情報を要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報として特定するまで繰り返すことで、ゲノム関連情報ＤＢ４０に含まれる所定の個人（個体）に関するゲノム関連情報を要求者（要求者が送信した「個体関連情報」に関連する個人（個体））に関するゲノム関連情報として特定することができる。
【００５９】
ところで、共用コンピュータ２は、メインＤＢ１４を用いて、以下のようにして利用者に対して所定の疾病の罹患可能性等の意味情報を提供することができる。すなわち、共用コンピュータ２は、利用者が「物品及び／又はサービスの要求」として、例えば、所定の疾病に関する自分の罹患可能性を教えて欲しいと要求する場合に、当該罹患可能性等の意味情報を提供できる。
【００６０】
なお、本情報処理システムにおいて、「物品及び／又はサービス」としては、所定の疾病の罹患可能性に限定されず、例えば、個人（個体）の体質に適合した医薬品、食品及び嗜好品等の物品や、個人（個体）の体質・性質に適合した情報等のサービスを含む意味である。
【００６１】
共用コンピュータ２が利用者に対して所定の疾病の罹患可能性を提供する情報処理システムにおいては、共用コンピュータ２のメモリー７に記録された処理プログラム１３及び個人用コンピュータ３のメモリー２３に記録された処理プログラム２７が例えば、図１１及び図１２に示すようなフローチャートに従って情報処理動作する。なお、図１１及び図１２に示すフローチャートにおいて、「（共）」と記載したステップは共用コンピュータ２における処理を意味し、「（個）」と記載したステップは個人用コンピュータ３における処理を意味している。
【００６２】
本情報処理システムは、ゲノム関連情報記録媒体２４を所持する各個人が個人用コンピュータ３を用いて通信回線網１を介して共用コンピュータ２にアクセスし、共用コンピュータ２のメインＤＢ１４に記録されている意味情報を利用するシステムである。なお、本情報処理システムは、複数人のゲノム関連情報２８がそれぞれ記録されたゲノム関連情報記録媒体２４を用い、各個人がゲノム関連情報記録媒体２４にアクセスするようなシステムであってもよい。
【００６３】
この場合、先ず、ステップＡ１（ＳＡ１）で、要求者が本システムを利用するにあたり、メモリー２３に記録されている処理プログラム２７を起動する。処理プログラム２７によって、個人用コンピュータ３の読取り装置２５を駆動してゲノム関連情報記録媒体２４にアクセスし、ゲノム関連情報記録媒体２４においてデータＩとして記録されている「Ｇｎｏ．」を読み出す。読み出した「Ｇｎｏ．」は、メモリー部２６に格納する。
【００６４】
次に、ステップＡ２（ＳＡ２）では、処理プログラム２７によって表示装置２２に表示された画面イメージに基づいて、要求者が提供を受けたい情報、例えば、「大腸がんの罹患可能性」（要求情報）を個人用コンピュータ３に入力するとともに、個人用コンピュータ３から通信回線網１を経由して共用コンピュータ２に「大腸がんの罹患可能性」及び「Ｇｎｏ．」を送信する。或いは、個人用コンピュータ３から通信回線網１を経由して共用コンピュータ２に対して、「大腸がんの罹患可能性」及び「Ｇｎｏ．」を書き込む。
【００６５】
次に、ステップＡ３（ＳＡ３）では、共用コンピュータ２が「大腸がんの罹患可能性」及び「Ｇｎｏ．」を受信する。受信した「大腸がんの罹患可能性」及び「Ｇｎｏ．」は、メモリー部Ａ１０に要求情報として格納する。
【００６６】
次に、ステップＡ４（ＳＡ４）では、要求情報を受信すると、メモリー７に記録されている処理プログラム１３を起動してメインＤＢ１４にアクセスする。なお、この処理プログラム１３は、共用コンピュータ２における処理を行うものである。
【００６７】
次に、ステップＡ５（ＳＡ５）では、処理プログラム１３に従って、メインＤＢ１４に記録されている「分類（疾患名）」を検索し、要求された「大腸がんの罹患可能性」（大腸がん）と一致するものを抽出する。
【００６８】
ステップＡ６（ＳＡ６）では、メインＤＢ１４に記録されているデータのなかから「大腸がんの罹患可能性」と一致した「分類（疾患名）」（大腸がん）に関連づけられた「多型番地」を読み出す。読み出した「多型番地」は、メモリー部Ａ１０に要求情報に関連づけた位置情報として格納する。すなわち、メモリー部Ａ１０には、所定の「Ｇｎｏ．」に対して「大腸がんの罹患可能性」及び「多型番地」が記録されることとなる。
【００６９】
次に、ステップＡ７（ＳＡ７）では、メモリー部Ａ１０に記録されている「Ｇｎｏ．」及び「多型番地」を個人用コンピュータ３に送信するとともに、送信する「多型番地」に対応する「多型パターン」を提出する命令情報を個人用コンピュータ３に送信する。また、このとき、要求情報の種類によっては、必要に応じて既往症や特徴等の付加的な情報の提出を命令してもよい。
【００７０】
次に、ステップＡ８（ＳＡ８）では、共用コンピュータ２から送信された「Ｇｎｏ．」、「多型番地」及び命令情報を受信する。受信した「Ｇｎｏ．」及び「多型番地」は、メモリー部２６に記録される。
【００７１】
次に、ステップＡ９（ＳＡ９）では、受信した命令情報に従って、ゲノム関連情報記録媒体２４に記録されているデータＩＩにアクセスする。ステップＡ１０（ＳＡ１０）では、処理プログラム２７に従ってゲノム関連情報記録媒体２４に記録されているデータＩＩを検索し、命令された多型番地の多型パターンを読み出し、多型番地と多型パターンとを関連づけてメモリー部２６に記録する。このとき、データＩに対してアクセスし、ステップＡ８で受信した「Ｇｎｏ．」が正しいか否かを確認することが好ましい。また、ステップＡ１０では、多型パターンのほかにデータＩＩＩ、データＩＶ及びデータＶに記録されている付加的な情報も同時に読み出し、必要に応じてメモリー部２６に記録してもよい。
【００７２】
次に、ステップＡ１１（ＳＡ１１）では、メモリー部２６に一時的に記録した多型番地に関連付けられた多型パターン及び必要に応じて記録された付加的な情報を、「Ｇｎｏ．」とともに通信回線網１を介して共用コンピュータ２に対して出力する。ステップＡ１２（ＳＡ１２）では、多型番地に関連付けられた多型パターン及び必要に応じて記録された付加的な情報を共用コンピュータ２で受信し、受信した多型パターンを多型番地と関連付けてメモリー部Ａ１０に記録する。
【００７３】
また、本例では、ステップＡ７において、共用コンピュータ２が「多型パターン」の提出を命令する命令情報を送出し、ステップＡ１０において、個人用コンピュータ３は命令情報に従って多型パターンをゲノム関連情報記録媒体２４から読み出している。しかしながら、本システムは、ステップＡ７において当該命令情報を送出しないシステムであってもよい。この場合、ステップＡ１０において、個人用コンピュータ３は、処理プログラム２７に従って、ステップＡ８で受信した多型番地に基づいてデータＩＩを検索し、受信した多型番地の多型パターンを読み出す。そして、個人用コンピュータ３は、ステップＡ１１で多型パターン等を共用コンピュータ２に対して出力する。この場合でも、共用コンピュータ２は、ステップＡ１２において、「大腸がんの罹患可能性」と一致した「分類（疾患名）」に関連づけられた「多型番地」の多型パターンを得ることができる。
【００７４】
次に、ステップＡ１３（ＳＡ１３）では、メインＤＢ１４にアクセスし、受信した多型番地及び多型パターンと一致するものを検索する。具体的には、メインＤＢ１４において、一つの多型番地に対して複数の多型パターンが記録されており、受信した多型番地及びその多型パターンがメインＤＢ１４においてどの多型パターンに一致しているのかを検索する。
【００７５】
次に、ステップＡ１４（ＳＡ１４）では、処理プログラム１３に従って、受信した多型パターンと一致した多型パターンに関連づけられている大腸がんに対する罹患可能性を読み出す。すなわち、ステップＡ１４では、要求者が提出した多型番地及び多型パターンに従って、要求者の大腸がんに対する罹患可能性を読み出すことができる。読み出した罹患可能性は、要求者の「Ｇｎｏ．」と関連づけてメモリー部Ａ１０に格納する。このとき、大腸がんに対する罹患可能性を、付加的な情報により補正したかたちで格納してもよいし、付加的な情報から得られるその他の情報を要求者の「Ｇｎｏ．」に関連づけて格納しても良い。
【００７６】
次に、ステップＡ１５（ＳＡ１５）では、メモリー部Ａ１０に格納した要求者の「Ｇｎｏ．」及び罹患可能性を意味情報として、通信回線網１を介して個人用コンピュータ３に対して送信する。ステップＡ１６（ＳＡ１６）では、個人用コンピュータ３が要求者の「Ｇｎｏ．」及び罹患可能性（意味情報）を受信する。受信した意味情報は、メモリー部２６に記録される。
【００７７】
次に、ステップＡ１７（ＳＡ１７）では、処理プログラム２７に従って、メモリー部２６に記録された意味情報から大腸がんに対する罹患可能性を表示装置２２に表示する。なお、ステップＡ１５からステップＡ１７の代わりに共用コンピュータ２が処理プログラム１３に従って意味情報を表示する画面を読み出し（作成し）、通信回線網１を経由して個人用コンピュータ３の表示装置２２に表示させることもできる。この場合においても、共用コンピュータ２から個人用コンピュータ３に対して意味情報が送信されたものとする。これにより、要求者は、ゲノム関連情報記録媒体２４に記録したゲノム関連情報２８を用いて大腸がんに対する罹患可能性を得ることができる。
【００７８】
さらに、本情報処理システムにおいては、共用コンピュータ２のメモリー７に記録された処理プログラム１３及び個人用コンピュータ３のメモリー２３に記録された処理プログラム２７が例えば、図１３に示すようなフローチャートに従って情報処理動作するものであってもよい。なお、図１３に示すフローチャートにおいても、「（共）」と記載したステップは共用コンピュータ２における処理を意味し、「（個）」と記載したステップは個人用コンピュータ３における処理を意味している。
【００７９】
ここでは、先ず、ステップＣ１（ＳＣ１）で、要求者が本システムを利用するにあたり、メモリー２３に記録されている処理プログラム２７を起動する。処理プログラム２７によって、個人用コンピュータ３の読取り装置２５を駆動してゲノム関連情報記録媒体２４にアクセスし、ゲノム関連情報記録媒体２４においてデータＩとして記録されている「Ｇｎｏ．」、データＩＩとして記録されている全ての「多型番地」及び「多型パターン」を読み出す。読み出した「Ｇｎｏ．」、「多型番地」及び「多型パターン」は、メモリー部２６に格納する。
【００８０】
次に、ステップＣ２（ＳＣ２）では、処理プログラム２７によって表示装置２２に表示された画面イメージに基づいて、要求者が提供を受けたい情報、例えば、「大腸がんの罹患可能性」（要求情報）を個人用コンピュータ３に入力するとともに、個人用コンピュータ３から通信回線網１を経由して共用コンピュータ２に「大腸がんの罹患可能性」と、メモリー部２６に記録されている「Ｇｎｏ．」、「多型番地」及び「多型パターン」とを送信する。
【００８１】
次に、ステップＣ３（ＳＣ３）では、共用コンピュータ２が「大腸がんの罹患可能性」、「Ｇｎｏ．」、「多型番地」及び「多型パターン」を受信する。受信した「大腸がんの罹患可能性」は要求情報としてメモリー部Ａ１０に記録され、「Ｇｎｏ．」、「多型番地」及び「多型パターン」も、メモリー部Ａ１０に格納される。共用コンピュータ２は、要求情報を受信すると処理プログラム１３を起動する。そして、ステップＣ４（ＳＣ４）では、処理プログラム１３に従って、メインＤＢ１４にアクセスする。
【００８２】
次に、ステップＣ５（ＳＣ５）では、処理プログラム１３に従って、メインＤＢ１４に記録されている「分類（疾患名）」を検索し、要求された「大腸がんの罹患可能性」（大腸がん）と一致するものを抽出する。
【００８３】
ステップＣ６（ＳＣ６）では、処理プログラム１３に従って、メインＤＢ１４にアクセスし、メインＤＢ１４から「大腸がん」に分類された「多型番地」、当該多型番地に対する全ての「多型パターン」、及び当該多型パターンに対する「罹患可能性」を読み出す。読み出した「多型番地」、「多型パターン」及び「罹患可能性」は、メモリー部Ａ１０に格納される。
【００８４】
次に、ステップＣ７（ＳＣ７）では、ステップＣ３で受信した「多型番地」及び「多型パターン」に基づいて、ステップＣ６でメモリー部Ａ１０に格納したデータを検索し、受信した「多型パターン」と一致した多型パターンに関連付けられた罹患可能性をメモリー部Ａ１０から抽出する。
【００８５】
ステップＣ８（ＳＣ８）では、ステップＣ７の結果、すなわち、ステップＣ３で受信した情報に含まれる多型パターンがメインＤＢ１４のいずれの多型パターンと一致するかに基づいて抽出した罹患可能性を、通信回線網１を介して個人用コンピュータ３に対して送信する。このとき、共用コンピュータ２は、抽出した罹患可能性を要求者の「Ｇｎｏ．」とともに送信する。
【００８６】
次に、ステップＣ９（ＳＣ９）では、共用コンピュータ２から送信された「Ｇｎｏ．」及び「罹患可能性」（意味情報）を受信する。受信した「Ｇｎｏ．」及び「罹患可能性」は、メモリー部２６に記録される。このとき、ゲノム関連情報記録媒体２４に記録されているデータＩにアクセスし、受信した「Ｇｎｏ．」が正しいか否かを確認することができる。
【００８７】
次に、ステップＣ１０（ＳＣ１０）では、処理プログラム２７に従って、メモリー部２６に記録された意味情報から大腸がんに対する罹患可能性を表示装置２２に表示する。なお、ステップＣ８からステップＣ１０の代わりに、共用コンピュータ２が処理プログラム１３に従って意味情報を表示する画面を読み出し（作成し）、通信回線網１を経由して個人用コンピュータ３の表示装置２２に表示させることもできる。この場合においても、共用コンピュータ２から個人用コンピュータ３に対して意味情報が送信されたものとする。これにより、要求者は、ゲノム関連情報記録媒体２４に記録したゲノム関連情報２８を用いて大腸がんに対する罹患可能性を得ることができる。
【００８８】
以上のように、図１１及び図１２に示したフローチャート或いは図１３に示したフローチャートに従って、要求者は意味情報を得ることができる。
さらに、本情報処理システムにおいては、要求者がゲノム関連情報記録媒体２４及び前記ゲノム関連情報記録媒体からデータＩＩに含まれる情報を除いた記録媒体のいずれも有さず、通信回線網１を介して個人用コンピュータ３と接続したゲノム関連情報記録媒体２４を備えるものであっても良い。このようなシステムの場合、要求者は、通信回線網１を介してゲノム関連情報記録媒体２４にアクセスし、ゲノム関連情報記録媒体２４に記録された「多型番地」及び「多型パターン」等の情報を個人用コンピュータ３にダウンロードできる。なお、この場合、ゲノム関連情報記録媒体２４は、複数の個人に関するゲノム関連情報を個人毎（「Ｇｎｏ．」毎）に記録したものであっても良い。
【００８９】
さらにまた、本発明は、上述したような共用コンピュータ２がメインＤＢ１４を有するような構成に限定されず、例えば、共用コンピュータ２と通信回線網１を介して接続されたメインＤＢ１４を備える情報処理システムにも適用される。この場合、共用コンピュータ２は、図１１及び図１２に示したフローチャート、或いは図１３に示したフローチャートにおいて、メインＤＢ１４に対して通信回線網１を介してアクセスする。この場合でも、本情報処理システムによれば、図１１及び図１２に示したフローチャート或いは図１３に示したフローチャートに従って要求者が所望の意味情報を得ることができる。
【００９０】
特に、この場合、共用コンピュータ２は、異なる機関又は団体が有する複数のメインＤＢ１４に対して通信回線網１を介してアクセスし、これら複数のメインＤＢ１４に含まれる意味情報を使用して、要求者に対する情報提供を行うことが可能となる。すなわち、本情報処理システムにおいては、図１１及び図１２に示したフローチャートにおけるステップＡ５で、図１３に示したフローチャートにおけるステップＣ５で、共用コンピュータ２が大腸がんの罹患可能性に関する情報を意味情報として有する様々なメインＤＢ１４にアクセスする。これにより、本情報処理システムによれば、要求者は、様々なメインＤＢ１４に含まれる情報に基づいて、大腸がんの罹患可能性に関する情報を得ることができる。
【００９１】
また、本システムは、図１１及び図１２に示したフローチャート或いは図１３に示したフローチャートにおいて、共用コンピュータ２が、いわゆるエージェントに対して、少なくとも個人用コンピュータ３から受け取った要求情報を送信し、意味情報（本例においては、「大腸がんに関する罹患可能性」）を、当該エージェントを介して得るものであってもよい。
【００９２】
このように、本システムにおいて、共用コンピュータ２は、図１１及び図１２に示したフローチャート或いは図１３に示したフローチャートに従って、物品及び／又はサービスを提供するために必要な「多型番地」及び「多型パターン」のセットを個人用コンピュータ３から受け取ることとなる。共用コンピュータ２は、多数の個人用コンピュータ３から受け取った「多型番地」及び「多型パターン」のセットを蓄積することができる。共用コンピュータ２は、蓄積した「多型番地」及び「多型パターン」のセットを種類毎にその数をカウントして、物品及び／又はサービスの提供と当該セットの種類との関係として、当該セットの種類毎に割合を求めることができる。
【００９３】
また、共用コンピュータ２は、統計解析用コンピュータＳＴのゲノム関連情報ＤＢ４０にアクセスし、蓄積した「多型番地」における多型パターンの種類毎の割合をゲノム関連情報ＤＢ４０から算出する。そして、共用コンピュータ２は、ゲノム関連情報ＤＢ４０から算出した割合と、蓄積したセットから求めたセットの種類毎の割合とを比較し、それぞれの割合から有意差を検討する。このように、共用コンピュータ２は、蓄積した「多型番地」と「多型パターン」のセットの中で、ゲノム関連情報ＤＢ４０における割合と有意に異なる割合のセットを抽出することができる。
【００９４】
例えば、特定の物品提供に際して、多型番地「１０００００」における多型パターンを受け取ったとする。なお、多型番地「１０００００」における多型パターンの種類は「多型１」、「多型２」及び「多型３」からなるとする。一定期間における物品提供によって、多型番地「１０００００」及びその多型パターンのセットを受け取ることにより、例えば、多型番地「１０００００」と多型パターン「多型１」のセット（以下セット１）、多型番地「１０００００」と多型パターン「多型２」のセット（以下セット２）、多型番地「１０００００」と多型パターン「多型３」のセット（以下セット３）をそれぞれ蓄積することができる。例えば、共用コンピュータ２において、セット１を５００個、セット２を３００個、セット３を２００個蓄積したとする。すなわち、蓄積した多型番地「１０００００」と多型パターンのセットのうち、セット１の割合は５０％であり、セット２の割合は３０％であり、セット３の割合は２０％であることになる。
【００９５】
一方、共用コンピュータ２は、ゲノム関連情報ＤＢ４０にアクセスし、ゲノム関連情報ＤＢ４０からセット１〜３の割合を、例えばセット１を７０％、セット２を２０％、セット３を１０％として算出したとする。この割合と、蓄積した多型番地「１０００００」と多型パターンのセットから求めたセットの種類毎の割合とを比較すると、例えば、蓄積したセット１〜３のなかで、セット１の割合が有意に少ないことが判る。
【００９６】
このような知見を得ることによって、共用コンピュータ２は、特定の物品提供の利用者には、セット１を有する者が有意に少なかったと認識することができる。そこで、共用コンピュータ２は、以下のようにして、「多型番地」及び「多型パターン」の特定のセット（例えば、セット１）を有する個体（個人）に特徴的な傾向を示す情報を得ることができる。
【００９７】
すなわち、図１４に示すフローチャートに従って情報処理することによって、共用コンピュータ２は特定のセットを有する個体（個人）に特徴的な傾向を示す情報を得ることができる。なお、図１４において、「（共）」と記載したステップは共用コンピュータ２における処理を意味し、「（統）」と記載したステップは統計解析用コンピュータＳＴにおける処理を意味している。
【００９８】
先ず、ステップＡで、共用コンピュータ２は、蓄積した「多型番地」と「多型パターン」のセットの全部又は一部を統計解析用コンピュータＳＴに送信し、送信した「多型番地」と「多型パターン」のセットを有する個体に特徴的な傾向を示す情報を提供するよう要求する。例えば、共用コンピュータ２は、多型番地「１０００００」と多型パターン「多型１」のセット（セット１）を送信することで、セット１を有する個体に特徴的な傾向を示す情報を提供するよう要求する。
【００９９】
次に、ステップＢで、統計解析用コンピュータＳＴは、「多型番地」と「多型パターン」のセットを受信し、「ゲノム関連情報ＤＢ４０」にアクセスし、受信した「多型番地」と「多型パターン」のセットを有する整理Ｎｏ（個体）を全て特定する。
【０１００】
次に、ステップＣで、統計解析用コンピュータＳＴは、個体関連情報ＤＢ４１にアクセスし、ステップＢで特定した整理Ｎｏ（個体）に関連付けられた個体関連情報を全て特定する。
【０１０１】
次に、ステップＤで、統計解析用コンピュータＳＴは、個体関連情報ＤＢ４１に含まれる個体関連情報の傾向と、ステップＣで特定した個体関連情報の傾向とを比較し、ステップＣで特定した個体関連情報の中でステップＢで特定した個体間において特に共通性の高い個体関連情報を抽出する。
【０１０２】
例えば、個体関連情報ＤＢ４１に含まれる全ての情報（個体）において、色の嗜好で紫色の好みがある割合が４０％であったとして、ステップＣで特定した個体関連情報において「紫色を好む」個体が５０％であったとすると、セット１を有する個体は紫色を好みやすい傾向があると言える。言い換えると、全体的には４０％の人しか紫色を好まないのに、セット１を有する個体の群では５０％もの人が紫色を好んでいることになる。
【０１０３】
このように、ステップＤでは、ステップＢで特定した整理Ｎｏ（個体）の群と、個体関連情報ＤＢ４１に含まれる全ての整理Ｎｏ（個体）の群とにおいて、所定の個体関連情報（紫色を好む）に該当する割合が統計的に有意に異なる個体関連情報を全て抽出する。次に、ステップＥで、統計解析用コンピュータＳＴは、ステップＤで抽出した全ての個体関連情報を共用コンピュータ２に対して送信する。
【０１０４】
次に、ステップＦで、共用コンピュータ２は、統計解析用コンピュータＳＴから個体関連情報を受信する。これにより、共用コンピュータ２は、ステップＡで送信した「多型番地」及び「多型パターン」のセットを有する個体（個人）に特徴的な傾向を示す情報として、個体関連情報を得ることができる。
【０１０５】
例えば、ステップＡにおいて共用コンピュータ２が特定の物品提供の利用者には、セット１を有する者が有意に少なかったとの認識に基づいて、セット１を統計解析用コンピュータＳＴに送信した場合、ステップＤで受信した個体関連情報の内容に応じて、例えば当該特定の物品の販売戦略を立てることができる。
【０１０６】
また、本システムは、ステップＡにおいて、共用コンピュータ２に代わって個人用コンピュータ３から、自己の「多型番地」及び「多型パターン」の特定のセットを送信するものであっても良い。この場合、統計解析用コンピュータＳＴは、受信したセットを有する個体に特徴的な傾向を示す情報として、個体関連情報を個人用コンピュータ３に対して送信する。これにより、個人用コンピュータ３では、自己のセットと同じセットを有する個体に特徴的な個体関連情報を取得することができる。このとき、例えば、本システムを利用する個人の側（個人用コンピュータ３）で所定の遺伝的体質・性質等に関係のある「多型番地」又は「多型番地の組合せ」を予め把握していれば、その「多型番地」又は「多型番地の組合せ」を統計解析用コンピュータＳＴに送信することによって、本システムを利用した個人は、取得した個体関連情報及びそれに関する情報に基づいて、例えば未だ発症していない生活習慣病に罹患する可能性が分かることとなり、自ら自発的に生活習慣を改めることができるなど、医療機関にかからずとも自ら予防医療的な行為が可能となる。
【０１０７】
さらに、本システムにおいて、共用コンピュータ２は、ステップＡで特定の「多型番地」のみを送信し、当該多型番地と当該多型番地において取りうる多型パターンとの全てのセットについて、特徴的な傾向を示す情報を提供するよう要求することもできる。この場合、統計解析用コンピュータＳＴは、ステップＥでセット毎に当該セットに特徴的な傾向を示す情報として、個体関連情報を共用コンピュータ２に対して送信する。この場合、共用コンピュータ２は、全てのセットのそれぞれについて特徴的な個体関連情報を得ることができるため、例えば、特定のテーラーメイド型の（要求者の特定の多型番地における多型パターンの相違に基づき提供され分けるべき）物品の提供に際して、当該物品の提供のために必要な多型番地における多型パターン毎に異なるきめ細かな広告宣伝・販売戦略を展開することができる。
【０１０８】
さらにまた、テーラーメイド型の物品でなく通常の物品であっても、特定の物品の購入者に多く保有されている「多型番地」及び「多型パターン」のセットが判明すれば、物品の提供者側から統計解析用コンピュータＳＴに対して当該セットを送信し、ステップＢからステップＦを行うことで、当該提供者側は当該購入者に多く見られる特徴的な個体関連情報を得ることができる。この個体関連情報を利用すれば、当該提供者側は、有効なマーケティング・広告宣伝・販売戦略を展開することができる。
【０１０９】
＜第２の実施の形態＞
次に、本発明を適用した第２の実施の形態として、例えば、保険会社（事業機関）が生命保険や医療保険等の保険料（以下、単に「保険料」と称する）を算出する際の基準となる情報を提供する情報処理システムについて説明する。
【０１１０】
以下の説明においては、保険会社が保険料を算出する際の基準となる情報を提供する情報処理システムについて説明するが、説明の都合上、簡略化したモデルとして説明する。本実施の形態として説明する情報処理システムは、上述した第１の実施の形態における情報処理システムと同様な構成及び用語については同じ名称、符号及び定義を使用することによって、その構成、動作及び用語の説明を省略する。
【０１１１】
本実施の形態において、保険会社は、図１に示した通信回線網１を介して共用コンピュータ２と統計解析用コンピュータＳＴとの間のデータ通信を可能とする保険会社端末を有している。保険会社端末は、図１５に示すフローチャートに従って情報処理することによって、保険会社端末は保険料を算出する際の基準となる情報を得ることができる。なお、図１５において、「（保）」と記載したステップは保険会社端末における処理を意味し、「（統）」と記載したステップは統計解析用コンピュータＳＴにおける処理を意味している。
【０１１２】
先ず、ステップＩ−１（ＳＩ−１）で保険会社は、保険会社端末を利用して、保険契約時に契約者の「多型番地」及び当該多型番地における「多型パターン」のセット（個体を特定するための情報）を取得する。このとき、当該セットは、統計解析用コンピュータＳＴの側で、契約者に相当する「匿名化された個体」を特定するための情報であって、完全に匿名化した状態で取得することが好ましい。また、当該セットとしては、他の個体との区別が可能となる種類・量のセットを取得する。さらに、取得するセットに含まれる多型番地及び多型パターンは、メインＤＢ１４において意味情報が関連付けられていないものであることが望ましい。さらにまた、保険会社は、例えば、保険会社に属する携帯端末などを用いて、契約者のゲノム関連情報記録媒体２４から当該セットを取得し、通信回線網１を介して保険会社端末に対してセットを送信することができるが、このとき、「Ｇｎｏ．」等の個人の識別を可能とする情報は取得・送信しないことが望ましい。これにより、保険会社においては、保険契約者と多型番地及び多型パターンのセットをリンクさせることができなくなる。
【０１１３】
次に、ステップＩ−２（ＳＩ−２）で、保険会社端末は、ステップＩ−１を複数の契約者について実行し、複数の契約者から取得した前記セットを契約者毎に区別して蓄積する。ステップＩ−２によって、保険会社端末には、契約者を特定することはできないが、他の契約者（個体）とは区別できる多型番地及び多型パターンのセットを、整理Ｎｏ．を付すなどして、契約者毎に区別した状態で蓄積したテーブルを作成することができる。
【０１１４】
次に、ステップＩ−３（ＳＩ−３）で、保険会社は、ステップＩ−２で蓄積した多型番地及び多型パターンのセット群、及び保険料の算定に関わる意味情報に関連付けられた多型番地を統計解析用コンピュータＳＴに対し送信し、送信した多型番地における多型パターン毎の割合を要求する。ここで、保険料の算定に関わる意味情報に関連付けられた多型番地とは、ある疾患の罹患可能性に関与する多型番地や、個人の体質に関与する多型番地等を挙げることができる。これら保険料の算定に関わる意味情報に関連付けられた多型番地は、例えば、共用コンピュータ２のメインＤＢ１４にアクセスすることによって取得することもできるし、保険会社が独自に所有する情報に基づいて設定しても良い。
【０１１５】
次に、ステップＩ−４（ＳＩ−４）で、統計解析用コンピュータＳＴは、保険会社端末からセット群及び多型番地を受信し、ゲノム関連情報ＤＢ４０にアクセスする。次に、ステップＩ−５（ＳＩ−５）で、統計解析用コンピュータＳＴは、受信した各セットに基づいてゲノム関連情報ＤＢ４０を検索し、各セットと一致する整理Ｎｏ．を抽出する。すなわち、ステップＩ−５によって、保険契約者の全てについてゲノム関連情報を特定できることとなる。
【０１１６】
次に、ステップＩ−６（ＳＩ−６）で、統計解析用コンピュータＳＴは、ステップＩ−５で特定した全てのゲノム関連情報のなかから、ステップＩ−４で受信した多型番地における多型パターン毎の割合を算出する。具体的に、ステップＩ−４で多型番地「０００００１」を受信したとし、多型番地「０００００１」における多型パターンが「多型１」、「多型２」及び「多型３」であるとする。この場合、ステップステップＩ−５で特定したゲノム関連情報に基づいて、多型番地「０００００１」が「多型１」である割合、多型番地「０００００１」が「多型２」である割合、多型番地「０００００１」が「多型３」である割合を算出する。
【０１１７】
次に、ステップＩ−７（ＳＩ−７）で、統計解析用コンピュータＳＴは、ステップＩ−６で算出した、ステップＩ−４で受信した多型番地における多型パターン毎の割合を保険会社端末に対して送信する。
【０１１８】
次に、ステップＩ−８（ＳＩ−８）で、保険会社端末は、統計解析用コンピュータＳＴから、保険料の算定に関わる意味情報に関連付けられた多型番地における多型パターン毎の割合を受信する。言い換えると、保険会社では、保険契約者から構成される集団において、保険料の算定に関わる意味情報に関連付けられた多型番地における多型パターン毎の割合を知ることができる。
【０１１９】
そして、保険会社は、ステップＩ−９（ＳＩ−９）において、ステップＩ−８で受信した割合に基づいて保険料率を算出することができる。例えば、ステップＩ−３において、ある疾患の罹患可能性に関与する多型番地や、個人の体質に関与する多型番地を統計解析用コンピュータＳＴに送信した場合、保険会社は、保険契約者からなる集団における「ある疾患の罹患可能性」や「体質」の傾向を知ることができ、これら「ある疾患の罹患可能性」や「体質」の傾向に基づいて保険料率を算出することができる。
【０１２０】
以上のように、本システムによれば、保険会社は、保険契約者からなる集団における遺伝的な傾向に基づいて保険料率を算出することができるため、合理的（経済的に誤算のない）保険料率を算定することができる。特に、本システムによれば、保険会社が、保険契約者個人について、保険料の算定に関わる意味情報に関連付けられた多型番地及び多型パターンを知ることがないため、多型パターンに基づく差別を防止することができる。
【０１２１】
したがって、保険会社は、保険契約者の遺伝子情報（多型パターン）に左右されることのない安定した事業を展開することができる。一方、保険契約者にとっては、従来の生命・医療保険システムと同様に、同じ条件で、同じ保険サービス内容を享受する場合には、遺伝子情報の如何に拘わらず全ての人が同じ保険料を支払うこととなり、平等性が確保される。以上のことから、本システムを利用することによって、テーラーメイド型社会が到来した後においても、本来あるべき健全な生命・医療保険システムを堅持できる。
【０１２２】
なお、本発明は保険会社が保険料率を算出する際の基準となる情報を提供する情報処理システムへの適用に限定されるものではない。すなわち、本発明は、例えば、学校教育等の教育者側が、上述した方法に準じて、教育対象の集団における遺伝的傾向の構成割合を取得することができる。そして、教育者側は、この遺伝的傾向の構成割合に基づいて教育活動を行うことで、教育対象の集団に適した教育サービスを提供することができる。
【０１２３】
このように、本発明は、多型パターンに基づいて、個体毎に異なる条件でサービスを提供することが差別に繋がったり、倫理的に問題があるような事業機関（例えば、保険会社）や、個体毎に異なるサービスを提供することが事実上困難であるような事業機関（例えば、教育機関）に対して適用することが大いに有効かつ効果的である。
【０１２４】
【発明の効果】
以上、詳細に説明したように、本発明によれば、塩基配列関連情報に基づいて個体関連情報を提供できる情報処理システム、及び塩基配列関連情報に基づいて保険料率の算定に使用できる情報提供できる情報処理システムを構築することができる。
【図面の簡単な説明】
【図１】本発明を適用した情報処理システムの構成を概略的に示す概略構成図である。
【図２】共用コンピュータの構成を概略的に示す概略構成図である。
【図３】メインＤＢに記録されたデータの一例を示す構成図である。
【図４】統計解析用コンピュータの構成を概略的に示す概略構成図である。
【図５】ゲノム関連情報ＤＢに記録されたデータの一例を示す構成図である。
【図６】個体関連情報ＤＢに記録されたデータの一例を示す構成図である。
【図７】個人用コンピュータの構成を概略的に示す概略構成図である。
【図８】ゲノム関連情報記録媒体に記録されたデータの一例を示す構成図である。
【図９】個体関連情報とゲノム関連情報とを関連連付ける情報処理システムにおいて、統計解析用コンピュータ及び個人用コンピュータでの処理を示すフローチャートである。
【図１０】個体関連情報収集画面の一例として示す画面イメージである。
【図１１】所定の疾病に対する罹患可能性を提供するシステムにおいて、共用コンピュータ及び個人用コンピュータでの処理を示すフローチャートである。
【図１２】図１６の続きであり、所定の疾病に対する罹患可能性を提供するシステムにおいて、共用コンピュータ及び個人用コンピュータでの処理を示すフローチャートである。
【図１３】所定の疾病に対する罹患可能性を提供するシステムにおいて、共用コンピュータ及び個人用コンピュータでの他の処理を示すフローチャートである。
【図１４】特定の多型番地及び多型パターンからなるセットを有する個体（個人）に特徴的な傾向を示す情報を得るための処理を示すフローチャートである。
【図１５】保険料を算出する際の基準となる情報を得るための処理を示すフローチャートである。
【符号の説明】
１…通信回線網、２…共用コンピュータ２…個人用コンピュータ、ＳＴ…統計解析用コンピュータ[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an information processing system that provides information via, for example, a communication network.
[0002]
[Prior art]
Currently, genome base sequences of various organisms including humans are rapidly determined, and genome base sequence information is accumulated in various databases. For example, a system that allows various research institutions and researchers to use genome base sequence information stored in a database via an information communication network such as the Internet is being constructed.
[0003]
At the same time, research on genomic drug discovery and analysis of genetic information are being actively conducted using the base sequences included in such genomic base sequence information. Differences in the base sequence are attracting attention. In general, the difference in the base sequence between individuals is a polymorphism that is defined as having a predetermined base difference at a frequency of 1% or more in the individual species, and a predetermined base difference is less than 1% in the individual species. Means variation. In particular, in the polymorphism, a single nucleotide polymorphism (SNP: Single Nucleotide Polymorphism), which is a single base difference between individuals, is deleted or inserted. Insertion / deletion polymorphism, VNTR (variable number of tandem repeat) and microsatellite polymorphism (repetition sequence is about 2 to 4 bases) with different number of repetitions of sequences of 2 to several tens of bases as one unit )It has been known.
[0004]
Such polymorphisms may affect differences in protein amino acid sequences between individuals, differences in expression efficiency of a given gene between individuals, and the like. Due to such influence, it is known that, for example, the morbidity possibility with respect to a predetermined disease varies among individuals, and the sensitivity to a predetermined drug varies among individuals.
[0005]
However, at present, no system has been constructed that effectively uses the difference in base sequence information among individuals such as polymorphisms to provide useful semantic information for each individual.
[0006]
[Problems to be solved by the invention]
Therefore, in view of the current situation, the present invention is an information processing method related to a base sequence that can provide individual related information based on the base sequence related information, and is used for conducting business based on the base sequence related information. It aims at providing the information processing method regarding the base sequence which can provide the information which can be performed.
[0007]
[Means for Solving the Problems]
An information processing system related to a base sequence according to the present invention that has achieved the above-described object is an individual by associating a requester terminal that requests information indicating a characteristic tendency with respect to a predetermined individual, position information, and base sequence related information. A statistical analysis terminal having a first storage device stored for each individual and a second storage device storing individual related information regarding the individual for each individual. In this system, the statistical analysis terminal acquires the positional information transmitted from the requester side terminal, searches the first storage device and the second storage device, and acquires the positional information acquired from the requester side terminal. Individual-related information indicating a tendency characteristic of an individual having a set of one base sequence-related information among a plurality of base sequence-related information associated with the position information is acquired.
[0008]
In addition, the information processing system related to the base sequence according to the present invention collects information for identifying individuals included in the predetermined group, and determines the holding ratio of the base sequence related information corresponding to the predetermined position information in the group. And a requester side terminal provided in a business organization that conducts business, and a statistical processing terminal having a storage device that stores the positional information and the base sequence related information in association with each other. In this system, the statistical processing terminal obtains information for specifying individuals included in the group from the requester side terminal and position information corresponding to the base sequence related information for which the holding ratio is obtained, and stores the storage device. A search is performed, and a holding ratio for each set including the acquired position information and one base sequence related information among a plurality of pieces of base sequence related information associated with the position information in the group is calculated. In this system, the requester side terminal acquires the holding ratio calculated by the statistical processing terminal in the group, and creates information for conducting a business based on the holding ratio.
[0009]
In addition, the information processing system regarding the base sequence according to the present invention can be realized as a program that causes a computer including hardware such as a control device, a transmission / reception device, and a storage device to execute each step. The information processing system related to the base sequence according to the present invention can also be realized as a recording medium recording a program that causes a computer including hardware such as a control device, a transmission / reception device, and a storage device to execute each step. Furthermore, the information processing system related to the base sequence according to the present invention can also be realized as an information processing device including hardware such as a control device, a transmission / reception device, and a storage device that execute each step.
In addition, this invention has the structure as described in each claim.
[0010]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, the present invention will be described in detail with reference to the drawings.
<First Embodiment>
First, as a first embodiment to which the present invention is applied, an information processing system that provides information indicating a tendency characteristic to an individual having a specific polymorphism area and a set of polymorphic patterns will be described.
[0011]
In the following description, an information processing system that provides information indicating a tendency characteristic to an individual having a specific polymorphism area and a set of polymorphic patterns will be described. However, for convenience of explanation, it is described as a simplified model. To do. As shown in FIG. 1, the information processing system includes a communication network 1 such as the Internet, a shared computer 2 connected to the communication network 1, and at least one or more personal computers 3 connected to the communication network 1. And a statistical analysis computer ST connected to the communication network 1, and enables data communication among the shared computer 2, the personal computer 3, and the statistical analysis computer ST via the communication network 1. .
[0012]
As shown in FIG. 2, the shared computer 2 includes a CPU 4 that controls all operations of the shared computer 2, an input device 5 such as a keyboard and a mouse that can input information and program execution instructions, and a display device and the like. Device 6, memory 7 for recording temporary information and non-rewritable information, database 8 for storing various data, and recording device for writing predetermined information in memory 7 and database 8 9 and a transmission / reception device 17 for transmitting / receiving information to / from the personal computer 3 via the communication line network 1.
[0013]
The memory 7 in the shared computer 2 includes a memory unit A10 and a memory unit B11 that record different types of information, a screen memory 12 that records image data to be displayed on the personal computer 3 or the display device 6, for example, and this system. And a processing program 13 for operating. The shared computer 2 does not have the screen memory 12 and the processing program 13 in the internal memory 7 but in an external storage device (not shown) connected to the shared computer 2 via the communication line network 1. It may be a thing.
[0014]
The database 8 (storage device) in the shared computer 2 includes a main DB 14 in which a multi-model area, a polymorphic pattern, and semantic information are recorded, a storage DB-A 15 that stores information recorded in the memory unit A10, and a memory unit. It consists of a storage DB-B16 that stores information recorded in B11. As shown in FIG. 3, the main DB 14 is recorded in association with a multi-model number area, a plurality of polymorph patterns that can be taken in the multi-model area, and semantic information that signifies each of the plurality of polymorph patterns. Yes. The main DB 14 may also record semantic information that means a combination of polymorphic patterns (for example, haplotypes) in a plurality of multimodel numbers.
[0015]
Here, “multi-model number location (position information)” means at least a position where a polymorphism exists in the base sequence. In general, the polymorphism includes, for example, so-called SNP (single nucleotide polymorphism), RFLP (restriction fragment length of polymorphism), VNTR (variable number of tentative rewrite), and so on. However, the “polymorphism” used in the present specification is not limited to these, and includes a base and a change (variation) in the base sequence that are present in the individual species with a frequency of less than 1%. Therefore, “multi-model address” means to include a position in a base sequence that indicates a change in base and base sequence that exists only at a frequency of less than 1% in an individual species. That is, the “multi-model number place” represents a position indicating a polymorphism or the like by combining numerical values, characters, symbols, and the like. The polymorphism location is not particularly limited, but for example, it can be expressed by a combination of a chromosome number and a symbol representing a gene in which the polymorphism exists and a numerical value indicating the location of the polymorphism in the gene. A combination of a symbol indicating an existing gene and a numerical value indicating the position of polymorphism in the gene may be used.
[0016]
The polymorphism number may be a polymorphic notation assigned to each polymorphism. When a polymorphism-specific notation is used as the polymorphism location, the polymorphism location does not directly indicate the position in the base sequence, but the position can be indirectly known based on the polymorphism-specific notation. Therefore, “multi-model number place” means to include notation unique to the polymorphism.
[0017]
“Polymorphism pattern (base sequence related information)” is information on a base sequence that differs between individuals, and means that it includes at least a base or base sequence pattern in the polymorphism. Furthermore, the “polymorphism pattern” is not limited to polymorphism, and includes a pattern of bases and base sequences that are present only at a frequency of less than 1% in individual species.
[0018]
For example, in a polymorphism area that is known to take A or G, the “polymorphism pattern” is represented by either “A” or “G”. Further, the “polymorphic pattern” may indicate a heterozygote or a homozygote in a homologous chromosome. In this case, for example, in a multi-model area that is known to take A or G, the “polymorphic pattern” can be expressed by any one of “AA”, “GG”, and “AG”. Furthermore, the “polymorphic pattern” does not directly represent a pattern that can be taken in a predetermined polymorphism area, but may indirectly represent it. That is, the “polymorphic pattern” is, for example, “allele 1” when taking “A” in a polymorphism area known to take A or G, and “allele 2” when taking “G”. May be written. Further, when the “polymorphic pattern” can be expressed by any of “AA”, “GG”, and “AG” as described above, for example, when it can be expressed by “AA”, it can be expressed by “α”, “GG”. When it can be expressed as “β”, when it can be expressed as “AG”, “γ” may be expressed. Other examples of “polymorphism pattern” are “repetition number” when the polymorphism is microsatellite, and “present / absent” symbol when the polymorphism is insertion or deletion. It may be written as
[0019]
Furthermore, the “polymorphic pattern” in each polymorphism area may be described as, for example, “polymorphic 1”, “polymorphic 2”, “polymorphic 3” or the like according to a predetermined rule or agreement. For example, “polymorphism 1”, “polymorphism 2”, and “polymorphism 3” can be expressed in descending order of the frequency that the “polymorphism pattern” can take in each polymorphism area. In this case, for example, each “polymorph 1” in each multi-model number area does not necessarily represent the same content. That is, for example, “polymorphism 1” of a certain model number area represents “AA” having the highest frequency, and another “model number 1” represents “GG” having the highest frequency. become. In the present embodiment, a method is used in which “polymorphism pattern” is represented by any one of “polymorphism 1”, “polymorphism 2”, “polymorphism 3”, and the like. By the way, in the present system, the polymorphic pattern may be encrypted or not encrypted.
[0020]
Here, the “semantic information” is information associated with the “polymorphism pattern”, and is based on, for example, responsiveness to drugs, side effects on drugs, risks to diseases and disorders, constitutions / properties, constitutions / properties, etc. It means various information resulting from differences in "polymorphic patterns" such as lifestyle advice and protein interactions.
[0021]
The “semantic information” may directly represent various information resulting from the difference in the “polymorphic pattern”, or may be indirectly represented using a symbol or the like meaning the information. The “semantic information” is information of a type that is corrected as the number of types increases as research on the genome / gene progresses, and it is preferable to always upgrade the version. In other words, the “semantic information” becomes more accurate by updating / accumulating the database using the research results of the genome / genes, thereby increasing / decreasing the accumulated amount.
[0022]
Information that is not directly related to the “polymorphic pattern” but is further derived from the “semantic information” is “information related to the semantic information”. When the “semantic information” is “risk for a disease”, when the risk exceeds a certain level, for example, a specific “health checkup item” is derived. This specific “health examination item” is “information related to semantic information”.
[0023]
In the present embodiment, the semantic information is recorded in the main DB 14 as “annotation information for a polymorphic pattern” associated with at least a predetermined “polymorphic pattern place” and “polymorphic pattern” as shown in FIG. Yes. The semantic information is associated with “polymorphic classification” and “classification (disease name)” corresponding to a predetermined “multi-model number location”. That is, when the predetermined “multi-model number” is a predetermined “polymorphic pattern”, it is possible to obtain annotation information (semantic information) indicating the type of disease name and the morbidity of the disease. Therefore, for example, the semantic information can be associated with a combination (for example, haplotype) of each polymorphic pattern corresponding to a plurality of polymorphism addresses. That is, annotation information (semantic information) indicating different morbidity with respect to a predetermined disease can be associated with each combination of polymorphic patterns in a plurality of polymorphism locations. In this case, when a plurality of multi-model numbers are a combination of predetermined polymorphic patterns, annotation information (semantic information) indicating the possibility of suffering from a predetermined disease can be obtained.
[0024]
Further, the “publication level” determined by a predetermined standard can be associated with the semantic information. For example, the criteria for determining the “publication level” should be determined in consideration of unforeseen disadvantages to individuals due to the disclosure of the semantic information, that is, here the possibility of “classification (disease name)”. Can do. Specifically, in the shared computer 2, in accordance with laws, rules, own behavior standards, or contracts with users, etc., a “publication level” is determined so that semantic information that is not appropriate to be disclosed is not disclosed. be able to. In this case, in the present system, the annotation information indicating the possibility of morbidity associated with the “public level” meaning that the information cannot be disclosed is not disclosed to the user. As a result, it is possible to prevent the user from being given semantic information that can be an unexpected disadvantage and to disclose the semantic information to anyone other than the contractor.
[0025]
In addition, when the user accepts the disclosure of the semantic information associated with the predetermined “public level” through informed consent, the semantic information associated with the predetermined “public level” is disclosed to the user. It may be a system that does this.
[0026]
Further, the “public level” can be set as a plurality of three or more stages such as “1, 2, 3,...” Or “a, b, c,. In this case, on the shared computer 2 side, the level can be set according to the type of the user, such as the user's age, qualification, and whether there is a contract with the user. The informed consent etc. is disclosed by the user side so that only annotation information indicating the possibility of morbidity associated with a public level higher than (or less than) the predetermined public level is provided to the user side. You can also select the level.
[0027]
In the database 8, for example, data such as base sequence related information that is genetic information of the individual requester who uses the system can be recorded in the storage DB-B 16. The storage DB-A 15 can record data such as information for specifying a requester who uses the system. As described above, the genetic information of the requester and the data specifying the requester are separately recorded in the storage DB-A 15 and the storage DB-B 16 by recording the individual genetic information and the information specifying the individual separately. It becomes difficult to relate.
[0028]
The shared computer 2 is not limited to the one having the database 8 therein, and may have an external database (not shown) connected to the shared computer 2 via the communication line network 1. The shared computer 2 may have a plurality of databases 8 inside, or may have an internal database 8 and an external database connected to the shared computer 2 via the communication line network 1. May be.
[0029]
As shown in FIG. 4, the statistical analysis computer ST includes a CPU 30 that controls all operations of the statistical analysis computer ST, an input device 31 such as a keyboard and a mouse that can input information and program execution instructions, and a display. A display device 32 such as a device, a memory 33 in which temporary information, rewritable information and non-rewritable information are recorded, a database 34 storing various data, and the memory 33 and the database 34 On the other hand, a recording device 35 for writing predetermined information and a transmission / reception device 36 for transmitting / receiving information between the shared computer 2 and the personal computer 3 via the communication network 1 are configured.
[0030]
The memory 33 in the statistical analysis computer ST temporarily stores a polymorphic pattern or the like sent from the personal computer 3 or records a statistical analysis result or the like, and a personal computer 3 or a shared computer, for example. 2 and screen memory 38 in which image data to be displayed on the display device 32 is recorded, and this system is operated, and “individual related information” regarding a plurality of individuals (individuals) and “genome related information” regarding a plurality of individuals (individuals); For example, a processing program 39 for creating semantic information such as annotation information indicating the possibility of suffering from a predetermined disease is recorded. According to the processing program 39, first, “genome related information” regarding a plurality of individuals (individuals) and “individual related information” regarding a plurality of individuals (individuals) are accumulated, By performing statistical processing using the “individual-related information”, it is possible to obtain annotation information (semantic information) indicating the morbidity of a predetermined disease. In the statistical analysis computer ST, the processing program 39 and the memory unit 37 are not included in the internal memory 33, but an external storage device (not shown) connected to the statistical analysis computer ST via the communication network 1. ).
[0031]
Here, “individual-related information” includes all information related to a predetermined individual such as the individual's properties, psychological state, constitution, physical condition, health condition, medical history, lifestyle, behavior / thought pattern, habit, and preferences. . For example, as individual-related information, information obtained from an individual (individual) such as a user answering a prepared question or the like, or, for example, a user is inspected by a medical institution or an inspection institution The information obtained by receiving can be mentioned.
The “genome related information” is a group of data (base sequence related information group) in which a plurality of “polymorphism patterns” related to a predetermined individual are associated with a predetermined “multi-model area”.
[0032]
The database 34 (storage device) in the statistical analysis computer ST has a “genome related information DB 40” (shown in FIG. 5) in which “genome related information” regarding a plurality of individuals (individuals) is recorded for each individual (individual). is doing. The “genome related information” recorded in the “genome related information DB 40” is preferably anonymized so as not to be directly linked to information for specifying an individual (individual).
[0033]
The database 34 also has an “individual related information DB 41” (shown in FIG. 6) in which the “individual related information” received from the personal computer 3 is recorded for each individual (individual). The “individual-related information” recorded in the “individual-related information DB 41” is preferably anonymized so as not to be directly linked to information for specifying an individual (individual).
[0034]
The statistical analysis computer ST is not limited to the one having the database 34 therein, and accesses an external database (not shown) connected to the statistical analysis computer ST via the communication network 1. It may be a thing.
[0035]
As shown in FIG. 7, the personal computer 3 includes a CPU 20 that controls all operations of the personal computer 3, an input device 21 such as a keyboard and a mouse that can input information and program execution instructions, a display device, and the like. Display device 22, memory 23 in which temporary information and rewritable information are recorded, reading device 25 that reads data from genome-related information recording medium 24, and shared computer 2 via communication network 1 And a transmission / reception device 29 for transmitting / receiving information between them. The personal computer 3 is not limited to a normal computer, and may be in any form such as a mobile phone, a personal mobile terminal, and other mobile communication devices.
[0036]
The memory 23 in the personal computer 3 has a memory unit 26 for recording information from the genome related information recording medium 24, and a processing program 27 for operating the information processing system is recorded therein.
[0037]
In the genome related information recording medium 24, individual genome related information 28 is recorded. Examples of the genome-related information recording medium 24 include a magnetic recording medium such as a magnetic disk and a magnetic card, an optical recording medium to which a magneto-optical recording method, a phase change recording method, and the like are applied, and a semiconductor memory. The genome-related information recording medium 24 may be in any form such as card, disk, stick, tape, or drum. Further, the genome-related information recording medium 24 may be one that records genome-related information 28 of a single individual (individual), but records a plurality of genome-related information 28 related to a plurality of individuals (individuals). It may be a thing.
[0038]
The genome-related information 28 included in the genome-related information recording medium 24 includes at least a “polymorphism pattern” in a predetermined multi-model area obtained as a result of analyzing the base sequence of the “multi-model area” and an individual (individual). means. In addition, the genome-related information 28 may include various types of information such as past illness, characteristics, medical record information, and health checkup results.
[0039]
In the genome related information recording medium 24, for example, as shown in FIG. 8, the individual number “Gno.” (G number) unique to the genome related information 28 and the date of birth, etc. Personal information is recorded, a multi-model area and a polymorphic pattern are recorded as data II, a past disease is recorded as data III, a feature is recorded as data IV, and medical record information and the like are recorded as data V. That is, the genome related information 28 includes data I, data II, data III, data IV, and data V. Data I and data II contain essential information, and data III, data IV and data V are composed of additional information.
[0040]
In the genome related information 28, the “multi-model number location” corresponding to the position on the base sequence and the “polymorphism pattern” in the multi-model location are linked and recorded. Further, in the data II, additional information in a predetermined multi-model address may be recorded as a “comment” linked to the “multi-model address”. Data II may record the entire base sequence related to a predetermined individual. Even when the entire base sequence is recorded in the data II, the “polymorphic pattern” and the “polymorphic pattern” are included in the data II.
[0041]
In the present invention, the personal computer 3 and the genome-related information recording medium 24 are not limited to the configurations shown in FIGS. 7 and 8, respectively. For example, the memory section in which the genome-related information recording medium has a processing program. The personal computer may be configured to operate the processing program by mounting the genome-related information recording medium. In this case, the personal computer can operate according to the processing program recorded in the memory section of the genome related information recording medium.
[0042]
In the information processing system configured as described above, the processing program 39 included in the memory 33 of the statistical analysis computer ST and the processing program 27 recorded in the memory 23 of the personal computer 3 are, for example, as shown in FIG. By performing the information processing operation according to the flowchart, the individual related information DB 41 can be constructed by accumulating the individual related information. In the flowchart shown in FIG. 9, the step described as “(T)” means processing in the statistical analysis computer ST, and the step described as “(Piece)” means processing in the personal computer 3. Yes.
[0043]
In this information processing system, each individual who possesses the genome-related information recording medium 24 accesses the statistical analysis computer ST through the communication network 1 using the personal computer 3, and "individual relations" The information is registered in the “individual related information DB 41” of the statistical analysis computer ST in association with the genome related information in the “genome related information DB 40”. The information processing system may be a system in which each individual accesses the genome-related information recording medium 24 using the genome-related information recording medium 24 in which the genome-related information 28 of a plurality of persons is recorded.
[0044]
At this time, first, in step 1 (S1), the requester accesses the statistical analysis computer ST via the communication network 1, and displays an intention to register the individual related information to the statistical analysis computer ST. I do. In step 1, the intention display may be performed by accessing a web page provided by the statistical analysis computer ST, or the intention display may be performed by accessing the statistical analysis computer ST using e-mail or the like. May be. In this example, the requester registers his / her individual related information. However, the present invention is not limited to this, and the requester can also register individual related information related to an individual (individual) other than himself / herself.
[0045]
Next, after receiving the intention display from the personal computer 3, the statistical analysis computer ST reads the “individual-related information collection screen” as shown in FIG. 10 from the screen memory 38 in step 2 (S2). And displayed on the display device 22 of the personal computer 3. In step 2, the individual related information collection screen may be displayed in a state where the personal computer 3 accesses the web page provided by the statistical analysis computer ST, or the individual related information collection transmitted to the personal computer 3 may be displayed. You may display on the display apparatus 22 of the personal computer 3 based on screen data.
[0046]
Next, in step 3 (S3), the personal computer 3 inputs the individual related information of the requester itself according to the individual related information collection screen. In other words, the requester answers the question displayed on the individual related information collection screen, and inputs the response content on the individual related information collection screen. Alternatively, the requester may input an answer to the question displayed on the individual related information collection screen on an answer screen different from the individual related information collection screen. In step 3, information obtained by a user (requester) or the like undergoing an examination at a medical institution or an examination institution may be input as individual related information.
[0047]
Next, in step 4 (S4), the personal computer 3 transmits an answer (individual related information) to the question displayed on the individual related information collection screen to the statistical analysis computer ST. In step 4, the individual-related information collection screen data to which the answer is input or the answer screen data to which the answer is input is transmitted via the communication network 1 so that the answer to the question is transmitted to the statistical analysis computer ST. can do. In step 4, information obtained by a user (requester) or the like undergoing an examination at a medical institution or examination institution as individual related information can be transmitted to the statistical analysis computer ST.
[0048]
Next, after receiving the individual-related information from the personal computer 3, the statistical analysis computer ST transmits a plurality of multi-model numbers to the personal computer 3 in step 5 (S5). The plurality of multi-model numbers transmitted in step 5 may be predetermined multi-model numbers or may be randomly selected multi-model numbers.
[0049]
Next, after receiving a plurality of multiple model numbers from the statistical analysis computer ST, the personal computer 3 drives the reader 25 to access the genome related information recording medium 24 in step 6 (S6). Next, in step 7 (S7), the personal computer 3 reads out the corresponding polymorphic patterns for a plurality of polymorphism addresses received from the statistical analysis computer ST. Then, in step 8 (S8), the personal computer 3 associates the polymorphic pattern read out in step 7 with the corresponding polymorphic address and transmits it to the statistical analysis computer ST. That is, in step 8, for each of a plurality of multi-model numbers received from the statistical analysis computer ST, a corresponding polymorph pattern is associated and transmitted. When transmitting from the personal computer 3 to the statistical analysis computer ST in step 8, it is desirable not to transmit information that can identify an individual (individual) such as “Gno.”.
[0050]
By the way, in the present embodiment, the personal computer 3 receives the multi-model number transmitted from the statistical analysis computer ST in step 5, and the polymorphic pattern corresponding to the received multi-model address is stored in step 8. Is transmitted to the statistical analysis computer ST. However, the present invention is not limited to this. For example, in step 4, the personal computer 3 transmits “individual-related information” to the statistical analysis computer ST, and a multi-address corresponding to the predetermined multi-model number and the multi-model number The mold pattern may be spontaneously transmitted to the statistical analysis computer ST. In this case, steps 5 to 8 described above are not performed, and the following steps after step 9 are similarly performed after step 4.
[0051]
Next, the statistical analysis computer ST receives the multi-model area and the polymorphism pattern from the personal computer 3, and then accesses the genome-related information DB 40 in step 9 (S9). In step 9, if the statistical analysis computer ST does not have the genome-related information DB 40 and the external organization has the genome-related information DB 40, the genome-related information that the external organization has via the communication line network 1 is used. The information DB 40 is accessed.
[0052]
Next, in step 10 (S10), the statistical analysis computer ST searches the genome related information DB 40 based on the combination of a plurality of polymorphism locations and polymorphic patterns received from the personal computer 3, and the genome related information DB 40 The genome related information related to an individual (individual) having a combination of a plurality of received polymorphic addresses and polymorphic patterns is identified from the genome related information stored in the. In other words, in step 10, the requester (individual (individual) related to the “individual-related information” transmitted by the requester) from among a plurality of genome-related information related to a plurality of individuals (individuals) registered in the genome-related information DB 40. ) To identify genome related information. In step 10, for example, “organization No.” may be assigned to the genome-related information regarding the specified requester, that is, the individual (individual). Alternatively, in Step 10, “Organization No.” previously assigned to the genome related information related to the individual (individual) registered in the genome related information DB 40 may be extracted.
[0053]
Next, in the statistical analysis computer ST, in step 11 (S11), genome-related information regarding the requester identified in step 10 (individual (individual) related to “individual-related information” transmitted by the requester), The individual related information received from the computer 3 is associated. Specifically, “Organization No.” given to the genome related information related to the requester specified in Step 10 (the individual (individual) related to the “individual related information” transmitted by the requester) or specified in Step 10 As shown in FIG. 6, the “rearrangement No.” extracted for the genome related information related to the requester (the individual (individual) related to the “individual related information” transmitted by the requester) is stored in association with the individual related information. “Individual-related information DB 41” is created.
[0054]
Alternatively, in step 10, if the statistical analysis computer ST has the genome-related information DB 40, the requester stored in the genome-related information DB 40 (individual (individual) related to the "individual-related information" transmitted by the requester). The individual related information received from the personal computer 3 may be stored in direct association with the genome related information regarding)).
[0055]
As described above, according to the flowchart shown in FIG. 9, for example, for a predetermined requester, individual-related information regarding the requester can be registered in the statistical analysis computer ST in association with the genome-related information regarding the requester. Further, when the plurality of requesters register the individual related information according to the flowchart shown in FIG. 9, the statistical analysis computer ST has the individual related information regarding the plurality of individuals (individuals). In other words, the statistical analysis computer ST can create a database in which individual related information is stored in association with genome related information related to a plurality of individuals.
[0056]
As described above, according to the present system, individual-related information that has been anonymized later is compared with genome-related information that has been previously anonymized and recorded in the “genome-related information DB 40” of the statistical analysis computer ST. Even when it is transmitted to the computer ST for statistical analysis, the genome related information and the individual related information can be linked.
[0057]
In the flowchart shown in FIG. 9, the statistical analysis computer ST receives the polymorphism pattern from the personal computer 3 for the plurality of polymorphism addresses transmitted in step 5, and stores the genome related information DB 40 in step 10. The genome-related information regarding the requester (the individual (individual) related to the “individual-related information” transmitted by the requester) is specified by searching. However, the present information processing system is not limited to this method for specifying genome-related information regarding a requester (an individual (individual) related to “individual-related information” transmitted by the requester). The combination of “model number location” and “polymorphic pattern” may be sequentially transmitted from the personal computer 3 to the statistical analysis computer ST, and the statistical analysis computer ST may identify the requester. In this case, a predetermined combination of “multi-model number” and “polymorphic pattern” may be transmitted from the personal computer 3 to the statistical analysis computer ST in sequence, or from the statistical analysis computer ST. Next, the personal computer 3 is sequentially requested to submit the “polymorphic pattern” corresponding to the “multi-model area”, and the personal computer 3 changes the “polymorphic pattern” corresponding to the request to the “multi-model area”. The data may be transmitted in association with each other.
[0058]
Specifically, the statistical analysis computer ST receives a combination of one or a plurality of “multi-model area” and “polymorphic pattern” related to the requester, and the received “multi-model area” and “polymorphic pattern”. A step of searching the genome-related information DB 40 for genome-related information relating to an individual having a combination of “multi-model location” and “polymorphic pattern” that matches the combination, and as a result of the search, genome-related information relating to one individual (individual) Is repeated until it is identified as genome-related information related to the requester (individual (individual) related to the “individual-related information” transmitted by the requester). Identifying information as genome-related information about the requester (the individual (individual) related to the “individual-related information” sent by the requester) It can be.
[0059]
By the way, the shared computer 2 can use the main DB 14 to provide semantic information such as the morbidity of a predetermined disease to the user as follows. That is, when the user requests the user to teach his / her morbidity regarding a predetermined disease, for example, as a “request for goods and / or services”, the shared computer 2 has semantic information such as morbidity. Can provide.
[0060]
In this information processing system, “articles and / or services” are not limited to the possibility of suffering from a predetermined disease. For example, articles such as pharmaceuticals, foods, and luxury goods that are suitable for the individual (individual) constitution. It also includes services such as information suitable for the constitution and nature of individuals (individuals).
[0061]
In the information processing system in which the shared computer 2 provides the user with the possibility of suffering a predetermined disease, the processing program 13 recorded in the memory 7 of the shared computer 2 and the memory 23 of the personal computer 3 are recorded. The processing program 27 performs an information processing operation according to flowcharts as shown in FIGS. 11 and 12, for example. In the flowcharts shown in FIG. 11 and FIG. 12, a step described as “(co)” means a process in the shared computer 2, and a step described as “(piece)” means a process in the personal computer 3. ing.
[0062]
In this information processing system, each individual who possesses the genome related information recording medium 24 accesses the shared computer 2 via the communication network 1 using the personal computer 3 and is recorded in the main DB 14 of the shared computer 2. It is a system that uses semantic information. The information processing system may be a system in which each individual accesses the genome-related information recording medium 24 using the genome-related information recording medium 24 in which the genome-related information 28 of a plurality of persons is recorded.
[0063]
In this case, first, in step A1 (SA1), when the requester uses this system, the processing program 27 recorded in the memory 23 is activated. The reading program 25 of the personal computer 3 is driven by the processing program 27 to access the genome related information recording medium 24 and “Gno.” Recorded as data I in the genome related information recording medium 24 is read. The read “Gno.” Is stored in the memory unit 26.
[0064]
Next, in step A2 (SA2), information that the requester wants to receive based on the screen image displayed on the display device 22 by the processing program 27, for example, “possibility of colorectal cancer” (request information) ) Is input to the personal computer 3, and “possibility of colorectal cancer” and “Gno.” Are transmitted from the personal computer 3 to the shared computer 2 via the communication network 1. Alternatively, “possibility of colorectal cancer” and “Gno.” Are written from the personal computer 3 to the shared computer 2 via the communication network 1.
[0065]
Next, in step A3 (SA3), the shared computer 2 receives “possibility of colorectal cancer” and “Gno.”. The received “possibility of colorectal cancer” and “Gno.” Are stored as request information in the memory unit A10.
[0066]
Next, in step A4 (SA4), when the request information is received, the processing program 13 recorded in the memory 7 is activated and the main DB 14 is accessed. The processing program 13 performs processing in the shared computer 2.
[0067]
Next, in step A5 (SA5), “classification (disease name)” recorded in the main DB 14 is searched according to the processing program 13, and the requested “possibility of colorectal cancer” (colorectal cancer). That match is extracted.
[0068]
In step A6 (SA6), the “multi-model address” associated with the “classification (disease name)” (colon cancer) that matches the “morbidity of colorectal cancer” from the data recorded in the main DB 14. "Is read out. The read “multi-model number address” is stored in the memory unit A10 as position information associated with the request information. That is, “possibility of colorectal cancer” and “multi-model number address” are recorded in the memory unit A10 for a predetermined “Gno.”.
[0069]
Next, in step A7 (SA7), “Gno.” And “multi-model number address” recorded in the memory unit A10 are transmitted to the personal computer 3 and “multi-model number address” corresponding to the transmitted “multi-model number address” is transmitted. The command information for submitting the “type pattern” is transmitted to the personal computer 3. At this time, depending on the type of request information, it may be instructed to submit additional information such as a past illness or a feature as necessary.
[0070]
Next, in step A8 (SA8), “Gno.”, “Multi-model address” and command information transmitted from the shared computer 2 are received. The received “Gno.” And “multi-model address” are recorded in the memory unit 26.
[0071]
Next, in step A9 (SA9), the data II recorded in the genome related information recording medium 24 is accessed according to the received command information. In step A10 (SA10), the data II recorded in the genome-related information recording medium 24 is searched according to the processing program 27, the commanded polymorphism pattern of the polymorphism area is read, and the polymorphism area and polymorphism pattern are obtained. The data is recorded in the memory unit 26 in association with each other. At this time, it is preferable to access the data I and confirm whether or not “Gno.” Received in step A8 is correct. In Step A10, in addition to the polymorphic pattern, additional information recorded in the data III, data IV, and data V may be simultaneously read and recorded in the memory unit 26 as necessary.
[0072]
Next, in step A11 (SA11), the polymorphic pattern associated with the polymorphism number temporarily recorded in the memory unit 26 and the additional information recorded as necessary are transmitted together with “Gno.” To the communication line. The data is output to the shared computer 2 via the network 1. In step A12 (SA12), the shared computer 2 receives the polymorphism pattern associated with the polymorphism address and additional information recorded as necessary, and associates the received polymorphism pattern with the polymorphism area and stores the memory. Record in part A10.
[0073]
In this example, in step A7, the shared computer 2 sends out command information instructing the submission of the “polymorphic pattern”. In step A10, the personal computer 3 records the polymorphic pattern in accordance with the command information in the genome related information recording. Reading from the medium 24. However, this system may be a system that does not transmit the command information in step A7. In this case, in step A10, the personal computer 3 searches the data II based on the multi-model number received in step A8 according to the processing program 27, and reads the received polymorphic pattern of the multi-model number. Then, the personal computer 3 outputs a polymorphic pattern or the like to the shared computer 2 at step A11. Even in this case, the shared computer 2 can obtain the polymorphism pattern of “multi-model area” associated with “classification (disease name)” that matches “possibility of colorectal cancer” in step A12. .
[0074]
Next, in step A13 (SA13), the main DB 14 is accessed, and a search is made for a match with the received polymorphism address and polymorphism pattern. Specifically, in the main DB 14, a plurality of polymorphic patterns are recorded for one polymorphism address, and the received polymorphism pattern and the polymorphism pattern match which polymorphic pattern in the main DB14. To find out.
[0075]
Next, in step A14 (SA14), according to the processing program 13, the possibility of suffering from colorectal cancer associated with the polymorphic pattern that matches the received polymorphic pattern is read out. That is, in step A14, according to the multi-model number area and polymorphism pattern submitted by the requester, it is possible to read out the possibility of suffering from colorectal cancer of the requester. The read possibility of morbidity is stored in the memory unit A10 in association with the requester “Gno.”. At this time, the possibility of suffering from colorectal cancer may be stored in a form corrected by additional information, or other information obtained from the additional information is stored in association with the requester “Gno.”. You may do it.
[0076]
Next, in step A15 (SA15), “Gno.” Of the requester stored in the memory unit A10 and the morbidity possibility are transmitted as semantic information to the personal computer 3 via the communication network 1. In step A16 (SA16), the personal computer 3 receives the requester's “Gno.” And the morbidity (semantic information). The received semantic information is recorded in the memory unit 26.
[0077]
Next, in step A17 (SA17), according to the processing program 27, the possibility of suffering from colorectal cancer is displayed on the display device 22 from the semantic information recorded in the memory unit 26. Note that instead of steps A15 to A17, the shared computer 2 reads (creates) a screen for displaying semantic information according to the processing program 13, and displays it on the display device 22 of the personal computer 3 via the communication network 1. You can also Also in this case, it is assumed that semantic information is transmitted from the shared computer 2 to the personal computer 3. Thereby, the requester can obtain the possibility of suffering from colorectal cancer using the genome related information 28 recorded in the genome related information recording medium 24.
[0078]
Furthermore, in this information processing system, the processing program 13 recorded in the memory 7 of the shared computer 2 and the processing program 27 recorded in the memory 23 of the personal computer 3 are processed according to the flowchart shown in FIG. It may operate. In the flowchart shown in FIG. 13, a step described as “(co-)” means a process in the shared computer 2, and a step described as “(number)” means a process in the personal computer 3. .
[0079]
Here, first, in step C1 (SC1), when the requester uses this system, the processing program 27 recorded in the memory 23 is activated. The reading program 25 of the personal computer 3 is driven by the processing program 27 to access the genome related information recording medium 24, and “Gno.” Recorded as data I in the genome related information recording medium 24 is recorded as data II. All “polymorphic pattern locations” and “polymorphic patterns” are read out. The read “Gno.”, “Multi-model address”, and “polymorph pattern” are stored in the memory unit 26.
[0080]
Next, in step C2 (SC2), information that the requester wants to receive based on the screen image displayed on the display device 22 by the processing program 27, for example, “possibility of colorectal cancer” (request information). ) Is input to the personal computer 3, and “possibility of colorectal cancer” is recorded in the shared computer 2 from the personal computer 3 via the communication network 1 and “Gno. "," Multi-model number "and" polymorphic pattern ".
[0081]
Next, in step C3 (SC3), the shared computer 2 receives “possibility of colorectal cancer”, “Gno.”, “Multi-model address”, and “polymorphism pattern”. The received “possibility of colorectal cancer” is recorded as request information in the memory unit A10, and “Gno.”, “Multi-model number area”, and “polymorphic pattern” are also stored in the memory unit A10. The shared computer 2 activates the processing program 13 when receiving the request information. In step C4 (SC4), the main DB 14 is accessed according to the processing program 13.
[0082]
Next, in step C5 (SC5), “classification (disease name)” recorded in the main DB 14 is searched according to the processing program 13, and the requested “possibility of colorectal cancer” (colorectal cancer). That match is extracted.
[0083]
In step C6 (SC6), the main DB 14 is accessed according to the processing program 13, and the “multi-model area” classified as “colon cancer” from the main DB 14, all “polymorphic patterns” for the multi-model area, and The “morbidity” for the polymorphic pattern is read out. The read “multi-model number”, “polymorphic pattern”, and “morbidity” are stored in the memory unit A10.
[0084]
Next, in step C7 (SC7), the data stored in the memory unit A10 in step C6 is searched based on the “multi-model address” and “polymorphic pattern” received in step C3, and the received “polymorphic pattern” is received. ”Is extracted from the memory part A10.
[0085]
In step C8 (SC8), the possibility of morbidity extracted based on the result of step C7, that is, the polymorphic pattern included in the information received in step C3 matches which polymorphic pattern in the main DB 14 is communicated. The data is transmitted to the personal computer 3 via the line network 1. At this time, the shared computer 2 transmits the extracted morbidity possibility together with the requester “Gno.”.
[0086]
Next, in step C9 (SC9), “Gno.” And “morbidity” (semantic information) transmitted from the shared computer 2 are received. The received “Gno.” And “morbidity” are recorded in the memory unit 26. At this time, it is possible to access the data I recorded in the genome related information recording medium 24 and confirm whether or not the received “Gno.” Is correct.
[0087]
Next, in step C10 (SC10), according to the processing program 27, the possibility of suffering from colorectal cancer is displayed on the display device 22 from the semantic information recorded in the memory unit 26. Instead of steps C8 to C10, the shared computer 2 reads (creates) a screen for displaying semantic information according to the processing program 13, and displays it on the display device 22 of the personal computer 3 via the communication network 1. It can also be made. Also in this case, it is assumed that semantic information is transmitted from the shared computer 2 to the personal computer 3. Thereby, the requester can obtain the possibility of suffering from colorectal cancer using the genome related information 28 recorded in the genome related information recording medium 24.
[0088]
As described above, the requester can obtain the semantic information in accordance with the flowcharts shown in FIGS. 11 and 12 or the flowchart shown in FIG.
Further, in this information processing system, the requester does not have either the genome-related information recording medium 24 or the recording medium excluding the information included in the data II from the genome-related information recording medium. In addition, a genome-related information recording medium 24 connected to the personal computer 3 may be provided. In the case of such a system, the requester accesses the genome-related information recording medium 24 via the communication network 1 and records “multi-model number” and “polymorphic pattern” recorded in the genome-related information recording medium 24. Can be downloaded to the personal computer 3. In this case, the genome-related information recording medium 24 may be one in which genome-related information related to a plurality of individuals is recorded for each individual (for each “Gno.”).
[0089]
Furthermore, the present invention is not limited to the configuration in which the shared computer 2 has the main DB 14 as described above. For example, the information processing system including the main DB 14 connected to the shared computer 2 via the communication network 1. Also applies. In this case, the shared computer 2 accesses the main DB 14 via the communication network 1 in the flowcharts shown in FIGS. 11 and 12 or the flowchart shown in FIG. Even in this case, according to the information processing system, the requester can obtain desired semantic information according to the flowcharts shown in FIGS. 11 and 12 or the flowchart shown in FIG. 13.
[0090]
In particular, in this case, the shared computer 2 accesses a plurality of main DBs 14 possessed by different organizations or organizations via the communication network 1, and uses the semantic information included in the plurality of main DBs 14 to request the requester. It is possible to provide information for. That is, in this information processing system, in step A5 in the flowcharts shown in FIG. 11 and FIG. 12, and in step C5 in the flowchart shown in FIG. To access the various main DBs 14. Thereby, according to the information processing system, the requester can obtain information on the possibility of suffering from colorectal cancer based on information included in various main DBs 14.
[0091]
Further, in the system shown in FIGS. 11 and 12, or in the flowchart shown in FIG. 13, the shared computer 2 transmits at least the request information received from the personal computer 3 to a so-called agent, Information (in this example, “possibility of being affected by colorectal cancer”) may be obtained via the agent.
[0092]
As described above, in the present system, the shared computer 2 can perform the “multi-model address” and the “multi-model address” and the “necessary to provide goods and / or services” according to the flowcharts shown in FIGS. A set of “polymorphic patterns” is received from the personal computer 3. The shared computer 2 can store a set of “multi-model number” and “polymorphic pattern” received from a large number of personal computers 3. The shared computer 2 counts the number of the accumulated “multi-model area” and “polymorphic pattern” sets for each type, and sets the set as the relationship between the provision of goods and / or services and the type of the set. The ratio can be obtained for each type.
[0093]
Further, the shared computer 2 accesses the genome-related information DB 40 of the statistical analysis computer ST, and calculates from the genome-related information DB 40 the ratio for each type of polymorphic pattern in the accumulated “multi-model area”. Then, the shared computer 2 compares the ratio calculated from the genome related information DB 40 with the ratio for each type of set obtained from the accumulated set, and examines a significant difference from each ratio. As described above, the shared computer 2 can extract a set having a ratio significantly different from the ratio in the genome-related information DB 40 from the set of the “multi-model area” and “polymorphism pattern”.
[0094]
For example, when providing a specific article, it is assumed that a polymorph pattern at the multi model number “100,000” is received. It is assumed that the types of polymorphism patterns in the polymorphism area “100000” are “polymorphism 1”, “polymorphism 2”, and “polymorphism 3”. By receiving the set of the multi-model number “100000” and the polymorphic pattern pattern by providing the article for a certain period of time, for example, a set of the multi-model number “100,000” and the polymorphic pattern “polymorph 1” (hereinafter set 1), A set of the multi-model area “100000” and the polymorphic pattern “polymorph 2” (hereinafter set 2), and a set of the multi-model area “100000” and the polymorph pattern “polymorph 3” (hereinafter set 3) are stored. Can do. For example, in the shared computer 2, it is assumed that 500 sets 1, 300 sets 2, and 200 sets 3 are accumulated. That is, of the accumulated set of the polymorphism number “100000” and the polymorphic pattern, the ratio of the set 1 is 50%, the ratio of the set 2 is 30%, and the ratio of the set 3 is 20%. Become.
[0095]
On the other hand, the shared computer 2 accesses the genome related information DB 40 and calculates the ratio of the sets 1 to 3 from the genome related information DB 40 as, for example, 70% for the set 1, 20% for the set 2, and 10% for the set 3. To do. When this ratio is compared with the ratio for each type of set obtained from the set polymorphic pattern number “100000” and the set of polymorphic patterns, for example, the ratio of set 1 is significant among the stored sets 1 to 3 It turns out that there are few.
[0096]
By obtaining such knowledge, the shared computer 2 can recognize that there are significantly fewer persons who have the set 1 among users who provide specific articles. Therefore, the shared computer 2 obtains information indicating a characteristic tendency of individuals (individuals) having a specific set (for example, set 1) of “polymorphic pattern” and “polymorphic pattern” as follows. be able to.
[0097]
That is, by performing information processing according to the flowchart shown in FIG. 14, the shared computer 2 can obtain information indicating a tendency characteristic of individuals (individuals) having a specific set. In FIG. 14, a step described as “(co-)” means a process in the shared computer 2, and a step described as “(superior)” means a process in the statistical analysis computer ST.
[0098]
First, in step A, the shared computer 2 transmits all or a part of the accumulated “multi-model number address” and “polymorphic pattern” set to the statistical analysis computer ST, and transmits the transmitted “multi-model number address” and “ Requests to provide information showing a characteristic trend for individuals with a set of “polymorphic patterns”. For example, the shared computer 2 provides information indicating a characteristic tendency to an individual having the set 1 by transmitting a set (set 1) of the polymorphism area “100000” and the polymorphism pattern “polymorphism 1”. Request that.
[0099]
Next, in step B, the statistical analysis computer ST receives a set of “multi-model address” and “polymorphism pattern”, accesses the “genome related information DB 40”, and receives the received “multi-model address” and “ All the arrangement numbers (individuals) having the set of “polymorphic pattern” are specified.
[0100]
Next, in step C, the statistical analysis computer ST accesses the individual related information DB 41 and specifies all the individual related information associated with the organization number (individual) specified in step B.
[0101]
Next, in step D, the statistical analysis computer ST compares the tendency of the individual related information included in the individual related information DB 41 with the tendency of the individual related information specified in step C, and the individual related specified in step C is compared. Among the information, individual-related information that is particularly highly common among individuals specified in step B is extracted.
[0102]
For example, in all the information (individuals) included in the individual related information DB 41, it is assumed that the ratio of purple preference for color preference is 40%. In the individual related information specified in Step C, the individual who likes “purple” Is 50%, it can be said that individuals with set 1 tend to prefer purple. In other words, while only 40% of people like purple overall, 50% of people with the set 1 prefer purple.
[0103]
As described above, in step D, predetermined group-related information (purple color is preferred) in the group of arrangement numbers (individuals) specified in step B and the group of all arrangement numbers (individuals) included in the individual-related information DB 41. ) To extract all the individual related information with statistically significantly different ratios. Next, in step E, the statistical analysis computer ST transmits all the individual related information extracted in step D to the shared computer 2.
[0104]
Next, in step F, the shared computer 2 receives individual related information from the statistical analysis computer ST. Thereby, the shared computer 2 can obtain individual-related information as information indicating a characteristic tendency of an individual (individual) having the set of “multi-model area” and “polymorphic pattern” transmitted in step A. .
[0105]
For example, when the shared computer 2 transmits the set 1 to the statistical analysis computer ST based on the recognition that the number of persons having the set 1 is significantly less than that of the user who provides the specific article in step A, the step D For example, according to the content of the individual related information received at, a sales strategy for the specific article can be established.
[0106]
In addition, in step A, the system may transmit a specific set of its own “multi-model address” and “polymorphic pattern” from the personal computer 3 instead of the shared computer 2. In this case, the statistical analysis computer ST transmits individual related information to the personal computer 3 as information indicating a characteristic tendency of the individual having the received set. Thereby, the personal computer 3 can acquire individual-related information characteristic of an individual having the same set as its own set. At this time, for example, the person using this system (personal computer 3) knows in advance “multi-model address” or “combination of multi-model addresses” which is related to a predetermined genetic constitution / property. Then, by sending the “multi-model address” or “combination of multi-model addresses” to the statistical analysis computer ST, the individual using this system can obtain the individual related information and information related thereto, For example, the possibility of suffering from a lifestyle-related disease that has not yet developed can be understood, and the lifestyle habit can be voluntarily changed.
[0107]
Further, in this system, the shared computer 2 transmits only a specific “multi-model number address” in step A, and all the sets of the multi-model number and the polymorphic patterns that can be taken in the multi-model number area are characteristic. It may also be requested to provide information indicating a trend. In this case, the statistical analysis computer ST transmits individual related information to the shared computer 2 as information indicating a tendency characteristic of the set for each set in step E. In this case, since the shared computer 2 can obtain characteristic individual-related information for each of all sets, for example, a specific tailor-made type (a difference in a polymorphic pattern in a specific multi-model number of a requester) When providing an article (which should be provided and divided based on), it is possible to develop a detailed advertising / sales strategy that is different for each polymorph pattern in the multi-model area necessary for the provision of the article.
[0108]
Furthermore, even if it is not a tailor-made type article but a normal article, provision of the article will be provided if the set of “multi-model pattern” and “poly-type pattern” that are often held by the purchaser of the specific article is found. By transmitting the set to the statistical analysis computer ST from the buyer side and performing Step B to Step F, the provider side can obtain characteristic individual-related information often seen in the purchaser. . By using this individual related information, the provider side can develop an effective marketing / advertisement / sales strategy.
[0109]
<Second Embodiment>
Next, as a second embodiment to which the present invention is applied, for example, when an insurance company (business institution) calculates insurance premiums for life insurance and medical insurance (hereinafter simply referred to as “insurance premiums”). An information processing system that provides reference information will be described.
[0110]
In the following description, an information processing system that provides information serving as a reference when an insurance company calculates an insurance premium will be described. However, for the convenience of explanation, it will be described as a simplified model. The information processing system described as the present embodiment uses the same names, symbols, and definitions for the same configurations and terms as those of the information processing system in the first embodiment described above, so that the configurations, operations, and terms The description of is omitted.
[0111]
In the present embodiment, the insurance company has an insurance company terminal that enables data communication between the shared computer 2 and the statistical analysis computer ST via the communication line network 1 shown in FIG. The insurance company terminal processes the information according to the flowchart shown in FIG. 15, whereby the insurance company terminal can obtain information serving as a reference when calculating the insurance premium. In FIG. 15, a step described as “(protect)” means a process in the insurance company terminal, and a step described as “(supervision)” means a process in the statistical analysis computer ST.
[0112]
First, in step I-1 (SI-1), the insurance company uses the insurance company terminal to set a set of “multi-type pattern” of the contractor and the “poly-type pattern” in the multi-type area at the time of insurance contract (individual Information for identifying). At this time, the set is information for specifying an “anonymized individual” corresponding to the contractor on the statistical analysis computer ST side, and is preferably obtained in a completely anonymized state. . Further, as the set, a set of types / quantities that can be distinguished from other individuals is acquired. Furthermore, it is desirable that the polymorphism area and polymorphism pattern included in the set to be acquired are those that are not associated with semantic information in the main DB 14. Furthermore, the insurance company acquires the set from the contractor's genome-related information recording medium 24 using, for example, a portable terminal belonging to the insurance company, and sets it to the insurance company terminal via the communication line network 1. However, at this time, it is desirable not to acquire or transmit information such as “Gno.” That enables individual identification. Thereby, in the insurance company, it becomes impossible to link the policyholder and the set of the multi-model area and the poly-pattern.
[0113]
Next, in step I-2 (SI-2), the insurance company terminal executes step I-1 for a plurality of contractors, and accumulates the sets acquired from the plurality of contractors for each contractor. . By the step I-2, the insurance company terminal cannot identify the contractor but sets a set of multi-model address and polymorphism pattern that can be distinguished from other contractors (individuals). It is possible to create a table that is stored in a state that is distinguished for each contractor.
[0114]
Next, in step I-3 (SI-3), the insurance company sets the polymorphism area and polymorphic pattern set group accumulated in step I-2 and the polymorphism associated with the semantic information related to the calculation of the insurance premium. The model number is transmitted to the statistical analysis computer ST, and the ratio of each polymorphic pattern in the transmitted multi-model number is requested. Here, the multi-model number associated with the semantic information related to the calculation of the insurance premium can include a multi-model number related to the possibility of suffering a certain disease, a multi-model number related to the personal constitution, and the like. . The multi-model address associated with the semantic information related to the calculation of the insurance premium can be acquired by accessing the main DB 14 of the shared computer 2, for example, or set based on information uniquely owned by the insurance company You may do it.
[0115]
Next, in step I-4 (SI-4), the statistical analysis computer ST receives the set group and the multi-model number address from the insurance company terminal, and accesses the genome related information DB 40. Next, in step I-5 (SI-5), the statistical analysis computer ST searches the genome related information DB 40 based on each received set, and arranges the arrangement number corresponding to each set. To extract. That is, in step I-5, genome related information can be specified for all policyholders.
[0116]
Next, in step I-6 (SI-6), the statistical analysis computer ST selects the polymorphism in the multi-model number received in step I-4 from all the genome related information specified in step I-5. The ratio for each pattern is calculated. Specifically, assuming that the multi-model number address “000001” is received in Step I-4, the polymorphism patterns in the multi-model number address “000001” are “polymorph 1”, “polymorph 2”, and “polymorph 3”. And In this case, based on the genome-related information identified in step I-5, the ratio of the multi-model number “000001” is “polymorphism 1”, the ratio of the multi-model number “000001” is “polymorphism 2”, The ratio that the multi-model area “000001” is “polymorph 3” is calculated.
[0117]
Next, in step I-7 (SI-7), the statistical analysis computer ST calculates the ratio of each polymorphic pattern in the polymorphism pattern received in step I-4, calculated in step I-6, to the insurance company terminal. Send to.
[0118]
Next, in step I-8 (SI-8), the insurance company terminal receives from the statistical analysis computer ST the ratio of each polymorphic pattern in the multimodel area associated with the semantic information related to the calculation of the insurance premium. To do. In other words, the insurance company can know the ratio of each polymorphism pattern in the multimodel area associated with the semantic information related to the calculation of the insurance premium in the group composed of the policyholders.
[0119]
And an insurance company can calculate an insurance premium rate based on the ratio received at step I-8 in step I-9 (SI-9). For example, in Step I-3, when the multi-model number area involved in the morbidity of a certain disease or the multi-model number area related to the personal constitution is transmitted to the statistical analysis computer ST, the insurance company receives the policy from the policyholder. The tendency of “morbidity of a certain disease” and “constitution” in a certain group can be known, and the insurance premium rate can be calculated based on the tendency of “morbidity of certain disease” and “constitution”.
[0120]
As described above, according to this system, insurance companies can calculate insurance premium rates based on genetic trends in a group of policyholders. The rate can be calculated. In particular, according to this system, the insurance company does not know the polymorphism pattern and polymorphism pattern associated with the semantic information related to the calculation of insurance premiums for individual policyholders. Can be prevented.
[0121]
Therefore, the insurance company can develop a stable business that is not affected by the genetic information (polymorphism pattern) of the policyholder. On the other hand, for policyholders, if they enjoy the same insurance service contents under the same conditions as in the conventional life / medical insurance system, everyone pays the same premium regardless of the genetic information. As a result, equality is ensured. From the above, by using this system, even after a tailor-made society has arrived, it is possible to maintain a sound life / medical insurance system that should be.
[0122]
Note that the present invention is not limited to application to an information processing system that provides information serving as a reference when an insurance company calculates an insurance premium rate. That is, according to the present invention, for example, an educator side such as school education can acquire the composition ratio of the genetic tendency in the education target population according to the method described above. And the educator side can provide the educational service suitable for the group to be educated by carrying out educational activities based on the composition ratio of this genetic tendency.
[0123]
As described above, the present invention is based on the polymorphism pattern, providing a service under different conditions for each individual leads to discrimination, or an ethical problem (for example, an insurance company), It is highly effective and effective to apply to business institutions (for example, educational institutions) where it is practically difficult to provide different services for each individual.
[0124]
【The invention's effect】
As described above in detail, according to the present invention, it is possible to provide information processing system that can provide individual related information based on base sequence related information, and information that can be used for calculation of insurance premium rate based on base sequence related information. An information processing system can be constructed.
[Brief description of the drawings]
FIG. 1 is a schematic configuration diagram schematically showing a configuration of an information processing system to which the present invention is applied.
FIG. 2 is a schematic configuration diagram schematically showing a configuration of a shared computer.
FIG. 3 is a configuration diagram showing an example of data recorded in a main DB.
FIG. 4 is a schematic configuration diagram schematically showing a configuration of a statistical analysis computer.
FIG. 5 is a block diagram showing an example of data recorded in a genome related information DB.
FIG. 6 is a configuration diagram showing an example of data recorded in an individual related information DB.
FIG. 7 is a schematic configuration diagram schematically showing a configuration of a personal computer.
FIG. 8 is a configuration diagram showing an example of data recorded on a genome-related information recording medium.
FIG. 9 is a flowchart showing processing in a statistical analysis computer and a personal computer in an information processing system for associating individual related information and genome related information.
FIG. 10 is a screen image shown as an example of an individual related information collection screen.
FIG. 11 is a flowchart showing processing in a shared computer and a personal computer in a system for providing morbidity for a predetermined disease.
FIG. 12 is a continuation of FIG. 16 and is a flowchart showing processing in a shared computer and a personal computer in a system for providing morbidity for a predetermined disease.
FIG. 13 is a flowchart showing another process in the shared computer and the personal computer in the system for providing the possibility of suffering from a predetermined disease.
FIG. 14 is a flowchart showing a process for obtaining information indicating a tendency characteristic of an individual (individual) having a set made up of a specific polymorphism area and a polymorphic pattern.
FIG. 15 is a flowchart showing a process for obtaining information serving as a reference when calculating an insurance premium.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 ... Communication line network, 2 ... Shared computer 2 ... Personal computer, ST ... Statistical analysis computer

Claims

A requester's terminal that requests information indicating a characteristic trend for a given individual;
A first storage device that stores position information indicating a position in a base sequence and base sequence related information in association with each individual, and a second storage device that stores individual related information about the individual for each individual; Acquires location information sent from the requester side terminal, searches the first storage device and the second storage device, and associates the location information acquired from the requester side terminal with the location information. A terminal for statistical analysis that acquires individual related information indicating a tendency characteristic of an individual having a set consisting of one base sequence related information among a plurality of base sequence related information obtained,
An information processing system relating to a base sequence.

Obtaining a position information indicating a position in the base sequence;
The first storage device that stores the position information and the base sequence related information in association with each other is searched, and the position information acquired in step a and the plurality of base sequence related information associated with the position information A step b for specifying a set consisting of 1 base sequence related information;
The second storage device storing the individual related information about the individual for each individual is searched, the tendency of the individual related information in the individual having the set specified in the step b, and all or the stored in the second storage device An information processing method relating to a base sequence, comprising the step c of comparing individual-related information trends in some individuals and obtaining individual-related information indicating a characteristic tendency of individuals having the set specified in the step b .

In the step a, the base sequence related information associated with the position information is acquired together with the position information,
3. The information processing according to claim 2, wherein in the step c, individual related information indicating a tendency characteristic of an individual having a set including the position information and the base sequence related information acquired in the step a is acquired. Method.

In the step c, a statistical distribution using the individual related information about all or some of the individuals stored in the second storage device as a parameter and the individual related information in the individual having the set specified in the step b 3. The information processing method according to claim 2, wherein individual related information showing a significant difference is extracted by comparing with a statistical distribution as a number.

A business that collects information for identifying individuals included in a given group and conducts business using the percentage of base sequence related information corresponding to the given position information that means the position in the base sequence in that group. The requester's terminal provided by the institution;
A storage device that stores position information and base sequence related information in association with each individual and includes a base sequence for which information for specifying individuals included in the group and the holding ratio are obtained from the requester side terminal The position information corresponding to the related information is acquired, the storage device is searched, and the base position related information of the plurality of base sequence related information associated with the acquired position information and the position information in the group is acquired. And a statistical processing terminal that calculates the holding ratio for each set consisting of
The information processing system using a base sequence in which the requester side terminal acquires the holding ratio calculated by the statistical processing terminal in the group and creates information for performing the business based on the holding ratio.

Receiving step a for receiving information for identifying individuals included in the predetermined group and position information indicating a position in the base sequence;
Search the storage device that stores the positional information and the base sequence related information in association with each individual, identify individuals included in the group using information for identifying the individual received in step a, A step b for calculating a holding ratio for each set consisting of the position information received in step a and one base sequence related information among a plurality of base sequence related information associated with the position information;
Information processing method using a base sequence having

7. The information processing method according to claim 6, wherein the information for specifying the individual is base sequence related information associated with position information different from the position information received in step a.

The information processing method according to claim 6, wherein the position information received in step a is position information used to create information for conducting a business.