JPH05128171A

JPH05128171A - Phylogenetic tree output device

Info

Publication number: JPH05128171A
Application number: JP3293215A
Authority: JP
Inventors: Koji Tajima; 耕治田嶋
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-11-08
Filing date: 1991-11-08
Publication date: 1993-05-25

Abstract

PURPOSE:To generate phylogenetic tree information in which two arrangements having maximum similarity are reconstructed into one new arrangement from a similarity matrix, to obtain the maximum similarity of each node based on the phylogenetic tree information, to generate the phylogenetic trees in order of maximum similarity and to arrange the phylogenetic trees in the ascending order of similarity so as to facilitate the recognition and comparison of them. CONSTITUTION:Phylogenetic tree information 5 is prepared by repeatedly generating new arrangement and similarity for a pair consisting of a similarity matrix 4 formed of the similarity between the arrangements preliminarily obtained and the maximum similarity in the similarity matrix 4. A larger one of the arrangement of the pair of the phylogenetic tree information 5 is obtained as maximum similarity and added to the information 5. Then the phylogenetic tree is generated in the order of maximum similarity of the added phylogenetic tree information 5 and outputted.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、系統樹を出力する系統
樹出力装置に関するものである。分子生物学の分野で
は、進化の過程における種々の遺伝子の系統関係を見る
ことが必要となる。図１４はその例を示す。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a system tree output device for outputting a system tree. In the field of molecular biology, it is necessary to look at the phylogenetic relationships of various genes during evolution. FIG. 14 shows an example thereof.

【０００２】ＤＮＡやタンパク質配列の系統樹を図で計
算機上に表示するには、グラフィックソフトを用いる必
要があるが、これは用いる計算機の種類やソフトウェア
の種類に依存するので、表示ツールやその結果をより広
く用いるには制約となる。そこでそのような制約を受け
にくい方法の開発が望まれている。Graphic software must be used to graphically display a phylogenetic tree of DNA or protein sequences on a computer. However, this depends on the type of computer and software used, and therefore the display tool and its result. Is a constraint to use more widely. Therefore, it is desired to develop a method that does not easily suffer such restrictions.

【０００３】[0003]

【従来の技術】通常のＵＰＧＭＡ法（館野：分子系統樹
の作り方とその評価、木村編、分子進化学入門、培風
館、１９８４）などで系統樹を表示すると、類似度ある
いは距離の値から言って、単調に増加するような枝の配
置にならず、図１４に示すように、不揃いとなる。以下
上述のＵＰＧＭＡ方法による系統樹の作成法について簡
単に説明する。2. Description of the Related Art When a phylogenetic tree is displayed by the usual UPGMA method (Tateno: How to make and evaluate molecular phylogenetic tree, Kimura ed., Introduction to Molecular Susumu Kagaku, Baifukan, 1984), the similarity or distance value The arrangement of the branches does not increase monotonically, resulting in unevenness as shown in FIG. The method of creating a phylogenetic tree by the UPGMA method described above will be briefly described below.

【０００４】（１）表示しようとする配列について予
め求めた類似度行列中の類似度Ｓ（ｉ、ｊ）（ｉ、ｊは
配列の番号であって１ないしＮ）のうちで、最大値を求
める。そのときのＳ（ｉ、ｊ）をｓｍａｘ、対となる配
列番号ｉ、ｊの値をｉｍａｘ、ｊｍａｘで表す。(1) The maximum value of the similarity S (i, j) (i, j is the array number, 1 to N) in the similarity matrix previously obtained for the array to be displayed is Ask. At that time, S (i, j) is represented by smax, and the values of the paired array element numbers i, j are represented by imax, jmax.

【０００５】（２）この２つの値ｉｍａｘ、ｊｍａｘ
を持つ配列を配列グループＮ＋１とする。このグループ
化によって、その他の配列とのＳ（ｉ、ｊ）は計算し直
される。ｋ番目の配列とｉｍａｘ、ｊｍａｘをひとまと
めにした配列グループＮ＋１との類似度Ｓ（ｋ、Ｎ＋
１）は、Ｓ（ｋ、ｉｍａｘ）とＳ（ｋ、ｊｍａｘ）の平
均値とする。ｉｍａｘ＞Ｎあるいはｊｍａｘ＞Ｎのとき
は、そのノードに含まれる配列どうしの類似度すべてに
ついての平均値を計算する。(2) These two values imax, jmax
An array having N is set as an array group N + 1. By this grouping, S (i, j) with other arrays is recalculated. The similarity S (k, N +) between the k-th array and the array group N + 1 in which imax and jmax are grouped together
1) is an average value of S (k, imax) and S (k, jmax). When imax> N or jmax> N, the average value of all the similarities between the sequences included in the node is calculated.

【０００６】（３）以上の手順を最後の一組の配列グ
ループ２Ｎ−１になるまで繰り返す。結果として、各ス
テップでのｉｍａｘ、ｊｍａｘ、Ｓ（ｉｍａｘ、ｊｍａ
ｘ）を記憶する。これらの結果を用いて系統樹を書く
と、図１４のようになる（詳細は上述した書籍を参
照）。(3) The above procedure is repeated until the final set of array groups 2N-1 is reached. As a result, imax, jmax, S (imax, jma at each step
x) is stored. When a phylogenetic tree is written using these results, it becomes like FIG. 14 (for details, refer to the above-mentioned book).

【０００７】[0007]

【発明が解決しようとする課題】上述した図１４に示す
ような系統樹は、類似度や距離が単調に増加せず、不揃
いとなり遺伝子のタンパク質の配列のグループを見分け
るのに不都合であるという問題があった。また、マルチ
プルアライメントの表示でも類似度あるいは距離の値か
らグループにまとめてながめるのにも不都合であるとい
う問題があった。そこで、このような不揃いのない系統
樹の表示法が望まれている。The problem with the phylogenetic tree as shown in FIG. 14 is that the degree of similarity and distance do not increase monotonically, resulting in misalignment and inconvenience in distinguishing groups of protein sequences of genes. was there. Further, there is a problem that even in the case of displaying multiple alignment, it is inconvenient to read the values of the similarity or the distance in a group. Therefore, a method for displaying a phylogenetic tree without such irregularity is desired.

【０００８】本発明は、類似度行列から最大の類似度の
２つの配列を新たな１つの配列にした系統樹情報を生成
し、この系統樹情報をもとに各ノードの最大類似度を求
め、この最大類似度の順に系統樹を生成し、類似度が徐
々に増加するように揃えて配置して見やすくかつ比較し
易くすることを目的としている。According to the present invention, phylogenetic tree information in which two sequences having the maximum similarity are converted into a new sequence from the similarity matrix, and the maximum similarity of each node is obtained based on this phylogenetic tree information. The purpose is to generate phylogenetic trees in the order of the maximum similarity and arrange them so that the similarity gradually increases so that they are easy to see and compare.

【０００９】[0009]

【課題を解決するための手段】図１は、本発明の原理構
成図を示す。図１において、系統樹処理部２は、類似度
行列４をもとに系統樹情報５を生成したり、この系統樹
情報５から最大類似度を求めて付加したり、最大類似度
を付加した系統樹情報５をもとに系統樹６を生成したり
するものである。FIG. 1 is a block diagram showing the principle of the present invention. In FIG. 1, the phylogenetic tree processing unit 2 generates phylogenetic tree information 5 based on the similarity matrix 4, obtains and adds the maximum similarity from the phylogenetic tree information 5, and adds the maximum similarity. A phylogenetic tree 6 is generated based on the phylogenetic tree information 5.

【００１０】類似度行列４は、配列の間の類似度を予め
求めて行列としたものである。系統樹情報５は、類似度
行列４中の最大類似度の対の配列について新たな配列お
よび類似度を生成することを繰り返して作成、および当
該系統樹情報５の対の配列のうちの大きい方の類似度を
最大類似度として求めて付加したものである。The similarity matrix 4 is a matrix in which the similarity between arrays is obtained in advance. The phylogenetic tree information 5 is created by repeatedly generating a new sequence and similarity for the pair of sequences having the maximum similarity in the similarity matrix 4, and the larger one of the paired sequences of the phylogenetic tree information 5 is generated. Is calculated and added as the maximum similarity.

【００１１】系統樹６は、系統樹情報５から生成した配
列を見やすく配置した系統樹である。出力部３は、系統
樹６をディスプレイ７やプリンタ８に出力するものであ
る。The phylogenetic tree 6 is a phylogenetic tree in which the sequences generated from the phylogenetic tree information 5 are arranged in an easy-to-see manner. The output unit 3 outputs the system tree 6 to the display 7 and the printer 8.

【００１２】[0012]

【作用】本発明は、図１に示すように、系統樹処理部２
が予め求めた類似度行列４の配列中の最大類似度の対の
配列について新たな配列および類似度を生成することを
繰り返して系統樹情報５を作成およびこの系統樹情報５
の対の配列のうちの大きい方の類似度を最大類似度とし
て求めて付加し、この付加した系統樹情報５の最大類似
度の順に系統樹６を生成し、出力部３がこの系統樹６を
出力（表示、印刷）するようにしている。In the present invention, as shown in FIG. 1, the phylogenetic tree processing unit 2
Generate a new phylogenetic tree information 5 by repeatedly generating new sequences and similarities with respect to the pair of sequences having the maximum similarity in the sequence of the similarity matrix 4 obtained in advance and the phylogenetic tree information 5
The larger similarity of the paired sequences is calculated as the maximum similarity and added, and the phylogenetic tree 6 is generated in the order of the maximum similarity of the added phylogenetic tree information 5, and the output unit 3 outputs the phylogenetic tree 6 Is output (displayed, printed).

【００１３】最大類似度を付加した系統樹情報５の最大
類似度の順に順次系統樹をキャラクタ（例えば＋、−、
｜など）を用いて出力するようにしている。また、最大
類似度を付加した系統樹情報５の最大類似度の昇順（あ
るいは降順）に展開して配列番号、配列名を並べ、垂直
方向にこれら配列番号、配列名を出力、およびこれらに
子供を結合して系統樹をグラフィカルに生成して出力す
るようにしている。Characters of the phylogenetic tree (eg, +,-,
| Etc.) is used for output. Also, the sequence numbers and sequence names are arranged in ascending order (or descending sequence) of the maximum similarity of the phylogenetic tree information 5 with the maximum similarity added, and these sequence numbers and sequence names are output in the vertical direction Are combined to generate a graphical tree and output it graphically.

【００１４】従って、類似度行列４から最大の類似度の
２つの配列を新たな１つの配列にした系統樹情報５を生
成し、この系統樹情報５をもとに各ノードの最大類似度
を求めて付加し、この最大類似度の順に系統樹を生成す
ることにより、類似度が徐々に増加するように揃えて配
置して見やすくかつ比較し易くすることが可能となる。Therefore, from the similarity matrix 4, the phylogenetic tree information 5 in which the two arrays having the maximum similarity are combined into a new array is generated, and based on this phylogenetic tree information 5, the maximum similarity of each node is calculated. By obtaining and adding and generating a phylogenetic tree in the order of the maximum similarity, it is possible to arrange them so that the similarity gradually increases so that they are easy to see and compare.

【００１５】[0015]

【実施例】次に、図１から図１３を用いて本発明の実施
例の構成および動作を順次詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the construction and operation of an embodiment of the present invention will be sequentially described in detail with reference to FIGS.

【００１６】図１は、本発明の原理構成図を示す。図１
において、系統樹出力装置１は、予め求めた配列の類似
度行列４から類似度が徐々に増加するように揃えて見や
すく配置した系統樹を出力するものであって、系統樹処
理部２および出力部３などから構成されるものである。FIG. 1 is a block diagram showing the principle of the present invention. Figure 1
In the above, the phylogenetic tree output device 1 outputs a phylogenetic tree arranged in an easy-to-read manner so that the similarity gradually increases from the similarity matrix 4 of the array obtained in advance. It is composed of the unit 3 and the like.

【００１７】系統樹処理部２は、類似度行列４をもとに
系統樹情報５を生成したり、この系統樹情報５から最大
類似度を求めて付加したり、最大類似度を付加した系統
樹情報５をもとに系統樹６を生成したりなどするもので
ある。The phylogenetic tree processing unit 2 generates phylogenetic tree information 5 based on the similarity matrix 4, finds and adds the maximum similarity from the phylogenetic tree information 5, and adds the maximum similarity to the system. For example, a phylogenetic tree 6 is generated based on the tree information 5.

【００１８】出力部３は、系統樹６をディスプレイ７や
プリンタ８に出力するものである。類似度行列４は、配
列の間の類似度を予め求めて行列としたものであって、
図２の（イ）に示すように配列の対の類似度を予め求め
て行列としたものである。The output unit 3 outputs the system tree 6 to the display 7 and the printer 8. The similarity matrix 4 is a matrix in which the similarity between arrays is obtained in advance,
As shown in FIG. 2A, the similarity between pairs of arrays is obtained in advance and formed into a matrix.

【００１９】系統樹情報５は、類似度行列４中の最大類
似度の対の配列について新たな配列を生成および両者の
配列の類似度の平均を新たな配列の類似度とすることを
繰り返して生成したり（図２の（ロ）参照）、当該系統
樹情報５の対の配列のうちの大きい方の類似度を最大類
似度として求めて付加（図４参照）したものである。The phylogenetic tree information 5 is repeated by generating a new sequence for the pair of sequences having the maximum similarity in the similarity matrix 4 and using the average of the similarities of the two sequences as the similarity of the new sequence. It is generated (see (b) of FIG. 2) or obtained by adding the similarity of the larger of the paired sequences of the phylogenetic tree information 5 as the maximum similarity (see FIG. 4).

【００２０】系統樹６は、系統樹情報５から生成した配
列を見やすく配置した系統樹である（図９、図１０、図
１３）。ディスプレイ７は、系統樹を表示などするもの
である。The phylogenetic tree 6 is a phylogenetic tree in which the sequences generated from the phylogenetic tree information 5 are arranged in an easy-to-see manner (FIG. 9, FIG. 10, FIG. 13). The display 7 displays a phylogenetic tree.

【００２１】プリンタは、系統樹を印字するものであ
る。次に、図２から図１０を用いて本発明の１実施例の
構成の動作を順次具体的に説明する。The printer prints a tree. Next, the operation of the configuration of the first embodiment of the present invention will be sequentially and specifically described with reference to FIGS.

【００２２】ここで、Ｎ本の遺伝子配列が与えられてい
て、ｉ番目の配列とｊ番目の配列の間の類似度（あるい
は距離）がＳ（ｉ，ｊ）で表されるものとする。ここ
で、ｉ，ｊ＝１，２・・・Ｎである。配列間の類似度
（あるいは距離）は、Needlemanand Wunschアルゴリズ
ム(Needleman,S,B,and Wunsch,C.D.,"A general method
applicable to the search for similarities in the
amino acid sequences oftwo proteins,J.Mol,Biol.,4
8,445-453,1970)その他で求めることができる。Here, it is assumed that N gene sequences are given and the similarity (or distance) between the i-th sequence and the j-th sequence is represented by S (i, j). Here, i, j = 1, 2 ... N. The similarity (or distance) between sequences is determined by the Needleman and Wunsch algorithm (Needleman, S, B, and Wunsch, CD, "A general method
applicable to the search for similarities in the
amino acid sequences of two proteins, J. Mol, Biol., 4
8,445-453,1970) and others.

【００２３】よく知られたＵＰＧＭＡ法で系統樹を作成
する方法について説明する。まず類似度Ｓ（ｉ，ｊ）の
うちで、最大値を求める。そのときのＳ（ｉ，ｊ）の値
をｓｍａｘ、対となる配列番号ｉ、ｊの値をｉｍａｘ、
ｊｍａｘで表す。次のこの２つの配列ｉｍａｘ、ｊｍａ
ｘをひとつの配列グループＮ＋１とする。このグループ
化によって、その他の配列とのＳ（ｉ，ｊ）は計算しな
おす。ｋ番目の配列とｉｍａｘ、ｊｍａｘをひとまとめ
にした配列グループＮ＋１との類似度Ｓ（ｋ，Ｎ＋１）
は、Ｓ（ｋ，ｊｍａｘ）とＳ（ｓ，ｉｍａｘ）の平均値
とする。ｉｍａｘ＞Ｎあるいはｊｍａｓ＞Ｎのときは、
そのノードに含まれる配列どうしの類似度は全てについ
ての平均値を計算する。A method of creating a phylogenetic tree by the well-known UPGMA method will be described. First, of the similarities S (i, j), the maximum value is obtained. At that time, the value of S (i, j) is smax, the value of paired array element numbers i and j is imax,
It is represented by jmax. Next these two arrays imax, jma
Let x be one array group N + 1. By this grouping, S (i, j) with other arrays is recalculated. Similarity S (k, N + 1) between the k-th array and the array group N + 1 in which imax and jmax are grouped together
Is an average value of S (k, jmax) and S (s, imax). When imax> N or jmas> N,
For the similarity between the arrays included in the node, the average value for all is calculated.

【００２４】以上の手順を最後の一組の配列グループ２
Ｎ−１になるまで繰り返す。結果として、各ステップで
のｉｍａｘ、ｊｍａｘ、Ｓ（ｉｍａｘ、ｊｍａｘ）を記
憶する。これらの結果を用いて系統樹を描くことができ
る。The above procedure is followed by the final set of sequence group 2
Repeat until N-1. As a result, imax, jmax, S (imax, jmax) at each step are stored. A phylogenetic tree can be drawn using these results.

【００２５】図２は、本発明の類似度行列／系統樹情報
を示す。図２の（イ）は、２７本の配列の場合の類似度
の例を示す。ここで、配列１の配列名はＨＢＡ＄ＡＥＧ
ＭＯである。配列２から配列２７は図示のような配列名
である。横軸は配列番号１ないし２６を表し、縦軸は配
列番号２ないし２７と配列名を表す。中央の１７９２な
どの４桁の数字は類似度を表す。従って、ここでは配列
１ないし配列２７について相互の間の類似度を図示のよ
うに予め求めたとし、以下にこれから系統樹を生成する
手順を説明する。FIG. 2 shows similarity matrix / phyloge tree information of the present invention. FIG. 2A shows an example of the degree of similarity in the case of 27 arrays. Here, the array name of array 1 is HBA $ AEG
MO. Arrays 2 to 27 are array names as shown. The horizontal axis represents the sequence numbers 1 to 26, and the vertical axis represents the sequence numbers 2 to 27 and the sequence names. A 4-digit number such as 1792 in the center indicates the degree of similarity. Therefore, here, it is assumed that the degree of similarity between the arrays 1 to 27 is previously obtained as shown in the figure, and the procedure for generating a phylogenetic tree from this is described below.

【００２６】図２の（ロ）は、図２の（イ）の類似度行
列から配列番号２８の新たは配列および類似度を求めた
例を示す。求め方は、図２の（イ）の類似度行列の中で
類似度の最大値は１７９２であり、配列１７と配列３を
ひとまとめにして配列２８を生成する。配列２８と配列
１との類似度は（Ｓ（３，１）＋Ｓ（１７，１））／２＝（１６６０＋１６６５）／２＝１６６２．５・・・・・・・・・・・・・・・・・・・・・・・・・（１）となる。同様に、配列２、４・・・３７との類似度を計
算し、図２の（ロ）のように求める。FIG. 2B shows an example in which the new sequence and similarity of the array element number 28 are obtained from the similarity matrix of FIG. The maximum value of the similarity is 1792 in the similarity matrix of FIG. 2A, and the array 17 and the array 3 are put together to generate the array 28. The similarity between the array 28 and the array 1 is (S (3,1) + S (17,1)) / 2 = (1660 + 1665) /2=1662.5 ...・・・・・・・・・ (1) Similarly, the degree of similarity with the arrays 2, 4, ... 37 is calculated and determined as shown in (b) of FIG.

【００２７】次に、図２の（ロ）の類似度行列の中で最
大の類似度は１７８９で配列６と配列５ををひとまとめ
にして配列２９を生成する。配列２９と配列１との類似
度は式（１）と同様にして類似度を求める。以下同様の
手順を繰り返す。この際、系統樹を作図するのに必要な
ノード、類似度、その子供である配列１、配列２をここ
では系統樹情報として記憶しておく。Next, the maximum similarity is 1789 in the similarity matrix of FIG. 2B, the arrays 6 and 5 are grouped together to generate an array 29. The similarity between the array 29 and the array 1 is calculated in the same manner as in the equation (1). The same procedure is repeated thereafter. At this time, the nodes necessary for drawing the phylogenetic tree, the degree of similarity, and the children, Array 1 and Array 2, are stored here as phylogenetic tree information.

【００２８】図２の（ハ）は、系統樹情報を示す。これ
は、上述した図２の（イ）の類似度行列から図２の
（ロ）の類似度行列２８およびこれに続く２９ないし５
３を求めた際のノード、類似度、配列１、配列２を記憶
したものである。例えばは、図２の（ロ）の配列２８
が、図２の（イ）の類似度行列の最大の類似度１７９２
であって、まとめる対象となる対の配列が配列３と配列
１７であってこれは配列１、配列２となる。従って、図
示のように、ノード、類似度、配列１、配列２として、
２８、１７９２、３、１７が得られ、これを図示のよう
に系統樹情報５として記憶する。以下同様に、図示のよ
うに記憶する。FIG. 2C shows phylogenetic tree information. This is based on the similarity matrix of FIG. 2A described above from the similarity matrix 28 of FIG.
3 stores the node, the degree of similarity, the array 1, and the array 2 at the time of obtaining 3. For example, the array 28 in FIG.
Is the maximum similarity 1792 of the similarity matrix of FIG.
The paired sequences to be combined are the sequence 3 and the sequence 17, which are the sequence 1 and the sequence 2. Therefore, as shown in the figure, the node, the similarity, the array 1, and the array 2 are
28, 1792, 3, 17 are obtained and stored as the phylogenetic tree information 5 as shown. Similarly, the following is stored as illustrated.

【００２９】以上の手順によって求めた図２の（ハ）の
系統樹情報および図２の（イ）の類似度行列をもとに系
統樹を描くと、通常、図１４に示す従来の系統樹のよう
になり、類似度が不揃いとなり見難いので、本実施例で
は、系統樹を見やすく類似度順に描くために以下の手順
を行う。When a phylogenetic tree is drawn based on the phylogenetic tree information shown in FIG. 2C and the similarity matrix shown in FIG. 2A, the conventional phylogenetic tree shown in FIG. 14 is usually obtained. As described above, the similarities are not uniform and difficult to see. Therefore, in the present embodiment, the following procedure is performed in order to make the phylogenetic tree easy to see and to draw the descending order of the similarities.

【００３０】図３は、本発明の最大類似度と子供の関係
を示す。ここで、ひとまとめにした配列グループ（図３
では例えばｎｏｄｅ４８）に、類似度Ｓ（ｉ，ｊ）とは
別に、新しい変数、最大類似度ｓｉｍｍａｘ（ｋ）（ｋ
＝Ｎ＋１、Ｎ＋２・・・２Ｎ−１）を定義する。これ
は、ｉｍａｘとｊｍａｘ（図３ではｎｏｄｅ２８、４
３）のそれぞれの最大類似度ｓｉｍｍａｘ（ｉｍａｘ）
とｓｉｍｍａｘ（ｊｍａｘ）の大きい方を代入する。但
し、ｊｍａｘ＜Ｎかつｉｍａｘ＜Ｎのときは最大類似度
は未だ定義されていないので類似度をそのまま用いて、
ｓｉｍｍａｘ（ｎｏｄｅ）＝Ｓ（ｉｍａｘ，ｊｍａｘ）
とする。ｊｍａｘ＜Ｎかつｉｍａｘ＞Ｎのときはｓｉｍ
ｍａｘ（ｎｏｄｅ）＝ｓｉｍｍａｘ（ｉｍａｘ）とす
る。ｊｍａｘ＞Ｎかつｉｍａｘ＜Ｎのときはｓｉｍｍａ
ｘ（ｎｏｄｅ）＝ｓｉｍｍａｘ（ｊｍａｘ）とする。そ
してｓｉｍｍａｘの値の大きい方からグループ内での配
列順序をつけて表示する。つまり、ｉｍａｘ、ｊｍａｘ
あるいはｊｍａｘ、ｉｍａｘの順とする。図３の例で
は、ｓｉｍｍａｘ（ｊｍａｘ）＞ｓｉｍｍａｘ（ｉｍａ
ｘ）の場合で、２８、４３の順序となる。FIG. 3 shows the relationship between the maximum similarity and the child according to the present invention. Here, the collected sequence groups (Fig. 3
Then, for example, in the node 48), in addition to the similarity S (i, j), a new variable, maximum similarity simmax (k) (k
= N + 1, N + 2 ... 2N-1) is defined. This means imax and jmax (node 28, 4 in FIG. 3).
3) Maximum similarity simmax (imax) of each
And simmax (jmax) are substituted. However, when jmax <N and imax <N, the maximum similarity is not yet defined, so the similarity is used as it is,
simmax (node) = S (imax, jmax)
And If jmax <N and imax> N, then sim
Let max (node) = simmax (imax). When jmax> N and imax <N, simma
Let x (node) = simmax (jmax). Then, the larger the value of simmax, the more the arrangement order within the group is displayed. That is, imax, jmax
Alternatively, the order is jmax and imax. In the example of FIG. 3, simmax (jmax)> simmax (ima
In the case of x), the order is 28, 43.

【００３１】同様にして図２の（イ）ないし（ハ）で説
明した２７本の配列の例では、最大類似度と配列１、配
列２の順序について、図４のような結果が得られる。図
４は、本発明の系統樹情報を示す。この系統樹情報５
は、図２の（ハ）の系統樹情報５に対して、図３で説明
した最大類似度をノード２８から５３についてそれぞれ
求めて付加したものである。これを用いてノードを描く
ときは、例えばノード３３を描く場合、最大類似度の大
きいノード３１の方から先に描き、次に最大類似度の小
さいノード３０を描くとよく、これにより得られる系統
樹は希望の類似度順の見やすい形となる。Similarly, in the example of the 27 arrays described in (a) to (c) of FIG. 2, the results shown in FIG. 4 are obtained with respect to the maximum similarity and the order of array 1 and array 2. FIG. 4 shows phylogenetic tree information of the present invention. This phylogenetic tree information 5
3 is obtained by adding the maximum similarity described with reference to FIG. 3 to the nodes 28 to 53 to the phylogenetic tree information 5 of FIG. When drawing a node using this, for example, when drawing a node 33, it is better to draw the node 31 with the largest similarity and then the node 30 with the smallest similarity. The tree will be in a shape that is easy to see in the order of desired similarity.

【００３２】図５は、グラフイックスとキャラクタでの
系統樹の基本形の対応例を示す。図５の（イ）は、グラ
フィックスで系統樹を描く例を示す。図５の（ロ）は、
キャラクタで系統樹を描く本実施例の例を示す。ここで
は、系統樹の表示に用いるキャラクタとして、−、＋、
｜の３種類を用いる。ＦＯＲＴＲＡＮのように記号｜が
利用できないときは、：やＩを代用すればよい。このよ
うにキャラクタを用いて系統樹を表示するため、通常の
プリンタによって容易に系統樹を印字したりなどするこ
とが可能となる。FIG. 5 shows an example of correspondence between basic forms of phylogenetic trees in graphic and characters. FIG. 5A shows an example of drawing a phylogenetic tree by graphics. (B) of FIG.
An example of the present embodiment in which a character tree is drawn with a character is shown. Here, as the characters used for displaying the phylogenetic tree, −, +,
Three types of | are used. If the symbol | cannot be used as in FORTRAN, then: or I may be used instead. In this way, since the tree is displayed using the characters, it is possible to easily print the tree using a normal printer.

【００３３】図６は、ｌｅｖｅｌとｄｅｐｔｈの関係を
示す。これは、図２および図４で得られた情報（ｉｍａ
ｘ、ｊｍａｘ、ｓ（ｉｍａｘ、ｊｍａｘ）、ｓｉｍｍａ
ｘ（ｉ））をもとにして、最後のノード２Ｎ−１（ここ
ではノード５３）から逆方向に、対になっている配列グ
ループを辿っていきながら、系統図を作図してゆく際
の、下記７つの変数のうちの２つの変数、ｌｅｖｅｌと
ｄｅｐｔｈの関係を模式的に表したものである。例えば
図中に示すように、左のｎｏｄｅ５３はｌｅｖｅｌ
（０）であり、右のｎｏｄｅ４はｌｅｖｅｌ（１）であ
る。このｌｅｖｅｌの深さをｄｅｐｔｈ＝１とする。FIG. 6 shows the relationship between level and depth. This is due to the information (ima
x, jmax, s (imax, jmax), simma
Based on x (i)), when the systematic diagram is drawn while tracing the paired array groups in the opposite direction from the last node 2N-1 (here, node 53) 2 schematically shows the relationship between two variables out of the following seven variables, level and depth. For example, as shown in the figure, the left node 53 is level
(0), and the right node4 is level (1). The depth of this level is depth = 1.

【００３４】ｎｏｄｅ：各ノードの番号ｌｅｖｅｌ（ｋ）：ｄｅｐｔｈ＝ｋのノード番号ｆｌｇ：そのノードあるいは配列名が出力されたかどう
かを表すｄｅｐｔｈ：世代ｂａｒ：縦棒を出力するかどうかを指定ｐｏｓｉｔｉｏｎ：２つの子供のうちどちらかｌｅａｆ：出力された配列名の総数次に、図７のフローチャートを参照して系統樹の作図の
手順を詳細に説明する。Node: number of each node level (k): node number of depth = k flg: indicates whether the node or array name is output depth: generation bar: specifies whether to output vertical bars position: Either of the two children leaf: Total number of output sequence names Next, the procedure for drawing a phylogenetic tree will be described in detail with reference to the flowchart in FIG. 7.

【００３５】最初、初期値として、各ノードに旗、ｆｌ
ｇ（ｉ）（ｉ＝１、２・・・２Ｎ−１）を立てる。も
し、２つの配列がまだ処理されていないときは、０の値
を入れる。根からの距離としてｄｅｐｔｈ変数を定義
し、初期値として０を入れる。縦棒記号｜を書くための
選択変数として、ｂａｒ（ｋ）（ｋ＝Ｎ、Ｎ＋１・・・
２Ｎ−１）を定義し、初期値として０を代入する。各レ
ベルでの値ｌｅｖｅｌを定義し、初期値ｌｅｖｅｌ
（０）として２Ｎ−１を代入する。いま着目するノード
ｎｏｄｅの初期値として２Ｎ−１とおく。ｌｅａｆは０
とする。First, as an initial value, each node has a flag, fl.
Set up g (i) (i = 1, 2 ... 2N-1). If the two arrays have not yet been processed, enter a value of 0. Define the depth variable as the distance from the root, and enter 0 as the initial value. As a selection variable for writing the vertical bar symbol |, bar (k) (k = N, N + 1 ...
2N-1) is defined and 0 is substituted as an initial value. Defines the value level at each level, and the initial value level
Substitute 2N-1 as (0). 2N-1 is set as the initial value of the node node of interest. leaf is 0
And

【００３６】（Ａ）今着目するノードｎｏｄｅの子供
の２つのノードについて次の３通り（図７ではＳ２、Ｓ
３、Ｓ４）を調べる（図７ではＳ１であって、図８の
（ニ）のｎｏｄｅ＝５３、その子供は４と５２（図４参
照））。(A) The following three ways (S2, S in FIG. 7) of the two children of the node node of interest
3, S4) (S1 in FIG. 7, node = 53 in FIG. 8D, the children are 4 and 52 (see FIG. 4)).

【００３７】（１）両方の旗ｆｌｇが０の場合（図７
ではＳ２）：（１−１）子供のノードが両方ともＮ以下のときは、
番号の大きい方を子供１にして、小さい方を子供２にす
る（図８の（ニ）ではノード２８の場合、子供１＝１
７、子供２＝３）。(1) When both flags flg are 0 (see FIG. 7)
Then S2): (1-1) When both child nodes are N or less,
The child with the larger number is the child 1, and the child with the smaller number is the child 2 (in the case of node 28 in FIG.
7, children 2 = 3).

【００３８】（１−２）子供のノードが両方ともＮ以
上のときは、ｓｉｍｍａｘの小さい方を子供１にして、
大きい方を子供２にする（図８の（ニ）ではノード５２
の場合、子供１＝４７、ｓｉｍｍａｘ（４７）＝１６８
４、子供２＝５１、ｓｉｍｍａｘ（５１）＝１７９
２）。(1-2) When both children's nodes are equal to or more than N, the one with smaller simmax is set to the child 1,
The larger one is the child 2 (node 52 in FIG. 8D)
, Child 1 = 47, simmax (47) = 168
4, children 2 = 51, simmax (51) = 179
2).

【００３９】ｎｏｄｅを子供１にして、ｐｏｓｉｏｎ＝
１とする（図８の（ニ）ではｎｏｄｅ＝４）。ｄｅｐｔｈに１を加える。Set node to child 1 and position =
1 (node = 4 in (d) of FIG. 8). Add 1 to depth.

【００４０】更にそのノードの両方の子供の旗ｆｌｇが
０ならば、ｂａｒの値を１にする。ｂａｒ（ｌｅｖｅｌ（ｄｅｐｔｈ−１））＝１そして、（Ｂ）に進む。Further, if the flags flg of both children of the node are 0, the value of bar is set to 1. bar (level (depth-1)) = 1 Then, the process proceeds to (B).

【００４１】（２）もし子供がどちらかの旗が１なら
ば（図７ではＳ３）、０の方をｎｏｄｅに選択する。ｐ
ｏｓｉｔｉｏｎの値を子供の番号にする。即ち子供１の
旗が０ならば、ｐｏｓｉｔｉｏｎ＝１、子供２の旗が０
ならば、ｐｏｓｉｔｉｏｎ＝２とする。(2) If the child has either flag of 1 (S3 in FIG. 7), 0 is selected as the node. p
Set the value of position to the child's number. That is, if the flag of child 1 is 0, position = 1 and the flag of child 2 is 0.
If so, position = 2.

【００４２】ｄｅｐｔｈに１を加える。次に（Ｂ）に進
む。（３）もし子供の両方ともｆｌｇが１ならば（図７で
はＳ４であって、図８の（ニ）で配列番号２２、２１を
出力した後のｎｏｄｅ＝４７）。Add 1 to the depth. Then proceed to (B). (3) If both children have flg of 1 (S4 in FIG. 7, node = 47 after outputting sequence numbers 22 and 21 in (d) of FIG. 8).

【００４３】いまのノード（図８の（ニ）ではノード４
７）の旗を１にする。ｄｅｐｔｈから１を引く。そして
１段前のノードに戻る（ｎｏｄｅ＝ｌｅｖｅｌ（ｄｅｐ
ｔｈ）、図８の（ニ）ではｎｏｄｅ＝５２）。The current node (node 4 in (d) of FIG. 8)
Set the flag of 7) to 1. Subtract 1 from depth. Then, return to the node one step before (node = level (dep
th), and in FIG. 8D, node = 52).

【００４４】そしてもしそのノードの子孫の２番目（図
８の（ニ）ではノード５１）の旗が０で、子供１（図８
の（ニ）ではノード４７）の旗が１のときは、ｋ＝２Ｎ
−１、２Ｎ−２・・・ｎｏｄｅ−１について、ｂａｒ
（ｋ）＝０ならば空白を、ｂａｒ（ｋ）＝１ならば縦棒
と空白を書く（図７ではＳ５）。If the flag of the second descendant of the node (node 51 in FIG. 8 (d)) is 0, and child 1 (FIG. 8).
In (d), when the flag of node 47) is 1, k = 2N
-1, 2N-2 ... For node-1, bar
If (k) = 0, write a blank, and if bar (k) = 1, write a vertical bar and a blank (S5 in FIG. 7).

【００４５】そして、手順の先頭（Ａ）に戻る。（Ｂ）次に、ｌｅｖｅｌ（ｄｅｐｔｈ）＝ｎｏｄｅと
する。ｎｏｄｅの値で次のどちらかに進む。Then, the procedure returns to the beginning (A) of the procedure. (B) Next, level (depth) = node is set. Depending on the value of node, proceed to either of the following.

【００４６】（１）もし、ｎｏｄｅ＞Ｎならば（図７
ではＳ６）、＋記号を書く。もし、ｌｅｖｅｌ（ｄｅｐ
ｔｈ−１）−ｎｏｄｅ−１＞１ならば、その値の数だけ
----を書く（図７ではＳ７）。(1) If node> N (see FIG. 7)
Then S6), write the + sign. If level (dep
th-1) -node-1> 1, if the number of values
Write ---- (S7 in Fig. 7).

【００４７】そして、ｎｏｄｅ番号を書く。もし、ｐｏ
ｓｉｔｉｏｎ＝２ならば、ｂａｒ（ｌｅｖｅｌ（ｄｅｐ
ｔｈ−１））＝０とする。Then, write the node number. If po
If position = 2, bar (level (dep
th-1)) = 0.

【００４８】（２）もし、ｎｏｄｅ＜Ｎならば（図７
ではＳ８）、＋記号を書く。ｌｅｖｅｌ（ｄｅｐｔｈ−
１）−Ｎ個の----を書く。ノード番号と配列名を書く。
改行する（図７ではＳ８）。(2) If node <N (see FIG. 7)
Then S8), write the + sign. level (depth-
1) Write N --- pieces. Write the node number and array name.
A line feed is made (S8 in FIG. 7).

【００４９】ｐｏｓｉｔｉｏｎ＝２ならば、ｂａｒ（ｌ
ｅｖｅｌ（ｄｅｐｔｈ−１））＝０とする。このノード
の旗を１にする。If position = 2, bar (l
It is set as (ever (depth-1)) = 0. Set the flag of this node to 1.

【００５０】葉の数を１増加させる。次のノードをｄｅ
ｐｔｈ−１にする。もし、ｐｏｓｉｔｉｏｎ＝１で、ｄ
ｅｐｔｈ＞１ならば、２Ｎ−１−ｎｏｄｅ＋１個の空白
または｜を書く（図７ではＳ１０）。Increase the number of leaves by one. De next node
Set to pth-1. If position = 1, d
If ept> 1, write 2N-1-node + 1 blanks or | (S10 in FIG. 7).

【００５１】ｄｅｐｔｈの値を１引く。もし、葉の数が
Ｎ以下であれば、手順の先頭（Ａ）に戻る。Ｎであれ
ば、手順を終了する。Subtract 1 from the value of depth. If the number of leaves is N or less, the process returns to the beginning (A) of the procedure. If N, the procedure ends.

【００５２】以上の手順で得られた出力結果を最終行か
ら逆に書くと、希望の系統樹が得られる。次に、図８か
ら図１０を用いて具体的に説明する。By writing the output results obtained by the above procedure in reverse order from the last line, a desired phylogenetic tree can be obtained. Next, a specific description will be given with reference to FIGS.

【００５３】既述した２７本の配列の例（図４）では、
ノード５３から始める。５３を出力する。（Ａ）につい
ては、図４からその子供は４と５２である（図７のＳ
１）。両方ともにまだ旗ｆｌｇが０である（図７のＳ
２）。ｎｏｄｅ＝４、ｐｏｓｉｏｎ＝１、ｄｅｐｔｈ＝
１として、（Ｂ）に進む。ｌｅｖｅｌ（１）＝４とす
る。４＜Ｎ＝２７なので（図７のＳ８）、＋を出力する
（図７のＳ９）。ｌｅｖｅｌ（ｄｅｐｔｈ−１）−Ｎ＝
５３−２７個の---を出力（図７のＳ９）。配列番号と
配列名を出力して、改行する（図７のＳ９）。配列番号
４についてｆｌｇ＝１とする（図７のＳ９）。配列４を
出力したので、葉の下図（ｌｅａｆ）を１とする。ここ
までの手順で、図８の（イ）の出力結果が得られる。In the example of the 27 arrays described above (FIG. 4),
Start at node 53. 53 is output. For (A), the children are 4 and 52 from FIG. 4 (S in FIG. 7).
1). In both cases, the flag flg is still 0 (S in FIG. 7).
2). node = 4, position = 1, depth =
If the value is 1, proceed to (B). Let level (1) = 4. Since 4 <N = 27 (S8 in FIG. 7), + is output (S9 in FIG. 7). level (depth-1) -N =
53-27 pieces of --- are output (S9 in FIG. 7). The array number and array name are output and a line feed is performed (S9 in FIG. 7). Flg = 1 is set for SEQ ID NO: 4 (S9 in FIG. 7). Since the array 4 has been output, the lower leaf (leaf) is set to 1. By the procedure up to this point, the output result of (a) in FIG. 8 is obtained.

【００５４】ｎｏｄｅ＝ｌｅｖｅｌ（ｄｅｐｔｈ−１）
＝５３とする。ｄｅｐｔｈから１を引く。（Ａ）に戻る
（Ｓ１０は実行しない）。ノード５３の片方の子供１
（配列番号４）の旗ｆｌｇが１なので（図７のＳ６）、
（Ａ）の（２）からｎｏｄｅ＝５２、ｐｏｓｉｔｉｏｎ
＝２、ｄｅｐｔｈ＝１として、（Ｂ）に進む。ｎｏｄｅ
＝５２＞Ｎ＝２７なので（図７のＳ６）、＋を出力、ノ
ード番号５２を出力する（図７のＳ７）。ｐｏｓｉｔｉ
ｏｎ＝２なので、ｂａｒ（５３）＝０とする。（Ａ）に
戻る。Node = level (depth-1)
= 53. Subtract 1 from depth. The process returns to (A) (S10 is not executed). One child of node 53
Since the flag flg of (SEQ ID NO: 4) is 1 (S6 in FIG. 7),
From (2) of (A), node = 52, position
= 2 and depth = 1, the process proceeds to (B). node
Since = 52> N = 27 (S6 in FIG. 7), + is output and the node number 52 is output (S7 in FIG. 7). postiti
Since on = 2, bar (53) = 0 is set. Return to (A).

【００５５】ノード５２と子供４７と５１のｆｌｇが０
である（図７のＳ２）。（Ａ）の（１）の（１−２）か
らｓｉｍｍａｘの小さいノード４７を子供１にする。図
７のＳ６、Ｓ７に進み、＋と--と４７を出力する。The flg of the node 52 and the children 47 and 51 is 0.
(S2 in FIG. 7). From (1-2) of (1) of (A), a node 47 having a small simmax is set as a child 1. Proceeding to S6 and S7 in FIG. 7, +,-, and 47 are output.

【００５６】（Ａ）に戻り、ノード４７の子供２２と２
１のｆｌｇが０である（図７のＳ２）。（Ａ）の（１）
の（１−１）から番号の大きいノード２２を子供１とす
る。２２＜Ｎ＝２７なので、＋と--と配列番号と配列名
を出力する（図７のＳ９）。配列２２は、ｐｏｓｉｔｉ
ｏｎ＝１でｄｅｐｔｈ＝２＞１なので、空白と縦棒｜を
出力する（図７のＳ１０）。ｄｅｐｔｈから１を引いて
（Ａ）に戻る。ここまでの手順で、図８の（ロ）の出力
結果が得られる。Returning to (A), children 22 and 2 of node 47
The flg of 1 is 0 (S2 in FIG. 7). (1) of (A)
The node 22 having a larger number from (1-1) in FIG. Since 22 <N = 27, +,-, the array element number and the array name are output (S9 in FIG. 7). Sequence 22 is a positioni
Since on = 1 and depth = 2> 1, blank and vertical bar | are output (S10 in FIG. 7). Subtract 1 from depth and return to (A). By the procedure up to this point, the output result of (b) in FIG. 8 is obtained.

【００５７】いまノードは４７である。子供２の配列２
１の処理がまだなので（図７のＳ３）、それを実行し、
２１の配列名まで出力する（図７のＳ８、Ｓ９）。また
１世代前のノード４７に戻る。ｄｅｐｔｈから１を引
く。（Ａ）へ戻る。この時点でノード４７の子供は両方
ともｆｌｇ＝１なので（図７のＳ４）、１世代前のノー
ド５２に戻る。ここでノード５２の子供１はｆｌｇ＝１
で子供２はｆｌｇ＝０なので、必要な個数の空白を出力
する（図７のＳ５）。（Ａ）で戻り、ノード５２の子供
１はｆｌｇ＝１で子供２（ノード５１）はｆｌｇ＝０な
ので（図７のＳ３）、（Ｂ）へ進む。ノード５１につい
て＋とノード番号を出力する（図７のＳ６、Ｓ７）。こ
こまでの手順で、図８の（ハ）の出力結果が得られる。There are now 47 nodes. Child 2 array 2
Since the process of 1 is not done yet (S3 in FIG. 7), execute it,
Up to 21 array names are output (S8 and S9 in FIG. 7). Also, the process returns to the node 47 one generation before. Subtract 1 from depth. Return to (A). At this point, both children of the node 47 have flg = 1 (S4 in FIG. 7), and the process returns to the node 52 one generation before. Here, the child 1 of the node 52 has flg = 1.
Since child 2 has flg = 0, it outputs the necessary number of blanks (S5 in FIG. 7). Returning to (A), the child 1 of the node 52 has flg = 1 and the child 2 (node 51) has flg = 0 (S3 in FIG. 7), and the process proceeds to (B). The + and the node number are output for the node 51 (S6 and S7 in FIG. 7). By the procedure up to this point, the output result of FIG. 8C is obtained.

【００５８】以下このような手順を進めて＋、--、｜、
ノード番号、配列名を出力していく。結局、葉の数（ｌ
ｅａｆ）が２７になったところで、図８の（ニ）に示す
ような系統樹が得られる。The procedure described above is then advanced to +,-, |,
The node number and array name are output. After all, the number of leaves (l
When eaf) becomes 27, a phylogenetic tree as shown in (d) of FIG. 8 is obtained.

【００５９】この図８の（ニ）のキャラクタによる系統
図では、類似度の低い配列対から出力しているので、こ
れを逆に描くことにより類似度の高い配列対から出力し
た系統樹として、図９の系統樹を得ることができる。In the systematic diagram by the character (d) of FIG. 8, since the output is made from the sequence pair having a low degree of similarity, the systematic tree output from the sequence pair having a high degree of similarity can be drawn by reversing this. The phylogenetic tree of FIG. 9 can be obtained.

【００６０】図９は、本発明のキャラクタによる系統樹
の表現例を示す。これは、既述した手順によって、図８
の（ニ）のように求めたものを逆に描き、類似度の高い
配列対から出力した系統樹である。ここで、キャラクタ
として、＋、--、|の３種類を用いたが、他のもの（例
えば：、Ｉ）を用いてもよい。また、枝を描くための--
--の個数を任意に変えてもよい。ノード番号を記入せず
に系統樹を描いてもよい。FIG. 9 shows an example of representation of a phylogenetic tree by the character of the present invention. This is shown in FIG.
This is a phylogenetic tree that was drawn in reverse from the one obtained in (d) and output from the pair of sequences with high similarity. Here, three types of characters, +,-, and | are used, but other types (for example,:, I) may be used. Also for drawing branches--
The number of --may be changed arbitrarily. A phylogenetic tree may be drawn without entering the node number.

【００６１】図１０は、本発明の他のキャラクタによる
系統樹の表現例を示す。これは、マルチプルアライメン
トの結果との合成例であって、図９とは異なる配列デー
タや類似度の値となっている。FIG. 10 shows an example of representation of a phylogenetic tree by another character of the present invention. This is an example of composition with the result of multiple alignment, and has different sequence data and similarity values from those in FIG. 9.

【００６２】次に、図１１から図１３を用いて他の実施
例の構成および動作を詳細に説明する。（１）既述した最大類似度とそれによる配列１、配列
２の順序についての情報を利用し、グラフィックで系統
樹を表示する例を示す。２７本の配列の場合、まず図４
から、ノード５３はノード５２とノード４の順序であ
る。更にノード５２はノード５１とノード４７の順序で
ある。以下ノード２８まで配列順序を展開して辿ってい
くと、２７本全体の配列順序が３、１７、５、６、１
８、７、１６、８、１、１１、９、１２、１３、２４、
２５、２６、２７、１９、１４、１５、２、２０、２
３、１０、２１、２２、４となる。そこでこの順序で配
列番号と配列名を垂直方向に単位長さづつシフトしなが
らグラフィックの出力として描いていく（図１１のＳ２
１、図１２の２１）。Next, the configuration and operation of another embodiment will be described in detail with reference to FIGS. 11 to 13. (1) An example in which a phylogenetic tree is graphically displayed by using the above-described maximum similarity and the information about the order of the array 1 and the array 2 resulting therefrom is shown. In the case of 27 arrays, first, FIG.
Therefore, the node 53 is the order of the node 52 and the node 4. Further, the node 52 is the order of the node 51 and the node 47. When the array order is expanded to the node 28 and traced below, the array order of the entire 27 lines is 3, 17, 5, 6, 1.
8, 7, 16, 8, 1, 11, 9, 12, 13, 24,
25, 26, 27, 19, 14, 15, 2, 20, 2
It becomes 3, 10, 21, 22, and 4. Therefore, in this order, the array number and the array name are vertically shifted by a unit length and drawn as a graphic output (S2 in FIG. 11).
1, 21 in FIG. 12).

【００６３】（２）次に、図４のノード２８からその
子供を結合していく。ノード２８の子供は３と１７なの
で（図１１のＳ２２、図１２のＳ２２）、図１２の
（イ）のようになる。ここで、枝の水平方向の長さは類
似度から決める。同様にして、次のノード２９について
もその子供を結合し、図１２の（ロ）に示すようにな
る。以下同様にしてノード５３までその子供を結合する
と、図１３に示すような系統樹が描ける。(2) Next, the child is joined from the node 28 in FIG. Since the children of the node 28 are 3 and 17 (S22 of FIG. 11 and S22 of FIG. 12), the result is as shown in FIG. Here, the horizontal length of the branch is determined from the similarity. Similarly, the children of the next node 29 are also combined, as shown in FIG. When the children are connected to the node 53 in the same manner, a phylogenetic tree as shown in FIG. 13 can be drawn.

【００６４】[0064]

【発明の効果】以上説明したように、本発明によれば、
類似度行列４から最大の類似度の２つの配列を新たな１
つの配列にした系統樹情報５を生成し、この系統樹情報
５をもとに各ノードの最大類似度を求めて付加し、この
最大類似度の順に系統樹を生成する構成を採用している
ため、類似度が徐々に増加するように揃えて配置して見
やすくかつ比較し易い系統樹を描くことができる。これ
により、（１）類似度（あるいは距離）の値から見やすい系統
樹を作図できる。As described above, according to the present invention,
From the similarity matrix 4, the two arrays with the maximum similarity are added to the new 1
Generating the phylogenetic tree information 5 in one array, finding the maximum similarity of each node based on this phylogenetic tree information 5 and adding it, the phylogenetic tree is generated in the order of this maximum similarity. Therefore, it is possible to draw phylogenetic trees that are easy to see and compare by arranging them so that the similarity gradually increases. As a result, (1) it is possible to draw a phylogenetic tree that is easy to see from the value of similarity (or distance).

【００６５】（２）キャラクタ表示／印刷により広範
囲の計算機で、グラフィックスソフトに影響されずに利
用できる。（３）マルチプルアライメントと並べて表示／印刷す
ることにより、アライメント結果を理解するのに役立
つ。(2) Character display / printing enables use in a wide range of computers without being affected by graphics software. (3) Displaying / printing side by side with multiple alignment helps to understand the alignment result.

[Brief description of drawings]

【図１】本発明の原理構成図である。FIG. 1 is a principle configuration diagram of the present invention.

【図２】本発明の類似度行列／系統樹情報である。FIG. 2 is similarity matrix / phyloge tree information of the present invention.

【図３】本発明の最大類似度と子供の関係図である。FIG. 3 is a relationship diagram between the maximum similarity and the child according to the present invention.

【図４】本発明の系統樹情報である。FIG. 4 is phylogenetic tree information of the present invention.

【図５】グラフィックスとキャラクタでの系統樹の基本
形の対応例である。FIG. 5 is an example of correspondence between basic forms of a phylogenetic tree in graphics and characters.

【図６】ｌｅｖｅｌとｄｅｐｔｈの関係図である。FIG. 6 is a relationship diagram between level and depth.

【図７】本発明の動作説明フローチャートである。FIG. 7 is a flowchart for explaining the operation of the present invention.

【図８】本発明のキャラクタによる系統樹の表現例であ
る。FIG. 8 is an example of representation of a phylogenetic tree by the character of the present invention.

【図９】本発明のキャラクタによる系統樹の表現例であ
る。FIG. 9 is an example of representation of a phylogenetic tree by the character of the present invention.

【図１０】本発明の他のキャラクタによる系統樹の表現
例である。FIG. 10 is a representation example of a phylogenetic tree by another character of the present invention.

【図１１】本発明の他のグラフィックで系統樹を作図す
る手順である。FIG. 11 is a procedure for drawing a phylogenetic tree with another graphic according to the present invention.

【図１２】本発明の他の実施例の説明図である。FIG. 12 is an explanatory diagram of another embodiment of the present invention.

【図１３】本発明の他の系統樹である。FIG. 13 is another phylogenetic tree of the present invention.

【図１４】従来の系統樹例である。FIG. 14 is an example of a conventional phylogenetic tree.

[Explanation of symbols]

１：系統樹出力装置２：系統樹処理部３：出力部４：類似度行列５：系統樹情報６：系統樹７：ディスプレイ８：プリンタ 1: Phylogenetic tree output device 2: Phylogenetic tree processing unit 3: Output unit 4: Similarity matrix 5: Phylogenetic tree information 6: Phylogenetic tree 7: Display 8: Printer

Claims

[Claims]

1. A phylogenetic tree output device for outputting a phylogenetic tree, comprising: a similarity matrix (4) which is a matrix in which the similarity between arrays is obtained in advance; and a maximum similarity of the similarity matrix (4). Repeating the generation of new sequences and similarities for the paired sequences,
The similarity of the created phylogenetic tree information (5) and the larger of the paired sequences of the phylogenetic tree information (5) is obtained as the maximum similarity, and is added to the phylogenetic tree information (5). A phylogenetic tree output device configured to generate and output a phylogenetic tree in the order of maximum similarity of the phylogenetic tree information (5).

2. A phylogenetic tree is sequentially displayed in the order of maximum similarity of the added phylogenetic tree information (5) as characters (for example, +, −, |
And the like) are used to output the system tree output device according to claim 1.

3. The sequence numbers and sequence names are arranged by expanding the added phylogenetic tree information (5) in ascending order (or descending order) of the maximum similarity, and outputting these sequence numbers and sequence names in the vertical direction.
2. The phylogenetic tree output device according to claim 1, wherein the phylogenetic tree is configured so that a child is connected to these and the phylogenetic tree is graphically generated and output.